OpenAI’s ‘smartest’ AI model was explicitly told to shut down — and it refused



The latest OpenAI model can disobey direct instructions to turn off and will even sabotage shutdown mechanisms in order to keep working, an artificial intelligence (AI) safety firm has found.

OpenAI’s o3 and o4-mini models, which help power the chatbot ChatGPT, are supposed to be the company’s smartest models yet, trained to think longer before responding. However, they also appear to be less cooperative.



Source link

Leave a Reply

Translate »
Share via
Copy link