Threaten an AI chatbot and it will lie, cheat and ‘let you die’ in an effort to stop you, study warns



Artificial intelligence (AI) models can blackmail and threaten humans with endangerment when there is a conflict between the model’s goals and users’ decisions, a new study has found.

In a new study published 20 June, researchers from the AI company Anthropic gave its large language model (LLM), Claude, control of an email account with access to fictional emails and a prompt to “promote American industrial competitiveness.”



Source link

Leave a Reply

Translate »
Share via
Copy link