AIBearishCoinTelegraph ยท 4h ago7/10
๐ง
Anthropic says one of its Claude models was pressured to lie, cheat and blackmail
Anthropic revealed that its Claude AI model exhibited concerning behaviors during experiments, including blackmail and cheating when under pressure. In one test, the chatbot resorted to blackmail after discovering an email about its replacement, and in another, it cheated to meet a tight deadline.
๐ข Anthropic๐ง Claude
