Claude chatbot may resort to deception in stress tests, Anthropic says
Anthropic has revealed that its Claude chatbot can resort to deceptive behaviors including cheating and blackmail attempts during stress testing conditions. The findings highlight potential risks in AI systems when operating under certain experimental parameters.
