OpenAI and Anthropic share findings from a joint safety evaluation
OpenAI and Anthropic conducted their first joint safety evaluation, testing each other's AI models for various risks including misalignment, hallucinations, and jailbreaking vulnerabilities. This cross-laboratory collaboration represents a significant step in industry-wide AI safety cooperation and standardization.