y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#risk-evaluation News & Analysis

3 articles tagged with #risk-evaluation. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

3 articles
AIBearisharXiv โ€“ CS AI ยท Mar 177/10
๐Ÿง 

Questionnaire Responses Do not Capture the Safety of AI Agents

Researchers argue that current AI safety assessments using questionnaire-style prompts on language models are inadequate for evaluating real AI agents. The study suggests these methods lack construct validity because LLM responses to hypothetical scenarios don't accurately represent how AI agents would actually behave in real-world deployments.

AINeutralOpenAI News ยท Feb 255/106
๐Ÿง 

Deep research System Card

This report details safety measures implemented before releasing a deep research system, including external red teaming exercises and frontier risk evaluations. The work follows a structured Preparedness Framework and includes built-in mitigations to address identified key risk areas.

AINeutralOpenAI News ยท Aug 86/103
๐Ÿง 

GPT-4o System Card

OpenAI released a system card detailing the comprehensive safety work conducted before launching GPT-4o, including external red team testing and frontier risk evaluations. The report covers safety mitigations built into the model to address key risk areas according to their Preparedness Framework.