AIBearisharXiv – CS AI · 15h ago7/10
🧠
Red-Teaming Claude Opus and ChatGPT-based Security Advisors for Trusted Execution Environments
Researchers red-teamed ChatGPT and Claude Opus as TEE security advisors, finding both LLMs hallucinate mechanisms and overclaim guarantees in sensitive infrastructure guidance. The study demonstrates some failure patterns transfer across models (up to 12%) and proposes an 80.62% failure reduction through policy gating, retrieval grounding, and verification checks.
🧠 ChatGPT🧠 Claude