AINeutralarXiv โ CS AI ยท 4h ago5/10
๐ง
Measuring LLM Trust Allocation Across Conflicting Software Artifacts
Researchers developed TRACE, a framework to evaluate how LLMs allocate trust between conflicting software artifacts like code, documentation, and tests. The study found that current LLMs are better at identifying natural-language specification issues than detecting subtle code-level problems, with models showing systematic blind spots when implementations drift while documentation remains plausible.