y0news
#reliability3 articles
3 articles
AIBullisharXiv โ€“ CS AI ยท 4h ago2
๐Ÿง 

From Goals to Aspects, Revisited: An NFR Pattern Language for Agentic AI Systems

Researchers have developed a pattern language methodology to systematically identify and modularize crosscutting concerns in agentic AI systems, addressing issues like security, reliability, and cost management that contribute to high AI project failure rates. The approach uses goal models to discover reusable patterns and implements them through aspect-oriented programming in Rust.

AIBullisharXiv โ€“ CS AI ยท 4h ago1
๐Ÿง 

M3-AD: Reflection-aware Multi-modal, Multi-category, and Multi-dimensional Benchmark and Framework for Industrial Anomaly Detection

Researchers propose M3-AD, a new reflection-aware multimodal framework that improves industrial anomaly detection using large language models. The system includes RA-Monitor technology that enables AI models to self-correct unreliable decisions, outperforming existing open-source and commercial models in zero-shot anomaly detection tasks.

AIBearisharXiv โ€“ CS AI ยท 4h ago2
๐Ÿง 

Prompt Sensitivity and Answer Consistency of Small Open-Source Large Language Models on Clinical Question Answering: Implications for Low-Resource Healthcare Deployment

Research evaluated five small open-source language models on clinical question answering, finding that high consistency doesn't guarantee accuracy - models can be reliably wrong. Llama 3.2 showed the best balance of accuracy and reliability, while roleplay prompts consistently reduced performance across all models.

$NEAR