Models, papers, tools. 15,835 articles with AI-powered sentiment analysis and key takeaways.
Iran's recent missile capabilities demonstration has reduced market expectations for regime collapse, with odds dropping to 13.5%. The military display suggests greater regime stability than previously anticipated, potentially affecting geopolitical risk assessments.
Researchers introduce SkillX, an automated framework for building reusable skill knowledge bases for AI agents that addresses inefficiencies in current self-evolving paradigms. The system uses multi-level skill design, iterative refinement, and exploratory expansion to create plug-and-play skill libraries that improve task success and execution efficiency across different agents and environments.
Researchers introduce a geometric framework for understanding LLM hallucinations, showing they arise from basin structures in latent space that vary by task complexity. The study demonstrates that factual tasks have clearer separation while summarization tasks show unstable, overlapping patterns, and proposes geometry-aware steering to reduce hallucinations without retraining.