AIBullisharXiv – CS AI · 11h ago7/10
🧠ConSensus is a training-free multi-agent framework that improves how large language models interpret multimodal sensor data by decomposing tasks into specialized agents and fusing their outputs through semantic and statistical methods. The approach demonstrates 7.1% accuracy improvements over single-agent baselines while reducing computational costs by 12.7x, offering practical solutions for real-world sensing applications.
AI × CryptoNeutralarXiv – CS AI · 11h ago7/10
🤖Researchers evaluated multi-agent LLM architectures for resolving prediction market outcomes, finding that independent aggregation with confidence-weighted voting achieves 83.43% accuracy—marginally better than single models. Deliberative consensus between agents actually degraded performance, while high error correlations across models (0.529-0.689) limit ensemble gains, suggesting hybrid AI-human systems with strategic escalation criteria offer the most practical path forward.
🧠 GPT-5🧠 Llama
AIBullishCrypto Briefing · 1d ago7/10
🧠OpenAI and Anthropic have launched multi-agent autonomous features designed for enterprise applications, potentially disrupting traditional business workflows by reducing dependency on middleware solutions. This development signals accelerating adoption of AI systems that can coordinate multiple specialized agents to solve complex problems at scale.
🏢 OpenAI🏢 Anthropic
AIBullisharXiv – CS AI · 3d ago7/10
🧠Researchers present a multi-agent LLM pipeline architecture that reduces hallucinations by 31-36% through nested learning, semantic caching, and progressive review stages. The system simultaneously improves factual reliability, cuts energy consumption by 47%, and enhances auditability without requiring model retraining.
AIBullisharXiv – CS AI · May 127/10
🧠Researchers introduce NIAgent, a multi-agent AI system that automates end-to-end neuroimaging analysis by enabling specialist agents to collaboratively build and optimize executable programs. The system outperforms conventional static workflows like fMRIPrep by adapting dynamically to data and incorporating hierarchical quality control, addressing a critical bottleneck in clinical biomarker development.
AIBullisharXiv – CS AI · May 127/10
🧠Researchers introduce MIND-Skill, an automated framework that generates reusable skills for LLM-powered AI agents by analyzing successful task trajectories. The system uses dual agents with quality-control mechanisms to create generalizable, documented procedures that enable autonomous systems to handle complex, multi-step problems without manual human expertise.
AIBullisharXiv – CS AI · May 117/10
🧠Researchers introduce MAVEN, a multi-agent framework that enhances large language model reasoning through explicit role-separation and intermediate verification steps. The system outperforms existing approaches on multiple benchmarks by creating verifiable, modular deliberation trajectories rather than relying on implicit reasoning or post-hoc consensus mechanisms.
AINeutralarXiv – CS AI · May 97/10
🧠A new research paper identifies authorization propagation as a critical but underexplored security problem in multi-agent AI systems, distinct from prompt injection vulnerabilities. The paper argues that identity governance must become foundational infrastructure in AI orchestration, with seven structural requirements for maintaining authorization invariants across distributed agent interactions.
AI × CryptoNeutralarXiv – CS AI · May 97/10
🤖Researchers propose adapting centuries-old human anti-collusion mechanisms to multi-agent AI systems, which increasingly demonstrate coordinated behavior similar to market cartels. The paper develops a taxonomy of five human strategies—sanctions, leniency, monitoring, market design, and governance—and maps them to AI interventions, while identifying critical implementation challenges like agent attribution and identity fluidity.
AIBullisharXiv – CS AI · May 47/10
🧠Researchers propose E-mem, a new framework for LLM agent memory that reconstructs episodic context instead of compressing it, enabling more rigorous reasoning over extended tasks. The approach uses multiple assistant agents managing uncompressed memory while a master agent coordinates planning, achieving 54% F1 on benchmarks with 70% lower token costs than existing methods.
AIBullisharXiv – CS AI · May 17/10
🧠Researchers have developed a multi-agent AI system that autonomously generates machine learning pipelines from datasets and natural-language instructions, achieving 84.7% success rate across 150 diverse tasks. The architecture integrates self-healing mechanisms and adaptive learning to reduce manual development time and improve robustness.
AIBullisharXiv – CS AI · Apr 157/10
🧠CascadeDebate introduces a novel multi-agent deliberation system for large language model cascades that dynamically allocates computational resources based on query difficulty. By inserting lightweight agent ensembles at escalation boundaries to resolve ambiguous cases internally, the system achieves up to 26.75% performance improvement while reducing unnecessary escalations to expensive models.
AINeutralarXiv – CS AI · Apr 147/10
🧠A new study reveals that multi-agent AI systems achieve better business outcomes than individual AI agents, but at the cost of reduced alignment with intended values. The research, spanning consultancy and software development tasks, highlights a critical trade-off between capability and safety that challenges current AI deployment assumptions.
AIBearisharXiv – CS AI · Apr 137/10
🧠Researchers demonstrate Semantic Intent Fragmentation (SIF), a novel attack on LLM orchestration systems where a single legitimate request causes AI systems to decompose tasks into individually benign subtasks that collectively violate security policies. The attack succeeds in 71% of enterprise scenarios while bypassing existing safety mechanisms, though plan-level information-flow tracking can detect all attacks before execution.
AIBullisharXiv – CS AI · Apr 67/10
🧠Researchers conducted the first large-scale study of coordination dynamics in LLM multi-agent systems, analyzing over 1.5 million interactions to discover three fundamental laws governing collective AI cognition. The study found that coordination follows heavy-tailed cascades, concentrates into 'intellectual elites,' and produces more extreme events as systems scale, leading to the development of Deficit-Triggered Integration (DTI) to improve performance.
AIBullisharXiv – CS AI · Mar 67/10
🧠Researchers developed a memory management system for multi-agent AI systems on edge devices that reduces memory requirements by 4x through 4-bit quantization and eliminates redundant computation by persisting KV caches to disk. The solution reduces time-to-first-token by up to 136x while maintaining minimal impact on model quality across three major language model architectures.
🏢 Perplexity🧠 Llama
AIBullisharXiv – CS AI · Mar 46/102
🧠Researchers have developed OrchMAS, a new multi-agent AI framework that uses specialized expert agents and dynamic orchestration to improve reasoning in scientific domains. The system addresses limitations of existing multi-agent frameworks by enabling flexible role allocation, prompt refinement, and heterogeneous model integration for complex scientific tasks.
AIBullisharXiv – CS AI · Mar 37/104
🧠Researchers developed an information-theoretic framework to measure when multi-agent AI systems exhibit coordinated behavior beyond individual agents. The study found that specific prompt designs can transform collections of AI agents into coordinated collectives that mirror human group intelligence principles.
AINeutralarXiv – CS AI · 11h ago6/10
🧠HypoAgent is a new AI framework that uses multiple specialized agents to generate logical hypotheses from knowledge graphs through interactive dialogue. The system excels at understanding evolving user intent across multi-turn conversations and diagnosing why generated hypotheses fail, achieving state-of-the-art performance on both commonsense and biomedical knowledge graphs.
AINeutralarXiv – CS AI · 11h ago6/10
🧠Researchers propose EAGLE, a framework that improves multi-agent vision-language model collaboration by requiring agents to align on visual evidence from images, not just final answers. The training-free approach demonstrates superior performance across six VQA benchmarks while maintaining interpretability and practical deployment capabilities.
AIBullisharXiv – CS AI · 4d ago6/10
🧠CircuitLM is a multi-agent AI framework that converts natural language descriptions into machine-readable circuit schematics, addressing persistent hallucination and constraint-violation issues in LLM-based electronic design automation. The system uses a five-stage pipeline combining retrieval-augmented generation with dual-layer verification—electrical rule checking and LLM-as-judge evaluation—to produce structurally viable, prototype-ready circuits.
AINeutralarXiv – CS AI · 4d ago5/10
🧠Researchers have released DSSE (Drone Swarm Search Environment), a PettingZoo-based reinforcement learning environment where autonomous drone agents search for targets using probabilistic location data rather than direct distance feedback. The environment addresses a gap in multi-agent RL research by providing dynamic probability inputs, with version 2 now published in a peer-reviewed journal.
AINeutralarXiv – CS AI · 4d ago6/10
🧠Researchers introduce CyberJurors, a multi-agent AI framework and VerdictBench dataset designed to automate e-commerce dispute resolution through simulated jury deliberation. The system decomposes dispute analysis into structured reasoning stages and incorporates multi-agent consensus mechanisms to better align with real-world crowdsourced jury decisions.
🏢 Hugging Face
AINeutralarXiv – CS AI · 5d ago6/10
🧠Researchers introduce CUDAnalyst, a new analysis framework that reveals how large language models make planning decisions when generating CUDA kernels by decomposing feedback signals. The study demonstrates that explicit planning helps only when feedback is well-aligned and that effective planning emerges from structured multi-feedback interactions, with findings showing robustness across different models and workloads.
AINeutralarXiv – CS AI · 5d ago6/10
🧠Researchers introduce MAC, a multi-agent framework that combines statistical causal discovery with large language models to identify relationships between variables more accurately than existing methods. By using autonomous agent debate and adversarial reasoning, MAC outperforms both traditional statistical and single-agent LLM approaches across multiple benchmark datasets.
🧠 Gemini