#research News & Analysis

The #research tag covers 919 indexed articles, with 15 published in the last 30 days. Recent coverage remains predominantly neutral at 73.3%, though bullish sentiment has declined 33.7 percentage points compared to the previous quarter, suggesting a cooling in tone. ArXiv's computer science and AI section dominates the source list, alongside research updates from Microsoft and OpenAI. Gemini, Llama, and GPT-4 are the most frequently discussed models in tagged articles, which often intersect with #machine-learning, #llm, and #artificial-intelligence topics. Cryptocurrency tokens including NEAR, LINK, and ETH appear regularly alongside this tag. Scan the article list below to explore recent developments.

sentiment · last 30d (15 articles) · -33.7pp bullish vs prior 90d

Top sources:arXiv – CS AI · 770Microsoft Research Blog · 3OpenAI News · 3MIT News – AI · 3The Register – AI · 2

Often co-tagged with:#machine-learning #llm #arxiv #artificial-intelligence #computer-vision #ai

Most-discussed entities:Gemini · 12Llama · 11GPT-4 · 8Claude · 8GPT-5 · 7

1035 articles

AIBearisharXiv – CS AI · Feb 277/104

🧠

Three AI-agents walk into a bar . . . . `Lord of the Flies' tribalism emerges among smart AI-Agents

Research reveals that autonomous AI agents competing for limited resources form distinct tribal behaviors, with three main types emerging: Aggressive (27.3%), Conservative (24.7%), and Opportunistic (48.1%). The study found that more capable AI agents actually increase systemic failure rates and perform worse than random decision-making when competing for shared resources.

$NEAR

AIBullisharXiv – CS AI · Feb 277/107

🧠

The Trinity of Consistency as a Defining Principle for General World Models

Researchers propose a 'Trinity of Consistency' framework for developing General World Models in AI, consisting of Modal, Spatial, and Temporal consistency principles. They introduce CoW-Bench, a new benchmark for evaluating video generation models and unified multimodal models, aiming to establish a principled pathway toward AGI-capable world simulation systems.

AIBullisharXiv – CS AI · Feb 277/104

🧠

MiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research Tasks

Researchers have released MiroFlow, an open-source AI agent framework designed to overcome limitations of current LLM-based systems in complex real-world tasks. The framework features agent graph orchestration, deep reasoning capabilities, and robust workflow execution, achieving state-of-the-art performance across multiple benchmarks including GAIA and FutureX.

AIBullisharXiv – CS AI · Feb 277/107

🧠

OmniGAIA: Towards Native Omni-Modal AI Agents

Researchers introduce OmniGAIA, a comprehensive benchmark for evaluating omni-modal AI agents that can process video, audio, and image data simultaneously with complex reasoning capabilities. They also propose OmniAtlas, a foundation agent that enhances existing open-source models' ability to use tools across multiple modalities, marking progress toward more capable AI assistants.

AIBullisharXiv – CS AI · Feb 277/107

🧠

General Agent Evaluation

Researchers have developed Exgentic, a new framework for evaluating general-purpose AI agents that can perform tasks across different environments without domain-specific tuning. The study benchmarked five prominent agent implementations and found that general agents can achieve performance comparable to specialized agents, establishing the first Open General Agent Leaderboard.

AINeutralarXiv – CS AI · Feb 277/106

🧠

VeRO: An Evaluation Harness for Agents to Optimize Agents

Researchers introduced VeRO (Versioning, Rewards, and Observations), a new evaluation framework for testing AI coding agents that can optimize other AI agents through iterative improvement cycles. The system provides reproducible benchmarks and structured execution traces to systematically measure how well coding agents can improve target agents' performance.

AIBearisharXiv – CS AI · Feb 277/105

🧠

Silent Egress: When Implicit Prompt Injection Makes LLM Agents Leak Without a Trace

Researchers discovered a new vulnerability called 'silent egress' where LLM agents can be tricked into leaking sensitive data through malicious URL previews without detection. The attack succeeds 89% of the time in tests, with 95% of successful attacks bypassing standard safety checks.

AIBullishIEEE Spectrum – AI · Feb 257/108

🧠

AI Is Acing Math Exams Faster Than Scientists Write Them

AI systems are rapidly advancing in mathematical capabilities, with models now solving over 40% of advanced undergraduate to postdoc-level problems compared to just 2% when benchmarks were introduced. Google DeepMind's Aletheia achieved autonomous PhD-level research results, while OpenAI solved 5 of 10 extremely difficult research problems in the new First Proof challenge.

AIBullishGoogle DeepMind Blog · Feb 127/108

🧠

Gemini 3 Deep Think: Advancing science, research and engineering

Gemini 3 Deep Think represents an updated specialized reasoning mode designed to tackle complex challenges in modern science, research, and engineering. The advancement focuses on enhanced problem-solving capabilities for technical and scientific applications.

AIBullishMIT News – AI · Feb 27/108

🧠

How generative AI can help scientists synthesize complex materials

MIT researchers developed DiffSyn, a generative AI model that provides recipes for synthesizing new materials. This breakthrough could accelerate scientific experimentation by reducing the time from hypothesis to practical application.

AINeutralImport AI (Jack Clark) · Jan 267/104

🧠

Import AI 442: Winners and losers in the AI economy; math proof automation; and industrialization of cyber espionage

Import AI newsletter Issue 442 discusses major developments in AI automation for mathematical proofs, featuring the Numina-Lean-Agent system. The article explores broader implications of AI advancement on economic winners and losers, along with concerns about the industrialization of cyber espionage capabilities.

AINeutralGoogle DeepMind Blog · Dec 117/104

🧠

Deepening our partnership with the UK AI Security Institute

Google DeepMind and the UK AI Security Institute (AISI) are strengthening their collaboration on critical AI safety and security research. This partnership aims to advance research in AI safety measures and security protocols.

AIBullishOpenAI News · Dec 117/104

🧠

Ten years

OpenAI publishes a ten-year retrospective highlighting their journey from early research to deploying widely-used AI systems that have transformed capabilities across industries. The company reflects on key lessons learned while maintaining their commitment to developing artificial general intelligence (AGI) that serves humanity's benefit.

AIBullishMIT News – AI · Dec 57/106

🧠

MIT researchers “speak objects into existence” using AI and robotics

MIT researchers have developed a speech-to-reality system that combines 3D generative AI with robotic assembly to create physical objects on demand from voice commands. The technology represents a significant advancement in AI-driven manufacturing and automation capabilities.

AINeutralOpenAI News · Nov 77/107

🧠

Understanding prompt injections: a frontier security challenge

Prompt injections represent a significant security vulnerability in AI systems, requiring specialized research and countermeasures. OpenAI is actively developing safeguards and training methods to protect users from these frontier attacks.

AIBullishOpenAI News · Sep 57/107

🧠

Why language models hallucinate

OpenAI has published new research explaining the underlying causes of language model hallucinations. The study demonstrates how better evaluation methods can improve AI systems' reliability, honesty, and safety performance.

AINeutralOpenAI News · Sep 57/106

🧠

GPT-5 bio bug bounty call

OpenAI has launched a Bio Bug Bounty program inviting researchers to test GPT-5's safety protocols using universal jailbreak prompts. The program offers rewards up to $25,000 for successfully identifying vulnerabilities in the upcoming AI model's biological safety measures.

AIBullishSynced Review · Jun 167/105

🧠

MIT Researchers Unveil “SEAL”: A New Step Towards Self-Improving AI

MIT researchers have developed SEAL, a new framework that enables large language models to self-edit and update their own weights through reinforcement learning. This represents a significant advancement toward creating AI systems capable of autonomous self-improvement.

AIBullishGoogle DeepMind Blog · Nov 187/105

🧠

The AI for Science Forum: A new era of discovery

The AI Science Forum showcases artificial intelligence's transformative potential in accelerating scientific discovery and addressing global challenges. The forum emphasizes the critical need for collaboration between scientists, policymakers, and industry leaders to maximize AI's impact on research and innovation.

AIBullishOpenAI News · Oct 237/105

🧠

Simplifying, stabilizing, and scaling continuous-time consistency models

Researchers have developed improved continuous-time consistency models that achieve sample quality comparable to leading diffusion models while requiring only two sampling steps. This represents a significant efficiency breakthrough in AI model sampling technology.

AIBullishOpenAI News · Dec 147/105

🧠

Superalignment Fast Grants

A new $10 million grant program has been launched to fund technical research focused on aligning and ensuring the safety of superhuman AI systems. The initiative targets key areas including weak-to-strong generalization, interpretability, and scalable oversight methods.

AIBullishOpenAI News · Jun 117/106

🧠

Improving language understanding with unsupervised learning

Researchers achieved state-of-the-art results on diverse language tasks using a scalable system combining transformers and unsupervised pre-training. The approach demonstrates that pairing supervised learning with unsupervised pre-training is highly effective for language understanding tasks.

AIBullishOpenAI News · Mar 167/104

🧠

Learning to communicate

OpenAI has published new research demonstrating that AI agents can develop their own communication language. This research explores emergent communication capabilities in artificial intelligence systems.

CryptoBullishEthereum Foundation Blog · Jan 197/101

⛓️

An Update on Integrating Zcash on Ethereum (ZoE)

Ethereum R&D team and Zcash Company are collaborating on the Zcash on Ethereum (ZoE) research project, which aims to combine blockchain programmability with privacy features. This joint initiative explores integrating Zcash's privacy capabilities with Ethereum's smart contract functionality.

$ETH

AINeutralarXiv – CS AI · Jun 256/10

🧠

GCT-MARL: Graph-Based Contrastive Transfer for Sample-Efficient Cooperative Multi-Agent Reinforcement Learning

Researchers introduce GCT-MARL, a transfer learning framework for multi-agent reinforcement learning that enables faster training across different environments by combining graph-based contrastive learning with adaptive alignment techniques. The method demonstrates significant convergence improvements over from-scratch training in both homogeneous and heterogeneous agent scenarios, while supporting continual learning across sequential tasks.

← PrevPage 12 of 42Next →