y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#research News & Analysis

913 articles tagged with #research. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

913 articles
AIBullisharXiv – CS AI · Feb 275/106
🧠

Invariant Transformation and Resampling based Epistemic-Uncertainty Reduction

Researchers propose a new AI inference method that uses invariant transformations and resampling to reduce epistemic uncertainty and improve model accuracy. The approach involves applying multiple transformed versions of an input to a trained AI model and aggregating the outputs for more reliable results.

AIBullisharXiv – CS AI · Feb 276/106
🧠

PATRA: Pattern-Aware Alignment and Balanced Reasoning for Time Series Question Answering

Researchers have developed PATRA, a new AI model that improves time series question answering by better understanding patterns like trends and seasonality. The model addresses limitations in existing LLM approaches that treat time series data as simple text or images, introducing pattern-aware mechanisms and balanced learning across tasks of varying difficulty.

AIBullisharXiv – CS AI · Feb 276/107
🧠

On Sample-Efficient Generalized Planning via Learned Transition Models

Researchers propose a new approach to generalized planning that learns explicit transition models rather than directly predicting action sequences. This method achieves better out-of-distribution performance with fewer training instances and smaller models compared to Transformer-based planners like PlanGPT.

AIBullisharXiv – CS AI · Feb 276/108
🧠

Deep Sequence Modeling with Quantum Dynamics: Language as a Wave Function

Researchers introduce a quantum-inspired sequence modeling framework that uses complex-valued wave functions and quantum interference for language processing. The approach shows theoretical advantages over traditional recurrent neural networks by utilizing quantum dynamics and the Born rule for token probability extraction.

AINeutralarXiv – CS AI · Feb 276/107
🧠

SPM-Bench: Benchmarking Large Language Models for Scanning Probe Microscopy

Researchers have developed SPM-Bench, a PhD-level benchmark for testing large language models on scanning probe microscopy tasks. The benchmark uses automated data synthesis from scientific papers and introduces new evaluation metrics to assess AI reasoning capabilities in specialized scientific domains.

AINeutralarXiv – CS AI · Feb 276/104
🧠

Correcting Human Labels for Rater Effects in AI Evaluation: An Item Response Theory Approach

Researchers propose using psychometric modeling to correct systematic biases in human evaluations of AI systems, demonstrating how Item Response Theory can separate true AI output quality from rater behavior inconsistencies. The approach was tested on OpenAI's summarization dataset and showed improved reliability in measuring AI model performance.

AIBullisharXiv – CS AI · Feb 276/107
🧠

AMA-Bench: Evaluating Long-Horizon Memory for Agentic Applications

Researchers introduce AMA-Bench, a new benchmark for evaluating long-horizon memory in AI agents deployed in real-world applications. The study reveals existing memory systems underperform due to lack of causality and objective information, while their proposed AMA-Agent system achieves 57.22% accuracy, surpassing baselines by 11.16%.

AIBullishMicrosoft Research Blog · Feb 266/102
🧠

CORPGEN advances AI agents for real work

Microsoft Research introduces CORPGEN, a new approach to advance AI agents for real-world workplace scenarios. The system aims to help AI agents handle multiple interdependent tasks simultaneously, similar to how knowledge workers juggle various responsibilities throughout their workday.

AIBullishOpenAI News · Feb 136/107
🧠

Scaling social science research

OpenAI has released GABRIEL, an open-source toolkit that leverages GPT to convert qualitative text and images into quantitative data for social science research. This tool enables researchers to analyze large-scale qualitative data more efficiently and systematically.

AIBullishGoogle DeepMind Blog · Jan 296/106
🧠

Project Genie: Experimenting with infinite, interactive worlds

Google has launched Project Genie, an experimental AI research prototype that allows Google AI Ultra subscribers in the U.S. to create and explore interactive virtual worlds. The project represents Google's continued expansion into AI-powered creative tools and immersive experiences.

AINeutralGoogle Research Blog · Jan 276/105
🧠

ATLAS: Practical scaling laws for multilingual models

ATLAS presents new scaling laws for multilingual generative AI models, providing practical frameworks for understanding how model performance scales across different languages and model sizes. This research offers valuable insights for optimizing multilingual AI system development and deployment strategies.

AIBullishMicrosoft Research Blog · Jan 206/101
🧠

Multimodal reinforcement learning with agentic verifier for AI agents

Microsoft Research introduces Argos, a multimodal reinforcement learning approach that uses an agentic verifier to evaluate whether AI agents' reasoning aligns with their observations over time. The system reduces visual hallucinations and creates more reliable, data-efficient agents for real-world applications.

Multimodal reinforcement learning with agentic verifier for AI agents
AIBullishMIT News – AI · Jan 145/109
🧠

At MIT, a continued commitment to understanding intelligence

MIT has renamed and expanded its intelligence research initiative to the MIT Siegel Family Quest for Intelligence with support from the Siegel Family Endowment. The program focuses on understanding how brains produce intelligence and developing methods to replicate this intelligence for practical problem-solving applications.

AINeutralMIT News – AI · Jan 56/104
🧠

MIT scientists investigate memorization risk in the age of clinical AI

MIT researchers have developed methods to test AI models used in clinical settings to prevent them from inadvertently revealing anonymized patient health data through memorization. This research addresses a critical privacy and security concern as healthcare AI systems become more prevalent.

AIBullishGoogle Research Blog · Dec 186/105
🧠

Google Research 2025: Bolder breakthroughs, bigger impact

Google Research published their 2025 outlook highlighting planned breakthroughs and expanded impact across their research initiatives. The article appears to be a year-end review focusing on Google's research achievements and future direction.

AIBullishMIT News – AI · Dec 186/107
🧠

Guided learning lets “untrainable” neural networks realize their potential

CSAIL researchers have developed a guidance method that enables previously "untrainable" neural networks to learn effectively by leveraging the built-in biases of other networks. This breakthrough could unlock the potential of neural network architectures that were previously considered ineffective for training.

AIBullishGoogle DeepMind Blog · Oct 295/104
🧠

Accelerating discovery with the AI for Math Initiative

The AI for Math Initiative is launching with participation from leading research institutions worldwide to advance the application of artificial intelligence in mathematical research. This collaborative effort aims to accelerate mathematical discovery through AI-powered tools and methodologies.

AIBullishOpenAI News · Aug 286/106
🧠

Supporting nonprofit and community innovation

OpenAI announces a $50 million People-First AI Fund to support U.S. nonprofits in scaling their impact through AI applications. The fund will provide grants for organizations working in education, healthcare, and research, with applications opening from September 8 to October 8, 2025.

AINeutralOpenAI News · Jul 176/106
🧠

Agent bio bug bounty call

OpenAI has launched a Bio Bug Bounty program inviting researchers to test ChatGPT agent's safety mechanisms using universal jailbreak prompts. The program offers rewards up to $25,000 for identifying vulnerabilities in the AI system's safety protocols.

AIBullishHugging Face Blog · Jul 106/108
🧠

Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models

Kimina-Prover represents a breakthrough in formal reasoning by applying test-time reinforcement learning search to large language models. This approach enhances mathematical proof generation and formal verification capabilities, potentially advancing AI's ability to handle complex logical reasoning tasks.

AIBullishGoogle Research Blog · Jun 236/105
🧠

Unlocking rich genetic insights through multimodal AI with M-REGLE

The article introduces M-REGLE, a new multimodal AI system designed to unlock genetic insights through advanced artificial intelligence techniques. This represents a significant advancement in the application of AI to genetic research and analysis.

← PrevPage 25 of 37Next →