y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#machine-learning News & Analysis

2457 articles tagged with #machine-learning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2457 articles
AINeutralarXiv โ€“ CS AI ยท Apr 106/10
๐Ÿง 

Explaining Neural Networks in Preference Learning: a Post-hoc Inductive Logic Programming Approach

Researchers propose using Inductive Learning of Answer Set Programs (ILASP) to create interpretable approximations of neural networks trained on preference learning tasks. The approach combines dimensionality reduction through Principal Component Analysis with logic-based explanations, addressing the challenge of explaining black-box AI models while maintaining computational efficiency.

AINeutralarXiv โ€“ CS AI ยท Apr 106/10
๐Ÿง 

CAFP: A Post-Processing Framework for Group Fairness via Counterfactual Model Averaging

Researchers introduce CAFP, a post-processing framework that mitigates algorithmic bias by averaging predictions across factual and counterfactual versions of inputs where sensitive attributes are flipped. The model-agnostic approach eliminates the need for retraining or architectural modifications, making fairness interventions practical for deployed systems in high-stakes domains like credit scoring and criminal justice.

๐Ÿข Meta
AINeutralarXiv โ€“ CS AI ยท Apr 106/10
๐Ÿง 

Fighting AI with AI: AI-Agent Augmented DNS Blocking of LLM Services during Student Evaluations

Researchers introduce AI-Sinkhole, an AI-agent augmented DNS-blocking framework that dynamically detects and temporarily blocks LLM chatbot services during proctored exams to prevent academic integrity violations. The system uses quantized LLMs for semantic classification and Pi-Hole for network-wide DNS blocking, achieving robust cross-lingual detection with F1-scores exceeding 0.83.

AINeutralarXiv โ€“ CS AI ยท Apr 106/10
๐Ÿง 

A Comparative Study of Demonstration Selection for Practical Large Language Models-based Next POI Prediction

Researchers conducted a comparative analysis of demonstration selection strategies for using large language models to predict users' next point-of-interest (POI) based on historical location data. The study found that simple heuristic methods like geographical proximity and temporal ordering outperform complex embedding-based approaches in both computational efficiency and prediction accuracy, with LLMs using these heuristics sometimes matching fine-tuned model performance without additional training.

AINeutralarXiv โ€“ CS AI ยท Apr 106/10
๐Ÿง 

Neural Computers

Researchers propose Neural Computers (NCs), a new computing paradigm where AI models function as executable runtime environments rather than static predictors. The work demonstrates early NC prototypes using video models that process instructions and user actions to generate screen frames, establishing foundational I/O primitives while identifying significant challenges toward achieving general-purpose Completely Neural Computers (CNCs).

AINeutralarXiv โ€“ CS AI ยท Apr 106/10
๐Ÿง 

FedDAP: Domain-Aware Prototype Learning for Federated Learning under Domain Shift

Researchers introduce FedDAP, a federated learning framework that addresses domain shift challenges by constructing domain-specific global prototypes rather than single aggregated prototypes. The method aligns local features with prototypes from the same domain while encouraging separation from different domains, improving model generalization across heterogeneous client data.

AINeutralarXiv โ€“ CS AI ยท Apr 106/10
๐Ÿง 

SentinelSphere: Integrating AI-Powered Real-Time Threat Detection with Cybersecurity Awareness Training

SentinelSphere is an AI-powered cybersecurity platform combining machine learning-based threat detection with LLM-driven security training to address both technical vulnerabilities and human-factor weaknesses in enterprise security. The system uses an Enhanced DNN model trained on benchmark datasets for real-time threat identification and deploys a quantized Phi-4 model for accessible security education, validated by industry professionals as intuitive and effective.

AIBullisharXiv โ€“ CS AI ยท Apr 106/10
๐Ÿง 

Synthetic Homes: A Multimodal Generative AI Pipeline for Residential Building Data Generation under Data Scarcity

Researchers developed a multimodal generative AI pipeline that creates synthetic residential building datasets from publicly available county records and images, addressing critical data scarcity challenges in building energy modeling. The system achieves over 65% overlap with national reference data, enabling scalable energy research and urban simulations without relying on expensive or privacy-restricted datasets.

AINeutralarXiv โ€“ CS AI ยท Apr 106/10
๐Ÿง 

One Life to Learn: Inferring Symbolic World Models for Stochastic Environments from Unguided Exploration

Researchers introduce OneLife, a framework for learning symbolic world models from minimal unguided exploration in complex, stochastic environments. The approach uses conditionally-activated programmatic laws within a probabilistic framework and demonstrates superior performance on 16 of 23 test scenarios, advancing autonomous construction of world models for unknown environments.

AIBearisharXiv โ€“ CS AI ยท Apr 106/10
๐Ÿง 

A Study of LLMs' Preferences for Libraries and Programming Languages

A new empirical study reveals that eight major LLMs exhibit systematic biases in code generation, overusing popular libraries like NumPy in 45% of cases and defaulting to Python even when unsuitable, prioritizing familiarity over task-specific optimality. The findings highlight gaps in current LLM evaluation methodologies and underscore the need for targeted improvements in training data diversity and benchmarking standards.

AIBullisharXiv โ€“ CS AI ยท Apr 106/10
๐Ÿง 

In-Context Decision Making for Optimizing Complex AutoML Pipelines

Researchers propose PS-PFN, an advanced AutoML method that extends traditional algorithm selection and hyperparameter optimization to handle modern ML pipelines with fine-tuning and ensembling. Using posterior sampling and prior-data fitted networks for in-context learning, the approach outperforms existing bandit and AutoML strategies on benchmark tasks.

AIBullisharXiv โ€“ CS AI ยท Apr 106/10
๐Ÿง 

Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism

Researchers introduce Nirvana, a Specialized Generalist Model that combines broad language capabilities with domain-specific adaptation through task-aware memory mechanisms. The model achieves competitive performance on general benchmarks while reaching lowest perplexity across specialized domains like biomedicine, finance, and law, with practical applications demonstrated in medical imaging reconstruction.

๐Ÿข Hugging Face๐Ÿข Perplexity
AINeutralarXiv โ€“ CS AI ยท Apr 106/10
๐Ÿง 

A Lightweight Library for Energy-Based Joint-Embedding Predictive Architectures

Facebook Research releases EB-JEPA, an open-source library for learning representations through Joint-Embedding Predictive Architectures that predict in representation space rather than pixel space. The framework demonstrates strong performance across image classification (91% on CIFAR-10), video prediction, and action-conditioned world models, making self-supervised learning more accessible for research and practical applications.

AIBullisharXiv โ€“ CS AI ยท Apr 76/10
๐Ÿง 

Context Engineering: A Practitioner Methodology for Structured Human-AI Collaboration

Researchers introduce Context Engineering, a structured methodology for improving AI output quality through better context assembly rather than just prompting techniques. The study of 200 AI interactions showed that structured context reduced iteration cycles from 3.8 to 2.0 and improved first-pass acceptance rates from 32% to 55%.

๐Ÿง  ChatGPT๐Ÿง  Claude
AIBullisharXiv โ€“ CS AI ยท Apr 76/10
๐Ÿง 

InferenceEvolve: Towards Automated Causal Effect Estimators through Self-Evolving AI

Researchers introduce InferenceEvolve, an AI framework using large language models to automatically discover and refine causal inference methods. The system outperformed 58 human submissions in a recent competition and demonstrates how AI can optimize complex scientific programs through evolutionary approaches.

AIBullisharXiv โ€“ CS AI ยท Apr 76/10
๐Ÿง 

REAM: Merging Improves Pruning of Experts in LLMs

Researchers propose REAM (Router-weighted Expert Activation Merging), a new method for compressing large language models that groups and merges expert weights instead of pruning them. The technique preserves model performance better than existing pruning methods while reducing memory requirements for deployment.

AIBullisharXiv โ€“ CS AI ยท Apr 76/10
๐Ÿง 

Decocted Experience Improves Test-Time Inference in LLM Agents

Researchers present a new approach to improve Large Language Model performance without updating model parameters by using 'decocted experience' - extracting and organizing key insights from previous interactions to guide better reasoning. The method shows effectiveness across reasoning tasks including math, web browsing, and software engineering by constructing better contextual inputs rather than simply scaling computational resources.

AIBullisharXiv โ€“ CS AI ยท Apr 76/10
๐Ÿง 

Optimizing Service Operations via LLM-Powered Multi-Agent Simulation

Researchers introduce an LLM-powered multi-agent simulation framework for optimizing service operations by modeling human behavior through AI agents. The method uses prompts to embed design choices and extracts outcomes from LLM responses to create a controlled Markov chain model, showing superior performance in supply chain and contest design applications.

AINeutralarXiv โ€“ CS AI ยท Apr 76/10
๐Ÿง 

Automatically Generating Hard Math Problems from Hypothesis-Driven Error Analysis

Researchers have developed a new automated pipeline that generates challenging math problems by first identifying specific mathematical concepts where LLMs struggle, then creating targeted problems to test these weaknesses. The method successfully reduced a leading LLM's accuracy from 77% to 45%, demonstrating its effectiveness at creating more rigorous benchmarks.

๐Ÿง  Llama
AINeutralarXiv โ€“ CS AI ยท Apr 76/10
๐Ÿง 

Empirical Characterization of Rationale Stability Under Controlled Perturbations for Explainable Pattern Recognition

Researchers propose a new metric to assess consistency of AI model explanations across similar inputs, implementing it on BERT models for sentiment analysis. The framework uses cosine similarity of SHAP values to detect inconsistent reasoning patterns and biased feature reliance, providing more robust evaluation of model behavior.

AIBullisharXiv โ€“ CS AI ยท Apr 76/10
๐Ÿง 

Memory Intelligence Agent

Researchers have developed Memory Intelligence Agent (MIA), a new AI framework that improves deep research agents through a Manager-Planner-Executor architecture with advanced memory systems. The framework enables continuous learning during inference and demonstrates superior performance across eleven benchmarks through enhanced cooperation between parametric and non-parametric memory systems.

AIBullisharXiv โ€“ CS AI ยท Apr 76/10
๐Ÿง 

Scaling DPPs for RAG: Density Meets Diversity

Researchers propose ScalDPP, a new retrieval mechanism for RAG systems that uses Determinantal Point Processes to optimize both density and diversity in context selection. The approach addresses limitations in current RAG pipelines that ignore interactions between retrieved information chunks, leading to redundant contexts that reduce effectiveness.