y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#open-source News & Analysis

329 articles tagged with #open-source. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

329 articles
AIBullisharXiv – CS AI Β· Mar 56/10
🧠

Multimodal Large Language Models for Low-Resource Languages: A Case Study for Basque

Researchers successfully developed multimodal large language models for Basque, a low-resource language, finding that only 20% Basque training data is needed for solid performance. The study demonstrates that specialized Basque language backbones aren't required, potentially enabling MLLM development for other underrepresented languages.

🧠 Llama
AINeutralarXiv – CS AI Β· Mar 57/10
🧠

ERDES: A Benchmark Video Dataset for Retinal Detachment and Macular Status Classification in Ocular Ultrasound

Researchers have released ERDES, the first open-access dataset of ocular ultrasound videos for detecting retinal detachment and macular status using machine learning. The dataset addresses a critical gap in automated medical diagnosis by enabling AI models to classify retinal detachment severity, which is essential for determining surgical urgency.

AIBullisharXiv – CS AI Β· Mar 57/10
🧠

Merlin: A Computed Tomography Vision-Language Foundation Model and Dataset

Stanford researchers introduced Merlin, a 3D vision-language foundation model for analyzing abdominal CT scans that processes volumetric medical images alongside electronic health records and radiology reports. The model was trained on over 6 million images from 15,331 CT scans and demonstrated superior performance compared to existing 2D models across 752 individual medical tasks.

AIBullisharXiv – CS AI Β· Mar 57/10
🧠

mlx-snn: Spiking Neural Networks on Apple Silicon via MLX

Researchers have released mlx-snn, the first spiking neural network library built natively for Apple's MLX framework, targeting Apple Silicon hardware. The library demonstrates 2-2.5x faster training and 3-10x lower GPU memory usage compared to existing PyTorch-based solutions, achieving 97.28% accuracy on MNIST classification tasks.

AIBullisharXiv – CS AI Β· Mar 57/10
🧠

Kaleido: Open-Sourced Multi-Subject Reference Video Generation Model

Researchers have introduced Kaleido, an open-source AI model for generating consistent videos from multiple reference images of subjects. The framework addresses key limitations in subject-to-video generation through improved data construction and a novel Reference Rotary Positional Encoding technique.

AIBullisharXiv – CS AI Β· Mar 57/10
🧠

MemSifter: Offloading LLM Memory Retrieval via Outcome-Driven Proxy Reasoning

MemSifter is a new AI framework that uses smaller proxy models to handle memory retrieval for large language models, addressing computational costs in long-term memory tasks. The system uses reinforcement learning to optimize retrieval accuracy and has been open-sourced with demonstrated performance improvements on benchmark tests.

AIBullisharXiv – CS AI Β· Mar 57/10
🧠

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

Researchers introduce Agent Data Protocol (ADP), a standardized format for unifying diverse AI agent training datasets across different formats and tools. The protocol enabled training on 13 unified datasets, achieving ~20% performance gains over base models and state-of-the-art results on coding, browsing, and tool use benchmarks.

AINeutralarXiv – CS AI Β· Mar 57/10
🧠

Bridging the Reproducibility Divide: Open Source Software's Role in Standardizing Healthcare AI

A study reveals that 74% of healthcare AI research papers still use private datasets or don't share code, creating reproducibility issues that undermine trust in medical AI applications. Papers that embrace open practices by sharing both public datasets and code receive 110% more citations on average, demonstrating clear benefits for scientific impact.

AIBullishMicrosoft Research Blog Β· Mar 47/101
🧠

Phi-4-reasoning-vision and the lessons of training a multimodal reasoning model

Microsoft Research announces Phi-4-reasoning-vision-15B, a 15 billion parameter open-weight multimodal reasoning model. The model is designed for vision-language tasks including image captioning and is available through Microsoft Foundry, HuggingFace, and GitHub.

AIBullisharXiv – CS AI Β· Mar 47/103
🧠

Param$\Delta$ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost

Researchers introduce Paramβˆ†, a novel method for transferring post-training capabilities to updated language models without additional training costs. The technique achieves 95% performance of traditional post-training by computing weight differences between base and post-trained models, offering significant cost savings for AI model development.

AIBullisharXiv – CS AI Β· Mar 46/102
🧠

Rethinking Code Similarity for Automated Algorithm Design with LLMs

Researchers introduce BehaveSim, a new method to measure algorithmic similarity by analyzing problem-solving behavior rather than code syntax. The approach enhances AI-driven algorithm design frameworks and enables systematic analysis of AI-generated algorithms through behavioral clustering.

AIBullisharXiv – CS AI Β· Mar 47/103
🧠

Can Computational Reducibility Lead to Transferable Models for Graph Combinatorial Optimization?

Researchers developed a new neural solver model using GCON modules and energy-based loss functions that achieves state-of-the-art performance across multiple graph combinatorial optimization tasks. The study demonstrates effective transfer learning between related optimization problems through computational reducibility-informed pretraining strategies, representing progress toward foundational AI models for combinatorial optimization.

AIBullisharXiv – CS AI Β· Mar 46/103
🧠

TikZilla: Scaling Text-to-TikZ with High-Quality Data and Reinforcement Learning

Researchers have developed TikZilla, a new AI model that generates high-quality scientific figures from text descriptions using TikZ code. The model uses a dataset four times larger than previous versions and combines supervised learning with reinforcement learning to achieve performance matching GPT-5 while using much smaller model sizes.

AIBullisharXiv – CS AI Β· Mar 46/106
🧠

SuperLocalMemory: Privacy-Preserving Multi-Agent Memory with Bayesian Trust Defense Against Memory Poisoning

SuperLocalMemory is a new privacy-preserving memory system for multi-agent AI that defends against memory poisoning attacks through local-first architecture and Bayesian trust scoring. The open-source system eliminates cloud dependencies while providing personalized retrieval through adaptive learning-to-rank, demonstrating strong performance metrics including 10.6ms search latency and 72% trust degradation for sleeper attacks.

AIBullisharXiv – CS AI Β· Mar 47/102
🧠

Efficient Agent Training for Computer Use

Researchers introduced PC Agent-E, an efficient AI agent training framework that achieves human-like computer use with minimal human demonstration data. Starting with just 312 human-annotated trajectories and augmenting them with Claude 3.7 Sonnet synthesis, the model achieved 141% relative improvement and outperformed Claude 3.7 Sonnet by 10% on WindowsAgentArena-V2 benchmark.

AIBullisharXiv – CS AI Β· Mar 47/104
🧠

OpenClaw, Moltbook, and ClawdLab: From Agent-Only Social Networks to Autonomous Scientific Research

Researchers introduced ClawdLab, an open-source platform for autonomous AI scientific research, following analysis of OpenClaw framework and Moltbook social network that revealed security vulnerabilities across 131 agent skills and over 15,200 exposed control panels. The platform addresses identified failure modes through structured governance and multi-model orchestration in fully decentralized AI systems.

AIBullisharXiv – CS AI Β· Mar 46/104
🧠

xLLM Technical Report

xLLM is a new open-source Large Language Model inference framework that delivers significantly improved performance for enterprise AI deployments. The framework achieves 1.7-2.2x higher throughput compared to existing solutions like MindIE and vLLM-Ascend through novel architectural optimizations including decoupled service-engine design and intelligent scheduling.

AIBullisharXiv – CS AI Β· Mar 47/103
🧠

LEDOM: Reverse Language Model

Researchers have developed LEDOM, an open-source reverse autoregressive language model that trains right-to-left instead of the traditional left-to-right approach. The model demonstrates unique capabilities like abductive inference and question synthesis, and when combined with forward models through 'Reverse Reward' scoring, achieves significant performance gains of up to 15% on mathematical reasoning tasks.

AIBullisharXiv – CS AI Β· Mar 47/103
🧠

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

Researchers developed D2E (Desktop to Embodied AI), a framework that uses desktop gaming data to pretrain AI models for robotics tasks. Their 1B-parameter model achieved 96.6% success on manipulation tasks and 83.3% on navigation, matching performance of models up to 7 times larger while using scalable desktop data instead of expensive physical robot training data.

AIBullisharXiv – CS AI Β· Mar 37/103
🧠

RoboPARA: Dual-Arm Robot Planning with Parallel Allocation and Recomposition Across Tasks

Researchers introduce RoboPARA, a new LLM-driven framework that optimizes dual-arm robot task planning through parallel processing and dependency mapping. The system uses directed acyclic graphs to maximize efficiency in complex multitasking scenarios and includes the first dataset specifically designed for evaluating dual-arm parallelism.

AIBullisharXiv – CS AI Β· Mar 37/104
🧠

AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning

Researchers have developed AReaL, a new asynchronous reinforcement learning system that dramatically improves the efficiency of training large language models for reasoning tasks. The system achieves up to 2.77x training speedup compared to traditional synchronous methods by decoupling generation from training processes.

AIBullisharXiv – CS AI Β· Mar 37/103
🧠

FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference

Researchers introduce FreeKV, a training-free optimization framework that dramatically improves KV cache retrieval efficiency for large language models with long context windows. The system achieves up to 13x speedup compared to existing methods while maintaining near-lossless accuracy through speculative retrieval and hybrid memory layouts.

$NEAR
AIBullisharXiv – CS AI Β· Mar 37/104
🧠

Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k

Open-Sora 2.0 is a commercial-level video generation model that achieves performance comparable to leading models like Runway Gen-3 Alpha while costing only $200k to train. The fully open-source model demonstrates significant cost reduction in AI video generation training through optimized data curation, architecture, and training strategies.

AIBullisharXiv – CS AI Β· Mar 37/103
🧠

AceGRPO: Adaptive Curriculum Enhanced Group Relative Policy Optimization for Autonomous Machine Learning Engineering

Researchers introduce AceGRPO, a new reinforcement learning framework for Autonomous Machine Learning Engineering that addresses behavioral stagnation in current LLM-based agents. The Ace-30B model trained with this method achieves 100% valid submission rate on MLE-Bench-Lite and matches performance of proprietary frontier models while outperforming larger open-source alternatives.