#weak-supervision News & Analysis

11 articles tagged with #weak-supervision. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

11 articles

AIBullisharXiv – CS AI · Jun 27/10

🧠

From "Weak" Signals to Strong Models: Preference Delta Aggregation with LoRA Merging

Researchers propose Preference Delta Aggregation (PDA), a framework that combines weak preference signals from multiple smaller language model pairs into LoRA adapters, then merges them using Geometric Alignment Merging to improve larger models. The approach achieves 6.8-7.3 point improvements on knowledge reasoning and agentic search benchmarks by effectively composing complementary capabilities.

AINeutralarXiv – CS AI · Jun 27/10

🧠

Weak Critics Make Strong Learners: On-Policy Critique Distillation for Scalable Oversight

Researchers propose On-Policy Critique Distillation (OPCD), a method enabling weak AI models to effectively supervise stronger ones by providing revision guidance rather than direct answers. The approach filters high-quality critiques and distills them into stronger models through adaptive learning, advancing scalable oversight for complex tasks.

AIBullisharXiv – CS AI · May 127/10

🧠

Weakly Supervised Concept Learning for Object-centric Visual Reasoning

Researchers present a weakly supervised learning approach that combines neural networks with symbolic AI for object-centric reasoning tasks, requiring only 1% of typical labels while outperforming foundation models in domain generalization. The method bridges perception and logical reasoning by using slot-based architectures and VAEs to ground symbolic outputs for frameworks like Inductive Logic Programming.

AIBullisharXiv – CS AI · Apr 107/10

🧠

Weakly Supervised Distillation of Hallucination Signals into Transformer Representations

Researchers developed a weak supervision framework to detect hallucinations in large language models by distilling grounding signals into transformer representations during training. Using substring matching, sentence embeddings, and LLM judges, they created a 15,000-sample dataset and trained five probing classifiers that achieve hallucination detection from internal activations alone at inference time, eliminating the need for external verification systems.

AIBullisharXiv – CS AI · Mar 177/10

🧠

Incentivizing Strong Reasoning from Weak Supervision

Researchers have developed a novel method to enhance large language model reasoning capabilities using supervision from weaker models, achieving 94% of expensive reinforcement learning gains at a fraction of the cost. This weak-to-strong supervision paradigm offers a promising alternative to costly traditional methods for improving LLM reasoning performance.

AINeutralarXiv – CS AI · Mar 117/10

🧠

A Consensus-Driven Multi-LLM Pipeline for Missing-Person Investigations

Researchers have developed Guardian, an AI system using multiple large language models (LLMs) to assist in missing-person investigations during the critical first 72 hours. The system employs a consensus-driven pipeline that coordinates specialized LLM models for information extraction and processing, with fine-tuning using QLoRA methodology.

AINeutralarXiv – CS AI · Jun 116/10

🧠

LASA: A Weak Supervision Method for Open-Vocabulary Scene Sketch Semantic Segmentation

Researchers introduce LASA, a weak supervision method for open-vocabulary sketch semantic segmentation that aggregates multi-layer Vision Transformer attention maps to capture complementary spatial cues. The approach achieves significant improvements over baselines without requiring pixel-level annotations, advancing computer vision capabilities for sparse line drawing interpretation.

AIBullisharXiv – CS AI · Jun 96/10

🧠

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Researchers propose WMSS, a post-training optimization method that leverages weak model checkpoints to improve strong language models beyond conventional saturation points. The approach identifies and addresses learning gaps through entropy dynamics, achieving performance gains in mathematical reasoning and code generation without additional inference costs.

AINeutralarXiv – CS AI · May 276/10

🧠

Rethinking Weak Supervision in Anomaly Detection: A Comprehensive Benchmark

Researchers introduce WSADBench, the first unified benchmark for weakly supervised anomaly detection (WSAD) that evaluates 36 algorithms across 4 modalities and over 700K experiments. The study reveals that specialized WSAD methods only outperform in extreme label-scarcity scenarios, while general foundation models and classification approaches dominate with increased supervision, fundamentally challenging current research isolation.

AINeutralarXiv – CS AI · May 126/10

🧠

WISTERIA: Learning Clinical Representations from Noisy Supervision via Multi-View Consistency in Electronic Health Records

WISTERIA is a machine learning framework that improves clinical AI by treating noisy medical labels as uncertain observations rather than ground truth. By enforcing consistency across multiple weak supervision sources and incorporating medical ontologies, the method achieves better generalization across healthcare institutions and demonstrates robustness to label noise.

AINeutralarXiv – CS AI · May 126/10

🧠

Multi-Tier Labeling and Physics-Informed Learning for Orbital Anomaly Detection at Scale

Researchers developed a multi-tier labeling system combining physics-based rules, Kalman filtering, and machine learning to detect orbital anomalies across thousands of LEO satellites. The approach generated 8.6M labeled training sequences from 232M historical records, enabling a Transformer model to achieve 55.4% maneuver recall and 62.8% decay recall—addressing a critical gap in space situational awareness infrastructure.