y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#theoretical-cs News & Analysis

13 articles tagged with #theoretical-cs. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

13 articles
AINeutralarXiv – CS AI · 6d ago6/10
🧠

Exact Unlearning in Reinforcement Learning

Researchers present a framework for exact unlearning in reinforcement learning that enables efficient removal of user data upon request, with computational costs only a ρ√ln T fraction of full retraining. The work establishes both an algorithm achieving near-optimal regret bounds for tabular MDPs and matching lower bounds, advancing the theoretical foundation for privacy-preserving machine learning systems.

AINeutralarXiv – CS AI · 6d ago6/10
🧠

A Unified Framework for Locality in Scalable MARL

Researchers present a unified mathematical framework for certifying locality in scalable multi-agent reinforcement learning (MARL) systems by decomposing the state-transition matrix into environment and policy sensitivity components. The approach uses spectral radius analysis to weaken prior Dobrushin bounds and applies temperature-scaled softmax policies to control locality, enabling exponentially decaying truncation bias in networked agent systems.

AINeutralarXiv – CS AI · Jun 26/10
🧠

MINTS: Minimalist Thompson Sampling

Researchers introduce MINTS (Minimalist Thompson Sampling), a Bayesian framework that simplifies sequential decision-making under uncertainty by placing priors only on optimal parameters while eliminating unnecessary variables through profile likelihood. The approach achieves near-optimal regret bounds for multi-armed bandits and automatically adapts to structural constraints, matching classical performance benchmarks.

AINeutralarXiv – CS AI · Jun 26/10
🧠

Algebraic anti-unification

Researchers have developed an algebraic (semantic) theory of anti-unification that extends abstraction and generalization from syntactic term-based systems to arbitrary algebras. This theoretical computer science advancement moves anti-unification beyond equational theories and establishes foundational properties compatible with homomorphisms and isomorphisms, with computability analysis for finite algebras.

AINeutralarXiv – CS AI · May 296/10
🧠

The Sample Complexity of Multiclass and Sparse Contextual Bandits

Researchers present optimal algorithms for sparse contextual bandits that achieve sample complexity of Õ((s/ε² + |A|/ε)log|Π|/δ), closing a gap from prior work that had exponential dependence on action set size. The results apply to multiclass classification and combinatorial semi-bandits through information-theoretic and algorithmic approaches.

AINeutralarXiv – CS AI · May 295/10
🧠

On Language Generation in the Limit with Bounded Memory

This theoretical computer science paper investigates language generation under bounded memory constraints, extending classical learning theory to a practical setting where algorithms cannot retain complete historical information. The research characterizes when language generation remains possible with various memory limitations and reveals that bounded memory affects different learning tasks—generation, density optimization, and identification—in fundamentally different ways.

AINeutralarXiv – CS AI · May 295/10
🧠

Approximate Proportionality in Online Fair Division

Researchers resolve a gap in online fair division theory by proving that proportionality up to one good (PROP1) cannot be approximated by standard greedy algorithms against adaptive adversaries, but can be achieved through randomized allocation or learning-augmented approaches with predictions.

🏢 Meta
AINeutralarXiv – CS AI · May 286/10
🧠

Stochastic Gradient Descent with Momentum is Algorithmically Stable

Researchers have demonstrated that Stochastic Gradient Descent with Momentum (SGDM), a fundamental optimization algorithm in machine learning, maintains strong generalization properties through algorithmic stability analysis. The study resolves a longstanding conjecture that momentum, while accelerating training, might harm generalization performance, providing tight stability bounds applicable to both Polyak's and Nesterov's momentum schemes.

AINeutralarXiv – CS AI · May 125/10
🧠

Multi-Armed Bandits With Best-Action Queries

Researchers resolve an open problem in multi-armed bandit theory by characterizing how best-action oracle queries improve learning algorithms in the realistic bandit-feedback model. They prove that benefits depend critically on reward structure: correlated stochastic rewards cannot achieve the theoretical gains seen in full-feedback settings, while i.i.d. stochastic rewards maintain near-optimal improvements with logarithmic precision.

AINeutralarXiv – CS AI · May 126/10
🧠

Personalized Alignment Revisited: The Necessity and Sufficiency of User Diversity

This theoretical computer science paper establishes formal conditions for efficient personalized alignment in large language models, proving that user diversity—specifically whether user-specific parameters span latent reward directions—is both necessary and sufficient for optimal statistical efficiency. The research provides rigorous mathematical foundations for adapting AI systems to heterogeneous user preferences.

AINeutralarXiv – CS AI · May 116/10
🧠

The Endogeneity of Miscalibration: Impossibility and Escape in Scored Reporting

A theoretical paper demonstrates that principals using standard scoring rules to oversee strategic AI agents face an inherent impossibility: achieving both honest reporting and accurate calibration simultaneously. The research identifies step-function approval thresholds as the only mechanism that preserves calibration while maintaining incentive compatibility, with specific equivalence properties under the Brier score.

AINeutralarXiv – CS AI · May 116/10
🧠

A Resilience Framework for Bi-Criteria Combinatorial Optimization with Bandit Feedback

Researchers introduce a resilience framework for bi-criteria combinatorial optimization under noisy conditions, extending bandit feedback algorithms from single-objective to multi-objective settings. The framework achieves sublinear regret bounds without requiring structural assumptions like linearity or submodularity, with potential applications to constrained optimization problems in machine learning and algorithmic decision-making.

AINeutralarXiv – CS AI · May 116/10
🧠

Spectral Filtering for Complex Linear Dynamical Systems

Researchers introduce a spectral filtering method for learning complex-valued linear dynamical systems with sector-bounded spectrum, achieving dimension-free regret bounds for sequence prediction. The approach uses Slepian basis functions and demonstrates that learning efficiency depends on an effective dimension independent of state space size, with applications to signal processing and quantum systems.