#semantic-embeddings News & Analysis

5 articles tagged with #semantic-embeddings. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

5 articles

AIBullisharXiv – CS AI · Jun 97/10

🧠

Beyond Item IDs: Scaling Short-Form-Video Recommendation via Semantic-Native Long Sequence Modeling

Researchers present a production-deployed recommendation system that scales short-form video suggestions to billion-user scale by replacing traditional Video IDs with semantic-native representations and introducing a compression transformer to reduce computational complexity. The framework achieves order-of-magnitude improvements in memory efficiency and enables longer user behavior sequences, delivering measurable gains in user engagement and content consumption metrics.

AIBullisharXiv – CS AI · Jun 27/10

🧠

SENSE: Semantic Embedding Navigation with Soft-gated Evaluation for Retrieval-based Speculative Decoding

SENSE is a new retrieval-based speculative decoding method that accelerates LLM inference by using semantic embeddings instead of lexical matching to retrieve candidate tokens. The approach achieves up to 3.26x speedup while maintaining generation quality, outperforming existing methods on LLaMA and Qwen models.

AIBullisharXiv – CS AI · May 77/10

🧠

Gradients with Respect to Semantics Preserving Embeddings Tell the Uncertainty of Large Language Models

Researchers introduce SemGrad, a gradient-based uncertainty quantification method for large language models that operates in semantic space rather than parameter space, eliminating the computational overhead of sampling-based approaches. The method measures output stability under semantically equivalent input perturbations to gauge LLM confidence, addressing the critical challenge of hallucinations in free-form text generation.

AIBullisharXiv – CS AI · Jun 16/10

🧠

Breaking Information Cocoons: A Hyperbolic Framework for Balancing Exploration and Exploitation in Recommender Systems

Researchers propose HERec, a hyperbolic-geometry-based recommender system framework that balances content exploration and exploitation while mitigating information cocoons. The system combines semantic-enhanced hierarchical mechanisms with automatic clustering to improve diversity by 11.39% and utility by 5.49% over existing approaches.

AINeutralarXiv – CS AI · May 286/10

🧠

Semantic Flow Regularization: Teaching LLMs to Generate Diverse Yet Coherent Responses

Researchers propose Semantic Flow Regularization (SFR), a novel training technique that addresses the problem of large language models generating repetitive, low-diversity responses when fine-tuned for specific styles or personas. SFR uses conditional flow matching to preserve output diversity while maintaining coherence, demonstrating improvements across dialogue systems and code generation tasks without adding inference costs.