y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#semantic-search News & Analysis

11 articles tagged with #semantic-search. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

11 articles
AINeutralarXiv โ€“ CS AI ยท Mar 267/10
๐Ÿง 

An In-Depth Study of Filter-Agnostic Vector Search on a PostgreSQL Database System: [Experiments and Analysis]

Researchers conducted the first comprehensive study of filter-agnostic vector search algorithms in a production PostgreSQL database system, revealing that real-world performance differs significantly from isolated library testing. The study found that system-level overheads often outweigh theoretical algorithmic benefits, with clustering-based approaches like ScaNN often outperforming graph-based methods like NaviX/ACORN in practice.

AIBullisharXiv โ€“ CS AI ยท Mar 46/102
๐Ÿง 

ScaleDoc: Scaling LLM-based Predicates over Large Document Collections

ScaleDoc is a new system that enables efficient semantic analysis of large document collections using LLMs by combining offline document representation with lightweight online filtering. The system achieves 2x speedup and reduces expensive LLM calls by up to 85% through contrastive learning and adaptive cascade mechanisms.

AINeutralarXiv โ€“ CS AI ยท 2d ago6/10
๐Ÿง 

X-SYS: A Reference Architecture for Interactive Explanation Systems

Researchers introduce X-SYS, a reference architecture for building interactive explanation systems that operationalize explainable AI (XAI) across production environments. The framework addresses the gap between XAI algorithms and deployable systems by organizing around four quality attributes (scalability, traceability, responsiveness, adaptability) and five service components, with SemanticLens as a concrete implementation for vision-language models.

AINeutralarXiv โ€“ CS AI ยท 2d ago6/10
๐Ÿง 

Domain-Specific Data Generation Framework for RAG Adaptation

RAGen is a new framework for generating domain-specific training data to improve Retrieval-Augmented Generation (RAG) systems. The system creates question-answer-context triples using semantic chunking, concept extraction, and Bloom's Taxonomy principles, enabling faster adaptation of LLMs to specialized domains like scientific research and enterprise knowledge bases.

AIBullisharXiv โ€“ CS AI ยท Mar 266/10
๐Ÿง 

MDKeyChunker: Single-Call LLM Enrichment with Rolling Keys and Key-Based Restructuring for High-Accuracy RAG

Researchers introduce MDKeyChunker, a three-stage pipeline that improves RAG (Retrieval-Augmented Generation) systems by using structure-aware chunking of Markdown documents, single-call LLM enrichment, and semantic key-based restructuring. The system achieves superior retrieval performance with Recall@5=1.000 using BM25 over structural chunks, significantly improving upon traditional fixed-size chunking methods.

๐Ÿข OpenAI
AIBullisharXiv โ€“ CS AI ยท Mar 176/10
๐Ÿง 

Learning Retrieval Models with Sparse Autoencoders

Researchers introduce SPLARE, a new method that uses sparse autoencoders (SAEs) to improve learned sparse retrieval in language models. The technique outperforms existing vocabulary-based approaches in multilingual and out-of-domain settings, with SPLARE-7B achieving top results on multilingual retrieval benchmarks.

AIBullisharXiv โ€“ CS AI ยท Mar 26/1014
๐Ÿง 

WisPaper: Your AI Scholar Search Engine

WisPaper is a new AI-powered academic search system that combines semantic search capabilities with automated paper validation and organization tools. The system achieved 22.26% recall on TaxoBench and 93.70% validation accuracy, addressing key limitations in current academic search engines by integrating discovery, organization, and monitoring workflows.

AIBullisharXiv โ€“ CS AI ยท Feb 276/106
๐Ÿง 

Scaling Search Relevance: Augmenting App Store Ranking with LLM-Generated Judgments

Apple's App Store search team successfully implemented LLM-generated textual relevance labels to augment their ranking system, addressing data scarcity issues. A fine-tuned specialized model outperformed larger pre-trained models, generating millions of labels that improved search relevance. This resulted in a statistically significant 0.24% increase in conversion rates in worldwide A/B testing.

AIBullishOpenAI News ยท Jan 256/108
๐Ÿง 

Introducing text and code embeddings

OpenAI has launched a new embeddings endpoint in their API that enables developers to perform natural language and code tasks including semantic search, clustering, topic modeling, and classification. This represents a significant expansion of OpenAI's API capabilities for AI-powered applications.

AIBullisharXiv โ€“ CS AI ยท Feb 274/106
๐Ÿง 

ULTRA:Urdu Language Transformer-based Recommendation Architecture

Researchers developed ULTRA, a new AI architecture specifically designed for semantic content recommendation in Urdu, a low-resource language. The system uses a dual-embedding approach with query-length aware routing to improve news retrieval, achieving over 90% precision gains compared to existing methods.