βBack to feed
π§ AIβͺ NeutralImportance 6/10
Not All Queries Need Rewriting: When Prompt-Only LLM Refinement Helps and Hurts Dense Retrieval
π€AI Summary
Research reveals that LLM query rewriting in RAG systems shows highly domain-dependent performance, degrading retrieval effectiveness by 9% in financial domains while improving it by 5.1% in scientific contexts. The study identifies that effectiveness depends on whether rewriting improves or worsens lexical alignment between queries and domain-specific terminology.
Key Takeaways
- βLLM query rewriting performance varies dramatically across domains, with 9% degradation in finance and 5.1% improvement in scientific retrieval tasks.
- βRewriting degrades performance when it replaces domain-specific terms in already well-matched queries, reducing lexical alignment.
- βImprovements occur when rewriting shifts queries toward corpus-preferred terminology and resolves inconsistent nomenclature.
- β95% of all rewrites involve lexical substitution, with effectiveness dependent on the direction rather than presence of substitution.
- βDomain-adaptive post-training is recommended as a safer strategy than prompt-only rewriting in well-optimized verticals.
Read Original βvia arXiv β CS AI
Act on this with AI
Stay ahead of the market.
Connect your wallet to an AI agent. It reads balances, proposes swaps and bridges across 15 chains β you keep full control of your keys.
Related Articles