AIBullisharXiv โ CS AI ยท 5h ago
๐ง
SHE: Stepwise Hybrid Examination Reinforcement Learning Framework for E-commerce Search Relevance
Researchers introduce SHE (Stepwise Hybrid Examination), a new reinforcement learning framework that improves AI-powered e-commerce search relevance prediction. The framework addresses limitations in existing training methods by using step-level rewards and hybrid verification to enhance both accuracy and interpretability of search results.