AIBullisharXiv โ CS AI ยท Mar 56/10
๐ง
SHE: Stepwise Hybrid Examination Reinforcement Learning Framework for E-commerce Search Relevance
Researchers introduce SHE (Stepwise Hybrid Examination), a new reinforcement learning framework that improves AI-powered e-commerce search relevance prediction. The framework addresses limitations in existing training methods by using step-level rewards and hybrid verification to enhance both accuracy and interpretability of search results.