AIBullisharXiv โ CS AI ยท 17h ago7/10
๐ง
SpecFuse: Ensembling Large Language Models via Next-Segment Prediction
Researchers introduce SpecEM, a new training-free framework for ensembling large language models that dynamically adjusts each model's contribution based on real-time performance. The system uses speculative decoding principles and online feedback mechanisms to improve collaboration between different LLMs, showing consistent performance improvements across multiple benchmark datasets.