AI × CryptoBullisharXiv – CS AI · 7h ago6/10
🤖
PoQ-Judge: A Multi-Architecture Evaluation Framework for Cost-Aware Proof-of-Quality in Decentralized LLM Inference
PoQ-Judge introduces a reference-free quality evaluation framework for decentralized LLM inference networks using lightweight judge models trained on UltraFeedback and GPT-labeled data. The framework achieves 0.747 Pearson correlation with ground-truth benchmarks while reducing evaluation costs by 72.7% through cascade evaluation, addressing a critical infrastructure need for decentralized AI systems.