AIBullisharXiv – CS AI · 7h ago7/10
🧠
TriEval: A Resource-Efficient Pipeline for LLM Bias, Toxicity, and Truthfulness Assessment
TriEval introduces an open-source pipeline for evaluating large language models across bias, toxicity, and truthfulness simultaneously while requiring minimal computational resources. The tool runs on standard laptops without GPU clusters, making rigorous LLM safety testing accessible to researchers with limited budgets, and reveals significant performance differences between open-source and closed-source models.
🧠 Claude🧠 Llama