AINeutralarXiv – CS AI · 7h ago6/10
🧠
InFerActive: Interactive Tree-Based Exploration of LLM Sampling for Safety Evaluation
InFerActive is an interactive system that improves how AI safety evaluators assess large language models by visualizing sampling results as navigable trees rather than static spreadsheets. The tool uses breadth-first sampling to achieve equivalent harmful-response coverage with up to 5x fewer samples, significantly improving evaluation efficiency according to controlled user studies.