y0news
#text-generation3 articles
3 articles
AIBullisharXiv โ€“ CS AI ยท 6h ago2
๐Ÿง 

Autorubric: A Unified Framework for Rubric-Based LLM Evaluation

Researchers introduce Autorubric, an open-source Python framework that standardizes rubric-based evaluation of large language models (LLMs) for text generation assessment. The framework addresses scattered evaluation techniques by providing a unified solution with configurable criteria, multi-judge ensembles, bias mitigation, and reliability metrics across three evaluation benchmarks.

AIBullisharXiv โ€“ CS AI ยท 6h ago1
๐Ÿง 

MetaState: Persistent Working Memory for Discrete Diffusion Language Models

Researchers introduce MetaState, a recurrent augmentation for discrete diffusion language models (dLLMs) that adds persistent working memory to improve text generation quality. The system addresses the 'Information Island' problem where intermediate representations are discarded between denoising steps, achieving improved accuracy on LLaDA-8B and Dream-7B models with minimal parameter overhead.

AINeutralarXiv โ€“ CS AI ยท 6h ago0
๐Ÿง 

Texterial: A Text-as-Material Interaction Paradigm for LLM-Mediated Writing

Researchers introduce Texterial, a new interaction paradigm that reimagines text as a malleable material that can be sculpted like clay or cultivated like plants in AI-assisted writing tools. The study presents two technical probes demonstrating gestural text refinement and serendipitous idea growth, expanding the design space for LLM-mediated writing interfaces.