Analytics Digests Sources Topics RSS AI Crypto

#humor-benchmarking News & Analysis

1 article tagged with #humor-benchmarking. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

1 articles

AINeutralarXiv – CS AI · Apr 136/10

🧠

Cards Against LLMs: Benchmarking Humor Alignment in Large Language Models

Researchers benchmarked five frontier LLMs against human players in Cards Against Humanity games, finding that while models exceed random baseline performance, their humor preferences align poorly with humans but strongly with each other. The findings suggest LLM humor judgment may reflect systematic biases and structural artifacts rather than genuine preference understanding.