#korean-nlp News & Analysis

2 articles tagged with #korean-nlp. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles

AINeutralarXiv – CS AI · May 287/10

🧠

KVoiceBench, KOpenAudioBench, and KMMAU: Agent-Driven Korean Speech Benchmarks for Evaluating SpeechLMs

Researchers introduce three new Korean speech benchmarks (KVoiceBench, KOpenAudioBench, and KMMAU) totaling 12,345 samples to evaluate multilingual speech language models, addressing the gap in non-English evaluation. The study reveals significant performance disparities between English and Korean across eight SpeechLMs, exposing weaknesses invisible to English-only testing.

AINeutralarXiv – CS AI · May 296/10

🧠

Obfuscation Rules for Detecting and Detoxifying Korean Toxicity

Researchers introduce KOTOX, the first Korean-language dataset for detecting and neutralizing obfuscated toxic content in language models. The dataset addresses a critical gap by providing paired examples of normal, toxic, and obfuscated text, leveraging Korean's unique linguistic properties like agglutination and orthographic variation that enable easy toxicity disguise.