AIBearisharXiv โ CS AI ยท 7h ago7/10
๐ง
When Personalization Tricks Detectors: The Feature-Inversion Trap in Machine-Generated Text Detection
Researchers introduce the first benchmark for detecting machine-generated text that imitates personal writing styles, revealing that state-of-the-art detectors fail significantly when LLMs personalize their output. The study identifies a 'feature-inversion trap' where detection features become unreliable in personalized contexts and proposes a method to predict detector performance degradation with 85% accuracy.