AIBearisharXiv – CS AI · 4d ago7/10
🧠Researchers demonstrate a practical attack called Bias-Inversion Rewriting Attack (BIRA) that defeats LLM watermarking schemes with over 99% success rate while maintaining semantic quality. The findings expose fundamental vulnerabilities in current watermarking detection methods, which are widely considered essential for identifying AI-generated content.
AIBearisharXiv – CS AI · 5d ago7/10
🧠A comprehensive listening study of 1,768 participants reveals that while humans remain similarly accurate at detecting fake audio (71.2%), they have significantly eroded trust in authentic speech, with real sample detection dropping from 72.7% to 64.1% compared to 2021 baselines. Modern commercial and language model-generated deepfakes pose the greatest challenge to human perception, though ML detectors maintain >94.5% accuracy across all conditions.
AIBearisharXiv – CS AI · May 97/10
🧠Researchers introduce RobustSora, a benchmark dataset of 6,500 videos designed to isolate how AI-generated video detectors rely on watermarks versus actual generation artifacts. Testing across ten detection models reveals that watermark manipulation causes accuracy drops of up to 14 percentage points, demonstrating that current detectors are vulnerable to watermark-removal attacks and may not detect authentic AI-generated content when watermarks are absent.
🧠 Sora
AIBullishTechCrunch – AI · Apr 217/10
🧠YouTube is expanding its AI-powered likeness detection tool to help celebrities and their representatives identify and remove deepfake content featuring their likenesses. This extension of the platform's existing detection technology represents a significant step in addressing the growing problem of non-consensual synthetic media.
AIBearisharXiv – CS AI · Apr 207/10
🧠Researchers introduced CONVEX, a dataset of 150K+ multimodal misinformation posts, revealing that AI-generated content spreads faster than authentic media but relies on passive engagement rather than active discussion. Detection systems show declining performance against evolving generative models, signaling a critical gap in identifying synthetic media at scale.
AIBearisharXiv – CS AI · Apr 147/10
🧠Researchers reveal a significant gap between laboratory performance and real-world reliability in AI-generated media detectors, demonstrating that models achieving 99% accuracy in controlled settings experience substantial degradation when subjected to platform-specific transformations like compression and resizing. The study introduces a platform-aware adversarial evaluation framework showing detectors become vulnerable to realistic attack scenarios, highlighting critical security risks in current AI detection benchmarks.
AIBearisharXiv – CS AI · Apr 147/10
🧠A comprehensive study analyzing over 40,000 news articles finds substantial increases in LLM-generated content across major, local, and college news outlets, with advanced AI detectors identifying widespread adoption especially in local and college media. The research reveals LLMs are primarily used for article introductions while conclusions remain manually written, producing more uniform writing styles with higher readability but lower formality that raises concerns about journalistic integrity.
AINeutralarXiv – CS AI · Apr 77/10
🧠Researchers developed a new AI-generated video detection framework using a large-scale dataset of 140K videos from 15 generators and the Qwen2.5-VL Vision Transformer. The method operates at native resolution to preserve high-frequency forgery artifacts typically lost in preprocessing, achieving superior performance in detecting synthetic media.
AINeutralarXiv – CS AI · Apr 67/10
🧠Researchers introduce SAGA, a comprehensive framework for identifying the specific AI models used to generate synthetic videos, moving beyond simple real/fake detection. The system provides multi-level attribution across authenticity, generation method, model version, and development team using only 0.5% of labeled training data.
AINeutralArs Technica – AI · Mar 267/10
🧠Google is launching Gemini 3.1 Flash Live, a new conversational audio AI system being integrated into search, Gemini platform, and developer tools. The advancement in AI conversational capabilities could make it increasingly difficult for users to distinguish between human and AI interactions.
🧠 Gemini
AI × CryptoBullisharXiv – CS AI · Mar 177/10
🤖Researchers developed an AI framework to detect rug pull scams in BSC meme tokens by analyzing wash-trading patterns. The system achieved 90.98% AUC accuracy and can provide early warnings with an average lead time of 3.8 hours, though it currently functions better as a high-precision screener than an automatic alarm system.
AINeutralarXiv – CS AI · Mar 57/10
🧠Researchers have conducted the first theoretical analysis of Google's SynthID-Text watermarking system, revealing vulnerabilities in its detection methods and proposing attacks that can break the system. The study identifies weaknesses in the mean score detection approach and demonstrates that the Bayesian score offers better robustness, while establishing optimal parameters for watermark detection.
AIBullisharXiv – CS AI · Mar 46/102
🧠Researchers developed a multimodal multi-agent ransomware analysis framework using AutoGen that combines static, dynamic, and network data sources for improved ransomware detection. The system achieved 0.936 Macro-F1 score for family classification and demonstrated stable convergence over 100 epochs with a final composite score of 0.88.
AIBearisharXiv – CS AI · Mar 46/102
🧠Researchers developed a method to detect AI-generated content at scale and found that 6.5-16.9% of peer reviews at major AI conferences after ChatGPT's release were substantially modified by LLMs. The study reveals concerning patterns where AI-generated reviews correlate with lower reviewer confidence, last-minute submissions, and reduced engagement in rebuttals.
AINeutralarXiv – CS AI · Mar 37/102
🧠Researchers developed a new algorithm called Learn-to-Distance (L2D) that can detect AI-generated text from models like GPT, Claude, and Gemini with significantly improved accuracy. The method uses adaptive distance learning between original and rewritten text, achieving 54.3% to 75.4% relative improvements over existing detection methods across extensive testing.
AINeutralarXiv – CS AI · 4d ago6/10
🧠Researchers have developed TELL, an AI-generated text detector that prioritizes explainability by showing users the specific linguistic markers indicating AI or human authorship rather than just providing an opaque numerical score. The system achieves competitive detection performance (AUROC 0.927) while generating human-evaluated explanations with a 72.3% mean win-rate across quality metrics, fundamentally reframing detection as a human-centric interpretability problem.
AINeutralDecrypt – AI · 5d ago6/10
🧠YouTube is implementing more prominent AI content labels and automatic detection systems to help viewers identify AI-generated videos, while Google simultaneously pushes its video remix tools. This move reflects growing pressure on platforms to address transparency concerns around synthetic media as AI generation tools become more accessible.
AINeutralTechCrunch – AI · 5d ago6/10
🧠YouTube is implementing automatic detection and labeling of videos containing significant photorealistic AI-generated content, shifting from a creator-disclosure model to platform-enforced transparency. The company is also making AI content labels more visually prominent to help users identify manipulated media.
AINeutralThe Verge – AI · 5d ago6/10
🧠YouTube is making AI-generated content labels more prominent and visible to users by relocating them directly below video players instead of hiding them in expanded descriptions. The platform is also implementing automatic detection and labeling of AI-generated content across Shorts and long-form videos, marking a significant shift in content transparency following Google's broader AI verification initiatives announced at I/O.
AINeutralarXiv – CS AI · 5d ago6/10
🧠Researchers introduce Social Gaze Consistency as a novel method to detect AI-generated images by analyzing the coherence of eye direction and head-eye alignment between people. The technique achieves meaningful improvements in detection accuracy across multiple vision models, suggesting that high-level semantic features offer advantages over traditional low-level artifact detection as generative models become more sophisticated.
AINeutralarXiv – CS AI · 5d ago6/10
🧠Researchers have developed READER, a compact AI text detector with only 1.5B parameters that outperforms much larger language models and existing detection systems. READER combines classification with explainable reasoning, providing both AI/human verdicts and structured rationales for its decisions, addressing critical limitations in current detection methods that fail under distribution shifts.
🧠 GPT-5🧠 Gemini
AINeutralarXiv – CS AI · 5d ago6/10
🧠A new study demonstrates that pooled benchmarks for detecting AI-generated academic text systematically misrepresent AI adoption across countries and research fields by ignoring contextual stylistic variations. Using country-field-specific benchmarks instead provides more accurate measurements and reveals that previous estimates substantially over- or underestimated AI use depending on geographic and disciplinary context.
AINeutralThe Verge – AI · 5d ago6/10
🧠Analysis suggests Pope Leo XIV may have used AI to write portions of his encyclical on AI's dangers, with detection tools indicating 40-100% of certain paragraphs were AI-generated. The finding raises questions about authenticity and irony, as the document warns against AI's impact while potentially being partially authored by AI systems.
🏢 Anthropic🧠 Claude
AINeutralarXiv – CS AI · May 116/10
🧠Researchers introduce MELD, an advanced AI-generated text detector that uses multi-task learning to improve robustness against adversarial attacks, transfer across unseen models and domains, and maintain low false-positive rates. The detector outperforms most open-source competitors and matches leading commercial systems on public benchmarks.
AINeutralarXiv – CS AI · May 96/10
🧠Researchers propose IntraGuard, a defense framework that embeds hidden safeguards into PDF manuscripts to detect when AI chatbots are used to generate peer reviews instead of human experts. The system achieves 84% success rate in disrupting AI-generated reviews while maintaining transparency for legitimate human reviewers, addressing growing concerns about academic integrity as LLMs proliferate.