y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#adversarial-attacks News & Analysis

57 articles tagged with #adversarial-attacks. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

57 articles
AIBearisharXiv โ€“ CS AI ยท Mar 37/106
๐Ÿง 

Turning Black Box into White Box: Dataset Distillation Leaks

Researchers discovered that dataset distillation, a technique for compressing large datasets into smaller synthetic ones, has serious privacy vulnerabilities. The study introduces an Information Revelation Attack (IRA) that can extract sensitive information from synthetic datasets, including predicting the distillation algorithm, model architecture, and recovering original training samples.

AIBearisharXiv โ€“ CS AI ยท Mar 36/107
๐Ÿง 

Hide&Seek: Remove Image Watermarks with Negligible Cost via Pixel-wise Reconstruction

Researchers have developed HIDE&SEEK (HS), a new attack method that can effectively remove watermarks from machine-generated images while maintaining visual quality. This research exposes vulnerabilities in current state-of-the-art proactive image watermarking defenses, highlighting the ongoing arms race between watermarking protection and removal techniques.

AIBearisharXiv โ€“ CS AI ยท Mar 36/103
๐Ÿง 

JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language Models

Researchers introduced JALMBench, a comprehensive benchmark to evaluate jailbreak vulnerabilities in Large Audio Language Models (LALMs), comprising over 245,000 audio samples and 11,000 text samples. The study reveals that LALMs face significant safety risks from jailbreak attacks, with text-based safety measures only partially transferring to audio inputs, highlighting the need for specialized defense mechanisms.

AINeutralOpenAI News ยท Aug 226/106
๐Ÿง 

Testing robustness against unforeseen adversaries

Researchers have developed a new method to evaluate neural network classifiers' ability to defend against previously unseen adversarial attacks. The approach introduces the UAR (Unforeseen Attack Robustness) metric to assess model performance against unanticipated threats and emphasizes testing across diverse attack scenarios.

AINeutralarXiv โ€“ CS AI ยท Mar 125/10
๐Ÿง 

Enhancing Network Intrusion Detection Systems: A Multi-Layer Ensemble Approach to Mitigate Adversarial Attacks

Researchers developed a multi-layer ensemble defense system to protect AI-powered Network Intrusion Detection Systems (NIDS) from adversarial attacks. The solution combines stacking classifiers with autoencoder validation and adversarial training, demonstrating improved resilience against GAN and FGSM-generated attacks on security datasets.

AINeutralarXiv โ€“ CS AI ยท Mar 24/106
๐Ÿง 

Concept-based Adversarial Attack: a Probabilistic Perspective

Researchers propose a new concept-based adversarial attack framework that targets entire concept distributions rather than single images, generating diverse adversarial examples while preserving the original concept identity. The method creates adversarial images with variations in pose, viewpoint, or background that can still mislead classifiers while remaining recognizable as instances of the original category.

AINeutralOpenAI News ยท Feb 81/106
๐Ÿง 

Adversarial attacks on neural network policies

The article appears to have no content provided, with only a title about adversarial attacks on neural network policies. Without the actual article body, no meaningful analysis of the research or its implications can be performed.

โ† PrevPage 3 of 3