#classification News & Analysis

38 articles tagged with #classification. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

38 articles

AIBullisharXiv – CS AI · Mar 127/10

🧠

Taking Shortcuts for Categorical VQA Using Super Neurons

Researchers introduce Super Neurons (SNs), a new method that probes raw activations in Vision Language Models to improve classification performance while achieving up to 5.10x speedup. Unlike Sparse Attention Vectors, SNs can identify discriminative neurons in shallow layers, enabling extreme early exiting from the first layer at the first generated token.

AINeutralarXiv – CS AI · Mar 57/10

🧠

Difficult Examples Hurt Unsupervised Contrastive Learning: A Theoretical Perspective

New research reveals that difficult training examples, which are crucial for supervised learning, actually hurt performance in unsupervised contrastive learning. The study provides theoretical framework and empirical evidence showing that removing these difficult examples can improve downstream classification tasks.

AIBullisharXiv – CS AI · Mar 46/103

🧠

LLM-MLFFN: Multi-Level Autonomous Driving Behavior Feature Fusion via Large Language Model

Researchers developed LLM-MLFFN, a new framework combining large language models with multi-level feature fusion to classify autonomous vehicle driving behaviors. The system achieves over 94% accuracy on the Waymo dataset by integrating numerical driving data with semantic features extracted through LLMs.

AINeutralarXiv – CS AI · Jun 236/10

🧠

Ratio Utility and Cost Analysis for Privacy Preserving Subspace Projection

Researchers present RUCA, a privacy-preserving data projection method that addresses the utility-privacy trade-off in machine learning by using compressive techniques to simultaneously maximize classification performance while minimizing private information inference. The approach demonstrates superior performance over existing methods on Census and Human Activity Recognition datasets, offering flexible control over privacy requirements.

AINeutralarXiv – CS AI · Jun 235/10

🧠

Machine Learning Classification of Cryopathy Syndromes: A Comprehensive Comparative Study

Researchers developed and compared machine learning models to automatically classify cryopathy syndromes from laboratory data, addressing clinical challenges caused by overlapping diagnostic patterns and rare diagnoses. A soft-voting ensemble combining Random Forest and Gradient Boosted Trees achieved the best performance, with tree-based methods substantially outperforming neural networks for this medical classification task.

AINeutralarXiv – CS AI · Jun 236/10

🧠

DSSCNet: A Transfer Learning Framework for Cross-Corpus Dysarthric Speech Severity Classification

Researchers introduce DSSCNet, a deep learning framework using transfer learning to improve dysarthric speech severity classification across different datasets. The model achieves 75.80% accuracy on TORGO and 68.25% on UA-Speech corpora, demonstrating significant improvements in speaker-independent assessment and cross-corpus generalization for assistive speech technologies.

AINeutralarXiv – CS AI · Jun 235/10

🧠

From numerical proportions to analogical proportions between probabilities

This academic paper extends analogical proportion theory from numerical and vector-based representations to probabilistic settings, investigating whether probability distributions associated with analogically proportional profiles maintain proportional relationships. The research bridges formal logic with statistical inference, potentially enabling more sophisticated classification methods that operate on probabilistic data.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Dealing with Annotator Disagreement in Hate Speech Classification

Researchers address the overlooked problem of annotator disagreement in hate speech classification, demonstrating that traditional approaches discarding non-consensus samples produce inflated performance metrics. The study establishes new state-of-the-art results for Turkish tweet classification by properly modeling disagreement as a valuable signal rather than noise, using aggregation methods and perceived hate speech strength scores to build more robust detection systems.

AINeutralarXiv – CS AI · Jun 95/10

🧠

A Hierarchical Feature Engineering Framework for Automated Classification of Phonotraumatic and Non-Phonotraumatic Vocal Hyperfunction

Researchers developed a hierarchical feature engineering framework to classify vocal hyperfunction subtypes using non-invasive neck-surface acceleration monitoring. The machine learning approach achieved 89.1% AUC for phonotraumatic cases and 72.8% for non-phonotraumatic cases, with coupling features proving crucial for distinguishing both conditions from healthy controls.

AINeutralarXiv – CS AI · Jun 95/10

🧠

Quantum-Enhanced Similarity Measures for Polarimetric Materials Classification

Researchers present a quantum-classical hybrid system for material classification using polarimetric data, employing quantum SWAP-test circuits to measure similarity between high-dimensional embeddings. The approach achieves competitive accuracy on 23 materials while demonstrating potential for open-set discrimination, positioning it as a practical near-term quantum computing application.

AINeutralarXiv – CS AI · Jun 46/10

🧠

Dynamic Content Moderation in Livestreams: Combining Supervised Classification with MLLM-Boosted Similarity Matching

Researchers present a hybrid content moderation system for livestreams that combines supervised classification with multimodal similarity matching, achieving 67-76% recall at 80% precision. The production-deployed framework reduces user views of unwanted content by 6-8%, demonstrating scalable AI-driven moderation for user-generated video platforms.

AIBearisharXiv – CS AI · Jun 36/10

🧠

Effect of Demographic Bias on Skin Lesion Classification

Researchers evaluated demographic bias in skin lesion classification models, finding that sex biases stem primarily from data imbalances while age biases consistently favor younger populations regardless of training distribution. Multi-task and adversarial learning strategies showed limited effectiveness in male-majority datasets, highlighting the need for targeted bias mitigation approaches in medical AI systems.

AINeutralarXiv – CS AI · Jun 26/10

🧠

Unsupervised Cognition

Researchers propose a novel unsupervised learning approach inspired by cognition models that uses primitive-based, hierarchical representations instead of traditional clustering methods. The method demonstrates superior performance on classification tasks, including cancer type classification and small/incomplete datasets, while exhibiting cognition-like properties that outperform existing supervised and unsupervised algorithms.

AINeutralarXiv – CS AI · May 285/10

🧠

Improving Requirements Classification with SMOTE-Tomek Preprocessing

Researchers applied SMOTE-Tomek preprocessing to address class imbalance in requirements engineering classification, achieving 76.16% accuracy with logistic regression compared to a 58.31% baseline. The technique combines synthetic minority oversampling with Tomek link removal and stratified K-fold validation on the PROMISE dataset of 969 categorized requirements.

AIBullisharXiv – CS AI · May 276/10

🧠

ECSEL: Explainable Classification via Signomial Equation Learning

Researchers introduced ECSEL, an explainable classification method that learns symbolic equations to create interpretable machine learning models. The approach outperforms competing symbolic regression methods on benchmarks while maintaining computational efficiency and classification accuracy comparable to traditional ML models.

AINeutralarXiv – CS AI · May 126/10

🧠

Relational Retrieval: Leveraging Known-Novel Interactions for Generalized Category Discovery

Researchers propose Relational Pattern Consistency (RPC), a machine learning framework for Generalized Category Discovery that bridges labeled and unlabeled data through bidirectional knowledge transfer. The method uses One-vs-All classifiers and relational pattern matching to simultaneously preserve known categories and discover novel ones, achieving state-of-the-art results on multiple benchmarks.

AIBullisharXiv – CS AI · May 126/10

🧠

The Silent Vote: Improving Zero-Shot LLM Reliability by Aggregating Semantic Neighborhoods

Researchers propose Semantic Softmax, a novel inference-time method that improves zero-shot LLM classification by recovering probability mass lost during constrained decoding. The approach aggregates scores from semantic synonyms, reducing calibration errors and boosting accuracy on emotion and toxicity detection tasks.

AINeutralarXiv – CS AI · May 116/10

🧠

Approximation-Free Differentiable Oblique Decision Trees

Researchers introduce DTSemNet, a novel neural network representation of oblique decision trees that enables approximation-free gradient-based training for both classification and regression tasks. The approach eliminates reliance on softening or quantized gradients, achieving superior performance on benchmark datasets and expanding decision tree applicability to reinforcement learning environments.

AINeutralarXiv – CS AI · May 115/10

🧠

N\"urnberg NLP at PsyDefDetect: Multi-Axis Voter Ensembles for Psychological Defence Mechanism Classification

Nürnberg NLP's ensemble approach for detecting psychological defence mechanisms achieved first place in the PsyDefDetect shared task by leveraging nine independent voters across different model architectures and training methods. The strategy prioritizes error independence over single-model strength, addressing the inherent ambiguity in classifying overlapping psychological categories.

AIBullisharXiv – CS AI · May 76/10

🧠

Mitigating Label Shift in Tabular In-Context Learning via Test-Time Posterior Adjustment

Researchers introduce DistPFN, a test-time adjustment method that improves TabPFN's vulnerability to label shift—a common problem where machine learning models overfit to majority classes. The solution rescales predicted probabilities without requiring architectural changes or retraining, demonstrating significant improvements across 250+ datasets while maintaining performance in standard settings.

AINeutralarXiv – CS AI · Apr 136/10

🧠

Structured Exploration and Exploitation of Label Functions for Automated Data Annotation

Researchers introduce EXPONA, an automated framework for generating label functions that improve weak label quality in machine learning datasets. The system balances exploration across surface, structural, and semantic levels with reliability filtering, achieving up to 98.9% label coverage and 46% downstream performance improvements across diverse classification tasks.

AINeutralarXiv – CS AI · Apr 136/10

🧠

Mitigating Extrinsic Gender Bias for Bangla Classification Tasks

Researchers have developed RandSymKL, a debiasing technique for Bangla language models that mitigates gender bias in classification tasks like sentiment analysis and hate speech detection. The study introduces four manually annotated benchmark datasets with gender-perturbation testing and demonstrates that the approach effectively reduces bias while maintaining competitive accuracy compared to existing methods.

AIBullisharXiv – CS AI · Mar 276/10

🧠

Lightweight GenAI for Network Traffic Synthesis: Fidelity, Augmentation, and Classification

Researchers developed lightweight generative AI models for creating synthetic network traffic data to address privacy concerns and data scarcity in network traffic classification. The models achieved up to 87% F1-score when classifiers were trained solely on synthetic data, with transformer-based approaches providing the best balance of accuracy and computational efficiency.

AINeutralarXiv – CS AI · Mar 55/10

🧠

Zono-Conformal Prediction: Zonotope-Based Uncertainty Quantification for Regression and Classification Tasks

Researchers introduce zono-conformal prediction, a new uncertainty quantification method for machine learning that uses zonotope-based prediction sets instead of traditional intervals. The approach is more computationally efficient and less conservative than existing conformal prediction methods while maintaining statistical coverage guarantees for both regression and classification tasks.

AIBullisharXiv – CS AI · Mar 36/104

🧠

Regularization Through Reasoning: Systematic Improvements in Language Model Classification via Explanation-Enhanced Fine-Tuning

Researchers found that fine-tuning large language models with explanations attached to labels significantly improves classification accuracy compared to label-only training. Surprisingly, even random token sequences that mimic explanation structure provide similar benefits, suggesting the improvement comes from increased token budget and regularization rather than semantic meaning.

Page 1 of 2Next →