907 articles tagged with #research. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.
AINeutralarXiv โ CS AI ยท Mar 54/10
๐ง Researchers evaluated five Multimodal Large Language Models (MLLMs) on their ability to reason about social norms in both text and image scenarios. GPT-4o performed best overall, while all models showed superior performance with text-based norm reasoning compared to image-based scenarios.
๐ง GPT-4
AINeutralarXiv โ CS AI ยท Mar 54/10
๐ง Research study examines how parents want to moderate their children's interactions with GenAI chatbots, revealing gaps in current parental control tools. The study used LLM-generated scenarios to identify that parents need more granular, personalized controls at the conversation level rather than broad content filtering.
AINeutralarXiv โ CS AI ยท Mar 53/10
๐ง Researchers developed a novel neural network architecture for classifying cuneiform tablet metadata using point-cloud representations. The convolution-inspired approach outperformed existing transformer-based methods like Point-BERT by gradually down-scaling point clouds while integrating local and global information.
AINeutralarXiv โ CS AI ยท Mar 54/10
๐ง Researchers propose TFWaveFormer, a novel Transformer architecture that combines temporal-frequency analysis with multi-resolution wavelet decomposition for dynamic link prediction. The framework achieves state-of-the-art performance on benchmark datasets by better capturing complex multi-scale temporal dynamics in applications like social networks and financial modeling.
AINeutralarXiv โ CS AI ยท Mar 54/10
๐ง Researchers propose DQE-CIR, a new method for composed image retrieval that improves AI's ability to find images based on reference images and text modifications. The approach addresses limitations in current contrastive learning frameworks by using learnable attribute weights and target relative negative sampling to create more distinctive query embeddings.
AINeutralarXiv โ CS AI ยท Mar 54/10
๐ง Researchers introduced MOO, a large-scale synthetic dataset of 1,000 cattle individuals captured from 128 viewpoints to improve animal re-identification across different viewing angles. The dataset addresses critical challenges in aerial-ground re-identification by providing precise angular annotations and demonstrates effective transfer to real-world applications.
AINeutralarXiv โ CS AI ยท Mar 54/10
๐ง Researchers have released MuRAL, a new dataset containing over 21 hours of multi-resident smart home sensor data with natural language annotations for training AI models. The dataset aims to improve Large Language Models' ability to understand human activities in complex smart home environments, though current LLMs still struggle with key tasks like resident identification and activity prediction.
AINeutralarXiv โ CS AI ยท Mar 54/10
๐ง Researchers propose RLJP, a new framework for Legal Judgment Prediction that combines first-order logic rules with large language models to improve AI-based legal decision making. The system uses a three-stage approach including Confusion-aware Contrastive Learning to dynamically optimize judgment rules and showed superior performance on public datasets.
AINeutralarXiv โ CS AI ยท Mar 54/10
๐ง Researchers have developed HAMLET, a hierarchical multi-agent AI framework that creates immersive, interactive theatrical experiences using large language models. The system generates narrative blueprints from simple topics and enables AI actors to perform with adaptive reasoning, emotional states, and physical interactions with scene props.
AINeutralarXiv โ CS AI ยท Mar 54/10
๐ง Researchers developed a new self-supervised Inductive Logic Programming approach called Poker that can learn recursive logic programs without requiring expert-crafted negative examples or problem-specific background theories. The system automatically generates and labels new training examples during learning, showing improved performance over existing methods when negative examples are unavailable.
AINeutralarXiv โ CS AI ยท Mar 54/10
๐ง Researchers present AutoQD, a new AI method that automatically discovers diverse behavioral policies without requiring hand-crafted descriptors. The approach uses mathematical embeddings of policy occupancy measures to enable Quality-Diversity optimization algorithms to find varied high-performing solutions in reinforcement learning tasks.
AINeutralarXiv โ CS AI ยท Mar 54/10
๐ง Researchers have developed Q-SVMPC, a new Model Predictive Control method that combines reinforcement learning with Stein variational inference to improve trajectory optimization. The approach addresses limitations in existing MPC methods that often converge to single solutions, instead maintaining diverse solution paths for better performance in robotics applications.
AINeutralarXiv โ CS AI ยท Mar 54/10
๐ง Researchers propose Co-Evolutionary Alignment (CoEA), a new recommendation system method that uses dual large language models to balance relevant and novel content suggestions. The system addresses traditional recommendation bias through dynamic optimization that considers both long-term group identity and short-term individual preferences.
AIBullisharXiv โ CS AI ยท Mar 54/10
๐ง Researchers introduced LadderSym, a new Transformer-based AI method for detecting music practice errors that significantly outperforms existing approaches. The system uses multimodal processing of audio and symbolic music scores, more than doubling accuracy for detecting missed notes and improving extra note detection by 14.4 points.
AINeutralarXiv โ CS AI ยท Mar 54/10
๐ง Researchers have released MuSaG, the first German multimodal sarcasm detection dataset featuring 33 minutes of annotated television content with text, audio, and video data. The study reveals a significant gap between human sarcasm detection (which relies heavily on audio cues) and current AI models (which perform best with text).
AINeutralarXiv โ CS AI ยท Mar 54/10
๐ง Researchers introduce CareMedEval, a new dataset with 534 questions based on 37 scientific articles to evaluate large language models' ability to perform critical appraisal in biomedical contexts. Testing reveals current AI models struggle with this specialized reasoning task, achieving only 0.5 exact match rates even with advanced prompting techniques.
AINeutralarXiv โ CS AI ยท Mar 54/10
๐ง Researchers propose a novel framework for 3D object reconstruction from multi-view images that simultaneously optimizes mesh geometry and appearance through Gaussian-guided rendering. The unified approach addresses limitations of existing methods that separate geometry and appearance optimization, enabling better downstream editing tasks like relighting and shape deformation.
AIBullisharXiv โ CS AI ยท Mar 44/103
๐ง Researchers have developed a new framework that combines Large Language Models with structured reasoning to analyze debates more transparently. The system extracts arguments from text, maps their relationships, and uses quantitative methods to determine argument strengths, addressing LLMs' limitations in explicit reasoning.
AINeutralarXiv โ CS AI ยท Mar 44/103
๐ง Researchers present a framework for social planners to strategically reveal positive and negative role models to influence agent behavior in social networks. The study addresses optimization challenges when disclosure budgets are limited and proposes algorithms to maximize social welfare while maintaining fairness across different groups.
AINeutralarXiv โ CS AI ยท Mar 44/103
๐ง Researchers have developed AnchorDrive, a two-stage AI framework that combines large language models with diffusion models to generate realistic safety-critical scenarios for autonomous driving systems. The system uses LLMs for controllable scenario generation based on natural language instructions, then employs diffusion models to create realistic driving trajectories.
AIBullisharXiv โ CS AI ยท Mar 44/102
๐ง Researchers developed FEAST, a new AI framework that improves food classification accuracy for Europe's FoodEx2 system by 12-38% on rare food categories. The system uses retrieval-augmented learning to better classify complex food descriptions into standardized codes used for food safety monitoring across Europe.
AINeutralarXiv โ CS AI ยท Mar 44/102
๐ง Researchers developed CDD (Contamination Detection via output Distribution) to identify data contamination in small language models by measuring output peakedness. The study found that CDD only works when fine-tuning produces verbatim memorization, failing at chance level with parameter-efficient methods like low-rank adaptation that avoid memorization.
AINeutralarXiv โ CS AI ยท Mar 44/103
๐ง Research paper compares three sinusoidal models for speech and audio signal processing: standard Sinusoidal Model (SM), Exponentially Damped Sinusoidal Model (EDSM), and extended adaptive Quasi-Harmonic Model (eaQHM). The study finds eaQHM performs better for medium-to-large window analysis while EDSM excels with smaller analysis windows, suggesting future research should combine both approaches.
AINeutralarXiv โ CS AI ยท Mar 44/103
๐ง Researchers propose GLEAN, a new evaluation protocol for testing small AI models on tabular reasoning tasks while addressing contamination and hardware constraints. The framework reveals distinct error patterns between different models and provides diagnostic tools for more reliable evaluation under limited computational resources.
AINeutralarXiv โ CS AI ยท Mar 44/102
๐ง Researchers conducted a benchmark study comparing graph neural networks (GNNs) against traditional methods for classifying neurons in C. elegans worms. The study found that attention-based GNNs significantly outperformed baseline methods when using spatial and connection features, validating the effectiveness of graph-based approaches for biological neural network analysis.