AINeutralarXiv – CS AI · 3d ago6/10
🧠A research study examines how humans decide to trust and rely on AI systems in collaborative question-answering tasks, identifying two distinct reliance patterns: delegation (autonomous AI action) and adoption (evaluating AI suggestions). The findings reveal humans make suboptimal trust decisions, both under-utilizing correct AI suggestions and over-relying on misleading AI outputs, with confirmation bias playing a significant role in trust calibration failures.
AINeutralarXiv – CS AI · 3d ago6/10
🧠A qualitative study of 24 employees across IT, healthcare, and service sectors reveals that AI adoption in workplaces produces divergent impacts on job satisfaction depending on occupational domain. While IT and healthcare workers expect improved working conditions but diminished sense of purpose due to AI automating their core tasks, service workers anticipate enhanced social status from AI integration despite no improvement in hours worked.
AINeutralarXiv – CS AI · 3d ago5/10
🧠Researchers propose a machine learning framework for optimally assigning prediction tasks to heterogeneous agents (humans or AI systems) subject to capacity constraints. The work develops explore-exploit algorithms that learn agent expertise and adapt assignments dynamically, demonstrating improvements over baseline approaches across tabular, image, and text tasks.
AINeutralarXiv – CS AI · 3d ago5/10
🧠A PhD study of 90 participants compared human-like spoken embodied conversational agents versus text-based agents in a mobile educational game about UK currency. Results showed statistically significant user preference for highly human-like agents, with implications for designing collaborative human-agent systems in educational contexts.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers introduce a hybrid prediction market combining algorithmic agents and human experts to forecast scientific replicability, demonstrating that collaborative approaches outperform either humans or AI alone. The system trains AI on historical replication data while humans contribute domain expertise through real-time trading, producing more accurate replication forecasts than single-modality baselines.
AINeutralarXiv – CS AI · 4d ago6/10
🧠A mixed-methods study of 76 sound designers and 20 industry professionals reveals a significant gap between AI tools currently available and what creative audio practitioners actually need. Current AI excels in fast-consumption media but lacks the narrative sophistication for high-end film and immersive audio work, with professionals favoring task-specific assistive tools over generative systems.
AINeutralarXiv – CS AI · 4d ago6/10
🧠Researchers present a novel method for controlling music generation in the MusicGen transformer by using activation steering techniques applied at inference time. The approach enables precise genre control through linear probes that manipulate the model's residual stream, demonstrating how interpretable AI behaviors can enhance collaborative music creation.
AINeutralarXiv – CS AI · May 126/10
🧠PLACO presents a multi-stage framework for optimizing human-AI team performance in classification tasks by combining human and model outputs through Bayesian probability methods. The research addresses how to effectively leverage both human judgment and AI predictions when neither alone achieves desired performance levels.
AIBullisharXiv – CS AI · May 126/10
🧠A study demonstrates that interactive dialogue between physicians and large language models significantly improves diagnostic accuracy in emergency medicine, with residents showing a 12.5% improvement on hard cases and standardized metrics confirming medium effect sizes across 52 clinical scenarios.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers establish formal mathematical bounds for when human-AI teams outperform individuals, proving complementarity occurs only when error correlation between humans and AI falls below a critical threshold. The framework explains why 70% of real-world human-AI collaborations fail to achieve synergy and provides predictive formulas validated against human datasets.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers present a modular, provenance-aware pipeline that converts handwritten archival tables into Knowledge Graphs while maintaining transparency through intermediate inspection points. The approach combines table structure recognition, handwriting recognition, and semantic interpretation while tracking data lineage to ensure all extracted information remains traceable to its source, addressing the opacity problem in end-to-end AI systems.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers introduce SCALAR, an Actor-Critic-Judge framework that systematically evaluates how AI agents improve through human feedback on theoretical physics problems. The study reveals that multi-turn dialogue consistently outperforms single attempts, but the effectiveness of different feedback strategies depends heavily on the specific pairing of AI models used, with asymmetric model pairs benefiting most from structured critique.
AIBullisharXiv – CS AI · May 116/10
🧠MPD²-Router is a machine learning framework that improves glaucoma screening by intelligently routing difficult cases between AI systems and human experts based on availability, uncertainty, and image quality. The system achieves better clinical outcomes than AI-alone approaches while maintaining balanced expert utilization across multiple international datasets.
AINeutralarXiv – CS AI · May 116/10
🧠VIDEE is a new system that enables entry-level data analysts to perform advanced text analytics using intelligent AI agents without specialized NLP knowledge. The platform combines human-in-the-loop decision-making with LLM-powered execution and evaluation, demonstrated through quantitative experiments and user studies showing effectiveness across experience levels.
AINeutralarXiv – CS AI · May 96/10
🧠Researchers conducted a user study with 11 expert mathematicians using AlphaEvolve, an AI coding agent, to explore how humans effectively collaborate with AI systems for scientific discovery. The study identified a cyclical workflow called 'intentmaking'—where users iteratively define and refine experimental goals through system interaction—paired with traditional sensemaking, suggesting AI tools should function as collaborative instruments rather than black-box assistants.
AINeutralarXiv – CS AI · May 96/10
🧠Researchers introduce InciteResearch, a multi-agent AI framework that helps researchers transform vague, implicit research ideas into structured, actionable questions through Socratic questioning. The framework achieves significant improvements over baselines on TF-Bench, a new benchmark for tacit-to-explicit research assistance, demonstrating AI's potential as a thinking tool rather than just an execution automator.
AINeutralarXiv – CS AI · May 96/10
🧠Researchers demonstrate that AI-generated audio description drafts significantly improve accessibility content creation for blind and low-vision audiences, but only when draft quality exceeds a minimum threshold. High-quality AI drafts cut completion time by over 50% and reduced cognitive load, while low-quality baseline drafts provided minimal benefit, establishing content-dependent quality standards as crucial for effective human-AI collaboration.
AINeutralarXiv – CS AI · May 96/10
🧠PersonaTeaming introduces a persona-driven approach to red-teaming generative AI systems, combining automated adversarial prompt generation with human-in-the-loop collaboration. The method outperforms existing automated approaches while enabling security researchers to leverage diverse perspectives and backgrounds to uncover AI model vulnerabilities more effectively.
AINeutralarXiv – CS AI · May 96/10
🧠Researchers introduce Open-Universe Assistance Games (OU-AGs), a framework enabling LLM-based agents to infer and align with human preferences through open-ended dialogue. The GOOD method extracts evolving goals from natural language interactions using probabilistic inference, demonstrating improved user intent alignment across shopping, robotics, and coding domains without requiring large offline datasets.
AINeutralarXiv – CS AI · May 46/10
🧠A research position paper argues that agentic AI systems should incorporate Bayesian decision theory at their orchestration layer to improve decision-making under uncertainty. Rather than making LLMs themselves Bayesian, the framework proposes applying Bayesian principles to the control systems that coordinate multiple LLMs and tools, enabling better belief maintenance and resource allocation.
AINeutralarXiv – CS AI · May 46/10
🧠A research study examines how generative AI is transforming product development through 'vibe coding'—a workflow where teams express design intent in natural language and AI generates functional prototypes. While the approach accelerates iteration and lowers barriers to participation, researchers found significant challenges including code unreliability, integration issues, and concerns about over-reliance on AI, alongside emerging tensions around team responsibility and ownership.
AIBullishFortune Crypto · May 16/10
🧠American professionals like Natalie Blythe are shifting from AI anxiety to pragmatic adoption, discovering genuine productivity gains rather than existential threats. The article highlights how early skepticism about AI transforms into confidence when users experience concrete efficiency improvements in their workflows.
AINeutralarXiv – CS AI · May 16/10
🧠Researchers present a conceptual framework for understanding human-AI decision-making relationships across five configurations—from pure human leadership to fully automated systems. The framework emphasizes that leaders often misrecognize where actual decision-shaping authority lies, risking ineffective oversight and suboptimal outcomes.
AINeutralarXiv – CS AI · May 16/10
🧠Pragmos is a research prototype that combines Large Language Models with human expertise to create business process models through interactive, iterative workflows. Rather than fully automating process modeling, the system decomposes complex tasks into manageable steps with explicit documentation, complementing LLM reasoning with specialized tools to ensure sound and comprehensible outputs.
AIBullishGoogle DeepMind Blog · Apr 306/10
🧠Researchers are developing AI co-clinician systems designed to augment healthcare delivery by partnering artificial intelligence with medical professionals. This initiative explores how AI can enhance clinical decision-making and patient care workflows through collaborative human-AI models rather than full automation.