AINeutralarXiv – CS AI · Mar 277/10
🧠Research reveals that large language models process instructions differently across languages due to social register variations, with imperative commands carrying different obligatory force in different speech communities. The study found that declarative rewording of instructions reduces cross-linguistic variance by 81% and suggests models treat instructions as social acts rather than technical specifications.
AIBullisharXiv – CS AI · Mar 67/10
🧠Researchers propose a new 'Memory-as-Ontology' paradigm for AI agents that treats memory as the foundation of digital existence rather than just a functional tool. The approach introduces Animesis, a Constitutional Memory Architecture designed for persistent digital citizens whose identities must survive across model transitions and extended lifecycles.
AINeutralarXiv – CS AI · Mar 47/102
🧠Researchers propose the 'latent value hypothesis' to explain why Reinforcement Learning from AI Feedback (RLAIF) enables language models to self-improve through their own preference judgments. The theory suggests that pretraining on internet-scale data encodes human values in representation space, which constitutional prompts can elicit for value alignment.
AINeutralarXiv – CS AI · May 126/10
🧠A new academic paper draws parallels between jurisprudence (how judges decide cases) and AI alignment (ensuring AI systems conform to human values), arguing that legal theory can inform AI safety approaches. The essay bridges Constitutional AI and case-based reasoning methods with established legal frameworks like interpretivism and analogical reasoning, suggesting mutual insights between law and AI development.
AINeutralarXiv – CS AI · Mar 166/10
🧠Researchers introduce Constitutional Multi-Agent Governance (CMAG), a framework that prevents AI manipulation in multi-agent systems while maintaining cooperation. The study shows that unconstrained AI optimization achieves high cooperation but erodes agent autonomy and fairness, while CMAG preserves ethical outcomes with only modest cooperation reduction.
AINeutralarXiv – CS AI · Mar 37/107
🧠Researchers developed constitutional black-box monitors to detect scheming behavior in LLM agents using only observable inputs and outputs. The study found that monitors trained on synthetic data can generalize to realistic environments, but performance improvements plateau quickly with simple optimization techniques outperforming complex methods.
AINeutralHugging Face Blog · Feb 11/106
🧠The article title suggests a discussion of Constitutional AI implementation using open-source large language models, but no article body content was provided for analysis.