#inference-control News & Analysis

4 articles tagged with #inference-control. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

4 articles

AIBullisharXiv – CS AI · Jun 47/10

🧠

Mid-Think: Training-Free Intermediate-Budget Reasoning via Token-Level Triggers

Researchers discovered that language model reasoning behavior is primarily controlled by specific token patterns rather than high-level instructions, leading to the development of Mid-Think, a training-free prompting technique that achieves intermediate-budget reasoning with better accuracy-efficiency tradeoffs and improves RL training performance for models like Qwen3-8B.

AINeutralarXiv – CS AI · Mar 57/10

🧠

Inference-Time Toxicity Mitigation in Protein Language Models

Researchers developed Logit Diff Amplification (LDA) as an inference-time safety mechanism for protein language models to prevent toxic protein generation. The method reduces predicted toxicity rates while maintaining biological plausibility and structural viability, addressing dual-use safety concerns in AI-driven protein design.

AINeutralarXiv – CS AI · May 16/10

🧠

Belief-Guided Inference Control for Large Language Model Services via Verifiable Observations

Researchers propose VEROIC, a framework for optimizing inference costs in black-box LLM services by dynamically deciding when to allocate additional computation. The system uses partially observable reliability signals to balance response quality against computational expenses, achieving better cost-efficiency trade-offs than existing approaches.

AIBullisharXiv – CS AI · Mar 176/10

🧠

From Refusal Tokens to Refusal Control: Discovering and Steering Category-Specific Refusal Directions

Researchers developed a method to control AI safety refusal behavior using categorical refusal tokens in Llama 3 8B, enabling fine-grained control over when models refuse harmful versus benign requests. The technique uses steering vectors that can be applied during inference without additional training, improving both safety and reducing over-refusal of harmless prompts.

🧠 Llama