y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#multi-tenant-systems News & Analysis

2 articles tagged with #multi-tenant-systems. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles
AIBullisharXiv – CS AI · 10h ago7/10
🧠

SPECTRE: Hybrid Ordinary-Parallel Speculative Serving for Resource-Efficient LLM Inference

SPECTRE is a new LLM serving framework that improves inference efficiency by repurposing underutilized smaller models as remote drafters for heavily-loaded large models through parallel speculative decoding. The system achieves up to 2.28× speedup on large models like Qwen3-235B while maintaining minimal interference to smaller models' native workloads.

AIBullisharXiv – CS AI · 10h ago7/10
🧠

MARLaaS: Multi-Tenant Asynchronous Reinforcement Learning as a Service

Researchers introduce MARLaaS, a system enabling cost-effective concurrent reinforcement learning fine-tuning for large language models across multiple users through shared base models and asynchronous architecture. The approach achieves 4.3x better accelerator utilization and 85% reduction in training time while maintaining single-task performance quality.