AIBullisharXiv โ CS AI ยท 15h ago6/10
๐ง
StreamWise: Serving Multi-Modal Generation in Real-Time at Scale
Researchers introduce StreamWise, a system for real-time multi-modal content generation that can produce 10-minute podcast videos with sub-second startup delays. The system dynamically manages quality and resources across LLMs, text-to-speech, and video generation, costing under $25 for basic generation or $45 for high-quality real-time streaming.