AIBullisharXiv โ CS AI ยท Feb 276/107
๐ง
GetBatch: Distributed Multi-Object Retrieval for ML Data Loading
Researchers introduce GetBatch, a new object store API that optimizes machine learning data loading by replacing thousands of individual GET requests with a single batch operation. The system achieves up to 15x throughput improvement for small objects and reduces batch retrieval latency by 2x in production ML training workloads.