y0news
AnalyticsDigestsSourcesRSSAICrypto
#distributed-storage2 articles
2 articles
AIBullisharXiv โ€“ CS AI ยท Feb 276/107
๐Ÿง 

GetBatch: Distributed Multi-Object Retrieval for ML Data Loading

Researchers introduce GetBatch, a new object store API that optimizes machine learning data loading by replacing thousands of individual GET requests with a single batch operation. The system achieves up to 15x throughput improvement for small objects and reduces batch retrieval latency by 2x in production ML training workloads.

GeneralNeutralarXiv โ€“ CS AI ยท Feb 274/106
๐Ÿ“ฐ

Optimized Disaster Recovery for Distributed Storage Systems: Lightweight Metadata Architectures to Overcome Cryptographic Hashing Bottleneck

Researchers propose a new metadata-driven architecture for distributed storage systems to overcome cryptographic hashing bottlenecks during disaster recovery. The framework assigns unique identifiers to data blocks at ingestion time, enabling faster recovery without cryptographic overhead that currently slows down failover events.