WritingCoreWeaveCoreWeavepublished May 20, 2026seen 6d

AI Storage Without Limits: Exploring the Latest CoreWeave AI Object Storage Expansions

Open original ↗

Captured source

source ↗

AI Storage Without Limits: Exploring the Latest CoreWeave AI Object Storage Expansions | Blog

Announcement

Announcement

Webinar

Announcement

Podcast

Announcement

GTC 2026

Announcement

CoreWeave brings up the industry’s first NVIDIA Vera Rubin NVL72 deployment.

Read more

Products

Data and storage

Infrastructure control

Runtime acceleration

Model and agent development

Mission control

Solutions

Pricing

Resources

About us

Contact us Login

Contact us Login

Clear

Reframing the AI Storage Challenge Every iteration of AI innovation runs headlong into the same challenge—data. Models get faster, more capable, and more compute-intensive, but the infrastructure supporting them has not kept pace. Across clouds, regions, and tiers, data remains fragmented, slow, and costly to move. AI teams are forced to duplicate datasets, manage unpredictable egress bills, and engineer brittle caching systems, all just to keep their GPUs from idling. The result? Innovation throttled by storage systems that were never designed for the demands of AI. At CoreWeave, we believe data mobility should be as dynamic as the AI workloads it powers. That belief led us to build CoreWeave AI Object Storage ,  our industry-leading, fully managed storage service purpose-built for AI. Powered by our Local Object Transport Accelerator (LOTA) technology, CoreWeave AI Object Storage eliminates the friction of moving data between regions, clouds, and tiers. It combines simplicity, scalability, and transparency, offering throughput of up to 7 GB/s per GPU , zero fees (egress, ingress, or request) , and automated, usage-based billing levels that cut existing customers’ costs by more than 75%. Whether you’re training, fine-tuning, or serving models across global environments,  CoreWeave AI Object Storage keeps your GPUs busy, your data accessible everywhere, and your innovation in motion. ‍ How CoreWeave AI Object Storage Delivers Maximum Performance for AI Workloads CoreWeave AI Object Storage is a fully managed object storage service built specifically to maximize throughput for AI workloads. Unlike general-purpose storage, it was engineered for the unique demands of AI, using a distributed architecture that separates compute from storage while preserving ultra-low-latency data access. Data is distributed across GPU nodes, enabling highly parallelized reads and writes. This design allows it to deliver up to 7 GB/s per GPU , performance unmatched in the industry. With the ability to scale to hundreds of thousands of GPUs, enterprise-grade durability (11 nines), built-in observability with Prometheus and Grafana, and full S3 compatibility, CoreWeave AI Object Storage accelerates large language model training, post-training, and reinforcement learning workflows, reducing both time and costs while enabling faster iteration and innovation.

CoreWeave AI Object Storage’s performance is powered by Local Object Transport Accelerator ( LOTA) , a proxy service that runs on the nodes of GPU clusters, acting as an S3 endpoint and also creating a local cache across the disks on GPU nodes. While traditional storage often requires customers to build separate caching layers that add operational overhead and new points of failure, LOTA is an industry-leading, AI-specific caching technology embedded directly into the storage service. It intelligently stages frequently accessed objects close to the compute, across regions and clouds, ensuring that GPUs are continuously fed with the data they need during both training and fine-tuning. Thanks to LOTA, the unmatched performance benefits from our industry-leading throughput scale linearly even as AI workloads grow . While traditional object storage is limited to the availability zone or regional level,  CoreWeave AI Object Storage sustains peak performance across any number of GPU nodes.

Another key feature of CoreWeave AI Object Storage is its automated, usage-based billing, which automatically assigns data to lower-cost pricing levels based on access frequency: Hot: Accessed in the last 7 days Warm: Accessed in the last 7-30 days Cold: Not accessed in 30+ days

This approach helps customers save time and money while maximizing ROI from purpose-built storage. With our new transparent, usage-based billing levels, CoreWeave AI Object Storage reduces our existing customer storage costs by more than 75% for typical AI workloads . Customers gain the flexibility to align storage costs with workload demands, without compromising the high performance they’ve come to expect from CoreWeave AI Object Storage. The benefits compound: maximum GPU utilization, simplified infrastructure, and predictable economics thanks to transparent pricing without hidden fees. By combining performance innovations like LOTA with unmatched throughput and resilience, CoreWeave AI Object Storage ensures that data never becomes the bottleneck, empowering AI teams to scale with confidence and accelerate every stage of innovation.

Enhancing CoreWeave AI Object Storage  with Cross-Region and Multi-Cloud Flexibility The newest features announced for CoreWeave AI Object Storage enable CoreWeave customers to use data anywhere in our regions, in other clouds, and on premises. Until now, most organizations have been forced to replicate datasets into each region where workloads run. This practice increases costs dramatically while introducing the risk of data divergence. CoreWeave AI Object Storage eliminates that burden. Now, a single dataset can be accessed seamlessly from anywhere in the world with local disk performance wherever AI workloads run. This capability is further enabled by CoreWeave’s multi-cloud-friendly, purpose-built for AI networking backbone, which combines private interconnects, direct cloud peering, and cross-region networking capable of up to 400 gigabits per second. Whether a workload is running in New York or London, developers can count on the same high-throughput access profile without needing to engineer complex replication strategies or manage massive data sprawl. The advantages of this flexibility are significant. Rather than balancing multiple inconsistent copies of data, teams can work from a single source of truth, ensuring integrity, reducing costs, and simplifying workflows. For AI developers responsible for global-scale deployments, this feature eliminates one of the toughest challenges in infrastructure design: data portability. And since LOTA technology is already powering…

Excerpt shown — open the source for the full document.

Notability

notability 3.0/10

Routine infrastructure announcement