WritingReplicateReplicatepublished May 16, 2025seen 5d

NVIDIA H100 GPUs are here

Open original ↗

Captured source

source ↗
published May 16, 2025seen 5dcaptured 3dhttp 200method plain

NVIDIA H100 GPUs are here – Replicate blog

Replicate Blog

NVIDIA H100 GPUs are here

Posted May 16, 2025 by zeke

You can now run NVIDIA H100 GPUs on Replicate.

You can also now use 2x, 4x, and 8x configurations of A100s and L40S GPUs. These were previously only available in deployments , but now you can use them for regular models and training runs.

If you’ve been waiting to speed up your model or try something more powerful, now’s a good time.

H100 pricing

1x H100s are now available to everyone.

2x, 4x, and 8x H100s are currently reserved for committed spend contracts.

Email us at team@replicate.com if you want access.

Hardware Price (per sec) Price (per hour) GPU GPU RAM CPU RAM H100 $0.001525 $5.49 1x 80GB 13x 72GB 2x H100 $0.003050 $10.98 2x 160GB – – 4x H100 $0.006100 $21.96 4x 320GB – – 8x H100 $0.012200 $43.92 8x 640GB – –

A100 pricing (2x, 4x, 8x)

These multi-GPU setups for A100s are now available for models (they were already available for deployments):

Hardware Price (per sec) Price (per hour) GPU GPU RAM CPU RAM 2x A100 (80GB) $0.002800 $10.08 2x 160GB 20x 288GB 4x A100 (80GB) $0.005600 $20.16 4x 320GB 40x 576GB 8x A100 (80GB) $0.011200 $40.32 8x 640GB 80x 960GB

See the full hardware pricing list for more details.

L40S pricing (2x, 4x, 8x)

These multi-GPU setups for L40S GPUs are now available for models (they were already available for deployments):

Hardware Price (per sec) Price (per hour) GPU GPU RAM CPU RAM 2x L40S $0.001950 $7.02 2x 96GB 20x 144GB 4x L40S $0.003900 $14.04 4x 192GB 40x 288GB 8x L40S $0.007800 $28.08 8x 384GB 80x 576GB

See the full hardware pricing list for more details.

Creating a new model using an H100 GPU

You can create a new model on the web or using the HTTP API .

Here’s a cURL command to create a new model that uses an H100 GPU:

Copy

curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN " \ -H 'Content-Type: application/json' \ -d '{"owner": "my-username", "name": "my-model", "description": "An example model", "visibility": "private", "hardware": "gpu-h100"}' \ https://api.replicate.com/v1/models

Listing available hardware via API

Here’s a cURL command to list available hardware for your account:

Copy

curl -s -X GET \ -H "Authorization: Bearer $REPLICATE_API_TOKEN " \ https://api.replicate.com/v1/hardware

This command outputs a list of all the hardware options available to you, and the names of the SKUs you can use in the hardware field when creating a new model via API:

Copy

[ { "sku" : "cpu" , "name" : "CPU" }, { "sku" : "gpu-a100-large" , "name" : "Nvidia A100 (80GB) GPU" }, { "sku" : "gpu-a100-large-2x" , "name" : "2x Nvidia A100 (80GB) GPU" }, { "sku" : "gpu-a100-large-4x" , "name" : "4x Nvidia A100 (80GB) GPU" }, { "sku" : "gpu-a100-large-8x" , "name" : "8x Nvidia A100 (80GB) GPU" }, { "sku" : "gpu-h100" , "name" : "Nvidia H100 GPU" }, { "sku" : "gpu-l40s" , "name" : "Nvidia L40S GPU" }, { "sku" : "gpu-l40s-2x" , "name" : "2x Nvidia L40S GPU" }, { "sku" : "gpu-l40s-4x" , "name" : "4x Nvidia L40S GPU" }, { "sku" : "gpu-l40s-8x" , "name" : "8x Nvidia L40S GPU" }, { "sku" : "gpu-t4" , "name" : "Nvidia T4 GPU" } ]

Updating your deployments

If you’re using a deployment , you can update the hardware configuration to use H100s or any of these new multi-GPU setups.

You can edit your deployment configuration on the web or use the HTTP API .

If you’re not sure how to best configure your deployments, email us at support@replicate.com .

Next: Run 30,000+ LoRAs on Hugging Face with Replicate

Notability

notability 6.0/10

Significant hardware availability announcement, moderate impact