WritingReplicateReplicatepublished Sep 6, 2023seen 5d

Fine-tuned models now boot in less than one second

Open original ↗

Captured source

source ↗
published Sep 6, 2023seen 5dcaptured 3dhttp 200method plain

Fine-tuned models now boot in less than one second – Replicate blog

Replicate Blog

Fine-tuned models now boot in less than one second

Posted September 6, 2023 by andreasjansson

You can fine-tune language models like Llama 2 or image models like SDXL with your own data on Replicate. If you don’t make any requests to your fine-tuned model for a while, it can take some time to start again. This is called a cold boot, and can be as slow as a few minutes for large models.

We’ve made some dramatic improvements to cold boots for fine-tuned models. They now boot in less than one second .

It works on these models:

meta/llama-2-7b-chat

meta/llama-2-13b-chat

meta/llama-2-70b-chat

meta/llama-2-7b

meta/llama-2-13b

meta/llama-2-70b

stability-ai/sdxl

For now, it’s available only for new fine-tuned models created starting today. We’re also working on a more cold boot improvements for all models. Stay tuned.

To get started, check out these guides:

Fine-tune Llama 2

Fine-tune SDXL

Let’s go. 🚀

Next: Painting with words: a history of text-to-image AI