What does this repo signal mean?

DeepInfra published deepinfra/cog-llama-2 (Python). This repository signal exposes tooling, eval, infrastructure, or model-adjacent work before it may appear in a launch post. High-signal details: repo deepinfra/cog-llama-2 · language Python · Routine Cog container for Llama 2.. onlylabs links this event to 1 captured evidence page and 6 related repo signals.

DeepInfra Repo: deepinfra/cog-llama-2

Captured source

source ↗

GitHub/github.com/deepinfra/cog-llama-2

deepinfra/cog-llama-2 repository metadata

Source ↗

published Aug 1, 2023seen Jun 5captured Jun 11http 200method plain

deepinfra/cog-llama-2

Description: A cog for running llama-2 using llama.cpp server

Language: Python

Stars: 0

Forks: 0

Open issues: 0

Created: 2023-08-01T09:33:38Z

Pushed: 2023-08-01T09:47:41Z

Default branch: main

Fork: no

Archived: no

README: Setup =====

download cog: https://github.com/replicate/cog/releases/
download a llama.cpp quantization from https://huggingface.co/TheBloke/Llama-2-70B-Chat-GGML/tree/main, place in weights/
tweak predict.py MODEL variable to match your weights
try sample inference with cog predict -i prompt="What came first, the chicken or the egg?"
make sure to put the right image in cog.yaml so the container name is ready to push out-of-the box
once you're ready run cog build, test it one more time with

docker run --gpus all --rm -it -p 5000:5000 IMAGE_NAME
# in another terminal, after it starts
curl -X POST http://127.0.0.1:5000/predictions \
--data '{"input": {"prompt": "Hello"}}' \
-H 'Content-Type: application/json' \
| python -m json.tool

(optional) push image with docker push IMAGE_NAME

Notes =====

the EXTRA flags at the top of predict.py are for 70B (and 35B) llama-2 models. For 13B and under, drop -gqa and -eps, i.e leave -ngl.
the setup function starts the llama server and waits for it to become available
for predictions we query the HTTP server
for cog build --separate-weights you might need a more recent cog 0.8.4+ with some kinks fixed

Notability

notability 3.0/10

Routine Cog container for Llama 2.