What does this fork signal mean?

Baseten forked basetenlabs/mpt-7b-base-truss (forked from tuhins/mpt-7b-instruct-truss). This fork signal points to upstream code the lab may be inspecting, patching, or building on. High-signal details: repo basetenlabs/mpt-7b-base-truss · parent tuhins/mpt-7b-instruct-truss. onlylabs links this event to 1 captured evidence page and 6 related fork signals.

Baseten Fork: basetenlabs/mpt-7b-base-truss

Captured source

source ↗

GitHub/github.com/basetenlabs/mpt-7b-base-truss

basetenlabs/mpt-7b-base-truss repository metadata

Source ↗

published May 23, 2023seen 5dcaptured 8hhttp 200method plain

basetenlabs/mpt-7b-base-truss

Description: A deployment "truss" for the MPT-7B Base model from MosaicML

Language: Python

License: Apache-2.0

Stars: 0

Forks: 1

Open issues: 2

Created: 2023-05-23T21:01:44Z

Pushed: 2023-07-23T16:16:34Z

Default branch: main

Fork: yes

Parent repository: tuhins/mpt-7b-instruct-truss

Archived: yes

README:

MPT-7B Base Truss

This is a Truss for MPT-7B using the mosaicml/mpt-7b family of models from the transformers library. This README will walk you through how to deploy the base version on Baseten.

Truss

Truss is an open-source model serving framework developed by Baseten. It allows you to develop and deploy machine learning models onto Baseten (and other platforms like AWS or GCP. Using Truss, you can develop a GPU model using live-reload, package models and their associated code, create Docker containers and deploy on Baseten.

Deploying MPT-7B

To deploy the MPT-7B Base Truss, you'll need to follow these steps:

1. __Prerequisites__: Make sure you have a Baseten account and API key. You can sign up for a Baseten account here.

2. __Install Truss and the Baseten Python client__: If you haven't already, install the Baseten Python client and Truss in your development environment using:

pip install --upgrade baseten truss

3. __Load the MPT-7B Base Truss__: Assuming you've cloned this repo, spin up an IPython shell and load the Truss into memory:

import truss
mpt_truss = truss.load("path/to/mpt_truss")

4. __Log in to Baseten__: Log in to your Baseten account using your API key (key found here):

import baseten

baseten.login("PASTE_API_KEY_HERE")

5. __Deploy the MPT-7B Base Truss__: Deploy MPT-7B Base to Baseten with the following command:

baseten.deploy(mpt_truss)

Once your Truss is deployed, you can start using MPT-7B Base through the Baseten platform! Navigate to the Baseten UI to watch the model build and deploy and invoke it via the REST API.

MPT-7B API documentation

This section provides an overview of the MPT-7B Base API, its parameters, and how to use it. The API consists of a single route named predict, which you can invoke to generate text completions based on the provided parameters.

API route: `predict`

The predict route is the primary method for generating images based on a given set of parameters. It takes several parameters:

__prompt__: The input text prompt for the LLM
__max_tokens__: (optional) The maximum number of tokens that will be generated
__temperature__: (optional) A temperature of 0 means the response is more deterministic. A temperature of greater than zero results in increasing variation in the completion.
__top_k__: (optional) Controls how model picks the next token from the top k tokens in its list, sorted by probability.
__top_p__: (optional) Controls how the model picks from the top tokens based on the sum of their probabilities.

Example usage

You can use the baseten model package to invoke your model from Python

import baseten
# You can retrieve your deployed model ID from the UI
model = baseten.deployed_model_version_id('YOUR_MODEL_ID')

request = {
"prompt" : "Today I inspected the engine mounting equipment. I found a problem in one of the brackets so"
"temperature": 0.75,
"max_tokens": 200
}

response = model.predict(request)

You can also invoke your model via a REST API

curl -X POST " https://app.baseten.co/models/YOUR_MODEL_ID/predict" \
-H "Content-Type: application/json" \
-H 'Authorization: Api-Key {YOUR_API_KEY}' \
-d '{
"prompt" : "Today I inspected the engine mounting equipment. I found a problem in one of the brackets so" \
"temperature": 0.75, \
"max_tokens": 200 \
}'