What does this writing signal mean?

OpenAI Writing: Introducing ChatGPT and Whisper APIs

Captured source

source ↗

openai.com/openai.com/index/introducing-chatgpt-and-whisper-apis

Introducing ChatGPT and Whisper APIs

Source ↗

published Apr 24, 2024seen 6dcaptured 2dhttp 200method exa

Introducing ChatGPT and Whisper APIs | OpenAI

Updated: April 24, 2024

Introducing APIs for GPT‑3.5 Turbo and Whisper

Developers can now integrate GPT‑3.5 Turbo and Whisper models into their apps and products through our API.

Loading…

Update on April 24, 2024: The ChatGPT API name has been discontinued. Mentions of the ChatGPT API in this blog refer to the GPT‑3.5 Turbo API.

ChatGPT and Whisper models are now available on our API, giving developers access to cutting-edge language (not just chat!) and speech-to-text capabilities. Through a series of system-wide optimizations, we’ve achieved 90% cost reduction for ChatGPT since December; we’re now passing through those savings to API users. Developers can now use our open-source Whisper large-v2 model in the API with much faster and cost-effective results. GPT‑3.5 API users can expect continuous model improvements and the option to choose dedicated capacity for deeper control over the models. We’ve also listened closely to feedback from our developers and refined our API terms of service to better meet their needs.

Early users of GPT-3.5 Turbo and Whisper APIs

Snap Inc⁠., the creator of Snapchat, introduced My AI for Snapchat+ this week. The experimental feature is running on GPT‑3.5 API. My AI offers Snapchatters a friendly, customizable chatbot at their fingertips that offers recommendations, and can even write a haiku for friends in seconds. Snapchat, where communication and messaging is a daily behavior, has 750 million monthly Snapchatters:

My AI for Snapchat+

Quizlet⁠ is a global learning platform with more than 60 million students using it to study, practice and master whatever they’re learning. Quizlet has worked with OpenAI for the last three years, leveraging GPT‑3 across multiple use cases, including vocabulary learning and practice tests. With the launch of GPT‑3.5 API, Quizlet is introducing Q-Chat, a fully-adaptive AI tutor that engages students with adaptive questions based on relevant study materials delivered through a fun chat experience:

Quizlet Q-Chat

Instacart⁠ is augmenting the Instacart app to enable customers to ask about food and get inspirational, shoppable answers. This uses ChatGPT alongside Instacart’s own AI and product data from their 75,000+ retail partner store locations to help customers discover ideas for open-ended shopping goals, such as “How do I make great fish tacos?” or “What’s a healthy lunch for my kids?” Instacart plans to launch “Ask Instacart” later this year:

Instacart’s Ask Instacart

Shop⁠, Shopify’s consumer app, is used by 100 million shoppers to find and engage with the products and brands they love. GPT‑3.5 API is used to power Shop’s new shopping assistant. When shoppers search for products, the shopping assistant makes personalized recommendations based on their requests. Shop’s new AI-powered shopping assistant will streamline in-app shopping by scanning millions of products to quickly find what buyers are looking for—or help them discover something new:

Shopify’s Shop app

Speak⁠ is an AI-powered language learning app focused on building the best path to spoken fluency. They’re the fastest-growing English app in South Korea, and are already using the Whisper API to power a new AI speaking companion product, and rapidly bring it to the rest of the globe. Whisper’s human-level accuracy for language learners of every level unlocks true open-ended conversational practice and highly accurate feedback:

The Speak app

GPT-3.5 API

Model: The ChatGPT model family we are releasing today,gpt-3.5-turbo, is the same model used in the ChatGPT product. It is priced at $0.002 per 1k tokens, which is 10x cheaper than our existing GPT‑3.5 models. It’s also our best model for many non-chat use cases—we’ve seen early testers migrate fromtext-davinci-003 togpt-3.5-turbo with only a small amount of adjustment needed to their prompts.

API: Traditionally, GPT models consume unstructured text, which is represented to the model as a sequence of “tokens.” ChatGPT models instead consume a sequence of messages together with metadata. (For the curious: under the hood, the input is still rendered to the model as a sequence of “tokens” for the model to consume; the raw format used by the model is a new format called Chat Markup Language⁠(“ChatML”).)

We’ve created a new endpoint to interact with our ChatGPT models:

Bash

1curl https://api.openai.com/v1/chat/completions \2 -H "Authorization: Bearer $OPENAI_API_KEY" \3 -H "Content-Type: application/json" \4 -d '{5 "model": "gpt-3.5-turbo",6 "messages": [{"role": "user", "content": "What is the OpenAI mission?"}] 7 }'

To learn more about the GPT‑3.5 API, visit our Chat guide⁠.

ChatGPT upgrades

We are constantly improving our ChatGPT models, and want to make these enhancements available to developers as well. Developers who use thegpt-3.5-turbo model will always get our recommended stable model, while still having the flexibility to opt for a specific model version. For example, today we’re releasinggpt-3.5-turbo-0301, which will be supported through at least June 1st, and we’ll updategpt-3.5-turbo to a new stable release in April. The models page⁠ will provide switchover updates.

Dedicated instances

We are also now offering dedicated instances for users who want deeper control over the specific model version and system performance. By default, requests are run on compute infrastructure shared with other users, who pay per request. Our API runs on Azure, and with dedicated instances, developers will pay by time period for an allocation of compute infrastructure that’s reserved for serving their requests.

Developers get full control over the instance’s load (higher load improves throughput but makes each request slower), the option to enable features such as longer context limits, and the ability to pin the model snapshot.

Dedicated instances can make economic sense for developers running beyond ~450M tokens per day. Additionally, it enables directly optimizing a developer’s workload against hardware performance, which can dramatically reduce costs relative to shared infrastructure. For dedicated instance inquiries, contact us⁠.

Whisper API

Whisper⁠, the speech-to-text model we open-sourced in September 2022, has received immense praise from the developer community but can also be hard to run. We’ve now made the large-v2 model available through our API, which gives convenient on-demand…

Excerpt shown — open the source for the full document.