OpenAI releases new language model InstructGPT-3.5

[ad_1]


summary
Summary

OpenAI introduces “gpt-3.5-turbo-instruct”, a new instruction language model that is as efficient as the chat-optimized GPT-3.5 Turbo.

OpenAI is introducing “gpt-3.5-turbo-instruct” as a replacement for the existing Instruct models, as well as text-ada-001, text-babbage-001, text-curie-001, and the three text-davinci models that will be retired on January 4, 2024.

The cost and performance of “gpt-3.5-turbo-instruct” is the same as the other GPT-3.5 models with 4K context windows. The cutoff date for training data is September 2021.

Image: OpenAI

OpenAI says that gpt-3.5-turbo-instruct was trained “similarly” to previous Instruct models. The company does not provide details or benchmarks for the new Instruct model, instead referring to the January 2022 announcement of InstructGPT, which in turn was the basis for GPT-3.5.

Ad

Ad

OpenAI’s general statement is that GPT-4 follows complex instructions better than GPT-3.5 and produces higher quality than GPT-3.5, which in turn is significantly faster and cheaper.

Gpt-3.5-turbo-instruct is not a chat model, unlike GPT-3.5. Instead of conversations, it is optimized for directly answering questions or completing text. OpenAI claims that it is as fast as GPT-3.5-turbo.

The following graphic shows an OpenAI-designed differentiation between the Instruct and Chat models. The difference can have implications for how prompts need to be written.

Image: via Twitter, Adam.GPT (OpenAI)

OpenAI’s Logan Kilpatrick, who is responsible for developer relations, calls the new Instruct model a stopgap solution for the transition to 3.5 Turbo. It is not a “long-term solution,” he said.

Customers who have fine-tuned models in use will need to re-tune based on the new model versions. The fine-tuning feature is available for GPT-3.5, with GPT-4 scheduled for release later this year.

Recommendation

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top