Training
Learning path
Develop Generative AI solutions with Azure OpenAI Service - Training
Develop Generative AI solutions with Azure OpenAI Service
This browser is no longer supported.
Upgrade to Microsoft Edge to take advantage of the latest features, security updates, and technical support.
Azure OpenAI Service is powered by a diverse set of models with different capabilities and price points. Model availability varies by region and cloud. For Azure Government model availability, please refer to Azure Government OpenAI Service.
Models | Description |
---|---|
GPT-4o & GPT-4 Turbo | The latest most capable Azure OpenAI models with multimodal versions, which can accept both text and images as input. |
GPT-4 | A set of models that improve on GPT-3.5 and can understand and generate natural language and code. |
GPT-3.5 | A set of models that improve on GPT-3 and can understand and generate natural language and code. |
Embeddings | A set of models that can convert text into numerical vector form to facilitate text similarity. |
DALL-E | A series of models that can generate original images from natural language. |
Whisper | A series of models in preview that can transcribe and translate speech to text. |
Text to speech (Preview) | A series of models in preview that can synthesize text to speech. |
GPT-4o integrates text and images in a single model, enabling it to handle multiple data types simultaneously. This multimodal approach enhances accuracy and responsiveness in human-computer interactions. GPT-4o matches GPT-4 Turbo in English text and coding tasks while offering superior performance in non-English languages and vision tasks, setting new benchmarks for AI capabilities.
Existing Azure OpenAI customers can test out the NEW GPT-4o mini model in the Azure OpenAI Studio Early Access Playground (Preview).
To test the latest model:
Note
The GPT-4o mini early access playground is currently only available for resources in West US3 and East US, and is limited to 10 requests every five minutes per subscription. Azure OpenAI content filters are enabled at the default configuration and cannot be modified. GPT-4o mini is a preview model and is currently not available for deployment/direct API access.
GPT-4o is available for standard and global-standard model deployment.
You need to create or use an existing resource in a supported standard or global standard region where the model is available.
When your resource is created, you can deploy the GPT-4o model. If you are performing a programmatic deployment, the model name is gpt-4o
, and the version is 2024-05-13
.
GPT-4 Turbo is a large multimodal model (accepting text or image inputs and generating text) that can solve difficult problems with greater accuracy than any of OpenAI's previous models. Like GPT-3.5 Turbo, and older GPT-4 models GPT-4 Turbo is optimized for chat and works well for traditional completions tasks.
The latest GA release of GPT-4 Turbo is:
gpt-4
Version: turbo-2024-04-09
This is the replacement for the following preview models:
gpt-4
Version: 1106-Preview
gpt-4
Version: 0125-Preview
gpt-4
Version: vision-preview
0409
turbo model supports JSON mode and function calling for all inference requests.turbo-2024-04-09
currently doesn't support the use of JSON mode and function calling when making inference requests with image (vision) input. Text based input requests (requests without image_url
and inline images) do support JSON mode and function calling.gpt-4
Version: turbo-2024-04-09
. This includes Optical Character Recognition (OCR), object grounding, video prompts, and improved handling of your data with images.gpt-4
Version: turbo-2024-04-09
is available for both standard and provisioned deployments. Currently the provisioned version of this model doesn't support image/vision inference requests. Provisioned deployments of this model only accept text input. Standard model deployments accept both text and image/vision inference requests.For information on model regional availability consult the model matrix for standard, and provisioned deployments.
To deploy the GA model from the Studio UI, select GPT-4
and then choose the turbo-2024-04-09
version from the dropdown menu. The default quota for the gpt-4-turbo-2024-04-09
model will be the same as current quota for GPT-4-Turbo. See the regional quota limits.
GPT-4 is the predecessor to GPT-4 Turbo. Both the GPT-4 and GPT-4 Turbo models have a base model name of gpt-4
. You can distinguish between the GPT-4 and Turbo models by examining the model version.
gpt-4
Version 0314
gpt-4
Version 0613
gpt-4-32k
Version 0613
You can see the token context length supported by each model in the model summary table.
See model versions to learn about how Azure OpenAI Service handles model version upgrades, and working with models to learn how to view and configure the model version settings of your GPT-4 deployments.
Model ID | Description | Max Request (tokens) | Training Data (up to) |
---|---|---|---|
gpt-4o (2024-05-13) GPT-4o (Omni) |
Latest GA model - Text, image processing - JSON Mode - parallel function calling - Enhanced accuracy and responsiveness - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision - Superior performance in non-English languages and in vision tasks - Does not support enhancements |
Input: 128,000 Output: 4,096 |
Oct 2023 |
gpt-4 (turbo-2024-04-09) GPT-4 Turbo with Vision |
New GA model - Replacement for all previous GPT-4 preview models ( vision-preview , 1106-Preview , 0125-Preview ). - Feature availability is currently different depending on method of input, and deployment type. - Does not support enhancements. |
Input: 128,000 Output: 4,096 |
Dec 2023 |
gpt-4 (0125-Preview)*GPT-4 Turbo Preview |
Preview Model -Replaces 1106-Preview - Better code generation performance - Reduces cases where the model doesn't complete a task - JSON Mode - parallel function calling - reproducible output (preview) |
Input: 128,000 Output: 4,096 |
Dec 2023 |
gpt-4 (vision-preview)GPT-4 Turbo with Vision Preview |
Preview model - Accepts text and image input. - Supports enhancements - JSON Mode - parallel function calling - reproducible output (preview) |
Input: 128,000 Output: 4,096 |
Apr 2023 |
gpt-4 (1106-Preview)GPT-4 Turbo Preview |
Preview Model - JSON Mode - parallel function calling - reproducible output (preview) |
Input: 128,000 Output: 4,096 |
Apr 2023 |
gpt-4-32k (0613) |
Older GA model - Basic function calling with tools |
32,768 | Sep 2021 |
gpt-4 (0613) |
Older GA model - Basic function calling with tools |
8,192 | Sep 2021 |
gpt-4-32k (0314) |
Older GA model - Retirement information |
32,768 | Sep 2021 |
gpt-4 (0314) |
Older GA model - Retirement information |
8,192 | Sep 2021 |
Caution
We don't recommend using preview models in production. We will upgrade all deployments of preview models to either future preview versions or to the latest stable/GA version. Models designated preview do not follow the standard Azure OpenAI model lifecycle.
turbo-2024-04-09
is the latest GA release and replaces 0125-Preview
, 1106-preview
, and vision-preview
.Important
gpt-4
versions 1106-Preview, 0125-Preview, and vision-preview will be upgraded with a stable version of gpt-4
in the future. Deployments of gpt-4
versions 1106-Preview, 0125-Preview, and vision-preview set to "Auto-update to default" and "Upgrade when expired" will start to be upgraded after the stable version is released. For each deployment, a model version upgrade takes place with no interruption in service for API calls. Upgrades are staged by region and the full upgrade process is expected to take 2 weeks. Deployments of gpt-4
versions 1106-Preview, 0125-Preview, and vision-preview set to "No autoupgrade" will not be upgraded and will stop operating when the preview version is upgraded in the region. See Azure OpenAI model retirements and deprecations for more information on the timing of the upgrade.GPT-3.5 models can understand and generate natural language or code. The most capable and cost effective model in the GPT-3.5 family is GPT-3.5 Turbo, which has been optimized for chat and works well for traditional completions tasks as well. GPT-3.5 Turbo is available for use with the Chat Completions API. GPT-3.5 Turbo Instruct has similar capabilities to text-davinci-003
using the Completions API instead of the Chat Completions API. We recommend using GPT-3.5 Turbo and GPT-3.5 Turbo Instruct over legacy GPT-3.5 and GPT-3 models.
Model ID | Description | Max Request (tokens) | Training Data (up to) |
---|---|---|---|
gpt-35-turbo (0125) NEW |
Latest GA Model - JSON Mode - parallel function calling - reproducible output (preview) - Higher accuracy at responding in requested formats. - Fix for a bug which caused a text encoding issue for non-English language function calls. |
Input: 16,385 Output: 4,096 |
Sep 2021 |
gpt-35-turbo (1106) |
Older GA Model - JSON Mode - parallel function calling - reproducible output (preview) |
Input: 16,385 Output: 4,096 |
Sep 2021 |
gpt-35-turbo-instruct (0914) |
Completions endpoint only - Replacement for legacy completions models |
4,097 | Sep 2021 |
gpt-35-turbo-16k (0613) |
Older GA Model - Basic function calling with tools |
16,384 | Sep 2021 |
gpt-35-turbo (0613) |
Older GA Model - Basic function calling with tools |
4,096 | Sep 2021 |
gpt-35-turbo 1 (0301) |
Older GA Model - Retirement information |
4,096 | Sep 2021 |
To learn more about how to interact with GPT-3.5 Turbo and the Chat Completions API check out our in-depth how-to.
1 This model will accept requests > 4,096 tokens. It is not recommended to exceed the 4,096 input token limit as the newer version of the model are capped at 4,096 tokens. If you encounter issues when exceeding 4,096 input tokens with this model this configuration is not officially supported.
text-embedding-3-large
is the latest and most capable embedding model. Upgrading between embeddings models is not possible. In order to move from using text-embedding-ada-002
to text-embedding-3-large
you would need to generate new embeddings.
text-embedding-3-large
text-embedding-3-small
text-embedding-ada-002
In testing, OpenAI reports both the large and small third generation embeddings models offer better average multi-language retrieval performance with the MIRACL benchmark while still maintaining performance for English tasks with the MTEB benchmark.
Evaluation Benchmark | text-embedding-ada-002 |
text-embedding-3-small |
text-embedding-3-large |
---|---|---|---|
MIRACL average | 31.4 | 44.0 | 54.9 |
MTEB average | 61.0 | 62.3 | 64.6 |
The third generation embeddings models support reducing the size of the embedding via a new dimensions
parameter. Typically larger embeddings are more expensive from a compute, memory, and storage perspective. Being able to adjust the number of dimensions allows more control over overall cost and performance. The dimensions
parameter is not supported in all versions of the OpenAI 1.x Python library, to take advantage of this parameter we recommend upgrading to the latest version: pip install openai --upgrade
.
OpenAI's MTEB benchmark testing found that even when the third generation model's dimensions are reduced to less than text-embeddings-ada-002
1,536 dimensions performance remains slightly better.
The DALL-E models generate images from text prompts that the user provides. DALL-E 3 is generally available for use with the REST APIs. DALL-E 2 and DALL-E 3 with client SDKs are in preview.
The Whisper models can be used for speech to text.
You can also use the Whisper model via Azure AI Speech batch transcription API. Check out What is the Whisper model? to learn more about when to use Azure AI Speech vs. Azure OpenAI Service.
The OpenAI text to speech models, currently in preview, can be used to synthesize text to speech.
You can also use the OpenAI text to speech voices via Azure AI Speech. To learn more, see OpenAI text to speech voices via Azure OpenAI Service or via Azure AI Speech guide.
Note
This article primarily covers model/region availability that applies to all Azure OpenAI customers with deployment types of Standard. Some select customers have access to model/region combinations that are not listed in the unified table below. For more information on Provisioned deployments, see our Provisioned guidance.
Region | gpt-4, 0613 | gpt-4, 1106-Preview | gpt-4, 0125-Preview | gpt-4, vision-preview | gpt-4, turbo-2024-04-09 | gpt-4o, 2024-05-13 | gpt-4-32k, 0613 | gpt-35-turbo, 0301 | gpt-35-turbo, 0613 | gpt-35-turbo, 1106 | gpt-35-turbo, 0125 | gpt-35-turbo-16k, 0613 | gpt-35-turbo-instruct, 0914 | text-embedding-ada-002, 1 | text-embedding-ada-002, 2 | text-embedding-3-small, 1 | text-embedding-3-large, 1 | dall-e-2, 2.0 | dall-e-3, 3.0 | babbage-002, 1 | davinci-002, 1 | tts, 001 | tts-hd, 001 | whisper, 001 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
australiaeast | ✅ | ✅ | - | ✅ | - | - | ✅ | - | ✅ | ✅ | - | ✅ | - | - | ✅ | - | - | - | ✅ | - | - | - | - | - |
brazilsouth | - | - | - | - | - | - | - | - | - | - | - | - | - | - | ✅ | - | - | - | - | - | - | - | - | - |
canadaeast | ✅ | ✅ | - | - | - | - | ✅ | - | ✅ | ✅ | ✅ | ✅ | - | - | ✅ | ✅ | ✅ | - | - | - | - | - | - | - |
eastus | - | - | ✅ | - | - | ✅ | - | ✅ | ✅ | - | - | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | - | - | - | - | - |
eastus2 | - | ✅ | - | - | ✅ | ✅ | - | - | ✅ | - | - | ✅ | - | - | ✅ | ✅ | ✅ | - | - | - | - | - | - | ✅ |
francecentral | ✅ | ✅ | - | - | - | - | ✅ | ✅ | ✅ | ✅ | - | ✅ | - | - | ✅ | - | ✅ | - | - | - | - | - | - | - |
japaneast | - | - | - | ✅ | - | - | - | - | ✅ | - | - | ✅ | - | - | ✅ | - | ✅ | - | - | - | - | - | - | - |
northcentralus | - | - | ✅ | - | - | ✅ | - | - | ✅ | - | ✅ | ✅ | - | - | ✅ | - | - | - | - | ✅ | ✅ | ✅ | ✅ | ✅ |
norwayeast | - | ✅ | - | - | - | - | - | - | - | - | - | - | - | - | ✅ | - | - | - | - | - | - | - | - | ✅ |
southafricanorth | - | - | - | - | - | - | - | - | - | - | - | - | - | - | ✅ | - | - | - | - | - | - | - | - | - |
southcentralus | - | - | ✅ | - | - | ✅ | - | ✅ | - | - | ✅ | - | - | ✅ | ✅ | - | - | - | - | - | - | - | - | - |
southindia | - | ✅ | - | - | - | - | - | - | - | ✅ | - | - | - | - | ✅ | - | ✅ | - | - | - | - | - | - | ✅ |
swedencentral | ✅ | ✅ | - | ✅ | ✅ | ✅ | ✅ | - | ✅ | ✅ | - | ✅ | ✅ | - | ✅ | - | ✅ | - | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
switzerlandnorth | ✅ | - | - | ✅ | - | - | ✅ | - | ✅ | - | - | ✅ | - | - | ✅ | - | - | - | - | - | - | - | - | - |
uksouth | - | ✅ | ✅ | - | - | - | - | ✅ | ✅ | ✅ | - | ✅ | - | - | ✅ | - | ✅ | - | - | - | - | - | - | - |
westeurope | - | - | - | - | - | - | - | ✅ | - | - | - | - | - | - | ✅ | - | - | - | - | - | - | - | - | ✅ |
westus | - | ✅ | - | ✅ | - | ✅ | - | - | - | ✅ | - | - | - | - | ✅ | - | - | - | - | - | - | - | - | - |
westus3 | - | ✅ | - | - | - | ✅ | - | - | - | - | - | - | - | - | ✅ | - | ✅ | - | - | - | - | - | - | - |
This table doesn't include global standard model deployment regional availability for GPT-4o, or fine-tuning regional availability information. Consult the dedicated global standard deployment section and the fine-tuning section for this information.
Region | GPT-4 | GPT-4-32K | GPT-4-Turbo | GPT-4-Turbo-V | gpt-4o | gpt-4o - GlobalStandard | GPT-35-Turbo | GPT-35-Turbo-Instruct | Text-Embedding-Ada-002 | text-embedding-3-small | text-embedding-3-large | Babbage-002 | Babbage-002 - finetune | Davinci-002 | Davinci-002 - finetune | GPT-35-Turbo - finetune | GPT-35-Turbo-1106 - finetune | GPT-4 - finetune | GPT-35-Turbo-0125 - finetune |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
australiaeast | 40 K | 80 K | 80 K | 30 K | - | 450 K 30 M |
300 K | - | 350 K | - | - | - | - | - | - | - | - | - | - |
brazilsouth | - | - | - | - | - | 450 K 30 M |
- | - | 350 K | - | - | - | - | - | - | - | - | - | - |
canadaeast | 40 K | 80 K | 80 K | - | - | 450 K 30 M |
300 K | - | 350 K | 350 K | 350 K | - | - | - | - | - | - | - | - |
eastus | - | - | 80 K | - | 150 K 1 M |
450 K 30 M |
240 K | 240 K | 240 K | 350 K | 350 K | - | - | - | - | - | - | - | - |
eastus2 | - | - | 80 K | - | 150 K 1 M |
450 K 30 M |
300 K | - | 350 K | 350 K | 350 K | - | - | - | - | 250 K | 250 K | - | 250 K |
francecentral | 20 K | 60 K | 80 K | - | - | 450 K 30 M |
240 K | - | 240 K | - | 350 K | - | - | - | - | - | - | - | - |
germanywestcentral | - | - | - | - | - | 450 K 30 M |
- | - | - | - | - | - | - | - | - | - | - | - | - |
japaneast | - | - | - | 30 K | - | 450 K 30 M |
300 K | - | 350 K | - | 350 K | - | - | - | - | - | - | - | - |
koreacentral | - | - | - | - | - | 450 K 30 M |
- | - | - | - | - | - | - | - | - | - | - | - | - |
northcentralus | - | - | 80 K | - | 150 K 1 M |
450 K 30 M |
300 K | - | 350 K | - | - | 240 K | 250 K | 240 K | 250 K | 250 K | 250 K | 100 K | 250 K |
norwayeast | - | - | 150 K | - | - | 450 K 30 M |
- | - | 350 K | - | - | - | - | - | - | - | - | - | - |
polandcentral | - | - | - | - | - | 450 K 30 M |
- | - | - | - | - | - | - | - | - | - | - | - | - |
southafricanorth | - | - | - | - | - | 450 K 30 M |
- | - | 350 K | - | - | - | - | - | - | - | - | - | - |
southcentralus | - | - | 80 K | - | 150 K 1 M |
450 K 30 M |
240 K | - | 240 K | - | - | - | - | - | - | - | - | - | - |
southindia | - | - | 150 K | - | - | 450 K 30 M |
300 K | - | 350 K | - | 350 K | - | - | - | - | - | - | - | - |
swedencentral | 40 K | 80 K | 150 K | 30 K | 150 K 1 M |
450 K 30 M |
300 K | 240 K | 350 K | - | 350 K | 240 K | 250 K | 240 K | 250 K | 250 K | 250 K | 100 K | 250 K |
switzerlandnorth | 40 K | 80 K | - | 30 K | - | 450 K 30 M |
300 K | - | 350 K | - | - | - | - | - | - | - | - | - | - |
switzerlandwest | - | - | - | - | - | - | - | - | - | - | - | - | 250 K | - | 250 K | 250 K | 250 K | - | 250 K |
uksouth | - | - | 80 K | - | - | 450 K 30 M |
240 K | - | 350 K | - | 350 K | - | - | - | - | - | - | - | - |
westeurope | - | - | - | - | - | 450 K 30 M |
240 K | - | 240 K | - | - | - | - | - | - | - | - | - | - |
westus | - | - | 80 K | 30 K | 150 K 1 M |
450 K 30 M |
300 K | - | 350 K | - | - | - | - | - | - | - | - | - | - |
westus3 | - | - | 80 K | - | 150 K 1 M |
450 K 30 M |
- | - | 350 K | - | 350 K | - | - | - | - | - | - | - | - |
Region | gpt-4, 0613 | gpt-4, 1106-Preview | gpt-4, 0125-Preview | gpt-4, turbo-2024-04-09 | gpt-4o, 2024-05-13 | gpt-4-32k, 0613 | gpt-35-turbo, 1106 | gpt-35-turbo, 0125 |
---|---|---|---|---|---|---|---|---|
australiaeast | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
brazilsouth | ✅ | ✅ | ✅ | - | ✅ | ✅ | ✅ | - |
canadacentral | ✅ | - | - | - | - | ✅ | - | ✅ |
canadaeast | ✅ | ✅ | - | ✅ | ✅ | - | ✅ | - |
eastus | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
eastus2 | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
francecentral | ✅ | ✅ | ✅ | - | ✅ | ✅ | - | ✅ |
germanywestcentral | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | - |
japaneast | - | ✅ | ✅ | ✅ | ✅ | - | - | ✅ |
koreacentral | ✅ | - | - | ✅ | ✅ | ✅ | ✅ | - |
northcentralus | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
norwayeast | ✅ | - | ✅ | - | - | ✅ | - | - |
polandcentral | ✅ | ✅ | ✅ | - | - | ✅ | ✅ | ✅ |
southafricanorth | ✅ | ✅ | - | - | - | ✅ | ✅ | - |
southcentralus | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
southindia | ✅ | ✅ | ✅ | - | ✅ | ✅ | ✅ | ✅ |
swedencentral | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
switzerlandnorth | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
switzerlandwest | - | - | - | - | - | - | - | ✅ |
uksouth | ✅ | ✅ | ✅ | ✅ | - | ✅ | ✅ | ✅ |
westus | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
westus3 | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
Note
The provisioned version of gpt-4
Version: turbo-2024-04-09
is currently limited to text only.
You need to speak with your Microsoft sales/account team to acquire provisioned throughput. If you don't have a sales/account team, unfortunately at this time, you cannot purchase provisioned throughput.
For more information on Provisioned deployments, see our Provisioned guidance.
Supported models:
gpt-4o
Version: 2024-05-13
Supported regions:
Region | gpt-4, 0613 | gpt-4, 1106-Preview | gpt-4, 0125-Preview | gpt-4, vision-preview | gpt-4, turbo-2024-04-09 | gpt-4o, 2024-05-13 | gpt-4-32k, 0613 |
---|---|---|---|---|---|---|---|
australiaeast | ✅ | ✅ | - | ✅ | - | - | ✅ |
canadaeast | ✅ | ✅ | - | - | - | - | ✅ |
eastus | - | - | ✅ | - | - | ✅ | - |
eastus2 | - | ✅ | - | - | ✅ | ✅ | - |
francecentral | ✅ | ✅ | - | - | - | - | ✅ |
japaneast | - | - | - | ✅ | - | - | - |
northcentralus | - | - | ✅ | - | - | ✅ | - |
norwayeast | - | ✅ | - | - | - | - | - |
southcentralus | - | - | ✅ | - | - | ✅ | - |
southindia | - | ✅ | - | - | - | - | - |
swedencentral | ✅ | ✅ | - | ✅ | ✅ | ✅ | ✅ |
switzerlandnorth | ✅ | - | - | ✅ | - | - | ✅ |
uksouth | - | ✅ | ✅ | - | - | - | - |
westus | - | ✅ | - | ✅ | - | ✅ | - |
westus3 | - | ✅ | - | - | - | ✅ | - |
In addition to the regions above which are available to all Azure OpenAI customers, some select pre-existing customers have been granted access to versions of GPT-4 in additional regions:
Model | Region |
---|---|
gpt-4 (0314) gpt-4-32k (0314) |
East US France Central South Central US UK South |
gpt-4 (0613) gpt-4-32k (0613) |
East US East US 2 Japan East UK South |
Important
The NEW gpt-35-turbo (0125)
model has various improvements, including higher accuracy at responding in requested formats and a fix for a bug which caused a text encoding issue for non-English language function calls.
GPT-3.5 Turbo is used with the Chat Completion API. GPT-3.5 Turbo version 0301 can also be used with the Completions API, though this is not recommended. GPT-3.5 Turbo versions 0613 and 1106 only support the Chat Completions API.
GPT-3.5 Turbo version 0301 is the first version of the model released. Version 0613 is the second version of the model and adds function calling support.
See model versions to learn about how Azure OpenAI Service handles model version upgrades, and working with models to learn how to view and configure the model version settings of your GPT-3.5 Turbo deployments.
Region | gpt-35-turbo, 0301 | gpt-35-turbo, 0613 | gpt-35-turbo, 1106 | gpt-35-turbo, 0125 | gpt-35-turbo-16k, 0613 | gpt-35-turbo-instruct, 0914 |
---|---|---|---|---|---|---|
australiaeast | - | ✅ | ✅ | - | ✅ | - |
canadaeast | - | ✅ | ✅ | ✅ | ✅ | - |
eastus | ✅ | ✅ | - | - | ✅ | ✅ |
eastus2 | - | ✅ | - | - | ✅ | - |
francecentral | ✅ | ✅ | ✅ | - | ✅ | - |
japaneast | - | ✅ | - | - | ✅ | - |
northcentralus | - | ✅ | - | ✅ | ✅ | - |
southcentralus | ✅ | - | - | ✅ | - | - |
southindia | - | - | ✅ | - | - | - |
swedencentral | - | ✅ | ✅ | - | ✅ | ✅ |
switzerlandnorth | - | ✅ | - | - | ✅ | - |
uksouth | ✅ | ✅ | ✅ | - | ✅ | - |
westeurope | ✅ | - | - | - | - | - |
westus | - | - | ✅ | - | - | - |
These models can only be used with Embedding API requests.
Note
text-embedding-3-large
is the latest and most capable embedding model. Upgrading between embedding models is not possible. In order to migrate from using text-embedding-ada-002
to text-embedding-3-large
you would need to generate new embeddings.
Model ID | Max Request (tokens) | Output Dimensions | Training Data (up-to) |
---|---|---|---|
text-embedding-ada-002 (version 2) |
8,191 | 1,536 | Sep 2021 |
text-embedding-ada-002 (version 1) |
2,046 | 1,536 | Sep 2021 |
text-embedding-3-large |
8,191 | 3,072 | Sep 2021 |
text-embedding-3-small |
8,191 | 1,536 | Sep 2021 |
Note
When sending an array of inputs for embedding, the max number of input items in the array per call to the embedding endpoint is 2048.
Region | text-embedding-ada-002, 1 | text-embedding-ada-002, 2 | text-embedding-3-small, 1 | text-embedding-3-large, 1 |
---|---|---|---|---|
australiaeast | - | ✅ | - | - |
brazilsouth | - | ✅ | - | - |
canadaeast | - | ✅ | ✅ | ✅ |
eastus | ✅ | ✅ | ✅ | ✅ |
eastus2 | - | ✅ | ✅ | ✅ |
francecentral | - | ✅ | - | ✅ |
japaneast | - | ✅ | - | ✅ |
northcentralus | - | ✅ | - | - |
norwayeast | - | ✅ | - | - |
southafricanorth | - | ✅ | - | - |
southcentralus | ✅ | ✅ | - | - |
southindia | - | ✅ | - | ✅ |
swedencentral | - | ✅ | - | ✅ |
switzerlandnorth | - | ✅ | - | - |
uksouth | - | ✅ | - | ✅ |
westeurope | - | ✅ | - | - |
westus | - | ✅ | - | - |
westus3 | - | ✅ | - | ✅ |
Model ID | Feature Availability | Max Request (characters) |
---|---|---|
dalle2 (preview) | East US | 1,000 |
dall-e-3 | East US, Australia East, Sweden Central | 4,000 |
babbage-002
and davinci-002
are not trained to follow instructions. Querying these base models should only be done as a point of reference to a fine-tuned version to evaluate the progress of your training.
gpt-35-turbo
- fine-tuning of this model is limited to a subset of regions, and is not available in every region the base model is available.
Model ID | Fine-Tuning Regions | Max Request (tokens) | Training Data (up to) |
---|---|---|---|
babbage-002 |
North Central US Sweden Central Switzerland West |
16,384 | Sep 2021 |
davinci-002 |
North Central US Sweden Central Switzerland West |
16,384 | Sep 2021 |
gpt-35-turbo (0613) |
East US2 North Central US Sweden Central Switzerland West |
4,096 | Sep 2021 |
gpt-35-turbo (1106) |
East US2 North Central US Sweden Central Switzerland West |
Input: 16,385 Output: 4,096 |
Sep 2021 |
gpt-35-turbo (0125) |
East US2 North Central US Sweden Central Switzerland West |
16,385 | Sep 2021 |
gpt-4 (0613) 1 |
North Central US Sweden Central |
8192 | Sep 2021 |
1 GPT-4 fine-tuning is currently in public preview. See our GPT-4 fine-tuning safety evaluation guidance for more information.
Model ID | Model Availability | Max Request (audio file size) |
---|---|---|
whisper |
East US 2 North Central US Norway East South India Sweden Central West Europe |
25 MB |
Model ID | Model Availability |
---|---|
tts-1 |
North Central US Sweden Central |
tts-1-hd |
North Central US Sweden Central |
For Assistants you need a combination of a supported model, and a supported region. Certain tools and capabilities require the latest models. The following models are available in the Assistants API, SDK, Azure AI Studio and Azure OpenAI Studio. The following table is for pay-as-you-go. For information on Provisioned Throughput Unit (PTU) availability, see provisioned throughput. The listed models and regions can be used with both Assistants v1 and v2.
Region | gpt-35-turbo (0613) |
gpt-35-turbo (1106) |
fine tuned gpt-3.5-turbo-0125 |
gpt-4 (0613) |
gpt-4 (1106) |
gpt-4 (0125) |
gpt-4o (2024-05-13) |
---|---|---|---|---|---|---|---|
Australia East | ✅ | ✅ | ✅ | ✅ | |||
East US | ✅ | ✅ | ✅ | ||||
East US 2 | ✅ | ✅ | ✅ | ✅ | ✅ | ||
France Central | ✅ | ✅ | ✅ | ✅ | |||
Japan East | ✅ | ||||||
Norway East | ✅ | ||||||
Sweden Central | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | |
UK South | ✅ | ✅ | ✅ | ✅ | |||
West US | ✅ | ✅ | ✅ | ||||
West US 3 | ✅ | ✅ |
For the latest information on model retirements, refer to the model retirement guide.
https://aka.ms/ContentUserFeedback.
Coming soon: Throughout 2024 we will be phasing out GitHub Issues as the feedback mechanism for content and replacing it with a new feedback system. For more information see:Submit and view feedback for
Training
Learning path
Develop Generative AI solutions with Azure OpenAI Service - Training
Develop Generative AI solutions with Azure OpenAI Service
Documentation
Understanding Azure OpenAI Service deployment types - Azure AI services
Learn how to use Azure OpenAI deployment types | Global-Standard | Standard | Provisioned.
Azure OpenAI Service quotas and limits - Azure AI services
Quick reference, detailed description, and best practices on the quotas and limits for the OpenAI service in Azure AI services.
Azure OpenAI Service model retirements - Azure OpenAI
Learn about the model deprecations and retirements in Azure OpenAI.