Skip to content

Commit

Permalink
Merge pull request #103 from MicrosoftDocs/main
Browse files Browse the repository at this point in the history
[Publishing] [Scheduled Publish] 9/03/2024 PM Publish
  • Loading branch information
Albertyang0 authored Sep 3, 2024
2 parents 7aa6b87 + 413694e commit 7d4d00a
Show file tree
Hide file tree
Showing 12 changed files with 840 additions and 68 deletions.
44 changes: 25 additions & 19 deletions articles/ai-services/content-safety/overview.md
Original file line number Diff line number Diff line change
Expand Up @@ -140,25 +140,31 @@ For more information, see [Language support](/azure/ai-services/content-safety/l

To use the Content Safety APIs, you must create your Azure AI Content Safety resource in the supported regions. Currently, the Content Safety features are available in the following Azure regions:

|Region | Moderation APIs<br>(text and image) | Prompt Shields<br>(preview) | Protected material<br>detection (preview) | Groundedness<br>detection (preview) | Custom categories<br>(rapid) (preview) | Custom categories<br>(standard) | Blocklists |
|---|---|---|---|---|---|---|--|
| East US ||||||||
| East US 2 || | ||| ||
| West US | | | | || | |
| West US 2 || | | || ||
| Central US || | | | | ||
| North Central US || | | || ||
| South Central US || | | || ||
| Canada East || | | || ||
| Switzerland North || | | ||||
| Sweden Central || | ||| ||
| UK South || | | || ||
| France Central || | | || ||
| West Europe |||| || ||
| Japan East || | | || ||
| Australia East|| | | ||||
| USGov Arizona || | | | | | |
| USGov Virginia || | | | | | |
| Region | Moderation APIs (text and image) | Prompt Shields | Protected material detection for Text | Groundedness detection (preview) | Custom categories (rapid) (preview) | Custom categories (standard) | Blocklists |
|---------------------|----------------------------------|----------------|--------------------------------------|-----------------------------------|------------------------------------|-------------------------------|------------|
| East US ||||||||
| East US 2 || |||| ||
| West US | ||| || | |
| West US 2 |||| || ||
| West US 3 |||| || ||
| Poland Central |||| || ||
| South East Asia || || || ||
| Central US || || || ||
| North Central US || || || ||
| South Central US || || || ||
| Canada East |||| || ||
| Switzerland North |||| ||||
| Sweden Central |||||| ||
| UK South || || || ||
| France Central |||| || ||
| West Europe |||| || ||
| Japan East |||| || ||
| Australia East |||| || ||
| South India || || ||||
| USGov Arizona |||| | | | |
| USGov Virginia |||| | | | |



Feel free to [contact us](mailto:contentsafetysupport@microsoft.com) if you need other regions for your business.

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -59,7 +59,7 @@ The following limits are observed for the conversational language understanding.

|Item|Lower Limit| Upper Limit |
| --- | --- | --- |
|Number of utterances per project | 1 | 25,000|
|Number of utterances per project | 1 | 50,000|
|Utterance length in characters (authoring) | 1 | 500 |
|Utterance length in characters (prediction) | 1 | 1000 |
|Number of intents per project | 1 | 500|
Expand Down
33 changes: 23 additions & 10 deletions articles/ai-services/openai/concepts/model-retirements.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
description: Learn about the model deprecations and retirements in Azure OpenAI.
ms.service: azure-ai-openai
ms.topic: conceptual
ms.date: 08/22/2024
ms.date: 09/03/2024
ms.custom:
manager: nitinme
author: mrbullwinkle
Expand Down Expand Up @@ -89,17 +89,17 @@ For information on the model upgrade process, see [How to upgrade to a new model
These models are currently available for use in Azure OpenAI Service.

| Model | Version | Retirement date | Suggested replacement |
| Model | Version | Retirement date | Suggested replacements |
| ---- | ---- | ---- | --- |
| `gpt-35-turbo` | 0301 | No earlier than November 1, 2024 | `gpt-4o-mini` |
| `gpt-35-turbo`<br>`gpt-35-turbo-16k` | 0613 | November 1, 2024 | `gpt-4o-mini` |
| `gpt-35-turbo` | 1106 | No earlier than Nov 17, 2024 | `gpt-4o-mini` |
| `gpt-35-turbo` | 0301 | No earlier than November 1, 2024<br><br> Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on November 15, 2024. | `gpt-35-turbo` (0125) <br><br> `gpt-4o-mini` |
| `gpt-35-turbo`<br>`gpt-35-turbo-16k` | 0613 | November 1, 2024 <br><br> Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on November 15, 2024. | `gpt-35-turbo` (0125) <br><br> `gpt-4o-mini`|
| `gpt-35-turbo` | 1106 | No earlier than Nov 17, 2024 <br><br> Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on November 15, 2024. | `gpt-35-turbo` (0125) <br><br> `gpt-4o-mini` |
| `gpt-35-turbo` | 0125 | No earlier than Feb 22, 2025 | `gpt-4o-mini` |
| `gpt-4`<br>`gpt-4-32k` | 0314 | **Deprecation:** November 1, 2024 <br> **Retirement:** June 6, 2025 | `gpt-4o` |
| `gpt-4`<br>`gpt-4-32k` | 0613 | **Deprecation:** November 1, 2024 <br> **Retirement:** June 6, 2025 | `gpt-4o` |
| `gpt-4` | 1106-preview | To be upgraded to `gpt-4` Version: `turbo-2024-04-09`, starting on November 15, 2024, or later **<sup>1</sup>** | `gpt-4o`|
| `gpt-4` | 0125-preview |To be upgraded to `gpt-4` Version: `turbo-2024-04-09`, starting on November 15, 2024, or later **<sup>1</sup>** | `gpt-4o` |
| `gpt-4` | vision-preview | To be upgraded to `gpt-4` Version: `turbo-2024-04-09`, starting on November 15, 2024, or later **<sup>1</sup>** | `gpt-4o`|
| `gpt-4`<br>`gpt-4-32k` | 0314 | **Deprecation:** November 15, 2024 <br> **Retirement:** June 6, 2025 | `gpt-4o` |
| `gpt-4`<br>`gpt-4-32k` | 0613 | **Deprecation:** November 15, 2024 <br> **Retirement:** June 6, 2025 | `gpt-4o` |
| `gpt-4` | 1106-preview | To be upgraded to `gpt-4` version: `turbo-2024-04-09`, starting on November 15, 2024, or later **<sup>1</sup>** | `gpt-4o`|
| `gpt-4` | 0125-preview |To be upgraded to `gpt-4` version: `turbo-2024-04-09`, starting on November 15, 2024, or later **<sup>1</sup>** | `gpt-4o` |
| `gpt-4` | vision-preview | To be upgraded to `gpt-4` version: `turbo-2024-04-09`, starting on November 15, 2024, or later **<sup>1</sup>** | `gpt-4o`|
| `gpt-3.5-turbo-instruct` | 0914 | No earlier than Sep 14, 2025 | |
| `text-embedding-ada-002` | 2 | No earlier than April 3, 2025 | `text-embedding-3-small` or `text-embedding-3-large` |
| `text-embedding-ada-002` | 1 | No earlier than April 3, 2025 | `text-embedding-3-small` or `text-embedding-3-large` |
Expand All @@ -111,6 +111,14 @@ These models are currently available for use in Azure OpenAI Service.
> [!IMPORTANT]
> Vision enhancements preview features including Optical Character Recognition (OCR), object grounding, video prompts will be retired and no longer available once `gpt-4` Version: `vision-preview` is upgraded to `turbo-2024-04-09`. If you are currently relying on any of these preview features, this automatic model upgrade will be a breaking change.
## Default model versions

| Model | Current default version | New default version | Default upgrade date |
|---|---|---|---|
| `gpt-35-turbo` | 0301 | 0125 | Deployments of versions `0301`, `0613`, and `1106` set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on November 15, 2024.|



## Deprecated models

These models were deprecated on July 6, 2023 and were retired on June 14, 2024. These models are no longer available for new deployments. Deployments created before July 6, 2023 remain available to customers until June 14, 2024. We recommend customers migrate their applications to deployments of replacement models before the June 14, 2024 retirement.
Expand Down Expand Up @@ -147,8 +155,13 @@ If you're an existing customer looking for information about these models, see [
| code-search-babbage-code-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |
| code-search-babbage-text-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |


## Retirement and deprecation history

## September 3, 2024

* Updated tables to include information on `gpt-35-turbo` default version upgrades. Deployments of versions `0301`, `0613`, and `1106` set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on November 15, 2024.|

### August 22, 2024

* Updated `gpt-35-turbo` (0301) retirement date to no earlier than November 1, 2024.
Expand Down
35 changes: 15 additions & 20 deletions articles/ai-services/openai/concepts/models.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
description: Learn about the different model capabilities that are available with Azure OpenAI.
ms.service: azure-ai-openai
ms.topic: conceptual
ms.date: 08/14/2024
ms.date: 08/28/2024
ms.custom: references_regions, build-2023, build-2023-dataai, refefences_regions
manager: nitinme
author: mrbullwinkle #ChrisHMSFT
Expand All @@ -26,24 +26,6 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
| [Whisper](#whisper-models) | A series of models in preview that can transcribe and translate speech to text. |
| [Text to speech](#text-to-speech-models-preview) (Preview) | A series of models in preview that can synthesize text to speech. |

## Early access playground (preview)

On August 6, 2024, OpenAI [announced](https://openai.com/index/introducing-structured-outputs-in-the-api/) the latest version of their flagship GPT-4o model version `2024-08-06`. GPT-4o `2024-08-06` has all the capabilities of the previous version as well as:

* An enhanced ability to support complex structured outputs.
* Max output tokens have been increased from 4,096 to 16,384.

Azure customers can test out GPT-4o `2024-08-06` today in the new AI Studio early access playground (preview).

Unlike the previous early access playground, the AI Studio early access playground (preview) does not require you to have a resource in a specific region.

> [!NOTE]
> Prompts and completions made through the early access playground (preview) may be processed in any Azure OpenAI region, and are currently subject to a 10 request per minute per Azure subscription limit. This limit may change in the future.
>
> Azure OpenAI Service abuse monitoring is enabled for all early access playground users even if approved for modification; default content filters are enabled and cannot be modified.
To test out GPT-4o `2024-08-06`, sign-in to the Azure AI early access playground (preview) using this [link](https://aka.ms/oai/docs/earlyaccessplayground).

## GPT-4o and GPT-4 Turbo

GPT-4o integrates text and images in a single model, enabling it to handle multiple data types simultaneously. This multimodal approach enhances accuracy and responsiveness in human-computer interactions. GPT-4o matches GPT-4 Turbo in English text and coding tasks while offering superior performance in non-English languages and vision tasks, setting new benchmarks for AI capabilities.
Expand All @@ -56,6 +38,7 @@ You need to [create](../how-to/create-resource.md) or use an existing resource i

When your resource is created, you can [deploy](../how-to/create-resource.md#deploy-a-model) the GPT-4o models. If you are performing a programmatic deployment, the **model** names are:

- `gpt-4o` **Version** `2024-08-06`
- `gpt-4o`, **Version** `2024-05-13`
- `gpt-4o-mini` **Version** `2024-07-18`

Expand Down Expand Up @@ -83,8 +66,9 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope

| Model ID | Description | Max Request (tokens) | Training Data (up to) |
| --- | :--- |:--- |:---: |
|`gpt-4o` (2024-08-06) <br> **GPT-4o (Omni)** | **Latest large GA model** <br> - Structured outputs<br> - Text, image processing <br> - JSON Mode <br> - parallel function calling <br> - Enhanced accuracy and responsiveness <br> - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision <br> - Superior performance in non-English languages and in vision tasks |Input: 128,000 <br> Output: 4,096| Oct 2023 |
|`gpt-4o-mini` (2024-07-18) <br> **GPT-4o mini** | **Latest small GA model** <br> - Fast, inexpensive, capable model ideal for replacing GPT-3.5 Turbo series models. <br> - Text, image processing <br>- JSON Mode <br> - parallel function calling | Input: 128,000 <br> Output: 16,384 | Oct 2023 |
|`gpt-4o` (2024-05-13) <br> **GPT-4o (Omni)** | **Latest large GA model** <br> - Text, image processing <br> - JSON Mode <br> - parallel function calling <br> - Enhanced accuracy and responsiveness <br> - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision <br> - Superior performance in non-English languages and in vision tasks |Input: 128,000 <br> Output: 4,096| Oct 2023 |
|`gpt-4o` (2024-05-13) <br> **GPT-4o (Omni)** | Text, image processing <br> - JSON Mode <br> - parallel function calling <br> - Enhanced accuracy and responsiveness <br> - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision <br> - Superior performance in non-English languages and in vision tasks |Input: 128,000 <br> Output: 4,096| Oct 2023 |
| `gpt-4` (turbo-2024-04-09) <br>**GPT-4 Turbo with Vision** | **New GA model** <br> - Replacement for all previous GPT-4 preview models (`vision-preview`, `1106-Preview`, `0125-Preview`). <br> - [**Feature availability**](#gpt-4o-and-gpt-4-turbo) is currently different depending on method of input, and deployment type. | Input: 128,000 <br> Output: 4,096 | Dec 2023 |
| `gpt-4` (0125-Preview)*<br>**GPT-4 Turbo Preview** | **Preview Model** <br> -Replaces 1106-Preview <br>- Better code generation performance <br> - Reduces cases where the model doesn't complete a task <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) | Input: 128,000 <br> Output: 4,096 | Dec 2023 |
| `gpt-4` (vision-preview)<br>**GPT-4 Turbo with Vision Preview** | **Preview model** <br> - Accepts text and image input. <br> - Supports enhancements <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) | Input: 128,000 <br> Output: 4,096 | Apr 2023 |
Expand Down Expand Up @@ -188,6 +172,17 @@ For more information on Provisioned deployments, see our [Provisioned guidance](

### Global standard model availability

`gpt-4o` **Version:** `2024-08-06`

**Supported regions:**
- eastus
- eastus2
- northcentralus
- southcentralus
- swedencentral
- westus
- westus3

`gpt-4o` **Version:** `2024-05-13`

**Supported regions:**
Expand Down
Loading

0 comments on commit 7d4d00a

Please sign in to comment.