Skip to content

Commit

Permalink
Merge pull request #252 from MicrosoftDocs/main
Browse files Browse the repository at this point in the history
9/12/2024 AM Publish
  • Loading branch information
Taojunshen authored Sep 12, 2024
2 parents d6a900d + dc90186 commit 162d4f1
Show file tree
Hide file tree
Showing 58 changed files with 206 additions and 146 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -2,15 +2,16 @@
title: Audio processing - Speech service
titleSuffix: Azure AI services
description: An overview of audio processing and capabilities of the Microsoft Audio Stack.
author: hasyashah
manager: nitinme
ms.service: azure-ai-speech
ms.topic: overview
ms.date: 8/20/2024
ms.author: hasshah
ms.date: 9/12/2024
author: eric-urban
ms.author: eur
ms.reviewer: hasshah
---

# Audio processing
# Audio processing with the Microsoft Audio Stack

The Microsoft Audio Stack is a set of enhancements optimized for speech processing scenarios. This includes examples like keyword recognition and speech recognition. It consists of various enhancements/components that operate on the input audio signal:

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -2,13 +2,13 @@
title: Use the Microsoft Audio Stack (MAS) - Speech service
titleSuffix: Azure AI services
description: An overview of the features, capabilities, and restrictions for audio processing using the Speech Software Development Kit (SDK).
author: hasyashah
manager: nitinme
ms.service: azure-ai-speech
ms.topic: how-to
ms.date: 8/20/2024
ms.author: hasshah
ms.devlang: cpp
ms.date: 9/12/2024
author: eric-urban
ms.author: eur
ms.reviewer: hasshah
ms.custom: devx-track-csharp
---

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ author: eric-urban
manager: nitinme
ms.service: azure-ai-speech
ms.topic: how-to
ms.date: 1/18/2024
ms.date: 9/12/2024
ms.author: eur
---

Expand Down
14 changes: 6 additions & 8 deletions articles/ai-services/speech-service/batch-synthesis.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ author: eric-urban
manager: nitinme
ms.service: azure-ai-speech
ms.topic: how-to
ms.date: 1/18/2024
ms.date: 9/12/2024
ms.author: eur
---

Expand Down Expand Up @@ -35,8 +35,6 @@ You can use the following REST API operations for batch synthesis:
| [List batch synthesis](#list-batch-synthesis) | `GET` | texttospeech/batchsyntheses |
| [Delete batch synthesis](#delete-batch-synthesis) | `DELETE` | texttospeech/batchsyntheses/YourSynthesisId |

<!-- | [Get operation for status monitor](#get-operation) | `GET` | texttospeech/operations/YourOperationId | -->

For code samples, see [GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/tree/master/samples/batch-synthesis).

## Create batch synthesis
Expand All @@ -48,9 +46,9 @@ To submit a batch synthesis request, construct the HTTP PUT request path and bod
- Optionally you can set the `description`, `timeToLiveInHours`, and other properties. For more information, see [batch synthesis properties](batch-synthesis-properties.md).

> [!NOTE]
> The maximum JSON payload size that will be accepted is 2 megabytes.
> The maximum JSON payload size that's accepted is 2 megabytes.
Set the required `YourSynthesisId` in path. The `YourSynthesisId` have to be unique. It must be 3-64 long, contains only numbers, letters, hyphens, underscores and dots, starts and ends with a letter or number.
Set the required `YourSynthesisId` in path. The `YourSynthesisId` must be unique. It must be 3-64 long, contains only numbers, letters, hyphens, underscores and dots, starts and ends with a letter or number.

Make an HTTP PUT request using the URI as shown in the following example. Replace `YourSpeechKey` with your Speech resource key, replace `YourSpeechRegion` with your Speech resource region, and set the request body properties as previously described.

Expand Down Expand Up @@ -229,9 +227,9 @@ The response headers include `HTTP/1.1 204 No Content` if the delete request was

## Batch synthesis results

After you [get a batch synthesis job](#get-batch-synthesis) with `status` of "Succeeded", you can download the audio output results. Use the URL from the `outputs.result` property of the [get batch synthesis](#get-batch-synthesis) response.
After you [get a batch synthesis job](#get-batch-synthesis) with `status` of "Succeeded", you can download the audio output results. Use the URL from the `outputs.result` property of the [batch synthesis GET](#get-batch-synthesis) response.

To get the batch synthesis results file, make an HTTP GET request using the URI as shown in the following example. Replace `YourOutputsResultUrl` with the URL from the `outputs.result` property of the [get batch synthesis](#get-batch-synthesis) response. Replace `YourSpeechKey` with your Speech resource key.
To get the batch synthesis results file, make an HTTP GET request using the URI as shown in the following example. Replace `YourOutputsResultUrl` with the URL from the `outputs.result` property of the [batch synthesis GET](#get-batch-synthesis) response. Replace `YourSpeechKey` with your Speech resource key.

```azurecli-interactive
curl -v -X GET "YourOutputsResultUrl" -H "Ocp-Apim-Subscription-Key: YourSpeechKey" > results.zip
Expand Down Expand Up @@ -331,7 +329,7 @@ HTTP 200 OK indicates that the request was successful.

### HTTP 201 Created

HTTP 201 Created indicates that the create batch synthesis request (via HTTP PUT) was successful.
HTTP 201 Created indicates that the batch synthesis create request (via HTTP PUT) was successful.

### HTTP 204 error

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -7,9 +7,10 @@ author: eric-urban
ms.author: eur
ms.service: azure-ai-speech
ms.topic: how-to
ms.date: 7/16/2024
ms.date: 9/12/2024
ms.devlang: csharp
ms.custom: devx-track-csharp, devx-track-azurecli
# Customer intent: As a user who implements audio transcription, I want to learn how to locate audio files for batch transcription.
---

# Locate audio files for batch transcription
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@ author: eric-urban
ms.author: eur
ms.service: azure-ai-speech
ms.topic: how-to
ms.date: 8/14/2024
ms.date: 9/12/2024
zone_pivot_groups: speech-cli-rest
ms.custom: devx-track-csharp
# Customer intent: As a user who implements audio transcription, I want create transcriptions in bulk so that I don't have to submit audio content repeatedly.
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@ author: eric-urban
ms.author: eur
ms.service: azure-ai-speech
ms.topic: how-to
ms.date: 7/16/2024
ms.date: 9/12/2024
zone_pivot_groups: speech-cli-rest
ms.custom: devx-track-csharp
---
Expand Down
2 changes: 1 addition & 1 deletion articles/ai-services/speech-service/batch-transcription.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@ author: eric-urban
ms.author: eur
ms.service: azure-ai-speech
ms.topic: overview
ms.date: 7/16/2024
ms.date: 9/12/2024
ms.devlang: csharp
ms.custom: devx-track-csharp
---
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -2,12 +2,13 @@
title: Use Bring your own storage (BYOS) Speech resource for Speech to text
titleSuffix: Azure AI services
description: Learn how to use Bring your own storage (BYOS) Speech resource with Speech to text.
author: alexeyo26
manager: nitinme
ms.service: azure-ai-speech
ms.topic: how-to
ms.date: 4/15/2024
ms.author: alexeyo
ms.date: 9/12/2024
author: eric-urban
ms.author: eur
ms.reviewer: alexeyo
---

# Use the Bring your own storage (BYOS) Speech resource for speech to text
Expand All @@ -18,9 +19,9 @@ Bring your own storage (BYOS) can be used in the following speech to text scenar
- Real-time transcription with audio and transcription results logging enabled
- Custom speech

One Speech resource to storage account pairing can be used for all scenarios simultaneously.
One pair of a Speech resource and storage account can be used for all scenarios simultaneously.

This article explains in depth how to use a BYOS-enabled Speech resource in all speech to text scenarios. The article implies, that you have [a fully configured BYOS-enabled Speech resource and associated Storage account](bring-your-own-storage-speech-resource.md).
This article explains in depth how to use a BYOS-enabled Speech resource in all speech to text scenarios. The article implies that you have [a fully configured BYOS-enabled Speech resource and associated Storage account](bring-your-own-storage-speech-resource.md).

## Data storage

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -2,13 +2,14 @@
title: Set up the Bring your own storage (BYOS) Speech resource
titleSuffix: Azure AI services
description: Learn how to set up Bring your own storage (BYOS) Speech resource.
author: alexeyo26
manager: nitinme
ms.service: azure-ai-speech
ms.custom: devx-track-azurepowershell
ms.topic: how-to
ms.date: 1/18/2024
ms.author: alexeyo
ms.date: 9/12/2024
author: eric-urban
ms.author: eur
ms.reviewer: alexeyo
---

# Set up the Bring your own storage (BYOS) Speech resource
Expand Down
9 changes: 5 additions & 4 deletions articles/ai-services/speech-service/call-center-overview.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,14 +2,15 @@
title: Azure AI services for Call Center Overview
titleSuffix: Azure AI services
description: Azure AI services for Language and Speech can help you realize partial or full automation of telephony-based customer interactions, and provide accessibility across multiple channels.
author: goergenj
ms.author: jagoerge
ms.service: azure-ai-speech
ms.topic: overview
ms.date: 1/18/2024
ms.date: 9/12/2024
author: eric-urban
ms.author: eur
ms.reviewer: jagoerge
---

# Call Center Overview
# Call center overview

Azure AI Language and Azure AI Speech can help you realize partial or full automation of telephony-based customer interactions, and provide accessibility across multiple channels. With the Language and Speech services, you can further analyze call center transcriptions, extract and redact conversation (PII), summarize the transcription, and detect the sentiment.

Expand Down
5 changes: 3 additions & 2 deletions articles/ai-services/speech-service/call-center-quickstart.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,12 +2,13 @@
title: "Post-call transcription and analytics quickstart - Speech service"
titleSuffix: Azure AI services
description: In this quickstart, you perform sentiment analysis and conversation summarization of call center transcriptions.
author: eric-urban
manager: nitinme
ms.service: azure-ai-speech
ms.topic: quickstart
ms.date: 1/18/2024
ms.date: 9/12/2024
author: eric-urban
ms.author: eur
ms.reviewer: jagoerge
ms.devlang: csharp
---

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -2,27 +2,28 @@
title: Call Center Telephony Integration - Speech service
titleSuffix: Azure AI services
description: A common scenario for speech to text is transcribing large volumes of telephony data that come from various systems, such as interactive voice response (IVR) in real-time. This requires an integration with the Telephony System used.
author: goergenj
ms.author: jagoerge
ms.service: azure-ai-speech
ms.topic: conceptual
ms.date: 1/18/2024
ms.date: 9/12/2024
author: eric-urban
ms.author: eur
ms.reviewer: jagoerge
ms.custom: template-concept
---

# Telephony Integration
# Telephony integration

To support real-time scenarios, like Virtual Agent and Agent Assist in Call Centers, an integration with the Call Centers telephony system is required.
To support real-time scenarios, like Virtual Agent and Agent Assist in call centers, an integration with the call center's telephony system is required.

Typically, integration with the Speech service is handled by a telephony client connected to the customers SIP/RTP processor, for example, to a Session Border Controller (SBC).

Usually the telephony client handles the incoming audio stream from the SIP/RTP processor, the conversion to PCM and connects the streams using continuous recognition. It also triages the processing of the results, for example, analysis of speech transcripts for Agent Assist or connect with a dialog processing engine (for example, Azure Botframework or Power Virtual Agent) for Virtual Agent.
Usually the telephony client handles the incoming audio stream from the SIP/RTP processor, the conversion to PCM and connects the streams using continuous recognition. It also triages the processing of the results. For example, analysis of speech transcripts for Agent Assist or connect with a dialog processing engine (for example, Azure Botframework or Power Virtual Agent) for Virtual Agent.

For easier integration the Speech service also supports ALAW in WAV container and MULAW in WAV container for audio streaming. To build this integration, we recommend using the [Speech SDK](./speech-sdk.md).
For easier integration the Speech service also supports "ALAW in WAV container" and "MULAW in WAV container" for audio streaming. To build this integration, we recommend using the [Speech SDK](./speech-sdk.md).

## Azure Communication Services

[Azure Communication Services](/azure/communication-services/overview) call automation APIs provide telephony integration. real-time event triggers to perform actions based on custom business logic specific to their domain. Within the Call Automation APIs developers can use simple AI powered APIs, which can be used to play personalized greeting messages, recognize conversational voice inputs to gather information on contextual questions to drive a more self-service model with customers, use sentiment analysis to improve customer service overall. These content specific APIs are orchestrated through Azure AI services with support for customization of AI models without developers needing to terminate media streams on their services and streaming back to Azure for AI functionality. For more information, see [Azure Communication Services](/azure/communication-services/concepts/call-automation/azure-communication-services-azure-cognitive-services-integration?context=/azure/ai-services/speech-service/context/context).
[Azure Communication Services](/azure/communication-services/overview) calls automation APIs provide telephony integration. real-time event triggers to perform actions based on custom business logic specific to their domain. Within the call automation APIs developers can use simple AI powered APIs, which can be used to play personalized greeting messages, recognize conversational voice inputs to gather information on contextual questions to drive a more self-service model with customers, use sentiment analysis to improve customer service overall. These content specific APIs are orchestrated through Azure AI services with support for customization of AI models without developers needing to terminate media streams on their services and streaming back to Azure for AI functionality. For more information, see [Azure Communication Services](/azure/communication-services/concepts/call-automation/azure-communication-services-azure-cognitive-services-integration?context=/azure/ai-services/speech-service/context/context).

## Next steps

Expand Down
2 changes: 1 addition & 1 deletion articles/ai-services/speech-service/captioning-concepts.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@ manager: nitinme
ms.service: azure-ai-speech
ms.custom: devx-track-extended-java, devx-track-go, devx-track-js, devx-track-python
ms.topic: conceptual
ms.date: 2/16/2024
ms.date: 9/12/2024
ms.author: eur
zone_pivot_groups: programming-languages-speech-sdk-cli
---
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ author: eric-urban
manager: nitinme
ms.service: azure-ai-speech
ms.topic: quickstart
ms.date: 2/16/2024
ms.date: 9/12/2024
ms.author: eur
ms.devlang: cpp
ms.custom: devx-track-extended-java, devx-track-go, devx-track-js, devx-track-python
Expand Down
2 changes: 1 addition & 1 deletion articles/ai-services/speech-service/custom-commands.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ author: trrwilson
manager: nitinme
ms.service: azure-ai-speech
ms.topic: overview
ms.date: 1/18/2024
ms.date: 9/12/2024
ms.author: travisw
ms.custom: cogserv-non-critical-speech
---
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ author: eric-urban
manager: nitinme
ms.service: azure-ai-speech
ms.topic: quickstart
ms.date: 1/18/2024
ms.date: 9/12/2024
ms.author: eur
ms.devlang: csharp
ms.custom: devx-track-csharp, devx-track-extended-java, devx-track-go, devx-track-js, devx-track-python
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
author: eric-urban
ms.service: azure-ai-speech
ms.topic: include
ms.date: 02/12/2022
ms.date: 9/12/2024
ms.author: eur
---

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
author: eric-urban
ms.service: azure-ai-speech
ms.topic: include
ms.date: 03/13/2022
ms.date: 9/12/2024
ms.author: eur
---

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
author: eric-urban
ms.service: azure-ai-speech
ms.topic: include
ms.date: 03/13/2022
ms.date: 9/12/2024
ms.author: eur
---

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
author: eric-urban
ms.service: azure-ai-speech
ms.topic: include
ms.date: 10/03/2022
ms.date: 9/12/2024
ms.author: eur
---

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
author: eric-urban
ms.service: azure-ai-speech
ms.topic: include
ms.date: 10/03/2022
ms.date: 9/12/2024
ms.author: eur
---

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
author: eric-urban
ms.service: azure-ai-speech
ms.topic: include
ms.date: 02/12/2022
ms.date: 9/12/2024
ms.author: eur
---

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
author: eric-urban
ms.service: azure-ai-speech
ms.topic: include
ms.date: 03/13/2022
ms.date: 9/12/2024
ms.author: eur
---

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
author: eric-urban
ms.service: azure-ai-speech
ms.topic: include
ms.date: 02/12/2022
ms.date: 9/12/2024
ms.author: eur
---

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
author: eric-urban
ms.service: azure-ai-speech
ms.topic: include
ms.date: 02/12/2022
ms.date: 9/12/2024
ms.author: eur
---

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
author: eric-urban
ms.service: azure-ai-speech
ms.topic: include
ms.date: 02/12/2022
ms.date: 9/12/2024
ms.author: eur
---

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
author: eric-urban
ms.service: azure-ai-speech
ms.topic: include
ms.date: 03/15/2022
ms.date: 9/12/2024
ms.author: eur
---

Expand Down
Loading

0 comments on commit 162d4f1

Please sign in to comment.