Chat + Vision using Azure OpenAI (Python)

This repository includes a Python app that uses Azure OpenAI to generate responses to user messages and uploaded images.

The project includes all the infrastructure and configuration needed to provision Azure OpenAI resources and deploy the app to Azure Container Apps using the Azure Developer CLI. By default, the app will use managed identity to authenticate with Azure OpenAI, and it will deploy a GPT-4o model with the GlobalStandard SKU.

We recommend first going through the deploying steps before running this app locally, since the local app needs credentials for Azure OpenAI to work properly.

Features
Architecture diagram
Getting started
Deploying
Development server
Costs
Security guidelines
Resources

Features

A Python Quart that uses the openai package to generate responses to user messages with uploaded image files.
A basic HTML/JS frontend that streams responses from the backend using JSON Lines over a ReadableStream.
Speech input and output buttons that use the free built-in browser APIs.
Bicep files for provisioning Azure resources, including Azure OpenAI, Azure Container Apps, Azure Container Registry, Azure Log Analytics, and RBAC roles.
Support for using GitHub models during development.

Architecture diagram

Getting started

You have a few options for getting started with this template. The quickest way to get started is GitHub Codespaces, since it will setup all the tools for you, but you can also set it up locally.

GitHub Codespaces

You can run this template virtually by using GitHub Codespaces. The button will open a web-based VS Code instance in your browser:

Open the template (this may take several minutes):
Open a terminal window
Continue with the deploying steps

VS Code Dev Containers

A related option is VS Code Dev Containers, which will open the project in your local VS Code using the Dev Containers extension:

Start Docker Desktop (install it if not already installed)
Open the project:
In the VS Code window that opens, once the project files show up (this may take several minutes), open a terminal window.
Continue with the deploying steps

Local environment

If you're not using one of the above options for opening the project, then you'll need to:

Make sure the following tools are installed:

Download the project code:

azd init -t openai-chat-vision-quickstart

Open the project folder
Create a Python virtual environment and activate it.
Install required Python packages:
```
pip install -r requirements-dev.txt
```
Install the app as an editable package:
```
python -m pip install -e src
```
Continue with the deploying steps.

Deploying

Once you've opened the project in Codespaces, in Dev Containers, or locally, you can deploy it to Azure.

Azure account setup

Sign up for a free Azure account and create an Azure Subscription.
Check that you have the necessary permissions:
- Your Azure account must have Microsoft.Authorization/roleAssignments/write permissions, such as Role Based Access Control Administrator, User Access Administrator, or Owner. If you don't have subscription-level permissions, you must be granted RBAC for an existing resource group and deploy to that existing group.
- Your Azure account also needs Microsoft.Resources/deployments/write permissions on the subscription level.

Deploying with azd

Login to Azure:
```
azd auth login
```
Provision and deploy all the resources:
```
azd up
```
It will prompt you to provide an azd environment name (like "chat-app"), select a subscription from your Azure account, and select a location where OpenAI is available (like "francecentral"). Then it will provision the resources in your account and deploy the latest code. If you get an error or timeout with deployment, changing the location can help, as there may be availability constraints for the OpenAI resource.
When azd has finished deploying, you'll see an endpoint URI in the command output. Visit that URI, and you should see the chat app! 🎉
When you've made any changes to the app code, you can just run:
```
azd deploy
```

Continuous deployment with GitHub Actions

This project includes a Github workflow for deploying the resources to Azure on every push to main. That workflow requires several Azure-related authentication secrets to be stored as Github action secrets. To set that up, run:

azd pipeline config

Development server

In order to run this app, you need to either have an Azure OpenAI account deployed (from the deploying steps) or use a model from GitHub models.

If you already deployed the app using azd up, then a .env file was created with the necessary environment variables, and you can skip to step 3.
To use the app with GitHub models, either copy .env.sample into a .env file or start from the created .env file. Change OPENAI_HOST to "github" in the .env file.

You'll need a GITHUB_TOKEN environment variable that stores a GitHub personal access token. If you're running this inside a GitHub Codespace, the token will be automatically available. If not, generate a new personal access token and run this command to set the GITHUB_TOKEN environment variable:
```
export GITHUB_TOKEN="<your-github-token-goes-here>"
```
Start the development server:
```
python -m quart --app src.quartapp run --port 50505 --reload
```
This will start the app on port 50505, and you can access it at http://localhost:50505.

Guidance

Costs

Pricing varies per region and usage, so it isn't possible to predict exact costs for your usage. The majority of the Azure resources used in this infrastructure are on usage-based pricing tiers. However, Azure Container Registry has a fixed cost per registry per day.

You can try the Azure pricing calculator for the resources:

Azure OpenAI Service: S0 tier, GPT-4o model. Pricing is based on token count. Pricing
Azure Container App: Consumption tier with 0.5 CPU, 1GiB memory/storage. Pricing is based on resource allocation, and each month allows for a certain amount of free usage. Pricing
Azure Container Registry: Basic tier. Pricing
Log analytics: Pay-as-you-go tier. Costs based on data ingested. Pricing

⚠️ To avoid unnecessary costs, remember to take down your app if it's no longer in use, either by deleting the resource group in the Portal or running azd down.

Security guidelines

This template uses Managed Identity for authenticating to the Azure OpenAI service.

Additionally, we have added a GitHub Action that scans the infrastructure-as-code files and generates a report containing any detected issues. To ensure continued best practices in your own repository, we recommend that anyone creating solutions based on our templates ensure that the Github secret scanning setting is enabled.

You may want to consider additional security measures, such as:

Protecting the Azure Container Apps instance with a firewall and/or Virtual Network.

Resources

About this app:

Get started with multimodal vision chat apps using Azure OpenAI: The Microsoft Learn Quickstart article for this sample, walks through both deployment and the relevant code for working with images in chat.
Video: Using vision models with Python: A live stream recording that steps through the Python notebook and app code.
Blog post: Add speech input/output to your app: Explains the speech buttons used in this app.

Related samples and docs:

OpenAI Chat Application Quickstart: Similar to this project, but without the vision and image uploads.
OpenAI Chat Application with Microsoft Entra Authentication - MSAL SDK: Similar to this project, but adds user authentication with Microsoft Entra using the Microsoft Graph SDK and built-in authentication feature of Azure Container Apps.
OpenAI Chat Application with Microsoft Entra Authentication - Built-in Auth: Similar to this project, but adds user authentication with Microsoft Entra using the Microsoft Graph SDK and MSAL SDK.
RAG chat with Azure AI Search + Python: A more advanced chat app that uses Azure AI Search to ground responses in domain knowledge. Includes user authentication with Microsoft Entra as well as data access controls.
Develop Python apps that use Azure AI services

Name		Name	Last commit message	Last commit date
Latest commit History 101 Commits
.devcontainer		.devcontainer
.github		.github
.vscode		.vscode
docs		docs
infra		infra
notebooks		notebooks
scripts		scripts
src		src
tests		tests
.env.sample		.env.sample
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE.md		LICENSE.md
README.md		README.md
SECURITY.md		SECURITY.md
azure.yaml		azure.yaml
docker-compose.yaml		docker-compose.yaml
pyproject.toml		pyproject.toml
readme_diagram.png		readme_diagram.png
requirements-dev.txt		requirements-dev.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chat + Vision using Azure OpenAI (Python)

Features

Architecture diagram

Getting started

GitHub Codespaces

VS Code Dev Containers

Local environment

Deploying

Azure account setup

Deploying with azd

Continuous deployment with GitHub Actions

Development server

Guidance

Costs

Security guidelines

Resources

About

Releases

Contributors 4

Languages

License

Azure-Samples/openai-chat-vision-quickstart

Folders and files

Latest commit

History

Repository files navigation

Chat + Vision using Azure OpenAI (Python)

Features

Architecture diagram

Getting started

GitHub Codespaces

VS Code Dev Containers

Local environment

Deploying

Azure account setup

Deploying with azd

Continuous deployment with GitHub Actions

Development server

Guidance

Costs

Security guidelines

Resources

About

Topics

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Contributors 4

Languages