Python OpenAI demos

This repository contains a collection of Python scripts that demonstrate how to use the OpenAI API to generate chat completions. 📺 Watch this video walkthrough of running these demos in GitHub Codespaces

Examples
Setting up the Python environment
Configuring the OpenAI environment variables
Resources

Examples

OpenAI Chat Completions

These scripts use the openai Python package to demonstrate how to use the OpenAI Chat Completions API. In increasing order of complexity, the scripts are:

chat.py: A simple script that demonstrates how to use the OpenAI API to generate chat completions.
chat_stream.py: Adds stream=True to the API call to return a generator that streams the completion as it is being generated.
chat_history.py: Adds a back-and-forth chat interface using input() which keeps track of past messages and sends them with each chat completion call.
chat_history_stream.py: The same idea, but with stream=True enabled.

Plus these scripts to demonstrate additional features:

chat_safety.py: The simple script with exception handling for Azure AI Content Safety filter errors.
chat_async.py: Uses the async clients to make asynchronous calls, including an example of sending off multiple requests at once using asyncio.gather.

Function calling

These scripts demonstrate using the Chat Completions API "tools" (a.k.a. function calling) feature, which lets the model decide when to call developer-defined functions and return structured arguments instead of (or before) a natural language answer.

In all of these examples, a list of functions is declared in the tools parameter. The model may respond with message.tool_calls containing one or more tool calls. Each tool call includes the function name and a JSON string of arguments that match the declared schema. Your application is responsible for: (1) detecting tool calls, (2) executing the corresponding local / external logic, and (3) (optionally) sending the tool result back to the model for a final answer.

Scripts (in increasing order of capability):

function_calling_basic.py: Declares a single lookup_weather function and prompts the model. It prints the tool call (if any) or falls back to the model's normal content. No actual function execution occurs.
function_calling_call.py: Executes the lookup_weather function if the model requests it by parsing the returned arguments JSON and calling the local Python function.
function_calling_extended.py: Shows a full round‑trip: after executing the function, it appends a tool role message containing the function result and asks the model again so it can incorporate real data into a final user-facing response.
function_calling_multiple.py: Exposes multiple functions (lookup_weather, lookup_movies) so you can see how the model chooses among them and how multiple tool calls could be returned.

You must use a model that supports function calling (such as the defaults gpt-4o, gpt-4o-mini, etc.). Some local or older models may not support the tools parameter.

Retrieval-Augmented Generation (RAG)

These scripts demonstrate how to use the OpenAI API for Retrieval-Augmented Generation (RAG) tasks, where the model retrieves relevant information from a source and uses it to generate a response.

First install the RAG dependencies:

python -m pip install -r requirements-rag.txt

Then run the scripts (in order of increasing complexity):

rag_csv.py: Retrieves matching results from a CSV file and uses them to answer user's question.
rag_multiturn.py: The same idea, but with a back-and-forth chat interface using input() which keeps track of past messages and sends them with each chat completion call.
rag_queryrewrite.py: Adds a query rewriting step to the RAG process, where the user's question is rewritten to improve the retrieval results.
rag_documents_ingestion.py: Ingests PDFs by using pymupdf to convert to markdown, then using Langchain to split into chunks, then using OpenAI to embed the chunks, and finally storing in a local JSON file.
rag_documents_flow.py: A RAG flow that retrieves matching results from the local JSON file created by rag_documents_ingestion.py.
rag_documents_hybrid.py: A RAG flow that implements a hybrid retrieval with both vector and keyword search, merging with Reciprocal Rank Fusion (RRF), and semantic re-ranking with a cross-encoder model.

Structured outputs

These scripts demonstrate how to use the OpenAI API to generate structured responses using Pydantic data models:

structured_outputs_basic.py: Basic example extracting simple event information using a Pydantic model.
structured_outputs_description.py: Uses additional descriptions in Pydantic model fields to clarify to the model how to format the response.
structured_outputs_enum.py: Uses enumerations (Enums) to restrict possible values in structured responses.
structured_outputs_function_calling.py: Demonstrates how to use functions defined with Pydantic for automatic function calling based on user queries.
structured_outputs_nested.py: Uses nested Pydantic models to handle more complex structured responses, such as events with participants having multiple attributes.

Setting up the Python environment

If you open this up in a Dev Container or GitHub Codespaces, everything will be setup for you. If not, follow these steps:

Set up a Python virtual environment and activate it.
Install the required packages:

python -m pip install -r requirements.txt

Configuring the OpenAI environment variables

These scripts can be run with Azure OpenAI account, OpenAI.com, local Ollama server, or GitHub models, depending on the environment variables you set. All the scripts reference the environment variables from a .env file, and an example .env.sample file is provided. Host-specific instructions are below.

Using GitHub Models

If you open this repository in GitHub Codespaces, you can run the scripts for free using GitHub Models without any additional steps, as your GITHUB_TOKEN is already configured in the Codespaces environment.

If you want to run the scripts locally, you need to set up the GITHUB_TOKEN environment variable with a GitHub personal access token (PAT). You can create a PAT by following these steps:

Go to your GitHub account settings.
Click on "Developer settings" in the left sidebar.
Click on "Personal access tokens" in the left sidebar.
Click on "Tokens (classic)" or "Fine-grained tokens" depending on your preference.
Click on "Generate new token".
Give your token a name and select the scopes you want to grant. For this project, you don't need any specific scopes.
Click on "Generate token".
Copy the generated token.
Set the GITHUB_TOKEN environment variable in your terminal or IDE:
```
export GITHUB_TOKEN=your_personal_access_token
```
Optionally, you can use a model other than "gpt-4o" by setting the GITHUB_MODEL environment variable. Use a model that supports function calling, such as: gpt-4o, gpt-4o-mini, o3-mini, AI21-Jamba-1.5-Large, AI21-Jamba-1.5-Mini, Codestral-2501, Cohere-command-r, Ministral-3B, Mistral-Large-2411, Mistral-Nemo, Mistral-small

Using Azure OpenAI models

You can run all examples in this repository using GitHub Models. If you want to run the examples using models from Azure OpenAI instead, you need to provision the Azure AI resources, which will incur costs.

This project includes infrastructure as code (IaC) to provision Azure OpenAI deployments of "gpt-4o" and "text-embedding-3-large". The IaC is defined in the infra directory and uses the Azure Developer CLI to provision the resources.

Make sure the Azure Developer CLI (azd) is installed.
Login to Azure:
```
azd auth login
```
For GitHub Codespaces users, if the previous command fails, try:
```
 azd auth login --use-device-code
```
Provision the OpenAI account:
```
azd provision
```
It will prompt you to provide an azd environment name (like "agents-demos"), select a subscription from your Azure account, and select a location. Then it will provision the resources in your account.
Once the resources are provisioned, you should now see a local .env file with all the environment variables needed to run the scripts.
To delete the resources, run:
```
azd down
```

Using OpenAI.com models

Create a .env file by copying the .env.sample file and updating it with your OpenAI API key and desired model name.
```
cp .env.sample .env
```

Update the .env file with your OpenAI API key and desired model name:

API_HOST=openai
OPENAI_API_KEY=your_openai_api_key
OPENAI_MODEL=gpt-4o-mini

Using Ollama models

Install Ollama and follow the instructions to set it up on your local machine.
Pull a model, for example:
```
ollama pull llama3.1
```
Create a .env file by copying the .env.sample file and updating it with your Ollama endpoint and model name.
```
cp .env.sample .env
```
Update the .env file with your Ollama endpoint and model name (any model you've pulled):
```
API_HOST=ollama
OLLAMA_ENDPOINT=http://localhost:11434/v1
OLLAMA_MODEL=llama3.1
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Python OpenAI demos

Examples

OpenAI Chat Completions

Function calling

Retrieval-Augmented Generation (RAG)

Structured outputs

Setting up the Python environment

Configuring the OpenAI environment variables

Using GitHub Models

Using Azure OpenAI models

Using OpenAI.com models

Using Ollama models

Resources

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 5

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
.devcontainer		.devcontainer
.github		.github
.vscode		.vscode
data		data
http		http
infra		infra
spanish		spanish
.env.sample		.env.sample
.env.sample.azure		.env.sample.azure
.env.sample.github		.env.sample.github
.env.sample.ollama		.env.sample.ollama
.env.sample.openai		.env.sample.openai
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
AGENTS.md		AGENTS.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
README.md		README.md
azure.yaml		azure.yaml
chained_calls.py		chained_calls.py
chat.py		chat.py
chat_async.py		chat_async.py
chat_history.py		chat_history.py
chat_history_stream.py		chat_history_stream.py
chat_safety.py		chat_safety.py
chat_stream.py		chat_stream.py
few_shot_examples.py		few_shot_examples.py
function_calling_basic.py		function_calling_basic.py
function_calling_call.py		function_calling_call.py
function_calling_extended.py		function_calling_extended.py
function_calling_multiple.py		function_calling_multiple.py
hybrid.csv		hybrid.csv
prompt_engineering.py		prompt_engineering.py
pyproject.toml		pyproject.toml
rag_csv.py		rag_csv.py
rag_documents_flow.py		rag_documents_flow.py
rag_documents_hybrid.py		rag_documents_hybrid.py
rag_documents_ingestion.py		rag_documents_ingestion.py
rag_ingested_chunks.json		rag_ingested_chunks.json
rag_multiturn.py		rag_multiturn.py
rag_queryrewrite.py		rag_queryrewrite.py
reasoning.py		reasoning.py
requirements-dev.txt		requirements-dev.txt
requirements-rag.txt		requirements-rag.txt
requirements.txt		requirements.txt
retrieval_augmented_generation.py		retrieval_augmented_generation.py
structured_outputs_basic.py		structured_outputs_basic.py
structured_outputs_description.py		structured_outputs_description.py
structured_outputs_enum.py		structured_outputs_enum.py
structured_outputs_function_calling.py		structured_outputs_function_calling.py
structured_outputs_nested.py		structured_outputs_nested.py

License

Azure-Samples/python-openai-demos

Folders and files

Latest commit

History

Repository files navigation

Python OpenAI demos

Examples

OpenAI Chat Completions

Function calling

Retrieval-Augmented Generation (RAG)

Structured outputs

Setting up the Python environment

Configuring the OpenAI environment variables

Using GitHub Models

Using Azure OpenAI models

Using OpenAI.com models

Using Ollama models

Resources

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 5

Languages

Packages