🤖 OpenAI-Compatible API Mimic

Drop-in replacement for the OpenAI API that works with any backend

A powerful, modular proxy service that mimics the OpenAI API interface, allowing you to transform requests to any OpenAI-compatible backend while maintaining full compatibility with applications that use the OpenAI API. Perfect for AI teams working with custom LLM deployments, private models, or alternative providers.

🌟 Features

Complete OpenAI API Compatibility: Ensures compatibility with OpenAI's API version 2024-10-21 and above, including support for:
- ✅ Chat completions with streaming
- ✅ Function/tool calling with structured outputs
- ✅ JSON mode responses
- ✅ Multimodal content (vision models)
- ✅ Text-to-speech and speech-to-text
- ✅ Image generation (DALL-E)
- ✅ Latest OpenAI models support (GPT-4o, GPT-4-turbo, etc.)
- ✅ Third-generation embeddings models
- ✅ Full models discovery API
Seamless Framework Integration: Works with any client that supports the OpenAI API:
- ✅ Official OpenAI SDKs (Python, Node.js, etc.)
- ✅ LangChain & LangChain.js
- ✅ LlamaIndex
- ✅ Semantic Kernel
- ✅ Any other OpenAI-compatible framework
Enterprise-Ready:
- ✅ Modular, maintainable code architecture
- ✅ Comprehensive logging
- ✅ Error handling with automatic token refresh
- ✅ Easy configuration through environment variables
- ✅ Docker containerization for simple deployment
- ✅ Built on FastAPI for high performance

🚀 Getting Started

Prerequisites

Python 3.10 or higher
Docker (optional, for containerized deployment)

Configuration

Create a .env file in the root directory with the following variables:

TOKEN_URL=your_url_to_get_token
CHAT_API_URL=your_url_of_base_api
EMBEDDING_API_URL=your_url_of_base_api
AUTH_CODE=your_authorization_code
VERIFY_SSL=False
PORT=8000
LOG_LEVEL=INFO

Installation

Clone the repository:

git clone https://github.com/yctimlin/OpenAI-Compatible-API-Mimic.git
cd OpenAI-Compatible-API-Mimic

Install dependencies:
```
pip install -r requirements.txt
```
Or use our setup script:
```
./setup.sh
```

Running Locally

Start the API server:

python main.py

The API will be available at http://localhost:8000 with interactive documentation at http://localhost:8000/docs.

Docker Deployment

Build the Docker image:
```
docker build -t openai-api-mimic .
```

Run the container:

docker run -p 8000:8000 --env-file .env openai-api-mimic

📂 Project Structure

openai-api-mimic/
├── src/                    # Source code directory
│   ├── api/                # API route definitions
│   │   ├── chat.py         # Chat completions endpoint
│   │   ├── embeddings.py   # Embeddings endpoint
│   │   └── models.py       # Models listing endpoint
│   ├── config/             # Configuration management
│   │   └── settings.py     # Application settings
│   ├── models/             # Pydantic schema definitions
│   │   └── schema.py       # Request/response schemas
│   ├── utils/              # Utility functions
│   │   ├── api.py          # API interaction utilities
│   │   └── models.py       # Model handling utilities
│   ├── app.py              # FastAPI application instance
│   └── __init__.py         # Package initialization
├── main.py                 # Application entry point
├── requirements.txt        # Python dependencies
├── Dockerfile              # Container definition
├── .env.example            # Example environment variables
└── README.md               # Project documentation

📚 API Documentation

Once the server is running, you can access the interactive API documentation at:

Swagger UI: http://localhost:8000/docs
ReDoc: http://localhost:8000/redoc

API Endpoints

GET /
Health check endpoint
GET /v1/models
List all available models
GET /v1/models/{model_id}
Get information about a specific model
POST /v1/chat/completions
Create a chat completion
POST /v1/embeddings
Create embeddings from input text

🧩 Usage Examples

Python with OpenAI SDK

import openai

client = openai.OpenAI(
    base_url="http://localhost:8000/v1",
    api_key="dummy-key"  # API key is not validated but required by the SDK
)

# Chat completion
response = client.chat.completions.create(
    model="gpt-4o",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Hello, how are you?"}
    ],
    temperature=0.7
)

print(response.choices[0].message.content)

# Streaming example
for chunk in client.chat.completions.create(
    model="gpt-4-turbo",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Write a short poem about AI."}
    ],
    stream=True
):
    if chunk.choices[0].delta.content:
        print(chunk.choices[0].delta.content, end="")

# Embeddings
embedding_response = client.embeddings.create(
    model="text-embedding-3-small",
    input="The quick brown fox jumps over the lazy dog"
)

print(f"Embedding dimension: {len(embedding_response.data[0].embedding)}")

LangChain Integration

from langchain.chat_models import ChatOpenAI
from langchain.schema import HumanMessage, SystemMessage

llm = ChatOpenAI(
    openai_api_base="http://localhost:8000/v1",
    openai_api_key="dummy-key",
    model="gpt-4o"
)

messages = [
    SystemMessage(content="You are a helpful assistant."),
    HumanMessage(content="Explain how APIs work in simple terms.")
]

response = llm.invoke(messages)
print(response.content)

👥 Community & Support

GitHub Discussions - Ask questions and share ideas
GitHub Issues - Report bugs or request features
Discord Server - Join our community

🗺️ Roadmap

Support for Azure OpenAI API compatibility
Additional logging and monitoring options
Helm chart for Kubernetes deployment
Authentication and rate limiting
Request/response validation middleware
Support for DALL-E 3 and other image generation endpoints

❓ FAQ

How does this differ from the OpenAI API?

This project creates a compatibility layer between your API backend and OpenAI's API format. It allows you to maintain your own backend implementation while providing a standard OpenAI-compatible interface for client applications.

Can I use this with any LLM provider?

Yes, as long as your backend can fulfill the requirements of the API - typically providing chat completions and embeddings. You'll need to implement the appropriate transformations in the backend utilities.

Is this suitable for production use?

Yes, the architecture follows best practices for production environments, including proper error handling, logging, and configuration. However, you should implement appropriate authentication and rate limiting for your specific use case.

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch: git checkout -b feature/amazing-feature
Commit your changes: git commit -m 'Add some amazing feature'
Push to the branch: git push origin feature/amazing-feature
Open a Pull Request

Contributors

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🔗 Related Projects

If you find this project helpful, please consider giving it a star ⭐

Built with ❤️ for the AI developer community

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤖 OpenAI-Compatible API Mimic

🌟 Features

📋 Table of Contents

🚀 Getting Started

Prerequisites

Configuration

Installation

Running Locally

Docker Deployment

📂 Project Structure

📚 API Documentation

API Endpoints

🧩 Usage Examples

Python with OpenAI SDK

LangChain Integration

👥 Community & Support

🗺️ Roadmap

❓ FAQ

How does this differ from the OpenAI API?

Can I use this with any LLM provider?

Is this suitable for production use?

🤝 Contributing

Contributors

📄 License

🔗 Related Projects

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
docs		docs
examples		examples
src		src
.env.example		.env.example
Dockerfile		Dockerfile
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
setup.sh		setup.sh
test_api.py		test_api.py

yctimlin/OpenAI-Compatible-API-Mimic

Folders and files

Latest commit

History

Repository files navigation

🤖 OpenAI-Compatible API Mimic

🌟 Features

📋 Table of Contents

🚀 Getting Started

Prerequisites

Configuration

Installation

Running Locally

Docker Deployment

📂 Project Structure

📚 API Documentation

API Endpoints

🧩 Usage Examples

Python with OpenAI SDK

LangChain Integration

👥 Community & Support

🗺️ Roadmap

❓ FAQ

How does this differ from the OpenAI API?

Can I use this with any LLM provider?

Is this suitable for production use?

🤝 Contributing

Contributors

📄 License

🔗 Related Projects

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages