Docker Nginx OpenAI API Cache Reverse proxy

This project is a simple Docker Nginx project that serves as a cache for the OpenAI API.

nginx here is preconfigured to work on OpenAI API.

Features:

Works with any client that allows you to configure the server address (as it acts as a reverse proxy)
Caches the response of the support endpoints. The key of cache is built from the request uri and body
Returns an "X-Cache-Status" header indicating whether the response was served from cache or not

Supported endpoints:

POST /v1/chat/completions
POST /v1/completions
POST /v1/edits
POST /v1/embeddings
POST /v1/moderations
POST /v1/answers

endpoints deprecated by OpenAI :

POST /v1/engines/*/chat/completions
POST /v1/engines/*/completions
POST /v1/engines/*/edits
POST /v1/engines/*/embeddings
POST /v1/engines/*/moderations
POST /v1/engines/*/answers

Getting Started

Prerequisites

Docker
Docker compose

Installation

Clone the repository:

git clone https://github.com/larrylin17/docker-nginx-openai-api-cache.git
cd docker-nginx-openai-api-cache

Generate SSL certificate and key

mkdir conf/ssl

sudo openssl req -x509 -nodes -days 365 -newkey rsa:2048 -keyout conf/ssl/nginx-selfsigned.key -out conf/ssl/nginx-selfsigned.crt

Start the container:

docker-compose up -d

or

docker run -d -p 80:80 -p 443:443 -v ./conf:/etc/nginx -v ./server_cache:/server_cache:rw --name nginx-openai-api --restart always nginx:latest

Test the server

Set your credentials:

OPENAI_API_KEY="...."

Run this 2 times or more:

curl -s -o /dev/null -w "%{http_code}" http://localhost:81/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -d '{
  "model": "gpt-3.5-turbo",
  "messages": [
    {
      "role": "user",
      "content": "Hello there !"
    }
  ],
  "temperature": 0,
  "max_tokens": 228,
  "top_p": 1,
  "frequency_penalty": 0,
  "presence_penalty": 0
}'

Check the logs

docker-compose logs

The last lines should show something like this

openai-cache-proxy  | 172.28.0.1 - - [29/Feb/2024:19:59:49 +0000] "POST /v1/chat/completions HTTP/1.1" 200 494 "-" "curl/7.80.0" Cache: MISS
openai-cache-proxy  | 172.28.0.1 - - [29/Feb/2024:19:59:52 +0000] "POST /v1/chat/completions HTTP/1.1" 200 494 "-" "curl/7.80.0" Cache: HIT

Stop the container:

docker-compose down

Usage

Set your client's API server address to http://localhost:81/v1 Once the containers are running, you can use the OpenAI API through the cache by sending requests to the supported URIs.

URIs that are supported will be forwarded, unless they are cached. URIs that are not supported will be forwarded normally.

Configuration

The cache is configured using the nginx.conf. You can modify this file to change the cache settings or add additional URIs.

Contributing

Contributions are welcome! Please submit a pull request or open an issue if you encounter any problems or have suggestions for improvements.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
conf		conf
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Docker Nginx OpenAI API Cache Reverse proxy

Features:

Supported endpoints:

Getting Started

Prerequisites

Installation

Usage

Configuration

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

larrylin17/docker-nginx-openai-api-cache

Folders and files

Latest commit

History

Repository files navigation

Docker Nginx OpenAI API Cache Reverse proxy

Features:

Supported endpoints:

Getting Started

Prerequisites

Installation

Usage

Configuration

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages