Ollama-API-Proxy

A proxy for Ollama to easily disable thinking output for Home Assistant integrations.

Features

Proxies /api/chat requests to Ollama AI server, automatically setting think=false to disable thinking output.
Streams responses back to the client.
Supports fetching tags via /api/tags.
Simple health check endpoint /.

Run the proxy server:

uvicorn proxy:app --host 0.0.0.0 --port 11435

POST /api/chat: Forward chat requests to Ollama with thinking output disabled.
GET /api/tags: Retrieve available tags.
GET /: Health check.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
proxy.py		proxy.py
start.sh		start.sh