GreyFox – Free self-hosted AI proxy, token quotas, and local cache GreyFox Community Edition, a free self-hosted AI traffic proxy, was released by Skillful Fox Studio. The Docker-based tool allows teams to control LLM token usage, enforce per-user quotas, cache responses, and monitor AI traffic locally without a cloud control plane. It supports up to five managed users and provides an OpenAI-compatible endpoint. GreyFox Community Edition is a self-hosted AI traffic proxy and local operator console for teams that want to control LLM token usage, enforce per-user limits, reuse exact cached responses, and keep AI traffic visibility inside their own infrastructure. GreyFox runs as a local Docker box. No GreyFox-hosted control plane is required. - OpenAI-compatible proxy endpoint at /v1/chat/completions - Local Admin UI served from the same container - Per-user token quota enforcement with X-App-User-Id - Mock mode for zero-cost onboarding and demos - Provider mode for OpenAI-compatible upstream APIs - Exact response cache for repeated non-streaming requests - Local SQLite storage for settings, users, logs, cache, and metrics - Traffic history, token analytics, manual cost calculator, and safe maintenance tools - Up to 5 active managed users - Token monitoring is the authoritative usage signal - Cost estimates are manual and informational only - No hosted GreyFox cloud control plane - No automatic update checks or automatic container updates - No request detail drawer, exports, deeper diagnostics, or live traffic metrics - Docker Desktop or Docker Engine with Docker Compose - One available host port, default 8080 - A Provider API key only if you want to use live provider mode You do not need Node.js, npm, Angular, Nx, or source code to run the Community Edition release. Create a compose.yaml file: services: greyfox: image: ghcr.io/skillful-fox-studio/grey-fox-community:0.1.0 container name: greyfox-community environment: OPENAI BASE URL: ${OPENAI BASE URL:-https://api.openai.com/v1} GREYFOX DB PATH: ${GREYFOX DB PATH:-data/greyfox.db} PORT: 3000 GREYFOX STATIC ROOT: /app/public/admin-ui ports: - "${GREYFOX HTTP PORT:-8080}:3000" volumes: - greyfox-data:/app/data restart: unless-stopped volumes: greyfox-data: Start GreyFox: docker compose up -d Open the Admin UI: http://localhost:8080 Health check: curl http://localhost:8080/api/health Expected response: {"status":"ok","service":"proxy-api"} GreyFox is a proxy layer. It does not install browser extensions, intercept your personal ChatGPT usage, or automatically capture traffic from unrelated applications. Your AI application must send its provider requests to GreyFox instead of sending them directly to the upstream provider. Typical direct setup: Your application | | HTTPS request with provider API key v OpenAI-compatible provider GreyFox setup: Your application | | OpenAI-compatible request | Base URL: http://