Stop Flying Blind with Coding Agents: Inspect Claude Code and Codex Requests with ccglass A developer built ccglass, an open-source local observability tool that captures and displays AI coding agent requests and responses. The tool runs as a local proxy, allowing users to inspect model traffic from agents like Claude Code and Codex without TLS decryption or client modification. It provides a web dashboard showing tool calls, token usage, and latency to help debug agent behavior. AI coding agents are getting good enough that they no longer feel like autocomplete. Tools like Claude Code, Codex, OpenCode, Cursor, Cline, and other agentic coding systems can read files, modify code, run commands, call tools, inspect errors, and continue working across multiple turns. That is useful. It is also increasingly opaque. When you ask an agent to fix a bug, you usually see the final answer and maybe a few tool calls in the terminal. But you often do not know: For small experiments, guessing is fine. For real work, guessing is a bad debugging strategy. That is why I built ccglass . GitHub: https://github.com/jianshuo/ccglass https://github.com/jianshuo/ccglass ccglass is a local observability tool for AI coding agents. It runs a local proxy, captures model requests and responses, and shows them in a web dashboard. The goal is simple: make it easy to see what tools like Claude Code, Codex, DeepSeek-TUI, Kimi, OpenCode, Ollama, OpenRouter, and other agent clients are actually sending to the model. ccglass can show: It is not another coding agent. It is a visibility layer for the agents you already use. General-purpose tools like Charles, mitmproxy, or Proxyman are great, but AI coding agents can be awkward to inspect with traditional proxy setups. Some clients do not reliably honor HTTP PROXY / HTTPS PROXY . Some have their own networking behavior. Some use provider-specific base URL settings. Patching the client is fragile. ccglass takes a more targeted approach. It starts a local proxy, then launches or configures the target agent with the right base URL environment variable, such as: ANTHROPIC BASE URL OPENAI BASE URL The agent sends model traffic to the local proxy. ccglass logs it, forwards it to the real upstream API, and renders the result in a dashboard. That means you can inspect LLM traffic without installing a CA certificate, decrypting TLS, or modifying the client source code. Install ccglass with npm: npm install -g ccglass Then run: ccglass You can also start a specific client directly: ccglass claude ccglass codex ccglass opencode ccglass deepseek ccglass kimi For example: ccglass codex When it starts, ccglass prints a local dashboard URL: dashboard: http://127.0.0.1:57633 Open that URL and you can watch requests appear as the agent works. Sometimes an agent makes a bad decision because it did not see the context you expected. With ccglass, you can inspect the actual messages sent to the model instead of guessing from the terminal output. You can answer questions like: Coding agents are only as good as their tool loop. ccglass lets you inspect the tools shown to the model, the tool call selected by the model, the arguments passed to the tool, and the result that was fed back into the next request. That is useful when an agent: Long-running agent sessions can burn tokens quickly. ccglass shows token usage, cache usage, estimated cost, and latency per request and per session. That makes it easier to spot: If you use local gateways, OpenAI-compatible endpoints, Anthropic-compatible endpoints, OpenRouter, Ollama, Bedrock, Vertex, or internal proxies, request shape matters. ccglass helps you compare what the client sent with what the upstream expected. This is especially useful when debugging: base url configurationYou can export captured requests for deeper inspection or bug reports: ccglass export