llm 0.32a2 Release of llm 0.32a2, highlighting that most reasoning-capable OpenAI models now use the `/v1/responses` endpoint instead of `/v1/chat/completions`, enabling interleaved reasoning across tool calls for GPT-5 class models. Users can now view summarized reasoning tokens in a different color when running prompts, with the option to hide them using the `-R` or `--hide-reasoning` flags. Release: llm 0.32a2 A bunch of useful stuff in this LLM alpha, but the most important detail is this one: Most reasoning-capable OpenAI models now use the /v1/responses endpoint instead of/v1/chat/completions . This enables interleaved reasoning across tool calls for GPT-5 class models. 1435 This means you can now see the summarized reasoning tokens when you run prompts against an OpenAI model, displayed in a different color to standard error. Use the -R or --hide-reasoning flags if you don't want to see that. Tags: llm, projects, openai, generative-ai, annotated-release-notes, ai, llms