How LLM Function Calling Actually Works — From Tokens to Tool Orchestration
An engineer explains how LLM function calling works under the hood, contrasting plain text, JSON mode, and schema-constrained function calling. The post details how constrained decoding enforces schema compliance at the …