04:00
2026-06-12
arxiv.org
large-language-models
ToolSense: A Diagnostic Framework for Auditing Parametric Tool Knowledge in LLMs
Researchers have developed ToolSense, an open-source diagnostic framework that automatically generates three benchmarks to audit whether large language models truly understand their parametric tool knโฆ