04:00
2026-06-04
arxiv.org
large-language-models
Can I Take Another Dose? Evaluating LLM Decision-Making Under Temporal Uncertainty in OTC Dosing QA
Researchers introduced DOSEBENCH, a benchmark of 81 over-the-counter dosing scenarios for adult acetaminophen and ibuprofen, to evaluate large language models' ability to answer safe dosing questions.โฆ