04:00
2026-06-18
arxiv.org
large-language-models
DeFAb: A Verifiable Benchmark for Defeasible Abduction in Foundation Models
Researchers introduced DeFAb, a benchmark for defeasible abduction in foundation models, converting knowledge bases into 372,648+ logically verifiable instances. Frontier language models achieved at mโฆ