cd /news/large-language-models/truth-or-sophistry-lofa-a-benchmark-… · home topics large-language-models article
[ARTICLE · art-45921] src=arxiv.org ↗ pub= topic=large-language-models verified=true sentiment=· neutral

Truth or Sophistry? LoFa: A Benchmark for LLM Robustness Against Logical Fallacies

Researchers introduced LoFa, a benchmark for evaluating large language models' robustness against logical fallacies, using a multi-agent pipeline and a multi-round debate framework. Experiments revealed varying vulnerability profiles across models, with the metric LFR@k quantifying resistance to fallacious attacks.

read1 min views1 publishedJul 1, 2026

arXiv:2606.31039v1 Announce Type: new Abstract: Large Language Models (LLMs) exhibit strong semantic capabilities, yet their resilience to manipulative linguistic patterns such as logical fallacies remains underexplored. Prior work has primarily examined whether LLMs can identify or classify fallacies, leaving their robustness against fallacious persuasion insufficiently studied. To address this gap, we introduce LoFa (Logical Fallacy), a comprehensive benchmark for evaluating LLM robustness against fallacies. LoFa is constructed through a multi-agent pipeline that pairs factual questions with fallacious arguments, and is accompanied by a multi-round debate framework for assessing model resilience under sustained adversarial persuasion. To disentangle fallacy robustness from a model's inherent knowledge limitations, we further propose Logical Fallacy Resistance at k (LFR@k), a metric that quantifies resistance to fallacious attacks. Experiments show that LLMs exhibit varying levels of robustness across different fallacy types, revealing distinct vulnerability profiles among models.

── more in #large-language-models 4 stories · sorted by recency
── more on @lofa 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/truth-or-sophistry-l…] indexed:0 read:1min 2026-07-01 ·