cd /news/large-language-models/something-just-like-trust-toxicity-r… · home topics large-language-models article
[ARTICLE · art-40510] src=aclanthology.org ↗ pub= topic=large-language-models verified=true sentiment=· neutral

♪ Something Just Like TRuST ♪ *: Toxicity Recognition of Span and Target

Researchers introduced TRuST, a large-scale dataset with ~300k annotations for toxicity recognition, unifying prior resources through a synthesized definition. Benchmarking showed fine-tuned PLMs outperform LLMs on toxicity detection, target identification, and toxic word identification, while reasoning models did not reliably improve performance.

read1 min views1 publishedJun 22, 2026
♪ Something Just Like TRuST ♪ *: Toxicity Recognition of Span and Target
Image: Aclanthology (auto-discovered)
Abstract

Toxic language includes content that is offensive, abusive, or that promotes harm. Progress in preventing toxic output from large language models (LLMs) is hampered by inconsistent definitions of toxicity. We introduce TRuST, a large-scale dataset that unifies and expands prior resources through a carefully synthesized definition of toxicity, and corresponding annotation scheme. It consists of ∼300k annotations, with high-quality human annotation on ∼11k. To ensure high-quality, we designed a rigorous, multi-stage human annotation process, and evaluated the diversity of the annotators. Then we benchmarked state-of-the-art LLMs and pre-trained models on three tasks: toxicity detection, identification of the target group, and of toxic words. Our results indicate that fine-tuned PLMs outperform LLMs on the three tasks, and that current reasoning models do not reliably improve performance. TRuST constitutes one of the most comprehensive resources for evaluating and mitigating LLM toxicity, and other research in socially-aware and safer language technologies.- Anthology ID:

- 2026.findings-acl.1854
- Volume:
[Findings of the Association for Computational Linguistics: ACL 2026](/volumes/2026.findings-acl/)- Month:
  • July
  • Year:
  • 2026
  • Address:
  • San Diego, California, United States
- Editors:
[Maria Liakata](/people/maria-liakata/),[Viviane P. Moreira](/people/viviane-p-moreira/unverified/),[Jiajun Zhang](/people/jiajun-zhang/unverified/),[David Jurgens](/people/david-jurgens/)- Venue:
[Findings](/venues/findings/)- SIG:
- Publisher:
  • Association for Computational Linguistics
- Note:
- Pages:
  • 37231–37251
- Language:
- URL:
[https://aclanthology.org/2026.findings-acl.1854/](https://aclanthology.org/2026.findings-acl.1854/)- DOI:
- Cite (ACL):
[♪ Something Just Like TRuST ♪ *: Toxicity Recognition of Span and Target](https://aclanthology.org/2026.findings-acl.1854/)(Atıl et al., Findings 2026)- PDF:
[https://aclanthology.org/2026.findings-acl.1854.pdf](https://aclanthology.org/2026.findings-acl.1854.pdf)
── more in #large-language-models 4 stories · sorted by recency
── more on @trust 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/something-just-like-…] indexed:0 read:1min 2026-06-22 ·