♪ Something Just Like TRuST ♪ *: Toxicity Recognition of Span and Target

wpnews.pro

cd /news/large-language-models/something-just-like-trust-toxicity-r… · home › topics › large-language-models › article

[ARTICLE · art-40510] src=aclanthology.org ↗ pub=2026-06-22T00:00Z topic=large-language-models verified=true sentiment=· neutral

♪ Something Just Like TRuST ♪ *: Toxicity Recognition of Span and Target

Researchers introduced TRuST, a large-scale dataset with ~300k annotations for toxicity recognition, unifying prior resources through a synthesized definition. Benchmarking showed fine-tuned PLMs outperform LLMs on toxicity detection, target identification, and toxic word identification, while reasoning models did not reliably improve performance.

read1 min views1 publishedJun 22, 2026

♪ Something Just Like TRuST ♪ *: Toxicity Recognition of Span and Target — Image: Aclanthology (auto-discovered)

Abstract

Toxic language includes content that is offensive, abusive, or that promotes harm. Progress in preventing toxic output from large language models (LLMs) is hampered by inconsistent definitions of toxicity. We introduce TRuST, a large-scale dataset that unifies and expands prior resources through a carefully synthesized definition of toxicity, and corresponding annotation scheme. It consists of ∼300k annotations, with high-quality human annotation on ∼11k. To ensure high-quality, we designed a rigorous, multi-stage human annotation process, and evaluated the diversity of the annotators. Then we benchmarked state-of-the-art LLMs and pre-trained models on three tasks: toxicity detection, identification of the target group, and of toxic words. Our results indicate that fine-tuned PLMs outperform LLMs on the three tasks, and that current reasoning models do not reliably improve performance. TRuST constitutes one of the most comprehensive resources for evaluating and mitigating LLM toxicity, and other research in socially-aware and safer language technologies.- Anthology ID:

- 2026.findings-acl.1854
- Volume:
[Findings of the Association for Computational Linguistics: ACL 2026](/volumes/2026.findings-acl/)- Month:

July
Year:
2026
Address:
San Diego, California, United States

- Editors:
[Maria Liakata](/people/maria-liakata/),[Viviane P. Moreira](/people/viviane-p-moreira/unverified/),[Jiajun Zhang](/people/jiajun-zhang/unverified/),[David Jurgens](/people/david-jurgens/)- Venue:
[Findings](/venues/findings/)- SIG:
- Publisher:

Association for Computational Linguistics

- Note:
- Pages:

37231–37251

- Language:
- URL:
[https://aclanthology.org/2026.findings-acl.1854/](https://aclanthology.org/2026.findings-acl.1854/)- DOI:
- Cite (ACL):

Berk Atıl, Namrata Sureddy, and Rebecca J. Passonneau. 2026. ♪ Something Just Like TRuST ♪ *: Toxicity Recognition of Span and Target. InFindings of the Association for Computational Linguistics: ACL 2026, pages 37231–37251, San Diego, California, United States. Association for Computational Linguistics. - Cite (Informal):

[♪ Something Just Like TRuST ♪ *: Toxicity Recognition of Span and Target](https://aclanthology.org/2026.findings-acl.1854/)(Atıl et al., Findings 2026)- PDF:
[https://aclanthology.org/2026.findings-acl.1854.pdf](https://aclanthology.org/2026.findings-acl.1854.pdf)

source & further reading

aclanthology.org — original article PatentScore: Multi-Dimensional Evaluation of LLM-Generated Patent Claims "Excuse me, may I say something..." CoLabScience, A Proactive AI Assistant for Biomedical Discovery and LLM-Expert Collaborations "Penny Wise, Pixel Foolish": Bypassing Price Constraints in Multimodal Agents via Visual Adversarial Perturbations

~/api · this article 200

$curl api.wpnews.pro/v1/news/something-just-like-trus…

Read original on aclanthology.org → aclanthology.org/2026.findings-acl.1854/

mentioned entities

TRuST

Association for Computational Linguistics

ACL 2026

Berk Atıl

Namrata Sureddy

Rebecca J. Passonneau

metadata

slugsomething-just-like-trust-toxicity-recognition-of-span-and-target

topic#large-language-models

secondary3 topics

sentimentneutral

canonicalaclanthology.org

navigation

← prevU.S. Export Directive Forces Ant…

next →Why Twio Chose Vertex AI Search …

── more in #large-language-models 4 stories · sorted by recency

arxiv.org · 26 Jun · #large-language-models

Position: Stop Anthropomorphizing Intermediate Tokens as Reasoning/Thinking Trac

aclanthology.org · 26 Jun · #large-language-models

PatentScore: Multi-Dimensional Evaluation of LLM-Generated Patent Claims

aclanthology.org · 22 Jun · #large-language-models

1,729 vs. 1729: The Effect of Scripts and Formats on LLM Numeracy

flyingpenguin.com · 26 Jun · #large-language-models

The National Academies Launders Mythos: “Implications of AI for Cybersecurity”

── more on @trust 3 stories trending now

wpnews · 19 Oct · #developer-tools

Windows Script to clean up and remove all ASUS software

wpnews · 1 Nov · #developer-tools

Custom Zig Test Runner, better ouput, timing display, and support for special "tests:beforeAll" and "tests:afterAll" tests

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required