cd /news/large-language-models/keyphrase-generative-representation-… · home topics large-language-models article
[ARTICLE · art-16065] src=arxiv.org pub= topic=large-language-models verified=true sentiment=↑ positive

Keyphrase Generative Representation of Youth Crisis Conversations Beyond Static Taxonomies

Researchers analyzed 703,975 de-identified Kids Help Phone conversations from 2018-2023 and expanded the platform's 19-label issue taxonomy into a 39-label hierarchical schema. They introduced Keyphrase Generative Representation (KGR), a constrained large language model that generated conversation-specific keyphrases, achieving 81% accuracy in reflecting content and surfacing identity-linked themes such as immigration problems and caregiver burden. The KGR-supported topic-retrieval workflow increased accuracy from 0.25 to 0.70 over manual analysis, marking a shift toward hybrid, interpretable representations that capture emerging and culturally grounded patterns of youth distress beyond static taxonomies.

read1 min publishedMay 28, 2026

arXiv:2605.27546v1 Announce Type: new Abstract: Crisis Responders (CRs) rapidly assess thousands of youth SMS conversations each year to identify mental health concerns and guide support. Yet youth distress is increasingly expressed through evolving and context-specific language that often does not fit fixed-label taxonomies. This work analyzed 703,975 de-identified Kids Help Phone conversations (2018-2023) and expanded KHP's 19-label issue taxonomy into a 39-label hierarchical schema. We then introduce Keyphrase Generative Representation (KGR), a constrained LLM generating concise, conversation-specific keyphrases, evaluated across 129 conversations and 387 expert annotations. The expanded taxonomy achieved expert consensus reliability, with an accuracy of 0.96, and expert review found that 81% of keyphrases accurately reflected content and 74% improved clarity. KGR surfaced identity-linked themes absent from the fixed taxonomy, including immigration problems and caregiver burden, and supported a topic-retrieval workflow that increased accuracy from 0.25 to 0.70 (+0.45) over the manual analyst process. KGR marks a shift toward hybrid, interpretable generative representations that extend crisis response beyond static taxonomies to surface emerging and culturally grounded patterns of youth distress.

── more in #large-language-models 4 stories · sorted by recency
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/keyphrase-generative…] indexed:0 read:1min 2026-05-28 ·