Dream at SemEval-2026 Task 13: SALSA for Single-Pass Machine-Generated Code Detection

wpnews.pro

cd /news/large-language-models/dream-at-semeval-2026-task-13-salsa-… · home › topics › large-language-models › article

[ARTICLE · art-38770] src=arxiv.org ↗ pub=2026-06-25T04:00Z topic=large-language-models verified=true sentiment=↑ positive

Dream at SemEval-2026 Task 13: SALSA for Single-Pass Machine-Generated Code Detection

Researchers propose SALSA, a single-pass autoregressive LLM structured classification method, for detecting machine-generated code in SemEval-2026 Task 13. Their system achieves an out-of-distribution F1 score of 0.789, significantly outperforming the CodeBERT baseline of 0.305.

read1 min views1 publishedJun 25, 2026

arXiv:2606.25102v1 Announce Type: new
Abstract: Large language models have transformed code generation, raising concerns around authorship, assessment integrity, and software trust. SemEval-2026 Task 13 Subtask A operationalizes detection as binary classification over code snippets, with a particular emphasis on out-of-distribution (OOD) generalization across unseen programming languages and application domains. We propose a SALSA-style formulation, Single-pass Autoregressive LLM Structured Classification, that maps each class to a dedicated output token and trains the model to emit a single-token label in a structured response. Rather than engineering hand-crafted features or decision rules, this formulation delegates the authorship decision to the model. To improve OOD robustness, we combine balanced sampling across languages with parameter-efficient fine-tuning and conservative training (low learning rate, single epoch) to avoid overfitting to the training domain. Our best system achieves OOD $F_1 = 0.789$ on the official leaderboard, substantially outperforming the CodeBERT baseline ($F_1 = 0.305$).

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/dream-at-semeval-2026-ta…

Read original on arxiv.org → arxiv.org/abs/2606.25102

mentioned entities

SemEval-2026

CodeBERT

SALSA

metadata

slugdream-at-semeval-2026-task-13-salsa-for-single-pass-machine-generated-code

topic#large-language-models

secondary2 topics

sentimentpositive

canonicalarxiv.org

navigation

← prevChinese models are sometimes bet…

next →Meta Pauses Employee Spyware Aft…

── more in #large-language-models 4 stories · sorted by recency

arxiv.org · 25 Jun · #large-language-models

Graph-Based Phonetic Error Correction of Noisy ASR

arxiv.org · 25 Jun · #large-language-models

Dustin: Draft-Augmented Sparse Verification for Efficient Long-Context Generation with Speculative Decoding

arxiv.org · 28 May · #large-language-models

Simorgh at SemEval-2026 task 7: Region-Aware Hybrid Retrieval for Low-Resource Cultural Reasoning in Multilingual Question Answering

arxiv.org · 25 Jun · #large-language-models

Perfect Detection, Failed Control: The Geometry of Knowing vs. Steering in Language Models

── more on @semeval-2026 3 stories trending now

wpnews · 22 Jun · #generative-ai

Bain tests software takeover targets using vibecoding AI replicas

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

wpnews · 24 Jun · #ai-policy

An AI startup is suing the US government for taking away Anthropic's new model

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required