cd /news/large-language-models/datasets-and-the-right-models · home topics large-language-models article
[ARTICLE · art-33382] src=discuss.huggingface.co ↗ pub= topic=large-language-models verified=true sentiment=· neutral

Datasets and the right models

A developer is seeking advice on which model excels at selection-with-abstention tasks, where the model must either pick a candidate with a confidence tier or abstain from outputting anything. The user has tried Gwen and Mistral but found them unsuitable for this specific structured output task, which involves fine-tuning a small instruct model via LoRA on a small corpus.

read1 min views1 publishedJun 18, 2026

Hey I was hoping for advice on which model can handle the sort of data sets I want to FT on. I’d like to give my model a corpus of information and train them to say nothing when they lack confidence. I’ve been using Gwen for this, and tried our Mistral to leave it takes date of another kind. I can’t even train it on what I need it to know. So which model excels at It’s a selection-with-abstention tasks, not generation. Each row gives the model a cue (a conversation turn) plus a numbered candidate slate of informative chunks, and the target output is a short structured string that either picks an index with a confidence tier or abstains:

│ -— surface candidate 3, confident │ -

— surface it, hedged │

│ - — abstain, surface nothing │

│ │

│ Format is completion-style SFT on a small instruct model (LoRA), small corpus (a few hundred real items, augmented).

── more in #large-language-models 4 stories · sorted by recency
── more on @gwen 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/datasets-and-the-rig…] indexed:0 read:1min 2026-06-18 ·