Hey I was hoping for advice on which model can handle the sort of data sets I want to FT on. I’d like to give my model a corpus of information and train them to say nothing when they lack confidence. I’ve been using Gwen for this, and tried our Mistral to leave it takes date of another kind. I can’t even train it on what I need it to know. So which model excels at It’s a selection-with-abstention tasks, not generation. Each row gives the model a cue (a conversation turn) plus a numbered candidate slate of informative chunks, and the target output is a short structured string that either picks an index with a confidence tier or abstains:
│ -— surface candidate 3, confident │ -
— surface it, hedged │
│ - — abstain, surface nothing │
│ │
│ Format is completion-style SFT on a small instruct model (LoRA), small corpus (a few hundred real items, augmented).