Fine-tuning a multimodal large language model for clinician-grade autism behavioral scoring from short home videos

wpnews.pro

cd /news/large-language-models/fine-tuning-a-multimodal-large-langu… · home › topics › large-language-models › article

[ARTICLE · art-42891] src=arxiv.org ↗ pub=2026-06-29T04:00Z topic=large-language-models verified=true sentiment=↑ positive

Fine-tuning a multimodal large language model for clinician-grade autism behavioral scoring from short home videos

Researchers fine-tuned Gemini 2.5 Pro on 400 clinician-rated home videos to score 30 behavioral features for autism assessment, improving inter-rater reliability with clinicians by 40% and boosting zero-shot ASD diagnosis F1 by 53%. The model achieved 77% accuracy and 86% AUC on held-out children, matching or exceeding clinician performance, suggesting scalable autism screening from short home videos.

read1 min views1 publishedJun 29, 2026

arXiv:2606.27484v1 Announce Type: new Abstract: Autism spectrum disorder (ASD) affects 1 in 31 US children, yet median age at diagnosis exceeds four years. Artificial intelligence pipelines that provide quantified diagnosis using easy to access observational data (e.g., home videos) could help with earlier diagnosis, and timely delivery of early treatments. We fine-tuned Gemini 2.5 Pro on 400 clinician-rated home videos with low-rank adaptation, training only on 30 behavioral features previously validated to produce reliable predictions when passed to various ML models. On 99 held-out children (49 ASD, 50 neurotypical), inter-rater reliability with clinicians (per-feature weighted Cohen's kappa) improved by 40% (p<0.001), with 27 of 28 evaluable features improving. As an emergent zero-shot capability, direct ASD diagnosis F1 improved by 53% (p<0.001), matching or exceeding clinician outcomes. Classifier-assisted pipelines using fine-tuned LLM-derived behavioral features matched clinician-scored inputs across all tested pathways and achieved 77% accuracy (95% CI: 68-85%) and an AUC of 86% (95% CI: 78-92%). Fine-tuned multimodal LLMs can serve as scalable behavioral feature extractors for use in autism assessment and diagnosis.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/fine-tuning-a-multimodal…

Read original on arxiv.org → arxiv.org/abs/2606.27484

mentioned entities

Gemini 2.5 Pro

ASD

autism spectrum disorder

metadata

slugfine-tuning-a-multimodal-large-language-model-for-clinician-grade-autism-scoring

topic#large-language-models

secondary3 topics

sentimentpositive

canonicalarxiv.org

navigation

← prevv0.5.6

── more in #large-language-models 4 stories · sorted by recency

github.com · 29 Jun · #large-language-models

PCB-Bench: Benchmarking LLMs for PCB Placement and Routing (ICLR 2026)

axios.com · 29 Jun · #large-language-models

Anthropic Claude Fable 5, on track to return soon (possibly this week)

arxiv.org · 29 Jun · #large-language-models

Tessellating The Earth

arxiv.org · 29 Jun · #large-language-models

DMV-Bench: Diagnosing Long-Horizon Multimodal Agents' Visual Memory with Incidental Cue Injection

── more on @gemini 2.5 pro 3 stories trending now

wpnews · 28 May · #ai-startups

[AINews] Cognition raises $1B in $26B Series D

wpnews · 5 Jun · #ai-agents

Miasma Worm Targets AI Coding Agents via GitHub Repos

wpnews · 28 Jun · #ai-agents

OpenCode v1.17: Session Snapshots Undo Your AI Agent

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required