𝗪𝗵𝗮𝘁 𝗶𝗳 𝐫𝐞𝐥𝐢𝐚𝐛𝐥𝐲 𝗮𝘂𝘁𝗼𝗺𝗮𝘁𝗶𝗻𝗴 𝘆𝗼𝘂𝗿 𝗱𝗮𝘁𝗮 𝘀𝗰𝗶𝗲𝗻𝗰𝗲 𝐭𝐚𝐬𝐤𝐬 𝘄𝗮𝘀 𝐟𝐢𝐧𝐚𝐥𝐥𝐲 𝘄𝗶𝘁𝗵𝗶𝗻 𝗿𝗲𝗮𝗰𝗵?!

wpnews.pro

cd /news/machine-learning/what-if-reliably-automating-your-dat… · home › topics › machine-learning › article

[ARTICLE · art-35670] src=dev.to ↗ pub=2026-06-21T15:26Z topic=machine-learning verified=true sentiment=↑ positive

𝗪𝗵𝗮𝘁 𝗶𝗳 𝐫𝐞𝐥𝐢𝐚𝐛𝐥𝐲 𝗮𝘂𝘁𝗼𝗺𝗮𝘁𝗶𝗻𝗴 𝘆𝗼𝘂𝗿 𝗱𝗮𝘁𝗮 𝘀𝗰𝗶𝗲𝗻𝗰𝗲 𝐭𝐚𝐬𝐤𝐬 𝘄𝗮𝘀 𝐟𝐢𝐧𝐚𝐥𝐥𝐲 𝘄𝗶𝘁𝗵𝗶𝗻 𝗿𝗲𝗮𝗰𝗵?!

A developer built OpenDataSci, an autonomous agent for data science and machine learning tasks, and tested it on a Kaggle competition. Without any guidance, the agent achieved an AUC of 0.95, placing in the top 30% out of over 3,000 teams. The tool automates the entire DS/ML lifecycle, from EDA to final evaluation, and works with major LLM providers.

read2 min views1 publishedJun 21, 2026

We all know the grind of working with data, even with AI tools: every experiment starts with re-explaining everything, every iteration needs you to prompt, wait, review, correct, and repeat. And the moment you close the session, everything learned is gone.

It makes us the bottleneck, and this hinders human-AI collaboration...

So I built 𝐎𝐩𝐞𝐧𝐃𝐚𝐭𝐚𝐒𝐜𝐢, an autonomous agent purpose-built for DS/ML, and tested it on Kaggle. I enrolled in a recent competition, ran the agent with no hints, no guidance, while ironing my shirts.

In one shot, it landed AUC 0.95, a top-30% finish out of 3K+ teams and 36K+ submissions using hashtag#Anthropic's Claude Sonnet 4.6. (More on this in README)

The top-1 outperformed this agent by merely 0.004, but at the cost of massive manual effort even while using popular AI tools. The needed a dozen model families, deep learning, 400-feature notebooks, AutoML sweeps across many libraries, and 186 models ensembled carefully. Essentially a few weeks worth of effort and time!!

OpenDataSci abstracts away all the complexity and has so much to offer for DS/ML automation:

→ Owns the entire development lifecycle from EDA to final evaluation

→ Plans, codes, and executes autonomously in a secure local sandbox

→ Self-reviews and corrects before anything reaches you

→ Remembers your data across sessions, gets smarter each run

→ Runs parallel experiments and ensembles

→ Has advanced context management for token efficiency and quality

→ Ships with predefined skills for DS/ML, so it knows how to do things right

→ Bring your own knowledge: out-of-the-box support for custom skills → Works with any major LLM provider (hashtag#Anthropic, hashtag#OpenAI, hashtag#Bedrock, hashtag#VertexAI, hashtag#Ollama, hashtag#vLLM, and any OpenAI-compatible server).

This and so much more!! You set the goal. It does the work. No data science knowledge required.

🔗 [https://github.com/f4roukb/open-data-sci](https://github.com/f4roukb/open-data-sci)

📦 pip install open-data-sci

Spin it up on your data and see what it achieves!

source & further reading

dev.to — original article 🏃🏿‍♂️ FREE AT DAWN: Survive the Shortest Night in this Gemini-Powered Juneteenth Runner I built OpenCode Nexus: A multi-agent workflow plugin for OpenCode that prevents context loss and enforces two-stage code reviews. Codex 5.4 vs 5.5 pricing and quality

~/api · this article 200

$curl api.wpnews.pro/v1/news/what-if-reliably-automat…

Read original on dev.to → dev.to/f4roukb/-55n5

mentioned entities

OpenDataSci

Kaggle

Anthropic

Claude Sonnet

OpenAI

Bedrock

VertexAI

Ollama

metadata

slugwhat-if-reliably-automating-your-data-science-tasks-was-finally-within-reach

topic#machine-learning

secondary3 topics

sentimentpositive

canonicaldev.to

navigation

← prevShow HN: VS Code agent optimized…

next →NVK Adds Experimental DLSS Suppo…

── more in #machine-learning 4 stories · sorted by recency

smartdino.dev · 21 Jun · #machine-learning

Show HN: VS Code agent optimized for affordable coding plans

dev.to · 21 Jun · #machine-learning

I Let Claude Analyze 500 Stocks—Here's What It Picked

dev.to · 21 Jun · #machine-learning

A notebook remembered more about me

letsdatascience.com · 21 Jun · #machine-learning

Parent Helps School Write First AI Policy

── more on @opendatasci 3 stories trending now

wpnews · 20 Jun · #ai-agents

Amazon Bedrock AgentCore Memory: Build AI Agents That Remember

wpnews · 20 Jun · #artificial-intelligence

Microsoft is rewriting the economics of enterprise AI and the bill shock is just getting started

wpnews · 21 Jun · #large-language-models

Anthropic faces a class action lawsuit accusing it of selling Claude Max subscribers far less than advertised

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required