cd /news/machine-learning/show-hn-ml-condenses-billions-of-log… · home topics machine-learning article
[ARTICLE · art-31845] src=github.com ↗ pub= topic=machine-learning verified=true sentiment=↑ positive

Show HN: ML condenses billions of logs into a tiny snapshot your LLM can debug

Kaushik launched Rocketgraph, an open-source tool that uses machine learning to condense millions of log lines into a few hundred patterns with anomaly scores, enabling LLMs to debug production issues without manual querying. The tool addresses the gap in AI-native observability by fingerprinting logs and scoring anomalies based on frequency and text similarity.

read1 min views4 publishedJun 17, 2026

Hi HN, I'm Kaushik, and I built Rocketgraph. I believe that while other spaces have caught up to the AI wave, the observability space is still lagging behind, using the same tools and dashboards that we use to analyse logs from human-written code. But now the code is written and debugged by AI, so we need to rethink how we do observability where the observer itself is an AI.

The problem that I run into is when an alert fires, I have to manually check the Grafana dashboards and write LogQL queries, which is pretty much like greping. But production usually breaks due to a schema mismatch, or a DB connection issue or a log line that I haven't seen before that's buried under millions of log lines. Much worse, the alert never fires, and I don't know when to grep

Rocketgraph fixes that. It turns your logs into patterns by fingerprinting them, then uses ML to anomaly score them by features like frequency, text similarity and other vectors. So, usually this condenses a million logs into 200-300 patterns with anomaly scores and feature vectors that your LLM can easily analyse without sending the entire firehose. This runs at specific points in time, so it's like an online anomaly detection based on logs.

Some companies do anomaly detection on metrics, but this is done for logs.

Other approaches in this space bolt an AI on top of existing Grafana dashboards, but it's the same thing as manually greping with extra steps.

Please check out the example setups to host it locally and run it on your log files. Let me know what you guys think!

Comments URL: [https://news.ycombinator.com/item?id=48578324](https://news.ycombinator.com/item?id=48578324)

Points: 1

── more in #machine-learning 4 stories · sorted by recency
── more on @rocketgraph 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/show-hn-ml-condenses…] indexed:0 read:1min 2026-06-17 ·