#
Staying on Top of ML Research
With ~10,000 new papers on Arxiv every month, staying current in your specific niche is nearly impossible through manual browsing.
#
The Automation
I built an Arxiv scraper on Apify that:
Keyword search: Define the topics you care about (e.g., "diffusion models", "LLM alignment", "RLHF") #
Scheduled runs: Set it to check daily or hourly #
Structured output: Returns paper title, authors, abstract, arXiv URL, PDF link, and categories #
Easy integration: JSON output works with any webhook, Slack bot, or Notion database
#
Example use: Slack Bot
#
Why This Matters
Researchers and engineers waste hours browsing Arxiv. An automated pipeline means:
- Zero missed papers in your niche
- Daily digest delivered to your preferred platform
- Easy collaboration with teams (shared paper feeds)
Try it on the [Apify Store](https://apify.com/technicaldost/arxiv-paper-scraper) — free tier available.