I built a $0.0005 screenshot cropper that saves AI agents 95% on vision LLM costs A developer built a stateless pay-per-use API that crops browser screenshots to specific elements, reducing vision LLM costs by 95%. The API uses the x402 payment protocol, charging $0.0005 USDC per crop on Base L2, and eliminates the need for API keys or subscriptions. If you're building AI agents that work with browser screenshots, you already know the pain. You take a full 1920×1080 screenshot, pass it to GPT-4o or Claude, and watch your token bill climb — while the model downscales the image anyway and blurs the exact text you needed it to read. There's a better way. Vision LLMs are expensive for two reasons when you feed them full screenshots: But your agent already knows where to look. Browser automation tools like Playwright and Puppeteer give you getBoundingClientRect — the exact pixel coordinates of any element on screen. So why are you sending the whole screenshot? I built a stateless pay-per-use API that takes a screenshot and pixel coordinates, and returns just the cropped element as a lossless PNG — ready to pass directly to your vision LLM. POST /crop { "image": "