TLDR: Following my recent blog post and Hacker News post (https://news.ycombinator.com/item?id=48528029). where I ran the desktop app on my M1 Max. This time, I’m using the self-hosted version, running in Docker, with an NVIDIA RTX 4090 (24 GB of VRAM).
The content is also fundamentally more demanding: long podcast episodes with at least two faces in every frame, coding tutorials packed with on-screen text, and screen recordings. GoPro footage is mostly wide outdoor shots.
But NVIDIA was much faster than my M1 Max.
The longest video was a livestream of 3h 12m indexed in 1h 52m (4,612 frames analyzed).
You can directly see the processing jobs results in JSON format here: https://gist.github.com/IliasHad/fd64e4d331e90e57d61e95f64e8...
Comments URL: [https://news.ycombinator.com/item?id=48731129](https://news.ycombinator.com/item?id=48731129)
Points: 1