WHY IT MATTERS As AI benchmarks become increasingly saturated with coding tasks, math problems, and synthetic evaluations, a growing number of researchers are asking a different question: how should we measure performance on real world knowledge work? https://x.com/trytrata/status/2062962521892598174 Trata, a startup from Y Combinator's Winter 2025 batch, believes the answer may lie in the workflows used by professional investors. This week, the company released Hedge Bench, a benchmark desig...
YC's Paxel turns AI-coding sessions into a Startup School signal