14:45
2026-05-12
arize.com
large-language-models
Models got an order of magnitude better at following instructions in one year
New research shows frontier AI models have improved nearly tenfold in their ability to follow instructions over the past year, according to data from the IFScale benchmark. A year ago, models began loโฆ