olmo-eval: An evaluation workbench for the model development loop
The Allen Institute for AI released olmo-eval, an open-source evaluation workbench designed to support the iterative process of developing large language models. The tool builds on the institute's ear…