# BEAVER: Enterprise benchmark for LLM Text-to-SQL from private data warehouses

> Source: <https://beaverbench.github.io/>
> Published: 2026-06-15 01:20:49+00:00

Please send an email to peterbc@mit.edu, along with your method name, a brief description of the method, and, optionally, a link to your paper or codebase. We will follow up with detailed instructions.

| Rank | Submission Date | Method | Model | Execution Accuracy |
|---|

If you find our data, code, or the paper helpful, please cite the paper:

```
@article{chen2024beaver,
  title={BEAVER: an enterprise benchmark for text-to-sql},
  author={Chen, Peter Baile and Yang, Devin and Li, Weiyue and Wenz, Fabian and Zhang, Yi and Tatbul, Nesime and Cafarella, Michael and Demiralp, {\c{C}}a{\u{g}}atay and Stonebraker, Michael},
  journal={arXiv preprint arXiv:2409.02038},
  year={2024}
}
```

BEAVER is a large-scale enterprise text-to-SQL dataset containing 9128 queries spanning 812 tables across 19 diverse domains. Of these, 7978 queries are publicly released, while the remaining portion is held out as a private test set. Queries and databases were collected from *private* organizations.

To facilitate fine-grained evaluation and analysis, we provide

Representative BEAVER tasks with question, SQL, and subtask annotations.

If you find our data, code, or the paper helpful, please cite the paper:

```
article{chen2024beaver,
  title={BEAVER: an enterprise benchmark for text-to-sql},
  author={Chen, Peter Baile and Yang, Devin and Li, Weiyue and Wenz, Fabian and Zhang, Yi and Tatbul, Nesime and Cafarella, Michael and Demiralp, {\c{C}}a{\u{g}}atay and Stonebraker, Michael},
  journal={arXiv preprint arXiv:2409.02038},
  year={2024}
}
```


