Spider2-DBT is a benchmark for evaluating AI agents on dbt (data build tool) data transformation tasks. It consists of 68 real-world dbt projects using DuckDB databases, covering various domains like ...
This project is a reference implementation of a batch-oriented analytics pipeline designed to reflect how data transformations evolve from simple scripts to structured data platform workflows. In many ...