dbt

Transform data in your data warehouse

TLDR

Debug the dbt project and the connection to the database

$ dbt debug

Run all models of the project

$ dbt run

Run all tests of example_model

$ dbt test --select example_model

Build (load seeds, run models, snapshots, and tests associated with) example_model and its downstream dependents

$ dbt build --select example_model+

Build all models, except the ones with the tag not_now

$ dbt build --exclude "tag:not_now"

Build all models with tags one and two

$ dbt build --select "tag:one,tag:two"

Build all models with tags one or two

$ dbt build --select "tag:one tag:two"

SYNOPSIS

dbt [global flags...] <command> [<args>...]
Common commands: debug, deps, docs-generate, docs-serve, ls, parse, run, seed, snapshot, sql, test

--version
    Display current dbt version

--help
    Show help message

--log-level DEBUG|INFO|WARN|ERROR
    Set logging verbosity

--project-dir PATH
    Specify dbt project directory (default: current)

--profiles-dir PATH
    Specify profiles.yml directory (default: ~/.dbt)

--target TARGET
    Specify target config in profiles.yml

--threads INTEGER
    Number of worker threads to use (default: 4)

--vars '{key: value}'
    Set runtime variables for models

--select PATH|TAG|MODEL
    Select specific resources to run

--exclude PATH|TAG|MODEL
    Exclude specific resources

--models MODEL
    DEPRECATED; use --select

--full-refresh
    Run full refresh on incremental models

--state ARTIFACT_PATH
    Compare against prior state for change detection

DESCRIPTION

dbt (data build tool) is a command-line tool that enables analytics engineers to transform data in warehouses more effectively. It treats data transformation as code, allowing SQL models to be version-controlled, tested, documented, and scheduled.

dbt compiles and executes SQL SELECT statements defined in modular .sql files, building a dependency graph (DAG) of models, sources, seeds, snapshots, and tests. It supports incremental models for efficiency, generic testing, singular testing, and auto-generated documentation.

Integrates with warehouses like Snowflake, BigQuery, Postgres, Redshift via adapters. Requires Python and installation via pip. Ideal for production data pipelines with CI/CD integration.

Usage involves creating a dbt project with dbt init, defining models in models/ dir, and running commands like dbt run or dbt test.

dbt

Transform data in your data warehouse

TLDR

SYNOPSIS

PARAMETERS

DESCRIPTION

CAVEATS

INSTALLATION

PROJECT SETUP

KEY SUBCOMMANDS

HISTORY

SEE ALSO