Skip to content

moritzkoerber/local-data-stack

Repository files navigation

Local data stack

This project demos how to integrate DuckDB and Delta Lake with dbt, orchestrated by Dagster. All data is stored and processed locally/on-premise. The whole pipeline is partitioned by date range and scheduled to run daily. The stack queries stock exchange (Xetra) data through an API and upserts the result into a Delta table (= bronze layer). The table serves as a source for dbt, which does a layered incremental load into a DuckDB database: first into silver, then into gold. Finally, the gold table is queried with DuckDB to create a line chart in Plotly. Overview:

Overview

How to run

Simply run $ uv run dg dev to launch the dagster webserver.

About

This project demos how to integrate DuckDB and Delta Lake with dbt, orchestrated by Dagster.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages