Data Observability
Data observability is the ability to understand the health of data flowing through your pipeline at any point in time. Without it, data issues are discovered by business users rather than engineers.
Data tooling including dbt, ClickHouse, and Power BI
View all tagsData observability is the ability to understand the health of data flowing through your pipeline at any point in time. Without it, data issues are discovered by business users rather than engineers.
A well-organized dbt project enables team collaboration, consistent conventions, and maintainable transformation pipelines. This document defines the standard folder structure and configuration patterns.
Data tests are the primary mechanism for ensuring correctness and catching regressions in a dbt project. This document defines a tiered testing strategy across all model layers.
This document covers ClickHouse installation, initial configuration, and the first steps for setting up an analytical environment.
Incremental refresh replaces full dataset imports with targeted refreshes of recent data only. It is the most impactful optimization for large Power BI datasets.