Build the
platform under
the platform — CI/CD for data
The DevOps spine for a 15-engineer dbt + Snowflake team: trunk-based Git, dbt tests in GitHub Actions with PR-scoped schemas, blue/green deploys via atomic view swaps, Snowflake time-travel rollback, and Terraform-managed warehouses + roles + grants.
The platform-engineer system-design round — when an interviewer asks how you’d ship dbt safely for a team of 15, this is the project that lets you walk the whole pipeline without hand-waving any layer.
- Trunk-based Git workflow: pre-commit (SQLFluff + dbt compile + secret scan), CODEOWNERS, branch protection
- GitHub Actions dbt CI with generic + custom + singular tests, PR-scoped CI schemas, and Slim CI on state:modified+
- Blue/green deploys on Snowflake: BLUE/GREEN schemas, atomic view swap inside BEGIN/COMMIT, DEPLOY_STATE audit
- Time-travel rollback playbook: <60s view re-swap + Snowflake AT/BEFORE for point-in-time recovery
- Terraform-managed Snowflake: warehouses, role hierarchy (LOADER / TRANSFORMER / REPORTER), grants across dev/staging/prod
- Multi-stage Docker image + freshness SLA checks + SEV runbooks for the on-call rotation
dbt build, and basic SQL DDL. Snowflake account helpful but not required — Part 1 walks setup. Docker + Terraform get introduced from scratch in Part 4.The #1 skill gap between data engineer and platform engineer.
Most data teams still ship dbt by hand — `dbt run --target prod` from someone's laptop, hope for the best. Mature orgs (Netflix, Stripe, Airbnb, dbt Labs) treat data infra like application infra: PRs, automated tests, blue/green, rollback. This is the project that proves you can build that bar.
PRs, not laptop pushes
Trunk-based flow with pre-commit, CODEOWNERS, and required CI checks. Every change is reviewed and tested before main.
Slim CI keeps PRs <5 min
Run only the dbt models touched by the PR via state:modified+. Manifest caching cuts a 45-min full-build to a 4-min selective run.
Zero-downtime deploys
Atomic view swap inside BEGIN/COMMIT. Production reads stay pointed at BLUE while GREEN builds, then re-point in one transaction.
Rollback you can trust
Time-travel via Snowflake AT/BEFORE plus a re-swap of production views — sub-minute recovery, audited in DEPLOY_STATE.
Module 01 is free. The rest unlocks with PRO.
Try Module 01 — set up the Git foundation, configure dev/staging/prod profiles, install pre-commit, validate environment parity. About 4 hours. If it clicks, upgrade to unlock the CI, blue/green, and Terraform modules.
DataOps: CI/CD & Infrastructure as Code
This curriculum is the foundation for the project — not a sales add-on. PRO subscribers get full access to every module.
Three sprints. Three checkpoints. One production-grade pipeline.
Each phase ends with a tagged commit and a runnable artifact. No ambiguity about where you are.
Trunk-based Git with pre-commit + branch protection (Module 01). dbt CI on GitHub Actions with PR-scoped schemas + Slim CI cutting full-build time to a selective <5-min run (Module 02).
- ✓Pre-commit + CODEOWNERS + dev/staging/prod profiles
- ✓dbt CI workflow with PR-scoped schemas
- ✓Slim CI on state:modified+ with manifest cache
BLUE/GREEN schema pattern with atomic view swap inside BEGIN/COMMIT. Pre-cutover checks. Time-travel rollback + DEPLOY_STATE audit. Orchestrator chains the whole pipeline.
- ✓BlueGreenDeployer + DEPLOY_STATE table
- ✓Pre-cutover validation framework
- ✓Time-travel rollback + orchestrator.py
Snowflake-as-code via Terraform: warehouses, RBAC role hierarchy, grants per env. Multi-stage Docker image. Freshness SLA + row-count anomaly checks. SEV runbooks + deployment audit model.
- ✓terraform/ for warehouses + roles + grants
- ✓Multi-stage Dockerfile + docker-compose
- ✓Freshness SLA + SEV runbooks + audit model
One repo. dbt + Snowflake + GitHub Actions + Terraform.
The starter kit ships the full PipelineOps Inc repo — dbt project skeleton, GitHub Actions workflows, blue/green Python orchestrator, Terraform modules, sample CSVs with intentional data-quality issues, and the seed schema for Snowflake.
What lives in the repo
Everything you need to build the platform under the platform — dbt, CI workflows, deploy scripts, IaC, and seed data with deliberate quality bugs so your CI gates catch real failures.
- models/ + tests/ — staging/intermediate/marts dbt project with generic, custom, and singular tests
- .github/workflows/ — dbt-ci.yml, dbt-slim-ci.yml, env-parity-check.yml, schema cleanup, manifest cache
- deploy/ — blue_green.py, pre_cutover_checks.py, rollback.py, backfill.py, orchestrator.py
- terraform/ — warehouses.tf, roles.tf, grants per dev/staging/prod with remote state backend
- Dockerfile + docker-compose.yml — multi-stage build (Python 3.11 slim) + local-parity stack
- runbooks/ + scripts/ — SEV templates + check_freshness.py + chaos_test.py skeletons
CI/CD Data Platform Starter Kit
Pre-built repo with dbt project skeleton, GitHub Actions workflows, blue/green orchestrator, Terraform modules, SEV runbook templates, and 14,800 rows of seed data with intentional QA issues.
The same dbt project — but with the safety net a 15-engineer team needs.
Most teams ship dbt by hand: dbt run --target prod from a laptop, no review, no rollback. The patterns in this project — state:modified+, atomic view swaps, time travel, Terraform-managed RBAC — are what unlocks safe parallelism without breaking the warehouse.
dbt build --target prod on merge, audit-loggeddbt build in PR-scoped CI_PR_N schema, cleaned on closeAT/BEFORE time travel — sub-minute, recorded in DEPLOY_STATEBLUE/GREEN schemas; production reads see only the active oneReal review from senior engineers who shipped this stack.
Submit your repo, get line-by-line feedback within 48 hours. The kind of review that's quietly worth thousands of dollars in time-to-staff.
4 reviews / month
Submit a repo, a PR, or a Terraform plan. Reviewer is matched to your domain — dbt + GitHub Actions + Snowflake for this project. Async, comments inline, average turnaround 31 hours.
2 office hours / month
Live 30-min sessions with a senior platform engineer. Architecture questions, walk a tricky deploy migration, mock a system-design interview on CI/CD for data. Group sessions also available.
One subscription. 15+ projects, all curriculum, code review.
PRO is built for engineers who want production-grade builds and feedback loops — not more tutorials.
Pick this if your dbt project is load-bearing for someone else’s revenue.
Data engineers going senior+
You ship dbt every day but deploys are still hand-rolled. This is the project that turns 'I can write models' into 'I can run a 15-engineer dbt team safely.'
Analytics engineers ready to own delivery
You can build the marts. Now you want to own the pipeline that ships them — PRs, CI, blue/green — without waiting for a platform team to build it for you.
Platform engineers entering data
You've shipped CI/CD for app code. Data is a new shape — atomic view swaps instead of containers, time travel instead of revert. This translates the patterns.
Eng managers and tech leads
You're the one writing the SLA and the SEV runbook. This gives you the reference architecture and the language to argue for it in your roadmap reviews.
Going deeper? Three tracks back this project.
CI/CD is the spine of this project. These three curriculums let you go deeper on the layers that matter most — the dbt models you're shipping, the freshness checks you're alerting on, and the governance posture that proves the platform is safe.
Quick answers.
Ready to build the platform under the platform?
Start with Module 01 — free, no card. About 4 hours. By the end you'll have trunk-based Git, pre-commit hooks, dev/staging/prod dbt profiles, and a parity-validated environment foundation. If it clicks, upgrade to unlock CI, blue/green, and Terraform modules.