Writing

Data Engineering
in the open

Deep dives on pipelines, cloud architecture, data modeling, and the future of data infrastructure. Written from the trenches.

All Pipelines Snowflake Apache Multicloud ML & AI Architecture
✍️

First articles coming soon

I'm writing about what I build — pipelines, cloud architecture, data engineering patterns. Subscribe to get notified when the first post drops.

Coming up

Building production Airflow DAGs that don't break at 3am

Error handling, alerting, and retry strategies for real-world data pipelines.

Snowflake cost optimization: the patterns nobody tells you

How to cut your Snowflake bill by 40% without sacrificing performance.

Terraform patterns for cross-cloud data infrastructure

IaC strategies that work across AWS, Azure, and GCP without going insane.

From notebook to production PySpark ML pipeline

The gap between a Jupyter notebook and a real ML pipeline — and how to cross it.