Stefan Zhelev
Data Professional
phone
WhatsApp
PDF

Data Transformation

Transforms raw warehouse data into tested, documented analytics models using SQL and software-engineering practices.

image

Objective

Transform raw warehouse data into clean, tested, documented models using SQL and software-engineering practices — the T of ELT.

Open Source Alternatives

dbt Core — 9 / 10

The de facto standard. SQL-first, every modern warehouse supports it, every data engineer reads it, and the patterns (sources, tests, snapshots, models, exposures) generalise across projects. Real complaints: incremental story has rough edges, Jinja templating is divisive, and the macro ecosystem can feel hacky.

SQLMesh — 8 / 10

The most interesting upstart. Virtual environments (no full refresh on dev), actual semantic understanding of SQL, column-level lineage by construction, and incremental models that work correctly. Smaller community and ecosystem than dbt; technically the strongest challenger.

Dataform (OSS) — 7 / 10

Open-source dbt-like for BigQuery. Tightly tied to GCP; less momentum since the Google acquisition.

Apache Spark SQL — 6 / 10

Code-heavy transformations on lakes. Different audience (engineers, not analytics engineers) and a much heavier operational model.

Managed SaaS Alternatives

dbt Cloud — 9 / 10

Managed dbt with hosted IDE, CI integration, scheduler, semantic layer, and metrics. The commercial extension of dbt Core. Premium pricing.

Coalesce — 7 / 10

Visual + code transformation for Snowflake and Databricks. UI-centric for analysts who don’t write much SQL by hand. Paid SaaS; smaller community.

Materialize — 6 / 10

Streaming SQL transformations — different problem (incremental materialized views over live data). Comparable only when real-time matters more than batch.

Dataform (Google Cloud) — 7 / 10

Managed BigQuery-only transformation as a Google Cloud service. Convenient inside GCP; outside it, irrelevant.

Scoring summary

Tool Score Type Best for
dbt Core 9 OSS Warehouse SQL transformation
dbt Cloud 9 SaaS Managed dbt with hosted IDE + scheduler
SQLMesh 8 OSS Teams hitting dbt’s incremental limits
Coalesce 7 SaaS UI-driven transformation on Snowflake/Databricks
Dataform 7 OSS / SaaS BigQuery-only teams
Materialize 6 SaaS Streaming SQL (different problem)
Spark SQL 6 OSS Lake-native engineering teams

Top in this category

Top OSS pick: dbt Core. Top managed pick: dbt Cloud.

dbt is the unambiguous category top. SQLMesh is the only credible technical challenger and worth watching. No revision needed.

Work Experience

Epic Data Operations 7 months
Octopyth Data Engineering and Operations 1 year 11 months
MiFinity Business Intellignece Manager (1 direct report) 7 months
Nexo Senior Data Engineer (2 direct reports) 1 year 10 months
Rank Interactive Senior Data Analyst 1 year 8 months
IBM Predictive Analytics and Reporting 1 year 1 month
Hewlett-Packard Service Level Management and Reporting 6 years 2 months