project · 2019-2021

DIME 2.0, Cloud-native data quality platform

Lead engineer on an enterprise data-quality platform with a pluggable connector framework for profiling, validation, and reconciliation across heterogeneous sources. Adopted across multiple internal customer engagements.

DIME 2.0 is a cloud-native distributed data-quality platform for enterprise data lakes. As lead engineer I designed the pluggable connector framework, each backend (SQL Server, BigQuery, Azure Synapse, Postgres, Delta Lake, S3 / ADLS) is a thin adapter implementing a common Connector interface; profiling, validation, and reconciliation rules run via the same orchestrator regardless of source.

What it does

Design highlights

Stack

Python · PySpark · Apache Airflow · Azure / AWS · pluggable connector framework.

← all projects