Medallion Architecture¶
Overview¶
The Medallion Architecture organizes data pipelines into progressive layers to ensure quality, reliability, and efficiency in analytics.
Layers¶
-
Bronze (Raw)
Raw ingested data from source systems. Minimal transformation, only cleaned for parsing errors. -
Silver (Cleaned / Curated)
Data is cleaned, deduplicated, and enriched. Suitable for basic analytics and reporting. -
Gold (Business-ready / Aggregated)
Fully transformed data optimized for dashboards, ML, and advanced analytics.
Principles¶
- Incremental processing: Each layer adds value without overwriting prior stages.
- Traceability: Easy to trace data lineage from Gold → Silver → Bronze.
- Reusability: Models at Silver and Gold layers can be shared across multiple labs.
Benefits¶
- Reduces data quality issues downstream.
- Enables faster experimentation and analytics.
- Facilitates modular ETL development in a multi-lab environment.
Visual Diagram¶
/filters:no_upscale()/articles/rethinking-medallion-architecture/en/resources/66figure-2-1737975898553.jpg)