Built an end-to-end production-style data engineering pipeline using Databricks, PySpark, SQL, Delta Lake, and AWS S3. Implemented Medallion Architecture (Bronze, Silver, Gold) with incremental data loads, automated job scheduling, and a Databricks SQL dashboard for business reporting. Key implementations: Incremental ingestion from AWS S3 Data cleaning and standardization (cities, customer names, prices, duplicates) Monthly sales aggregation and upserts using Spark SQL Automated pipeline execution using Databricks Jobs Leadership-ready Databricks SQL dashboard for revenue analytics GitHub: https://github.com/mkyz108/databricks-merger-sales-pipeline