Cloud Migration - Adobe
Engineered the migration of terabyte-scale big data workloads from legacy on-premises infrastructure (Cloudera, Hive, Oozie) to a modern AWS cloud architecture. Optimized data pipelines for cost and performance.
AWS EMR · S3 · Hive · PySpark · AWS Glue · Terraform · SQL
Marketing Technology Platform - JCPenney
Built the entire Marketing Technology data platform from scratch, replacing an expensive outsourced solution. Ingested data from various marketing sources into a unified platform.
PySpark · AWS EMR · Airflow · Redshift · Python · SQL · Terraform
CDC Data Platform - Banco Inter
Acted as a Data Engineer on projects within Banco Inter, migrating SQL procedures to PySpark to scale processing for Brazil's leading digital bank. Utilized EMR, S3, PySpark, Kafka, Delta Lake, and Airflow to build robust data pipelines.
Kafka · Delta Lake · PySpark · Airflow · Spark Streaming · AWS EMR