I am a Data Engineer with a solid 7-year foundation in data analytics, now fully specialized in building robust, scalable data pipelines and automating complex processes.
My current focus is on designing progressive ETL/ELT architectures, leveraging serverless solutions, and ensuring high data quality and reliability. I am constantly building end-to-end projects that bridge the gap between raw data and business value, applying software engineering best practices to data ecosystems.
- Languages: Python, SQL
- Cloud (AWS & GCP): S3, EC2, Lambda, Glue, Athena, SQS, SNS, DynamoDB, EMR, Redshift, IAM | BigQuery
- Processing & Orchestration: Airflow, PySpark, Spark, Apache Iceberg
- Data Quality & Validation: Pydantic, Pandera
- Infrastructure & Deployment: Docker, Terraform
- Databases & Storage: Relational (SQL), NoSQL, Data Warehousing
- Data Viz & Applications: Power BI, Tableau, Streamlit, Plotly/Dash
- Developing a progressive portfolio of serverless data pipelines (from extraction to consumption).
- Implementing modern data lakehouse architectures using Apache Iceberg and AWS analytics services.
- Applying advanced data quality and validation checks within automated workflows.
- MBA in Data Science and Analytics - Universidade de São Paulo (USP)
- Bachelor's Degree in Science & Technology - Universidade Federal do ABC (UFABC)

