Skip to content
View gdcur's full-sized avatar

Block or report gdcur

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
gdcur/README.md

Gianfranco De Curtis - Senior Data Engineer

20+ years building production-grade data pipelines, lakehouse architectures, and analytics platforms across energy, finance and enterprise domains.

Most of my work lives at the intersection of complex data sources, cloud-native AWS infrastructure, and the unglamorous but critical work of making data actually trustworthy in production.

What I build

  • End-to-end ETL/ELT pipelines for structured and semi-structured data
  • Lakehouse architectures on AWS (S3, Glue, Athena, Lambda, Delta Lake)
  • Schema evolution and temporal data models built for production reality
  • Data quality and reconciliation frameworks at scale
  • Workflow automation with Python, Airflow, and Docker

Active projects

ercot-plan-ranker — A transparent, runnable pipeline that simulates and ranks Texas electricity plans against realistic usage profiles and weather scenarios. Portfolio-friendly starting point for a production-style lakehouse. Roadmap includes dbt, Airflow, Postgres Bronze/Silver/Gold layers.

xml-drift-lakehouse (work in progress) — A generic open-source toolkit for ingesting, parsing, and modeling XML-based data sources into lakehouse architectures. Designed to be portable across industries.

Stack

Python, SQL, AWS (Glue, Athena, Lambda, S3, DynamoDB, Fargate), Delta Lake, dbt, Docker, Airflow, PySpark

Currently

Open to senior data engineering opportunities in the Dallas-Fort Worth area.

LinkedIn

Pinned Loading

  1. ercot-plan-ranker ercot-plan-ranker Public

    ERCOT plan comparison demo: simulate electricity bill costs from 15-minute usage and weather scenarios, then rank plans (MVP, extensible lakehouse roadmap).

    Python 1

  2. xml-drift-lakehouse xml-drift-lakehouse Public

    Schema-on-read toolkit for ingesting XML sources with structural drift into a lakehouse architecture.

    Python 1