A production-leaning Python ETL pipeline built as a portfolio project for Data Engineering roles. It extracts the most recent NYC Yellow Taxi monthly dataset, validates and transforms it, and persists ...
Portal Inmobiliario (live scrape) │ Scrapling — stealthy HTTP, adaptive selectors extract.py ← 5 pages × 48 listings = ~240 properties │ transform.py ← Polars: clean, validate, enrich │ - parse price ...