Pipeline ETL local (Python + pandas) que ingesta órdenes desde una API mock (JSON local), genera capa raw y curated particionada por fecha, con soporte incremental e idempotencia. etl-test/ ├─ ...
Welcome to the Data Warehouse and Analytics Project repository! 🚀 This project demonstrates a comprehensive data warehousing and analytics solution, from building a data warehouse to generating ...
BlazingSQL builds on RAPIDS to distribute SQL query execution across GPU clusters, delivering the ETL for an all-GPU data science workflow. BlazingSQL is a GPU-accelerated SQL engine built on top of ...