I'm an Environmental Engineer turned Data Engineer with a strong foundation in building efficient and scalable data architectures. My expertise includes data manipulation, transformation, and integration using Python
(pandas, NumPy) and SQL
. I have extensive experience in designing and managing data architectures for both Online Transaction Processing (OLTP) systems
, which handle real-time transaction data, and Online Analytical Processing (OLAP) systems
, optimized for complex data analysis. Additionally, I have developed and optimized ETL pipelines
for diverse data sources, implementing best practices for data processing. My technical stack includes cloud data management with AWS services
such as S3, Lambda, and RDS, where I design and manage data storage solutions to support analytics and reporting. I am passionate about creating impactful data solutions that drive decision-making and contribute to sustainable outcomes.
β»οΈ In my free time, I actively participate in various technology communities, including serving as an organizer for the Google Developers Group (GDG). I also enjoy engaging in data-centric hackathons, where I leverage my environmental engineering background to extract meaningful insights from data.
π¦ I'm currently learning Rust
ETL pipeline to analyze environmental crime data in Colombia, integrating data extraction from a public API, data transformation with pandas, and loading into PostgreSQL and AWS S3. Includes logging, error handling, and unit tests for reliability and scalability.
Simulates a real-time Smart City data pipeline with Kafka, Apache Spark, and S3. Streams and processes vehicle, GPS, weather, traffic, and emergency data with Dockerized components and Parquet storage for efficient, scalable data engineering.
This Rust-based command-line app records, analyzes, and assesses risk levels from data entries and geolocation info. Using DuckDB for storage, it calculates and stores monthly risk statuses, leveraging asynchronous requests and JSON handling for comprehensive risk analysis.
This project combines web scraping, PDF processing, and Natural Language Processing (NLP) to extract and analyze IPCC climate reports. It automates downloading PDFs, processes file validation, and applies NLP for data insights.
- Talking and sharing in LinkedIn