Passionate about transforming raw data into actionable insights and building scalable data solutions that drive business value.
- π Currently working on: Data Engineering & Analytics projects
- πΌ Role: Senior Data Engineer & Data Analyst
- π± Specializing in: Data Architecture, ETL Pipelines, Analytics & Visualization
- π― Focus areas: Data Quality, Governance, Cloud Data Solutions
- π Location: UTC -03:00
- π¬ Ask me about: Python, SQL, Data Warehousing, Cloud Platforms
- π« Reach me: LinkedIn
βοΈ Enterprise Data Warehouse
Snowflake-based security analytics data warehouse for IT department of a construction and infrastructure solutions company, processing 10M+ records daily with 98% automation rate and significant operational cost savings.
Highlights:
- 3-layer architecture (Bronze/Silver/Gold) with dimensional modeling
- 550+ database objects (141 raw tables, 32 dimensions, 23 facts, 148 views)
- Real-time Snowpipe ingestion + batch processing
- SCD Type 2 for historical tracking
- Power BI and Streamlit dashboards
Tech Stack: Snowflake, Python 3.13, dbt, SQL, Power BI, Streamlit Industry: Construction & Infrastructure (IT Security Analytics) License: MIT
π©οΈ GCP Data Lake Platform
Enterprise-scale data lake on Google Cloud Platform consolidating 4 heterogeneous source systems with automated ETL pipelines and real-time ingestion.
Highlights:
- 7 production Airflow DAGs orchestrating multi-source integration
- Bronze-Silver medallion architecture with Apache Iceberg
- Near real-time CDC with 30-minute refresh cycles
- 1.45 TB data volume with projected 3.98 TB growth
- API integration (Salesforce REST), batch processing, and streaming
Tech Stack: GCP, BigQuery, Cloud Composer (Airflow), Dataflow, Apache Beam, Python, SQL Industry: Logistics & Supply Chain License: MIT
Data analytics and BI platform tracking global cloud storage migration for 130+ million users with real-time executive dashboards.
Highlights:
- 130M+ users tracked across multiple regions
- Tableau & Power BI executive dashboards
- MySQL, Hive, Databricks integration
- Comprehensive data quality framework
- Sub-second query performance on 130M+ records
Tech Stack: Tableau, Power BI, MySQL, Apache Hive, Databricks, Spark SQL, Python Industry: Technology & Cloud Services License: MIT
SQL Server to Databricks migration expertise across multiple financial services engagements with complex stored procedure translation.
Highlights:
- Multiple 4-month consulting engagements
- T-SQL to Spark SQL/PySpark translation
- 10-100x performance improvements
- SCD Type 2 with Delta Lake
- Cursor elimination and vectorization
Tech Stack: Databricks, SQL Server, PySpark, Delta Lake, Spark SQL Industry: Financial Services (Banking, Insurance) License: MIT
Internship experience in revenue management and pricing analytics with focus on price elasticity modeling and A/B testing.
Highlights:
- Price elasticity analysis and demand forecasting
- Promotional campaign ROI measurement
- A/B testing and pricing experiments
- Revenue forecasting with 95%+ accuracy
- Statistical analysis and market research
Tech Stack: Excel, SQL, Tableau, Python, Statistical Analysis Industry: Media & Entertainment License: MIT
π¦ Data Engineering
Comprehensive collection of data engineering projects, POCs, and POKs using Python, Spark, SQL, and cloud platforms. Demonstrates end-to-end pipeline development and best practices.
Tech Stack: Python, Apache Spark, SQL, Cloud Platforms Updated: October 2025
ποΈ Data Architecture
Expertise in designing and optimizing data architectures for diverse business needs. Scalable solutions and architectural patterns.
Tech Stack: System Design, Data Modeling, Architecture Patterns License: MIT
βοΈ Cloud Data Architecture
Cloud-based data solutions and architectures demonstrating modern cloud-native approaches to data platform design.
Tech Stack: AWS, Azure, GCP, Infrastructure as Code License: MIT
π Data Analysis
Demonstrates skills in data manipulation, statistical analysis, visualization, and deriving actionable insights from complex datasets.
Tech Stack: Python, Pandas, Visualization Tools License: MIT
π¬ Data Science
Expertise in leveraging data to derive insights, build predictive models, and solve complex business problems using machine learning.
Tech Stack: Python, Scikit-learn, Machine Learning License: MIT
π Data Governance
Data governance frameworks, quality assurance, and regulatory compliance projects. Focus on data quality, lineage, and stewardship.
Tech Stack: Data Quality Tools, Governance Frameworks License: MIT
π Data Integration
Data integration solutions across various platforms and technologies. ETL/ELT processes and real-time data synchronization.
Tech Stack: Integration Tools, ETL/ELT Frameworks License: MIT
ποΈ Database Administration β 2
Database design, optimization, security, and management across various database systems. Performance tuning and best practices.
Tech Stack: PostgreSQL, MySQL, Database Management License: MIT
Ecological economics tool exploring the concept of metabolic rift through data analysis. Calculating nutrient flows, ecological degradation, and resource depletion in agricultural systems.
Focus: Nutrient Flow Analysis (NPK), Soil Health, Ecological Footprint, Environmental Justice Tech Stack: Python, Pandas, Data Visualization Status: Personal Project License: MIT
π IoT & Industry 5.0
Exploring Internet of Things and Industry 5.0 concepts including automation, sensors, and smart manufacturing.
Learning Focus: IoT Platforms, 5G, Edge Computing, Industrial Automation, Smart Sensors Status: Learning & Experimenting License: MIT
πΉοΈ Game Development
Exploring game development concepts, mechanics, and engines. Learning journey in creating interactive experiences and game logic.
Learning Focus: Unity, C#, Game Design Patterns, Physics, AI Status: Learning & Experimenting License: MIT
Applying data analytics to espresso extraction optimization. Understanding TDS, extraction yield, pressure profiling, and brew parameters for perfect shots.
Focus: Extraction Metrics, Pressure Profiling, Water Chemistry, Systematic Dialing-In Tech Stack: Python, Pandas, Data Analysis, Statistical Modeling Status: Personal Project License: MIT
- 6+ years in Data Engineering & Analytics
- Specialized in building scalable data pipelines and ETL/ELT solutions
- Expert in data warehouse design and cloud data platforms
- Strong background in data governance, quality assurance, and compliance
- Proficient in data visualization and business analytics tools
- Experience with big data technologies and distributed systems
skills = {
"Data Engineering": ["ETL/ELT", "Data Pipelines", "Data Warehousing", "Data Modeling"],
"Analytics": ["Statistical Analysis", "Data Visualization", "Business Analytics", "Reporting"],
"Cloud": ["AWS", "Azure", "GCP", "Cloud Architecture", "Serverless"],
"Programming": ["Python", "SQL", "PySpark", "Bash", "PowerShell"],
"Tools": ["Airflow", "dbt", "Spark", "Kafka", "Docker", "Kubernetes"],
"Databases": ["PostgreSQL", "MySQL", "MongoDB", "Redis", "Snowflake", "BigQuery", "Hive", "Databricks"],
"Governance": ["Data Quality", "Data Lineage", "Metadata Management", "Compliance"]
}π "Data engineering for the planet: Technical skills need to be applied to ecological and humanitarian challenges."
Thanks for visiting my profile! βοΈ Feel free to explore my repositories and don't hesitate to reach out for collaborations or discussions about data engineering, analytics, and exploring how technology can serve people and planet.
