Lead Data Integration Engineer
14+ years of experience building scalable data platforms across startups and global enterprises. Specializing in cloud data architecture, real-time streaming pipelines, and Data Mesh design — with hands-on leadership from IKEA and BYJU'S to IMMO Digital and Reveleer.
I'm a Lead Data Integration Engineer based in Chennai, India, passionate about creating scalable data solutions with 14 years of experience across individual contributor and team lead roles. My background spans global organizations in Europe and India, delivering innovative solutions to complex data challenges.
Currently at Reveleer, I design ETL/ELT pipelines using DBT, Apache Airflow, and Snowflake in a healthcare big data environment, enforcing data modeling standards and CI/CD practices. Previously at IMMO Digital Solutions, I led a team of 6 Senior Engineers building a Data Mesh platform with real-time Kafka streaming and Snowflake from the ground up using Terraform.
Beyond my day job, I actively build open-source data tools — including NSE stock analytics and a full lakehouse architecture — and enjoy combining data engineering with financial market analysis.
Open-source modern data lakehouse built with Apache Iceberg, Trino, dbt, Apache Airflow, and MinIO. Demonstrates a production-grade data platform with orchestrated ELT pipelines, object storage, and a distributed SQL query engine — all containerized with Docker Compose.
Financial data ETL platform for NSE (National Stock Exchange) market data. Automates data collection, processing, and analysis of Indian stock market data with custom analytics and visualization — combining data engineering expertise with financial market insights.
Engineered Snowflake from the ground up using Terraform modules. Built a full Kafka/MSK streaming architecture with connectors to Snowflake, formulated data contracts, and managed Kafka topic creation and access control for a Series B UK real-estate scale-up.
Designed and implemented scalable ETL/ELT pipelines for healthcare big data at Reveleer using DBT, Apache Airflow, and Snowflake. Powers near-real-time data marts with Snowflake Streams, Tasks, and Materialized Views integrated with Tableau for stakeholder reporting.
Achieved a landmark performance optimization at TCS (client: IKEA Sweden) — tuned an Oracle batch process from a 7-day runtime down to 20 hours, a 91% reduction. Deep expertise in PL/SQL, query optimization, and large-scale batch architecture.
Streamlined Snowflake operational costs through warehouse right-sizing, clustering key design, query profile analysis, and intelligent use of Snowflake-native features to reduce compute spend while maintaining SLA performance for analytics teams.
AI-powered healthcare technology company — value-based care workflows, data & analytics
UK-based Series B scale-up — next-gen residential real estate portfolios, 100+ global team
India's leading edtech company — personalized online learning for students
Data Mesh Architecture — Developed data contracts and streaming architecture for a platform rooted in Data Mesh concepts at IMMO Digital (Feb 2024)
Snowflake Cost Reduction — Streamlined costs associated with Snowflake operations through query and warehouse optimization (Jun 2021)
Oracle Batch: 7 days → 20 hours — Achieved a 91% performance improvement in Oracle batch processing for IKEA (Apr 2018)
Verified credential on Credly — click to view the full badge and issuing organization
Verify on Credly3× certified in Apache Airflow — orchestration, DAG development, and advanced patterns
3 Badges EarnedOpen to discussing data engineering, cloud architecture, team leadership, or the Indian stock market. Reach out — I'd love to connect!
📧 ksugaan@zohomail.in | 📞 +91 97896 34295 | 📍 Chennai, Tamil Nadu 600126