About Me
Hello! I'm Harshul Nihalani.
I'm a passionate Data Engineer based in India, with 3 years of hands-on experience building robust, scalable data pipelines and transforming raw data into valuable insights. My journey in data engineering has been driven by a fascination with the power of data to solve complex business problems.
Throughout my career, I've specialized in managing full-scale data pipelines in Microsoft Fabric, leveraging Python and PySpark for data processing, and utilizing SQL across multiple platforms including MySQL, PostgreSQL, and Snowflake. I'm proficient in orchestration tools like Apache Airflow and Dagster, and I bring data to life through Power BI dashboards and reports.
My Technical Toolbox
Data Processing
Databases
Orchestration & Platforms
Visualization & Tools
My Journey
Senior Data Engineer
Leading data infrastructure projects, managing full-scale data pipelines in Microsoft Fabric, and architecting ETL solutions using Python, PySpark, and modern orchestration tools. Delivering business intelligence through Power BI dashboards.
Data Engineer
Built and maintained data pipelines, worked extensively with SQL databases (MySQL, PostgreSQL, Snowflake), and implemented automation workflows using Apache Airflow and Dagster for data orchestration.
Junior Data Engineer
Started my data engineering journey, learning Python and PySpark for data processing, building ETL pipelines, and contributing to data warehouse development projects.
Graduated BCA from Presidency University
Completed my Bachelor's with a final CGPA of 8.9, building a strong foundation in computer science and data structures.
Graduated 12th from DPS Udaipur
Completed my higher secondary education with a score of 90%, setting the stage for my journey in technology and data.
My Work
Real-Time ETL Pipeline
Built a scalable real-time ETL pipeline using Microsoft Fabric and PySpark to process 10M+ daily transactions, reducing data latency from hours to minutes.
Enterprise Data Warehouse
Designed and implemented a multi-tiered data warehouse on Snowflake, integrating data from 15+ sources using Python and SQL for centralized analytics.
Automated Data Orchestration
Developed complex Airflow DAGs to orchestrate 50+ data workflows, achieving 99.5% pipeline reliability and reducing manual intervention by 80%.
Executive Analytics Dashboard
Created interactive Power BI dashboards connecting to PostgreSQL and Snowflake, providing real-time insights to C-level executives and reducing report generation time by 90%.
Real-Time Streaming Analytics
Implemented real-time data streaming solution using Dagster for event-driven pipelines, processing 500K events per second with sub-second latency.
ML Feature Engineering Pipeline
Built end-to-end feature engineering pipeline in Python and PySpark, serving ML models with automated data validation and quality checks across MySQL and MongoDB.
What My Clients Say
"Harshul is an exceptional developer with a keen eye for detail. He delivered our project on time and exceeded our expectations with his creativity and technical skill."
Jane Doe
CEO, Tech Solutions Inc.
"Working with Harshul was a fantastic experience. His communication was clear, and he was always ready to go the extra mile to ensure the final product was perfect."
John Smith
Marketing Head, Creative Co.
"The e-commerce platform Harshul built for us is robust, fast, and easy to manage. His expertise was invaluable throughout the entire process."
Emily White
Founder, Fashion Forward
Let's Connect
I'm currently available for freelance work and open to new opportunities. Whether you have a question or just want to say hi, my inbox is always open.