Summary

Results-driven Data Engineer with 3+ years of experience designing and optimizing ETL pipelines, real-time streaming solutions, and cloud-based big data architectures on AWS and Azure. Proven track record of reducing data latency by 60%, improving data accuracy by 25%, and enhancing system reliability across high-volume datasets. Adept at Python, PySpark, and Spark Streaming, with a passion for delivering scalable, high-performance analytics platforms.

Skills

  • Python
  • Scala
  • SQL
  • MySQL
  • Hive
  • SQLite
  • Apache Spark
  • PySpark
  • Hadoop
  • Spark Streaming
  • ETL Development
  • Data Modeling
  • AWS
  • Azure
  • Snowflake
  • Git

Experience

Carelon Global Solutions - Data Engineer

Aug 2023 - Present | Gurugram, India

  • Automated AWS Step Functions workflows, cutting manual intervention by 70% and accelerating data delivery by 30%
  • Built a data-quality framework (cleansing, validation, anomaly detection) to boost dataset accuracy by 25%
  • Engineered CDC pipelines for Redshift and Hive, achieving 99.8% real-time consistency on high-volume data
  • Migrated a data lake to an optimized Redshift warehouse via AWS EMR, slashing query times by 60%
  • Developed serverless Lambda functions to modularize processing and reduce operational overhead by 45%

Celebal Technologies - Data Engineer Intern

Jun 2022 – Jul 2022 | Jaipur, India

  • Optimized Azure SQL queries and schemas, reducing execution time by 40% and increasing throughput.
  • Designed ETL pipelines in Azure Data Factory, improving integration reliability by 35%.
  • Managed Azure MySQL clusters with high availability (99.9% uptime) and horizontal scaling.
  • Architected Azure Blob Storage solutions for structured/unstructured data, reducing storage costs by 30%.

Projects

Network Traffic Analyzer

Developed a cross-platform packet capture and ETL tool using Python's socket library for TCP/UDP/ICMP traffic. Designed a zero-install Tkinter-based UI for real-time monitoring and automated archival. Enabled security teams to analyze traffic without additional software, cutting setup time by 80%.

Tech Stack: Python, SQLite, Tkinter, Linux

Project Demo

Education

Bachelor of Technology, Computer Science

Poornima Group of Institutions, Jaipur

2019 - 2023

Certifications

  • Microsoft Azure AI Fundamentals (AI-900)
  • Udacity AI Programming with Python Nanodegree

Contact Me