Summary
Results-driven Data Engineer with 3+ years of experience designing and optimizing ETL pipelines, real-time streaming solutions, and cloud-based big data architectures on AWS and Azure. Proven track record of reducing data latency by 60%, improving data accuracy by 25%, and enhancing system reliability across high-volume datasets. Adept at Python, PySpark, and Spark Streaming, with a passion for delivering scalable, high-performance analytics platforms.
Skills
- Python
- Scala
- SQL
- MySQL
- Hive
- SQLite
- Apache Spark
- PySpark
- Hadoop
- Spark Streaming
- ETL Development
- Data Modeling
- AWS
- Azure
- Snowflake
- Git
Experience
Carelon Global Solutions - Data Engineer
Aug 2023 - Present | Gurugram, India
- Automated AWS Step Functions workflows, cutting manual intervention by 70% and accelerating data delivery by 30%
- Built a data-quality framework (cleansing, validation, anomaly detection) to boost dataset accuracy by 25%
- Engineered CDC pipelines for Redshift and Hive, achieving 99.8% real-time consistency on high-volume data
- Migrated a data lake to an optimized Redshift warehouse via AWS EMR, slashing query times by 60%
- Developed serverless Lambda functions to modularize processing and reduce operational overhead by 45%
Celebal Technologies - Data Engineer Intern
Jun 2022 – Jul 2022 | Jaipur, India
- Optimized Azure SQL queries and schemas, reducing execution time by 40% and increasing throughput.
- Designed ETL pipelines in Azure Data Factory, improving integration reliability by 35%.
- Managed Azure MySQL clusters with high availability (99.9% uptime) and horizontal scaling.
- Architected Azure Blob Storage solutions for structured/unstructured data, reducing storage costs by 30%.
Projects
Network Traffic Analyzer
Developed a cross-platform packet capture and ETL tool using Python's socket library for TCP/UDP/ICMP traffic. Designed a zero-install Tkinter-based UI for real-time monitoring and automated archival. Enabled security teams to analyze traffic without additional software, cutting setup time by 80%.
Tech Stack: Python, SQLite, Tkinter, Linux
Education
Bachelor of Technology, Computer Science
Poornima Group of Institutions, Jaipur
2019 - 2023
Certifications
- Microsoft Azure AI Fundamentals (AI-900)
- Udacity AI Programming with Python Nanodegree