Data Engineer | Azure Databricks | PySpark | US Healthcare Domain
I am a highly motivated and results-driven Data Engineer with over 3.5 years of experience specializing in data engineering within the US healthcare domain. I have a proven track record in leveraging PySpark, Python, and SQL for complex data transformation tasks, with advanced proficiency in Azure Databricks, Azure Synapse, Azure Data Lake, Azure Data Factory, and Airflow.
Certified as both a Microsoft Azure Data Engineer Associate and a Databricks Data Engineer Associate, I excel in designing and implementing scalable, efficient, and robust ETL solutions that drive business value.
- Programming Languages: Python, SQL, PySpark, Scala, Shell Scripting
- Big Data Technologies: Apache Spark, Hadoop, Hive
- Cloud Platforms: Azure Databricks, Azure Synapse, Azure Data Lake, Azure Data Factory, Snowflake
- Orchestration Tools: Airflow, Control-M, ASG Zena
- DevOps Tools: Git, Jenkins, UrbanCode Deployment, ADO (Azure DevOps)
- Project Management: Jira, Agile Methodologies
- Domain Expertise: US Healthcare
- Microsoft Certified:
- Azure Data Engineer Associate (DP-203)
- Azure Data Fundamentals (DP-900)
- Databricks Certified:
- Data Engineer Associate
- Lakehouse Fundamentals
- Infosys Certifications:
- MySQL Associate
- Spark Professional
- Project: Nexus for Health (Healthcare Domain)
- Developed and optimized ETL pipelines using Azure Databricks and Azure Data Lake.
- Utilized PySpark and SQL for complex data transformations.
- Implemented Delta Lake solutions for efficient data storage and retrieval.
- Collaborated with cross-functional teams to align deliverables with business objectives.
- Contributed to design documentation and participated in Agile sprints.
- Project: Data Analytics Platform Migration (Healthcare Domain)
- Developed and optimized PySpark scripts for data migration to Azure cloud.
- Migrated 100+ TB of healthcare data ensuring data integrity and consistency.
- Reduced query processing time by 30% using Azure Synapse.
- Implemented monitoring mechanisms with Azure Monitor and Log Analytics.
- Supported advanced data modeling and analytics by collaborating with data scientists.
- Insta Awards: Award of Appreciation by Infosys DNA (Sep 2021, Mar 2022)
- EY GDS User Recognition Award
Bachelor of Engineering in Computer Science
Shri Ram Group of Institutions, Jabalpur MP - GPA: 8.0
- Email: mobashshir2mau@gmail.com
- Location: New Delhi, India - 110025