🌱 I'm a Data and Analytics Engineer with a robust foundation in Computer Science Engineering. With 5 years of professional experience, I specialize in Python, PySpark, SQL, AWS, GCP, Kafka, OOPS, core Machine Learning and DE System Design. I currently serve as Senior Data Scientist at Definitive Healthcare.
🌱 My professional work includes developing
▶️ Enterprise ETL website using Python, PySpark
▶️ Developing E2E customer segmentation models using Python and Machine Learning
🌱 I am well versed in
▶️ Python, PySpark, SQL, OOPS, System Design
▶️ Data Modelling, ETL, Batch & Streaming data pipelines, Kafka, Data Orchestration - Airflow
▶️ Cloud Services - S3, Redshift, EMR, EC2, Glue, GCS, Bigquery, Dataflow
▶️ Tools - Git, Bitbucket, VSCode, PYCharm, RStudio
🌱 In the realm of Data Analysis, I have engaged in case studies and exploratory data analysis (EDA) using Python, SQL, and data visualization using Tableau.
🌱 I have built robust data pipelines both in batch and stream data processing. Worked with multiple Databases and types of data.
🌱 Beyond the world of data and laptops, I have a passion for fitness. During my free time, I enjoy cooking and playing the violin.