Data Scientist, Health Science Researcher, and Bioinformaticians
Data Scientist and Health Science Research expert with over 8 years of experience in data analytics, leveraging both structured and unstructured data sources. Proficient in applying machine learning, statistical modeling, and data science techniques to solve complex problems. Expertise in creating data-driven insights for executive-level decision-making, especially in healthcare settings.
Skill | Technologies/Tools |
---|---|
Database Management | MySQL, PostgreSQL, MongoDB, NoSQL |
Data Engineering | ETL/ELT, data modeling, schema design, data integration |
Machine Learning | Scikit-learn, TensorFlow, PyTorch, Neural Networks, Ensemble methods, XGBoost, LightGBM |
Statistics & Analytics | Regression, Bayesian/Frequentist inference, causal inference, experimental designs |
Natural Language Processing | Sentiment analysis, text classification, spaCy, NLTK, Hugging Face, BERT, GPT, Word2Vec |
Visualization & Dashboards | Plotly, Matplotlib, ggplot2, Altair, R Shiny, Dash, Power BI, Tableau |
Cloud Computing & Big Data | AWS EC2, S3, Azure, Google Cloud, Hadoop, Spark |
Version Control & DevOps | Git, Docker, CI/CD pipelines |
Programming | Python, R, SQL, NoSQL, Unix Shell, VBScript |
Degree | Institution | Dates | Details |
---|---|---|---|
Master of Data Science | University of British Columbia | Sep 2022 – Jun 2023 | Overall GPA: 95%. Recipient of the UBC MDS domestic scholarship ($10,000). |
Master of Science (Stem Cell Biology) | McGill University | Sep 2017 – Aug 2020 | Overall GPA: 4.0/4.0. Recipient of Quebec FRQS training scholarship ($35,000). |
Bachelor of Science (Honours Immunology) | McGill University | Sep 2013 – Jun 2017 |
Project | Description | Link | Language/ Package | Theme |
---|---|---|---|---|
BC Forest and Wildfire Dashboard | Developed an interactive dashboard using Python, Dash, Plotly, and GeoPandas to analyze forest and wildfire GIS data. | GitHub Link | Python Dash, GIS | Geospatial |
Gene Differential Expression Explorer | Created an R Shiny App to analyze RNASeq data using the EdgeR package for differential gene expression analysis. | GitHub Link | R Shiny | Bioinformatics |
Vancouver Housing Market Dashboard | Built a dashboard in R Shiny to explore property values in Vancouver, integrating filters to allow dynamic exploration. | GitHub Link | R Shiny | Real Estate Market Analysis |
Employee Retention Dashboard | Created a Python-based web application to visualize employee turnover and satisfaction metrics from simulated HR data. | GitHub_Link | Python Dash, Exploratory Data Analysis | People Data |
Credit Card Default Prediction | Built a machine learning model using LightGBM to predict customer credit defaults, achieving 90% accuracy. | GitHub Link | Python, Machine Learning | Finance |
Spaceship Titanic Classification | Developed a model using LightGBM and XGBoost, achieving 80.5% accuracy in the Spaceship Titanic Kaggle competition. | Kaggle Link | Python, Machine Learning | Classification |
Breast Cancer Recurrence Prediction | Developed a Naive Bayes model for predicting breast cancer recurrence, using SHAP feature importance analysis. | Kaggle Link | Python, Machine Learning | Health |
Body Mass Index Calculator | Designed a Python and R package to calculate BMI, project BMI trajectory, estimate ideal calorie intake, and suggest exercise plans to achieve targeted weight goals. | R Package Link Python Package Link | Python, R, Package Development | Health |
Position | Organization | Dates | Responsibilities |
---|---|---|---|
Team Lead – UBC Capstone (Autozen.com) | Autozen.com | May 2023 – Jun 2023 | Led a data science team to enhance car valuation model, increasing accuracy by 7%. Delivered executive-level presentations, built interactive dashboards. |
Quantitative Data Analyst / Research Asst. | McGill University | Sep 2016 – Aug 2022 | Led quantitative analysis of structured genomic and health data. Integrated data from multiple sources. Published papers and mentored junior data scientists. |
Quantitative Data Analyst / Summer Student | BC Children’s Hospital | May 2015 – Aug 2022 | Led quantitative data analysis for pathogen detection, built a 21,000-species genome database, automated data workflows. |
Position | Organization | Dates | Responsibilities |
---|---|---|---|
Senior Advisor / Co-president / VP Internal | Student Research Initiative Club | 2016 – 2022 | Led executive team to organize bi-annual events, mentored undergraduates, and established procedures for smooth leadership transitions. |
COVID-19 Vaccine Clinic Volunteer | Montreal General Hospital | Apr 2021 – May 2021 | Assisted with non-medical tasks during the COVID-19 pandemic, serving over 300 people weekly. |
Friendly Visitor | Montreal Chinese Hospital | 2017 – 2020 | Provided companionship to elders, assisting them with physiotherapy and enhancing their well-being. |
Basketball Coach | Churchill Secondary School | 2022 – 2023 | Coached basketball team, developed strategies, and led practices. |
Student Representative | UBC MDS Program | Oct 2022 – Nov 2022 | Served as student representative for the UBC Master of Data Science program. |
Intramural Basketball Team Captain | McGill | 2014 – 2022 | Led and coached intramural basketball teams, improving team performance and strategy. |
- UBC Master of Data Science Domestic Scholarship ($10,000) – 2022
- Quebec Master's Training Award ($35,000) – 2018
- McGill Physiology Excellence Award ($7,500) – 2017
- W.H.M.I.S Certified
- First Aid in Workplace