Skip to content
View hcwang24's full-sized avatar

Block or report hcwang24

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
hcwang24/README.md

WELCOME TO MY PAGE 👋

Data Scientist, Health Science Researcher, and Bioinformaticians

Objective

Data Scientist and Health Science Research expert with over 8 years of experience in data analytics, leveraging both structured and unstructured data sources. Proficient in applying machine learning, statistical modeling, and data science techniques to solve complex problems. Expertise in creating data-driven insights for executive-level decision-making, especially in healthcare settings.

Core Skills

Skill Technologies/Tools
Database Management MySQL, PostgreSQL, MongoDB, NoSQL
Data Engineering ETL/ELT, data modeling, schema design, data integration
Machine Learning Scikit-learn, TensorFlow, PyTorch, Neural Networks, Ensemble methods, XGBoost, LightGBM
Statistics & Analytics Regression, Bayesian/Frequentist inference, causal inference, experimental designs
Natural Language Processing Sentiment analysis, text classification, spaCy, NLTK, Hugging Face, BERT, GPT, Word2Vec
Visualization & Dashboards Plotly, Matplotlib, ggplot2, Altair, R Shiny, Dash, Power BI, Tableau
Cloud Computing & Big Data AWS EC2, S3, Azure, Google Cloud, Hadoop, Spark
Version Control & DevOps Git, Docker, CI/CD pipelines
Programming Python, R, SQL, NoSQL, Unix Shell, VBScript

Education

Degree Institution Dates Details
Master of Data Science University of British Columbia Sep 2022 – Jun 2023 Overall GPA: 95%. Recipient of the UBC MDS domestic scholarship ($10,000).
Master of Science (Stem Cell Biology) McGill University Sep 2017 – Aug 2020 Overall GPA: 4.0/4.0. Recipient of Quebec FRQS training scholarship ($35,000).
Bachelor of Science (Honours Immunology) McGill University Sep 2013 – Jun 2017

Data Science Projects

Project Description Link Language/ Package Theme
BC Forest and Wildfire Dashboard Developed an interactive dashboard using Python, Dash, Plotly, and GeoPandas to analyze forest and wildfire GIS data. GitHub Link Python Dash, GIS Geospatial
Gene Differential Expression Explorer Created an R Shiny App to analyze RNASeq data using the EdgeR package for differential gene expression analysis. GitHub Link R Shiny Bioinformatics
Vancouver Housing Market Dashboard Built a dashboard in R Shiny to explore property values in Vancouver, integrating filters to allow dynamic exploration. GitHub Link R Shiny Real Estate Market Analysis
Employee Retention Dashboard Created a Python-based web application to visualize employee turnover and satisfaction metrics from simulated HR data. GitHub_Link Python Dash, Exploratory Data Analysis People Data
Credit Card Default Prediction Built a machine learning model using LightGBM to predict customer credit defaults, achieving 90% accuracy. GitHub Link Python, Machine Learning Finance
Spaceship Titanic Classification Developed a model using LightGBM and XGBoost, achieving 80.5% accuracy in the Spaceship Titanic Kaggle competition. Kaggle Link Python, Machine Learning Classification
Breast Cancer Recurrence Prediction Developed a Naive Bayes model for predicting breast cancer recurrence, using SHAP feature importance analysis. Kaggle Link Python, Machine Learning Health
Body Mass Index Calculator Designed a Python and R package to calculate BMI, project BMI trajectory, estimate ideal calorie intake, and suggest exercise plans to achieve targeted weight goals. R Package Link Python Package Link Python, R, Package Development Health

Work Experience

Position Organization Dates Responsibilities
Team Lead – UBC Capstone (Autozen.com) Autozen.com May 2023 – Jun 2023 Led a data science team to enhance car valuation model, increasing accuracy by 7%. Delivered executive-level presentations, built interactive dashboards.
Quantitative Data Analyst / Research Asst. McGill University Sep 2016 – Aug 2022 Led quantitative analysis of structured genomic and health data. Integrated data from multiple sources. Published papers and mentored junior data scientists.
Quantitative Data Analyst / Summer Student BC Children’s Hospital May 2015 – Aug 2022 Led quantitative data analysis for pathogen detection, built a 21,000-species genome database, automated data workflows.

Volunteer Experience

Position Organization Dates Responsibilities
Senior Advisor / Co-president / VP Internal Student Research Initiative Club 2016 – 2022 Led executive team to organize bi-annual events, mentored undergraduates, and established procedures for smooth leadership transitions.
COVID-19 Vaccine Clinic Volunteer Montreal General Hospital Apr 2021 – May 2021 Assisted with non-medical tasks during the COVID-19 pandemic, serving over 300 people weekly.
Friendly Visitor Montreal Chinese Hospital 2017 – 2020 Provided companionship to elders, assisting them with physiotherapy and enhancing their well-being.
Basketball Coach Churchill Secondary School 2022 – 2023 Coached basketball team, developed strategies, and led practices.
Student Representative UBC MDS Program Oct 2022 – Nov 2022 Served as student representative for the UBC Master of Data Science program.
Intramural Basketball Team Captain McGill 2014 – 2022 Led and coached intramural basketball teams, improving team performance and strategy.

Awards & Certifications

  • UBC Master of Data Science Domestic Scholarship ($10,000) – 2022
  • Quebec Master's Training Award ($35,000) – 2018
  • McGill Physiology Excellence Award ($7,500) – 2017
  • W.H.M.I.S Certified
  • First Aid in Workplace

Pinned Loading

  1. GeneDifferentialExplorer GeneDifferentialExplorer Public

    A Shiny App for automating the analysis pipeline for RNASeq data using EdgeR package.

    R 1

  2. BC_Forest_WildFire_Dashboard BC_Forest_WildFire_Dashboard Public

    Interactive dashboard for exploring and analyzing wildfire incidents and forest data in British Columbia.

    Jupyter Notebook

  3. van_houses van_houses Public

    Forked from UBC-MDS/van_houses

    Our R-based dashboard application gives an easy way for people to explore housing prices in Vancouver city.

    R 1

  4. employee_retention employee_retention Public

    An interactive Employee Retention Dashboard that visualizes simulated data to analyze turnover trends and employee satisfaction.

    Python 1

  5. bioseq bioseq Public

    A test example for building a python application for processing DNA sequence.

    Python

  6. breast_cancer_predictor breast_cancer_predictor Public

    Forked from ttimbers/breast_cancer_predictor

    Demo of a data analysis project for DSCI 522 (Data Science workflows) at UBC

    R