DSCI 310 Group 16 project on Predicting Win Rate of Tennis Players done using R.
=================
- Authors/Contributors:
- Summary of Project
- Important Project Rules and Regulations
- Detailed Analysis
- Dependencies required
- How to reproduce this project's Analysis.
- Licenses
=================
- Ammar Bagharib
- Miles Brodie
- Sammy Gulipalli
The goal of this project is to predict the win rate of tennis players using machine learning techniques. The motivation behind this project is to explore the relationship between various player statistics and their win rate, and to develop a model that can accurately predict a player's win rate based on these statistics. The analysis has been carried out in R using various packages from the tidyverse and tidymodels ecosystems.
The detailed analysis files can be found here
The following are dependencies used within this project, and that which is necessary to reproduce an identical analysis.
Package Name | Version |
---|---|
R | 4.1.3 |
data.table | 1.12.8 |
GGally | 2.1.0 |
here | 1.0.1 |
kknn | 1.3.1 |
tidymodels | 0.1.1 |
tidyverse | 1.3.0 |
knitr | 1.4.1 |
see Dockerfile
-
Sign up/ Log in a Docker account.
-
Install Docker in your computer.
Fork this project's repository on GitHub and then clone the fork to your local machine. For more details on forking see the GitHub Documentation. Type in the following command in your Terminal.
git clone https://github.com/mjbrodie/dsci-310-group-16
To keep your fork up to date with changes in this repo, you can use the fetch upstream button on GitHub. More details regarding fork syncing, e.g., syncing from your terminal instead of directly on Github can be found here.
Then, fire up your Terminal on your local machine, and using Docker, follow these steps to reproduce this analysis:
Type in the following command in Terminal.
docker pull mjbrodie/dsci-310-group-project:latest
Run the Docker appliation on your local machine. Then, run the following in your terminal.
docker run --platform linux/amd64 -p 8787:8787 -e DISABLE_AUTH=true mjbrodie/dsci-310-group-project:latest
Run your preferred browser, (we used Chrome) and visit http://localhost:8787
Once done, within the terminal in the Rstudio, run:
make all
It should take max 10 minutes to run, be patient! Once done, you should see the following message:
Output created: Predicting_Win_Rate_of_Tennis_Players.html
There'll be many new files created, you can explore them within the data
and Analysis
folder on your Rstudio environment.
If you'd like to restart your workflow, within the terminal in the Rstudio, run:
make clean
All the best!!
- MIT license for project analysis (completed in Rmarkdown)