Biomedical engineering - miniproject - data exploration
Welcome to the Data Exploration miniproject. You will learn how to clean and prepare data and build a pipeline that can be used for further exploration. The goal is that by completing this lab, you understand the first stage of data processing pipeline.
-
(Optional step) Fork the repository to your githab account. It allows you to upload the changes to your own github
-
Clean all the previous copies of this repo (it could be downloaded by other students).
cd ls -all rm -rf bme-*
-
Clone the repository to your local PC
Go to your home directory, and clone the repository. In case you made a copy (fork) provide your own URL
cd git clone https://github.com/tstokrk/bme-data-exp.git cd bme-data-exp
-
(Optional step) Create a new conda environment.
If you're using mini conda this step help you tune the the necessary Python packages.
All necessary packages are listed in requirements.yml file, that defines dataexp env.
Before you create the new dataexp env using conda, check if there is no different env with the same name on your PC. Remove all env that can create a conflict with the new one
conda env list conda env remove --name dataexp
Create the new env using the package list from the provided file
conda env create -f requirements.yml source activate dataexp
-
Open the Jupyter Notebook and follow the instructions
jupyter notebook dataexp.ipynb
-
(Optional step) Commit and push all the changes to your own github repo
git commit -m "My update.." git push origin master