In this assignment, we will work with a small dataset of diabetes patients taken from here.
|
AGE |
SEX |
BMI |
BP |
S1 |
S2 |
S3 |
S4 |
S5 |
S6 |
Y |
0 |
59 |
2 |
32.1 |
101. |
157 |
93.2 |
38.0 |
4. |
4.8598 |
87 |
151 |
1 |
48 |
1 |
21.6 |
87.0 |
183 |
103.2 |
70. |
3. |
3.8918 |
69 |
75 |
2 |
72 |
2 |
30.5 |
93.0 |
156 |
93.6 |
41.0 |
4.0 |
4. |
85 |
141 |
... |
... |
... |
... |
... |
... |
... |
... |
... |
... |
... |
... |
- Open the assignment notebook in a jupyter notebook environment
- Complete all tasks listed in the notebook, namely:
[ ] Compute mean values and variance for all values
[ ] Plot boxplots for BMI, BP and Y depending on gender
[ ] What is the the distribution of Age, Sex, BMI and Y variables?
[ ] Test the correlation between different variables and disease progression (Y)
[ ] Test the hypothesis that the degree of diabetes progression is different between men and women
Exemplary |
Adequate |
Needs Improvement |
All required tasks are complete, graphically illustrated and explained |
Most of the tasks are complete, explanations or takeaways from graphs and/or obtained values are missing |
Only basic tasks such as computation of mean/variance and basic plots are complete, no conclusions are made from the data |