GitHub - Andre3002/cmu-week3-process-overview: CMU week 3 - whole process overview, data split, validation and model scoring

Very important to note that when looking at the python examples 5a and 6a, the dataset is in a very information-dense format

Column 1 called "data" is a column of array data. Each row in this column contains [5.1, 3.2, 1.4, 0.2] coresponding to (sepal length, sepal width, petal length, petal width)

Column 2 is the target variable encoded to 0, 1, or 2 based on if the plant type is setosa, versicolor or virginica

Column 4 contains just 3 rows, representing the list of unique target names (setosa, versicolor or virginica)

Column 6 has 4 rows, which are the list of feature names

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
README.md		README.md
Use Case Teams - Foundations of Data Science Module.xlsx		Use Case Teams - Foundations of Data Science Module.xlsx
cmu_ds_t5.pdf		cmu_ds_t5.pdf
cmu_ds_t6.pdf		cmu_ds_t6.pdf
session_5a_datascience_cmu.ipynb		session_5a_datascience_cmu.ipynb
session_6a_datascience_cmu.ipynb		session_6a_datascience_cmu.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

Andre3002/cmu-week3-process-overview

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages