From fdb7242a664aaa0291ee5c24c4ae1e0400068d3c Mon Sep 17 00:00:00 2001 From: RossCampbellNIH2 Date: Wed, 6 Nov 2024 15:19:16 -0600 Subject: [PATCH 01/14] Update 4- Practice - Answer Key.ipynb --- AWS/4- Practice - Answer Key.ipynb | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/AWS/4- Practice - Answer Key.ipynb b/AWS/4- Practice - Answer Key.ipynb index 716b722..7b1a459 100644 --- a/AWS/4- Practice - Answer Key.ipynb +++ b/AWS/4- Practice - Answer Key.ipynb @@ -99,7 +99,7 @@ "\n", "- [COVID cases data (California Health and Human Services Agency)](https://data.chhs.ca.gov/dataset/covid-19-time-series-metrics-by-county-and-state/resource/046cdd2b-31e5-4d34-9ed3-b48cdbc4be7a)\n", "- [COVID vaccination data (Los Angeles Times)](https://github.com/datadesk/california-coronavirus-data)\n", - "- [Unemployment data (California Employment Development Dept.)](https://data.edd.ca.gov/Labor-Force-and-Unemployment-Rates/Local-Area-Unemployment-StatisticsdecisionLAUS-/e6gw-gvii)\n", + "- [Unemployment data (California Employment Development Dept.)](https://labormarketinfo.edd.ca.gov/data/unemployment-and-labor-force.html)\n", "- [Election data (Harvard University)](https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/VOQCHQ)\n", "\n", "***Libraries/Packages***\n", From 5c024c27fd75ecc5f62925ad82406d3d1942b54b Mon Sep 17 00:00:00 2001 From: RossCampbellNIH2 Date: Wed, 6 Nov 2024 15:24:22 -0600 Subject: [PATCH 02/14] Update README.md fixling california employment link --- AWS/README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/AWS/README.md b/AWS/README.md index 5746808..5514fea 100644 --- a/AWS/README.md +++ b/AWS/README.md @@ -80,7 +80,7 @@ All original data from this module was originally sourced from the following sit - [COVID cases data (California Health and Human Services Agency)](https://data.chhs.ca.gov/dataset/covid-19-time-series-metrics-by-county-and-state/resource/046cdd2b-31e5-4d34-9ed3-b48cdbc4be7a) - [COVID vaccination data (Los Angeles Times)](https://github.com/datadesk/california-coronavirus-data) -- [Unemployment data (California Employment Development Dept.)](https://data.edd.ca.gov/Labor-Force-and-Unemployment-Rates/Local-Area-Unemployment-StatisticsdecisionLAUS-/e6gw-gvii) +- [Unemployment data (California Employment Development Dept.)](https://labormarketinfo.edd.ca.gov/data/unemployment-and-labor-force.html) - [Election data (Harvard University)](https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/VOQCHQ) We subsequently picked only certain variables of interest, cleaned and created a composite dataset for the years 2020 and 2021 from the sources listed above. **We manipulated the variable named "Unemployment_rate" by using the 2020 rates in both the 2020 and 2021 Datasets**. We then separated these datasets into training, validation, and testing sets for each of these years to streamline the tutorials. Finally, we stored them in our group's [SFSU GitHub repository](https://github.com/MarcMachineLearning/Introduction-to-Machine-Learning/tree/main/Datasets). From b0610db3b436f5d1dc2713853b4bace73ce154cd Mon Sep 17 00:00:00 2001 From: RossCampbellNIH2 Date: Wed, 6 Nov 2024 15:37:21 -0600 Subject: [PATCH 03/14] Update README.md fix link to data license --- AWS/README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/AWS/README.md b/AWS/README.md index 5514fea..6fb8bfe 100644 --- a/AWS/README.md +++ b/AWS/README.md @@ -93,7 +93,7 @@ We subsequently picked only certain variables of interest, cleaned and created a ## **License for Data** -Text and materials are licensed under a Creative Commons CC-BY-NC-SA license. The license allows you to copy, remix and redistribute any of our publicly available materials, under the condition that you attribute the work (details in the license) and do not make profits from it. More information is available [here](https://tilburgsciencehub.com/about/#license). +Text and materials are licensed under a Creative Commons CC-BY-NC-SA license. The license allows you to copy, remix and redistribute any of our publicly available materials, under the condition that you attribute the work (details in the license) and do not make profits from it. More information is available [here](https://tilburgsciencehub.com/about/#License). ![Creative commons license](https://i.creativecommons.org/l/by-nc-sa/4.0/88x31.png) From 384e1d759a9f45349c8a0d0136e88d52ed163c42 Mon Sep 17 00:00:00 2001 From: RossCampbellNIH2 Date: Wed, 6 Nov 2024 15:39:47 -0600 Subject: [PATCH 04/14] Update README.md remove space in link to GCP module --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 07c5f24..f975687 100644 --- a/README.md +++ b/README.md @@ -17,7 +17,7 @@ This module is geared towards beginners and does not require prior knowledge on a specific scientific discipline. The module is divided into three Jupyter notebooks as outlined at the beginning of this document. In addition to the notebooks mentioned, there are videos containing brief explanations about basic concepts in machine learning and what the code does in each step of the notebook. Below is an outline of the videos contained in each notebook with their respective links. These videos are already attached to the notebook. -This module offers two computing pathways: [AWS (Amazon Web Services)](https://github.com/NIGMS/Introduction-to-Data-Science-for-Biology/tree/master/AWS) or [GCP (Google Cloud Platform)](https://github.com/NIGMS/Introduction-to-Data-Science-for-Biology/tree/master/Google%20Cloud). Users can choose their preferred cloud service to run the Jupyter notebooks, ensuring flexibilty and accessibilty based on their existing infrastructure or familairty. Detailed instructions for setting up and using either AWS or GCP for this module are provided within their corresponding folders within this repository. +This module offers two computing pathways: [AWS (Amazon Web Services)](https://github.com/NIGMS/Introduction-to-Data-Science-for-Biology/tree/master/AWS) or [GCP (Google Cloud Platform)](https://github.com/NIGMS/Introduction-to-Data-Science-for-Biology/tree/master/GoogleCloud). Users can choose their preferred cloud service to run the Jupyter notebooks, ensuring flexibilty and accessibilty based on their existing infrastructure or familairty. Detailed instructions for setting up and using either AWS or GCP for this module are provided within their corresponding folders within this repository. ### 1- Introduction To Machine Learning: Decision Trees (10 video clips) From 42f8ca18a59ac1012f6309e3a8d3adc92b910fe4 Mon Sep 17 00:00:00 2001 From: RossCampbellNIH2 Date: Wed, 6 Nov 2024 15:44:48 -0600 Subject: [PATCH 05/14] Update README.md fix data license link --- AWS/README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/AWS/README.md b/AWS/README.md index 6fb8bfe..4075754 100644 --- a/AWS/README.md +++ b/AWS/README.md @@ -93,7 +93,7 @@ We subsequently picked only certain variables of interest, cleaned and created a ## **License for Data** -Text and materials are licensed under a Creative Commons CC-BY-NC-SA license. The license allows you to copy, remix and redistribute any of our publicly available materials, under the condition that you attribute the work (details in the license) and do not make profits from it. More information is available [here](https://tilburgsciencehub.com/about/#License). +Text and materials are licensed under a Creative Commons CC-BY-NC-SA license. The license allows you to copy, remix and redistribute any of our publicly available materials, under the condition that you attribute the work (details in the license) and do not make profits from it. More information is available [here](https://tilburgsciencehub.com/about). ![Creative commons license](https://i.creativecommons.org/l/by-nc-sa/4.0/88x31.png) From 2c42cc5b7106db30009cc7c0d6502990f01f8c6c Mon Sep 17 00:00:00 2001 From: RossCampbellNIH2 Date: Wed, 6 Nov 2024 15:45:13 -0600 Subject: [PATCH 06/14] Update README.md fix data license link --- GoogleCloud/README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/GoogleCloud/README.md b/GoogleCloud/README.md index 78a36c6..a13a17e 100644 --- a/GoogleCloud/README.md +++ b/GoogleCloud/README.md @@ -95,7 +95,7 @@ We subsequently picked only certain variables of interest, cleaned and created a ## **License for Data** -Text and materials are licensed under a Creative Commons CC-BY-NC-SA license. The license allows you to copy, remix and redistribute any of our publicly available materials, under the condition that you attribute the work (details in the license) and do not make profits from it. More information is available [here](https://tilburgsciencehub.com/about/#license). +Text and materials are licensed under a Creative Commons CC-BY-NC-SA license. The license allows you to copy, remix and redistribute any of our publicly available materials, under the condition that you attribute the work (details in the license) and do not make profits from it. More information is available [here](https://tilburgsciencehub.com/about). ![Creative commons license](https://i.creativecommons.org/l/by-nc-sa/4.0/88x31.png) From cc171a0475174017fe339b82a7da9f7f3b4773ee Mon Sep 17 00:00:00 2001 From: RossCampbellNIH2 Date: Wed, 6 Nov 2024 15:54:30 -0600 Subject: [PATCH 07/14] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index f975687..d77d18e 100644 --- a/README.md +++ b/README.md @@ -96,7 +96,7 @@ All original data from this module was originally sourced from the following sit ## **License for Data** -Text and materials are licensed under a Creative Commons CC-BY-NC-SA license. The license allows you to copy, remix and redistribute any of our publicly available materials, under the condition that you attribute the work (details in the license) and do not make profits from it. More information is available [here](https://tilburgsciencehub.com/about/#license). +Text and materials are licensed under a Creative Commons CC-BY-NC-SA license. The license allows you to copy, remix and redistribute any of our publicly available materials, under the condition that you attribute the work (details in the license) and do not make profits from it. More information is available [here](https://tilburgsciencehub.com/about). ![Creative commons license](https://i.creativecommons.org/l/by-nc-sa/4.0/88x31.png) From 6960b3a7af558712300f29bc27a348d7699f8252 Mon Sep 17 00:00:00 2001 From: RossCampbellNIH2 Date: Wed, 6 Nov 2024 15:58:37 -0600 Subject: [PATCH 08/14] Update 3- Practice.ipynb --- GoogleCloud/3- Practice.ipynb | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/GoogleCloud/3- Practice.ipynb b/GoogleCloud/3- Practice.ipynb index d3867e4..790b11b 100644 --- a/GoogleCloud/3- Practice.ipynb +++ b/GoogleCloud/3- Practice.ipynb @@ -98,7 +98,7 @@ "\n", "- [COVID cases data (California Health and Human Services Agency)](https://data.chhs.ca.gov/dataset/covid-19-time-series-metrics-by-county-and-state/resource/046cdd2b-31e5-4d34-9ed3-b48cdbc4be7a)\n", "- [COVID vaccination data (Los Angeles Times)](https://github.com/datadesk/california-coronavirus-data)\n", - "- [Unemployment data (California Employment Development Dept.)](https://data.edd.ca.gov/Labor-Force-and-Unemployment-Rates/Local-Area-Unemployment-StatisticsdecisionLAUS-/e6gw-gvii)\n", + "- [Unemployment data (California Employment Development Dept.)](https://labormarketinfo.edd.ca.gov/data/unemployment-and-labor-force.html)\n", "- [Election data (Harvard University)](https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/VOQCHQ)\n", "\n", "***Libraries/Packages***\n", From c65e12f12733df59127485a013f0377c7a7ecc6e Mon Sep 17 00:00:00 2001 From: RossCampbellNIH2 Date: Wed, 6 Nov 2024 15:59:15 -0600 Subject: [PATCH 09/14] Update 3- Practice.ipynb --- AWS/3- Practice.ipynb | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/AWS/3- Practice.ipynb b/AWS/3- Practice.ipynb index d3867e4..790b11b 100644 --- a/AWS/3- Practice.ipynb +++ b/AWS/3- Practice.ipynb @@ -98,7 +98,7 @@ "\n", "- [COVID cases data (California Health and Human Services Agency)](https://data.chhs.ca.gov/dataset/covid-19-time-series-metrics-by-county-and-state/resource/046cdd2b-31e5-4d34-9ed3-b48cdbc4be7a)\n", "- [COVID vaccination data (Los Angeles Times)](https://github.com/datadesk/california-coronavirus-data)\n", - "- [Unemployment data (California Employment Development Dept.)](https://data.edd.ca.gov/Labor-Force-and-Unemployment-Rates/Local-Area-Unemployment-StatisticsdecisionLAUS-/e6gw-gvii)\n", + "- [Unemployment data (California Employment Development Dept.)](https://labormarketinfo.edd.ca.gov/data/unemployment-and-labor-force.html)\n", "- [Election data (Harvard University)](https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/VOQCHQ)\n", "\n", "***Libraries/Packages***\n", From 3a6e5858a55850acd822f5112f2e152836d09a73 Mon Sep 17 00:00:00 2001 From: RossCampbellNIH2 Date: Wed, 6 Nov 2024 16:01:33 -0600 Subject: [PATCH 10/14] Update 1- Intro to Machine Learning Decision Trees.ipynb --- AWS/1- Intro to Machine Learning Decision Trees.ipynb | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/AWS/1- Intro to Machine Learning Decision Trees.ipynb b/AWS/1- Intro to Machine Learning Decision Trees.ipynb index 06a29cc..462bb37 100644 --- a/AWS/1- Intro to Machine Learning Decision Trees.ipynb +++ b/AWS/1- Intro to Machine Learning Decision Trees.ipynb @@ -383,7 +383,7 @@ "\n", "- [COVID cases data (California Health and Human Services Agency)](https://data.chhs.ca.gov/dataset/covid-19-time-series-metrics-by-county-and-state/resource/046cdd2b-31e5-4d34-9ed3-b48cdbc4be7a)\n", "- [COVID vaccination data (Los Angeles Times)](https://github.com/datadesk/california-coronavirus-data)\n", - "- [Unemployment data (California Employment Development Dept.)](https://data.edd.ca.gov/Labor-Force-and-Unemployment-Rates/Local-Area-Unemployment-StatisticsdecisionLAUS-/e6gw-gvii)\n", + "- [Unemployment data (California Employment Development Dept.)](https://labormarketinfo.edd.ca.gov/data/unemployment-and-labor-force.html)\n", "- [Election data (Harvard University)](https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/VOQCHQ)\n", "\n", "***Libraries/Packages***\n", From 209c4e50881f9ec23d9b73d6a0cf596be20104a3 Mon Sep 17 00:00:00 2001 From: RossCampbellNIH2 Date: Wed, 6 Nov 2024 16:04:10 -0600 Subject: [PATCH 11/14] Update 1- Intro to Machine Learning Decision Trees.ipynb --- GoogleCloud/1- Intro to Machine Learning Decision Trees.ipynb | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/GoogleCloud/1- Intro to Machine Learning Decision Trees.ipynb b/GoogleCloud/1- Intro to Machine Learning Decision Trees.ipynb index e8a9982..587fe31 100644 --- a/GoogleCloud/1- Intro to Machine Learning Decision Trees.ipynb +++ b/GoogleCloud/1- Intro to Machine Learning Decision Trees.ipynb @@ -1148,7 +1148,7 @@ "\n", "- [COVID cases data (California Health and Human Services Agency)](https://data.chhs.ca.gov/dataset/covid-19-time-series-metrics-by-county-and-state/resource/046cdd2b-31e5-4d34-9ed3-b48cdbc4be7a)\n", "- [COVID vaccination data (Los Angeles Times)](https://github.com/datadesk/california-coronavirus-data)\n", - "- [Unemployment data (California Employment Development Dept.)](https://data.edd.ca.gov/Labor-Force-and-Unemployment-Rates/Local-Area-Unemployment-StatisticsdecisionLAUS-/e6gw-gvii)\n", + "- [Unemployment data (California Employment Development Dept.)](https://labormarketinfo.edd.ca.gov/data/unemployment-and-labor-force.html)\n", "- [Election data (Harvard University)](https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/VOQCHQ)\n", "\n", "***Libraries/Packages***\n", From 5af0dfbf15bc0a2fd50de9da5800e39a742f83b8 Mon Sep 17 00:00:00 2001 From: RossCampbellNIH2 Date: Wed, 6 Nov 2024 16:05:35 -0600 Subject: [PATCH 12/14] Update 4- Practice - Answer Key.ipynb --- GoogleCloud/4- Practice - Answer Key.ipynb | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/GoogleCloud/4- Practice - Answer Key.ipynb b/GoogleCloud/4- Practice - Answer Key.ipynb index f988d93..1d4f82e 100644 --- a/GoogleCloud/4- Practice - Answer Key.ipynb +++ b/GoogleCloud/4- Practice - Answer Key.ipynb @@ -99,7 +99,7 @@ "\n", "- [COVID cases data (California Health and Human Services Agency)](https://data.chhs.ca.gov/dataset/covid-19-time-series-metrics-by-county-and-state/resource/046cdd2b-31e5-4d34-9ed3-b48cdbc4be7a)\n", "- [COVID vaccination data (Los Angeles Times)](https://github.com/datadesk/california-coronavirus-data)\n", - "- [Unemployment data (California Employment Development Dept.)](https://data.edd.ca.gov/Labor-Force-and-Unemployment-Rates/Local-Area-Unemployment-StatisticsdecisionLAUS-/e6gw-gvii)\n", + "- [Unemployment data (California Employment Development Dept.)](https://labormarketinfo.edd.ca.gov/data/unemployment-and-labor-force.html)\n", "- [Election data (Harvard University)](https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/VOQCHQ)\n", "\n", "***Libraries/Packages***\n", From 6796ada23bcbddfddf68656ea21575837aa1a8a4 Mon Sep 17 00:00:00 2001 From: RossCampbellNIH2 Date: Wed, 6 Nov 2024 16:09:02 -0600 Subject: [PATCH 13/14] Update README.md fix link to data source that was broken in most notebooks. --- GoogleCloud/README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/GoogleCloud/README.md b/GoogleCloud/README.md index a13a17e..c37d7f1 100644 --- a/GoogleCloud/README.md +++ b/GoogleCloud/README.md @@ -82,7 +82,7 @@ All original data from this module was originally sourced from the following sit - [COVID cases data (California Health and Human Services Agency)](https://data.chhs.ca.gov/dataset/covid-19-time-series-metrics-by-county-and-state/resource/046cdd2b-31e5-4d34-9ed3-b48cdbc4be7a) - [COVID vaccination data (Los Angeles Times)](https://github.com/datadesk/california-coronavirus-data) -- [Unemployment data (California Employment Development Dept.)](https://data.edd.ca.gov/Labor-Force-and-Unemployment-Rates/Local-Area-Unemployment-StatisticsdecisionLAUS-/e6gw-gvii) +- [Unemployment data (California Employment Development Dept.)](https://labormarketinfo.edd.ca.gov/data/unemployment-and-labor-force.html) - [Election data (Harvard University)](https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/VOQCHQ) We subsequently picked only certain variables of interest, cleaned and created a composite dataset for the years 2020 and 2021 from the sources listed above. **We manipulated the variable named "Unemployment_rate" by using the 2020 rates in both the 2020 and 2021 Datasets**. We then separated these datasets into training, validation, and testing sets for each of these years to streamline the tutorials. Finally, we stored them in our group's [SFSU GitHub repository](https://github.com/MarcMachineLearning/Introduction-to-Machine-Learning/tree/main/Datasets). From e344326d5681a734c877d9d5fb08cdad6f3dd53c Mon Sep 17 00:00:00 2001 From: RossCampbellNIH2 Date: Wed, 6 Nov 2024 16:12:48 -0600 Subject: [PATCH 14/14] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index d77d18e..1ea95a8 100644 --- a/README.md +++ b/README.md @@ -84,7 +84,7 @@ All original data from this module was originally sourced from the following sit - [COVID cases data (California Health and Human Services Agency)](https://data.chhs.ca.gov/dataset/covid-19-time-series-metrics-by-county-and-state/resource/046cdd2b-31e5-4d34-9ed3-b48cdbc4be7a) - [COVID vaccination data (Los Angeles Times)](https://github.com/datadesk/california-coronavirus-data) -- [Unemployment data (California Employment Development Dept.)](https://data.edd.ca.gov/Labor-Force-and-Unemployment-Rates/Local-Area-Unemployment-StatisticsdecisionLAUS-/e6gw-gvii) +- [Unemployment data (California Employment Development Dept.)](https://labormarketinfo.edd.ca.gov/data/unemployment-and-labor-force.html) - [Election data (Harvard University)](https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/VOQCHQ)