forked from rstudio-education/stat545
-
Notifications
You must be signed in to change notification settings - Fork 0
/
index.Rmd
143 lines (104 loc) · 7.74 KB
/
index.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
---
title: "STAT 545"
subtitle: "Data wrangling, exploration, and analysis with R"
author:
- Jenny Bryan
- The STAT 545 TAs
knit: "bookdown::render_book"
site: bookdown::bookdown_site
url: 'https\://stat545.com/'
github-repo: rstudio-education/stat545
twitter-handle: jennybryan
cover-image: assets/stat545-logo.png
documentclass: book
bibliography: [book.bib, packages.bib]
biblio-style: apalike
link-citations: yes
description: "STAT 545: Data wrangling, exploration, and analysis with R."
favicon: assets/favicon.ico
---
# Welcome to STAT 545 {-}
```{r include = FALSE}
source("common.R")
htmltools::tagList(rmarkdown::html_dependency_font_awesome())
library(tidyverse)
library(devtools)
library(DT)
```
Learn how to:
* explore, groom, visualize, and analyze data,
* make all of that reproducible, reusable, and shareable,
* using R.
This site is about everything that comes up during data analysis **except for statistical modelling and inference**. This might strike you as strange, given R's statistical roots. First, let me assure you we believe that modelling and inference are important. But the world already offers a lot of great resources for doing statistics with R.
The design of STAT 545 was motivated by the need to provide more balance in applied statistical training. Data analysts spend a considerable amount of time on project organization, data cleaning and preparation, and communication. These activities can have a profound effect on the quality and credibility of an analysis. Yet these skills are rarely taught, despite how important and necessary they are. STAT 545 aims to address this gap.
## History and future {-}
These materials originated in the STAT 545 course at the University of British Columbia:
> "The STAT 545 course became notable as an early example of a data science course taught in a statistics program. It is also notable for its focus on teaching using modern R packages, Git and GitHub, its extensive sharing of teaching materials openly online, and its strong emphasis on practical data cleaning, exploration, and visualization skills, rather than algorithms and theory."
>
> --- [Wikipedia](https://en.wikipedia.org/wiki/Jenny_Bryan)
The main author, Jenny Bryan ([jennybryan.org](https://jennybryan.org)), developed this version of STAT 545 as a professor at UBC. She has since joined [RStudio](https://www.rstudio.com/) as a Software Engineer, on the [tidyverse](https://www.tidyverse.org/) and [r-lib](https://github.com/r-lib/) teams and is an adjunct professor at UBC. In September 2019, we (amicably) created separate spaces for the ongoing maintenance of this content and the continued offerings of STAT 545 at UBC (<https://stat545.stat.ubc.ca>), which is alive and well.
We plan to continue maintaining these resources, as they are still used in [STAT 545 at UBC](https://stat545.stat.ubc.ca) and by people teaching themselves R. Some topics have since been developed more fully elsewhere and we may link out to those resources. For example, the Git and GitHub content of STAT 545 eventually grew into its own website: [happygitwithr.com](https://happygitwithr.com). Some material has been retired, but is archived in the [repository of the old website](https://github.com/STAT545-UBC/STAT545-UBC-original-website). Finally, the new website has URLs that are more human-friendly; we believe we created the necessary redirects, so we don't break other people's links. If you think we've missed one, please let us know [in an issue](https://github.com/rstudio-education/stat545/issues).
## Other contributors {-}
Several STAT 545 TAs were instrumental in the development of these materials and members of the RStudio Education Team ported the [original website](https://github.com/STAT545-UBC/STAT545-UBC-original-website) into the modern and more maintainable framework we enjoy today:
* TAs who contributed content: [Dean Attali](http://deanattali.com/) (web applications with Shiny), [Julia Gustavsen](https://www.juliagustavsen.com) (Shiny), [Shaun Jackman](http://sjackman.ca) (automating workflows), [Bernhard Konrad](https://twitter.com/BernhardKonrad) (system setup, package development, the shell), [Gloria Li](https://twitter.com/LuolanLi) (regular expressions), [Andrew MacDonald](https://twitter.com/polesasunder) (getting data from the web), [Kieran Samuk](https://ksamuk.github.io) (regular expressions)
* RStudio: Alison Hill (<https://alison.rbind.io>) and intern Grace Lawley (<https://grace.rbind.io>) lead the heroic effort to port a vintage R Markdown website into bookdown. Intern Desirée De Leon (<https://desiree.rbind.io>) contributed design expertise.
## Colophon {-}
This book was written in [bookdown](http://bookdown.org/) inside [RStudio](http://www.rstudio.com/ide/). The website [stat545.com](https://stat545.com) is hosted with [Netlify](https://www.netlify.com), and automatically updated after every commit by [Travis-CI](https://travis-ci.org). The complete source is available from [GitHub](https://github.com/rstudio-education/stat545).
The [STAT 545 logo](assets/stat545-logo.png) and the book style was designed by [Desirée De Leon](https://desiree.rbind.io/).
This version of the book was built with:
```{r message = FALSE, warning = FALSE, echo = FALSE}
# needed because new_session is set to true in _bookdown.yml
all_pkgs <- renv::dependencies(path = "DESCRIPTION") %>%
pull(Package)
session <- devtools::session_info(pkgs = all_pkgs)
session$platform
```
Along with these packages:
```{r message = FALSE, warning = FALSE, echo = FALSE}
session$packages %>%
select(package, loadedversion, date, source) %>%
DT::datatable(rownames = FALSE,
class = 'cell-border stripe',
filter = list(position = 'top'),
options = list(pageLength = nrow(session$packages),
autoWidth = TRUE,
bInfo = FALSE,
paging = FALSE))
```
## License {-}
This work is licensed under a [Creative Commons Attribution-ShareAlike 4.0 International License](https://creativecommons.org/licenses/by-sa/4.0/).
<center>
<i class="fab fa-creative-commons fa-2x"></i><i class="fab fa-creative-commons-by fa-2x"></i><i class="fab fa-creative-commons-sa fa-2x"></i>
</center>
*This is a human-readable summary of (and not a substitute for) the license.
Please see <https://creativecommons.org/licenses/by-sa/4.0/legalcode> for the full legal text.*
**You are free to:**
- **Share**---copy and redistribute the material in any medium or
format
- **Remix**---remix, transform, and build upon the material for any
purpose, even commercially.
The licensor cannot revoke these freedoms as long as you follow the
license terms.
**Under the following terms:**
- **Attribution**---You must give appropriate credit, provide a link
to the license, and indicate if changes were made. You may do so in
any reasonable manner, but not in any way that suggests the licensor
endorses you or your use.
- **ShareAlike**---If you remix, transform, or build upon the material, you must distribute your contributions under the same license as the original.
- **No additional restrictions**---You may not apply legal terms or
technological measures that legally restrict others from doing
anything the license permits.
**Notices:**
You do not have to comply with the license for elements of the
material in the public domain or where your use is permitted by an
applicable exception or limitation.
No warranties are given. The license may not give you all of the
permissions necessary for your intended use. For example, other rights
such as publicity, privacy, or moral rights may limit how you use the
material.
```{r include = FALSE}
# automatically create a bib database for R packages
knitr::write_bib(c(
.packages(), 'bookdown', 'knitr', 'rmarkdown'
), 'packages.bib')
```