-
Notifications
You must be signed in to change notification settings - Fork 1
/
lab-3-key.qmd
68 lines (50 loc) · 1.11 KB
/
lab-3-key.qmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
---
title: "Descriptive assessment of datasets"
author: "Jerid Francom"
date-modified: today
format: html
bibliography: [bibliography.bib, packages.bib]
biblio-style: apalike
link-citations: true
---
<!-- [ ] incomplete: depends on data appoach (see README)-->
## Overview
- What is the name of the data source?
- Where did it come from?
- What is the sampling frame?
- What are you trying to learn about the data?
## Setup
```{r}
#| label: setup
#| message: false
# Load packages
library(readr) # read in data
library(dplyr) # data manipulation
library(skimr) # descriptive statistics
library(knitr) # tables
library(ggplot2) # plotting
```
```{r}
#| label: read-dataset
#| message: false
# Read in dataset
masc_df <- read_csv("data/derived/data.csv")
```
```{r}
#| label: subset-dataset
# Subset dataset
masc_df <-
masc_df |>
select(-description, -domain)
```
- How many variables are included?
- What are the variable types?
- How many observations are included?
## Assessment
```{r}
#| label: statistical-summary
# Statistical summary
skim(masc_df)
```
## Self-assessment
## References