forked from hadley/r4ds
-
Notifications
You must be signed in to change notification settings - Fork 0
/
transform.Rmd
35 lines (21 loc) · 1.87 KB
/
transform.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
# (PART) Transform {.unnumbered}
# Introduction {#data-types-intro}
In this part of the book, you'll learn about various types of data the columns of a data frame can contain and how to transform them.
The transformations you might want to apply to a column vary depending on the type of data you're working with, for example if you have text strings you might want to extract or remove certain pieces while if you have numerical data, you might want to rescale them.
You've already learned a little about data wrangling in the previous part.
Now we'll focus on new skills for specific types of data you will frequently encounter in practice.
<!--# TO DO: Add a diagram? -->
This part of the book proceeds as follows:
- In Chapter \@ref(tibbles), you'll learn about the variant of the data frame that we use in this book: the **tibble**. You'll learn what makes them different from regular data frames, and how you can construct them "by hand".
- Chapter \@ref(relational-data) will give you tools for working with multiple interrelated datasets.
- Chapter \@ref(logicals-numbers) ...
<!--# TO DO: Something about logicals and numbers -->
- Chapter \@ref(vector-tools) ...
<!--# TO DO: Something about general vector tools -->
- Chapter \@ref(missing-values)...
<!--# TO DO: Something about missing values -->
- Chapter \@ref(strings) will give you tools for working with strings and introduce regular expressions, a powerful tool for manipulating strings.
- Chapter \@ref(factors) will introduce factors -- how R stores categorical data.
They are used when a variable has a fixed set of possible values, or when you want to use a non-alphabetical ordering of a string.
- Chapter \@ref(dates-and-times) will give you the key tools for working with dates and date-times.
- Chapter \@ref(column-wise) will give you tools for performing the same operation on multiple columns.