r4ds/wrangle.Rmd

36 lines
1.4 KiB
Plaintext
Raw Normal View History

2016-04-27 15:04:29 +08:00
# (PART) Wrangle {-}
2016-04-21 21:01:34 +08:00
2016-07-22 22:15:55 +08:00
# Introduction {#wrangle-intro}
2016-02-12 06:31:34 +08:00
2016-07-19 22:39:00 +08:00
In this part of the book, you'll learn about data wrangling, the art of getting your data into R in a useful form. Data wrangling encompasses three main pieces:
2016-03-25 22:39:49 +08:00
2016-07-19 22:39:00 +08:00
```{r echo = FALSE, out.width = "75%"}
knitr::include_graphics("diagrams/data-science-wrangle.png")
2016-03-25 22:39:49 +08:00
```
2016-07-25 03:53:59 +08:00
Data wrangling is import because it allows you to work with your own data. You'll learn:
* In [tibbles], you'll learn about the variant of the data frame that we use
in this book: the __tibble__. You'll learn what makes them different
from regular data frames, and how you can construct them "by hand".
* In [data import], you'll learn the art of data import: how to get your
data from disk and into R. We'll focus on plain-text rectangular
2016-07-25 03:53:59 +08:00
formats, but will give you pointers to packages that help with other
types of data.
* In [tidy data], you'll learn about tidy data, a consistent way of storing
your data that makes transformation, visualisation, and modelling easier.
Data wrangling also encompasses data transformation. You've already learned the basics, and now you'll learn new skills for specific types of data:
* [Dates and times] will give you the key tools for working with
2016-07-27 05:01:19 +08:00
dates, and date-times.
2016-07-25 03:53:59 +08:00
* [Strings] will introduce regular expressions, a powerful tool for
manipulating strings.
* [Relational data] will give you tools for working with multiple
interrelated datasets.
2016-03-25 22:39:49 +08:00