Tweaking wrangle intro

This commit is contained in:
hadley 2016-07-24 14:53:59 -05:00
parent fecd77847a
commit b28578fd8d
1 changed files with 20 additions and 15 deletions

View File

@ -8,14 +8,21 @@ In this part of the book, you'll learn about data wrangling, the art of getting
knitr::include_graphics("diagrams/data-science-wrangle.png") knitr::include_graphics("diagrams/data-science-wrangle.png")
``` ```
* In [data import], you'll learn the art of data import: how to get your data Data wrangling is import because it allows you to work with your own data. You'll learn:
off of disk and into R.
* In [tidy data], you'll learn about tidy data, a consistent way of storing your * In [tibbles], you'll learn about the variant of the data frame that we use
data that makes transformation, visualiation, and modelling easier. in this book: the __tibble__. You'll learn what makes them different
from regular data frames, and how you can construct them "by hand".
* You've already learned the basics of data transformation. In this part of the * In [data import], you'll learn the art of data import: how to get your
book we'll dive deeper into tools useful for specific types of data: data off of disk and into R. We'll focus on plain-text rectangular
formats, but will give you pointers to packages that help with other
types of data.
* In [tidy data], you'll learn about tidy data, a consistent way of storing
your data that makes transformation, visualisation, and modelling easier.
Data wrangling also encompasses data transformation. You've already learned the basics, and now you'll learn new skills for specific types of data:
* [Dates and times] will give you the key tools for working with * [Dates and times] will give you the key tools for working with
dates, and date times. dates, and date times.
@ -26,5 +33,3 @@ knitr::include_graphics("diagrams/data-science-wrangle.png")
* [Relational data] will give you tools for working with multiple * [Relational data] will give you tools for working with multiple
interrelated datasets. interrelated datasets.
Before we get to those chapters we'll take a brief discussion to discuss the "tibble" in more detail, in [tibbles].