diff --git a/arrow.qmd b/arrow.qmd index 36f3e21..002b2da 100644 --- a/arrow.qmd +++ b/arrow.qmd @@ -69,8 +69,8 @@ curl::multi_download( ## Opening a dataset Let's start by taking a look at the data. -At 9GB, this file is large enough that we probably don't want to load the whole thing into memory. -A good rule of thumb is that you usually want at least twice as much memory as the size of the data, and many laptops top out at 16 Gb. +At 9 GB, this file is large enough that we probably don't want to load the whole thing into memory. +A good rule of thumb is that you usually want at least twice as much memory as the size of the data, and many laptops top out at 16 GB. This means we want to avoid `read_csv()` and instead use the `arrow::open_dataset()`: ```{r open-dataset}