From b95d8462835a9364dcc0259394de65569d123fbb Mon Sep 17 00:00:00 2001 From: Kevin Tappe <77916431+tappek@users.noreply.github.com> Date: Thu, 24 Aug 2023 15:26:20 +0200 Subject: [PATCH] Update arrow.qmd (#1568) --- arrow.qmd | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/arrow.qmd b/arrow.qmd index 36f3e21..002b2da 100644 --- a/arrow.qmd +++ b/arrow.qmd @@ -69,8 +69,8 @@ curl::multi_download( ## Opening a dataset Let's start by taking a look at the data. -At 9GB, this file is large enough that we probably don't want to load the whole thing into memory. -A good rule of thumb is that you usually want at least twice as much memory as the size of the data, and many laptops top out at 16 Gb. +At 9 GB, this file is large enough that we probably don't want to load the whole thing into memory. +A good rule of thumb is that you usually want at least twice as much memory as the size of the data, and many laptops top out at 16 GB. This means we want to avoid `read_csv()` and instead use the `arrow::open_dataset()`: ```{r open-dataset}