diff --git a/arrow.qmd b/arrow.qmd index b639551..33db21c 100644 --- a/arrow.qmd +++ b/arrow.qmd @@ -140,7 +140,7 @@ This means that: This means that they're organized column-by-column, much like R's data frame. This typically leads to better performance for data analysis tasks compared to CSV files, which are organized row-by-row. -- Parquet files are "chunked", which makes it possible to work on different parts of the file at the same time, and, if you're lucky, to skip some chunks all together. +- Parquet files are "chunked", which makes it possible to work on different parts of the file at the same time, and, if you're lucky, to skip some chunks altogether. There's one primary disadvantage to parquet files: they are no longer "human readable", i.e. if you look at a parquet file using `readr::read_file()`, you'll just see a bunch of gibberish.