diff --git a/arrow.qmd b/arrow.qmd index bb7fdcb..e203384 100644 --- a/arrow.qmd +++ b/arrow.qmd @@ -27,6 +27,8 @@ In many cases, the choice is made for you, as in the data is already in a databa But if you're starting with your own data (perhaps CSV files), you can either load it into a database or convert it to parquet. In general, it's hard to know what will work best, so in the early stages of your analysis we'd encourage you to try both and pick the one that works the best for you. +(A big thanks to Danielle Navarro who contributed the initial version of this chapter.) + ### Prerequisites In this chapter, we'll continue to use the tidyverse, particularly dplyr, but we'll pair it with the arrow package which is designed specifically for working with large data.