From ae15acf2fcfa68c4fb349d73299493580b930e65 Mon Sep 17 00:00:00 2001 From: Hadley Wickham Date: Tue, 6 Dec 2022 16:15:54 -0600 Subject: [PATCH] Save --- webscraping.qmd | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/webscraping.qmd b/webscraping.qmd index a40e39a..7871e14 100644 --- a/webscraping.qmd +++ b/webscraping.qmd @@ -9,7 +9,8 @@ status("polishing") This vignette introduces you to the basics of web scraping with [rvest](https://rvest.tidyverse.org). Web scraping is a very useful tool for extracting data from web pages. -Some websites will offer an API, a set of structured HTTP requests that return data as JSON, which you handle using the techniques from @sec-rectangling. Where possible, you should use the API, because typically it will give you more reliably data. +Some websites will offer an API, a set of structured HTTP requests that return data as JSON, which you handle using the techniques from @sec-rectangling. +Where possible, you should use the API, because typically it will give you more reliably data. Unfortunately however, programming with web APIs is out of scope for this book, and we instead teaching scraping, a technique that works whether or not a site provides an API. In this chapter, we'll first discuss the ethics and legalities of scraping before we dive into the basics of HTML.