diff --git a/EDA.Rmd b/EDA.Rmd index 7805fd2..7a45c12 100644 --- a/EDA.Rmd +++ b/EDA.Rmd @@ -111,7 +111,7 @@ diamonds %>% count(cut_width(carat, 0.5)) A histogram divides the x axis into equally spaced bins and then uses the height of bar to display the number of observations that fall in each bin. In the graph above, the tallest bar shows that almost 30,000 observations have a $carat$ value between 0.25 and 0.75, which are the left and right edges of the bar. -You can set the width of the intervals in a histogram with the `binwidth` argument, which is measured in the units of the $x$ variable. You should always explore a variety of binwidths when working with histograms, as different binwidths can reveal different patterns. For example, here is how the graph above looks when we zoom into just the diamonds with a binwidth of less than three and choose a smaller binwidth. +You can set the width of the intervals in a histogram with the `binwidth` argument, which is measured in the units of the $x$ variable. You should always explore a variety of binwidths when working with histograms, as different binwidths can reveal different patterns. For example, here is how the graph above looks when we zoom into just the diamonds with a carat of less than three and choose a smaller binwidth. ```{r} smaller <- diamonds %>% filter(carat < 3)