diff --git a/data-visualize.Rmd b/data-visualize.Rmd index dd20078..0c0dd54 100644 --- a/data-visualize.Rmd +++ b/data-visualize.Rmd @@ -771,7 +771,7 @@ ggplot(data = mpg) + geom_point(mapping = aes(x = displ, y = hwy)) ``` -The values of `hwy` and `displ` are rounded so the points appear on a grid and many points overlap each other. +The underlying values of `hwy` and `displ` are rounded so the points appear on a grid and many points overlap each other. This problem is known as **overplotting**. This arrangement makes it hard to see where the mass of the data is. Are the data points spread equally throughout the graph, or is there one special combination of `hwy` and `displ` that contains 109 values?