a missing word inserted (#1298)

* a missing word corrected

* CLT versus LLN

* summary statistics instead of summary

* Update data-transform.qmd

---------

Co-authored-by: Mine Cetinkaya-Rundel <cetinkaya.mine@gmail.com>
This commit is contained in:
Ercan Karadas 2023-02-23 23:49:43 -05:00 committed by GitHub
parent df90abbde5
commit e827725d8e
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23
1 changed files with 2 additions and 2 deletions

View File

@ -699,9 +699,9 @@ ggplot(delays, aes(x = n, y = delay)) +
```
Not surprisingly, there is much greater variation in the average delay when there are few flights for a given plane.
The shape of this plot is very characteristic: whenever you plot a mean (or other summary) vs. group size, you'll see that the variation decreases as the sample size increases[^data-transform-4].
The shape of this plot is very characteristic: whenever you plot a mean (or other summary statistics) vs. group size, you'll see that the variation decreases as the sample size increases[^data-transform-4].
[^data-transform-4]: \*cough\* the central limit theorem \*cough\*.
[^data-transform-4]: \*cough\* the law of large numbers \*cough\*.
When looking at this sort of plot, it's often useful to filter out the groups with the smallest numbers of observations, so you can see more of the pattern and less of the extreme variation in the smallest groups: