A dataset demonstrating the utility of visualization. These 12 datasets are equal in standard measures: mean, standard deviation, and Pearson's correlation.
Format
A data frame with 1846 rows and 3 variables:
- dataset: indicates which dataset the data are from 
- x: x-values 
- y: y-values 
References
Matejka, J., & Fitzmaurice, G. (2017). Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing. CHI 2017 Conference proceedings: ACM SIGCHI Conference on Human Factors in Computing Systems. Retrieved from https://www.research.autodesk.com/publications/same-stats-different-graphs/. #nolint
Examples
if (require(ggplot2)) {
  ggplot(datasaurus_dozen, aes(x = x, y = y, colour = dataset)) +
    geom_point() +
    theme_void() +
    theme(legend.position = "none") +
    facet_wrap(~dataset, ncol = 3)
}
 # Base R plots
state = par("mar", "mfrow")
# plot
par(mfrow = c(5, 3), mar = c(1, 2, 2, 1))
sets = sort(unique(datasaurus_dozen$dataset))
for (s in sets) {
  df = datasaurus_dozen[datasaurus_dozen$dataset == s, ]
  plot(df$x, df$y, pch = 16)
  title(s)
}
#reset settings
par(state)
# Base R plots
state = par("mar", "mfrow")
# plot
par(mfrow = c(5, 3), mar = c(1, 2, 2, 1))
sets = sort(unique(datasaurus_dozen$dataset))
for (s in sets) {
  df = datasaurus_dozen[datasaurus_dozen$dataset == s, ]
  plot(df$x, df$y, pch = 16)
  title(s)
}
#reset settings
par(state)
 
