I’m excited about this penguins data set which has just been made publicly available. This will be much more fun for student projects than the old standard iris data set.
The data is from a published study on Antarctic penguins. It offers great opportunities for regression analysis, cluster analysis, etc. Here are two sample charts from the Github Readme:
What’s a culmen you may ask? They’ve illustrated that nicely:
Links and Credits
The data set is available at Github here: https://github.com/allisonhorst/penguins
The data was used in the published study freely available here:
Gorman KB, Williams TD, Fraser WR (2014) Ecological Sexual Dimorphism and Environmental Variability within a Community of Antarctic Penguins (Genus Pygoscelis). PLoS ONE 9(3): e90081. doi:10.1371/journal.pone.0090081