palmerpenguins
The palmerpenguins data contains size measurements for three penguin species observed on three islands in the Palmer Archipelago, Antarctica.
https://github.com/allisonhorst/palmerpenguins
Category: Biosphere
Sub Category: Avian Monitoring and Analysis
Keywords from Contributors
dynamic ecology data-manipulation grammar
Last synced: about 13 hours ago
JSON representation
Repository metadata
A great intro dataset for data exploration & visualization (alternative to iris).
- Host: GitHub
- URL: https://github.com/allisonhorst/palmerpenguins
- Owner: allisonhorst
- License: cc0-1.0
- Created: 2020-06-05T14:57:15.000Z (over 5 years ago)
- Default Branch: main
- Last Pushed: 2024-09-19T22:20:38.000Z (about 1 year ago)
- Last Synced: 2025-10-26T06:40:07.847Z (9 days ago)
- Language: R
- Homepage: https://allisonhorst.github.io/palmerpenguins/
- Size: 43.3 MB
- Stars: 978
- Watchers: 22
- Forks: 240
- Open Issues: 17
- Releases: 1
-
Metadata Files:
- Readme: README.Rmd
- License: LICENSE.md
README.Rmd
---
output: github_document
---
```{r, include = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>",
fig.path = "man/figures/README-",
out.width = "75%",
warning = FALSE,
message = FALSE,
fig.retina = 2,
fig.align = 'center'
)
library(tidyverse)
library(palmerpenguins)
```
# palmerpenguins
[](https://doi.org/10.5281/zenodo.3960218)
[](https://cran.r-project.org/package=palmerpenguins)
The goal of palmerpenguins is to provide a great dataset for data exploration & visualization, as an alternative to `iris`.
```{r flipper-bill, echo=FALSE}
flipper_bill <- ggplot(data = penguins,
aes(x = flipper_length_mm,
y = bill_length_mm)) +
geom_point(aes(color = species,
shape = species),
size = 3,
alpha = 0.8) +
geom_smooth(method = "lm", se = FALSE, aes(color = species)) +
theme_minimal() +
scale_color_manual(values = c("darkorange","purple","cyan4")) +
labs(title = "Flipper and bill length",
subtitle = "Dimensions for Adelie, Chinstrap and Gentoo Penguins at Palmer Station LTER",
x = "Flipper length (mm)",
y = "Bill length (mm)",
color = "Penguin species",
shape = "Penguin species") +
theme(legend.position = c(0.85, 0.15),
plot.title.position = "plot",
plot.caption = element_text(hjust = 0, face= "italic"),
plot.caption.position = "plot")
flipper_bill
```
## Installation
You can install the released version of palmerpenguins from [CRAN](https://CRAN.R-project.org) with:
``` r
install.packages("palmerpenguins")
```
To install the development version from [GitHub](https://github.com/) use:
``` r
# install.packages("remotes")
remotes::install_github("allisonhorst/palmerpenguins")
```
## About the data
Data were collected and made available by [Dr. Kristen Gorman](https://www.uaf.edu/cfos/people/faculty/detail/kristen-gorman.php) and the [Palmer Station, Antarctica LTER](https://pallter.marine.rutgers.edu/), a member of the [Long Term Ecological Research Network](https://lternet.edu/).
The palmerpenguins package contains two datasets.
```{r showdata}
library(palmerpenguins)
data(package = 'palmerpenguins')
```
One is called `penguins`, and is a simplified version of the raw data; see `?penguins` for more info:
```{r example-penguins}
head(penguins)
```
The second dataset is `penguins_raw`, and contains all the variables and original names as downloaded; see `?penguins_raw` for more info.
```{r example-penguins-raw}
head(penguins_raw)
```
Both datasets contain data for `r nrow(penguins)` penguins. There are `r length(unique(penguins$species))` different species of penguins in this dataset, collected from `r length(unique(penguins$island))` islands in the Palmer Archipelago, Antarctica.
```{r}
str(penguins)
```
We gratefully acknowledge Palmer Station LTER and the US LTER Network. Special thanks to Marty Downs (Director, LTER Network Office) for help regarding the data license & use.
## Examples
You can find these and more code examples for exploring palmerpenguins in `vignette("examples")`.
Penguins are fun to summarize! For example:
```{r example, warning=FALSE, message=FALSE}
library(tidyverse)
penguins %>%
count(species)
penguins %>%
group_by(species) %>%
summarize(across(where(is.numeric), mean, na.rm = TRUE))
```
Penguins are fun to visualize! For example:
```{r mass-flipper, warning = FALSE, message = FALSE, echo = FALSE, out.width='75%', fig.retina=2}
mass_flipper <- ggplot(data = penguins,
aes(x = flipper_length_mm,
y = body_mass_g)) +
geom_point(aes(color = species,
shape = species),
size = 3,
alpha = 0.8) +
theme_minimal() +
scale_color_manual(values = c("darkorange","purple","cyan4")) +
labs(title = "Penguin size, Palmer Station LTER",
subtitle = "Flipper length and body mass for Adelie, Chinstrap, and Gentoo Penguins",
x = "Flipper length (mm)",
y = "Body mass (g)",
color = "Penguin species",
shape = "Penguin species") +
theme(legend.position = c(0.2, 0.7),
plot.title.position = "plot",
plot.caption = element_text(hjust = 0, face= "italic"),
plot.caption.position = "plot")
mass_flipper
```
```{r flipper-hist, echo=FALSE}
flipper_hist <- ggplot(data = penguins, aes(x = flipper_length_mm)) +
geom_histogram(aes(fill = species),
alpha = 0.5,
position = "identity") +
scale_fill_manual(values = c("darkorange","purple","cyan4")) +
theme_minimal() +
labs(x = "Flipper length (mm)",
y = "Frequency",
title = "Penguin flipper lengths")
flipper_hist
```
## Artwork
You can download palmerpenguins art (useful for teaching with the data) in `vignette("art")`. If you use this artwork, please cite with: "Artwork by @allison_horst".
### Meet the Palmer penguins
```{r, echo = FALSE, fig.align='center'}
knitr::include_graphics("man/figures/lter_penguins.png", dpi = 300)
```
### Bill dimensions
The culmen is the upper ridge of a bird's bill. In the simplified `penguins` data, culmen length and depth are renamed as variables `bill_length_mm` and `bill_depth_mm` to be more intuitive.
For this penguin data, the culmen (bill) length and depth are measured as shown below (thanks Kristen Gorman for clarifying!):
```{r, echo = FALSE, fig.align='center'}
knitr::include_graphics("man/figures/culmen_depth.png", dpi = 300)
```
## License
Data are available by [CC-0](https://creativecommons.org/share-your-work/public-domain/cc0/) license in accordance with the [Palmer Station LTER Data Policy](https://pallter.marine.rutgers.edu/data/) and the [LTER Data Access Policy for Type I data](https://lternet.edu/data-access-policy/).
## Citation
To cite the palmerpenguins package, please use:
```{r}
citation("palmerpenguins")
```
## Additional data use information
Anyone interested in publishing the data should contact [Dr. Kristen Gorman](https://www.uaf.edu/cfos/people/faculty/detail/kristen-gorman.php) about analysis and working together on any final products. From Gorman et al. (2014): "Individuals interested in using these data are expected to follow the US LTER Network’s Data Access Policy, Requirements and Use Agreement: https://lternet.edu/data-access-policy/."
## References
**Data originally published in:**
+ Gorman KB, Williams TD, Fraser WR (2014). Ecological sexual dimorphism and environmental variability within a community of Antarctic penguins (genus *Pygoscelis*). PLoS ONE 9(3):e90081. https://doi.org/10.1371/journal.pone.0090081
**Data citations:**
Adélie penguins:
+ Palmer Station Antarctica LTER and K. Gorman, 2020. Structural size measurements and isotopic signatures of foraging among adult male and female Adélie penguins (*Pygoscelis adeliae*) nesting along the Palmer Archipelago near Palmer Station, 2007-2009 ver 5. Environmental Data Initiative. https://doi.org/10.6073/pasta/98b16d7d563f265cb52372c8ca99e60f (Accessed 2020-06-08).
Gentoo penguins:
+ Palmer Station Antarctica LTER and K. Gorman, 2020. Structural size measurements and isotopic signatures of foraging among adult male and female Gentoo penguin (*Pygoscelis papua*) nesting along the Palmer Archipelago near Palmer Station, 2007-2009 ver 5. Environmental Data Initiative. https://doi.org/10.6073/pasta/7fca67fb28d56ee2ffa3d9370ebda689 (Accessed 2020-06-08).
Chinstrap penguins:
+ Palmer Station Antarctica LTER and K. Gorman, 2020. Structural size measurements and isotopic signatures of foraging among adult male and female Chinstrap penguin (*Pygoscelis antarcticus*) nesting along the Palmer Archipelago near Palmer Station, 2007-2009 ver 6. Environmental Data Initiative. https://doi.org/10.6073/pasta/c14dfcfada8ea13a17536e73eb6fbe9e (Accessed 2020-06-08).
Owner metadata
- Name: Allison Horst
- Login: allisonhorst
- Email:
- Kind: user
- Description: Teaching | Data Science | Art | Science Communication
- Website: https://www.allisonhorst.com/
- Location:
- Twitter: allison_horst
- Company: Observable
- Icon url: https://avatars.githubusercontent.com/u/25372780?u=9e6fa4d2211577613e0b7b95d4d0643f11319ecc&v=4
- Repositories: 183
- Last ynced at: 2025-10-11T17:32:13.821Z
- Profile URL: https://github.com/allisonhorst
GitHub Events
Total
- Issues event: 4
- Watch event: 77
- Issue comment event: 3
- Fork event: 33
Last Year
- Issues event: 4
- Watch event: 69
- Issue comment event: 3
- Fork event: 32
Committers metadata
Last synced: 12 days ago
Total Commits: 192
Total Committers: 7
Avg Commits per committer: 27.429
Development Distribution Score (DDS): 0.203
Commits in past year: 0
Committers in past year: 0
Avg Commits per committer in past year: 0.0
Development Distribution Score (DDS) in past year: 0.0
| Name | Commits | |
|---|---|---|
| Allison Horst | a****t@u****u | 153 |
| Alison Presmanes Hill | a****l@g****m | 31 |
| Hadley Wickham | h****m@g****m | 4 |
| deanerik | 3****k | 1 |
| Trang Le | g****r@g****m | 1 |
| Jennifer (Jenny) Bryan | j****n@g****m | 1 |
| Jannik Buhr | 1****r | 1 |
Committer domains:
- ucsb.edu: 1
Issue and Pull Request metadata
Last synced: about 1 month ago
Total issues: 30
Total pull requests: 72
Average time to close issues: 4 months
Average time to close pull requests: about 18 hours
Total issue authors: 29
Total pull request authors: 12
Average comments per issue: 1.97
Average comments per pull request: 0.24
Merged pull request: 66
Bot issues: 0
Bot pull requests: 0
Past year issues: 3
Past year pull requests: 1
Past year average time to close issues: N/A
Past year average time to close pull requests: about 4 hours
Past year issue authors: 3
Past year pull request authors: 1
Past year average comments per issue: 0.0
Past year average comments per pull request: 0.0
Past year merged pull request: 1
Past year bot issues: 0
Past year bot pull requests: 0
Top Issue Authors
- allisonhorst (2)
- MonkmanMH (1)
- jhk0530 (1)
- john-sandall (1)
- RaphaelS1 (1)
- friendly (1)
- davidhodge931 (1)
- markvanderloo (1)
- Bassei (1)
- amrrs (1)
- ttimbers (1)
- gupton1944 (1)
- moon-jong (1)
- karaesmen (1)
- charliejhadley (1)
Top Pull Request Authors
- allisonhorst (33)
- apreshill (26)
- hadley (4)
- deanerik (2)
- gungorMetehan (2)
- jennybc (1)
- koliharesh (1)
- brunj7 (1)
- jmbuhr (1)
- trangdata (1)
- ttimbers (1)
- friendly (1)
Top Issue Labels
- CRAN :penguin: (1)
Top Pull Request Labels
Package metadata
- Total packages: 2
-
Total downloads:
- cran: 77,664 last-month
- Total docker downloads: 48,971
- Total dependent packages: 39 (may contain duplicates)
- Total dependent repositories: 151 (may contain duplicates)
- Total versions: 4
- Total maintainers: 1
cran.r-project.org: palmerpenguins
Palmer Archipelago (Antarctica) Penguin Data
- Homepage: https://allisonhorst.github.io/palmerpenguins/
- Documentation: http://cran.r-project.org/web/packages/palmerpenguins/palmerpenguins.pdf
- Licenses: CC0
- Latest release: 0.1.1 (published about 3 years ago)
- Last Synced: 2025-10-30T08:52:17.421Z (5 days ago)
- Versions: 2
- Dependent Packages: 38
- Dependent Repositories: 149
- Downloads: 77,664 Last month
- Docker Downloads: 48,971
-
Rankings:
- Forks count: 0.315%
- Stargazers count: 0.347%
- Downloads: 1.648%
- Dependent repos count: 1.661%
- Average: 2.411%
- Dependent packages count: 2.642%
- Docker downloads count: 7.855%
- Maintainers (1)
conda-forge.org: r-palmerpenguins
- Homepage: https://allisonhorst.github.io/palmerpenguins/
- Licenses: CC0-1.0
- Latest release: 0.1.1 (published about 3 years ago)
- Last Synced: 2025-10-30T08:52:19.655Z (5 days ago)
- Versions: 2
- Dependent Packages: 1
- Dependent Repositories: 2
-
Rankings:
- Forks count: 14.045%
- Stargazers count: 14.589%
- Average: 19.412%
- Dependent repos count: 20.06%
- Dependent packages count: 28.954%
Dependencies
- R >= 2.10 depends
- dplyr * suggests
- ggplot2 * suggests
- knitr * suggests
- recipes * suggests
- rmarkdown * suggests
- tibble * suggests
- tidyr * suggests
- JamesIves/github-pages-deploy-action 4.1.4 composite
- actions/checkout v2 composite
- r-lib/actions/setup-pandoc v2 composite
- r-lib/actions/setup-r v2 composite
- r-lib/actions/setup-r-dependencies v2 composite
Score: 20.59991785692029