lterdatasampler
Provide the scientific community, policy makers, and society with the knowledge and predictive understanding necessary to conserve, protect, and manage the nation's ecosystems, their biodiversity, and the services they provide.
https://github.com/lter/lterdatasampler
Category: Biosphere
Sub Category: Conservation and Restoration
Keywords
data-science ecology lter-science r r-package
Keywords from Contributors
standards
Last synced: about 5 hours ago
JSON representation
Repository metadata
LTER data samples to teach environmental data science
- Host: GitHub
- URL: https://github.com/lter/lterdatasampler
- Owner: lter
- Created: 2021-06-16T16:59:13.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2023-10-03T18:53:21.000Z (about 2 years ago)
- Last Synced: 2025-10-22T04:40:10.850Z (12 days ago)
- Topics: data-science, ecology, lter-science, r, r-package
- Language: R
- Homepage: https://lter.github.io/lterdatasampler/
- Size: 44.9 MB
- Stars: 51
- Watchers: 2
- Forks: 7
- Open Issues: 16
- Releases: 1
-
Metadata Files:
- Readme: README.Rmd
README.Rmd
---
output: github_document
editor_options:
markdown:
wrap: 72
---
```{r setup, include=FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>",
fig.path = "man/figures/README-",
out.width = "75%",
warning = FALSE,
message = FALSE,
fig.retina = 2,
fig.align = 'center'
)
library(gt)
library(tidyverse)
```
[](https://github.com/lter/lterdatasampler/actions)
[](https://github.com/lter/lterdatasampler/actions)
[](https://cran.r-project.org/package=lterdatasampler)
[](http://www.r-pkg.org/pkg/lterdatasampler)
# lterdatasampler
The mission of the [Long Term Ecological Research program (LTER)
Network](https://lternet.edu/) is to “*provide the scientific community,
policy makers, and society with the knowledge and predictive
understanding necessary to conserve, protect, and manage the nation’s
ecosystems, their biodiversity, and the services they provide.*” A
specific goal of the LTER is [education and
training](https://lternet.edu/education-and-training/) - “*to promote
training, teaching, and learning about long-term ecological research and
the Earth’s ecosystems, and to educate a new generation of scientists.*”
The goal of this package is to provide a sampler to gather feedback from
the community of what will be a larger package containing 28 datasets -
one from each of the existing [US LTER
sites](https://lternet.edu/site/). Those datasets are subsets of the
original data and have been updated - sometimes substantially - from the
raw data. They are aimed to be useful for teaching and training in
environmental data science. **This content is thus not suitable for
research and should only be used for teaching purposes**.
We encourage you to explore existing LTER [teaching and training
initiatives](https://lternet.edu/education-and-training/), and the
**many** other available LTER datasets which can be accessed via the
[Environmental Data
Initiative](https://edirepository.org/). Please contact
cited researchers directly to discuss using data for research purposes
or in publication.
## Installation
You can install the CRAN version of `lterdatasampler` with:
``` r
install.packages("lterdatasampler")
```
You can install the development version of `lterdatasampler` from
GitHub with:
``` r
# install.packages("remotes")
remotes::install_github("lter/lterdatasampler")
```
## The dataset samples
Dataset samples currently included in the package are summarized below;
see individual Articles for data and source details. Note: the three
letter prefix for each dataset indicates the LTER site (see full list of
[site abbreviations](https://lternet.edu/site/)).
- [`and_vertebrates`](https://lter.github.io/lterdatasampler/reference/and_vertebrates.html):
Records for aquatic vertebrates (cutthroat trout and salamanders) in
Mack Creek, Andrews Experimental Forest, Oregon (1987 - present)
- [`arc_weather`](https://lter.github.io/lterdatasampler/reference/arc_weather.html):
Daily meteorological (e.g. air temperature, precipitation) records
from Toolik Field Station, Alaska (1988 - present)
- [`hbr_maples`](https://lter.github.io/lterdatasampler/reference/hbr_maples.html):
Sugar maple seedlings at Hubbard Brook Experimental Forest (New
Hampshire) in calcium-treated and reference watersheds in August
2003 and June 2004
- [`knz_bison`](https://lter.github.io/lterdatasampler/reference/knz_bison.html):
Bison masses recorded for the herd at Konza Prairie Biological
Station LTER
- [`luq_streamchem`](https://lter.github.io/lterdatasampler/reference/luq_streamchem.html):
stream chemistry data for the Quebrada Sonadora (QS) location part of the Luqillo tropical
forest LTER site
- [`ntl_icecover`](https://lter.github.io/lterdatasampler/reference/ntl_icecover.html):
Ice freeze and thaw dates for Madison, Wisconsin Area lakes (1853 -
2019), North Temperate Lakes LTER
- [`ntl_airtemp`](https://lter.github.io/lterdatasampler/reference/ntl_airtemp.html):
Daily average air temperature data for Madison, Wisconsin (1869 -
2019), North Temperate Lakes LTER
- [`nwt_pikas`](https://lter.github.io/lterdatasampler/reference/nwt_pikas.html):
Pika observations for habitat and stress analysis at Niwot Ridge
LTER, Colorado
- [`pie_crab`](https://lter.github.io/lterdatasampler/reference/pie_crab.html):
Fiddler crab body size recorded summer 2016 in salt marshes from
Florida to Massachusetts including Plum Island Ecosystem LTER,
Virginia Coast LTER, and NOAA’s National Estuarine Research Reserve
System
## Which data sample should I use?
These data samples are selected because they have features we feel are
commonly useful in introductory environmental data science and
statistics courses.
In the table below, we list some introductory methods / skills, then
share which data samples in this package we think are well-suited to use
when teaching or learning them! It is not comprehensive - there are
*many* different analyses & skills that these data samples would
facilitate. Here we highlight a few that we think would be commonly
useful
```{r, echo = FALSE}
# Create the table contents
table_contents <- tribble(
~method, ~datasample, ~data_description, ~link,
"Linear relationships", "`pie_crab`", "Model the relationship between fiddler crab size and latitude using `pie_crab` , while learning about Bergmann's Rule!", "https://lter.github.io/lterdatasampler/articles/pie_crab_vignette.html",
"Linear relationships", "`ntl_icecover`", "Investigate the relationship between winter temperatures and ice cover duration for Wisconsin lakes using `ntl_icecover`", "https://lter.github.io/lterdatasampler/articles/ntl_icecover_vignette.html",
"Linear relationships", "`hbr_maples`", "Explore seedling height-mass relationships for sugar maples using `hbr_maples`", "https://lter.github.io/lterdatasampler/articles/hbr_maples_vignette.html",
"Non-linear relationships", "`knz_bison`", "Model the relationship between bison age and mass for male and female bison using `knz_bison`, for example estimating parameters in the Gompertz model", "https://lter.github.io/lterdatasampler/articles/knz_bison_vignette.html",
"Non-linear relationships", "`and_vertebrates`", "Model the length-mass relationships for cutthroat trout and salamanders in Mack Creek, Oregon", "https://lter.github.io/lterdatasampler/articles/and_vertebrates_vignette.html",
"Time series analysis", "`arc_weather`", "Explore seasonality, wrangling dates, or practice forecasting using daily meteorological records from Toolik Station, Alaska", "https://lter.github.io/lterdatasampler/articles/arc_weather_vignette.html",
"Time series analysis", "`luq_streamchem`", "Investigate the impact of a hurricane on stream water chemistry", "https://lter.github.io/lterdatasampler/articles/luq_streamchem_vignette.html",
"Spatial data introduction", "`nwt_pikas`", "Introduce basics of spatial data (e.g. CRS, projections) and tools for working with spatial data by visualizing pika locations at Niwot Ridge in the Colorado Rockies", "https://lter.github.io/lterdatasampler/articles/nwt_pikas_vignette.html",
"Comparing groups", "`hbr_maples`", "Compare sugar maple seedling heights in previously calcium-treated versus untreated watersheds using `hbr_maples`, using the exercise as an opportunity to think about acid rain and soil acidification", "https://lter.github.io/lterdatasampler/articles/hbr_maples_vignette.html",
"Comparing groups", "`and_vertebrates`", "Explore differences in size and abundance of cutthroat trout and salamanders in old growth versus previously clear cut forest sections (2 groups) or in different conditions (> 2 groups, e.g. pool, cascade, riffle) of Mack Creek, Oregon", "https://lter.github.io/lterdatasampler/articles/and_vertebrates_vignette.html"
) %>%
mutate(full_link = sprintf('%s', link, datasample),
full_link = map(full_link, gt::html))
table_contents %>%
select(method, full_link, data_description) %>%
gt(groupname_col = "method") %>%
tab_header(
title = "Recommended data samples for introducing selected topics",
) %>%
cols_label(method = "Topic", full_link = "Data sample", data_description = "For example you could:") %>%
tab_options(row_group.as_column = TRUE) %>%
tab_style(
style = "vertical-align:middle",
locations = cells_row_groups()
)
```
## How to provide feedback
The best way to provide feedback on this package is to open an
[issue](https://github.com/lter/lterdatasampler/issues) and assign the
`feedback` label. Thank you!
## Acknowledgements
Thank you to the amazing students who contributed to this project: *Sam
Guo, Adhitya Logan, Lia Ran, Sophia Sternberg, Karen Zhao* as part of
their [UCSB Data Science capstone
project](https://ucsb-ds-capstone-2021.github.io/projects/nceas/update3.html).
Thank you also go to their Course Advisor Prof. Sang-yun Oh.
People / organizations who supported this project:
- LTER Network Office
- LTER Information Managers
- LTER Education Committee
- All the LTER Researchers and Site PIs
- Cyber-infrastructures:
[EDI](https://edirepository.org/) and
[DataONE](https://www.dataone.org/)
We gratefully acknowledge all authors and contributors of the
[`roxygen2`](https://roxygen2.r-lib.org/),
[`usethis`](https://usethis.r-lib.org/),
[`pkgdown`](https://pkgdown.r-lib.org/),
[`devtools`](https://devtools.r-lib.org/),
[`tidyverse`](https://www.tidyverse.org/) and
[`metajam`](https://github.com/NCEAS/metajam/) packages. This website
relies heavily on themes created by Dr. Desirée DeLeon and Dr. Alison
Hill.
Owner metadata
- Name: Long Term Ecological Research Network
- Login: lter
- Email:
- Kind: organization
- Description: Projects in this organization support ecological research, coordination, and communication by and related to the LTER Network.
- Website: http://www.lternet.edu
- Location: United States
- Twitter: uslter
- Company:
- Icon url: https://avatars.githubusercontent.com/u/5386476?v=4
- Repositories: 68
- Last ynced at: 2023-07-31T13:54:30.185Z
- Profile URL: https://github.com/lter
GitHub Events
Total
- Watch event: 4
- Fork event: 1
Last Year
- Watch event: 4
- Fork event: 1
Committers metadata
Last synced: 7 days ago
Total Commits: 179
Total Committers: 5
Avg Commits per committer: 35.8
Development Distribution Score (DDS): 0.285
Commits in past year: 0
Committers in past year: 0
Avg Commits per committer in past year: 0.0
Development Distribution Score (DDS) in past year: 0.0
| Name | Commits | |
|---|---|---|
| Julien Brun | b****n@g****m | 128 |
| allisonhorst | a****t@u****u | 41 |
| adhil0 | 5****0 | 5 |
| Allison Horst | a****n@o****m | 4 |
| GitHub Actions | a****s@g****m | 1 |
Committer domains:
- github.com: 1
- observablehq.com: 1
- ucsb.edu: 1
Issue and Pull Request metadata
Last synced: 21 days ago
Total issues: 42
Total pull requests: 46
Average time to close issues: 3 months
Average time to close pull requests: 10 days
Total issue authors: 8
Total pull request authors: 6
Average comments per issue: 1.12
Average comments per pull request: 0.24
Merged pull request: 41
Bot issues: 0
Bot pull requests: 0
Past year issues: 0
Past year pull requests: 0
Past year average time to close issues: N/A
Past year average time to close pull requests: N/A
Past year issue authors: 0
Past year pull request authors: 0
Past year average comments per issue: 0
Past year average comments per pull request: 0
Past year merged pull request: 0
Past year bot issues: 0
Past year bot pull requests: 0
Top Issue Authors
- brunj7 (20)
- allisonhorst (15)
- KGraysonUR (2)
- atn38 (1)
- njlyon0 (1)
- mobb (1)
- marierivers (1)
- atheobold (1)
Top Pull Request Authors
- allisonhorst (22)
- brunj7 (19)
- adhil0 (2)
- colebrookson (1)
- syoh (1)
- kbodwin (1)
Top Issue Labels
- enhancement (4)
- feedback (3)
- documentation (2)
- data suggestion (2)
- data (2)
- bug (1)
Top Pull Request Labels
Package metadata
- Total packages: 1
-
Total downloads:
- cran: 1,318 last-month
- Total dependent packages: 0
- Total dependent repositories: 0
- Total versions: 1
- Total maintainers: 1
cran.r-project.org: lterdatasampler
Educational Dataset Examples from the Long Term Ecological Research Program
- Homepage: https://github.com/lter/lterdatasampler
- Documentation: http://cran.r-project.org/web/packages/lterdatasampler/lterdatasampler.pdf
- Licenses: CC0
- Latest release: 0.1.1 (published about 2 years ago)
- Last Synced: 2025-10-30T08:48:55.938Z (4 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 1,318 Last month
-
Rankings:
- Stargazers count: 7.606%
- Forks count: 10.985%
- Dependent packages count: 28.348%
- Average: 34.325%
- Dependent repos count: 36.932%
- Downloads: 87.752%
- Maintainers (1)
Dependencies
- R >= 2.10 depends
- broom * suggests
- changepoint * suggests
- feasts * suggests
- ggmap * suggests
- gt * suggests
- here * suggests
- janitor * suggests
- knitr * suggests
- leaflet * suggests
- lubridate * suggests
- metajam * suggests
- patchwork * suggests
- rmarkdown * suggests
- sf * suggests
- tidyverse * suggests
- tsibble * suggests
- usethis * suggests
- actions/cache v2 composite
- actions/checkout v2 composite
- actions/upload-artifact main composite
- r-lib/actions/setup-pandoc v1 composite
- r-lib/actions/setup-r v1 composite
- actions/cache v1 composite
- actions/checkout v2 composite
- r-lib/actions/setup-pandoc master composite
- r-lib/actions/setup-r master composite
- actions/checkout v2 composite
- r-lib/actions/setup-pandoc v1 composite
- r-lib/actions/setup-r v1 composite
Score: 12.99875968454238