EMMA
The core repository for environmental data processing in the Ecological Monitoring and Management Application EMMA.io.
https://github.com/adamwilsonlab/emma_envdata
Category: Biosphere
Sub Category: Conservation and Restoration
Last synced: about 2 hours ago
JSON representation
Repository metadata
Environmental data processing for www.emma.eco
- Host: GitHub
- URL: https://github.com/adamwilsonlab/emma_envdata
- Owner: AdamWilsonLab
- License: other
- Created: 2021-10-25T17:49:20.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2025-04-15T18:45:34.000Z (13 days ago)
- Last Synced: 2025-04-17T14:12:00.111Z (11 days ago)
- Language: R
- Homepage:
- Size: 75.2 MB
- Stars: 7
- Watchers: 3
- Forks: 2
- Open Issues: 0
- Releases: 21
-
Metadata Files:
- Readme: README.Rmd
- License: LICENSE
README.Rmd
--- output: github_document editor_options: chunk_output_type: console --- ```{r, include = FALSE} knitr::opts_chunk$set( collapse = TRUE, comment = "#>" ) library(tidyverse) ```  # Ecological Monitoring and Management Application (EMMA) This is the core repository for environmental data processing in the Ecological Monitoring and Management Application [EMMA.io](EMMA.io). ## File structure The most important files are: ```{r, eval = FALSE} ├── _targets.R (data processing workflow and dependency management) ├── R/ ├──── [data_processing_functions] ├── data/ ├──── manual_download (files behind firewalls that must be manually downloaded) ├──── raw_data (raw data files downloaded by the workflow) ├──── processed_data (data processed and stored by the workflow) └── Readme.Rmd (this file) ``` Files generated by the workflow are stored in the targets-runs branch. The final output of the workflow is a set of parquet files stored as Github releases with the tag "current". # Workflow structure ```{r, echo=F, results="hide"} targets::tar_glimpse() %>% htmlwidgets::saveWidget(file = "img/network.html") #webshot::install_phantomjs() webshot::webshot("img/network.html", "img/network.png") ``` # Workflow Notes ## Runtime and frequency Github places some constrains on actions, including memory limits and run time limits. To prevent this workflow from taking too long to run (and thereby losing all progress), there are a few key parameters that can be changed. In the _targets.R file, the argument "max_layers" controls the maximum number of layers that rgee will attempt to download in one action run. When initially setting up the repo, it may be necessary to lower this value and increase the frequency that the targets workflow is run (by adjusting the cron parameters in targets.yaml). Github also limits the rates of requests, and so the file release_data.R includes a call to Sys.sleep that can be adjusted to slow down/speed up the process of pushing data to a Github release. # Data notes * MODIS NDVI values have been transformed to save space. To restore them to the original values (between -1 and 1), divide by 100 and subtract 1. * Untransformed NDVI = (transformed NDVI / 100) - 1 * Raw MODIS fire dates (tag:raw_fire_modis): values are either 0 (no fire) or the day of the year a fire was observed (1 through 366). * Processed MODIS fire dates (tag: processed_fire_dates: values are either 0 (no fire) or the UNIX date (days since 1 Jan. 1970) a fire was observed. ## Data layers * Continuous Heat-Insolation Load Index (CHILI; ALOS) * Multi-Scale Topographic Position Index (MTPI; compares elevation to surroundings; ALOS) * Topographic Diversity (represents the variety of temperature, moisture conditions; ALOS ) * Mean annual air temperature (CHELSA Bio1) * Mean diurnal air temperature range (CHELSA Bio2) * Isothermality (ratio of diurnal variation to annual variation in temperatures; CHELSA Bio3) * Temperature seasonality(std. deviation of the monthly mean temperatures; CHELSA Bio4) * Mean daily maximum air temperature of the warmest month (CHELSA Bio5) * Mean daily minimum air temperature of the coldest month (CHELSA Bio6) * Annual range of air temperature (CHELSA Bio7) * Mean daily mean air temperatures of the wettest quarter (CHELSA Bio8) * Mean daily mean air temperatures of the driest quarter (CHELSA Bio9) * Mean daily mean air temperatures of the warmest quarter (CHELSA Bio10) * Mean daily mean air temperatures of the coldest quarter (CHELSA Bio11) * Annual precipitation amount (CHELSA Bio12) * Precipitation amount of the wettest month (CHELSA Bio13) * Precipitation amount of the driest month (CHELSA Bio14) * Precipitation seasonality (CV of the monthly precipitation estimates; CHELSA Bio15) * Mean monthly precipitation amount of the wettest quarter (CHELSA Bio16) * Mean monthly precipitation amount of the driest quarter (CHELSA Bio17) * Mean monthly precipitation amount of the warmest quarter (CHELSA Bio18) * Mean monthly precipitation amount of the coldest quarter (CHELSA Bio19) * January (mid dry season) precipitation (CHELSA) * July (mid wet season) precipitation (CHELSA) * Interannual variability in cloud frequency (MODCF) * Intraannual variability in cloud frequency (MODCF) * Mean annual cloud frequency (MODCF) * Cloud frequency seasonality concentration (sum(monthly concentration vectors); MODCF) * Elevation (NASA DEM) * Soil electrical conductivity (soil_EC_mS_m, Cramer et al. 2019) * Soil extractable K (soil_Ext_K_cmol_kg, Cramer et al. 2019) * Soil extractable NA (soil_Ext_Na_cmol_kg, Cramer et al. 2019) * Soil extractable P(soil_Ext_P_mg_kg, Cramer et al. 2019) * Soil pH (Cramer et al. 2019) * Total soil C (Cramer et al. 2019) * Total soil N (Cramer et al. 2019) * Time since fire (generated from MODIS active fire products and CapeNature fire polygons) ## Setting up the repo * This repo requires github credentials. To store those securely... * Credentials are decrypted with the function decryp_secret.sh # Extras * Call `targets::tar_renv(extras = character(0))` to write a `_packages.R` file to expose hidden dependencies. * Call `renv::init()` to initialize the `renv` lockfile `renv.lock` or `renv::snapshot()` to update it. * Commit `renv.lock` to your Git repository. # DOI [](https://zenodo.org/badge/latestdoi/421127852)
Owner metadata
- Name: Adam Wilson Lab
- Login: AdamWilsonLab
- Email: [email protected]
- Kind: organization
- Description: Global Change Ecology
- Website: github.com/adamwilsonlab
- Location: Buffalo, NY
- Twitter: adamwilsonlab
- Company:
- Icon url: https://avatars.githubusercontent.com/u/8514984?v=4
- Repositories: 32
- Last ynced at: 2024-03-27T13:24:01.925Z
- Profile URL: https://github.com/AdamWilsonLab
GitHub Events
Total
- Push event: 169
Last Year
- Push event: 169
Committers metadata
Last synced: 6 days ago
Total Commits: 993
Total Committers: 3
Avg Commits per committer: 331.0
Development Distribution Score (DDS): 0.152
Commits in past year: 85
Committers in past year: 2
Avg Commits per committer in past year: 42.5
Development Distribution Score (DDS) in past year: 0.424
Name | Commits | |
---|---|---|
Brian | b****r@g****m | 842 |
Adam M. Wilson | a****w@b****u | 150 |
Jasper Slingsby | j****y@g****m | 1 |
Committer domains:
- buffalo.edu: 1
Issue and Pull Request metadata
Last synced: about 2 hours ago
Total issues: 0
Total pull requests: 8
Average time to close issues: N/A
Average time to close pull requests: about 3 hours
Total issue authors: 0
Total pull request authors: 2
Average comments per issue: 0
Average comments per pull request: 0.0
Merged pull request: 8
Bot issues: 0
Bot pull requests: 0
Past year issues: 0
Past year pull requests: 0
Past year average time to close issues: N/A
Past year average time to close pull requests: N/A
Past year issue authors: 0
Past year pull request authors: 0
Past year average comments per issue: 0
Past year average comments per pull request: 0
Past year merged pull request: 0
Past year bot issues: 0
Past year bot pull requests: 0
Top Issue Authors
Top Pull Request Authors
- adammwilson (6)
- bmaitner (2)
Top Issue Labels
Top Pull Request Labels
Score: 3.044522437723423