A curated list of open technology projects to sustain a stable climate, energy supply, biodiversity and natural resources.

TADA

Help developers compile and evaluate water quality data for samples collected from surface water monitoring sites on streams and lakes.
https://github.com/usepa/epatada

Category: Natural Resources
Sub Category: Water Supply and Quality

Keywords

ow

Last synced: about 9 hours ago
JSON representation

Repository metadata

This R package can be used to compile and evaluate Water Quality Portal (WQP) data for samples collected from surface water monitoring sites on streams and lakes. It can be used to create applications that support water quality programs and help states, tribes, and other stakeholders efficiently analyze the data.

README.md

Welcome to EPATADA: Tools for Automated Data Analysis!

Tools for Automated Data Analysis, or TADA, is being developed to help States, Tribes (i.e., Tribal Nations, Pueblos, Bands, Rancherias, Communities, Colonies, Towns, Indians, Villages), federal partners, and any other Water Quality Portal (WQP) users (e.g. researchers) efficiently compile and evaluate WQP data collected from water quality monitoring sites. TADA is both a stand-alone R package, and a building block to support development of R Shiny applications such as TADAShiny.

  • How to use TADA:

  • How to Contribute

    • We encourage stakeholders to test the functionality and provide feedback. Moreover, open source software provides an avenue for water quality data originators and users to develop and share code, and we welcome your contributions! We hope to build a collaborative community dedicated to this effort where TADA users and contributors can discover, share and build the functionality over time.
  • More information on how TADA leverages the WQX QAQC Service

  • More about the TADA Project

TADA poster created for the 2025 National Water Quality Monitoring Conference

Installation

You must first have R and R Studio installed to use the TADA R Package (see instructions below if needed). Our team is actively developing TADA, therefore we highly recommend that you update the TADA R Package and all of its dependency libraries each time you use the package. You can install and/or update the TADA R Package and all dependencies by running:

if(!"remotes"%in%installed.packages()){
install.packages("remotes")
}

remotes::install_github("USEPA/EPATADA", ref = "develop", dependencies = TRUE, force = TRUE)

The TADA R Shiny application can be run on the web (R and R Studio install not required), or within R Studio. We recommend running it within R Studio if possible to avoid time out issues and to make sure you have the most up to data version (web version is on a slightly lagged update schedule). Run the following code within R Studio to install or update and run the most recent version of the TADA R Shiny application:

if(!"remotes"%in%installed.packages()){
install.packages("remotes")
}

remotes::install_github("USEPA/TADAShiny", ref = "develop", dependencies = TRUE, force = TRUE)

TADAShiny::run_app()

Recommended Citation:

Mullin, C.A., Marler, H., Greif, J., Hinman, E., Wong, K., Healy, K., Willi, K., Brousil, M., Bousquin, J., 2025, EPATADA: Tools for Automated Data Analysis, https://usepa.github.io/EPATADA/

Water Quality Portal

In 2012, the WQP was deployed by the U.S. Geological Survey (USGS), the U.S. Environmental Protection Agency (USEPA), and the National Water Quality Monitoring Council to combine and serve water-quality data from numerous sources in a standardized format. The WQP holds over 420 million water quality sample results from over 1000 federal, state, tribal and other partners, and is the nation's largest source for single point of access for water-quality data. Participating organizations submit their data to the WQP using the EPA's Water Quality Exchange (WQX), a framework designed to map their data holdings to a common data structure.

Install R and R Studio

  1. To download R: Go to https://cran.r-project.org/ and click the link that describes your computer operating system in the first box in the menu entitled "Download and Install R".
  2. Clicking your operating system will take you to a new page, which looks slightly different for PC (first image) and Macs (second image):

  1. Download the program by clicking the appropriate link for your system, and click through the installer windows on your computer, accepting all defaults.

  2. Next, go to the following link to download RStudio: https://posit.co/download/rstudio-desktop/, scroll down a little, and click download RStudio.

  1. Again, download the installer, click through the prompts, and accept the defaults.

Open-Source Code Policy

Effective August 8, 2016, the OMB Mandate: M-16-21; Federal Source Code Policy: Achieving Efficiency, Transparency, and Innovation through Reusable and Open Source Software applies to new custom-developed code created or procured by EPA consistent with the scope and applicability requirements of Office of Management and Budget's (OMB's) Federal Source Code Policy. In general, it states that all new custom-developed code by Federal Agencies should be made available and reusable as open-source code.

The EPA specific implementation of OMB Mandate M-16-21 is addressed in the System Life Cycle Management Procedure. EPA has chosen to use GitHub as its version control system as well as its inventory of open-source code projects. EPA uses GitHub to inventory its custom-developed, open-source code and generate the necessary metadata file that is then posted to code.gov for broad reuse in compliance with OMB Mandate M-16-21.

If you have any questions or want to read more, check out the EPA Open Source Project Repo and EPA's Interim Open Source Software Policy.

License

All contributions to this project will be released under the CCO-1.0 license file dedication. By submitting a pull request or issue, you are agreeing to comply with this waiver of copyright interest.

Disclaimer

This United States Environmental Protection Agency (EPA) GitHub project code is provided on an "as is" basis and the user assumes responsibility for its use. EPA has relinquished control of the information and no longer has responsibility to protect the integrity, confidentiality, or availability of the information. Any reference to specific commercial products, processes, or services by service mark, trademark, manufacturer, or otherwise, does not constitute or imply their endorsement, recommendation or favoring by EPA. The EPA seal and logo shall not be used in any manner to imply endorsement of any commercial product or activity by EPA or the United States Government.

Contact

If you have any questions, please reach out to the TADA Team at mywaterway@epa.gov.


Owner metadata


GitHub Events

Total
Last Year

Committers metadata

Last synced: 7 days ago

Total Commits: 2,231
Total Committers: 19
Avg Commits per committer: 117.421
Development Distribution Score (DDS): 0.622

Commits in past year: 1,004
Committers in past year: 8
Avg Commits per committer in past year: 125.5
Development Distribution Score (DDS) in past year: 0.434

Name Email Commits
hillarymarler M****y@e****v 844
cristinamullin 4****n 804
Elise H 1****n 262
Kenny Wong W****y@e****v 120
Katie/Ryn Willi (she/her) 4****i 41
Katie Healy h****n@e****v 39
Matt Brousil 3****l 35
Bousquin B****n@e****v 26
Greif G****b@e****v 22
Renae Myers r****s@e****m 18
mthawley t****e@e****v 4
Brad Cooper c****1 4
DeCicco l****o@u****v 3
Weber W****c@e****v 2
Zachary M. Smith 1****7 2
elisehinman 1****n 2
David Blodgett d****t@u****v 1
Florian Rupprecht 3****0 1
jesseboormanpadgett b****e@e****v 1

Committer domains:


Issue and Pull Request metadata

Last synced: 1 day ago

Total issues: 236
Total pull requests: 178
Average time to close issues: 6 months
Average time to close pull requests: 6 days
Total issue authors: 13
Total pull request authors: 9
Average comments per issue: 1.56
Average comments per pull request: 1.26
Merged pull request: 156
Bot issues: 0
Bot pull requests: 0

Past year issues: 70
Past year pull requests: 79
Past year average time to close issues: about 2 months
Past year average time to close pull requests: 10 days
Past year issue authors: 9
Past year pull request authors: 7
Past year average comments per issue: 1.67
Past year average comments per pull request: 2.34
Past year merged pull request: 58
Past year bot issues: 0
Past year bot pull requests: 0

More stats: https://issues.ecosyste.ms/repositories/lookup?url=https://github.com/usepa/epatada

Top Issue Authors

  • cristinamullin (155)
  • jakegreif (36)
  • hillarymarler (23)
  • ehinman (8)
  • wokenny13 (6)
  • mariamval (1)
  • elisehinman (1)
  • edhinman (1)
  • jesseboormanpadgett (1)
  • mhweber (1)
  • cefergus (1)
  • ycwwang (1)
  • kathryn-willi (1)

Top Pull Request Authors

  • cristinamullin (73)
  • hillarymarler (41)
  • jakegreif (40)
  • wokenny13 (10)
  • kathryn-willi (4)
  • mthawley (4)
  • renaemyers (3)
  • dblodgett-usgs (2)
  • mbrousil (1)

Top Issue Labels

  • Future Improvement (52)
  • Module 1 (48)
  • Top Priority (44)
  • MVP (34)
  • bug (28)
  • Good First Issue (23)
  • ResultFlagsIndependent.R (19)
  • Tables&Figures (18)
  • Package Management (18)
  • DataDiscoveryRetrieval.R (17)
  • Usability (17)
  • Ref: WQX QAQC Char Validation (16)
  • Transformations.R (13)
  • Module 2 (12)
  • ROSSyndicate (12)
  • ResultFlagsDependent.R (10)
  • HOLD - waiting for WQX 3.0 (10)
  • More Research Needed (10)
  • ERG Discussion (9)
  • TADAWorkingGroupTopic (9)
  • Module 3 (8)
  • QAQC (8)
  • testthat (7)
  • Utilities.R (6)
  • Ref: Harmonization (6)
  • TADA Team Discussion (5)
  • hacktoberfest (5)
  • Helpful User Feedback (4)
  • Ref: WQX MeasureUnit Domain List (4)
  • Criteria-Methodologies-Module (4)

Top Pull Request Labels

  • ResultFlagsDependent.R (6)
  • Transformations.R (4)
  • Ref: WQX MeasureUnit Domain List (2)
  • ResultFlagsIndependent.R (2)
  • ROSSyndicate (2)
  • Future Improvement (1)
  • More Research Needed (1)
  • bug (1)
  • Module 1 (1)
  • Package Management (1)

Dependencies

DESCRIPTION cran
  • R >= 2.10 depends
  • RColorBrewer * imports
  • Rcpp * imports
  • data.table * imports
  • dataRetrieval * imports
  • dplyr * imports
  • ggplot2 * imports
  • grDevices * imports
  • magrittr * imports
  • plyr * imports
  • remotes * imports
  • stats * imports
  • stringr * imports
  • utils * imports
  • devtools * suggests
  • knitr * suggests
  • rmarkdown * suggests
  • testthat * suggests
  • usethis * suggests
.github/workflows/R-CMD-check.yaml actions
  • actions/checkout v3 composite
  • r-lib/actions/check-r-package v2 composite
  • r-lib/actions/setup-pandoc v2 composite
  • r-lib/actions/setup-r v2 composite
  • r-lib/actions/setup-r-dependencies v2 composite
.github/workflows/test-coverage.yaml actions
  • actions/cache v1 composite
  • actions/checkout v2 composite
  • r-lib/actions/setup-pandoc v2 composite
  • r-lib/actions/setup-r v2 composite
.github/workflows/pkgdown.yaml actions
  • JamesIves/github-pages-deploy-action v4 composite
  • actions/checkout v3 composite
  • r-lib/actions/setup-pandoc v2 composite
  • r-lib/actions/setup-r v2 composite
  • r-lib/actions/setup-r-dependencies v2 composite

Score: 7.796469243086057