taxizedb

Automatically converts supported taxonomic databases into SQLite for unified querying, with support for the National Center for Biotechnology Information Taxonomy, the Integrated Taxonomic Information System, World Flora Online, the Catalogue of Life, the Global Biodiversity Information Facility, and Wikidata.
https://github.com/ropensci/taxizedb

Category: Sustainable Development
Sub Category: Taxonomy and Ontology

Keywords

itis r r-package rstats taxize taxonomic-databases taxonomy

Keywords from Contributors

biodiversity biology darwincore nomenclature ecology conservation cycle routes data-access weather

Last synced: about 11 hours ago
JSON representation

Repository metadata

Tools for Working with Taxonomic SQL Databases

README.Rmd

          ---
output: github_document
editor_options: 
  chunk_output_type: console
---


taxizedb
========

```{r echo=FALSE}
knitr::opts_chunk$set(
  warning = FALSE,
  message = FALSE,
  collapse = TRUE,
  comment = "#>"
)
```

[![status](https://www.repostatus.org/badges/latest/active.svg)](https://www.repostatus.org/#active)
[![cran checks](https://badges.cranchecks.info/worst/taxizedb.svg)](https://badges.cranchecks.info/worst/taxizedb.svg)
[![R-check](https://github.com/ropensci/taxizedb/workflows/R-check/badge.svg)](https://github.com/ropensci/taxizedb/actions)
[![codecov](https://codecov.io/gh/ropensci/taxizedb/branch/master/graph/badge.svg)](https://app.codecov.io/gh/ropensci/taxizedb)
[![rstudio mirror downloads](https://cranlogs.r-pkg.org/badges/taxizedb)](https://github.com/r-hub/cranlogs.app)
[![Total Downloads](https://cranlogs.r-pkg.org/badges/grand-total/taxizedb?color=blue)](https://cran.r-project.org/package=taxizedb)
[![cran version](https://www.r-pkg.org/badges/version/taxizedb)](https://cran.r-project.org/package=taxizedb)
[![DOI](https://zenodo.org/badge/53961466.svg)](https://zenodo.org/badge/latestdoi/53961466)

`taxizedb` - Tools for Working with Taxonomic Databases

Docs: 

`taxizedb` is an R package for interacting with taxonomic databases. Its functionality can be divided in two parts: 1. You can download the databases to your platform 2. You can query the downloaded databases to retrieve taxonomic information.

This two step approach is different from tools which interact with web services for each query, and has a number of advantages:

* Once you download a database you can work with it offline
* Once you download a database querying it is super fast
* As long as you store your database files all the queries in your analysis will be fully reproducible

## Data sources

When you download a database with `taxizedb` it will automatically convert it to SQLite and then all query functions will interact with this SQLite database. However, not all taxonomic databases are publicly available, or can be converted to SQLite. The following databases are supported:

- [NCBI Taxonomy](https://www.ncbi.nlm.nih.gov/taxonomy)
- [ITIS](https://itis.gov/)
- [World Flora Online (WFO)](https://www.worldfloraonline.org/)
- [Catalogue of Life (COL)](https://www.catalogueoflife.org/)
- [Global Biodiversity Information Facility (GBIF)](https://www.gbif.org/)
- [Wikidata](https://zenodo.org/records/1213477)

Get in touch [in the issues](https://github.com/ropensci/taxizedb/issues) with
any ideas on new data sources.

## Data sources - legacy support

[The Plant List (TPL)](https://en.wikipedia.org/wiki/The_Plant_List) has been replaced by World Flora Online. The website seems to be down so `taxizedb` can no longer facilitate new downloads. However, already downloaded database files can still be queried using `taxizedb` functions, for reproducibility.

## Package API

This package for each data sources performs the following tasks:

* Downloaded taxonomic databases `db_download_*`
* Create `dplyr` SQL backend via `dbplyr::src_dbi` - `src_*` 
* Query and get data back into a data.frame - `sql_collect`
* Manage cached database files - `tdb_cache`
* Retrieve immediate descendents of a taxon - `children`
* Retrieve the taxonomic hierarchies from local database - `classification`
* Retrieve all taxa descending from a vector of taxa - `downstream`
* Convert species names to taxon IDs - `name2taxid`
* Convert taxon IDs to species names - `taxid2name`
* Convert taxon IDs to ranks - `taxid2rank`

You can use the `src` connections with `dplyr`, etc. to do operations downstream. Or use the database connection to do raw SQL queries.

## Installation

CRAN version

```{r eval=FALSE}
install.packages("taxizedb")
```

dev version

```{r eval=FALSE}
remotes::install_github("ropensci/taxizedb")
```

## Citation

To cite taxizedb in publications use:

* Chamberlain S, Arendsee Z, Stirling T (2023). taxizedb: Tools for Working with 'Taxonomic' Databases. R package version 0.3.1. 

## Meta

* Please [report any issues, bugs or feature requests](https://github.com/ropensci/taxizedb/issues).
* License: MIT
* Get citation information for `taxizedb` in R with `citation(package = 'taxizedb')`
* Please note that this package is released with a [Contributor Code of Conduct](https://ropensci.org/code-of-conduct). By contributing to this project, you agree to abide by its terms.

[![ropensci](https://ropensci.org/public_images/github_footer.png)](https://ropensci.org)

        

Owner metadata


GitHub Events

Total
Last Year

Committers metadata

Last synced: 2 days ago

Total Commits: 290
Total Committers: 9
Avg Commits per committer: 32.222
Development Distribution Score (DDS): 0.521

Commits in past year: 22
Committers in past year: 1
Avg Commits per committer in past year: 22.0
Development Distribution Score (DDS) in past year: 0.0

Name Email Commits
Scott Chamberlain m****s@g****m 139
Tamas Stirling s****s@g****m 79
Zebulun Arendsee a****e@i****u 46
Carl Boettiger c****g@g****m 16
T D James t****1 4
Maëlle Salmon m****n@y****e 3
Rekyt m****e@e****r 1
Gaopeng Li l****c@g****m 1
rOpenSci Bot m****t@g****m 1

Committer domains:


Issue and Pull Request metadata

Last synced: 4 months ago

Total issues: 66
Total pull requests: 40
Average time to close issues: 11 months
Average time to close pull requests: 9 days
Total issue authors: 23
Total pull request authors: 9
Average comments per issue: 3.21
Average comments per pull request: 2.2
Merged pull request: 32
Bot issues: 0
Bot pull requests: 0

Past year issues: 5
Past year pull requests: 14
Past year average time to close issues: 5 months
Past year average time to close pull requests: about 9 hours
Past year issue authors: 2
Past year pull request authors: 1
Past year average comments per issue: 0.0
Past year average comments per pull request: 0.21
Past year merged pull request: 13
Past year bot issues: 0
Past year bot pull requests: 0

More stats: https://issues.ecosyste.ms/repositories/lookup?url=https://github.com/ropensci/taxizedb

Top Issue Authors

  • sckott (28)
  • stitam (11)
  • KaiAragaki (4)
  • arendsee (3)
  • cboettig (2)
  • kwymangrothem (1)
  • sagesteppe (1)
  • dlebauer (1)
  • pederengelstad (1)
  • GossypiumH (1)
  • gpli (1)
  • andzandz11 (1)
  • brunobrr (1)
  • NgAMB (1)
  • twest820 (1)

Top Pull Request Authors

  • stitam (22)
  • arendsee (7)
  • sckott (3)
  • KaiAragaki (2)
  • cboettig (2)
  • kylebuscaglia (1)
  • Rekyt (1)
  • gpli (1)
  • tdjames1 (1)

Top Issue Labels

  • bug (3)
  • data-source (2)
  • feature (1)

Top Pull Request Labels


Package metadata

proxy.golang.org: github.com/ropensci/taxizedb

cran.r-project.org: taxizedb

Offline Access to Taxonomic Databases

  • Homepage: https://docs.ropensci.org/taxizedb/
  • Documentation: http://cran.r-project.org/web/packages/taxizedb/taxizedb.pdf
  • Licenses: MIT + file LICENSE
  • Latest release: 0.3.2 (published 11 months ago)
  • Last Synced: 2026-05-09T13:08:13.677Z (3 days ago)
  • Versions: 7
  • Dependent Packages: 0
  • Dependent Repositories: 10
  • Downloads: 316 Last month
  • Docker Downloads: 88,618
  • Rankings:
    • Docker downloads count: 0.044%
    • Dependent repos count: 9.238%
    • Forks count: 9.642%
    • Stargazers count: 9.709%
    • Average: 14.936%
    • Dependent packages count: 28.668%
    • Downloads: 32.316%
  • Maintainers (1)
conda-forge.org: r-taxizedb

  • Homepage: https://ropensci.github.io/taxizedb/, https://github.com/ropensci/taxizedb
  • Licenses: MIT
  • Latest release: 0.3.0 (published over 5 years ago)
  • Last Synced: 2026-03-22T00:22:35.818Z (about 2 months ago)
  • Versions: 2
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 30,246 Total
  • Rankings:
    • Dependent repos count: 34.025%
    • Stargazers count: 43.389%
    • Average: 44.073%
    • Forks count: 47.702%
    • Dependent packages count: 51.175%

Dependencies

DESCRIPTION cran
  • DBI >= 0.6 imports
  • RSQLite >= 1.1.2 imports
  • curl >= 2.4 imports
  • dbplyr >= 1.0.0 imports
  • dplyr >= 0.7.0 imports
  • hoardr >= 0.1.0 imports
  • magrittr >= 1.5 imports
  • readr >= 1.1.1 imports
  • rlang * imports
  • tibble * imports
  • taxize * suggests
  • testthat * suggests
.github/workflows/R-check.yaml actions
  • actions/cache v2 composite
  • actions/checkout v2 composite
  • actions/upload-artifact v2 composite
  • r-lib/actions/setup-pandoc v1 composite
  • r-lib/actions/setup-r v1 composite
.github/workflows/build-docs.yml actions
  • actions/cache v1 composite
  • actions/checkout v2 composite
  • r-lib/actions/setup-pandoc master composite
  • r-lib/actions/setup-r master composite
.circleci/Dockerfile docker
  • rocker/tidyverse latest build
docker-compose.yml docker
  • mariadb latest
  • postgres latest

Score: 17.797738500031542