text2sdg
Multi-system analysis package that identifies SDGs in text, opening up the opportunity to monitor any type of text-based data.
https://github.com/dwulff/text2sdg
Category: Sustainable Development
Sub Category: Sustainable Development Goals
Keywords
natural-language-processing sustainability sustainable-development sustainable-development-goals
Last synced: about 18 hours ago
JSON representation
Repository metadata
Detect UN Sustainable Development Goals in Text
- Host: GitHub
- URL: https://github.com/dwulff/text2sdg
- Owner: dwulff
- Created: 2021-04-28T23:26:35.000Z (almost 4 years ago)
- Default Branch: main
- Last Pushed: 2024-09-08T19:43:46.000Z (8 months ago)
- Last Synced: 2025-04-17T22:07:24.156Z (9 days ago)
- Topics: natural-language-processing, sustainability, sustainable-development, sustainable-development-goals
- Language: R
- Homepage: https://text2sdg.io
- Size: 20.9 MB
- Stars: 19
- Watchers: 2
- Forks: 4
- Open Issues: 3
- Releases: 0
-
Metadata Files:
- Readme: README.md
- Changelog: NEWS.md
README.md
text2sdg
The United Nations’ Sustainable Development Goals (SDGs) have become an important guideline for higher-education and research institutions to monitor and plan their contributions to social, economic, and environmental transformations.
The text2sdg
package is the first open-source, multi-system analysis package that identifies SDGs in text, opening up the opportunity to monitor any type of text-based data, including scientific output and corporate publications.
General Information
The text2sdg
package is developed by Dirk U. Wulff and Dominik S. Meier, with contributions from Rui Mata and the Center for Cognitive and Decision Sciences. It is published under the GNU General Public License.
An overview of the package can be accessed
online or from within R using ?text2sdg
.
Installation
The current stable version is available on CRAN and can be installed via install.packages("text2sdg")
.
The latest development version on GitHub can be installed via devtools::install_github("dwulff/text2sdg")
. Note that this requires prior installation of the devtools
package.
Usage
To identify SDGs in a series of documents, the user can choose between two approaches, an individual systems approach implementing six individual query systems and an ensemble approach powered by machine learning that integrates these systems. It is recommended to use the more accurate and bias-free ensemble approach (see Wulff, Meier, & Mata, 2024).
# vector of texts
texts = c("This is text 1", "This is text 2")
# individual systems approach
hits = detect_sdg_systems(texts)
# ensemble approach
hits = detect_sdg(texts)
For a complete tutorial on the use of the package, visit
this page or call vignette("text2sdg")
from within R.
Citation
If you use the text2sdg
package for published work, we kindly ask that you cite the package as follows:
Meier, D. S., Mata, R., & Wulff, D. U. (2021). text2sdg: An open-source solution to monitoring sustainable development goals from text. arXiv. https://arxiv.org/abs/2110.05856
Depending on the use of the package, also consider referencing the following article:
Wulff, D. U., Meier, D. S., & Mata, R. (2024). Using novel data and ensemble models to improve automated labeling of Sustainable Development Goals. Sustainability Science. https://doi.org/10.1007/s11625-024-01516-3
Owner metadata
- Name: Dirk Wulff
- Login: dwulff
- Email:
- Kind: user
- Description: Cognitive, decision, and data science at University of Basel and MPI for Human Development, Berlin.
- Website: dirkwulff.org
- Location: Basel, Switzerland
- Twitter:
- Company: Center for Cognitive Decision Science, University of Basel
- Icon url: https://avatars.githubusercontent.com/u/10883558?u=228c85f8cd992e050e6cd26be90c31ec342eff3e&v=4
- Repositories: 6
- Last ynced at: 2023-03-10T11:35:55.124Z
- Profile URL: https://github.com/dwulff
GitHub Events
Total
- Issues event: 2
- Watch event: 3
Last Year
- Issues event: 2
- Watch event: 3
Committers metadata
Last synced: 6 days ago
Total Commits: 271
Total Committers: 4
Avg Commits per committer: 67.75
Development Distribution Score (DDS): 0.554
Commits in past year: 20
Committers in past year: 2
Avg Commits per committer in past year: 10.0
Development Distribution Score (DDS) in past year: 0.4
Name | Commits | |
---|---|---|
Dirk Wulff | d****f | 121 |
Dominik Meier | d****r@o****m | 119 |
Dominik Meier | d****r@u****h | 28 |
Dirk Wulff | d****f@M****l | 3 |
Committer domains:
- unibas.ch: 1
Issue and Pull Request metadata
Last synced: 2 days ago
Total issues: 5
Total pull requests: 1
Average time to close issues: 3 days
Average time to close pull requests: 3 days
Total issue authors: 4
Total pull request authors: 1
Average comments per issue: 0.8
Average comments per pull request: 0.0
Merged pull request: 1
Bot issues: 0
Bot pull requests: 0
Past year issues: 3
Past year pull requests: 0
Past year average time to close issues: 3 days
Past year average time to close pull requests: N/A
Past year issue authors: 2
Past year pull request authors: 0
Past year average comments per issue: 0.67
Past year average comments per pull request: 0
Past year merged pull request: 0
Past year bot issues: 0
Past year bot pull requests: 0
Top Issue Authors
- krlmlr (2)
- DataStrategist (1)
- robertrentzsch (1)
- grlju (1)
Top Pull Request Authors
- grlju (1)
Top Issue Labels
- enhancement (1)
- bug (1)
Top Pull Request Labels
Package metadata
- Total packages: 1
-
Total downloads:
- cran: 1,409 last-month
- Total docker downloads: 48
- Total dependent packages: 0
- Total dependent repositories: 0
- Total versions: 9
- Total maintainers: 1
cran.r-project.org: text2sdg
Detecting UN Sustainable Development Goals in Text
- Homepage: https://github.com/dwulff/text2sdg
- Documentation: http://cran.r-project.org/web/packages/text2sdg/text2sdg.pdf
- Licenses: GPL-3
- Latest release: 1.1.1 (published about 2 years ago)
- Last Synced: 2025-04-25T12:30:56.210Z (2 days ago)
- Versions: 9
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 1,409 Last month
- Docker Downloads: 48
-
Rankings:
- Forks count: 17.77%
- Stargazers count: 18.704%
- Downloads: 24.859%
- Average: 25.317%
- Dependent packages count: 29.797%
- Dependent repos count: 35.455%
- Maintainers (1)
Dependencies
- R >= 3.5.0 depends
- corpustools >= 0.4.2 imports
- dplyr * imports
- ggplot2 * imports
- lifecycle * imports
- magrittr * imports
- stringr * imports
- tibble * imports
- tidyr * imports
- knitr * suggests
- rmarkdown * suggests
- testthat * suggests
Score: 11.762843363000991