tidytext: Text Mining and Analysis Using Tidy Data Principles in R
tidytext: Text Mining and Analysis Using Tidy Data Principles in R - Published in JOSS (2016)
https://github.com/juliasilge/tidytext
Keywords
natural-language-processing r text-mining tidy-data tidyverse
Keywords from Contributors
setup ggplot-extension parsing strings package-creation fwf data-manipulation regular-expression plot rmarkdown
Last synced: 2 months ago
JSON representation
Acceptance Criteria
- Revelant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
Repository metadata
Text mining using tidy tools :sparkles::page_facing_up::sparkles:
- Host: GitHub
- URL: https://github.com/juliasilge/tidytext
- Owner: juliasilge
- License: other
- Created: 2016-03-31T18:51:39.000Z (almost 10 years ago)
- Default Branch: main
- Last Pushed: 2025-07-25T18:08:19.000Z (5 months ago)
- Last Synced: 2025-10-16T22:15:23.246Z (3 months ago)
- Topics: natural-language-processing, r, text-mining, tidy-data, tidyverse
- Language: R
- Homepage: https://juliasilge.github.io/tidytext/
- Size: 130 MB
- Stars: 1,196
- Watchers: 63
- Forks: 182
- Open Issues: 9
- Releases: 0
-
Metadata Files:
- Readme: README.Rmd
- Changelog: NEWS.md
- License: LICENSE
GitHub Events
Total
- Issues event: 5
- Watch event: 28
- Delete event: 1
- Issue comment event: 20
- Push event: 6
- Pull request event: 3
- Fork event: 4
- Create event: 2
Last Year
- Issues event: 5
- Watch event: 26
- Delete event: 1
- Issue comment event: 20
- Push event: 6
- Pull request event: 3
- Fork event: 3
- Create event: 2
Committers metadata
Last synced: 2 months ago
Total Commits: 716
Total Committers: 33
Avg Commits per committer: 21.697
Development Distribution Score (DDS): 0.268
Commits in past year: 3
Committers in past year: 1
Avg Commits per committer in past year: 3.0
Development Distribution Score (DDS) in past year: 0.0
| Name | Commits | |
|---|---|---|
| Julia Silge | j****e@g****m | 524 |
| Dave Robinson | d****n@s****m | 54 |
| dgrtwo | d****o@p****u | 38 |
| Colin | c****n@t****r | 23 |
| Julia Silge | j****e@s****m | 17 |
| Oliver Keyes | i****s@g****m | 7 |
| Kenneth Benoit | k****t@l****k | 7 |
| Emil Hvitfeldt | e****t@g****m | 6 |
| Timothy Mastny | t****y@g****m | 6 |
| Jeff Erickson | j****f@e****o | 3 |
| Jim Hester | j****r@g****m | 3 |
| kanishkamisra | m****e@g****m | 3 |
| David Robinson | a****d@g****m | 2 |
| Lionel Henry | l****y@g****m | 2 |
| Luis de Sousa | l****d@s****a | 2 |
| aedobbyn | a****1@g****m | 2 |
| seankross | s****0@g****m | 1 |
| olivroy | 5****y | 1 |
| jonmcalder | j****r@g****m | 1 |
| Y. Yu | 5****e | 1 |
| Vincent Arel-Bundock | v****k@u****a | 1 |
| Seth Berry | s****y@n****u | 1 |
| Ramnath Vaidyanathan | r****a@g****m | 1 |
| Michael Chirico | m****4@g****m | 1 |
| Lincoln Mullen | l****n@l****m | 1 |
| Jonathan Völkle | 3****e | 1 |
| Jenny Bryan | j****n@g****m | 1 |
| James Keirstead | j****d@g****m | 1 |
| Erwan Le Pennec | l****c@g****m | 1 |
| Dave Childers | c****e@g****m | 1 |
| and 3 more... | ||
Committer domains:
- stackoverflow.com: 2
- theathletic.com: 1
- lincolnmullen.com: 1
- nd.edu: 1
- umontreal.ca: 1
- syeop.co.za: 1
- erick.so: 1
- lse.ac.uk: 1
- thinkr.fr: 1
- princeton.edu: 1
Issue and Pull Request metadata
Last synced: 4 months ago
Total issues: 93
Total pull requests: 21
Average time to close issues: about 1 month
Average time to close pull requests: 4 days
Total issue authors: 61
Total pull request authors: 10
Average comments per issue: 3.78
Average comments per pull request: 2.19
Merged pull request: 18
Bot issues: 0
Bot pull requests: 0
Past year issues: 3
Past year pull requests: 3
Past year average time to close issues: N/A
Past year average time to close pull requests: 19 minutes
Past year issue authors: 3
Past year pull request authors: 1
Past year average comments per issue: 0.0
Past year average comments per pull request: 0.0
Past year merged pull request: 1
Past year bot issues: 0
Past year bot pull requests: 0
Top Issue Authors
- juliasilge (16)
- dgrtwo (8)
- TheOne000 (5)
- nabsiddiqui (3)
- petereckley (3)
- MichaelChirico (2)
- Ironholds (2)
- kjmobile (1)
- yli74 (1)
- jirkalewandowski (1)
- 1danjordan (1)
- kbenoit (1)
- ariespirgel (1)
- dan-reznik (1)
- twedl (1)
Top Pull Request Authors
- juliasilge (10)
- kbenoit (3)
- olivroy (2)
- seankross (1)
- AmeliaMN (1)
- jimhester (1)
- jonathanvoelkle (1)
- arfon (1)
- davechilders (1)
- MichaelChirico (1)
Top Issue Labels
- feature (2)
Top Pull Request Labels
Package metadata
- Total packages: 3
-
Total downloads:
- cran: 68,363 last-month
- Total docker downloads: 142,547
- Total dependent packages: 71 (may contain duplicates)
- Total dependent repositories: 195 (may contain duplicates)
- Total versions: 51
- Total maintainers: 1
cran.r-project.org: tidytext
Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools
- Homepage: https://juliasilge.github.io/tidytext/
- Documentation: http://cran.r-project.org/web/packages/tidytext/tidytext.pdf
- Licenses: MIT + file LICENSE
- Latest release: 0.4.3 (published 5 months ago)
- Last Synced: 2025-10-26T03:11:20.161Z (2 months ago)
- Versions: 15
- Dependent Packages: 66
- Dependent Repositories: 194
- Downloads: 68,363 Last month
- Docker Downloads: 142,547
-
Rankings:
- Stargazers count: 0.221%
- Forks count: 0.307%
- Average: 1.271%
- Dependent repos count: 1.333%
- Dependent packages count: 1.428%
- Downloads: 2.038%
- Docker downloads count: 2.296%
- Maintainers (1)
proxy.golang.org: github.com/juliasilge/tidytext
- Homepage:
- Documentation: https://pkg.go.dev/github.com/juliasilge/tidytext#section-documentation
- Licenses: other
- Latest release: v0.4.3 (published 5 months ago)
- Last Synced: 2025-10-26T03:10:52.563Z (2 months ago)
- Versions: 22
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent packages count: 5.459%
- Average: 5.642%
- Dependent repos count: 5.825%
conda-forge.org: r-tidytext
- Homepage: http://github.com/juliasilge/tidytext
- Licenses: MIT
- Latest release: 0.3.4 (published over 3 years ago)
- Last Synced: 2025-10-22T01:10:02.037Z (2 months ago)
- Versions: 14
- Dependent Packages: 5
- Dependent Repositories: 1
-
Rankings:
- Dependent packages count: 10.412%
- Stargazers count: 11.874%
- Forks count: 12.972%
- Average: 14.84%
- Dependent repos count: 24.103%
Dependencies
- R >= 2.10 depends
- Matrix * imports
- dplyr * imports
- generics * imports
- hunspell * imports
- janeaustenr * imports
- lifecycle * imports
- methods * imports
- purrr >= 0.1.1 imports
- rlang >= 0.4.10 imports
- stringr * imports
- tibble * imports
- tokenizers * imports
- vctrs * imports
- NLP * suggests
- broom * suggests
- covr * suggests
- data.table * suggests
- ggplot2 * suggests
- knitr * suggests
- mallet * suggests
- quanteda * suggests
- readr * suggests
- reshape2 * suggests
- rmarkdown * suggests
- scales * suggests
- stm * suggests
- stopwords * suggests
- testthat >= 2.1.0 suggests
- textdata * suggests
- tidyr * suggests
- tm * suggests
- topicmodels * suggests
- vdiffr * suggests
- wordcloud * suggests
- actions/checkout v2 composite
- r-lib/actions/check-r-package v2 composite
- r-lib/actions/setup-pandoc v2 composite
- r-lib/actions/setup-r v2 composite
- r-lib/actions/setup-r-dependencies v2 composite
- actions/checkout v2 composite
- r-lib/actions/check-r-package v2 composite
- r-lib/actions/setup-pandoc v2 composite
- r-lib/actions/setup-r v2 composite
- r-lib/actions/setup-r-dependencies v2 composite
- dessant/lock-threads v2 composite
- JamesIves/github-pages-deploy-action 4.1.4 composite
- actions/checkout v2 composite
- r-lib/actions/setup-pandoc v2 composite
- r-lib/actions/setup-r v2 composite
- r-lib/actions/setup-r-dependencies v2 composite
- actions/checkout v2 composite
- r-lib/actions/pr-fetch v2 composite
- r-lib/actions/pr-push v2 composite
- r-lib/actions/setup-r v2 composite
- r-lib/actions/setup-r-dependencies v2 composite
- actions/checkout v2 composite
- r-lib/actions/setup-r v2 composite
- r-lib/actions/setup-r-dependencies v2 composite
Score: 22.85154942002146