OpenRefine-ecology
Data Cleaning with OpenRefine for Ecologists.
https://github.com/datacarpentry/OpenRefine-ecology-lesson
Category: Sustainable Development
Sub Category: Education
Keywords
carpentries data-carpentry data-cleaning data-management ecology english lesson open-educational-resources openrefine stable
Keywords from Contributors
spreadsheet data-wrangling carpentries-incubator workshop beta carpentries-lab metagenomics puerto-rico tibfds sustainable-software
Last synced: about 20 hours ago
JSON representation
Repository metadata
Data Cleaning with OpenRefine for Ecologists
- Host: GitHub
- URL: https://github.com/datacarpentry/OpenRefine-ecology-lesson
- Owner: datacarpentry
- License: other
- Created: 2015-04-04T12:32:31.000Z (about 10 years ago)
- Default Branch: main
- Last Pushed: 2025-05-27T00:32:44.000Z (11 days ago)
- Last Synced: 2025-05-31T10:38:14.881Z (7 days ago)
- Topics: carpentries, data-carpentry, data-cleaning, data-management, ecology, english, lesson, open-educational-resources, openrefine, stable
- Homepage: https://datacarpentry.org/OpenRefine-ecology-lesson/
- Size: 19.2 MB
- Stars: 27
- Watchers: 16
- Forks: 113
- Open Issues: 17
- Releases: 2
-
Metadata Files:
- Readme: README.md
- Changelog: NEWS.md
- Contributing: CONTRIBUTING.md
- License: LICENSE.md
- Code of conduct: CODE_OF_CONDUCT.md
- Citation: CITATION
- Authors: AUTHORS
- Zenodo: .zenodo.json
README.md
OpenRefine-ecology
Data Cleaning with OpenRefine for Ecologists Lesson for Data Carpentry
OpenRefine Version
The current version has been tested with OpenRefine 3.7.2 on May 2023.
Data set notes
- This data set is derived from The Portal Project Long-term desert ecology project data. This data file was downloaded and then modified specifically for use with OpenRefine.
- Taxon names were put back into the file.
- The number of rows was reduced to simplify the reconciliation and URL parsing exercises.
- These modifications were made in order to illustrate some features of Open Refine.
- Errors were added to the taxon names (
scientificName
field), to demonstrate OpenRefine's ability to find likely mis-entered data. - These errors can be found using clustering algorithms on the
scientificName
column, showing the power of the algorithms to find discrepancies quickly and making it simple to fix all issues found.
- Errors were added to the taxon names (
Contributing
We welcome all contributions to improve the lesson! Maintainers will do their best to help you if you have any questions, concerns, or experience any difficulties along the way.
We'd like to ask you to familiarize yourself with our Contribution Guide.
Please see the current list of issues for ideas for contributing to this repository. For making your contribution, we use the GitHub flow, which is nicely explained in the chapter Contributing to a Project in Pro Git by Scott Chacon.
Look for the tag . This indicates that the maintainers will welcome a pull request fixing this issue.
Maintainers
Current Maintainers
- Luis J. Villanueva ([email protected])
Past Authors and Maintainers
- Abigail Cabunoc
- Aleksandra Nenadic
- April M. Wright
- Betty Rozum
- Bill Mills
- Brian Yandell
- C. Titus Brown
- Cam Macdonell
- Dan Mazur
- Debbie Paul
- Erin Becker
- Francois Michonneau
- Gabriel A. Devenyi
- Greg Wilson
- Hilmar Lapp
- Hugo Tavares
- Ian Carroll
- James Allen
- James Mickley
- Jeffrey W. Hollister
- Jon Pipitone
- Jonah Duckles
- Kari L. Jordan
- Lisa Zilinski
- Maxim Belkin
- Michael Hansen
- Nick Young
- Piotr Banaszkiewicz
- Raniere Silva
- Ross Dickson
- Ryan E. Johnson
- Rémi Emonet
- Timothée Poisot
- Tracy Teal
- W. Trevor King
- Zack Brym
- dlstrong
- evanwill
- trelogan
See the Authors page for details.
Citation (CITATION)
Please cite as: Deborah Paul and Cam Macdonell (eds): "Data Carpentry: Data Cleaning with OpenRefine Ecology lesson." Version 2017.04.0, April 2017, http://www.datacarpentry.org/OpenRefine-ecology-lesson/, FIXME: Add Zenodo DOI.
Owner metadata
- Name: Data Carpentry
- Login: datacarpentry
- Email: [email protected]
- Kind: organization
- Description: Workshops teaching scientists basic skills for retrieving, viewing, managing, and manipulating data in an open and reproducible way.
- Website: https://datacarpentry.org/
- Location:
- Twitter:
- Company:
- Icon url: https://avatars.githubusercontent.com/u/6666450?v=4
- Repositories: 89
- Last ynced at: 2023-03-13T13:55:49.004Z
- Profile URL: https://github.com/datacarpentry
GitHub Events
Total
- Issues event: 2
- Watch event: 2
- Delete event: 6
- Issue comment event: 20
- Push event: 42
- Pull request review event: 1
- Pull request event: 14
- Fork event: 3
- Create event: 9
Last Year
- Issues event: 2
- Watch event: 2
- Delete event: 6
- Issue comment event: 20
- Push event: 42
- Pull request review event: 1
- Pull request event: 14
- Fork event: 3
- Create event: 9
Committers metadata
Last synced: over 1 year ago
Total Commits: 400
Total Committers: 66
Avg Commits per committer: 6.061
Development Distribution Score (DDS): 0.698
Commits in past year: 80
Committers in past year: 8
Avg Commits per committer in past year: 10.0
Development Distribution Score (DDS) in past year: 0.175
Name | Commits | |
---|---|---|
Luis J. Villanueva | v****l@s****u | 121 |
Erin Becker | e****r@g****m | 104 |
Debbie Paul | d****l@f****u | 23 |
Tracy Teal | t****t@i****g | 20 |
Francois Michonneau | f****u@g****m | 16 |
Dorothea Salo | 2****o | 7 |
Brian Yandell | b****l@b****t | 6 |
maneesha sane | a****m@g****m | 5 |
Kari L. Jordan | k****n@m****m | 5 |
Dan Mazur | Q****n | 4 |
Zhian N. Kamvar | z****r@g****m | 4 |
JoshuaDull | j****l@g****m | 4 |
Jeanine Finn | j****n@u****u | 4 |
Anelda van der Walt | a****a | 3 |
David LeBauer | d****r@g****m | 3 |
marijane white | m****e | 3 |
Nicola Soranzo | n****o@g****m | 3 |
Paul R. Pival | p****l@u****a | 3 |
Kari L. Jordan | k****n@c****g | 3 |
Paul R. Pival | p****l@g****m | 3 |
Toby Hodges | t****s@g****m | 2 |
kdmclean | 3****n | 2 |
Jeffrey W. Hollister | j****r@g****m | 2 |
carmi cronje | c****e | 2 |
Ross Dickson | r****n@d****a | 2 |
Trevor Keller | t****r@n****v | 2 |
Ben Companjen | b****n@c****e | 2 |
C. Titus Brown | t****s@i****g | 2 |
Aleksandra Nenadic | a****c@m****k | 2 |
Phillip Doehle | d****e@o****u | 2 |
and 36 more... |
Committer domains:
- carpentries.org: 3
- idyll.org: 2
- manchester.ac.uk: 2
- si.edu: 1
- fsu.edu: 1
- brians-mbp.attlocal.net: 1
- me.com: 1
- utexas.edu: 1
- ucalgary.ca: 1
- dal.ca: 1
- nist.gov: 1
- companjen.name: 1
- okstate.edu: 1
- ucdavis.edu: 1
- duckles.org: 1
- nyu.edu: 1
- wisc.edu: 1
- austin.utexas.edu: 1
- umd.edu: 1
- illinois.edu: 1
- rgaiacs.com: 1
- uta.edu: 1
Issue and Pull Request metadata
Last synced: 2 days ago
Total issues: 76
Total pull requests: 63
Average time to close issues: about 3 years
Average time to close pull requests: 6 months
Total issue authors: 42
Total pull request authors: 28
Average comments per issue: 2.2
Average comments per pull request: 0.98
Merged pull request: 49
Bot issues: 0
Bot pull requests: 0
Past year issues: 1
Past year pull requests: 19
Past year average time to close issues: N/A
Past year average time to close pull requests: 2 days
Past year issue authors: 1
Past year pull request authors: 5
Past year average comments per issue: 0.0
Past year average comments per pull request: 1.26
Past year merged pull request: 10
Past year bot issues: 0
Past year bot pull requests: 0
Top Issue Authors
- ErinBecker (11)
- maneesha (10)
- tracykteal (8)
- villanueval (5)
- cmacdonell (2)
- anenadic (2)
- tobyhodges (2)
- cengel (2)
- jifar (1)
- nmwolf (1)
- debpaul (1)
- kta65 (1)
- bscheffler (1)
- MikeTrizna (1)
- mkweskin (1)
Top Pull Request Authors
- villanueval (11)
- carpentries-bot (9)
- jas58 (5)
- marijane (5)
- tajuakins (4)
- maneesha (4)
- tobyhodges (3)
- ppival (2)
- Stonepeople (1)
- sstevens2 (1)
- kdmclean (1)
- jellenf (1)
- lyndamk (1)
- jwscutt (1)
- aprovoNYU (1)
Top Issue Labels
- type:enhancement (22)
- type:clarification (16)
- good first issue (6)
- type:template and tools (3)
- help wanted (3)
- status:in progress (2)
- type:instructor guide (1)
- status:refer to cac (1)
- openrefine-3.5.0 (1)
- type:formatting (1)
- type:discussion (1)
- openrefine-3.4.1 (1)
Top Pull Request Labels
- type: template and tools (9)
- type:enhancement (2)
- status:in progress (2)
- type:template and tools (1)
- status:waiting for response (1)
- type:clarification (1)
Dependencies
- actions/upload-artifact v3 composite
- actions/checkout v3 composite
- carpentries/actions/check-valid-pr main composite
- carpentries/actions/comment-diff main composite
- carpentries/actions/download-workflow-artifact main composite
- carpentries/actions/download-workflow-artifact main composite
- carpentries/actions/remove-branch main composite
- carpentries/actions/check-valid-pr main composite
- carpentries/actions/comment-diff main composite
- actions/checkout v3 composite
- actions/upload-artifact v3 composite
- carpentries/actions/check-valid-pr main composite
- carpentries/actions/setup-lesson-deps main composite
- carpentries/actions/setup-sandpaper main composite
- r-lib/actions/setup-pandoc v2 composite
- r-lib/actions/setup-r v2 composite
- actions/checkout v3 composite
- carpentries/actions/setup-lesson-deps main composite
- carpentries/actions/setup-sandpaper main composite
- r-lib/actions/setup-pandoc v2 composite
- r-lib/actions/setup-r v2 composite
- actions/checkout v3 composite
- carpentries/actions/check-valid-credentials main composite
- carpentries/actions/update-lockfile main composite
- carpentries/create-pull-request main composite
- r-lib/actions/setup-r v2 composite
- actions/checkout v3 composite
- carpentries/actions/check-valid-credentials main composite
- carpentries/actions/update-workflows main composite
- carpentries/create-pull-request main composite
Score: 7.973844375944687