Carob

Create reproducible workflows that reshape primary agricultural research data from experiments and surveys into a standard format, and to aggregate individual data sets into larger collections that can be used in further research.
https://github.com/carob-data/carob

Category: Consumption
Sub Category: Agriculture and Nutrition

Last synced: about 22 hours ago
JSON representation

Repository metadata

Carob: standardizing agricultural research data

README.md

Carob

Carob creates reproducible workflows that standardize primary agricultural research data from experiments and surveys. Standardization includes the use of a common file format, variable names, units and accepted values according to the terminag standard. Standardized data sets are aggregated into larger collections that can be used in further research. We do this by writing an R script for each individual dataset. See the website for more information.

Carob is an open access Extract, Transform, and Load (ETL) framework supported by CGIAR to support predictive analytics (machine learning, artifical intelligence) and other types of data analysis.

Contributions are welcome from anyone, and they can be made via pull-requests. Feel free to improve these scripts, or provide new ones. See the instructions on how to write a Carob script described here. You can also raise an issues. A good place to discover new data sets is the Gardian website or our to-do list.

Get the data

Standardized data can be downloaded from carob-data.org (data with a CC license only), or with R package caramba.

You can also compile your own version by cloning this repo and running

remotes::install_github("carob-data/carobiner")
ff <- carobiner::make_carob(path)

where path is the folder of the cloned repo (e.g. "d:/github/carob")


Owner metadata


GitHub Events

Total
Last Year

Committers metadata

Last synced: 3 days ago

Total Commits: 2,813
Total Committers: 29
Avg Commits per committer: 97.0
Development Distribution Score (DDS): 0.434

Commits in past year: 606
Committers in past year: 7
Avg Commits per committer in past year: 86.571
Development Distribution Score (DDS) in past year: 0.495

Name Email Commits
rhijmans r****s@g****m 1591
cedric Ngakou c****u@a****g 423
egbendito e****o@g****m 225
Cliffoe08 b****8@g****m 121
Mitchelle Njukuya m****a@g****m 78
Shumi01 x****a@g****m 56
efyrouwa e****g@g****m 49
Henry Juarez h****o@g****m 49
mukami3juma r****8@g****m 42
Fredy f****e@e****w 39
asila m****a@g****m 29
smkuhlani s****a@g****m 27
Fredy Chimire F****e@e****w 15
Fredy Chimire f****x@g****m 13
Andrew Sila a****a@c****g 10
Cirad Zim y****u@e****m 8
SasoA 4****A 7
unknown m****e@g****m 7
DZUDA b****a@c****g 6
NjoguM M****9@g****m 4
GacheriNturibi g****i@g****m 4
egbendito g****4 2
Céline Aubert a****e@y****m 2
Muthoni Gachoki g****i@g****m 1
pontesprates p****s@u****u 1
Siyabusa Mkuhlani s****a@S****l 1
Layal Atassi 3****i 1
J-MJohnson y****n@g****m 1
sgichu 1****u 1

Committer domains:


Issue and Pull Request metadata

Last synced: 3 days ago

Total issues: 9
Total pull requests: 277
Average time to close issues: about 20 hours
Average time to close pull requests: about 16 hours
Total issue authors: 2
Total pull request authors: 11
Average comments per issue: 0.33
Average comments per pull request: 0.84
Merged pull request: 230
Bot issues: 0
Bot pull requests: 0

Past year issues: 7
Past year pull requests: 205
Past year average time to close issues: about 20 hours
Past year average time to close pull requests: about 18 hours
Past year issue authors: 2
Past year pull request authors: 7
Past year average comments per issue: 0.43
Past year average comments per pull request: 0.8
Past year merged pull request: 161
Past year bot issues: 0
Past year bot pull requests: 0

More stats: https://issues.ecosyste.ms/repositories/lookup?url=https://github.com/carob-data/carob

Top Issue Authors

  • rhijmans (7)
  • cedricngakou (2)

Top Pull Request Authors

  • cedricngakou (177)
  • Cliffoe08 (49)
  • henryjuarez (15)
  • mitchynjukuya (11)
  • egbendito (9)
  • Shumi01 (6)
  • 19950606 (4)
  • rhijmans (3)
  • GacheriNturibi (1)
  • Muthono19 (1)
  • smkuhlani (1)

Top Issue Labels

Top Pull Request Labels

Score: 6.699500340161678