Learning from Crowds with Crowd-Kit
Learning from Crowds with Crowd-Kit - Published in JOSS (2024)
https://github.com/toloka/crowd-kit
Keywords
aggregations annotation crowd crowdsourcing data-mining data-science labeling python quality-control toloka truth-inference
Keywords from Contributors
distributed transforms measur cloud-native dbms distributed-database archiving projection animals optimize
Last synced: 2 months ago
JSON representation
Acceptance Criteria
- Revelant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
Repository metadata
Control the quality of your labeled data with the Python tools you already know.
- Host: GitHub
- URL: https://github.com/toloka/crowd-kit
- Owner: Toloka
- License: other
- Created: 2021-03-01T23:02:08.000Z (almost 5 years ago)
- Default Branch: main
- Last Pushed: 2025-10-13T11:23:01.000Z (3 months ago)
- Last Synced: 2025-10-26T00:33:47.654Z (2 months ago)
- Topics: aggregations, annotation, crowd, crowdsourcing, data-mining, data-science, labeling, python, quality-control, toloka, truth-inference
- Language: Python
- Homepage: https://crowd-kit.readthedocs.io/
- Size: 1.42 MB
- Stars: 233
- Watchers: 11
- Forks: 19
- Open Issues: 2
- Releases: 0
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Citation: CITATION.cff
- Codeowners: .github/CODEOWNERS
- Authors: AUTHORS
Owner metadata
- Name: Toloka
- Login: Toloka
- Email: github@toloka.ai
- Kind: organization
- Description: Data labeling platform for ML
- Website: https://toloka.ai
- Location:
- Twitter:
- Company:
- Icon url: https://avatars.githubusercontent.com/u/76212487?v=4
- Repositories: 11
- Last ynced at: 2023-03-03T20:29:24.191Z
- Profile URL: https://github.com/Toloka
GitHub Events
Total
- Issues event: 5
- Watch event: 25
- Issue comment event: 10
- Push event: 11
- Pull request review event: 5
- Pull request event: 14
- Fork event: 3
- Create event: 3
Last Year
- Issues event: 5
- Watch event: 25
- Issue comment event: 10
- Push event: 11
- Pull request review event: 5
- Pull request event: 14
- Fork event: 3
- Create event: 3
Committers metadata
Last synced: 2 months ago
Total Commits: 376
Total Committers: 28
Avg Commits per committer: 13.429
Development Distribution Score (DDS): 0.614
Commits in past year: 20
Committers in past year: 4
Avg Commits per committer in past year: 5.0
Development Distribution Score (DDS) in past year: 0.4
| Name | Commits | |
|---|---|---|
| Dmitry Ustalov | d****v@g****m | 145 |
| Nikita Pavlichenko | p****v@p****u | 37 |
| Sukhorosov Aleksey | a****v@g****m | 26 |
| Denis Fraltsov | 8****t | 22 |
| Mathew Shen | d****r@g****m | 21 |
| Vladimir Losev | m****k@g****m | 19 |
| Alisa | a****a@g****m | 19 |
| Natalia | n****6@y****u | 15 |
| Stepan Nosov | n****8@y****u | 15 |
| Daniil Likhobaba | l****p@p****u | 12 |
| dependabot[bot] | 4****] | 10 |
| pavlichenko | p****o@y****u | 7 |
| DrhF | m****w@g****m | 5 |
| Evgeny Tulin | t****v@y****u | 3 |
| Alexander Vnuchkov | a****v@y****u | 3 |
| vlad-mois | v****s@y****u | 3 |
| Tahar Allouche | t****o@g****m | 2 |
| mr-fedulow | m****w@y****u | 2 |
| shadchin | s****n@y****u | 1 |
| Daniil Likhobaba | l****p@y****u | 1 |
| btseytlin | b****n@y****u | 1 |
| dsamuylov | d****v@y****u | 1 |
| gilyazev-yu | g****u@y****u | 1 |
| arcadia-devtools | a****s@y****u | 1 |
| Pavel Gein | p****n@y****u | 1 |
| Iulian Giliazev | g****a@p****u | 1 |
| Artem Grigorev | o****j@t****i | 1 |
| Aleksandr Dremov | d****e@g****m | 1 |
Committer domains:
- yandex-team.ru: 12
- phystech.edu: 3
- yandex.ru: 2
- toloka.ai: 1
Issue and Pull Request metadata
Last synced: 3 months ago
Total issues: 26
Total pull requests: 127
Average time to close issues: about 1 month
Average time to close pull requests: 9 days
Total issue authors: 15
Total pull request authors: 18
Average comments per issue: 2.27
Average comments per pull request: 1.44
Merged pull request: 115
Bot issues: 0
Bot pull requests: 13
Past year issues: 5
Past year pull requests: 15
Past year average time to close issues: 14 days
Past year average time to close pull requests: 17 days
Past year issue authors: 4
Past year pull request authors: 4
Past year average comments per issue: 1.4
Past year average comments per pull request: 0.6
Past year merged pull request: 13
Past year bot issues: 0
Past year bot pull requests: 5
Top Issue Authors
- shenxiangzhuang (6)
- LydiaMak (3)
- jcklie (3)
- pilot7747 (2)
- Senarect (2)
- ahundt (1)
- alexdremov (1)
- TanVD (1)
- takumi1001 (1)
- Mind-the-Cap (1)
- vikasraykar (1)
- Marceau-h (1)
- taharallouche (1)
- johann-petrak (1)
- amine-boukriba (1)
Top Pull Request Authors
- shenxiangzhuang (29)
- dustalov (26)
- pilot7747 (17)
- dependabot[bot] (13)
- Losik (5)
- Natalyl3 (5)
- aliskin (4)
- Senarect (4)
- taharallouche (4)
- alexdrydew (4)
- DrhF (3)
- varfolomeii (3)
- alexandervnuchkov (3)
- denaxen (2)
- ortemij (2)
Top Issue Labels
- bug (9)
- documentation (8)
- enhancement (7)
- good first issue (2)
Top Pull Request Labels
- dependencies (13)
- enhancement (9)
- documentation (7)
- github_actions (5)
- good first issue (3)
- bug (1)
- duplicate (1)
Package metadata
- Total packages: 1
-
Total downloads:
- pypi: 5,565 last-month
- Total dependent packages: 3
- Total dependent repositories: 5
- Total versions: 24
- Total maintainers: 2
pypi.org: crowd-kit
Computational Quality Control for Crowdsourcing
- Homepage: https://github.com/Toloka/crowd-kit
- Documentation: https://crowd-kit.readthedocs.io/
- Licenses: Apache-2.0
- Latest release: 1.4.2 (published 3 months ago)
- Last Synced: 2025-10-26T00:34:17.159Z (2 months ago)
- Versions: 24
- Dependent Packages: 3
- Dependent Repositories: 5
- Downloads: 5,565 Last month
-
Rankings:
- Dependent packages count: 2.328%
- Downloads: 4.424%
- Stargazers count: 5.338%
- Average: 5.665%
- Dependent repos count: 6.66%
- Forks count: 9.574%
- Maintainers (2)
Dependencies
- attrs *
- nltk *
- numpy *
- pandas *
- scikit-learn *
- tqdm *
- transformers *
- actions/checkout v3 composite
- actions/setup-python v4 composite
- peter-evans/create-pull-request v4 composite
- actions/checkout v3 composite
- actions/setup-node v3 composite
- actions/setup-python v4 composite
- actions/upload-artifact v3 composite
- citation-file-format/cffconvert-github-action 2.0.0 composite
- build * develop
- codecov * develop
- flake8 * develop
- ipywidgets * develop
- mypy * develop
- notebook * develop
- pytest * develop
- stubmaker * develop
- twine * develop
- crowd-kit *
Score: 17.41783762064768