Learning from Crowds with Crowd-Kit

Learning from Crowds with Crowd-Kit - Published in JOSS (2024)
https://github.com/toloka/crowd-kit

Keywords

aggregations annotation crowd crowdsourcing data-mining data-science labeling python quality-control toloka truth-inference

Keywords from Contributors

distributed transforms measur cloud-native dbms distributed-database archiving projection animals optimize

Last synced: 2 months ago
JSON representation

Acceptance Criteria

Repository metadata

Control the quality of your labeled data with the Python tools you already know.


Owner metadata


GitHub Events

Total
Last Year

Committers metadata

Last synced: 2 months ago

Total Commits: 376
Total Committers: 28
Avg Commits per committer: 13.429
Development Distribution Score (DDS): 0.614

Commits in past year: 20
Committers in past year: 4
Avg Commits per committer in past year: 5.0
Development Distribution Score (DDS) in past year: 0.4

Name Email Commits
Dmitry Ustalov d****v@g****m 145
Nikita Pavlichenko p****v@p****u 37
Sukhorosov Aleksey a****v@g****m 26
Denis Fraltsov 8****t 22
Mathew Shen d****r@g****m 21
Vladimir Losev m****k@g****m 19
Alisa a****a@g****m 19
Natalia n****6@y****u 15
Stepan Nosov n****8@y****u 15
Daniil Likhobaba l****p@p****u 12
dependabot[bot] 4****] 10
pavlichenko p****o@y****u 7
DrhF m****w@g****m 5
Evgeny Tulin t****v@y****u 3
Alexander Vnuchkov a****v@y****u 3
vlad-mois v****s@y****u 3
Tahar Allouche t****o@g****m 2
mr-fedulow m****w@y****u 2
shadchin s****n@y****u 1
Daniil Likhobaba l****p@y****u 1
btseytlin b****n@y****u 1
dsamuylov d****v@y****u 1
gilyazev-yu g****u@y****u 1
arcadia-devtools a****s@y****u 1
Pavel Gein p****n@y****u 1
Iulian Giliazev g****a@p****u 1
Artem Grigorev o****j@t****i 1
Aleksandr Dremov d****e@g****m 1

Committer domains:


Issue and Pull Request metadata

Last synced: 3 months ago

Total issues: 26
Total pull requests: 127
Average time to close issues: about 1 month
Average time to close pull requests: 9 days
Total issue authors: 15
Total pull request authors: 18
Average comments per issue: 2.27
Average comments per pull request: 1.44
Merged pull request: 115
Bot issues: 0
Bot pull requests: 13

Past year issues: 5
Past year pull requests: 15
Past year average time to close issues: 14 days
Past year average time to close pull requests: 17 days
Past year issue authors: 4
Past year pull request authors: 4
Past year average comments per issue: 1.4
Past year average comments per pull request: 0.6
Past year merged pull request: 13
Past year bot issues: 0
Past year bot pull requests: 5

More stats: https://issues.ecosyste.ms/repositories/lookup?url=https://github.com/toloka/crowd-kit

Top Issue Authors

  • shenxiangzhuang (6)
  • LydiaMak (3)
  • jcklie (3)
  • pilot7747 (2)
  • Senarect (2)
  • ahundt (1)
  • alexdremov (1)
  • TanVD (1)
  • takumi1001 (1)
  • Mind-the-Cap (1)
  • vikasraykar (1)
  • Marceau-h (1)
  • taharallouche (1)
  • johann-petrak (1)
  • amine-boukriba (1)

Top Pull Request Authors

  • shenxiangzhuang (29)
  • dustalov (26)
  • pilot7747 (17)
  • dependabot[bot] (13)
  • Losik (5)
  • Natalyl3 (5)
  • aliskin (4)
  • Senarect (4)
  • taharallouche (4)
  • alexdrydew (4)
  • DrhF (3)
  • varfolomeii (3)
  • alexandervnuchkov (3)
  • denaxen (2)
  • ortemij (2)

Top Issue Labels

  • bug (9)
  • documentation (8)
  • enhancement (7)
  • good first issue (2)

Top Pull Request Labels

  • dependencies (13)
  • enhancement (9)
  • documentation (7)
  • github_actions (5)
  • good first issue (3)
  • bug (1)
  • duplicate (1)

Package metadata

pypi.org: crowd-kit

Computational Quality Control for Crowdsourcing

  • Homepage: https://github.com/Toloka/crowd-kit
  • Documentation: https://crowd-kit.readthedocs.io/
  • Licenses: Apache-2.0
  • Latest release: 1.4.2 (published 3 months ago)
  • Last Synced: 2025-10-26T00:34:17.159Z (2 months ago)
  • Versions: 24
  • Dependent Packages: 3
  • Dependent Repositories: 5
  • Downloads: 5,565 Last month
  • Rankings:
    • Dependent packages count: 2.328%
    • Downloads: 4.424%
    • Stargazers count: 5.338%
    • Average: 5.665%
    • Dependent repos count: 6.66%
    • Forks count: 9.574%
  • Maintainers (2)

Dependencies

setup.py pypi
  • attrs *
  • nltk *
  • numpy *
  • pandas *
  • scikit-learn *
  • tqdm *
  • transformers *
.github/workflows/release.yml actions
  • actions/checkout v3 composite
  • actions/setup-python v4 composite
  • peter-evans/create-pull-request v4 composite
.github/workflows/tests.yml actions
  • actions/checkout v3 composite
  • actions/setup-node v3 composite
  • actions/setup-python v4 composite
  • actions/upload-artifact v3 composite
  • citation-file-format/cffconvert-github-action 2.0.0 composite
Pipfile pypi
  • build * develop
  • codecov * develop
  • flake8 * develop
  • ipywidgets * develop
  • mypy * develop
  • notebook * develop
  • pytest * develop
  • stubmaker * develop
  • twine * develop
  • crowd-kit *
pyproject.toml pypi

Score: 17.41783762064768