etl

A compute graph for loading and transforming Our World in Data's data.
https://github.com/owid/etl

Category: Sustainable Development
Sub Category: Data Catalogs and Interfaces

Keywords from Contributors

coronavirus covid covid-19 sars-cov-2 co2-emissions greenhouse-gas-emissions

Last synced: 37 minutes ago
JSON representation

Repository metadata

A compute graph for loading and transforming OWID's data


Owner metadata


GitHub Events

Total
Last Year

Committers metadata

Last synced: 5 days ago

Total Commits: 13,570
Total Committers: 35
Avg Commits per committer: 387.714
Development Distribution Score (DDS): 0.534

Commits in past year: 3,307
Committers in past year: 16
Avg Commits per committer in past year: 206.688
Development Distribution Score (DDS) in past year: 0.436

Name Email Commits
owidbot t****h@o****g 6330
Pablo Rosado p****o@g****m 1962
Mojmir Vinkler m****r@g****m 1371
Lucas Rodés-Guirao l****s 1012
Fiona Spooner f****a@o****g 641
Pablo Arriagada p****p@g****m 569
veronikasamborska1994 3****4 509
Lars Yencken l****s@y****g 403
analytics a****s@o****g 219
Edouard Mathieu e****t@p****e 175
lucas rg l****g@k****e 78
Daniel Bachler d****l@d****e 64
Tuna Acisu 5****4 57
dependabot[bot] 4****] 42
Tuna Acisu t****u@o****m 40
Marcel Gerber m****9@g****m 19
Bobbie Macdonald b****d@g****m 16
Fiona Spooner f****r@f****e 12
Sophia Mersmann s****1@g****m 10
Billy Cox w****5@g****m 8
Pablo Rosado p****o@i****s 8
Martin Račák r****i 7
Bastian Herre b****n@o****g 2
Copilot 1****t 2
Ike Saunders 1****u 2
Matthieu Bergel m****u@o****g 2
William Cox w****x@c****u 2
tristannew t****w@y****m 1
Toni Sharpe t****3@g****m 1
Tuna Acisu t****u@g****l 1
and 5 more...

Committer domains:


Issue and Pull Request metadata

Last synced: 6 days ago

Total issues: 361
Total pull requests: 3,589
Average time to close issues: about 2 months
Average time to close pull requests: 6 days
Total issue authors: 20
Total pull request authors: 23
Average comments per issue: 2.16
Average comments per pull request: 1.05
Merged pull request: 2,512
Bot issues: 0
Bot pull requests: 111

Past year issues: 105
Past year pull requests: 1,491
Past year average time to close issues: 26 days
Past year average time to close pull requests: 4 days
Past year issue authors: 11
Past year pull request authors: 16
Past year average comments per issue: 1.51
Past year average comments per pull request: 1.19
Past year merged pull request: 965
Past year bot issues: 0
Past year bot pull requests: 3

More stats: https://issues.ecosyste.ms/repositories/lookup?url=https://github.com/owid/etl

Top Issue Authors

  • Marigold (98)
  • pabloarosado (84)
  • lucasrodes (66)
  • larsyencken (55)
  • spoonerf (32)
  • paarriagadap (10)
  • ikesau (2)
  • antea04 (2)
  • marcelgerber (1)
  • exploreriii (1)
  • jorisvandenbossche (1)
  • heliodoro8243 (1)
  • veronikasamborska1994 (1)
  • edomt (1)
  • danyx23 (1)

Top Pull Request Authors

  • lucasrodes (898)
  • Marigold (648)
  • pabloarosado (571)
  • spoonerf (466)
  • veronikasamborska1994 (405)
  • paarriagadap (268)
  • dependabot[bot] (111)
  • antea04 (86)
  • owidbot (43)
  • edomt (21)
  • danyx23 (17)
  • larsyencken (17)
  • sophiamersmann (10)
  • marcelgerber (9)
  • rakyi (5)

Top Issue Labels

  • needs triage (87)
  • priority 3 - nice to have (70)
  • wontfix (70)
  • priority 2 - important (59)
  • wizard (33)
  • bug (26)
  • metadata (21)
  • enhancement (11)
  • needs discussion (11)
  • pinned (9)
  • correctness (9)
  • data (9)
  • devex (7)
  • documentation (6)
  • priority 3 - low (4)
  • priority 1 - essential (4)
  • walkthrough (4)
  • maintenance (4)
  • workflow (3)
  • priority 2 - medium (3)
  • dependencies (2)
  • help wanted (2)
  • ci (2)
  • needs proposal (1)
  • ops (1)
  • tracking (1)
  • catalog (1)

Top Pull Request Labels

  • dependencies (111)
  • wontfix (38)
  • wizard (23)
  • data (18)
  • codex (11)
  • merge-schedule-failed (10)
  • enhancement (8)
  • bug (6)
  • documentation (6)
  • staging (5)
  • priority 3 - nice to have (3)
  • javascript (3)
  • devex (3)
  • metadata (3)
  • staging-bake (2)
  • pinned (2)
  • coderabbit (2)
  • external (1)
  • priority 2 - important (1)
  • needs triage (1)
  • catalog (1)
  • stale (1)
  • maintenance (1)

Dependencies

.github/workflows/project-automations.yml actions
  • andymckay/labeler e6c4322d0397f3240f0e7e30a33b5c5df2d39e90 composite
  • owid/actions/assign-priority main composite
  • owid/actions/set-project-status main composite
poetry.lock pypi
  • 300 dependencies
.github/workflows/auto-author-assign.yml actions
  • toshimaru/auto-author-assign v1.6.2 composite
etl/steps/archive/garden/papers/2022-11-04/riley_2005/meta.yml cpan
etl/steps/data/garden/demography/2022-12-08/population/meta.yml cpan
etl/steps/data/garden/demography/2023-03-31/population/meta.yml cpan
etl/steps/data/garden/papers/2022-11-03/zijdeman_et_al_2015/meta.yml cpan
etl/steps/data/garden/papers/2023-02-03/riley_2005/meta.yml cpan
lib/walden/ingests/papers/2022-11-01/riley_2005/meta.yml cpan
lib/walden/ingests/papers/2022-11-01/zijdeman_et_al_2015/meta.yml cpan
lib/catalog/poetry.lock pypi
  • appnope 0.1.3
  • argh 0.28.1
  • asttokens 2.2.1
  • attrs 23.1.0
  • backcall 0.2.0
  • black 23.7.0
  • boto3 1.28.8
  • botocore 1.31.8
  • certifi 2023.5.7
  • charset-normalizer 3.2.0
  • click 8.1.6
  • colorama 0.4.6
  • coverage 7.2.7
  • dataclasses-json 0.5.7
  • decorator 5.1.1
  • dynamic-yaml 1.3.4
  • exceptiongroup 1.1.2
  • executing 1.2.0
  • flake8 6.1.0
  • idna 3.4
  • importlib-resources 6.0.0
  • iniconfig 2.0.0
  • ipdb 0.13.13
  • ipython 8.12.2
  • isort 5.12.0
  • jedi 0.18.2
  • jmespath 1.0.1
  • jsonschema 4.18.4
  • jsonschema-specifications 2023.7.1
  • marshmallow 3.20.1
  • marshmallow-enum 1.5.1
  • matplotlib-inline 0.1.6
  • mccabe 0.7.0
  • mypy-extensions 1.0.0
  • nodeenv 1.8.0
  • numpy 1.24.4
  • owid-repack 0.1.2
  • packaging 23.1
  • pandas 1.5.3
  • pandas-stubs 1.2.0.62
  • parso 0.8.3
  • pathspec 0.11.1
  • pexpect 4.8.0
  • pickleshare 0.7.5
  • pkgutil-resolve-name 1.3.10
  • platformdirs 3.9.1
  • pluggy 1.2.0
  • prompt-toolkit 3.0.39
  • ptyprocess 0.7.0
  • pure-eval 0.2.2
  • pyarrow 12.0.1
  • pycodestyle 2.11.0
  • pyflakes 3.1.0
  • pygments 2.15.1
  • pyright 1.1.288
  • pytest 7.4.0
  • pytest-cov 4.1.0
  • python-dateutil 2.8.2
  • pytz 2023.3
  • pyyaml 6.0.1
  • referencing 0.30.0
  • requests 2.31.0
  • rpds-py 0.9.2
  • s3transfer 0.6.1
  • setuptools 68.0.0
  • six 1.16.0
  • stack-data 0.6.2
  • structlog 23.1.0
  • tomli 2.0.1
  • traitlets 5.9.0
  • typing-extensions 4.7.1
  • typing-inspect 0.9.0
  • unidecode 1.3.6
  • urllib3 1.26.16
  • watchdog 3.0.0
  • wcwidth 0.2.6
  • zipp 3.16.2
lib/catalog/pyproject.toml pypi
  • PyYAML >=5.4.1
  • Unidecode >=1.3.4
  • boto3 >=1.21.13
  • dataclasses-json >=0.5.6
  • dynamic-yaml ^1.3.4
  • ipdb >=0.13.9
  • jsonschema >=3.2.0
  • owid-repack >=0.1.1
  • pandas >=1.3.3,<2.0
  • pyarrow >=10.0.1
  • pytest-cov >=2.12.1
  • python >=3.8.1
  • requests >=2.26.0
  • structlog >=21.5.0
lib/datautils/poetry.lock pypi
  • 139 dependencies
lib/datautils/pyproject.toml pypi
  • boto3 >=1.21.16
  • colorama >=0.4.4
  • gdown >=4.5.2
  • gsheets >=0.6.1
  • owid-catalog *
  • pandas >=1.3.3
  • pydrive2 >=1.15.0
  • python >=3.8.1,<4.0
  • structlog >=21.5.0
  • urllib3 <2
lib/repack/poetry.lock pypi
  • attrs 22.2.0
  • black 22.12.0
  • click 8.1.3
  • colorama 0.4.6
  • exceptiongroup 1.0.4
  • flake8 6.1.0
  • iniconfig 1.1.1
  • isort 5.11.4
  • mccabe 0.7.0
  • mypy-extensions 0.4.3
  • nodeenv 1.7.0
  • numpy 1.24.0
  • packaging 22.0
  • pandas 1.5.2
  • pathspec 0.10.3
  • platformdirs 2.6.0
  • pluggy 1.0.0
  • pycodestyle 2.11.0
  • pyflakes 3.1.0
  • pyright 1.1.285
  • pytest 7.2.0
  • python-dateutil 2.8.2
  • pytz 2022.7
  • setuptools 65.6.3
  • six 1.16.0
  • tomli 2.0.1
  • typing-extensions 4.4.0
lib/repack/pyproject.toml pypi
  • numpy >=1.24.0
  • pandas >=1.5.2
  • python >=3.8.1
lib/walden/poetry.lock pypi
  • 172 dependencies
lib/walden/pyproject.toml pypi
  • PyYAML >=5.4.1 develop
  • argh >=0.26.2 develop
  • black >=22.3.0 develop
  • flake8 >=3.9.2 develop
  • isort >=5.10.1 develop
  • jupyter >=1.0.0 develop
  • jupyter_nbextensions_configurator >=0.4.1 develop
  • jupytext >=1.13.7 develop
  • pyright >=1.1.278 develop
  • pytest >=6.2.4 develop
  • requests-mock >=1.9.3 develop
  • tqdm >=4.62.3 develop
  • types-PyYAML >=6.0.5 develop
  • types-requests >=2.25.2 develop
  • watchdog >=2.1.3 develop
  • beautifulsoup4 >=4.11.1
  • boto3 >=1.17.112
  • click >=8.0.1
  • dataclasses-json >=0.5.4
  • jsonschema >=3.2.0
  • openpyxl >=3.0.9
  • owid-datautils *
  • owid-repack >=0.1.1
  • pandas >=1.3.4
  • pyrsistent >=0.19.1
  • python ^3.8.1
  • requests >=2.26.0
  • rich >=12.1.0
  • sh >=1.14.2
  • structlog >=21.5.0
pyproject.toml pypi
  • PyPDF2 >=2.11.1
  • PyYAML >=5.4.1
  • SPARQLWrapper >=1.8.5
  • Unidecode >=1.3.2
  • bugsnag >=4.2.1
  • click >=8.0.1
  • dvc >=2.58.2,<3.0.0
  • frictionless ^4.40.8
  • fsspec 2022.11.0
  • gitpython ^3.1.30
  • jupyterlab >=3.1.13
  • mysqlclient 2.1.1
  • numpy >=1.22.1
  • odfpy ^1.4.1
  • openai ^0.27.7
  • openpyxl >=3.0.9
  • owid-catalog *
  • owid-datautils *
  • owid-repack *
  • pandas 1.5.2
  • papermill >=2.3.3
  • pdfplumber ^0.9.0
  • pydantic >=1.9.0
  • pyhumps ^3.8.0
  • python ^3.10
  • python-dotenv >=0.19.0
  • questionary ^2.0.0
  • rapidfuzz ^2.13.7
  • regex >=2022.1.18
  • rich >=12.1.0
  • rich-click >=1.5.1
  • ruamel.yaml >=0.17.21
  • sh 1.14.3
  • simplejson >=3.17.6
  • sqlmodel ^0.0.6
  • st-pages ^0.4.4
  • streamlit ^1.26.0
  • streamlit-ace ^0.1.1
  • streamlit-extras ^0.3.0
  • structlog >=21.5.0
  • tenacity >=8.0.1
  • typing-extensions ^4.7.1
  • walden *
  • wbgapi ^1.0.12
  • wikipedia >=1.4.0
  • world-bank-data ^0.1.3
  • xlrd >=2.0.1
dag/environment.yml conda

Score: 8.685246776412487