Zarr

Provides an efficient, scalable, and flexible way to store and access large, multi-dimensional arrays, the core data format used in climate models and observational datasets.
https://github.com/zarr-developers/zarr-python

Category: Climate Change
Sub Category: Climate Data Standards

Keywords

compressed ndimensional-arrays python zarr

Keywords from Contributors

pydata closember data-catalog distributed-computing notebook qt spec-0 cython alignment flexible

Last synced: about 14 hours ago
JSON representation

Repository metadata

An implementation of chunked, compressed, N-dimensional arrays for Python.

README.md

Zarr

What is it?

Zarr is a Python package providing an implementation of compressed, chunked, N-dimensional arrays, designed for use in parallel computing. See the documentation for more information.

Main Features

  • Create N-dimensional arrays with any NumPy dtype.
  • Chunk arrays along any dimension.
  • Compress and/or filter chunks using any NumCodecs codec.
  • Store arrays in memory, on disk, inside a zip file, on S3, etc...
  • Read an array concurrently from multiple threads or processes.
  • Write to an array concurrently from multiple threads or processes.
  • Organize arrays into hierarchies via groups.

Where to get it

Zarr can be installed from PyPI using pip:

pip install zarr

or via conda:

conda install -c conda-forge zarr

For more details, including how to install from source, see the installation documentation.


Owner metadata


GitHub Events

Total
Last Year

Committers metadata

Last synced: 4 days ago

Total Commits: 2,353
Total Committers: 125
Avg Commits per committer: 18.824
Development Distribution Score (DDS): 0.644

Commits in past year: 493
Committers in past year: 43
Avg Commits per committer in past year: 11.465
Development Distribution Score (DDS) in past year: 0.828

Name Email Commits
Alistair Miles a****o@g****m 838
dependabot[bot] 4****] 256
John Kirkham k****j@j****g 144
Joe Hamman j****e@e****o 123
Dimitri Papadopoulos Orfanos 3****s 122
David Stansby d****y@g****m 116
Davis Bennett d****t@g****m 74
Josh Moore j****e@d****k 72
Martin Durant m****t@u****a 65
Matthias Bussonnier b****s@g****m 53
pre-commit-ci[bot] 6****] 46
Tom Augspurger t****r@g****m 34
Norman Rzepka c****e@n****m 28
Deepak Cherian d****n 28
Sanket Verma s****5@g****m 20
Hannes Spitz 4****1 19
Andrew Fulton a****9@g****m 17
Tarik Onalan o****t 17
James Bourbeau j****u 16
Max Jones 1****s 15
Gregory Lee g****7@g****m 15
Ryan Abernathey r****y@g****m 15
Joseph Hamman j****n@u****u 13
Mads R. B. Kristensen m****k@g****m 12
Zain Patel 3****2 11
Saransh Chopra s****1@g****m 9
Nathan Zimmerman n****n@g****m 8
Matthew Rocklin m****n@g****m 8
AWA BRANDON AWA 5****N 8
Shikhar Goenka s****a@s****m 6
and 95 more...

Committer domains:


Issue and Pull Request metadata

Last synced: 1 day ago

Total issues: 774
Total pull requests: 1,148
Average time to close issues: 8 months
Average time to close pull requests: about 2 months
Total issue authors: 209
Total pull request authors: 119
Average comments per issue: 3.96
Average comments per pull request: 2.74
Merged pull request: 809
Bot issues: 7
Bot pull requests: 244

Past year issues: 460
Past year pull requests: 678
Past year average time to close issues: about 1 month
Past year average time to close pull requests: 9 days
Past year issue authors: 107
Past year pull request authors: 64
Past year average comments per issue: 3.43
Past year average comments per pull request: 2.28
Past year merged pull request: 473
Past year bot issues: 6
Past year bot pull requests: 94

More stats: https://issues.ecosyste.ms/repositories/lookup?url=https://github.com/zarr-developers/zarr-python

Top Issue Authors

  • d-v-b (124)
  • jhamman (102)
  • dstansby (56)
  • TomAugspurger (28)
  • rabernat (24)
  • jakirkham (22)
  • dcherian (16)
  • ilan-gold (14)
  • alimanfoo (13)
  • joshmoore (13)
  • MSanKeys963 (13)
  • tomwhite (11)
  • QuLogic (10)
  • TomNicholas (9)
  • DimitriPapadopoulos (8)

Top Pull Request Authors

  • dependabot[bot] (200)
  • dstansby (165)
  • jhamman (148)
  • DimitriPapadopoulos (111)
  • d-v-b (98)
  • pre-commit-ci[bot] (44)
  • dcherian (37)
  • TomAugspurger (32)
  • normanrz (31)
  • brokkoli71 (26)
  • maxrjones (23)
  • MSanKeys963 (20)
  • joshmoore (15)
  • rabernat (13)
  • moradology (12)

Top Issue Labels

  • bug (296)
  • V3 (101)
  • enhancement (87)
  • documentation (69)
  • help wanted (68)
  • V2 (37)
  • good-first-issue (28)
  • performance (18)
  • tests (18)
  • design discussion (8)
  • maintenance (5)
  • types (4)
  • chore (3)
  • platform-specific (1)
  • dependencies (1)
  • N5 (1)

Top Pull Request Labels

  • needs release notes (211)
  • dependencies (200)
  • V3 (159)
  • python (135)
  • documentation (51)
  • github_actions (49)
  • types (25)
  • V2 (22)
  • stale (16)
  • downstream (5)
  • tests (4)
  • help wanted (3)
  • bug (3)
  • enhancement (2)
  • maintenance (2)
  • performance (2)
  • release notes done (1)
  • good-first-issue (1)
  • platform-specific (1)

Package metadata

pypi.org: zarr

An implementation of chunked, compressed, N-dimensional arrays for Python

  • Homepage: https://github.com/zarr-developers/zarr-python
  • Documentation: https://zarr.readthedocs.io/
  • Licenses: MIT License
  • Latest release: 3.0.8 (published 27 days ago)
  • Last Synced: 2025-06-14T21:00:56.377Z (about 20 hours ago)
  • Versions: 106
  • Dependent Packages: 450
  • Dependent Repositories: 626
  • Downloads: 1,379,048 Last month
  • Docker Downloads: 3,817,058
  • Rankings:
    • Dependent packages count: 0.071%
    • Downloads: 0.508%
    • Dependent repos count: 0.557%
    • Docker downloads count: 0.685%
    • Average: 1.17%
    • Stargazers count: 1.869%
    • Forks count: 3.332%
  • Maintainers (4)
conda-forge.org: zarr

  • Homepage: https://github.com/zarr-developers/zarr-python
  • Licenses: MIT
  • Latest release: 2.13.3 (published over 2 years ago)
  • Last Synced: 2025-06-14T02:32:11.241Z (1 day ago)
  • Versions: 33
  • Dependent Packages: 69
  • Dependent Repositories: 1,165
  • Rankings:
    • Dependent repos count: 0.634%
    • Dependent packages count: 1.138%
    • Average: 6.444%
    • Stargazers count: 11.929%
    • Forks count: 12.074%
proxy.golang.org: github.com/zarr-developers/zarr-python

anaconda.org: zarr

Zarr is a format for the storage of chunked, compressed, N-dimensional arrays. These documents describe the Zarr format and its Python implementation.

  • Homepage: https://github.com/zarr-developers/zarr-python
  • Licenses: MIT
  • Latest release: 3.0.5 (published 3 months ago)
  • Last Synced: 2025-06-14T02:32:20.008Z (1 day ago)
  • Versions: 11
  • Dependent Packages: 2
  • Dependent Repositories: 1,165
  • Rankings:
    • Dependent repos count: 3.895%
    • Dependent packages count: 13.462%
    • Average: 15.455%
    • Forks count: 22.11%
    • Stargazers count: 22.355%

Score: 27.953483532429857