reformatters
Reformat weather datasets into Zarr.
https://github.com/dynamical-org/reformatters
Category: Atmosphere
Sub Category: Meteorological Observation and Forecast
Last synced: about 23 hours ago
JSON representation
Repository metadata
Reformat weather datasets into Zarr
- Host: GitHub
- URL: https://github.com/dynamical-org/reformatters
- Owner: dynamical-org
- License: bsd-3-clause
- Created: 2024-09-18T01:28:17.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2026-03-05T01:34:21.000Z (16 days ago)
- Last Synced: 2026-03-05T01:42:04.025Z (16 days ago)
- Language: Python
- Homepage: https://dynamical.org/catalog
- Size: 72.5 MB
- Stars: 26
- Watchers: 5
- Forks: 6
- Open Issues: 24
- Releases: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Agents: AGENTS.md
README.md
dynamical.org reformatters
Reformat weather datasets into zarr.
Browse the datasets produced by this repo at https://dynamical.org/catalog/.
- See AGENTS.md for an overview of the approach and this repository.
- Integrate a new dataset to be reformatted.
- Add a new variable to an existing dataset.
Local development
We use
uvto manage dependencies and python environmentsrufffor linting and formattingtyfor type checkingpytestfor testingprekto automatically lint and format as you git commit
Setup
- Install uv
- Run
uv run prek installto setup the git hooks - If you use VSCode, you may want to install the extensions (ruff) it will recommend when you open this folder
Running locally
uv run main --help- list all datasetsuv run main <DATASET_ID> update-templateuv run main <DATASET_ID> backfill-local <INIT_TIME_END>
Development commands
- Add dependency:
uv add <package> [--dev]. Use--devto add a development only dependency. - Lint:
uv run ruff check [--fix] - Type check:
uv run ty check - Format:
uv run ruff format - Tests:
- Run tests in parallel on all available cores:
uv run pytest - Run tests serially:
uv run pytest -n 0
- Run tests in parallel on all available cores:
Deploying to the cloud
To reformat a large archive we parallelize work across multiple cloud servers.
We use
dockerto package the code and dependencieskubernetesindexed jobs to run work in parallel
Setup
- Install
dockerandkubectl. Make suredockercan be found at/usr/bin/dockerandkubectlat/usr/bin/kubectl. - Setup a docker image repository and export the
DOCKER_REPOSITORYenvironment variable in your local shell. e.g.export DOCKER_REPOSITORY=container.registry/<project-id>/reformatters/main. Follow your registry's instructions to allow your docker to authenticate and push images to the registry. - Setup a kubernetes cluster and configure kubectl to point to your cluster. e.g.
aws eks update-kubeconfig --region <region> --name <cluster-name>,gcloud container clusters get-credentials <cluster-name> --region <region> --project <project>, etc. - Create a kubectl secret containing a single json encoded value to be passed to fsspec
storage_optionsor splatted as keyword arguments to an icechunk storage openerkubectl create secret generic your-destination-storage-options-key --from-literal=contents='{"key": "...", "secret": "..."}'. Seestorage.py.
Owner metadata
- Name: dynamical-org
- Login: dynamical-org
- Email:
- Kind: organization
- Description:
- Website:
- Location:
- Twitter:
- Company:
- Icon url: https://avatars.githubusercontent.com/u/156974863?v=4
- Repositories: 1
- Last ynced at: 2024-01-20T21:33:46.041Z
- Profile URL: https://github.com/dynamical-org
Committers metadata
Last synced: 19 days ago
Total Commits: 606
Total Committers: 11
Avg Commits per committer: 55.091
Development Distribution Score (DDS): 0.375
Commits in past year: 419
Committers in past year: 10
Avg Commits per committer in past year: 41.9
Development Distribution Score (DDS) in past year: 0.353
| Name | Commits | |
|---|---|---|
| Alden Keefe Sampson | a****n@g****m | 379 |
| Alex | m****r | 71 |
| Sam Neubardt | s****m@u****h | 56 |
| Margo Crawford | m****o@u****h | 42 |
| Jack Kelly | j****k@O****g | 20 |
| Dan Katz | d****5@g****m | 11 |
| Tony Cannistra | t****a@g****m | 8 |
| Lauren Gulland | l****n@u****h | 7 |
| Copilot | 1****t | 7 |
| dependabot[bot] | 4****] | 4 |
| Marshall | m****t@g****m | 1 |
Committer domains:
Issue and Pull Request metadata
Last synced: 18 days ago
Total issues: 8
Total pull requests: 223
Average time to close issues: N/A
Average time to close pull requests: 1 day
Total issue authors: 4
Total pull request authors: 8
Average comments per issue: 1.88
Average comments per pull request: 0.28
Merged pull request: 155
Bot issues: 0
Bot pull requests: 0
Past year issues: 7
Past year pull requests: 165
Past year average time to close issues: N/A
Past year average time to close pull requests: 1 day
Past year issue authors: 3
Past year pull request authors: 6
Past year average comments per issue: 2.14
Past year average comments per pull request: 0.22
Past year merged pull request: 105
Past year bot issues: 0
Past year bot pull requests: 0
Top Issue Authors
- aldenks (3)
- JackKelly (3)
- briannapagan (1)
- samn (1)
Top Pull Request Authors
- aldenks (113)
- mosegontar (43)
- samn (36)
- dan-katz (10)
- margocrawf (9)
- acannistra (7)
- JackKelly (3)
- mrshll (2)
Top Issue Labels
- enhancement (1)
Top Pull Request Labels
Score: 6.309918278226517