A curated list of open technology projects to sustain a stable climate, energy supply, biodiversity and natural resources.

Mesogeos

A multi-purpose dataset for data-driven wildfire modeling in the Mediterranean.
https://github.com/orion-ai-lab/mesogeos

Category: Biosphere
Sub Category: Wildfire

Keywords

datacube deep-learning forests mediterranean wildfire-forecasting zarr

Last synced: about 3 hours ago
JSON representation

Repository metadata

A multi-purpose dataset for data-driven wildfire modeling in the Mediterranean. Deep Learning models for wildfire modeling, e.g. danger forecasting, burned area prediction, etc

README.md

Mesogeos: A multi-purpose dataset for data-driven wildfire modeling in the Mediterranean

🆕 2023-09: Accepted at Neurips 2023 Datasets and Benchmarks Track

This is the official code repository of the mesogeos dataset.

Pre-print describing the paper.

This repo contains code for the following:

  • Creation of the Mesogeos datacube.
  • Extraction of machine learning datasets for different tracks.
  • Training and evaluation of machine learning models for these tracks.

Authors: Spyros Kondylatos (1, 2), Ioannis Prapas (1, 2), Gustau Camps-Valls (2), Ioannis Papoutsis (1)

(1) Orion Lab, IAASARS, National Observatory of Athens

(2) Image & Signal Processing Group, Universitat de València

Table of Contents

Data repository

You can access the data using this Drive link. This link contains the mesogeos datacube (mesogeos_cube.zarr/), the extracted datasets for the machine learning tracks (ml_tracks/), as well as notebooks showing how to access the mesogeos cubes (notebooks/).

Accessing the mesogeos cube

The mesogeos cube is publicly accessible in the following places:

Option 1: Access from Google Colab

notebooks/1_Exploring_Mesogeos.ipynb shows how to open Mesogeos directly in google colab
colab_link

Option 2: Download from Google Colab

Rclone may be the best option to download the dataset from google drive. See this issue.

Datacube Generation

Find the code to generate a datacube like mesogeos in datacube_creation.

Machine Learning Tracks

Track A: Wildfire Danger Forecasting

This track defines wildfire danger forecasting as a binary classification problem.

More details in Track A

Track B: Final Burned Area Prediction

This track is about predicting the final burned area of a wildfire given the ignition point and the conditions of the fire drivers at the first day of the fire in a neighborhood around the ignition point.

More details in Track B

Datacube Details

Mesogeos is meant to be used to develop models for wildfire modeling in the Mediterranean.
It contains variables related to the ignition and spread of wildfire for the years 2006 to 2022 at a daily 1km x 1km grid.

The datacube contains the following variables:

Vriables in the cube:

Variable Units Description
aspect ° aspect
burned areas unitless rasterized burned polygons. 0 when no burned area occurs in that cell, 1 if it does for the day of interest
curvature rad curvature
d2m K day's maximum 2 metres dewpoint temperature
dem m elevation
ignition_points hectares rasterized fire ignitions. It contains the final hectares of the burned area resulted from the fire
lai unitless leaf area index
lc_agriculture % fraction of agriculture in the pixel. 1st Jan of each year has the values of the year
lc_forest % fraction of forest in the pixel. 1st Jan of each year has the values of the year
lc_grassland % fraction of grassland in the pixel. 1st Jan of each year has the values of the year
lc_settlement % fraction of settlement in the pixel. 1st Jan of each year has the values of the year
lc_shrubland % fraction of shrubland in the pixel. 1st Jan of each year has the values of the year
lc_sparse_veagetation % fraction of sparse vegetation in the pixel. 1st Jan of each year has the values of the year
lc_water_bodies % fraction of water bodies in the pixel. 1st Jan of each year has the values of the year
lc_wetland % fraction of wetland in the pixel. 1st Jan of each year has the values of the year
lst_day K day's land surface temperature
lst_night K nights' land surface temperature
ndvi unitless normalized difference vegetation index
population people/km^2 population count per year. 1st Jan of each year has the values of the year
rh %/100 day's minimum relative humidity
roads_distance km distance from the nearest road
slope rad slope
smi unitless soil moisture index
sp Pa day's maximum surface pressure
ssrd J/m^2 day's average surface solar radiation downwards
t2m K day's maximum 2 metres temperature
tp m day's total precipitation
wind_speed m/s day's maximum wind speed

An example of some variables for a day in the cube:
image

Datacube Metadata

  • Temporal Extent: (2006-04-01, 2022-09-29)
  • Spatial Extent: (-10.72, 30.07, 36.74, 47.7), i.e. the wider Mediterranean region.
  • Coordinate Reference System: EPSG:4326

DOI

Datacube Citation

Spyros Kondylatos, Ioannis Prapas, Gustau Camps-Valls, & Ioannis Papoutsis. (2023). 
Mesogeos: A multi-purpose dataset for data-driven wildfire modeling in the Mediterranean. 
Zenodo. https://doi.org/10.5281/zenodo.7473331

Contributing

We welcome new contributions for new models and new machine learning tracks!

New Model: To contribute a new model for an existing track, your code has to be (i) open, (ii) reproducible (we should be able to easily run your code and get the reported results) and (iii) use the same dataset split defined for the track.
After we verify your results, you get to add your model and name to the leaderboard.
Check the current leaderboards.

Submit a new issue containing a link to your code.

New ML Track: To contribute a new track, submit a new issue.

We recommend at minimum:

  1. a dataset extraction process that samples from mesogeos,
  2. a description of the task,
  3. a baseline model,
  4. appropriate metrics.

License

Creative Commons Attribution v4

Citation

@inproceedings{
kondylatos2023mesogeos,
title={Mesogeos: A multi-purpose dataset for data-driven wildfire modeling in the Mediterranean},
author={Spyros Kondylatos and Ioannis Prapas and Gustau Camps-Valls and Ioannis Papoutsis},
booktitle={Thirty-seventh Conference on Neural Information Processing Systems Datasets and Benchmarks Track},
year={2023},
url={https://openreview.net/forum?id=VH1vxapUTs}
}

Acknowledgements

This work has received funding from the European Union’s Horizon 2020 Research and Innovation Projects DeepCube and TREEADS, under Grant Agreement Numbers 101004188 and 101036926353 respectively


Owner metadata


GitHub Events

Total
Last Year

Committers metadata

Last synced: 6 days ago

Total Commits: 54
Total Committers: 3
Avg Commits per committer: 18.0
Development Distribution Score (DDS): 0.407

Commits in past year: 4
Committers in past year: 1
Avg Commits per committer in past year: 4.0
Development Distribution Score (DDS) in past year: 0.0

Name Email Commits
iprapas 2****s 32
skondylatos 7****s 15
Spyros Kondylatos s****s@S****l 7

Committer domains:


Issue and Pull Request metadata

Last synced: 1 day ago

Total issues: 3
Total pull requests: 1
Average time to close issues: 4 months
Average time to close pull requests: 1 minute
Total issue authors: 3
Total pull request authors: 1
Average comments per issue: 2.33
Average comments per pull request: 0.0
Merged pull request: 1
Bot issues: 0
Bot pull requests: 0

Past year issues: 2
Past year pull requests: 0
Past year average time to close issues: 3 months
Past year average time to close pull requests: N/A
Past year issue authors: 2
Past year pull request authors: 0
Past year average comments per issue: 1.5
Past year average comments per pull request: 0
Past year merged pull request: 0
Past year bot issues: 0
Past year bot pull requests: 0

More stats: https://issues.ecosyste.ms/repositories/lookup?url=https://github.com/orion-ai-lab/mesogeos

Top Issue Authors

  • Anna0509 (1)
  • farzadazma (1)
  • yhw03 (1)

Top Pull Request Authors

  • iprapas (1)

Top Issue Labels

Top Pull Request Labels

Score: 5.030437921392435