Mesogeos
A multi-purpose dataset for data-driven wildfire modeling in the Mediterranean.
https://github.com/orion-ai-lab/mesogeos
Category: Biosphere
Sub Category: Wildfire
Keywords
datacube deep-learning forests mediterranean wildfire-forecasting zarr
Last synced: about 3 hours ago
JSON representation
Repository metadata
A multi-purpose dataset for data-driven wildfire modeling in the Mediterranean. Deep Learning models for wildfire modeling, e.g. danger forecasting, burned area prediction, etc
- Host: GitHub
- URL: https://github.com/orion-ai-lab/mesogeos
- Owner: Orion-AI-Lab
- Created: 2022-10-12T12:48:08.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2025-01-13T14:27:37.000Z (3 months ago)
- Last Synced: 2025-04-17T20:37:06.895Z (10 days ago)
- Topics: datacube, deep-learning, forests, mediterranean, wildfire-forecasting, zarr
- Language: Jupyter Notebook
- Homepage: https://orion-ai-lab.github.io/mesogeos/
- Size: 12.4 MB
- Stars: 50
- Watchers: 3
- Forks: 11
- Open Issues: 1
- Releases: 3
-
Metadata Files:
- Readme: README.md
README.md
Mesogeos: A multi-purpose dataset for data-driven wildfire modeling in the Mediterranean
🆕 2023-09: Accepted at Neurips 2023 Datasets and Benchmarks Track
This is the official code repository of the mesogeos dataset.
Pre-print describing the paper.
This repo contains code for the following:
- Creation of the Mesogeos datacube.
- Extraction of machine learning datasets for different tracks.
- Training and evaluation of machine learning models for these tracks.
Authors: Spyros Kondylatos (1, 2), Ioannis Prapas (1, 2), Gustau Camps-Valls (2), Ioannis Papoutsis (1)
(1) Orion Lab, IAASARS, National Observatory of Athens
(2) Image & Signal Processing Group, Universitat de València
Table of Contents
- Downloading the data
- Datacube Generation
- Machine Learning Tracks
- Track A: Wildfire Danger Forecasting
- Track B: Final Burned Area Prediction
- Contributing
- Datacube Details
- Citation
- License
- Acknowledgements
Data repository
You can access the data using this Drive link. This link contains the mesogeos datacube (mesogeos_cube.zarr/
), the extracted datasets for the machine learning tracks (ml_tracks/
), as well as notebooks showing how to access the mesogeos cubes (notebooks/
).
Accessing the mesogeos cube
The mesogeos cube is publicly accessible in the following places:
-
Google Drive folder: https://drive.google.com/drive/folders/1aRXQXVvw6hz0eYgtJDoixjPQO-_bRK z9
-
S3 Storage in OVH(This option isn't supported anymore due to supporting project termination)
Option 1: Access from Google Colab
notebooks/1_Exploring_Mesogeos.ipynb shows how to open Mesogeos directly in google colab
Option 2: Download from Google Colab
Rclone may be the best option to download the dataset from google drive. See this issue.
Datacube Generation
Find the code to generate a datacube like mesogeos in datacube_creation.
Machine Learning Tracks
Track A: Wildfire Danger Forecasting
This track defines wildfire danger forecasting as a binary classification problem.
More details in Track A
Track B: Final Burned Area Prediction
This track is about predicting the final burned area of a wildfire given the ignition point and the conditions of the fire drivers at the first day of the fire in a neighborhood around the ignition point.
More details in Track B
Datacube Details
Mesogeos is meant to be used to develop models for wildfire modeling in the Mediterranean.
It contains variables related to the ignition and spread of wildfire for the years 2006 to 2022 at a daily 1km x 1km grid.
The datacube contains the following variables:
- satellite data from MODIS (Land Surface Temperature (https://lpdaac.usgs.gov/products/mod11a1v061/), Normalized Vegetation Index (https://lpdaac.usgs.gov/products/mod13a2v061/), Leaf Area Index (https://lpdaac.usgs.gov/products/mod15a2hv061/))
- weather variables from ERA5-Land (max daily temperature, max daily dewpoint temperature, min daily relative humidity,
max daily wind speed, max daily surface pressure, mean daily surface solar radiation downwards) (https://cds.climate.copernicus.eu/cdsapp#!/dataset/10.24381/cds.e2161bac?tab=overview) - soil moisture index from JRC European Drought Observatory (https://edo.jrc.ec.europa.eu/edov2/home.static.html)
- population count (https://hub.worldpop.org/geodata/listing?id=64) & distance to roads (https://hub.worldpop.org/geodata/listing?id=33) from worldpop.org
- land cover from Copernicus Climate Change Service (https://cds.climate.copernicus.eu/cdsapp#!/dataset/satellite-land-cover?tab=overview)
- elevation, aspect, slope and curvature from Copernicus EU-DEM (https://land.copernicus.eu/imagery-in-situ/eu-dem/eu-dem-v1.1?tab=download)
- burned areas and ignition points from EFFIS (https://effis.jrc.ec.europa.eu/applications/data-and-services)
Vriables in the cube:
Variable | Units | Description |
---|---|---|
aspect | ° | aspect |
burned areas | unitless | rasterized burned polygons. 0 when no burned area occurs in that cell, 1 if it does for the day of interest |
curvature | rad | curvature |
d2m | K | day's maximum 2 metres dewpoint temperature |
dem | m | elevation |
ignition_points | hectares | rasterized fire ignitions. It contains the final hectares of the burned area resulted from the fire |
lai | unitless | leaf area index |
lc_agriculture | % | fraction of agriculture in the pixel. 1st Jan of each year has the values of the year |
lc_forest | % | fraction of forest in the pixel. 1st Jan of each year has the values of the year |
lc_grassland | % | fraction of grassland in the pixel. 1st Jan of each year has the values of the year |
lc_settlement | % | fraction of settlement in the pixel. 1st Jan of each year has the values of the year |
lc_shrubland | % | fraction of shrubland in the pixel. 1st Jan of each year has the values of the year |
lc_sparse_veagetation | % | fraction of sparse vegetation in the pixel. 1st Jan of each year has the values of the year |
lc_water_bodies | % | fraction of water bodies in the pixel. 1st Jan of each year has the values of the year |
lc_wetland | % | fraction of wetland in the pixel. 1st Jan of each year has the values of the year |
lst_day | K | day's land surface temperature |
lst_night | K | nights' land surface temperature |
ndvi | unitless | normalized difference vegetation index |
population | people/km^2 | population count per year. 1st Jan of each year has the values of the year |
rh | %/100 | day's minimum relative humidity |
roads_distance | km | distance from the nearest road |
slope | rad | slope |
smi | unitless | soil moisture index |
sp | Pa | day's maximum surface pressure |
ssrd | J/m^2 | day's average surface solar radiation downwards |
t2m | K | day's maximum 2 metres temperature |
tp | m | day's total precipitation |
wind_speed | m/s | day's maximum wind speed |
An example of some variables for a day in the cube:
Datacube Metadata
- Temporal Extent:
(2006-04-01, 2022-09-29)
- Spatial Extent:
(-10.72, 30.07, 36.74, 47.7)
, i.e. the wider Mediterranean region. - Coordinate Reference System:
EPSG:4326
Datacube Citation
Spyros Kondylatos, Ioannis Prapas, Gustau Camps-Valls, & Ioannis Papoutsis. (2023).
Mesogeos: A multi-purpose dataset for data-driven wildfire modeling in the Mediterranean.
Zenodo. https://doi.org/10.5281/zenodo.7473331
Contributing
We welcome new contributions for new models and new machine learning tracks!
New Model: To contribute a new model for an existing track, your code has to be (i) open, (ii) reproducible (we should be able to easily run your code and get the reported results) and (iii) use the same dataset split defined for the track.
After we verify your results, you get to add your model and name to the leaderboard.
Check the current leaderboards.
Submit a new issue containing a link to your code.
New ML Track: To contribute a new track, submit a new issue.
We recommend at minimum:
- a dataset extraction process that samples from mesogeos,
- a description of the task,
- a baseline model,
- appropriate metrics.
License
Creative Commons Attribution v4
Citation
@inproceedings{
kondylatos2023mesogeos,
title={Mesogeos: A multi-purpose dataset for data-driven wildfire modeling in the Mediterranean},
author={Spyros Kondylatos and Ioannis Prapas and Gustau Camps-Valls and Ioannis Papoutsis},
booktitle={Thirty-seventh Conference on Neural Information Processing Systems Datasets and Benchmarks Track},
year={2023},
url={https://openreview.net/forum?id=VH1vxapUTs}
}
Acknowledgements
This work has received funding from the European Union’s Horizon 2020 Research and Innovation Projects DeepCube and TREEADS, under Grant Agreement Numbers 101004188 and 101036926353 respectively
Owner metadata
- Name: Orion Lab
- Login: Orion-AI-Lab
- Email: [email protected]
- Kind: organization
- Description: Orion Lab research group: Deep Learning in Earth Observation at the National Observatory of Athens
- Website:
- Location: Greece
- Twitter:
- Company:
- Icon url: https://avatars.githubusercontent.com/u/94176283?v=4
- Repositories: 5
- Last ynced at: 2023-03-10T08:16:05.630Z
- Profile URL: https://github.com/Orion-AI-Lab
GitHub Events
Total
- Issues event: 2
- Watch event: 7
- Issue comment event: 1
- Push event: 2
- Fork event: 2
Last Year
- Issues event: 2
- Watch event: 7
- Issue comment event: 1
- Push event: 2
- Fork event: 2
Committers metadata
Last synced: 6 days ago
Total Commits: 54
Total Committers: 3
Avg Commits per committer: 18.0
Development Distribution Score (DDS): 0.407
Commits in past year: 4
Committers in past year: 1
Avg Commits per committer in past year: 4.0
Development Distribution Score (DDS) in past year: 0.0
Name | Commits | |
---|---|---|
iprapas | 2****s | 32 |
skondylatos | 7****s | 15 |
Spyros Kondylatos | s****s@S****l | 7 |
Committer domains:
Issue and Pull Request metadata
Last synced: 1 day ago
Total issues: 3
Total pull requests: 1
Average time to close issues: 4 months
Average time to close pull requests: 1 minute
Total issue authors: 3
Total pull request authors: 1
Average comments per issue: 2.33
Average comments per pull request: 0.0
Merged pull request: 1
Bot issues: 0
Bot pull requests: 0
Past year issues: 2
Past year pull requests: 0
Past year average time to close issues: 3 months
Past year average time to close pull requests: N/A
Past year issue authors: 2
Past year pull request authors: 0
Past year average comments per issue: 1.5
Past year average comments per pull request: 0
Past year merged pull request: 0
Past year bot issues: 0
Past year bot pull requests: 0
Top Issue Authors
- Anna0509 (1)
- farzadazma (1)
- yhw03 (1)
Top Pull Request Authors
- iprapas (1)
Top Issue Labels
Top Pull Request Labels
Score: 5.030437921392435