DeepTreeAttention

Hyperspectral Image Classification with Attention Aided CNNs.
https://github.com/weecology/DeepTreeAttention

Category: Biosphere
Sub Category: Forest Remote Sensing

Keywords from Contributors

transforms optimize archiving measur generic compose observation animals conversion projection

Last synced: about 1 hour ago
JSON representation

Repository metadata

Implementation of Hang et al. 2020 "Hyperspectral Image Classification with Attention Aided CNNs" for tree species prediction

Host: GitHub
URL: https://github.com/weecology/DeepTreeAttention
Owner: weecology
License: mit
Created: 2020-06-01T14:14:29.000Z (over 5 years ago)
Default Branch: main
Last Pushed: 2024-03-27T22:46:04.000Z (over 1 year ago)
Last Synced: 2025-12-20T06:52:14.732Z (6 days ago)
Language: Python
Size: 314 MB
Stars: 133
Watchers: 8
Forks: 37
Open Issues: 12
Releases: 6
Metadata Files:
- Readme: README.md
- License: LICENSE

DeepTreeAttention

Tree Species Prediction for the National Ecological Observatory Network (NEON)

Implementation of Hang et al. 2020 Hyperspectral Image Classification with Attention Aided CNNs for tree species prediction.

Model Architecture

Project Organization

├── LICENSE
├── README.md          <- The top-level README for developers using this project.
├── data
│   ├── processed      <- The final, canonical data sets for modeling.
│   └── raw            <- The original, immutable data dump.
│
├── environment.yml   <- Conda requirements
│
├── setup.py           <- makes project pip installable (pip install -e .) so src can be imported
├── src                <- Source code for use in this project.
│   ├── Models         <- Model Architectures

Workflow

There are three main parts to this project, a 1) data module, a 2) model module, and 3) a trainer module. Usually the data_module is created to hold the train and test split and keep track of data generation reproducibility. Then a model architecture is created and pass to the model module along with the data module. Finally the model module is passed to the trainer.

#1) 
data_module = data.TreeData(csv_file="data/raw/neon_vst_data_2021.csv", regenerate=False, client=client)

#2)
model = <create a pytorch NN.module>
m = main.TreeModel(model=model, bands=data_module.config["bands"], classes=data_module.num_classes,label_dict=data_module.species_label_dict)

#3
trainer = Trainer()
trainer.fit(m, datamodule=data_module)

Pytorch Lightning Data Module (data.TreeData)

This repo contains a pytorch lightning data module for reproducibility. The goal of the project is to make it easy to share with others within our research group, but we welcome contributions from outside the community. While all data is public, it is VERY large (>20TB) and cannot be easily shared. If you want to reproduce this work, you will need to download the majority of NEON's camera, HSI and CHM data and change the paths in the config file. For the 'raw' NEON tree stem data see data/raw/neon_vst_2021.csv. The data module starts from this state, which are x,y locations for each tree. It then performs the following actions as an end-to-end workflow.

Filters the data to represent trees over 3m with sufficient number of training samples
Extract the LiDAR derived canopy height and compares it to the field measured height. Trees that are below the canopy are excluded based on the min_CHM_diff parameter in the config.
Splits the training and test x,y data such that field plots are either in training or test.
For each x,y stem location the crown is predicted by the tree detection algorithm (DeepForest - https://deepforest.readthedocs.io/).
Crops of each tree crown are created and divided into pixel windows for pixel-level prediction.

This workflow does not need to be run on every experiment. If you are satisifed with the current train/test split and data generation process, set regenerate=False

data_module = data.TreeData(csv_file="data/raw/neon_vst_data_2021.csv", regenerate=False)
data_module.setup()

Pytorch Lightning Training Module (data.TreeModel)

Training is handled by the TreeModel class which loads a model from the models folder, reads the config file and runs the training. The evaluation metrics and images are computed and put of the comet dashboard

m = main.TreeModel(model=Hang2020.vanilla_CNN, bands=data_module.config["bands"], classes=data_module.num_classes,label_dict=data_module.species_label_dict)

trainer = Trainer(
    gpus=data_module.config["gpus"],
    fast_dev_run=data_module.config["fast_dev_run"],
    max_epochs=data_module.config["epochs"],
    accelerator=data_module.config["accelerator"],
    logger=comet_logger)
   
trainer.fit(m, datamodule=data_module)

Alive/Dead Filtering

As part of the prediction pipeline, RGB crops are scored as either 'Alive', meanining they have leaves during presumed leaf-on season, or 'Dead', meaning they do not have leaves.
To finetune the resent50 model, see src/models/dead.py. The classified data for the Alive/Dead crops can be found in data/raw/dead_train and dead/raw/dead_test.

Dev Guide

In general, major changes or improvements should be made on a new git branch. Only core improvements should be made on the main branch. If a change leads to higher scores, please create a pull request. Any pull requests are expected to have pytest unit tests (see tests/) that cover major use cases.

Model Architectures

The TreeModel class takes in a create model function

m = main.TreeModel(model=Hang2020.vanilla_CNN)

Any model can be specified provided it follows the following input and output arguments

class myModel(Module):
    """
    Model description
    """
    def __init__(self, bands, classes):
        super(myModel, self).__init__()
        <define model architecture here>

    def forward(self, x):
        <forward method for computing loss goes here>
        class_scores = F.softmax(x)
        
        return class_scores

Extending the model

To create a model that takes in new inputs, I strongly recommend sub-classing the existing TreeData and TreeModel classes. For an example, see the MetadataModel in models/metadata.py

#Subclass of the training model
class MetadataModel(main.TreeModel):
    """Subclass the core model and update the training loop to take two inputs"""
    def __init__(self, model, sites,classes, label_dict, config):
        super(MetadataModel,self).__init__(model=model,classes=classes,label_dict=label_dict, config=config)  
    
    def training_step(self, batch, batch_idx):
        """Train on a loaded dataset
        """
        #allow for empty data if data augmentation is generated
        inputs, y = batch
        images = inputs["HSI"]
        metadata = inputs["site"]
        y_hat = self.model.forward(images, metadata)
        loss = F.cross_entropy(y_hat, y)    
        
        return loss

Getting Started (UF - collaboration)

This section is meant solely for members of the idtrees group who have access to the data.

Fork this repo and install the conda environment.

conda env create -f=environment.yml
conda activate DeepTreeAttention

Update the config.yml

Currently, only members of the ewhite group have permissions to the raw NEON data.

For example:

rgb_sensor_pool: /orange/ewhite/NeonData/*/DP3.30010.001/**/Camera/**/*.tif

This is not a problem, just set

regenerate: False

and it will bypass these steps and use the existing train/test split (e.g. data/processed/train.csv)

You will need to set the correct crop directories

crop_dir: /blue/ewhite/b.weinstein/DeepTreeAttention/crops/

To wherever the crops are saved. This is currently

/orange/idtrees-collab/DeepTreeAttention/crops/

I highly recommend making a comet login. Change

#Comet dashboard
comet_workspace: bw4sz

to your usename and add a .comet.config file to authenticate.

Submit a job

Submit a SLURM job

sbatch SLURM/experiment.sh

Look at the comet repo for results

The metrics tab has the Micro and Macro Accuracy.

Owner metadata

Name: Weecology
Login: weecology
Email:
Kind: organization
Description:
Website: http://weecology.org
Location:
Twitter:
Company:
Icon url: https://avatars.githubusercontent.com/u/1156696?v=4
Repositories: 93
Last ynced at: 2023-03-11T03:45:49.249Z
Profile URL: https://github.com/weecology

GitHub Events

Total

Issues event: 1
Watch event: 12
Issue comment event: 1
Fork event: 2

Last Year

Issues event: 1
Watch event: 9
Issue comment event: 1
Fork event: 1

Committers metadata

Last synced: 4 days ago

Total Commits: 3,067
Total Committers: 5
Avg Commits per committer: 613.4
Development Distribution Score (DDS): 0.482

Commits in past year: 0
Committers in past year: 0
Avg Commits per committer in past year: 0.0
Development Distribution Score (DDS) in past year: 0.0

Name	Email	Commits
Ben Weinstein	b**n@B**l	1588
bw4sz	b**0@g**m	1474
dependabot[bot]	4****]	2
Ethan White (he/him)	e**n@w**g	2
Ben Weinstein	b**n@B**n	1

Committer domains:

bens-mbp.lan: 1
weecology.org: 1

Issue and Pull Request metadata

Last synced: 6 days ago

Total issues: 154
Total pull requests: 23
Average time to close issues: 3 months
Average time to close pull requests: 7 days
Total issue authors: 3
Total pull request authors: 5
Average comments per issue: 0.53
Average comments per pull request: 0.04
Merged pull request: 18
Bot issues: 0
Bot pull requests: 3

Past year issues: 1
Past year pull requests: 0
Past year average time to close issues: N/A
Past year average time to close pull requests: N/A
Past year issue authors: 1
Past year pull request authors: 0
Past year average comments per issue: 1.0
Past year average comments per pull request: 0
Past year merged pull request: 0
Past year bot issues: 0
Past year bot pull requests: 0

More stats: https://issues.ecosyste.ms/repositories/lookup?url=https://github.com/weecology/DeepTreeAttention

Top Issue Authors

bw4sz (150)
mgwein (2)
SebastianKyle (1)

Top Pull Request Authors

bw4sz (16)
dependabot[bot] (3)
ethanwhite (2)
henrykironde (1)
MarconiS (1)

Top Issue Labels

Top Pull Request Labels

dependencies (3)

Package metadata

Total packages: 2
Total downloads: unknown
Total dependent packages: 0 (may contain duplicates)
Total dependent repositories: 0 (may contain duplicates)
Total versions: 8

proxy.golang.org: github.com/weecology/DeepTreeAttention

Homepage:
Documentation: https://pkg.go.dev/github.com/weecology/DeepTreeAttention#section-documentation
Licenses: mit
Latest release: v0.0.5 (published almost 5 years ago)
Last Synced: 2025-12-23T21:31:32.839Z (2 days ago)
Versions: 4
Dependent Packages: 0
Dependent Repositories: 0
Rankings:
- Dependent packages count: 5.395%
- Average: 5.576%
- Dependent repos count: 5.758%

proxy.golang.org: github.com/weecology/deeptreeattention

Homepage:
Documentation: https://pkg.go.dev/github.com/weecology/deeptreeattention#section-documentation
Licenses: mit
Latest release: v0.0.5 (published almost 5 years ago)
Last Synced: 2025-12-23T21:31:31.964Z (2 days ago)
Versions: 4
Dependent Packages: 0
Dependent Repositories: 0
Rankings:
- Dependent packages count: 5.395%
- Average: 5.576%
- Dependent repos count: 5.758%

Dependencies

.github/workflows/pytest.yml actions

actions/cache v1 composite
actions/checkout v2 composite
conda-incubator/setup-miniconda v2 composite

requirements.txt pypi

PyYAML *
Shapely *
comet_ml *
dask *
dask_jobqueue *
deepforest *
descartes *
distributed *
geopandas *
h5py *
matplotlib *
numpy *
pandas *
pytest *
pytorch_lightning *
rasterio *
rasterstats *
scikit_learn *
setuptools *
skimage *
torch *
torchmetrics *
torchvision *

setup.py pypi

environment.yml conda

bokeh
descartes
h5py
matplotlib
numpydoc
pip
pytest
pytorch
pyyaml
recommonmark
scikit-learn
sphinx
sphinx_rtd_theme
torchvision
twine
yapf

Score: -Infinity

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Sustainable Technology

DeepTreeAttention

Keywords from Contributors

Repository metadata

README.md

DeepTreeAttention

Model Architecture

Project Organization

Workflow

Pytorch Lightning Data Module (data.TreeData)

Pytorch Lightning Training Module (data.TreeModel)

Alive/Dead Filtering

Dev Guide

Model Architectures

Extending the model

Getting Started (UF - collaboration)

Owner metadata

GitHub Events

Total

Last Year

Committers metadata

Committer domains:

Issue and Pull Request metadata

Top Issue Authors

Top Pull Request Authors

Top Issue Labels

Top Pull Request Labels

Package metadata

proxy.golang.org: github.com/weecology/DeepTreeAttention

proxy.golang.org: github.com/weecology/deeptreeattention

Dependencies