TACO
Trash Annotations in Context Dataset Toolkit.
https://github.com/pedropro/TACO
Category: Industrial Ecology
Sub Category: Circular Economy and Waste
Keywords
dataset deep-learning garbage litter mask-rcnn object-detection trash
Last synced: about 2 hours ago
JSON representation
Repository metadata
🌮 Trash Annotations in Context Dataset Toolkit
- Host: GitHub
- URL: https://github.com/pedropro/TACO
- Owner: pedropro
- License: mit
- Created: 2019-06-08T22:19:04.000Z (almost 6 years ago)
- Default Branch: master
- Last Pushed: 2024-06-16T18:57:16.000Z (10 months ago)
- Last Synced: 2025-04-20T09:03:39.500Z (7 days ago)
- Topics: dataset, deep-learning, garbage, litter, mask-rcnn, object-detection, trash
- Language: Jupyter Notebook
- Homepage: http://tacodataset.org
- Size: 98 MB
- Stars: 639
- Watchers: 26
- Forks: 212
- Open Issues: 39
- Releases: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
README.md
TACO is a growing image dataset of waste in the wild. It contains images of litter taken under
diverse environments: woods, roads and beaches. These images are manually labeled and segmented
according to a hierarchical taxonomy to train and evaluate object detection algorithms. Currently,
images are hosted on Flickr and we have a server that is collecting more images and
annotations @ tacodataset.org
For convenience, annotations are provided in COCO format. Check the metadata here:
http://cocodataset.org/#format-data
TACO is still relatively small, but it is growing. Stay tuned!
Publications
For more details check our paper: https://arxiv.org/abs/2003.06975
If you use this dataset and API in a publication, please cite us using: Â
@article{taco2020,
title={TACO: Trash Annotations in Context for Litter Detection},
author={Pedro F Proença and Pedro Simões},
journal={arXiv preprint arXiv:2003.06975},
year={2020}
}
News
December 20, 2019 - Added more 785 images and 2642 litter segmentations.
November 20, 2019 - TACO is officially open for new annotations: http://tacodataset.org/annotate
Getting started
Requirements
To install the required python packages simply type
pip3 install -r requirements.txt
Additionaly, to use demo.pynb
, you will also need coco python api. You can get this using
pip3 install git+https://github.com/philferriere/cocoapi.git#subdirectory=PythonAPI
Download
To download the dataset images simply issue
python3 download.py
Our API contains a jupyter notebook demo.pynb
to inspect the dataset and visualize annotations.
Unlabeled data
A list of URLs for both unlabeled and labeled images is now also provided in data/all_image_urls.csv
.
Each image contains one URL for each original image (second column) and one URL for a VGA-resized version (first column)
for images hosted by Flickr. If you decide to annotate these images using other tools, please make them public and contact us so we can keep track.
Unofficial data
Annotations submitted via our website are added weekly to data/annotations_unofficial.json
. These have not yet been been reviewed by us -- some may be inaccurate or have poor segmentations.
You can use the same command to download the respective images:
python3 download.py --dataset_path ./data/annotations_unofficial.json
Trash Detection
The implementation of Mask R-CNN by Matterport is included in /detector
with a few modifications. Requirements are the same. Before using this, the dataset needs to be split. You can either donwload our weights and splits or generate these from scratch using the split_dataset.py
script to generate
N random train, val, test subsets. For example, run this inside the directory detector
:
python3 split_dataset.py --dataset_dir ../data
For further usage instructions, check detector/detector.py
.
As you can see here, most of the original classes of TACO have very few annotations, therefore these must be either left out or merged together. Depending on the problem, detector/taco_config
contains several class maps to target classes, which maintain the most dominant classes, e.g., Can, Bottles and Plastic bags. Feel free to make your own classes.
Owner metadata
- Name: Pedro F. Proença
- Login: pedropro
- Email:
- Kind: user
- Description:
- Website: https://pedropro.github.io/
- Location: Los Angeles
- Twitter:
- Company: NASA - JPL
- Icon url: https://avatars.githubusercontent.com/u/10153872?u=396916742c81f37c878f27df2a6b300dce4c1fd6&v=4
- Repositories: 9
- Last ynced at: 2024-06-11T15:35:13.295Z
- Profile URL: https://github.com/pedropro
GitHub Events
Total
- Watch event: 50
- Issue comment event: 2
- Fork event: 12
Last Year
- Watch event: 50
- Issue comment event: 2
- Fork event: 12
Committers metadata
Last synced: 6 days ago
Total Commits: 185
Total Committers: 3
Avg Commits per committer: 61.667
Development Distribution Score (DDS): 0.022
Commits in past year: 1
Committers in past year: 1
Avg Commits per committer in past year: 1.0
Development Distribution Score (DDS) in past year: 0.0
Name | Commits | |
---|---|---|
pedropro | p****a@g****m | 181 |
BeardMix | m****t@g****m | 3 |
cvwiz | p****a@j****v | 1 |
Committer domains:
- jpl.nasa.gov: 1
Issue and Pull Request metadata
Last synced: 1 day ago
Total issues: 48
Total pull requests: 9
Average time to close issues: about 1 month
Average time to close pull requests: 2 days
Total issue authors: 44
Total pull request authors: 9
Average comments per issue: 1.52
Average comments per pull request: 0.44
Merged pull request: 1
Bot issues: 0
Bot pull requests: 0
Past year issues: 0
Past year pull requests: 1
Past year average time to close issues: N/A
Past year average time to close pull requests: 1 minute
Past year issue authors: 0
Past year pull request authors: 1
Past year average comments per issue: 0
Past year average comments per pull request: 1.0
Past year merged pull request: 0
Past year bot issues: 0
Past year bot pull requests: 0
Top Issue Authors
- rgaufman (2)
- ysig (2)
- JTDeve (2)
- pedropro (2)
- capjamesg (1)
- James0730 (1)
- shier1 (1)
- tjiagoM (1)
- StefaanJoos (1)
- reshmaram2000 (1)
- kegsay (1)
- LorenzoMonti (1)
- Zesky665 (1)
- brianchap (1)
- bernardo-dev (1)
Top Pull Request Authors
- DomMcOyle (1)
- xeviknal (1)
- hern4ndes (1)
- ShivamShrirao (1)
- joshdabosh (1)
- psimoesSsimoes (1)
- phideltaee (1)
- Beardmix (1)
- CleanPegasus (1)
Top Issue Labels
- duplicate (1)
Top Pull Request Labels
Dependencies
- Cython *
- graphviz *
- jupyter *
- matplotlib *
- numpy *
- pandas *
- pillow *
- requests *
- seaborn *
Score: 7.617759576608505