https://github.com/spectrochempy/spectrochempy
SpectroChemPy is a framework for processing, analyzing and modeling spectroscopic data for chemistry with Python
chemistry
data-analysis
datasets
ftir
ftir-data-analysis
infrared
nmr
nmr-data
nmr-spectroscopy
processing
python
raman
raman-spectra
raman-spectroscopy
spectroscopy
uv-vis
Added: over 1 year ago - Last Synced: 11 months ago
- Created: May 13, 2020
- Relevant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
- Main Language: Python
- Commits: 1945
- Committers: 9
- Issues: 65
- Pull Requests: 153
- Owner: spectrochempy
- Stars: 106
- Forks: 22
- Packages: 1
- Downloads: 835

https://github.com/roapi/roapi
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
analytics
arrow
blob-storage
cloud-native
columnar
datafusion
datasets
delta-lake
graphql
in-memory-database
parquet
query
query-frontends
rest-api
rust
s3
sql
static-datasets
Added: over 1 year ago - Last Synced: 11 months ago
- Created: December 11, 2020

https://github.com/meirelesff/siconvr
An R package to fetch data from Plataforma +Brasil (Siconv)
brazil
datasets
plataforma-brasil
public-data
siconv
Added: over 1 year ago - Last Synced: 11 months ago
- Created: March 18, 2021
- Relevant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
- Main Language: R
- Commits: 31
- Committers: 3
- Issues: 6
- Pull Requests: 7
- Owner: meirelesff
- Stars: 10
- Forks: 2
- Packages: 1
- Downloads: 211

https://github.com/HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
annotation
annotation-tool
annotations
boundingbox
computer-vision
data-labeling
dataset
datasets
deep-learning
image-annotation
image-classification
image-labeling
image-labelling-tool
label-studio
labeling
labeling-tool
mlops
semantic-segmentation
text-annotation
yolo
Added: over 1 year ago - Last Synced: 11 months ago
- Created: June 19, 2019
- Relevant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
- Main Language: JavaScript
- Commits: 3307
- Committers: 127
- Issues: 306
- Pull Requests: 587
- Owner: HumanSignal
- Stars: 16367
- Forks: 2018
- Packages: 0

https://github.com/allenai/smashed
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batching, and more. Supports datasets from Huggingface, torchdata iterables, or simple lists of dictionaries.
dataset
datasets
dict
huggingface
in-context-learning
mappers
natural-language-processing
nlp
pipeline
prefix
prefix-tuning
preprocessing
prompting
pytorch
text
torchdata
transformer
transformers
Added: over 1 year ago - Last Synced: 11 months ago
- Created: July 21, 2022
