https://github.com/spectrochempy/spectrochempy

SpectroChemPy is a framework for processing, analyzing and modeling spectroscopic data for chemistry with Python
chemistry data-analysis datasets ftir ftir-data-analysis infrared nmr nmr-data nmr-spectroscopy processing python raman raman-spectra raman-spectroscopy spectroscopy uv-vis
Added: over 1 year ago - Last Synced: 11 months ago - Created: May 13, 2020

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Python
  • Commits: 1945
  • Committers: 9
  • Issues: 65
  • Pull Requests: 153
  • Owner: spectrochempy
  • Stars: 106
  • Forks: 22
  • Packages: 1
  • Downloads: 835
https://github.com/roapi/roapi

Create full-fledged APIs for slowly moving datasets without writing a single line of code.
analytics arrow blob-storage cloud-native columnar datafusion datasets delta-lake graphql in-memory-database parquet query query-frontends rest-api rust s3 sql static-datasets
Added: over 1 year ago - Last Synced: 11 months ago - Created: December 11, 2020

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Rust
  • Commits: 252
  • Committers: 34
  • Issues: 107
  • Pull Requests: 85
  • Owner: roapi
  • Stars: 3100
  • Forks: 172
  • Packages: 3
  • Downloads: 2,420
https://github.com/meirelesff/siconvr

An R package to fetch data from Plataforma +Brasil (Siconv)
brazil datasets plataforma-brasil public-data siconv
Added: over 1 year ago - Last Synced: 11 months ago - Created: March 18, 2021

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: R
  • Commits: 31
  • Committers: 3
  • Issues: 6
  • Pull Requests: 7
  • Owner: meirelesff
  • Stars: 10
  • Forks: 2
  • Packages: 1
  • Downloads: 211
https://github.com/HumanSignal/label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format
annotation annotation-tool annotations boundingbox computer-vision data-labeling dataset datasets deep-learning image-annotation image-classification image-labeling image-labelling-tool label-studio labeling labeling-tool mlops semantic-segmentation text-annotation yolo
Added: over 1 year ago - Last Synced: 11 months ago - Created: June 19, 2019

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: JavaScript
  • Commits: 3307
  • Committers: 127
  • Issues: 306
  • Pull Requests: 587
  • Owner: HumanSignal
  • Stars: 16367
  • Forks: 2018
  • Packages: 0
https://github.com/allenai/smashed

SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batching, and more. Supports datasets from Huggingface, torchdata iterables, or simple lists of dictionaries.
dataset datasets dict huggingface in-context-learning mappers natural-language-processing nlp pipeline prefix prefix-tuning preprocessing prompting pytorch text torchdata transformer transformers
Added: over 1 year ago - Last Synced: 11 months ago - Created: July 21, 2022

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Python
  • Commits: 145
  • Committers: 6
  • Issues: 1
  • Pull Requests: 65
  • Owner: allenai
  • Stars: 30
  • Forks: 3
  • Packages: 1
  • Downloads: 11,380