https://github.com/giacbrd/smartpipeline

A framework for rapid development of robust data pipelines following a simple design pattern
data-analysis data-analytics data-mining data-pipelines data-processing data-science dataops design-patterns etl machine-learning mlops pipeline pipeline-framework pipelines reproducibility task-queue workflow
Added: over 1 year ago - Last Synced: 11 months ago - Created: September 03, 2018

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Python
  • Commits: 275
  • Committers: 3
  • Issues: 0
  • Pull Requests: 3
  • Owner: giacbrd
  • Stars: 22
  • Forks: 2
  • Packages: 1
  • Downloads: 56
https://github.com/kfultz07/go-dataframe

A simple package to abstract away the process of creating usable DataFrames for data analytics. This package is heavily inspired by the amazing Python library, Pandas.
data-analysis data-analytics data-processing data-science dataframe go golang pandas
Added: over 1 year ago - Last Synced: 11 months ago - Created: January 03, 2022

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Go
  • Commits: 130
  • Committers: 2
  • Issues: 0
  • Pull Requests: 2
  • Owner: kfultz07
  • Stars: 71
  • Forks: 6
  • Packages: 1
https://github.com/helmholtz-analytics/heat/

Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
array-api data-analytics data-processing data-science distributed gpu hpc machine-learning massive-datasets mpi mpi4py multi-gpu multi-node-cluster numpy parallelism python pytorch tensors
Added: over 1 year ago - Last Synced: 11 months ago - Created: May 17, 2018

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Python
  • Commits: 4590
  • Committers: 62
  • Issues: 218
  • Pull Requests: 354