https://github.com/noaa-mdl/grib2io-interp
Interpolation component for grib2io interfacing to the NCEPLIBS-ip library
atmospheric-science
data-science
f2py
fortran
grib2
interpolation
interpolation-methods
meteorology
ncep
nceplibs
nceplibs-ip
numpy
weather
weather-data
Added: over 1 year ago - Last Synced: 11 months ago
- Created: June 06, 2023

https://github.com/time-series-machine-learning/tsml-eval
Evaluation tools for time series machine learning algorithms.
benchmarking
data-science
evaluation
machine-learning
python
time-series
Added: over 1 year ago - Last Synced: 11 months ago
- Created: June 30, 2022
- Relevant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
- Main Language: Jupyter Notebook
- Commits: 448
- Committers: 15
- Issues: 13
- Pull Requests: 112
- Owner: time-series-machine-learning
- Stars: 20
- Forks: 8
- Packages: 1
- Downloads: 235

https://github.com/oracle-samples/oci-data-science-ai-samples
This repo contains a series of tutorials and code examples highlighting different features of the OCI Data Science and AI services, along with a release vehicle for experimental programs.
ai
conda
data-science
data-science-notebooks
deep-learning
jupyter-notebook
machine-learning
oci
oracle-cloud-infrastructure
python
Added: over 1 year ago - Last Synced: 11 months ago
- Created: May 06, 2021
- Relevant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
- Main Language: Jupyter Notebook
- Commits: 1116
- Committers: 84
- Issues: 9
- Pull Requests: 158
- Owner: oracle-samples
- Stars: 151
- Forks: 155
- Packages: 1
- Downloads: 46

https://github.com/bbva/mercury-dataschema
Utility package that, given a Pandas DataFrame, it uses the DataSchema class which auto-infers feature types and automatically calculates different statistics depending on the types.
analytics
data
data-cleaning
data-processing
data-science
feature-engineering
Added: over 1 year ago - Last Synced: 11 months ago
- Created: March 09, 2023

https://github.com/infuseai/piperider
Code review for data in dbt
code-review
continuous-integration
data-exploration
data-observability
data-pipeline
data-profiler
data-profiling
data-quality
data-reliability
data-science
data-testing
data-visualization
dbt
dbt-metrics
eda
exploratory-data-analysis
pull-requests
python
reporting
Added: over 1 year ago - Last Synced: 11 months ago
- Created: March 31, 2022

https://github.com/finos/jupyterlab_templates
Support for jupyter notebook templates in jupyterlab
data-science
dataviz
jupyter
jupyterlab
jupyterlab-extension
machine-learning
notebook
Added: over 1 year ago - Last Synced: 11 months ago
- Created: March 17, 2018

https://github.com/olavolav/uniplot
Lightweight plotting to the terminal. 4x resolution via Unicode.
data-analysis
data-science
plot
python
Added: over 1 year ago - Last Synced: 11 months ago
- Created: August 15, 2020

https://github.com/streamnative/pulsar-spark
Spark Connector to read and write with Pulsar
apache-pulsar
apache-spark
batch-processing
data-processing
data-science
flink
spark
spark-sql
stream-processing
structured-streaming
Added: over 1 year ago - Last Synced: 11 months ago
- Created: July 01, 2019
- Relevant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
- Main Language: Scala
- Commits: 189
- Committers: 22
- Issues: 93
- Pull Requests: 142
- Owner: streamnative
- Stars: 109
- Forks: 48
- Packages: 2

https://github.com/picterra/picterra-python
Picterra Python API Client
data-science
earth-observation
geospatial-analysis
geospatial-intelligence
machine-learning
Added: over 1 year ago - Last Synced: 11 months ago
- Created: February 12, 2020

https://github.com/joshuawe/plots_and_graphs
Visualize Machine Learning Metrics
classification
data-science
machine-learning
matplotlib
performance-metrics
performance-visualization
plot
python
regression
visualizer
Added: over 1 year ago - Last Synced: 11 months ago
- Created: October 05, 2023
- Relevant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
- Main Language: Jupyter Notebook
- Commits: 85
- Committers: 3
- Issues: 31
- Pull Requests: 11
- Owner: joshuawe
- Stars: 3
- Forks: 0
- Packages: 1
- Downloads: 29

https://github.com/bytewax/bytewax
Python Stream Processing
data-engineering
data-processing
data-science
dataflow
machine-learning
python
rust
stream-processing
streaming-data
Added: over 1 year ago - Last Synced: 11 months ago
- Created: February 04, 2022

https://github.com/bluebrain/nexus-forge
Building and Using Knowledge Graphs made easy
data-management
data-science
json-ld
knowledge-engineering
knowledge-graph
knowledgegraph
rdf
shacl
Added: over 1 year ago - Last Synced: 11 months ago
- Created: May 25, 2020

https://github.com/insitro/redun
Yet another redundant workflow engine
aws
bioinformatics
data-engineering
data-science
docker
etl
gcp
ml
python
workflow-engine
Added: over 1 year ago - Last Synced: 11 months ago
- Created: November 04, 2021

https://github.com/hariketsheth/carefact---envisioning-farms-for-future
Agriculture is a very significant contributor to the Indian economy.Farmers' most prevalent problem is that they are unable to make informed decisions on which crops are supported in their areas, as well as market and profit prices, which lead to lower productivity and less profit margins.
analysis
chatbot
collaborate
data-science
farming
future
github
github-codespaces
github-pages
platform-engineering
twilio
typeform
Added: over 1 year ago - Last Synced: 11 months ago
- Created: April 14, 2023
- Relevant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
- Main Language: JavaScript
- Commits: 34
- Committers: 1
- Issues: 3
- Pull Requests: 2
- Owner: hariketsheth
- Stars: 6
- Forks: 1
- Packages: 0

https://github.com/hisqkq/projet-de-visualisation-m1
Application Dash ayant pour but de rΓ©aliser les visualisations interactives des donnΓ©s de l'Γ©nergie Γ©lectrique en France mΓ©tropolitaine (hors Corse)
dash
dashboard
data-science
data-visualization
database
energy
energy-consumption
mongodb
plotly
Added: over 1 year ago - Last Synced: 11 months ago
- Created: September 26, 2023

https://github.com/firstnet-systems-uk/customer-analysis-tableau-dashboard
"Explore Tableau's power in crafting a comprehensive customer analysis dashboard. Learn step-by-step chart creation and formatting for insightful data visualization
dashboards
data-science
dataanalysis
datascience
datavisualization
tableau
Added: over 1 year ago - Last Synced: 11 months ago
- Created: January 07, 2024
- Relevant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
- Main Language: HTML
- Commits: 6
- Committers: 2
- Issues: 0
- Pull Requests: 1
- Owner: FirstNet-Systems-UK
- Stars: 1
- Forks: 1
- Packages: 0

https://github.com/lsys/forestplot
A Python package to make publication-ready but customizable coefficient plots.
coefficientplot
data-science
data-visualization
dataviz
forestplot
matplotlib
python
visualization
Added: over 1 year ago - Last Synced: 11 months ago
- Created: July 03, 2022
- Relevant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
- Main Language: Jupyter Notebook
- Commits: 168
- Committers: 3
- Issues: 55
- Pull Requests: 57
- Owner: LSYS
- Stars: 102
- Forks: 9
- Packages: 2
- Downloads: 711

https://github.com/giacbrd/smartpipeline
A framework for rapid development of robust data pipelines following a simple design pattern
data-analysis
data-analytics
data-mining
data-pipelines
data-processing
data-science
dataops
design-patterns
etl
machine-learning
mlops
pipeline
pipeline-framework
pipelines
reproducibility
task-queue
workflow
Added: over 1 year ago - Last Synced: 11 months ago
- Created: September 03, 2018

https://github.com/svenkreiss/pysparkling
A pure Python implementation of Apache Spark's RDD and DStream interfaces.
apache-spark
data-processing
data-science
python
Added: over 1 year ago - Last Synced: 11 months ago
- Created: May 09, 2015
- Relevant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
- Main Language: Python
- Commits: 1454
- Committers: 10
- Issues: 20
- Pull Requests: 80
- Owner: svenkreiss
- Stars: 260
- Forks: 44
- Packages: 2
- Downloads: 11,565

https://github.com/kfultz07/go-dataframe
A simple package to abstract away the process of creating usable DataFrames for data analytics. This package is heavily inspired by the amazing Python library, Pandas.
data-analysis
data-analytics
data-processing
data-science
dataframe
go
golang
pandas
Added: over 1 year ago - Last Synced: 11 months ago
- Created: January 03, 2022

https://github.com/skuschel/generatorpipeline
Parallelize your data-processing pipelines with just a decorator.
data-processing
data-science
hacktoberfest
python
Added: over 1 year ago - Last Synced: 11 months ago
- Created: November 21, 2019

https://github.com/epigen/mr.pareto
MR. PARETO - Modules & Recipes for Pragmatic Augmentation of Research Efficiency Towards Optimum
automation
bioinformatics
biomedical
data-science
framework
snakemake
workflows
Added: over 1 year ago - Last Synced: 11 months ago
- Created: March 28, 2022
- Relevant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
- Main Language: /projects/review
- Commits: 37
- Committers: 2
- Issues: 2
- Pull Requests: 0
- Owner: epigen
- Stars: 3
- Forks: 0
- Packages: 0

https://github.com/ccbest/geostructures
A lightweight implementation of shapes drawn across a geo-temporal plane.
data-science
geopython
geospatial
geospatial-analysis
mapping
python
Added: over 1 year ago - Last Synced: 11 months ago
- Created: August 16, 2023

https://github.com/thecoderpinar/earthquake-explorer
ππ Explore and analyze earthquake data worldwide using this interactive data science project. Visualize earthquake occurrences, patterns, and geographical distribution. Dive deep into seismic data, perform exploratory data analysis, and gain insights into earthquake trends.
data-analysis
data-science
data-visualization
earthquake-analysis
exploratory-data-analysis
geospatial-analysis
interactive-maps
jupyter-notebook
machine-learning
python
seismic-data
Added: over 1 year ago - Last Synced: 11 months ago
- Created: October 05, 2023
- Relevant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
- Main Language: HTML
- Commits: 4
- Committers: 1
- Issues: 1
- Pull Requests: 0
- Owner: ThecoderPinar
- Stars: 3
- Forks: 1
- Packages: 0

https://github.com/wenjiedu/pypots
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation, classification, clustering, forecasting, & anomaly detection on incomplete industrial (irregularly-sampled) multivariate TS with NaN missing values
classification
clustering
data-mining
data-science
deep-learning
forecasting
healthcare
imputation
incomplete
industrial
interpolation
machine-learning
missing-values
missingness
neural-network
partially-observed-time-series
pytorch
science-research
time-series
time-series-analysis
Added: over 1 year ago - Last Synced: 11 months ago
- Created: March 29, 2022

https://github.com/swanhubx/swanlab
π§SwanLab: track and visualize all the pieces of your machine learning pipeline. θ·θΈͺδΈε―θ§εδ½ ηζΊε¨ε¦δΉ ε
¨ζ΅η¨
data-science
deep-learning
fastapi
jax
machine-learning
mlops
model-versioning
python
pytorch
tensorboard
tensorflow
tracking
transformers
visualization
Added: over 1 year ago - Last Synced: 11 months ago
- Created: November 24, 2023

https://github.com/tensorwarp/bitfusion
Deep Learning research project for ideas
artificial-intelligence
cpp20
cuda-aware-mpi
cuda-programming
data-science
deep-learning
hpc-systems
inter-process-communication
linear-algebra
mpi
multi-gpu-inference
nvidia
scala-native
Added: over 1 year ago - Last Synced: 11 months ago
- Created: September 06, 2023
- Relevant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
- Main Language: C++
- Commits: 171
- Committers: 1
- Issues: 0
- Pull Requests: 2
- Owner: TensorWarp
- Stars: 0
- Forks: 0
- Packages: 0

https://github.com/enlite-ai/maze
Maze Applied Reinforcement Learning Framework
applied-machine-learning
automation
data-science
decision-making
deep-learning
distributed
documentation
framework
machine-learning
monitoring
optimization
python
reinforcement-learning
simulation
Added: over 1 year ago - Last Synced: 11 months ago
- Created: February 11, 2021

https://github.com/dimi-lab/dimi-lab.github.io
Data science and Informatics for Multiomics Integration
bioinformatics
data-science
dataanalysis
multiomics
spatialomics
Added: over 1 year ago - Last Synced: 11 months ago
- Created: October 27, 2023

https://github.com/welthungerhilfe/cgm-ml-archive
Child Growth Monitor Machine Learning
ai
computer-vision
data-science
deep-learning
depth-camera
machine-learning
malnutrition
python
sdg
sdgs
tensorflow2
Added: over 1 year ago - Last Synced: 11 months ago
- Created: June 04, 2018
- Relevant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
- Main Language: Jupyter Notebook
- Commits: 724
- Committers: 39
- Issues: 7
- Pull Requests: 93
- Owner: Welthungerhilfe
- Stars: 55
- Forks: 35
- Packages: 0

https://github.com/helmholtz-analytics/heat/
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
array-api
data-analytics
data-processing
data-science
distributed
gpu
hpc
machine-learning
massive-datasets
mpi
mpi4py
multi-gpu
multi-node-cluster
numpy
parallelism
python
pytorch
tensors
Added: over 1 year ago - Last Synced: 11 months ago
- Created: May 17, 2018
- Relevant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
- Main Language: Python
- Commits: 4590
- Committers: 62
- Issues: 218
- Pull Requests: 354
- Owner: helmholtz-analytics
- Stars: 193
- Forks: 56
- Packages: 1

https://github.com/collab-uniba/pynblint
Pynblint is a linter for Python Jupyter notebooks.
best-practices
computational-notebooks
data-science
guidelines
jupyter-notebook
linter
machine-learning
python
quality-assurance
static-analysis
static-analyzer
Added: over 1 year ago - Last Synced: 11 months ago
- Created: March 20, 2021
- Relevant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
- Main Language: Python
- Commits: 312
- Committers: 7
- Issues: 49
- Pull Requests: 47
- Owner: collab-uniba
- Stars: 35
- Forks: 1
- Packages: 1
- Downloads: 102

https://github.com/mlr-org/mlr3pipelines
Dataflow Programming for Machine Learning in R
bagging
data-science
dataflow-programming
ensemble-learning
machine-learning
mlr3
pipelines
preprocessing
r
r-package
stacking
Added: over 1 year ago - Last Synced: 11 months ago
- Created: October 10, 2017

https://github.com/gradio-app/gradio
Build and share delightful machine learning apps, all in Python. π Star to support our work!
data-analysis
data-science
data-visualization
deep-learning
deploy
gradio
gradio-interface
hacktoberfest
interface
machine-learning
models
python
python-notebook
ui
ui-components
Added: 11 months ago - Last Synced: 11 months ago
- Created: December 19, 2018
- Relevant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
- Main Language: Python
- Commits: 4204
- Committers: 153
- Issues: 3264
- Pull Requests: 2306
- Owner: gradio-app
- Stars: 29581
- Forks: 2196
- Packages: 17
- Downloads: 11,141,053

https://github.com/smups/rustronomy
rustronomy - an astronomy data analysis toolkit written in rust
astronomy
data-science
physics
rust
rust-lang
rust-library
science
Added: about 1 year ago - Last Synced: 11 months ago
- Created: November 13, 2021
