https://github.com/streamnative/pulsar-spark

Spark Connector to read and write with Pulsar
apache-pulsar apache-spark batch-processing data-processing data-science flink spark spark-sql stream-processing structured-streaming
Added: over 1 year ago - Last Synced: about 1 year ago - Created: July 01, 2019

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Scala
  • Commits: 189
  • Committers: 22
  • Issues: 93
  • Pull Requests: 142
https://github.com/apache/incubator-wayang

Apache Wayang(incubating) is the first cross-platform data processing system.
apache big-data cross-platform data-management-platform data-processing distributed-system hadoop java jdbc middleware open-source performance scala spark
Added: over 1 year ago - Last Synced: about 1 year ago - Created: December 16, 2020

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Java
  • Commits: 1685
  • Committers: 45
  • Issues: 491
  • Pull Requests: 454
  • Owner: apache
  • Stars: 170
  • Forks: 71
  • Packages: 38
https://github.com/absaoss/spline

Data Lineage Tracking And Visualization Solution
bigdata hadoop lineage scala spark tracking visualization
Added: over 1 year ago - Last Synced: about 1 year ago - Created: May 30, 2017

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Scala
  • Commits: 1091
  • Committers: 60
  • Issues: 506
  • Pull Requests: 388
  • Owner: AbsaOSS
  • Stars: 578
  • Forks: 152
  • Packages: 32