https://github.com/svenkreiss/pysparkling

A pure Python implementation of Apache Spark's RDD and DStream interfaces.
apache-spark data-processing data-science python
Added: over 1 year ago - Last Synced: 11 months ago - Created: May 09, 2015

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Python
  • Commits: 1454
  • Committers: 10
  • Issues: 20
  • Pull Requests: 80
  • Owner: svenkreiss
  • Stars: 260
  • Forks: 44
  • Packages: 2
  • Downloads: 11,565