PredictionIO, a machine learning server for developers and ML engineers. Built on Apache Spark, HBase and Spray.

Subscribe to updates I use incubator-predictionio

Statistics on incubator-predictionio

Number of watchers on Github 11124
Number of open issues 71
Average time to close an issue 13 days
Main language Scala
Average time to merge a PR 7 days
Open pull requests 41+
Closed pull requests 129+
Last commit about 1 month ago
Repo Created about 5 years ago
Repo Last Updated about 1 month ago
Size 34 MB
Homepage https://predictio...
Organization / Authorapache
Latest Releasev0.9.6
Page Updated
Do you use incubator-predictionio? Leave a review!
View open issues (71)
View incubator-predictionio activity
View on github
Latest Open Source Launches
Trendy new open source projects in your inbox! View examples

Subscribe to our mailing list

Evaluating incubator-predictionio for your project? Score Explanation
Commits Score (?)
Issues & PR Score (?)

Apache PredictionIO


Apache PredictionIO is an open source machine learning framework for developers, data scientists, and end users. It supports event collection, deployment of algorithms, evaluation, querying predictive results via REST APIs. It is based on scalable open source services like Hadoop, HBase (and other DBs), Elasticsearch, Spark and implements what is called a Lambda Architecture.

To get started, check out!

Table of contents


A few installation options available.

Quick Start

Bugs and Feature Requests

Use Apache JIRA to report bugs or request new features.


Documentation, included in this repo in the docs/manual directory, is built with Middleman and publicly hosted at

Interested in helping with our documentation? Read Contributing Documentation.


Keep track of development and community news.


Read the Contribute Code page.

You can also list your projects on the Community Project page.


Apache PredictionIO is under Apache 2 license.

incubator-predictionio open issues Ask a question     (View All Issues)
  • over 1 year [ERROR] [ESSequences] [pio_meta][3] Index failed for [sequences#apps]
  • over 1 year Would be cool to add the badge
  • over 1 year FATAL: role "pio" does not exist
  • over 1 year pio new app fails on macOS
  • over 1 year Unresolved Dependency
  • over 1 year Can't run ./bin/
  • over 1 year & pio-class fails on parsing RELEASE file
  • over 1 year Vagrant don`t work
  • over 1 year 404 error getting API docs from link
  • over 1 year Bind fail when running pio
  • over 1 year postgres and non-tiny data
  • over 1 year Fail to run "pio deploy" in cluster
  • over 1 year io.prediction#core_2.10;0.10.0-SNAPSHOT: not found
  • over 1 year PredictionIO install script doesn't work!
  • over 1 year How can I add items without the entityType value of 'item' in PHP SDK
  • over 1 year pio build returns with 'No subject alternative DNS name matching found.'
  • over 1 year Website related problems
  • over 1 year pio train exception
  • over 1 year Why is the MaxNumberOfEventsPerBatchRequest is 50?
  • over 1 year Make TLS/https optional
  • over 1 year pio build —— error
  • almost 2 years The link to the newsletter is broken in the
  • almost 2 years Creating new app fails/hangs
  • almost 2 years Installation script doesn't work
  • almost 2 years The "Installation" section points to old SF PredictionIO
  • almost 2 years down
  • almost 2 years pio train fails because of ( Cannot run program "/home/tigra/PredictionIO/vendors/spark-1.6.0/bin/spark-submit": error=2, No such file or directory)
  • almost 2 years X-ray Image Pattern Recognition
  • almost 2 years Release to maven with scala 2.11.X
  • almost 2 years The log4j.prorperties included in the --files argument has no effect as it is right now
incubator-predictionio open pull requests (View All Pulls)
  • [PIO-1] Make SSL and authKey param authentication optional
  • Make SSL and authKey param authentication optional
  • Merge pull request #1 from PredictionIO/develop
  • update SimilarProduct return-item-properties example to v0.3.2
  • ES 2.x
  • Update
  • [PIO-35] Add integration tests for official templates
  • *Added classification template based on Lingpipe ( algorithm
  • Update document
  • [PIO-30] Set up a cross build for Scala 2.10 (Spark 1.6.2) and Scala …
  • [PIO-34]Update templates.yaml
  • [PIO-28] Console refactor
  • Templates link fixed.
  • Updated the file and fix the issue #254
  • [PIO-40] Remove docs/manual/obsolete/*
  • [PIO-42] : Negative Test Cases for Engine.train
  • update list of docker installations
  • Fix GNU's readlink with -f option not working on a Mac and BSD based systems
  • Update the Recommendation Quickstart
  • updated templates.yaml
  • [PIO-48] scala-local-helloword Sample Not working
  • [PIO-47] Eliminate enginemanifest for stateless build
  • Add git to Dockerfile
  • [PIO-65] Cache downloaded jars in Travis build
  • Fix docs: pio template is no longer supported.
  • Update
  • Elasticsearch basic HTTP authentication
  • [PIO-61] Add S3 Model Data Repository
  • Update linux distro url
  • [PIO-59] Use /dev/urandom to create access keys.
  • [PIO-56] Adding embedded elasticsearch and mocked configuration for tests
  • [PIO-127] Update release instructions in
  • bump up hbase client version and make it configurable
  • add S3 storage provider docs
  • [PIO-138] Fix batchpredict for custom PersistentModel
  • [PIO-137] Create a connection object at a worker to delete events
  • [PIO-136] Add CleanupFunctions for Python
  • pio batchpredict error
  • Update org.template references to org.example
  • Update
  • [PIO-155] Fix 'Topic Labelling with Wikipedia' Template Link
incubator-predictionio list of languages used
incubator-predictionio latest release notes
v0.9.6 PredictionIO 0.9.6

Thanks to the community for the continued support of PredictionIO! This release will not be possible without the community's participation.

Breaking Chanages

  • Starting from 0.9.6, Java 8 is a hard requirement
  • HTTPS is enforced on all REST endpoints

If HTTPS is going to break your current deployment, please wait for the upcoming patch release.


  • Default Spark version has been bumped to 1.6.0
  • Fixes regarding manipulating data sources with JDBC interface
  • Security fixes
  • Code style cleanup and fixes
  • Many documentation and script fixes
Other projects in Scala