Open source projects by cloudera

cloudera/spark-timeseries

A library for time series analysis on Apache Spark

☕Scala   ★546 stars   ⚠61 open issues   ⚭10 contributors   ☯almost 2 years old  

cloudera/hue

Let’s Big Data. Hue is an open source Web interface for analyzing data with Hadoop and Spark.

☕Python   ★2087 stars   ⚠38 open issues   ⚭83 contributors   ☯over 6 years old  

cloudera/spark-dataflow

Provides an Apache Spark backend for executing Dataflow pipelines.

☕Java   ★315 stars   ⚠4 open issues   ⚭7 contributors   ☯about 2 years old  

cloudera/ibis

Productivity-centric Python data analysis framework for SQL systems and the Hadoop platform. Co-founded by the creator of pandas

☕Python   ★790 stars   ⚠224 open issues   ⚭9 contributors   ☯almost 2 years old  

cloudera/oryx

(Retired) version 1 of simple real-time large-scale machine learning infrastructure.

☕Java   ★1404 stars   ⚠1 open issues   ⚭7 contributors   ☯over 3 years old  

cloudera/kudu

Apache Kudu. Mirrored from https://github.com/apache/kudu

☕C++   ★568 stars   ⚠1 open issues   ⚭22 contributors   ☯over 2 years old  

cloudera/Impala

Real-time Query for Hadoop

☕C++   ★1738 stars   ⚠42 open issues   ⚭51 contributors   ☯over 4 years old  

cloudera/livy

Livy is an open source REST interface for interacting with Apache Spark from anywhere

☕Scala   ★349 stars   ⚠17 open issues   ⚭3 contributors   ☯over 1 year old  

cloudera/cdh-twitter-example

Example application for analyzing Twitter data using CDH - Flume, Oozie, Hive

☕Java   ★238 stars   ⚠13 open issues   ⚭4 contributors   ☯over 4 years old  

cloudera/search

☕Java   ★72 stars   ⚠1 open issues   ⚭4 contributors   ☯almost 4 years old  

cloudera/sqoop

Sqoop has moved to Apache!

☕Java   ★160 stars   ⚠2 open issues   ⚭10 contributors   ☯almost 7 years old  

cloudera/cdk

Cloudera Development Kit

☕Java   ★203 stars   ⚠7 open issues   ⚭11 contributors   ☯almost 4 years old  

cloudera/impyla

Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)

☕Python   ★257 stars   ⚠59 open issues   ⚭15 contributors   ☯almost 3 years old  

cloudera/cm_ext

Cloudera Manager Extensibility Tools and Documentation.

☕Java   ★53 stars   ⚠8 open issues   ⚭2 contributors   ☯about 3 years old  

cloudera/ibis-notebooks

IPython notebooks and learning materials for Ibis

☕Python   ★50 stars   ⚠2 open issues   ⚭2 contributors   ☯almost 2 years old  

cloudera/kitten

The fast and fun way to write YARN applications.

☕Java   ★114 stars   ⚠2 open issues   ⚭2 contributors   ☯over 4 years old  

cloudera/spark

Mirror of Apache Spark

☕Scala   ★36 stars   ⚠0 open issues   ⚭135 contributors   ☯about 3 years old  

cloudera/cm_api

Cloudera Manager API Client

☕Java   ★119 stars   ⚠28 open issues   ⚭18 contributors   ☯almost 5 years old  

cloudera/impala-udf-samples

Sample UDF and UDAs for Impala.

☕C++   ★27 stars   ⚠3 open issues   ⚭5 contributors   ☯over 3 years old  

cloudera/flume

WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. The system is centrally managed and allows for intelligent dynamic management. It uses a simple extensible data model that allows for online analytic applications.

☕Java   ★873 stars   ⚠6 open issues   ⚭20 contributors   ☯over 6 years old  

cloudera/cdh-package

☕Groovy   ★23 stars   ⚠1 open issues   ⚭3 contributors   ☯over 4 years old  

cloudera/hadoop-common

Mirror of Apache Hadoop common

☕Java   ★49 stars   ⚠2 open issues   ⚭15 contributors   ☯about 6 years old  

cloudera/sentry

Access Server

☕Java   ★25 stars   ⚠1 open issues   ⚭5 contributors   ☯about 4 years old  

cloudera/python-ngrams

☕Python   ★74 stars   ⚠0 open issues   ⚭1 contributors   ☯about 4 years old  

cloudera/hdfs-nfs-proxy

☕Java   ★43 stars   ⚠11 open issues   ⚭1 contributors   ☯over 4 years old  

cloudera/parquet-examples

Example programs and scripts for accessing parquet files

☕Java   ★35 stars   ⚠1 open issues   ⚭1 contributors   ☯about 3 years old  

cloudera/impala-tpcds-kit

TPC-DS Kit for Impala

☕Shell   ★74 stars   ⚠10 open issues   ⚭6 contributors   ☯about 3 years old  

cloudera/htrace

☕Java   ★137 stars   ⚠8 open issues   ⚭9 contributors   ☯over 4 years old  

cloudera/hbase

Mirror of Apache Hadoop HBase

☕Java   ★23 stars   ⚠0 open issues   ⚭7 contributors   ☯about 6 years old  

cloudera/madlibport

Madlib port for Cloudera Impala

☕C++   ★27 stars   ⚠0 open issues   ⚭4 contributors   ☯over 3 years old  

cloudera/hive

Mirror of Apache Hive

☕Java   ★34 stars   ⚠0 open issues   ⚭2 contributors   ☯about 6 years old  

cloudera/director-scripts

Cloudera Director sample code

☕Shell   ★23 stars   ⚠3 open issues   ⚭8 contributors   ☯about 2 years old  

cloudera/accumulo

CDH specific changes and backports on top of Apache Accumulo

   ★0 stars   ⚠0 open issues   ⚭58 contributors   ☯7 months old  

cloudera/accumulo-upgrade-test

Testing for Apache Accumulo upgrades

☕Java   ★0 stars   ⚠0 open issues   ⚭3 contributors   ☯almost 2 years old  

cloudera/ades

An analysis of adverse drug event data using Hadoop, R, and Gephi

☕Java   ★35 stars   ⚠0 open issues   ⚭2 contributors   ☯over 5 years old  

cloudera/alfredo

Alfredo, Java HTTP SPNEGO

☕Java   ★29 stars   ⚠2 open issues   ⚭1 contributors   ☯about 6 years old  

cloudera/art

vector drawing for buttons, icons, widgets and all that stuff

☕JavaScript   ★1 stars   ⚠0 open issues   ⚭6 contributors   ☯about 7 years old  

cloudera/art-widgets

☕JavaScript   ★8 stars   ⚠0 open issues   ⚭5 contributors   ☯almost 7 years old  

cloudera/avro

Mirror of Apache Avro

☕Java   ★6 stars   ⚠0 open issues   ⚭4 contributors   ☯about 6 years old  

cloudera/behavior

Auto-instantiates widgets/classes based on parsed, declarative HTML.

☕JavaScript   ★2 stars   ⚠0 open issues   ⚭5 contributors   ☯about 6 years old  

cloudera/bigtop

Bigtop is a project for the development of packaging and tests of the Apache Hadoop ecosystem. The primary goal of Bigtop is to build a community around the packaging and interoperability testing of Hadoop-related projects. This includes testing at various levels (packaging, platform, runtime, upgrade, etc...) developed by a community with a focus on the system as a whole, rather than individual projects.

☕Groovy   ★49 stars   ⚠1 open issues   ⚭2 contributors   ☯over 5 years old  

cloudera/blog-eclipse

☕Perl   ★3 stars   ⚠0 open issues   ⚭1 contributors   ☯almost 4 years old  

cloudera/branchreduce

Distributed branch-and-bound on Hadoop YARN.

☕Java   ★10 stars   ⚠0 open issues   ⚭1 contributors   ☯over 4 years old  

cloudera/catapult

Catapult - fork used for embedding in Kudu

☕HTML   ★0 stars   ⚠0 open issues   ⚭152 contributors   ☯5 months old  

cloudera/cdh-maven-archetype

Cloudera Maven Archetypes

☕Java   ★5 stars   ⚠0 open issues   ⚭2 contributors   ☯over 5 years old  

cloudera/cdk-examples

Cloudera Development Kit Examples

☕Java   ★56 stars   ⚠3 open issues   ⚭5 contributors   ☯almost 4 years old  

cloudera/cdk-examples-integration-tests

☕Java   ★1 stars   ⚠0 open issues   ⚭2 contributors   ☯over 3 years old  

cloudera/clientcide

The Clientcide Javascript Libraries

☕JavaScript   ★1 stars   ⚠0 open issues   ⚭4 contributors   ☯about 7 years old  

cloudera/cloudera-playbook

Cloudera deployment automation with Ansible

☕Python   ★5 stars   ⚠5 open issues   ⚭1 contributors   ☯4 months old  

cloudera/cloudera-training

Training exercises for Cloudera's Distribution for Hadoop

☕CSS   ★76 stars   ⚠1 open issues   ⚭0 contributors   ☯almost 8 years old  

cloudera/clusterdock

☕Python   ★24 stars   ⚠6 open issues   ⚭0 contributors   ☯7 months old  

cloudera/cm_charting_scrapbook

A collection of useful Cloudera Manager charts

   ★12 stars   ⚠0 open issues   ⚭2 contributors   ☯almost 4 years old  

cloudera/cm_csds

A collection of Custom Service Descriptors

☕Shell   ★18 stars   ⚠1 open issues   ⚭1 contributors   ☯about 3 years old  

cloudera/collections-generic

☕Java   ★0 stars   ⚠0 open issues   ⚭2 contributors   ☯about 1 year old  

cloudera/crepo

cloudera repo management tool

☕Python   ★26 stars   ⚠5 open issues   ⚭3 contributors   ☯over 7 years old  

cloudera/crunch

Crunch is an Apache TLP now, and lives at http://crunch.apache.org/

☕Java   ★316 stars   ⚠0 open issues   ⚭8 contributors   ☯over 5 years old  

cloudera/datafu

☕Java   ★5 stars   ⚠0 open issues   ⚭4 contributors   ☯almost 4 years old  

cloudera/director-aws-plugin

Cloudera Director - Amazon Web Services integration

☕Java   ★6 stars   ⚠0 open issues   ⚭3 contributors   ☯almost 2 years old  

cloudera/director-azure-plugin

Cloudera Director - Microsoft Azure Integration

☕Java   ★1 stars   ⚠0 open issues   ⚭3 contributors   ☯8 months old  

cloudera/director-byon-plugin-example

BYON (Bring-Your-Own-Nodes) plugin for Cloudera Director

☕Java   ★1 stars   ⚠0 open issues   ⚭2 contributors   ☯almost 2 years old