How Kudu Enables Fast Analytics on Fast Data

From our partner, Cloudera’s website:

Kudu is a columnar storage manager developed for the Hadoop platform that runs on commodity hardware, is horizontally scalable, and supports highly available operation.

Kudu shares the common technical properties of Hadoop ecosystem applications and targets support for families of applications that are difficult  to implement on current generation Hadoop storage technologies.


Kudu’s design sets it apart. Some of Kudu’s benefits include:

  • Fast processing of OLAP workloads.
  • Integration with MapReduce, Spark and other Hadoop ecosystem components.
  • Tight integration with Impala, making it a good, mutable alternative to using HDFS with Parquet.
  • Strong but flexible consistency model, allowing you to choose consistency requirements on a per-request basis, including the option for strict serialized consistency.
  • Strong performance for running sequential and random workloads simultaneously.
  • Easy to administer and manage with Cloudera Manager.
  • High availability. Tablet Servers and Master use the Raft consensus algorithm, which ensures availability even if f replicas fail, given 2f+1 available replicas. Reads can be serviced by read-only follower tablets, even in the event of a leader tablet failure.
  • Structured data model.

To read more about Kudu and find out how to install it, go to our partner Cloudera’s website.