[Keynote]The role of the Distribution in the Apache Hadoop Ecosystem
133
Views
Length: 27:46
Cloudera Inc, HDFS/MapReduce/HBase Commiter, Todd Lipcon
In the past several years, Apache Hadoop has enjoyed considerable success due to its ability to scalably and reliably store and process vast quantities of data. HDFS and MapReduce are the two…
Cloudera Inc, HDFS/MapReduce/HBase Commiter, Todd Lipcon
In the past several years, Apache Hadoop has enjoyed considerable success due to its ability to scalably and reliably store and process vast quantities of data. HDFS and MapReduce are the two core components of this software, but the real power of Hadoop comes from the larger ecosystem of open source projects built on and around this core: projects like Apache Hive, Pig, Flume, Oozie, HBase, Avro, and more. In this talk, Todd will introduce CDH, Cloudera's entirely open source distribution that integrates all of these components in a single product, and explain why it is the easiest and most popular way to deploy Hadoop in critical enterprise environments world-wide.
More
Be the first to leave a comment for this video!