Hadoop Framework
Hadoop Framework
Hadoop Framework
ETL
Raw Data
RDBMS
Reports
History
GFS
GFS
MapReduce
src: http://blog.sqlauthority.com
Scalable
Fault tolerant
Main components:
HDFS
src: http://sundar5.wordpress.com/2010/03/19/hadoop-basic/
HDFS
Hadoop Architecture
Hadoop Architecture
Hadoop Architecture
Related Tools
Apache Pig
Apache Mahout
Apache Hive
Apache ZooKeeper
Apache HBase
Apache Flume
Apache Sqoop
Apache OOZIE
Related Tools
Apache Pig
Apache Mahout
Apache Hive
Apache ZooKeeper
Apache HBase
Apache Flume
Apache Sqoop
Apache OOZIE
Related Tools
Apache Pig
Apache Mahout
Apache Hive
Apache ZooKeeper
Apache HBase
Apache Flume
Apache Sqoop
Apache OOZIE
Related Tools
Apache Pig
Apache Mahout
Apache Hive
Apache ZooKeeper
Apache HBase
Apache Flume
Apache Sqoop
Apache OOZIE
Flume
is
for
integrating
large
volume of log data.
Related Tools
Apache Pig
Apache Mahout
Apache Hive
Apache ZooKeeper
Apache HBase
Apache Flume
Apache Sqoop
Apache OOZIE
Related Tools
Apache Pig
Apache Mahout
Apache Hive
Apache ZooKeeper
Apache HBase
Apache Flume
Apache Sqoop
Apache OOZIE
Oozie is a workfow
scheduler system to
manage Apache
Hadoop jobs.
Related Tools
Apache Pig
Apache Mahout
Apache Hive
Apache ZooKeeper
Apache HBase
Scalable Machine Learning
Library
Apache Flume
Apache Sqoop
Apache OOZIE
Related Tools
Apache Pig
Apache Mahout
Apache Hive
Apache ZooKeeper
Apache HBase
Apache Flume
Zookeeper allows distributed
processes to coordinate with
Apache
eachSqoop
other through a shared
hierarchical name space of
data registers.
Apache OOZIE
references
http://blog.sqlauthority.com