A Definitive Information to Hadoop-Associated Frameworks and Instruments
This ebook is a sensible information on utilizing the Apache Hadoop initiatives together with MapReduce, HDFS, Apache Hive, Apache HBase, Apache Kafka, Apache Mahout and Apache Solr. From organising the surroundings to working pattern functions every chapter is a sensible tutorial on utilizing a Apache Hadoop ecosystem mission. Whereas a number of books on Apache Hadoop can be found, most are based mostly on the principle initiatives MapReduce and HDFS and none discusses the opposite Apache Hadoop ecosystem initiatives and the way these all work collectively as a cohesive large knowledge growth platform.
What you’ll study
How one can arrange surroundings in Linux for Hadoop initiatives utilizing Cloudera Hadoop Distribution CDH 5.
How one can run a MapReduce job
How one can retailer knowledge with Apache Hive, Apache HBase
How one can index knowledge in HDFS with Apache Solr
How one can develop a Kafka messaging system
How one can develop a Mahout Person Recommender System
How one can stream Logs to HDFS with Apache Flume
How one can switch knowledge from MySQL database to Hive, HDFS and HBase with Sqoop
How create a Hive desk over Apache Solr
Edition: 1st Edition
ISBN: 1484221982
Posted on: 11/18/2016
Format: Pdf
Page Count: 421 Pages
Author: Deepak Vohra,: --------------------