Practical Hadoop Ecosystem

A Definitive Guide to Hadoop-Related Frameworks and Tools

Nonfiction, Computers, Database Management, General Computing
Cover of the book Practical Hadoop Ecosystem by Deepak Vohra, Apress
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart
Author: Deepak Vohra ISBN: 9781484221990
Publisher: Apress Publication: September 30, 2016
Imprint: Apress Language: English
Author: Deepak Vohra
ISBN: 9781484221990
Publisher: Apress
Publication: September 30, 2016
Imprint: Apress
Language: English

Learn how to use the Apache Hadoop projects, including MapReduce, HDFS, Apache Hive, Apache HBase, Apache Kafka, Apache Mahout, and Apache Solr. From setting up the environment to running sample applications each chapter in this book is a practical tutorial on using an Apache Hadoop ecosystem project.

While several books on Apache Hadoop are available, most are based on the main projects, MapReduce and HDFS, and none discusses the other Apache Hadoop ecosystem projects and how they all work together as a cohesive big data development platform.

What You Will Learn:

  • Set up the environment in Linux for Hadoop projects using Cloudera Hadoop Distribution CDH 5

  • Run a MapReduce job

  • Store data with Apache Hive, and Apache HBase

  • Index data in HDFS with Apache Solr

  • Develop a Kafka messaging system

  • Stream Logs to HDFS with Apache Flume

  • Transfer data from MySQL database to Hive, HDFS, and HBase with Sqoop

  • Create a Hive table over Apache Solr

  • Develop a Mahout User Recommender System

Who This Book Is For:

Apache Hadoop developers. Pre-requisite knowledge of Linux and some knowledge of Hadoop is required.

View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart

Learn how to use the Apache Hadoop projects, including MapReduce, HDFS, Apache Hive, Apache HBase, Apache Kafka, Apache Mahout, and Apache Solr. From setting up the environment to running sample applications each chapter in this book is a practical tutorial on using an Apache Hadoop ecosystem project.

While several books on Apache Hadoop are available, most are based on the main projects, MapReduce and HDFS, and none discusses the other Apache Hadoop ecosystem projects and how they all work together as a cohesive big data development platform.

What You Will Learn:

Who This Book Is For:

Apache Hadoop developers. Pre-requisite knowledge of Linux and some knowledge of Hadoop is required.

More books from Apress

Cover of the book Winning Design! by Deepak Vohra
Cover of the book Practical LPIC-1 Linux Certification Study Guide by Deepak Vohra
Cover of the book Pro HTML5 with CSS, JavaScript, and Multimedia by Deepak Vohra
Cover of the book PHP 7 Zend Certification Study Guide by Deepak Vohra
Cover of the book Beginning PowerShell for SharePoint 2013 by Deepak Vohra
Cover of the book SAP MII by Deepak Vohra
Cover of the book BizTalk 2013 EDI for Supply Chain Management by Deepak Vohra
Cover of the book Scalable Big Data Architecture by Deepak Vohra
Cover of the book Build your own 2D Game Engine and Create Great Web Games by Deepak Vohra
Cover of the book Beginning ASP.NET MVC 4 by Deepak Vohra
Cover of the book Pro Couchbase Server by Deepak Vohra
Cover of the book Beginning Java EE 7 by Deepak Vohra
Cover of the book Python Data Analytics by Deepak Vohra
Cover of the book Practical Linux Infrastructure by Deepak Vohra
Cover of the book Pro Microsoft HDInsight by Deepak Vohra
We use our own "cookies" and third party cookies to improve services and to see statistical information. By using this website, you agree to our Privacy Policy