Getting Started with Kudu

Perform Fast Analytics on Fast Data

Nonfiction, Computers, Database Management, Information Storage & Retrievel, Data Processing
Cover of the book Getting Started with Kudu by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart, O'Reilly Media
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart
Author: Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart ISBN: 9781491980200
Publisher: O'Reilly Media Publication: July 9, 2018
Imprint: O'Reilly Media Language: English
Author: Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart
ISBN: 9781491980200
Publisher: O'Reilly Media
Publication: July 9, 2018
Imprint: O'Reilly Media
Language: English

Fast data ingestion, serving, and analytics in the Hadoop ecosystem have forced developers and architects to choose solutions using the least common denominator—either fast analytics at the cost of slow data ingestion or fast data ingestion at the cost of slow analytics. There is an answer to this problem. With the Apache Kudu column-oriented data store, you can easily perform fast analytics on fast data. This practical guide shows you how.

Begun as an internal project at Cloudera, Kudu is an open source solution compatible with many data processing frameworks in the Hadoop environment. In this book, current and former solutions professionals from Cloudera provide use cases, examples, best practices, and sample code to help you get up to speed with Kudu.

  • Explore Kudu’s high-level design, including how it spreads data across servers
  • Fully administer a Kudu cluster, enable security, and add or remove nodes
  • Learn Kudu’s client-side APIs, including how to integrate Apache Impala, Spark, and other frameworks for data manipulation
  • Examine Kudu’s schema design, including basic concepts and primitives necessary to make your project successful
  • Explore case studies for using Kudu for real-time IoT analytics, predictive modeling, and in combination with another storage engine
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart

Fast data ingestion, serving, and analytics in the Hadoop ecosystem have forced developers and architects to choose solutions using the least common denominator—either fast analytics at the cost of slow data ingestion or fast data ingestion at the cost of slow analytics. There is an answer to this problem. With the Apache Kudu column-oriented data store, you can easily perform fast analytics on fast data. This practical guide shows you how.

Begun as an internal project at Cloudera, Kudu is an open source solution compatible with many data processing frameworks in the Hadoop environment. In this book, current and former solutions professionals from Cloudera provide use cases, examples, best practices, and sample code to help you get up to speed with Kudu.

More books from O'Reilly Media

Cover of the book Apache Sqoop Cookbook by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart
Cover of the book View Updating and Relational Theory by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart
Cover of the book Bash Pocket Reference by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart
Cover of the book Accessibility Handbook by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart
Cover of the book High Performance Responsive Design by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart
Cover of the book iPad: The Missing Manual by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart
Cover of the book Positioning in CSS by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart
Cover of the book Building Polyfills by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart
Cover of the book SQL in a Nutshell by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart
Cover of the book Google Maps Hacks by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart
Cover of the book Visualizing Streaming Data by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart
Cover of the book Think Bayes by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart
Cover of the book QuickBooks 2009: The Missing Manual by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart
Cover of the book Transforms in CSS by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart
Cover of the book Office 2010: The Missing Manual by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart
We use our own "cookies" and third party cookies to improve services and to see statistical information. By using this website, you agree to our Privacy Policy