Apache Spark 2.x Machine Learning Cookbook

Nonfiction, Computers, Advanced Computing, Theory, Artificial Intelligence, General Computing
Cover of the book Apache Spark 2.x Machine Learning Cookbook by Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei, Packt Publishing
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart
Author: Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei ISBN: 9781782174608
Publisher: Packt Publishing Publication: September 22, 2017
Imprint: Packt Publishing Language: English
Author: Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei
ISBN: 9781782174608
Publisher: Packt Publishing
Publication: September 22, 2017
Imprint: Packt Publishing
Language: English

Simplify machine learning model implementations with Spark

About This Book

  • Solve the day-to-day problems of data science with Spark
  • This unique cookbook consists of exciting and intuitive numerical recipes
  • Optimize your work by acquiring, cleaning, analyzing, predicting, and visualizing your data

Who This Book Is For

This book is for Scala developers with a fairly good exposure to and understanding of machine learning techniques, but lack practical implementations with Spark. A solid knowledge of machine learning algorithms is assumed, as well as hands-on experience of implementing ML algorithms with Scala. However, you do not need to be acquainted with the Spark ML libraries and ecosystem.

What You Will Learn

  • Get to know how Scala and Spark go hand-in-hand for developers when developing ML systems with Spark
  • Build a recommendation engine that scales with Spark
  • Find out how to build unsupervised clustering systems to classify data in Spark
  • Build machine learning systems with the Decision Tree and Ensemble models in Spark
  • Deal with the curse of high-dimensionality in big data using Spark
  • Implement Text analytics for Search Engines in Spark
  • Streaming Machine Learning System implementation using Spark

In Detail

Machine learning aims to extract knowledge from data, relying on fundamental concepts in computer science, statistics, probability, and optimization. Learning about algorithms enables a wide range of applications, from everyday tasks such as product recommendations and spam filtering to cutting edge applications such as self-driving cars and personalized medicine. You will gain hands-on experience of applying these principles using Apache Spark, a resilient cluster computing system well suited for large-scale machine learning tasks.

This book begins with a quick overview of setting up the necessary IDEs to facilitate the execution of code examples that will be covered in various chapters. It also highlights some key issues developers face while working with machine learning algorithms on the Spark platform. We progress by uncovering the various Spark APIs and the implementation of ML algorithms with developing classification systems, recommendation engines, text analytics, clustering, and learning systems. Toward the final chapters, we'll focus on building high-end applications and explain various unsupervised methodologies and challenges to tackle when implementing with big data ML systems.

Style and approach

This book is packed with intuitive recipes supported with line-by-line explanations to help you understand how to optimize your work flow and resolve problems when working with complex data modeling tasks and predictive algorithms. This is a valuable resource for data scientists and those working on large scale data projects.

View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart

Simplify machine learning model implementations with Spark

About This Book

Who This Book Is For

This book is for Scala developers with a fairly good exposure to and understanding of machine learning techniques, but lack practical implementations with Spark. A solid knowledge of machine learning algorithms is assumed, as well as hands-on experience of implementing ML algorithms with Scala. However, you do not need to be acquainted with the Spark ML libraries and ecosystem.

What You Will Learn

In Detail

Machine learning aims to extract knowledge from data, relying on fundamental concepts in computer science, statistics, probability, and optimization. Learning about algorithms enables a wide range of applications, from everyday tasks such as product recommendations and spam filtering to cutting edge applications such as self-driving cars and personalized medicine. You will gain hands-on experience of applying these principles using Apache Spark, a resilient cluster computing system well suited for large-scale machine learning tasks.

This book begins with a quick overview of setting up the necessary IDEs to facilitate the execution of code examples that will be covered in various chapters. It also highlights some key issues developers face while working with machine learning algorithms on the Spark platform. We progress by uncovering the various Spark APIs and the implementation of ML algorithms with developing classification systems, recommendation engines, text analytics, clustering, and learning systems. Toward the final chapters, we'll focus on building high-end applications and explain various unsupervised methodologies and challenges to tackle when implementing with big data ML systems.

Style and approach

This book is packed with intuitive recipes supported with line-by-line explanations to help you understand how to optimize your work flow and resolve problems when working with complex data modeling tasks and predictive algorithms. This is a valuable resource for data scientists and those working on large scale data projects.

More books from Packt Publishing

Cover of the book R Graph Essentials by Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei
Cover of the book IBM Lotus Notes 8.5 User Guide by Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei
Cover of the book Learning LibGDX Game Development - Second Edition by Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei
Cover of the book Mastering Apache Cassandra 3.x by Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei
Cover of the book Unity 5: Learning C# by Developing Games by Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei
Cover of the book Odoo Development Essentials by Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei
Cover of the book Plone 3 Multimedia by Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei
Cover of the book Google SketchUp for Game Design: Beginner's Guide by Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei
Cover of the book Getting started with Intellij IDEA by Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei
Cover of the book Learning Pixelmator by Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei
Cover of the book Cocos2d-x Game Development Essentials by Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei
Cover of the book Swift Data Structure and Algorithms by Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei
Cover of the book Natural Language Processing with Java and LingPipe Cookbook by Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei
Cover of the book CakePHP Application Development by Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei
Cover of the book Mastering Identity and Access Management with Microsoft Azure by Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei
We use our own "cookies" and third party cookies to improve services and to see statistical information. By using this website, you agree to our Privacy Policy