Big Data Processing Using Spark in Cloud

Nonfiction, Computers, Database Management, Networking & Communications, Computer Security, General Computing
Cover of the book Big Data Processing Using Spark in Cloud by , Springer Singapore
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart
Author: ISBN: 9789811305504
Publisher: Springer Singapore Publication: June 16, 2018
Imprint: Springer Language: English
Author:
ISBN: 9789811305504
Publisher: Springer Singapore
Publication: June 16, 2018
Imprint: Springer
Language: English

The book describes the emergence of big data technologies and the role of Spark in the entire big data stack. It compares Spark and Hadoop and identifies the shortcomings of Hadoop that have been overcome by Spark. The book mainly focuses on the in-depth architecture of Spark and our understanding of Spark RDDs and how RDD complements big data’s immutable nature, and solves it with lazy evaluation, cacheable and type inference. It also addresses advanced topics in Spark, starting with the basics of Scala and the core Spark framework, and exploring Spark data frames, machine learning using Mllib, graph analytics using Graph X and real-time processing with Apache Kafka, AWS Kenisis, and Azure Event Hub. It then goes on to investigate Spark using PySpark and R. Focusing on the current big data stack, the book examines the interaction with current big data tools, with Spark being the core processing layer for all types of data.

The book is intended for data engineers and scientists working on massive datasets and big data technologies in the cloud. In addition to industry professionals, it is helpful for aspiring data processing professionals and students working in big data processing and cloud computing environments.

View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart

The book describes the emergence of big data technologies and the role of Spark in the entire big data stack. It compares Spark and Hadoop and identifies the shortcomings of Hadoop that have been overcome by Spark. The book mainly focuses on the in-depth architecture of Spark and our understanding of Spark RDDs and how RDD complements big data’s immutable nature, and solves it with lazy evaluation, cacheable and type inference. It also addresses advanced topics in Spark, starting with the basics of Scala and the core Spark framework, and exploring Spark data frames, machine learning using Mllib, graph analytics using Graph X and real-time processing with Apache Kafka, AWS Kenisis, and Azure Event Hub. It then goes on to investigate Spark using PySpark and R. Focusing on the current big data stack, the book examines the interaction with current big data tools, with Spark being the core processing layer for all types of data.

The book is intended for data engineers and scientists working on massive datasets and big data technologies in the cloud. In addition to industry professionals, it is helpful for aspiring data processing professionals and students working in big data processing and cloud computing environments.

More books from Springer Singapore

Cover of the book Principles of Transversality in Globalization and Education by
Cover of the book Severe Asthma by
Cover of the book Electronics, Communications and Networks V by
Cover of the book Socio-Economic Analysis of Arsenic Contamination of Groundwater in West Bengal by
Cover of the book Not-So-Simple Stellar Populations in Star Clusters by
Cover of the book Changing Spatial Elements in Chinese Socio-economic Five-year Plan: from Project Layout to Spatial Planning by
Cover of the book Microbial Biotechnology by
Cover of the book Conflict and Youth Rights in India by
Cover of the book Frontier Computing by
Cover of the book Proceedings of the International Conference on Microelectronics, Computing & Communication Systems by
Cover of the book South Asia in Global Power Rivalry by
Cover of the book Invasive Computing for Mapping Parallel Programs to Many-Core Architectures by
Cover of the book Strategic Marketing and Innovation for Indian MSMEs by
Cover of the book The Development of MOOCs in China by
Cover of the book Commerce and Strangers in Adam Smith by
We use our own "cookies" and third party cookies to improve services and to see statistical information. By using this website, you agree to our Privacy Policy