Subhashini Chellappan, Dharanitharan Ganesan

Practical Apache Spark

Using the Scala API

Fotogalerie

Zur Bildergalerie

Subhashini Chellappan, Dharanitharan Ganesan

Practical Apache Spark

Using the Scala API

Broschiertes Buch

Jetzt bewerten Jetzt bewerten

Weitere Ausgabe:
eBook, PDF

Andere Kunden interessierten sich auch für

Hien Luu
Beginning Apache Spark 2

46,99 €
Butch Quinto
Next-Generation Big Data

43,99 €
Elad Eldor
Kafka Troubleshooting in Production

29,99 €
Jaya Sinha
Hadoop Ecosystem, YARN, NoSQL, MongoDB, Spark & Scala Guide

40,99 €
Jay M. Patel
Getting Structured Data from the Internet

45,99 €
Butch Quinto
Next-Generation Machine Learning with Spark

38,99 €
Raju Kumar Mishra
PySpark Recipes

45,99 €

Produktbeschreibung

Work with Apache Spark using Scala to deploy and set up single-node, multi-node, and high-availability clusters. This book discusses various components of Spark such as Spark Core, DataFrames, Datasets and SQL, Spark Streaming, Spark MLib, and R on Spark with the help of practical code snippets for each topic. Practical Apache Spark also covers the integration of Apache Spark with Kafka with examples. You'll follow a learn-to-do-by-yourself approach to learning - learn the concepts, practice the code snippets in Scala, and complete the assignments given to get an overall exposure.
On completion, you'll have knowledge of the functional programming aspects of Scala, and hands-on expertise in various Spark components. You'll also become familiar with machine learning algorithms with real-time usage.
What You Will LearnDiscover the functional programming features of Scala
Understand the completearchitecture of Spark and its componentsIntegrate Apache Spark with Hive and Kafka
Use Spark SQL, DataFrames, and Datasets to process data using traditional SQL queries
Work with different machine learning concepts and libraries using Spark's MLlib packages

Who This Book Is For
Developers and professionals who deal with batch and stream data processing.

Produktdetails

Produktdetails
Verlag: Apress / Springer, Berlin
Artikelnr. des Verlages: 978-1-4842-3651-2
First Edition
Seitenzahl: 296
Erscheinungstermin: 13. Dezember 2018
Englisch
Abmessung: 254mm x 178mm x 17mm
Gewicht: 562g
ISBN-13: 9781484236512
ISBN-10: 1484236513
Artikelnr.: 52070231

Herstellerkennzeichnung
Libri GmbH
Europaallee 1
36244 Bad Hersfeld
gpsr@libri.de

Produktdetails

Verlag: Apress / Springer, Berlin
Artikelnr. des Verlages: 978-1-4842-3651-2
First Edition
Seitenzahl: 296
Erscheinungstermin: 13. Dezember 2018
Englisch
Abmessung: 254mm x 178mm x 17mm
Gewicht: 562g
ISBN-13: 9781484236512
ISBN-10: 1484236513
Artikelnr.: 52070231

Herstellerkennzeichnung
Libri GmbH
Europaallee 1
36244 Bad Hersfeld
gpsr@libri.de

Autorenporträt

Subhashini Chellappan is a technology enthusiast with expertise in the big data and cloud space. She has rich experience in both academia and the software industry. Her areas of interest and expertise are centered on business intelligence, big data analytics and cloud computing. Dharanitharan Ganesan is a senior analyst with five years of experience in IT. He has a high level of exposure and experience in big data - Apache Hadoop, Apache Spark and various Hadoop ecosystem components. He has a proven track record of improving efficiency and productivity through the automation of various routine and administrative functions in business intelligence and big data technologies. His areas of interest and expertise are centered on machine learning algorithms, statistical modelling and predictive analysis.

Inhaltsangabe

1. Scala - Functional Programming Aspects. - 2. Single & Multi-node cluster setup. - 3. Introduction to Apache Spark and Spark Core. - 4. Spark SQL, Dataframes & Datasets. - 5. Introduction to Spark Streaming. - 6. Spark Structured Streaming. - 7. Spark Streaming with Kafka. - 8. Spark Machine Learning Library. - 9. Working with SparkR. - 10. Spark - Real time use case.

Inhaltsangabe