Data Science Solutions with Python

Fast and Scalable Models Using Keras, PySpark MLlib, H2O, XGBoost, and Scikit-Learn

Fotogalerie

Tshepo Chris Nokeri

Data Science Solutions with Python

Fast and Scalable Models Using Keras, PySpark MLlib, H2O, XGBoost, and Scikit-Learn

Broschiertes Buch

Jetzt bewerten Jetzt bewerten

Weitere Ausgabe:
eBook, PDF

Andere Kunden interessierten sich auch für

Tshepo Chris Nokeri
Econometrics and Data Science

26,99 €
Statistics for Data Science and Policy Analysis

183,99 €
Recent Developments in Statistics and Data Science

168,99 €
Statistics for Data Science and Policy Analysis

183,99 €
Recent Developments in Statistics and Data Science

168,99 €
Artificial Intelligence, Big Data and Data Science in Statistics

132,99 €
John Atkinson-Abutridy
Large Language Models

183,99 €

Produktbeschreibung

Apply supervised and unsupervised learning to solve practical and real-world big data problems. This book teaches you how to engineer features, optimize hyperparameters, train and test models, develop pipelines, and automate the machine learning (ML) process.
The book covers an in-memory, distributed cluster computing framework known as PySpark, machine learning framework platforms known as scikit-learn, PySpark MLlib, H2O, and XGBoost, and a deep learning (DL) framework known as Keras.

The book starts off presenting supervised and unsupervised ML and DL models, and then it examines big data frameworks along with ML and DL frameworks. Author Tshepo Chris Nokeri considers a parametric model known as the Generalized Linear Model and a survival regression model known as the Cox Proportional Hazards model along with Accelerated Failure Time (AFT). Also presented is a binary classification model (logistic regression) and an ensemble model (Gradient Boosted Trees). The bookintroduces DL and an artificial neural network known as the Multilayer Perceptron (MLP) classifier. A way of performing cluster analysis using the K-Means model is covered. Dimension reduction techniques such as Principal Components Analysis and Linear Discriminant Analysis are explored. And automated machine learning is unpacked.

This book is for intermediate-level data scientists and machine learning engineers who want to learn how to apply key big data frameworks and ML and DL frameworks. You will need prior knowledge of the basics of statistics, Python programming, probability theories, and predictive analytics.

What You Will LearnUnderstand widespread supervised and unsupervised learning, including key dimension reduction techniquesKnow the big data analytics layers such as data visualization, advanced statistics, predictive analytics, machine learning, and deep learningIntegrate big data frameworks with a hybrid of machine learning frameworks and deep learning frameworksDesign, build, test, and validate skilled machine models and deep learning modelsOptimize model performance using data transformation, regularization, outlier remedying, hyperparameter optimization, and data split ratio alteration

Who This Book Is For
Data scientists and machine learning engineers with basic knowledge and understanding of Python programming, probability theories, and predictive analytics

Produktdetails

Produktdetails
Verlag: Apress / Springer, Berlin
Artikelnr. des Verlages: 978-1-4842-7761-4
1st ed.
Seitenzahl: 136
Erscheinungstermin: 26. Oktober 2021
Englisch
Abmessung: 254mm x 178mm x 8mm
Gewicht: 271g
ISBN-13: 9781484277614
ISBN-10: 1484277619
Artikelnr.: 62514271

Herstellerkennzeichnung

Produktdetails

Verlag: Apress / Springer, Berlin
Artikelnr. des Verlages: 978-1-4842-7761-4
1st ed.
Seitenzahl: 136
Erscheinungstermin: 26. Oktober 2021
Englisch
Abmessung: 254mm x 178mm x 8mm
Gewicht: 271g
ISBN-13: 9781484277614
ISBN-10: 1484277619
Artikelnr.: 62514271

Herstellerkennzeichnung

Autorenporträt

Tshepo Chris Nokeri harnesses advanced analytics and artificial intelligence to foster innovation and optimize business performance. In his functional work, he has delivered complex solutions to companies in the mining, petroleum, and manufacturing industries. He initially completed a bachelor's degree in information management. Afterward, he graduated with an Honours degree in business science at the University of the Witwatersrand on a TATA Prestigious Scholarship and a Wits Postgraduate Merit Award. They unanimously awarded him the Oxford University Press Prize.

Inhaltsangabe

Chapter 1: Understanding Machine Learning and Deep Learning.- Chapter 2: Big Data Frameworks and ML and DL Frameworks.- Chapter 3: The Parametric Method - Linear Regression.- Chapter 4: Survival Regression Analysis.-Chapter 5:The Non-Parametric Method - Classification.- Chapter 6:Tree-based Modelling and Gradient Boosting.- Chapter 7: Artificial Neural Networks.- Chapter 8: Cluster Analysis using K-Means.- Chapter 9: Dimension Reduction - Principal Components Analysis.- Chapter 10: Automated Machine Learning.

Inhaltsangabe

Rezensionen

"The book has a reader-centric style. Topics are covered briefly. ... The book can be considered as an introduction to various topics. Code listings and graphical results for different models are added benefits, which could enhance learning and exposure." (Jawwad Shamsi, Computing Reviews, June 29, 2022)

Data Science Solutions with Python

Rechnungen

Retourenschein anfordern

Bestellstatus

Storno

Serviceseiten

Schließen

Data Science Solutions with Python

Data Science Solutions with Python

Bitte wählen Sie Ihr Anliegen aus.

Rechnungen

Retourenschein anfordern

Bestellstatus

Storno

Serviceseiten

Schließen