|
Showing 1 - 2 of
2 matches in All Departments
Take a journey toward discovering, learning, and using Apache Spark
3.0. In this book, you will gain expertise on the powerful and
efficient distributed data processing engine inside of Apache
Spark; its user-friendly, comprehensive, and flexible programming
model for processing data in batch and streaming; and the scalable
machine learning algorithms and practical utilities to build
machine learning applications. Beginning Apache Spark 3 begins by
explaining different ways of interacting with Apache Spark, such as
Spark Concepts and Architecture, and Spark Unified Stack. Next, it
offers an overview of Spark SQL before moving on to its advanced
features. It covers tips and techniques for dealing with
performance issues, followed by an overview of the structured
streaming processing engine. It concludes with a demonstration of
how to develop machine learning applications using Spark MLlib and
how to manage the machine learning development lifecycle. This book
is packed with practical examples and code snippets to help you
master concepts and features immediately after they are covered in
each section. After reading this book, you will have the knowledge
required to build your own big data pipelines, applications, and
machine learning applications. What You Will Learn Master the Spark
unified data analytics engine and its various components Work in
tandem to provide a scalable, fault tolerant and performant data
processing engine Leverage the user-friendly and flexible
programming model to perform simple to complex data analytics using
dataframe and Spark SQL Develop machine learning applications using
Spark MLlib Manage the machine learning development lifecycle using
MLflow Who This Book Is For Data scientists, data engineers and
software developers.
Develop applications for the big data landscape with Spark and
Hadoop. This book also explains the role of Spark in developing
scalable machine learning and analytics applications with Cloud
technologies. Beginning Apache Spark 2 gives you an introduction to
Apache Spark and shows you how to work with it. Along the way,
you'll discover resilient distributed datasets (RDDs); use Spark
SQL for structured data; and learn stream processing and build
real-time applications with Spark Structured Streaming.
Furthermore, you'll learn the fundamentals of Spark ML for machine
learning and much more. After you read this book, you will have the
fundamentals to become proficient in using Apache Spark and know
when and how to apply it to your big data applications. What You
Will Learn Understand Spark unified data processing platform How to
run Spark in Spark Shell or Databricks Use and manipulate RDDs Deal
with structured data using Spark SQL through its operations and
advanced functions Build real-time applications using Spark
Structured Streaming Develop intelligent applications with the
Spark Machine Learning library Who This Book Is For Programmers and
developers active in big data, Hadoop, and Java but who are new to
the Apache Spark platform.
|
You may like...
Ab Wheel
R209
R149
Discovery Miles 1 490
Widows
Viola Davis, Michelle Rodriguez, …
Blu-ray disc
R22
R19
Discovery Miles 190
|