0
Your cart

Your cart is empty

Books > Computing & IT > Computer programming

Buy Now

Big Data Processing with Apache Spark - Efficiently tackle large datasets and big data analysis with Spark and Python (Paperback) Loot Price: R921
Discovery Miles 9 210
Big Data Processing with Apache Spark - Efficiently tackle large datasets and big data analysis with Spark and Python...

Big Data Processing with Apache Spark - Efficiently tackle large datasets and big data analysis with Spark and Python (Paperback)

Manuel Ignacio Franco Galeano

 (sign in to rate)
Loot Price R921 Discovery Miles 9 210 | Repayment Terms: R86 pm x 12*

Bookmark and Share

Expected to ship within 18 - 22 working days

No need to spend hours ploughing through endless data - let Spark, one of the fastest big data processing engines available, do the hard work for you. Key Features Get up and running with Apache Spark and Python Integrate Spark with AWS for real-time analytics Apply processed data streams to machine learning APIs of Apache Spark Book DescriptionProcessing big data in real time is challenging due to scalability, information consistency, and fault-tolerance. This book teaches you how to use Spark to make your overall analytical workflow faster and more efficient. You'll explore all core concepts and tools within the Spark ecosystem, such as Spark Streaming, the Spark Streaming API, machine learning extension, and structured streaming. You'll begin by learning data processing fundamentals using Resilient Distributed Datasets (RDDs), SQL, Datasets, and Dataframes APIs. After grasping these fundamentals, you'll move on to using Spark Streaming APIs to consume data in real time from TCP sockets, and integrate Amazon Web Services (AWS) for stream consumption. By the end of this book, you'll not only have understood how to use machine learning extensions and structured streams but you'll also be able to apply Spark in your own upcoming big data projects. What you will learn Write your own Python programs that can interact with Spark Implement data stream consumption using Apache Spark Recognize common operations in Spark to process known data streams Integrate Spark streaming with Amazon Web Services (AWS) Create a collaborative filtering model with the movielens dataset Apply processed data streams to Spark machine learning APIs Who this book is forData Processing with Apache Spark is for you if you are a software engineer, architect, or IT professional who wants to explore distributed systems and big data analytics. Although you don't need any knowledge of Spark, prior experience of working with Python is recommended.

General

Imprint: Packt Publishing Limited
Country of origin: United Kingdom
Release date: October 2018
Authors: Manuel Ignacio Franco Galeano
Dimensions: 93 x 75mm (L x W)
Format: Paperback
Pages: 142
ISBN-13: 978-1-78980-881-0
Categories: Books > Computing & IT > Computer programming > General
Books > Computing & IT > Social & legal aspects of computing > Human-computer interaction
Books > Computing & IT > Applications of computing > Databases > Data capture & analysis
Promotions
LSN: 1-78980-881-2
Barcode: 9781789808810

Is the information for this product incomplete, wrong or inappropriate? Let us know about it.

Does this product have an incorrect or missing image? Send us a new image.

Is this product missing categories? Add more categories.

Review This Product

No reviews yet - be the first to create one!

You might also like..

Problem Solving with C++ - Global…
Walter Savitch Paperback R2,189 R1,762 Discovery Miles 17 620
Programming Logic & Design
Joyce Farrell Paperback R757 Discovery Miles 7 570
C++ Programming - Program Design…
D. Malik Paperback R1,646 R1,523 Discovery Miles 15 230
Program Construction - Calculating…
Roland Backhouse Paperback R2,460 Discovery Miles 24 600
Programming Logic & Design…
Joyce Farrell Paperback R1,256 R1,170 Discovery Miles 11 700
Hardware Accelerator Systems for…
Shiho Kim, Ganesh Chandra Deka Hardcover R3,950 Discovery Miles 39 500
Dark Silicon and Future On-chip Systems…
Suyel Namasudra, Hamid Sarbazi-Azad Hardcover R3,940 Discovery Miles 39 400
Temporal Data Mining via Unsupervised…
Yun Yang Paperback R1,173 Discovery Miles 11 730
Creativity in Computing and DataFlow…
Suyel Namasudra, Veljko Milutinovic Hardcover R4,204 Discovery Miles 42 040
News Search, Blogs and Feeds - A Toolkit
Lars Vage, Lars Iselid Paperback R1,332 Discovery Miles 13 320
Microcontroller Projects in C for the…
Dogan Ibrahim Paperback R1,455 Discovery Miles 14 550
Essential Java for Scientists and…
Brian Hahn, Katherine Malan Paperback R1,266 Discovery Miles 12 660

See more

Partners