0
Your cart

Your cart is empty

Books > Computing & IT > General theory of computing

Buy Now

PySpark Cookbook - Over 60 recipes for implementing big data processing and analytics using Apache Spark and Python (Paperback) Loot Price: R1,193
Discovery Miles 11 930
PySpark Cookbook - Over 60 recipes for implementing big data processing and analytics using Apache Spark and Python...

PySpark Cookbook - Over 60 recipes for implementing big data processing and analytics using Apache Spark and Python (Paperback)

Denny Lee, Tomasz Drabas

 (sign in to rate)
Loot Price R1,193 Discovery Miles 11 930 | Repayment Terms: R112 pm x 12*

Bookmark and Share

Expected to ship within 10 - 15 working days

Combine the power of Apache Spark and Python to build effective big data applications Key Features Perform effective data processing, machine learning, and analytics using PySpark Overcome challenges in developing and deploying Spark solutions using Python Explore recipes for efficiently combining Python and Apache Spark to process data Book DescriptionApache Spark is an open source framework for efficient cluster computing with a strong interface for data parallelism and fault tolerance. The PySpark Cookbook presents effective and time-saving recipes for leveraging the power of Python and putting it to use in the Spark ecosystem. You'll start by learning the Apache Spark architecture and how to set up a Python environment for Spark. You'll then get familiar with the modules available in PySpark and start using them effortlessly. In addition to this, you'll discover how to abstract data with RDDs and DataFrames, and understand the streaming capabilities of PySpark. You'll then move on to using ML and MLlib in order to solve any problems related to the machine learning capabilities of PySpark and use GraphFrames to solve graph-processing problems. Finally, you will explore how to deploy your applications to the cloud using the spark-submit command. By the end of this book, you will be able to use the Python API for Apache Spark to solve any problems associated with building data-intensive applications. What you will learn Configure a local instance of PySpark in a virtual environment Install and configure Jupyter in local and multi-node environments Create DataFrames from JSON and a dictionary using pyspark.sql Explore regression and clustering models available in the ML module Use DataFrames to transform data used for modeling Connect to PubNub and perform aggregations on streams Who this book is forThe PySpark Cookbook is for you if you are a Python developer looking for hands-on recipes for using the Apache Spark 2.x ecosystem in the best possible way. A thorough understanding of Python (and some familiarity with Spark) will help you get the best out of the book.

General

Imprint: Packt Publishing Limited
Country of origin: United Kingdom
Release date: June 2018
Authors: Denny Lee • Tomasz Drabas
Dimensions: 93 x 75 x 23mm (L x W x T)
Format: Paperback
Pages: 330
ISBN-13: 978-1-78883-536-7
Categories: Books > Computing & IT > General theory of computing > General
Books > Computing & IT > Applications of computing > Databases > Data capture & analysis
LSN: 1-78883-536-0
Barcode: 9781788835367

Is the information for this product incomplete, wrong or inappropriate? Let us know about it.

Does this product have an incorrect or missing image? Send us a new image.

Is this product missing categories? Add more categories.

Review This Product

No reviews yet - be the first to create one!

You might also like..

Systems Analysis And Design In A…
John Satzinger, Robert Jackson, … Hardcover  (1)
R1,334 R1,154 Discovery Miles 11 540
Oracle 12c - SQL
Joan Casteel Paperback  (1)
R1,375 R1,184 Discovery Miles 11 840
Systems Analysis And Design
Scott Tilley Hardcover R1,354 R1,168 Discovery Miles 11 680
Discovering Computers (c)2017
Mark Frydenberg, Misty Vermaat, … Paperback  (3)
R1,395 R1,204 Discovery Miles 12 040
Foundations Of Computer Science
Behrouz Forouzan Paperback R1,236 R1,073 Discovery Miles 10 730
Foundations Of Computer Science
Behrouz Forouzan Paperback R999 R599 Discovery Miles 5 990
Discovering Computers 2018 - Digital…
Misty Vermaat, Steven Freund, … Paperback R1,323 R1,143 Discovery Miles 11 430
Introduction to Computer Theory
Daniel I. A. Cohen Paperback  (4)
R6,898 Discovery Miles 68 980
Dynamic Web Application Development…
David Parsons, Simon Stobart Paperback R1,309 R1,129 Discovery Miles 11 290
Program Construction - Calculating…
Roland Backhouse Paperback R1,443 Discovery Miles 14 430
Discovering Computers, Essentials…
Susan Sebok, Jennifer Campbell, … Paperback R1,256 R1,082 Discovery Miles 10 820
Interaction Design: Beyond…
Rogers Paperback R2,043 R1,918 Discovery Miles 19 180

See more

Partners