|
Showing 1 - 2 of
2 matches in All Departments
Get the definitive handbook for manipulating, processing, cleaning,
and crunching datasets in Python. Updated for Python 3.10 and
pandas 1.4, the third edition of this hands-on guide is packed with
practical case studies that show you how to solve a broad set of
data analysis problems effectively. You'll learn the latest
versions of pandas, NumPy, and Jupyter in the process. Written by
Wes McKinney, the creator of the Python pandas project, this book
is a practical, modern introduction to data science tools in
Python. It's ideal for analysts new to Python and for Python
programmers new to data science and scientific computing. Data
files and related material are available on GitHub. Use the Jupyter
notebook and IPython shell for exploratory computing Learn basic
and advanced features in NumPy Get started with data analysis tools
in the pandas library Use flexible tools to load, clean, transform,
merge, and reshape data Create informative visualizations with
matplotlib Apply the pandas groupby facility to slice, dice, and
summarize datasets Analyze and manipulate regular and irregular
time series data Learn how to solve real-world data analysis
problems with thorough, detailed examples
Process tabular data and build high-performance query engines on
modern CPUs and GPUs using Apache Arrow, a standardized
language-independent memory format, for optimal performance Key
Features Learn about Apache Arrow's data types and interoperability
with pandas and Parquet Work with Apache Arrow Flight RPC, Compute,
and Dataset APIs to produce and consume tabular data Reviewed,
contributed, and supported by Dremio, the co-creator of Apache
Arrow Book DescriptionApache Arrow is designed to accelerate
analytics and allow the exchange of data across big data systems
easily. In-Memory Analytics with Apache Arrow begins with a quick
overview of the Apache Arrow format, before moving on to helping
you to understand Arrow's versatility and benefits as you walk
through a variety of real-world use cases. You'll cover key tasks
such as enhancing data science workflows with Arrow, using Arrow
and Apache Parquet with Apache Spark and Jupyter for better
performance and hassle-free data translation, as well as working
with Perspective, an open source interactive graphical and tabular
analysis tool for browsers. As you advance, you'll explore the
different data interchange and storage formats and become
well-versed with the relationships between Arrow, Parquet, Feather,
Protobuf, Flatbuffers, JSON, and CSV. In addition to understanding
the basic structure of the Arrow Flight and Flight SQL protocols,
you'll learn about Dremio's usage of Apache Arrow to enhance SQL
analytics and discover how Arrow can be used in web-based browser
apps. Finally, you'll get to grips with the upcoming features of
Arrow to help you stay ahead of the curve. By the end of this book,
you will have all the building blocks to create useful, efficient,
and powerful analytical services and utilities with Apache Arrow.
What you will learn Use Apache Arrow libraries to access data files
both locally and in the cloud Understand the zero-copy elements of
the Apache Arrow format Improve read performance by memory-mapping
files with Apache Arrow Produce or consume Apache Arrow data
efficiently using a C API Use the Apache Arrow Compute APIs to
perform complex operations Create Arrow Flight servers and clients
for transferring data quickly Build the Arrow libraries locally and
contribute back to the community Who this book is forThis book is
for developers, data analysts, and data scientists looking to
explore the capabilities of Apache Arrow from the ground up. This
book will also be useful for any engineers who are working on
building utilities for data analytics and query engines, or
otherwise working with tabular data, regardless of the programming
language. Some familiarity with basic concepts of data analysis
will help you to get the most out of this book but isn't required.
Code examples are provided in the C++, Go, and Python programming
languages.
|
You may like...
Loot
Nadine Gordimer
Paperback
(2)
R205
R168
Discovery Miles 1 680
Ab Wheel
R209
R149
Discovery Miles 1 490
Loot
Nadine Gordimer
Paperback
(2)
R205
R168
Discovery Miles 1 680
|