|
Books > Computing & IT > Applications of computing > Databases > Data capture & analysis
Sharpen your data visualization skills with Tableau 10 Bootcamp.
About This Book * Make informed decisions using powerful
visualizations in Tableau * Learn effective data storytelling to
transform how your business uses ideas * Use this extensive
bootcamp that makes you an efficient Tableau user in a short span
of time Who This Book Is For This book caters to business, data,
and analytics professionals who want to build rich interactive
visualizations using Tableau Desktop. Familiarity with previous
versions of Tableau will be helpful, but not necessary. What You
Will Learn * Complete practical Tableau tasks with each chapter *
Build different types of charts in Tableau with ease * Extend data
using calculated fields and parameters * Prepare and refine data
for analysis * Create engaging and interactive dashboards * Present
data effectively using story points In Detail Tableau is a leading
visual analytics software that can uncover insights for better and
smarter decision-making. Tableau has an uncanny ability to beautify
your data, compared to other BI tools, which makes it an ideal
choice for performing fast and easy visual analysis. A military
camp style fast-paced learning book that builds your understanding
of Tableau 10 in no time. This day based learning guide contains
the best elements from two of our published books, Learning Tableau
10 - Second Edition and Tableau 10 Business Intelligence Cookbook,
and delivers practical, learning modules in manageable chunks. Each
chunk is delivered in a "day", and each "day" is a productive day.
Each day builds your competency in Tableau. You will increase your
competence in integrating analytics and forecasting to enhance data
analysis during the course of this Bootcamp. Each chapter presents
core concepts and key takeaways about a topic in Tableau and
provides a series of hands-on exercises. In addition to these
exercises, at the end of the chapter, you will find self-check
quizzes and extra drills to challenge you, to take what you learned
to the next level. To summarize, this book will equip you with
step-by-step instructions through rigorous tasks, practical
callouts, and various real-world examples and assignments to
reinforce your understanding of Tableau 10. Style and approach A
fast paced book filled with highly-effective real-world examples to
help you build new things and help you in solving problems in newer
and unseen ways.
A fast paced guide that will help you to create, read, update and
delete data using MongoDB Key Features Create secure databases with
MongoDB Manipulate and maintain your database Model and use data in
a No SQL environment with MongoDB Book DescriptionMongoDB has grown
to become the de facto NoSQL database with millions of users, from
small start-ups to Fortune 500 companies. It can solve problems
that are considered difficult, if not impossible, for aging RDBMS
technologies. Written for version 4 of MongoDB, this book is the
easiest way to get started with MongoDB. You will start by getting
a MongoDB installation up and running in a safe and secure manner.
You will learn how to perform mission-critical create, read,
update, and delete operations, and set up database security. You
will also learn about advanced features of MongoDB such as the
aggregation pipeline, replication, and sharding. You will learn how
to build a simple web application that uses MongoDB to respond to
AJAX queries, and see how to make use of the MongoDB programming
language driver for PHP. The examples incorporate new features
available in MongoDB version 4 where appropriate. What you will
learn Get a standard MongoDB database up and running quickly
Perform simple CRUD operations on the database using the MongoDB
command shell Set up a simple aggregation pipeline to return
subsets of data grouped, sorted, and filtered Safeguard your data
via replication and handle massive amounts of data via sharding
Publish data from a web form to the database using a program
language driver Explore the basic CRUD operations performed using
the PHP MongoDB driver Who this book is forWeb developers, IT
professionals and Database Administrators (DBAs) who want to learn
how to create and manage MongoDB databases.
Getting started with data science doesn't have to be an uphill
battle. This step-by-step guide is ideal for beginners who know a
little Python and are looking for a quick, fast-paced introduction.
Key Features Get up and running with the Jupyter ecosystem and some
example datasets Learn about key machine learning concepts like
SVM, KNN classifiers and Random Forests Discover how you can use
web scraping to gather and parse your own bespoke datasets Book
DescriptionGet to grips with the skills you need for entry-level
data science in this hands-on Python and Jupyter course. You'll
learn about some of the most commonly used libraries that are part
of the Anaconda distribution, and then explore machine learning
models with real datasets to give you the skills and exposure you
need for the real world. We'll finish up by showing you how easy it
can be to scrape and gather your own data from the open web, so
that you can apply your new skills in an actionable context. What
you will learn Get up and running with the Jupyter ecosystem and
some example datasets Learn about key machine learning concepts
like SVM, KNN classifiers, and Random Forests Plan a machine
learning classification strategy and train classification, models
Use validation curves and dimensionality reduction to tune and
enhance your models Discover how you can use web scraping to gather
and parse your own bespoke datasets Scrape tabular data from web
pages and transform them into Pandas DataFrames Create interactive,
web-friendly visualizations to clearly communicate your findings
Who this book is forThis book is ideal for professionals with a
variety of job descriptions across large range of industries, given
the rising popularity and accessibility of data science. You'll
need some prior experience with Python, with any prior work with
libraries like Pandas, Matplotlib and Pandas providing you a useful
head start.
Develop, deploy, and streamline your data science projects with the
most popular end-to-end platform, Anaconda Key Features -Use
Anaconda to find solutions for clustering, classification, and
linear regression -Analyze your data efficiently with the most
powerful data science stack -Use the Anaconda cloud to store,
share, and discover projects and libraries Book DescriptionAnaconda
is an open source platform that brings together the best tools for
data science professionals with more than 100 popular packages
supporting Python, Scala, and R languages. Hands-On Data Science
with Anaconda gets you started with Anaconda and demonstrates how
you can use it to perform data science operations in the real
world. The book begins with setting up the environment for Anaconda
platform in order to make it accessible for tools and frameworks
such as Jupyter, pandas, matplotlib, Python, R, Julia, and more.
You'll walk through package manager Conda, through which you can
automatically manage all packages including cross-language
dependencies, and work across Linux, macOS, and Windows. You'll
explore all the essentials of data science and linear algebra to
perform data science tasks using packages such as SciPy,
contrastive, scikit-learn, Rattle, and Rmixmod. Once you're
accustomed to all this, you'll start with operations in data
science such as cleaning, sorting, and data classification. You'll
move on to learning how to perform tasks such as clustering,
regression, prediction, and building machine learning models and
optimizing them. In addition to this, you'll learn how to visualize
data using the packages available for Julia, Python, and R. What
you will learn Perform cleaning, sorting, classification,
clustering, regression, and dataset modeling using Anaconda Use the
package manager conda and discover, install, and use functionally
efficient and scalable packages Get comfortable with heterogeneous
data exploration using multiple languages within a project Perform
distributed computing and use Anaconda Accelerate to optimize
computational powers Discover and share packages, notebooks, and
environments, and use shared project drives on Anaconda Cloud
Tackle advanced data prediction problems Who this book is
forHands-On Data Science with Anaconda is for you if you are a
developer who is looking for the best tools in the market to
perform data science. It's also ideal for data analysts and data
science professionals who want to improve the efficiency of their
data science applications by using the best libraries in multiple
languages. Basic programming knowledge with R or Python and
introductory knowledge of linear algebra is expected.
Become the master player of data exploration by creating
reproducible data processing pipelines, visualizations, and
prediction models for your applications. Key Features Get up and
running with the Jupyter ecosystem and some example datasets Learn
about key machine learning concepts such as SVM, KNN classifiers,
and Random Forests Discover how you can use web scraping to gather
and parse your own bespoke datasets Book DescriptionGetting started
with data science doesn't have to be an uphill battle. Applied Data
Science with Python and Jupyter is a step-by-step guide ideal for
beginners who know a little Python and are looking for a quick,
fast-paced introduction to these concepts. In this book, you'll
learn every aspect of the standard data workflow process, including
collecting, cleaning, investigating, visualizing, and modeling
data. You'll start with the basics of Jupyter, which will be the
backbone of the book. After familiarizing ourselves with its
standard features, you'll look at an example of it in practice with
our first analysis. In the next lesson, you dive right into
predictive analytics, where multiple classification algorithms are
implemented. Finally, the book ends by looking at data collection
techniques. You'll see how web data can be acquired with scraping
techniques and via APIs, and then briefly explore interactive
visualizations. What you will learn Get up and running with the
Jupyter ecosystem Identify potential areas of investigation and
perform exploratory data analysis Plan a machine learning
classification strategy and train classification models Use
validation curves and dimensionality reduction to tune and enhance
your models Scrape tabular data from web pages and transform it
into Pandas DataFrames Create interactive, web-friendly
visualizations to clearly communicate your findings Who this book
is forApplied Data Science with Python and Jupyter is ideal for
professionals with a variety of job descriptions across a large
range of industries, given the rising popularity and accessibility
of data science. You'll need some prior experience with Python,
with any prior work with libraries such as Pandas, Matplotlib, and
Pandas providing you a useful head start.
Build effective regression models in R to extract valuable insights
from real data Key Features Implement different regression analysis
techniques to solve common problems in data science - from data
exploration to dealing with missing values From Simple Linear
Regression to Logistic Regression - this book covers all regression
techniques and their implementation in R A complete guide to
building effective regression models in R and interpreting results
from them to make valuable predictions Book DescriptionRegression
analysis is a statistical process which enables prediction of
relationships between variables. The predictions are based on the
casual effect of one variable upon another. Regression techniques
for modeling and analyzing are employed on large set of data in
order to reveal hidden relationship among the variables. This book
will give you a rundown explaining what regression analysis is,
explaining you the process from scratch. The first few chapters
give an understanding of what the different types of learning are -
supervised and unsupervised, how these learnings differ from each
other. We then move to covering the supervised learning in details
covering the various aspects of regression analysis. The outline of
chapters are arranged in a way that gives a feel of all the steps
covered in a data science process - loading the training dataset,
handling missing values, EDA on the dataset, transformations and
feature engineering, model building, assessing the model fitting
and performance, and finally making predictions on unseen datasets.
Each chapter starts with explaining the theoretical concepts and
once the reader gets comfortable with the theory, we move to the
practical examples to support the understanding. The practical
examples are illustrated using R code including the different
packages in R such as R Stats, Caret and so on. Each chapter is a
mix of theory and practical examples. By the end of this book you
will know all the concepts and pain-points related to regression
analysis, and you will be able to implement your learning in your
projects. What you will learn Get started with the journey of data
science using Simple linear regression Deal with interaction,
collinearity and other problems using multiple linear regression
Understand diagnostics and what to do if the assumptions fail with
proper analysis Load your dataset, treat missing values, and plot
relationships with exploratory data analysis Develop a perfect
model keeping overfitting, under-fitting, and cross-validation into
consideration Deal with classification problems by applying
Logistic regression Explore other regression techniques - Decision
trees, Bagging, and Boosting techniques Learn by getting it all in
action with the help of a real world case study. Who this book is
forThis book is intended for budding data scientists and data
analysts who want to implement regression analysis techniques using
R. If you are interested in statistics, data science, machine
learning and wants to get an easy introduction to the topic, then
this book is what you need! Basic understanding of statistics and
math will help you to get the most out of the book. Some
programming experience with R will also be helpful
Gain useful insights from your data using popular data science
tools Key Features A one-stop guide to Python libraries such as
pandas and NumPy Comprehensive coverage of data science operations
such as data cleaning and data manipulation Choose scalable
learning algorithms for your data science tasks Book
DescriptionFully expanded and upgraded, the latest edition of
Python Data Science Essentials will help you succeed in data
science operations using the most common Python libraries. This
book offers up-to-date insight into the core of Python, including
the latest versions of the Jupyter Notebook, NumPy, pandas, and
scikit-learn. The book covers detailed examples and large hybrid
datasets to help you grasp essential statistical techniques for
data collection, data munging and analysis, visualization, and
reporting activities. You will also gain an understanding of
advanced data science topics such as machine learning algorithms,
distributed computing, tuning predictive models, and natural
language processing. Furthermore, You'll also be introduced to deep
learning and gradient boosting solutions such as XGBoost, LightGBM,
and CatBoost. By the end of the book, you will have gained a
complete overview of the principal machine learning algorithms,
graph analysis techniques, and all the visualization and deployment
instruments that make it easier to present your results to an
audience of both data science experts and business users What you
will learn Set up your data science toolbox on Windows, Mac, and
Linux Use the core machine learning methods offered by the
scikit-learn library Manipulate, fix, and explore data to solve
data science problems Learn advanced explorative and manipulative
techniques to solve data operations Optimize your machine learning
models for optimized performance Explore and cluster graphs, taking
advantage of interconnections and links in your data Who this book
is forIf you're a data science entrant, data analyst, or data
engineer, this book will help you get ready to tackle real-world
data science problems without wasting any time. Basic knowledge of
probability/statistics and Python coding experience will assist you
in understanding the concepts covered in this book.
A comprehensive guide to making machine data accessible across the
organization using advanced dashboards Key Features Enrich
machine-generated data and transform it into useful, meaningful
insights Perform search operations and configurations, build
dashboards, and manage logs Extend Splunk services with scripts and
advanced configurations to process optimal results Book
DescriptionSplunk is the leading platform that fosters an efficient
methodology and delivers ways to search, monitor, and analyze
growing amounts of big data. This book will allow you to implement
new services and utilize them to quickly and efficiently process
machine-generated big data. We introduce you to all the new
features, improvements, and offerings of Splunk 7. We cover the new
modules of Splunk: Splunk Cloud and the Machine Learning Toolkit to
ease data usage. Furthermore, you will learn to use search terms
effectively with Boolean and grouping operators. You will learn not
only how to modify your search to make your searches fast but also
how to use wildcards efficiently. Later you will learn how to use
stats to aggregate values, a chart to turn data, and a time chart
to show values over time; you'll also work with fields and chart
enhancements and learn how to create a data model with faster data
model acceleration. Once this is done, you will learn about XML
Dashboards, working with apps, building advanced dashboards,
configuring and extending Splunk, advanced deployments, and more.
Finally, we teach you how to use the Machine Learning Toolkit and
best practices and tips to help you implement Splunk services
effectively and efficiently. By the end of this book, you will have
learned about the Splunk software as a whole and implemented Splunk
services in your tasks at projects What you will learn Focus on the
new features of the latest version of Splunk Enterprise 7 Master
the new offerings in Splunk: Splunk Cloud and the Machine Learning
Toolkit Create efficient and effective searches within the
organization Master the use of Splunk tables, charts, and graph
enhancements Use Splunk data models and pivots with faster data
model acceleration Master all aspects of Splunk XML dashboards with
hands-on applications Create and deploy advanced Splunk dashboards
to share valuable business insights with peers Who this book is
forThis book is intended for data analysts, business analysts, and
IT administrators who want to make the best use of big data,
operational intelligence, log management, and monitoring within
their organization. Some knowledge of Splunk services will help you
get the most out of the book
Make sense of your data and predict the unpredictable About This
Book * A unique book that centers around develop six key practical
skills needed to develop and implement predictive analytics * Apply
the principles and techniques of predictive analytics to
effectively interpret big data * Solve real-world analytical
problems with the help of practical case studies and real-world
scenarios taken from the world of healthcare, marketing, and other
business domains Who This Book Is For This book is for those with a
mathematical/statistics background who wish to understand the
concepts, techniques, and implementation of predictive analytics to
resolve complex analytical issues. Basic familiarity with a
programming language of R is expected. What You Will Learn * Master
the core predictive analytics algorithm which are used today in
business * Learn to implement the six steps for a successful
analytics project * Classify the right algorithm for your
requirements * Use and apply predictive analytics to research
problems in healthcare * Implement predictive analytics to retain
and acquire your customers * Use text mining to understand
unstructured data * Develop models on your own PC or in
Spark/Hadoop environments * Implement predictive analytics products
for customers In Detail This is the go-to book for anyone
interested in the steps needed to develop predictive analytics
solutions with examples from the world of marketing, healthcare,
and retail. We'll get started with a brief history of predictive
analytics and learn about different roles and functions people play
within a predictive analytics project. Then, we will learn about
various ways of installing R along with their pros and cons,
combined with a step-by-step installation of RStudio, and a
description of the best practices for organizing your projects. On
completing the installation, we will begin to acquire the skills
necessary to input, clean, and prepare your data for modeling. We
will learn the six specific steps needed to implement and
successfully deploy a predictive model starting from asking the
right questions through model development and ending with deploying
your predictive model into production. We will learn why
collaboration is important and how agile iterative modeling cycles
can increase your chances of developing and deploying the best
successful model. We will continue your journey in the cloud by
extending your skill set by learning about Databricks and SparkR,
which allow you to develop predictive models on vast gigabytes of
data. Style and Approach This book takes a practical hands-on
approach wherein the algorithms will be explained with the help of
real-world use cases. It is written in a well-researched academic
style which is a great mix of theoretical and practical
information. Code examples are supplied for both theoretical
concepts as well as for the case studies. Key references and
summaries will be provided at the end of each chapter so that you
can explore those topics on their own.
Enhance your data analysis and predictive modeling skills using
popular Python tools Key Features Cover all fundamental libraries
for operation and manipulation of Python for data analysis
Implement real-world datasets to perform predictive analytics with
Python Access modern data analysis techniques and detailed code
with scikit-learn and SciPy Book DescriptionPython is one of the
most common and popular languages preferred by leading data
analysts and statisticians for working with massive datasets and
complex data visualizations. Become a Python Data Analyst
introduces Python's most essential tools and libraries necessary to
work with the data analysis process, right from preparing data to
performing simple statistical analyses and creating meaningful data
visualizations. In this book, we will cover Python libraries such
as NumPy, pandas, matplotlib, seaborn, SciPy, and scikit-learn, and
apply them in practical data analysis and statistics examples. As
you make your way through the chapters, you will learn to
efficiently use the Jupyter Notebook to operate and manipulate data
using NumPy and the pandas library. In the concluding chapters, you
will gain experience in building simple predictive models and
carrying out statistical computation and analysis using rich Python
tools and proven data analysis techniques. By the end of this book,
you will have hands-on experience performing data analysis with
Python. What you will learn Explore important Python libraries and
learn to install Anaconda distribution Understand the basics of
NumPy Produce informative and useful visualizations for analyzing
data Perform common statistical calculations Build predictive
models and understand the principles of predictive analytics Who
this book is forBecome a Python Data Analyst is for entry-level
data analysts, data engineers, and BI professionals who want to
make complete use of Python tools for performing efficient data
analysis. Prior knowledge of Python programming is necessary to
understand the concepts covered in this book
Learn to use IPython and Jupyter Notebook for your data analysis
and visualization work. Key Features Leverage the Jupyter Notebook
for interactive data science and visualization Become an expert in
high-performance computing and visualization for data analysis and
scientific modeling A comprehensive coverage of scientific
computing through many hands-on, example-driven recipes with
detailed, step-by-step explanations Book DescriptionPython is one
of the leading open source platforms for data science and numerical
computing. IPython and the associated Jupyter Notebook offer
efficient interfaces to Python for data analysis and interactive
visualization, and they constitute an ideal gateway to the
platform. IPython Interactive Computing and Visualization Cookbook,
Second Edition contains many ready-to-use, focused recipes for
high-performance scientific computing and data analysis, from the
latest IPython/Jupyter features to the most advanced tricks, to
help you write better and faster code. You will apply these
state-of-the-art methods to various real-world examples,
illustrating topics in applied mathematics, scientific modeling,
and machine learning. The first part of the book covers programming
techniques: code quality and reproducibility, code optimization,
high-performance computing through just-in-time compilation,
parallel computing, and graphics card programming. The second part
tackles data science, statistics, machine learning, signal and
image processing, dynamical systems, and pure and applied
mathematics. What you will learn Master all features of the Jupyter
Notebook Code better: write high-quality, readable, and well-tested
programs; profile and optimize your code; and conduct reproducible
interactive computing experiments Visualize data and create
interactive plots in the Jupyter Notebook Write blazingly fast
Python programs with NumPy, ctypes, Numba, Cython, OpenMP, GPU
programming (CUDA), parallel IPython, Dask, and more Analyze data
with Bayesian or frequentist statistics (Pandas, PyMC, and R), and
learn from actual data through machine learning (scikit-learn) Gain
valuable insights into signals, images, and sounds with SciPy,
scikit-image, and OpenCV Simulate deterministic and stochastic
dynamical systems in Python Familiarize yourself with math in
Python using SymPy and Sage: algebra, analysis, logic, graphs,
geometry, and probability theory Who this book is forThis book is
intended for anyone interested in numerical computing and data
science: students, researchers, teachers, engineers, analysts, and
hobbyists. A basic knowledge of Python/NumPy is recommended. Some
skills in mathematics will help you understand the theory behind
the computational methods.
Get to grips with the most popular Python packages that make data
analysis possible Key Features Explore the tools you need to become
a data analyst Discover practical examples to help you grasp data
processing concepts Walk through hierarchical indexing and grouping
for data analysis Book DescriptionPython, a multi-paradigm
programming language, has become the language of choice for data
scientists for visualization, data analysis, and machine learning.
Hands-On Data Analysis with NumPy and Pandas starts by guiding you
in setting up the right environment for data analysis with Python,
along with helping you install the correct Python distribution. In
addition to this, you will work with the Jupyter notebook and set
up a database. Once you have covered Jupyter, you will dig deep
into Python's NumPy package, a powerful extension with advanced
mathematical functions. You will then move on to creating NumPy
arrays and employing different array methods and functions. You
will explore Python's pandas extension which will help you get to
grips with data mining and learn to subset your data. Last but not
the least you will grasp how to manage your datasets by sorting and
ranking them. By the end of this book, you will have learned to
index and group your data for sophisticated data analysis and
manipulation. What you will learn Understand how to install and
manage Anaconda Read, sort, and map data using NumPy and pandas
Find out how to create and slice data arrays using NumPy Discover
how to subset your DataFrames using pandas Handle missing data in a
pandas DataFrame Explore hierarchical indexing and plotting with
pandas Who this book is forHands-On Data Analysis with NumPy and
Pandas is for you if you are a Python developer and want to take
your first steps into the world of data analysis. No previous
experience of data analysis is required to enjoy this book.
A solution-based guide to put your deep learning models into
production with the power of Apache Spark Key Features Discover
practical recipes for distributed deep learning with Apache Spark
Learn to use libraries such as Keras and TensorFlow Solve problems
in order to train your deep learning models on Apache Spark Book
DescriptionWith deep learning gaining rapid mainstream adoption in
modern-day industries, organizations are looking for ways to unite
popular big data tools with highly efficient deep learning
libraries. As a result, this will help deep learning models train
with higher efficiency and speed. With the help of the Apache Spark
Deep Learning Cookbook, you'll work through specific recipes to
generate outcomes for deep learning algorithms, without getting
bogged down in theory. From setting up Apache Spark for deep
learning to implementing types of neural net, this book tackles
both common and not so common problems to perform deep learning on
a distributed environment. In addition to this, you'll get access
to deep learning code within Spark that can be reused to answer
similar problems or tweaked to answer slightly different problems.
You will also learn how to stream and cluster your data with Spark.
Once you have got to grips with the basics, you'll explore how to
implement and deploy deep learning models, such as Convolutional
Neural Networks (CNN) and Recurrent Neural Networks (RNN) in Spark,
using popular libraries such as TensorFlow and Keras. By the end of
the book, you'll have the expertise to train and deploy efficient
deep learning models on Apache Spark. What you will learn Set up a
fully functional Spark environment Understand practical machine
learning and deep learning concepts Apply built-in machine learning
libraries within Spark Explore libraries that are compatible with
TensorFlow and Keras Explore NLP models such as Word2vec and TF-IDF
on Spark Organize dataframes for deep learning evaluation Apply
testing and training modeling to ensure accuracy Access readily
available code that may be reusable Who this book is forIf you're
looking for a practical and highly useful resource for implementing
efficiently distributed deep learning models with Apache Spark,
then the Apache Spark Deep Learning Cookbook is for you. Knowledge
of the core machine learning concepts and a basic understanding of
the Apache Spark framework is required to get the best out of this
book. Additionally, some programming knowledge in Python is a plus.
Enhance the power of NumPy and start boosting your scientific
computing capabilities Key Features Grasp all aspects of numerical
computing and understand NumPy Explore examples to learn
exploratory data analysis (EDA), regression, and clustering Access
NumPy libraries and use performance benchmarking to select the
right tool Book DescriptionNumPy is one of the most important
scientific computing libraries available for Python. Mastering
Numerical Computing with NumPy teaches you how to achieve expert
level competency to perform complex operations, with in-depth
coverage of advanced concepts. Beginning with NumPy's arrays and
functions, you will familiarize yourself with linear algebra
concepts to perform vector and matrix math operations. You will
thoroughly understand and practice data processing, exploratory
data analysis (EDA), and predictive modeling. You will then move on
to working on practical examples which will teach you how to use
NumPy statistics in order to explore US housing data and develop a
predictive model using simple and multiple linear regression
techniques. Once you have got to grips with the basics, you will
explore unsupervised learning and clustering algorithms, followed
by understanding how to write better NumPy code while keeping
advanced considerations in mind. The book also demonstrates the use
of different high-performance numerical computing libraries and
their relationship with NumPy. You will study how to benchmark the
performance of different configurations and choose the best for
your system. By the end of this book, you will have become an
expert in handling and performing complex data manipulations. What
you will learn Perform vector and matrix operations using NumPy
Perform exploratory data analysis (EDA) on US housing data Develop
a predictive model using simple and multiple linear regression
Understand unsupervised learning and clustering algorithms with
practical use cases Write better NumPy code and implement the
algorithms from scratch Perform benchmark tests to choose the best
configuration for your system Who this book is forMastering
Numerical Computing with NumPy is for you if you are a Python
programmer, data analyst, data engineer, or a data science
enthusiast, who wants to master the intricacies of NumPy and build
solutions for your numeric and scientific computational problems.
You are expected to have familiarity with mathematics to get the
most out of this book.
From data to actionable business insights using Amazon QuickSight!
About This Book * A practical hands-on guide to improving your
business with the power of BI and Quicksight * Immerse yourself
with an end-to-end journey for effective analytics using QuickSight
and related services * Packed with real-world examples with
Solution Architectures needed for a cloud-powered Business
Intelligence service Who This Book Is For This book is for Business
Intelligence architects, BI developers, Big Data architects, and IT
executives who are looking to modernize their business intelligence
architecture and deliver a fast, easy-to-use, cloud powered
business intelligence service. What You Will Learn * Steps to test
drive QuickSight and see how it fits in AWS big data eco system *
Load data from various sources such as S3, RDS, Redshift, Athena,
and SalesForce and visualize using QuickSight * Understand how to
prepare data using QuickSight without the need of an IT developer *
Build interactive charts, reports, dashboards, and storyboards
using QuickSight * Access QuickSight using the mobile application *
Architect and design for AWS Data Lake Solution, leveraging AWS
hosted services * Build a big data project with step-by-step
instructions for data collection, cataloguing, and analysis *
Secure your data used for QuickSight from S3, RedShift, and RDS
instances * Manage users, access controls, and SPICE capacity In
Detail Amazon QuickSight is the next-generation Business
Intelligence (BI) cloud service that can help you build interactive
visualizations on top of various data sources hosted on Amazon
Cloud Infrastructure. QuickSight delivers responsive insights into
big data and enables organizations to quickly democratize data
visualizations and scale to hundreds of users at a fraction of the
cost when compared to traditional BI tools. This book begins with
an introduction to Amazon QuickSight, feature differentiators from
traditional BI tools, and how it fits in the overall AWS big data
ecosystem. With practical examples, you will find tips and
techniques to load your data to AWS, prepare it, and finally
visualize it using QuickSight. You will learn how to build
interactive charts, reports, dashboards, and stories using
QuickSight and share with others using just your browser and mobile
app. The book also provides a blueprint to build a real-life big
data project on top of AWS Data Lake Solution and demonstrates how
to build a modern data lake on the cloud with governance, data
catalog, and analysis. It reviews the current product shortcomings,
features in the roadmap, and how to provide feedback to AWS. Grow
your profits, improve your products, and beat your competitors.
Style and approach This book takes a fast-paced, example-driven
approach to demonstrate the power of QuickSight to improve your
business' efficiency. Every chapter is accompanied with a use case
that shows the practical implementation of the step being
explained.
|
|