|
|
Books > Computing & IT > Computer software packages > Other software packages > Mathematical & statistical software
Build Machine Learning models with a sound statistical
understanding. About This Book * Learn about the statistics behind
powerful predictive models with p-value, ANOVA, and F- statistics.
* Implement statistical computations programmatically for
supervised and unsupervised learning through K-means clustering. *
Master the statistical aspect of Machine Learning with the help of
this example-rich guide to R and Python. Who This Book Is For This
book is intended for developers with little to no background in
statistics, who want to implement Machine Learning in their
systems. Some programming knowledge in R or Python will be useful.
What You Will Learn * Understand the Statistical and Machine
Learning fundamentals necessary to build models * Understand the
major differences and parallels between the statistical way and the
Machine Learning way to solve problems * Learn how to prepare data
and feed models by using the appropriate Machine Learning
algorithms from the more-than-adequate R and Python packages *
Analyze the results and tune the model appropriately to your own
predictive goals * Understand the concepts of required statistics
for Machine Learning * Introduce yourself to necessary fundamentals
required for building supervised & unsupervised deep learning
models * Learn reinforcement learning and its application in the
field of artificial intelligence domain In Detail Complex
statistics in Machine Learning worry a lot of developers. Knowing
statistics helps you build strong Machine Learning models that are
optimized for a given problem statement. This book will teach you
all it takes to perform complex statistical computations required
for Machine Learning. You will gain information on statistics
behind supervised learning, unsupervised learning, reinforcement
learning, and more. Understand the real-world examples that discuss
the statistical side of Machine Learning and familiarize yourself
with it. You will also design programs for performing tasks such as
model, parameter fitting, regression, classification, density
collection, and more. By the end of the book, you will have
mastered the required statistics for Machine Learning and will be
able to apply your new skills to any sort of industry problem.
Style and approach This practical, step-by-step guide will give you
an understanding of the Statistical and Machine Learning
fundamentals you'll need to build models.
Over 100 hands-on recipes to effectively solve real-world data
problems using the most popular R packages and techniques About
This Book * Gain insight into how data scientists collect, process,
analyze, and visualize data using some of the most popular R
packages * Understand how to apply useful data analysis techniques
in R for real-world applications * An easy-to-follow guide to make
the life of data scientist easier with the problems faced while
performing data analysis Who This Book Is For This book is for
those who are already familiar with the basic operation of R, but
want to learn how to efficiently and effectively analyze real-world
data problems using practical R packages. What You Will Learn * Get
to know the functional characteristics of R language * Extract,
transform, and load data from heterogeneous sources * Understand
how easily R can confront probability and statistics problems * Get
simple R instructions to quickly organize and manipulate large
datasets * Create professional data visualizations and interactive
reports * Predict user purchase behavior by adopting a
classification approach * Implement data mining techniques to
discover items that are frequently purchased together * Group
similar text documents by using various clustering methods In
Detail This cookbook offers a range of data analysis samples in
simple and straightforward R code, providing step-by-step resources
and time-saving methods to help you solve data problems
efficiently. The first section deals with how to create R functions
to avoid the unnecessary duplication of code. You will learn how to
prepare, process, and perform sophisticated ETL for heterogeneous
data sources with R packages. An example of data manipulation is
provided, illustrating how to use the "dplyr" and "data.table"
packages to efficiently process larger data structures. We also
focus on "ggplot2" and show you how to create advanced figures for
data exploration. In addition, you will learn how to build an
interactive report using the "ggvis" package. Later chapters offer
insight into time series analysis on financial data, while there is
detailed information on the hot topic of machine learning,
including data classification, regression, clustering, association
rule mining, and dimension reduction. By the end of this book, you
will understand how to resolve issues and will be able to
comfortably offer solutions to problems encountered while
performing data analysis. Style and approach This easy-to-follow
guide is full of hands-on examples of data analysis with R. Each
topic is fully explained beginning with the core concept, followed
by step-by-step practical examples, and concluding with detailed
explanations of each concept used.
Partial Least Squares (PLS) is a flexible statistical modeling
technique that applies to data of any shape. It models
relationships between inputs and outputs even when there are more
predictors than observations. Using JMP statistical discovery
software from SAS, Discovering Partial Least Squares with JMP
explores PLS and positions it within the more general context of
multivariate analysis. Ian Cox and Marie Gaudard use a "learning
through doing" style. This approach, coupled with the interactivity
that JMP itself provides, allows you to actively engage with the
content. Four complete case studies are presented, accompanied by
data tables that are available for download. The detailed "how to"
steps, together with the interpretation of the results, help to
make this book unique. Discovering Partial Least Squares with JMP
is of interest to professionals engaged in continuing development,
as well as to students and instructors in a formal academic
setting. The content aligns well with topics covered in
introductory courses on: psychometrics, customer relationship
management, market research, consumer research, environmental
studies, and chemometrics. The book can also function as a
supplement to courses in multivariate statistics, and to courses on
statistical methods in biology, ecology, chemistry, and genomics.
While the book is helpful and instructive to those who are using
JMP, a knowledge of JMP is not required, and little or no prior
statistical knowledge is necessary. By working through the
introductory chapters and the case studies, you gain a deeper
understanding of PLS and learn how to use JMP to perform PLS
analyses in real-world situations. This book motivates current and
potential users of JMP to extend their analytical repertoire by
embracing PLS. Dynamically interacting with JMP, you will develop
confidence as you explore underlying concepts and work through the
examples. The authors provide background and guidance to support
and empower you on this journey.
If you are planning to create data analysis and visualization tools
in the context of science, engineering, economics, or social
science, then this book is for you. With this book, you will become
a visualization expert, in a short time, using Mathematica.
Harness the power of RStudio to create web applications, R
packages, markdown reports and pretty data visualizations About
This Book * Discover the multi-functional use of RStudio to support
your daily work with R code * Learn to create stunning, meaningful,
and interactive graphs and learn to embed them into easy
communicable reports using multiple R packages * Develop your own R
packages and Shiny web apps to share your knowledge and collaborate
with others Who This Book Is For This book is aimed at R developers
and analysts who wish to do R statistical development while taking
advantage of RStudio's functionality to ease their development
efforts. R programming experience is assumed as well as being
comfortable with R's basic structures and a number of functions.
What You Will Learn * Discover the RStudio IDE and details about
the user interface * Communicate your insights with R Markdown in
static and interactive ways * Learn how to use different graphic
systems to visualize your data * Build interactive web applications
with the Shiny framework to present and share your results *
Understand the process of package development and assemble your own
R packages * Easily collaborate with other people on your projects
by using Git and GitHub * Manage the R environment for your
organization with RStudio and Shiny server * Apply your obtained
knowledge about RStudio and R development to create a real-world
dashboard solution In Detail RStudio helps you to manage small to
large projects by giving you a multi-functional integrated
development environment, combined with the power and flexibility of
the R programming language, which is becoming the bridge language
of data science for developers and analyst worldwide. Mastering the
use of RStudio will help you to solve real-world data problems.
This book begins by guiding you through the installation of RStudio
and explaining the user interface step by step. From there, the
next logical step is to use this knowledge to improve your data
analysis workflow. We will do this by building up our toolbox to
create interactive reports and graphs or even web applications with
Shiny. To collaborate with others, we will explore how to use Git
and GitHub with RStudio and how to build your own packages to
ensure top quality results. Finally, we put it all together in an
interactive dashboard written with R. Style and approach An
easy-to-follow guide full of hands-on examples to master RStudio.
Beginning from explaining the basics, each topic is explained with
a lot of details for every feature.
Load, wrangle, and analyze your data using the world's most
powerful statistical programming language About This Book * Load,
manipulate and analyze data from different sources * Gain a deeper
understanding of fundamentals of applied statistics * A practical
guide to performing data analysis in practice Who This Book Is For
Whether you are learning data analysis for the first time, or you
want to deepen the understanding you already have, this book will
prove to an invaluable resource. If you are looking for a book to
bring you all the way through the fundamentals to the application
of advanced and effective analytics methodologies, and have some
prior programming experience and a mathematical background, then
this is for you. What You Will Learn * Navigate the R environment *
Describe and visualize the behavior of data and relationships
between data * Gain a thorough understanding of statistical
reasoning and sampling * Employ hypothesis tests to draw inferences
from your data * Learn Bayesian methods for estimating parameters *
Perform regression to predict continuous variables * Apply powerful
classification methods to predict categorical data * Handle missing
data gracefully using multiple imputation * Identify and manage
problematic data points * Employ parallelization and Rcpp to scale
your analyses to larger data * Put best practices into effect to
make your job easier and facilitate reproducibility In Detail
Frequently the tool of choice for academics, R has spread deep into
the private sector and can be found in the production pipelines at
some of the most advanced and successful enterprises. The power and
domain-specificity of R allows the user to express complex
analytics easily, quickly, and succinctly. With over 7,000 user
contributed packages, it's easy to find support for the latest and
greatest algorithms and techniques. Starting with the basics of R
and statistical reasoning, Data Analysis with R dives into advanced
predictive analytics, showing how to apply those techniques to
real-world data though with real-world examples. Packed with
engaging problems and exercises, this book begins with a review of
R and its syntax. From there, get to grips with the fundamentals of
applied statistics and build on this knowledge to perform
sophisticated and powerful analytics. Solve the difficulties
relating to performing data analysis in practice and find solutions
to working with "messy data", large data, communicating results,
and facilitating reproducibility. This book is engineered to be an
invaluable resource through many stages of anyone's career as a
data analyst. Style and approach Learn data analysis using engaging
examples and fun exercises, and with a gentle and friendly but
comprehensive "learn-by-doing" approach.
|
You may like...
Systems Thinkers
Magnus Ramage, Karen Shipp
Paperback
R1,966
R1,858
Discovery Miles 18 580
|