|
|
Books > Computing & IT > Computer software packages > Other software packages > Mathematical & statistical software
Solve real-world statistical problems using the most popular R
packages and techniques Key Features Learn how to apply statistical
methods to your everyday research with handy recipes Foster your
analytical skills and interpret research across industries and
business verticals Perform t-tests, chi-squared tests, and
regression analysis using modern statistical techniques Book
DescriptionR is a popular programming language for developing
statistical software. This book will be a useful guide to solving
common and not-so-common challenges in statistics. With this book,
you'll be equipped to confidently perform essential statistical
procedures across your organization with the help of cutting-edge
statistical tools. You'll start by implementing data modeling, data
analysis, and machine learning to solve real-world problems. You'll
then understand how to work with nonparametric methods, mixed
effects models, and hidden Markov models. This book contains
recipes that will guide you in performing univariate and
multivariate hypothesis tests, several regression techniques, and
using robust techniques to minimize the impact of outliers in
data.You'll also learn how to use the caret package for performing
machine learning in R. Furthermore, this book will help you
understand how to interpret charts and plots to get insights for
better decision making. By the end of this book, you will be able
to apply your skills to statistical computations using R 3.5. You
will also become well-versed with a wide array of statistical
techniques in R that are extensively used in the data science
industry. What you will learn Become well versed with recipes that
will help you interpret plots with R Formulate advanced statistical
models in R to understand its concepts Perform Bayesian regression
to predict models and input missing data Use time series analysis
for modelling and forecasting temporal data Implement a range of
regression techniques for efficient data modelling Get to grips
with robust statistics and hidden Markov models Explore ANOVA
(Analysis of Variance) and perform hypothesis testing Who this book
is forIf you are a quantitative researcher, statistician, data
analyst, or data scientist looking to tackle various challenges in
statistics, this book is what you need! Proficiency in R
programming and basic knowledge of linear algebra is necessary to
follow along the recipes covered in this book.
Design clever algorithms that discover hidden patterns and draw
responses from unstructured, unlabeled data. Key Features Build
state-of-the-art algorithms that can solve your business' problems
Learn how to find hidden patterns in your data Revise key concepts
with hands-on exercises using real-world datasets Book
DescriptionStarting with the basics, Applied Unsupervised Learning
with R explains clustering methods, distribution analysis, data
encoders, and features of R that enable you to understand your data
better and get answers to your most pressing business questions.
This book begins with the most important and commonly used method
for unsupervised learning - clustering - and explains the three
main clustering algorithms - k-means, divisive, and agglomerative.
Following this, you'll study market basket analysis, kernel density
estimation, principal component analysis, and anomaly detection.
You'll be introduced to these methods using code written in R, with
further instructions on how to work with, edit, and improve R code.
To help you gain a practical understanding, the book also features
useful tips on applying these methods to real business problems,
including market segmentation and fraud detection. By working
through interesting activities, you'll explore data encoders and
latent variable models. By the end of this book, you will have a
better understanding of different anomaly detection methods, such
as outlier detection, Mahalanobis distances, and contextual and
collective anomaly detection. What you will learn Implement
clustering methods such as k-means, agglomerative, and divisive
Write code in R to analyze market segmentation and consumer
behavior Estimate distribution and probabilities of different
outcomes Implement dimension reduction using principal component
analysis Apply anomaly detection methods to identify fraud Design
algorithms with R and learn how to edit or improve code Who this
book is forApplied Unsupervised Learning with R is designed for
business professionals who want to learn about methods to
understand their data better, and developers who have an interest
in unsupervised learning. Although the book is for beginners, it
will be beneficial to have some basic, beginner-level familiarity
with R. This includes an understanding of how to open the R
console, how to read data, and how to create a loop. To easily
understand the concepts of this book, you should also know basic
mathematical concepts, including exponents, square roots, means,
and medians.
Applied Predictive Modeling covers the overall predictive modeling
process, beginning with the crucial steps of data preprocessing,
data splitting and foundations of model tuning. The text then
provides intuitive explanations of numerous common and modern
regression and classification techniques, always with an emphasis
on illustrating and solving real data problems. The text
illustrates all parts of the modeling process through many
hands-on, real-life examples, and every chapter contains extensive
R code for each step of the process. This multi-purpose text can be
used as an introduction to predictive models and the overall
modeling process, a practitioner's reference handbook, or as a text
for advanced undergraduate or graduate level predictive modeling
courses. To that end, each chapter contains problem sets to help
solidify the covered concepts and uses data available in the book's
R package. This text is intended for a broad audience as both an
introduction to predictive models as well as a guide to applying
them. Non-mathematical readers will appreciate the intuitive
explanations of the techniques while an emphasis on problem-solving
with real data across a wide variety of applications will aid
practitioners who wish to extend their expertise. Readers should
have knowledge of basic statistical ideas, such as correlation and
linear regression analysis. While the text is biased against
complex equations, a mathematical background is needed for advanced
topics.
Bayesian modeling with PyMC3 and exploratory analysis of Bayesian
models with ArviZ Key Features A step-by-step guide to conduct
Bayesian data analyses using PyMC3 and ArviZ A modern, practical
and computational approach to Bayesian statistical modeling A
tutorial for Bayesian analysis and best practices with the help of
sample problems and practice exercises. Book DescriptionThe second
edition of Bayesian Analysis with Python is an introduction to the
main concepts of applied Bayesian inference and its practical
implementation in Python using PyMC3, a state-of-the-art
probabilistic programming library, and ArviZ, a new library for
exploratory analysis of Bayesian models. The main concepts of
Bayesian statistics are covered using a practical and computational
approach. Synthetic and real data sets are used to introduce
several types of models, such as generalized linear models for
regression and classification, mixture models, hierarchical models,
and Gaussian processes, among others. By the end of the book, you
will have a working knowledge of probabilistic modeling and you
will be able to design and implement Bayesian models for your own
data science problems. After reading the book you will be better
prepared to delve into more advanced material or specialized
statistical modeling if you need to. What you will learn Build
probabilistic models using the Python library PyMC3 Analyze
probabilistic models with the help of ArviZ Acquire the skills
required to sanity check models and modify them if necessary
Understand the advantages and caveats of hierarchical models Find
out how different models can be used to answer different data
analysis questions Compare models and choose between alternative
ones Discover how different models are unified from a probabilistic
perspective Think probabilistically and benefit from the
flexibility of the Bayesian framework Who this book is forIf you
are a student, data scientist, researcher, or a developer looking
to get started with Bayesian data analysis and probabilistic
programming, this book is for you. The book is introductory so no
previous statistical knowledge is required, although some
experience in using Python and NumPy is expected.
Sentiment analysis research has been started long back and recently
it is one of the demanding research topics. Research activities on
Sentiment Analysis in natural language texts and other media are
gaining ground with full swing. But, till date, no concise set of
factors has been yet defined that really affects how writers'
sentiment i.e., broadly human sentiment is expressed, perceived,
recognized, processed, and interpreted in natural languages. The
existing reported solutions or the available systems are still far
from perfect or fail to meet the satisfaction level of the end
users. The reasons may be that there are dozens of conceptual rules
that govern sentiment and even there are possibly unlimited clues
that can convey these concepts from realization to practical
implementation. Therefore, the main aim of this book is to provide
a feasible research platform to our ambitious researchers towards
developing the practical solutions that will be indeed beneficial
for our society, business and future researches as well.
Learn, by example, the fundamentals of data analysis as well as
several intermediate to advanced methods and techniques ranging
from classification and regression to Bayesian methods and MCMC,
which can be put to immediate use. Key Features Analyze your data
using R - the most powerful statistical programming language Learn
how to implement applied statistics using practical use-cases Use
popular R packages to work with unstructured and structured data
Book DescriptionFrequently the tool of choice for academics, R has
spread deep into the private sector and can be found in the
production pipelines at some of the most advanced and successful
enterprises. The power and domain-specificity of R allows the user
to express complex analytics easily, quickly, and succinctly.
Starting with the basics of R and statistical reasoning, this book
dives into advanced predictive analytics, showing how to apply
those techniques to real-world data though with real-world
examples. Packed with engaging problems and exercises, this book
begins with a review of R and its syntax with packages like Rcpp,
ggplot2, and dplyr. From there, get to grips with the fundamentals
of applied statistics and build on this knowledge to perform
sophisticated and powerful analytics. Solve the difficulties
relating to performing data analysis in practice and find solutions
to working with messy data, large data, communicating results, and
facilitating reproducibility. This book is engineered to be an
invaluable resource through many stages of anyone's career as a
data analyst. What you will learn Gain a thorough understanding of
statistical reasoning and sampling theory Employ hypothesis testing
to draw inferences from your data Learn Bayesian methods for
estimating parameters Train regression, classification, and time
series models Handle missing data gracefully using multiple
imputation Identify and manage problematic data points Learn how to
scale your analyses to larger data with Rcpp, data.table, dplyr,
and parallelization Put best practices into effect to make your job
easier and facilitate reproducibility Who this book is forBudding
data scientists and data analysts who are new to the concept of
data analysis, or who want to build efficient analytical models in
R will find this book to be useful. No prior exposure to data
analysis is needed, although a fundamental understanding of the R
programming language is required to get the best out of this book.
Data Mining with R: Learning with Case Studies, Second Edition uses
practical examples to illustrate the power of R and data mining.
Providing an extensive update to the best-selling first edition,
this new edition is divided into two parts. The first part will
feature introductory material, including a new chapter that
provides an introduction to data mining, to complement the already
existing introduction to R. The second part includes case studies,
and the new edition strongly revises the R code of the case studies
making it more up-to-date with recent packages that have emerged in
R. The book does not assume any prior knowledge about R. Readers
who are new to R and data mining should be able to follow the case
studies, and they are designed to be self-contained so the reader
can start anywhere in the document. The book is accompanied by a
set of freely available R source files that can be obtained at the
book's web site. These files include all the code used in the case
studies, and they facilitate the "do-it-yourself" approach followed
in the book. Designed for users of data analysis tools, as well as
researchers and developers, the book should be useful for anyone
interested in entering the "world" of R and data mining. About the
Author Luis Torgo is an associate professor in the Department of
Computer Science at the University of Porto in Portugal. He teaches
Data Mining in R in the NYU Stern School of Business' MS in
Business Analytics program. An active researcher in machine
learning and data mining for more than 20 years, Dr. Torgo is also
a researcher in the Laboratory of Artificial Intelligence and Data
Analysis (LIAAD) of INESC Porto LA.
|
|