|
Books > Computing & IT > Computer software packages > Other software packages > Mathematical & statistical software
Learn, by example, the fundamentals of data analysis as well as
several intermediate to advanced methods and techniques ranging
from classification and regression to Bayesian methods and MCMC,
which can be put to immediate use. Key Features Analyze your data
using R - the most powerful statistical programming language Learn
how to implement applied statistics using practical use-cases Use
popular R packages to work with unstructured and structured data
Book DescriptionFrequently the tool of choice for academics, R has
spread deep into the private sector and can be found in the
production pipelines at some of the most advanced and successful
enterprises. The power and domain-specificity of R allows the user
to express complex analytics easily, quickly, and succinctly.
Starting with the basics of R and statistical reasoning, this book
dives into advanced predictive analytics, showing how to apply
those techniques to real-world data though with real-world
examples. Packed with engaging problems and exercises, this book
begins with a review of R and its syntax with packages like Rcpp,
ggplot2, and dplyr. From there, get to grips with the fundamentals
of applied statistics and build on this knowledge to perform
sophisticated and powerful analytics. Solve the difficulties
relating to performing data analysis in practice and find solutions
to working with messy data, large data, communicating results, and
facilitating reproducibility. This book is engineered to be an
invaluable resource through many stages of anyone's career as a
data analyst. What you will learn Gain a thorough understanding of
statistical reasoning and sampling theory Employ hypothesis testing
to draw inferences from your data Learn Bayesian methods for
estimating parameters Train regression, classification, and time
series models Handle missing data gracefully using multiple
imputation Identify and manage problematic data points Learn how to
scale your analyses to larger data with Rcpp, data.table, dplyr,
and parallelization Put best practices into effect to make your job
easier and facilitate reproducibility Who this book is forBudding
data scientists and data analysts who are new to the concept of
data analysis, or who want to build efficient analytical models in
R will find this book to be useful. No prior exposure to data
analysis is needed, although a fundamental understanding of the R
programming language is required to get the best out of this book.
A practical guide to obtaining, transforming, exploring, and
analyzing data using Python, MongoDB, and Apache Spark About This
Book * Learn to use various data analysis tools and algorithms to
classify, cluster, visualize, simulate, and forecast your data *
Apply Machine Learning algorithms to different kinds of data such
as social networks, time series, and images * A hands-on guide to
understanding the nature of data and how to turn it into insight
Who This Book Is For This book is for developers who want to
implement data analysis and data-driven algorithms in a practical
way. It is also suitable for those without a background in data
analysis or data processing. Basic knowledge of Python programming,
statistics, and linear algebra is assumed. What You Will Learn *
Acquire, format, and visualize your data * Build an
image-similarity search engine * Generate meaningful visualizations
anyone can understand * Get started with analyzing social network
graphs * Find out how to implement sentiment text analysis *
Install data analysis tools such as Pandas, MongoDB, and Apache
Spark * Get to grips with Apache Spark * Implement machine learning
algorithms such as classification or forecasting In Detail Beyond
buzzwords like Big Data or Data Science, there are a great
opportunities to innovate in many businesses using data analysis to
get data-driven products. Data analysis involves asking many
questions about data in order to discover insights and generate
value for a product or a service. This book explains the basic data
algorithms without the theoretical jargon, and you'll get hands-on
turning data into insights using machine learning techniques. We
will perform data-driven innovation processing for several types of
data such as text, Images, social network graphs, documents, and
time series, showing you how to implement large data processing
with MongoDB and Apache Spark. Style and approach This is a
hands-on guide to data analysis and data processing. The concrete
examples are explained with simple code and accessible data.
Implement machine learning, time-series analysis, algorithmic
trading and more About This Book * Understand the basics of R and
how they can be applied in various Quantitative Finance scenarios *
Learn various algorithmic trading techniques and ways to optimize
them using the tools available in R. * Contain different methods to
manage risk and explore trading using Machine Learning. Who This
Book Is For If you want to learn how to use R to build quantitative
finance models with ease, this book is for you. Analysts who want
to learn R to solve their quantitative finance problems will also
find this book useful. Some understanding of the basic financial
concepts will be useful, though prior knowledge of R is not
required. What You Will Learn * Get to know the basics of R and how
to use it in the field of Quantitative Finance * Understand data
processing and model building using R * Explore different types of
analytical techniques such as statistical analysis, time-series
analysis, predictive modeling, and econometric analysis * Build and
analyze quantitative finance models using real-world examples * How
real-life examples should be used to develop strategies *
Performance metrics to look into before deciding upon any model *
Deep dive into the vast world of machine-learning based trading *
Get to grips with algorithmic trading and different ways of
optimizing it * Learn about controlling risk parameters of
financial instruments In Detail The role of a quantitative analyst
is very challenging, yet lucrative, so there is a lot of
competition for the role in top-tier organizations and investment
banks. This book is your go-to resource if you want to equip
yourself with the skills required to tackle any real-world problem
in quantitative finance using the popular R programming language.
You'll start by getting an understanding of the basics of R and its
relevance in the field of quantitative finance. Once you've built
this foundation, we'll dive into the practicalities of building
financial models in R. This will help you have a fair understanding
of the topics as well as their implementation, as the authors have
presented some use cases along with examples that are easy to
understand and correlate. We'll also look at risk management and
optimization techniques for algorithmic trading. Finally, the book
will explain some advanced concepts, such as trading using machine
learning, optimizations, exotic options, and hedging. By the end of
this book, you will have a firm grasp of the techniques required to
implement basic quantitative finance models in R. Style and
approach This book introduces you to the essentials of quantitative
finance with the help of easy-to-understand, practical examples and
use cases in R. Each chapter presents a specific financial concept
in detail, backed with relevant theory and the implementation of a
real-life example.
Data Mining with R: Learning with Case Studies, Second Edition uses
practical examples to illustrate the power of R and data mining.
Providing an extensive update to the best-selling first edition,
this new edition is divided into two parts. The first part will
feature introductory material, including a new chapter that
provides an introduction to data mining, to complement the already
existing introduction to R. The second part includes case studies,
and the new edition strongly revises the R code of the case studies
making it more up-to-date with recent packages that have emerged in
R. The book does not assume any prior knowledge about R. Readers
who are new to R and data mining should be able to follow the case
studies, and they are designed to be self-contained so the reader
can start anywhere in the document. The book is accompanied by a
set of freely available R source files that can be obtained at the
book's web site. These files include all the code used in the case
studies, and they facilitate the "do-it-yourself" approach followed
in the book. Designed for users of data analysis tools, as well as
researchers and developers, the book should be useful for anyone
interested in entering the "world" of R and data mining. About the
Author Luis Torgo is an associate professor in the Department of
Computer Science at the University of Porto in Portugal. He teaches
Data Mining in R in the NYU Stern School of Business' MS in
Business Analytics program. An active researcher in machine
learning and data mining for more than 20 years, Dr. Torgo is also
a researcher in the Laboratory of Artificial Intelligence and Data
Analysis (LIAAD) of INESC Porto LA.
Build Machine Learning models with a sound statistical
understanding. About This Book * Learn about the statistics behind
powerful predictive models with p-value, ANOVA, and F- statistics.
* Implement statistical computations programmatically for
supervised and unsupervised learning through K-means clustering. *
Master the statistical aspect of Machine Learning with the help of
this example-rich guide to R and Python. Who This Book Is For This
book is intended for developers with little to no background in
statistics, who want to implement Machine Learning in their
systems. Some programming knowledge in R or Python will be useful.
What You Will Learn * Understand the Statistical and Machine
Learning fundamentals necessary to build models * Understand the
major differences and parallels between the statistical way and the
Machine Learning way to solve problems * Learn how to prepare data
and feed models by using the appropriate Machine Learning
algorithms from the more-than-adequate R and Python packages *
Analyze the results and tune the model appropriately to your own
predictive goals * Understand the concepts of required statistics
for Machine Learning * Introduce yourself to necessary fundamentals
required for building supervised & unsupervised deep learning
models * Learn reinforcement learning and its application in the
field of artificial intelligence domain In Detail Complex
statistics in Machine Learning worry a lot of developers. Knowing
statistics helps you build strong Machine Learning models that are
optimized for a given problem statement. This book will teach you
all it takes to perform complex statistical computations required
for Machine Learning. You will gain information on statistics
behind supervised learning, unsupervised learning, reinforcement
learning, and more. Understand the real-world examples that discuss
the statistical side of Machine Learning and familiarize yourself
with it. You will also design programs for performing tasks such as
model, parameter fitting, regression, classification, density
collection, and more. By the end of the book, you will have
mastered the required statistics for Machine Learning and will be
able to apply your new skills to any sort of industry problem.
Style and approach This practical, step-by-step guide will give you
an understanding of the Statistical and Machine Learning
fundamentals you'll need to build models.
Gain sharp insights into your data and solve real-world data
science problems with R-from data munging to modeling and
visualization About This Book * Handle your data with precision and
care for optimal business intelligence * Restructure and transform
your data to inform decision-making * Packed with practical advice
and tips to help you get to grips with data mining Who This Book Is
For If you are a data scientist or R developer who wants to explore
and optimize your use of R's advanced features and tools, this is
the book for you. A basic knowledge of R is required, along with an
understanding of database logic. What You Will Learn * Connect to
and load data from R's range of powerful databases * Successfully
fetch and parse structured and unstructured data * Transform and
restructure your data with efficient R packages * Define and build
complex statistical models with glm * Develop and train machine
learning algorithms * Visualize social networks and graph data *
Deploy supervised and unsupervised classification algorithms *
Discover how to visualize spatial data with R In Detail R is an
essential language for sharp and successful data analysis. Its
numerous features and ease of use make it a powerful way of mining,
managing, and interpreting large sets of data. In a world where
understanding big data has become key, by mastering R you will be
able to deal with your data effectively and efficiently. This book
will give you the guidance you need to build and develop your
knowledge and expertise. Bridging the gap between theory and
practice, this book will help you to understand and use data for a
competitive advantage. Beginning with taking you through essential
data mining and management tasks such as munging, fetching,
cleaning, and restructuring, the book then explores different model
designs and the core components of effective analysis. You will
then discover how to optimize your use of machine learning
algorithms for classification and recommendation systems beside the
traditional and more recent statistical methods. Style and approach
Covering the essential tasks and skills within data science,
Mastering Data Analysis provides you with solutions to the
challenges of data science. Each section gives you a theoretical
overview before demonstrating how to put the theory to work with
real-world use cases and hands-on examples.
This accessible and classroom-tested textbook/reference presents an
introduction to the fundamentals of the emerging and
interdisciplinary field of data science. The coverage spans key
concepts adopted from statistics and machine learning, useful
techniques for graph analysis and parallel programming, and the
practical application of data science for such tasks as building
recommender systems or performing sentiment analysis. Topics and
features: provides numerous practical case studies using real-world
data throughout the book; supports understanding through hands-on
experience of solving data science problems using Python; describes
techniques and tools for statistical analysis, machine learning,
graph analysis, and parallel programming; reviews a range of
applications of data science, including recommender systems and
sentiment analysis of text data; provides supplementary code
resources and data at an associated website.
|
|