|
Books > Computing & IT > Computer software packages > Other software packages > Mathematical & statistical software
A practical guide to obtaining, transforming, exploring, and
analyzing data using Python, MongoDB, and Apache Spark About This
Book * Learn to use various data analysis tools and algorithms to
classify, cluster, visualize, simulate, and forecast your data *
Apply Machine Learning algorithms to different kinds of data such
as social networks, time series, and images * A hands-on guide to
understanding the nature of data and how to turn it into insight
Who This Book Is For This book is for developers who want to
implement data analysis and data-driven algorithms in a practical
way. It is also suitable for those without a background in data
analysis or data processing. Basic knowledge of Python programming,
statistics, and linear algebra is assumed. What You Will Learn *
Acquire, format, and visualize your data * Build an
image-similarity search engine * Generate meaningful visualizations
anyone can understand * Get started with analyzing social network
graphs * Find out how to implement sentiment text analysis *
Install data analysis tools such as Pandas, MongoDB, and Apache
Spark * Get to grips with Apache Spark * Implement machine learning
algorithms such as classification or forecasting In Detail Beyond
buzzwords like Big Data or Data Science, there are a great
opportunities to innovate in many businesses using data analysis to
get data-driven products. Data analysis involves asking many
questions about data in order to discover insights and generate
value for a product or a service. This book explains the basic data
algorithms without the theoretical jargon, and you'll get hands-on
turning data into insights using machine learning techniques. We
will perform data-driven innovation processing for several types of
data such as text, Images, social network graphs, documents, and
time series, showing you how to implement large data processing
with MongoDB and Apache Spark. Style and approach This is a
hands-on guide to data analysis and data processing. The concrete
examples are explained with simple code and accessible data.
Over 100 hands-on recipes to effectively solve real-world data
problems using the most popular R packages and techniques About
This Book * Gain insight into how data scientists collect, process,
analyze, and visualize data using some of the most popular R
packages * Understand how to apply useful data analysis techniques
in R for real-world applications * An easy-to-follow guide to make
the life of data scientist easier with the problems faced while
performing data analysis Who This Book Is For This book is for
those who are already familiar with the basic operation of R, but
want to learn how to efficiently and effectively analyze real-world
data problems using practical R packages. What You Will Learn * Get
to know the functional characteristics of R language * Extract,
transform, and load data from heterogeneous sources * Understand
how easily R can confront probability and statistics problems * Get
simple R instructions to quickly organize and manipulate large
datasets * Create professional data visualizations and interactive
reports * Predict user purchase behavior by adopting a
classification approach * Implement data mining techniques to
discover items that are frequently purchased together * Group
similar text documents by using various clustering methods In
Detail This cookbook offers a range of data analysis samples in
simple and straightforward R code, providing step-by-step resources
and time-saving methods to help you solve data problems
efficiently. The first section deals with how to create R functions
to avoid the unnecessary duplication of code. You will learn how to
prepare, process, and perform sophisticated ETL for heterogeneous
data sources with R packages. An example of data manipulation is
provided, illustrating how to use the "dplyr" and "data.table"
packages to efficiently process larger data structures. We also
focus on "ggplot2" and show you how to create advanced figures for
data exploration. In addition, you will learn how to build an
interactive report using the "ggvis" package. Later chapters offer
insight into time series analysis on financial data, while there is
detailed information on the hot topic of machine learning,
including data classification, regression, clustering, association
rule mining, and dimension reduction. By the end of this book, you
will understand how to resolve issues and will be able to
comfortably offer solutions to problems encountered while
performing data analysis. Style and approach This easy-to-follow
guide is full of hands-on examples of data analysis with R. Each
topic is fully explained beginning with the core concept, followed
by step-by-step practical examples, and concluding with detailed
explanations of each concept used.
Implement machine learning, time-series analysis, algorithmic
trading and more About This Book * Understand the basics of R and
how they can be applied in various Quantitative Finance scenarios *
Learn various algorithmic trading techniques and ways to optimize
them using the tools available in R. * Contain different methods to
manage risk and explore trading using Machine Learning. Who This
Book Is For If you want to learn how to use R to build quantitative
finance models with ease, this book is for you. Analysts who want
to learn R to solve their quantitative finance problems will also
find this book useful. Some understanding of the basic financial
concepts will be useful, though prior knowledge of R is not
required. What You Will Learn * Get to know the basics of R and how
to use it in the field of Quantitative Finance * Understand data
processing and model building using R * Explore different types of
analytical techniques such as statistical analysis, time-series
analysis, predictive modeling, and econometric analysis * Build and
analyze quantitative finance models using real-world examples * How
real-life examples should be used to develop strategies *
Performance metrics to look into before deciding upon any model *
Deep dive into the vast world of machine-learning based trading *
Get to grips with algorithmic trading and different ways of
optimizing it * Learn about controlling risk parameters of
financial instruments In Detail The role of a quantitative analyst
is very challenging, yet lucrative, so there is a lot of
competition for the role in top-tier organizations and investment
banks. This book is your go-to resource if you want to equip
yourself with the skills required to tackle any real-world problem
in quantitative finance using the popular R programming language.
You'll start by getting an understanding of the basics of R and its
relevance in the field of quantitative finance. Once you've built
this foundation, we'll dive into the practicalities of building
financial models in R. This will help you have a fair understanding
of the topics as well as their implementation, as the authors have
presented some use cases along with examples that are easy to
understand and correlate. We'll also look at risk management and
optimization techniques for algorithmic trading. Finally, the book
will explain some advanced concepts, such as trading using machine
learning, optimizations, exotic options, and hedging. By the end of
this book, you will have a firm grasp of the techniques required to
implement basic quantitative finance models in R. Style and
approach This book introduces you to the essentials of quantitative
finance with the help of easy-to-understand, practical examples and
use cases in R. Each chapter presents a specific financial concept
in detail, backed with relevant theory and the implementation of a
real-life example.
This accessible and classroom-tested textbook/reference presents an
introduction to the fundamentals of the emerging and
interdisciplinary field of data science. The coverage spans key
concepts adopted from statistics and machine learning, useful
techniques for graph analysis and parallel programming, and the
practical application of data science for such tasks as building
recommender systems or performing sentiment analysis. Topics and
features: provides numerous practical case studies using real-world
data throughout the book; supports understanding through hands-on
experience of solving data science problems using Python; describes
techniques and tools for statistical analysis, machine learning,
graph analysis, and parallel programming; reviews a range of
applications of data science, including recommender systems and
sentiment analysis of text data; provides supplementary code
resources and data at an associated website.
Gain sharp insights into your data and solve real-world data
science problems with R-from data munging to modeling and
visualization About This Book * Handle your data with precision and
care for optimal business intelligence * Restructure and transform
your data to inform decision-making * Packed with practical advice
and tips to help you get to grips with data mining Who This Book Is
For If you are a data scientist or R developer who wants to explore
and optimize your use of R's advanced features and tools, this is
the book for you. A basic knowledge of R is required, along with an
understanding of database logic. What You Will Learn * Connect to
and load data from R's range of powerful databases * Successfully
fetch and parse structured and unstructured data * Transform and
restructure your data with efficient R packages * Define and build
complex statistical models with glm * Develop and train machine
learning algorithms * Visualize social networks and graph data *
Deploy supervised and unsupervised classification algorithms *
Discover how to visualize spatial data with R In Detail R is an
essential language for sharp and successful data analysis. Its
numerous features and ease of use make it a powerful way of mining,
managing, and interpreting large sets of data. In a world where
understanding big data has become key, by mastering R you will be
able to deal with your data effectively and efficiently. This book
will give you the guidance you need to build and develop your
knowledge and expertise. Bridging the gap between theory and
practice, this book will help you to understand and use data for a
competitive advantage. Beginning with taking you through essential
data mining and management tasks such as munging, fetching,
cleaning, and restructuring, the book then explores different model
designs and the core components of effective analysis. You will
then discover how to optimize your use of machine learning
algorithms for classification and recommendation systems beside the
traditional and more recent statistical methods. Style and approach
Covering the essential tasks and skills within data science,
Mastering Data Analysis provides you with solutions to the
challenges of data science. Each section gives you a theoretical
overview before demonstrating how to put the theory to work with
real-world use cases and hands-on examples.
Essential MATLAB for Engineers and Scientists, Sixth Edition,
provides a concise, balanced overview of MATLAB's functionality
that facilitates independent learning, with coverage of both the
fundamentals and applications. The essentials of MATLAB are
illustrated throughout, featuring complete coverage of the
software's windows and menus. Program design and algorithm
development are presented clearly and intuitively, along with many
examples from a wide range of familiar scientific and engineering
areas. This updated edition includes the latest MATLAB versions
through 2016a, and is an ideal book for a first course on MATLAB,
or for an engineering problem-solving course using MATLAB, as well
as a self-learning tutorial for professionals and students expected
to learn and apply MATLAB.
Learn and explore the fundamentals of data analysis with power of
Mathematica About This Book * Use the power of Mathematica to
analyze data in your applications * Discover the capabilities of
data classification and pattern recognition offered by Mathematica
* Use hundreds of algorithms for time series analysis to predict
the future Who This Book Is For The book is for those who want to
learn to use the power of Mathematica to analyze and process data.
Perhaps you are already familiar with data analysis but have never
used Mathematica, or you know Mathematica but you are new to data
analysis. With the help of this book, you will be able to quickly
catch up on the key points for a successful start. What You Will
Learn * Import data from different sources to Mathematica * Link
external libraries with programs written in Mathematica * Classify
data and partition them into clusters * Recognize faces, objects,
text, and barcodes * Use Mathematica functions for time series
analysis * Use algorithms for statistical data processing * Predict
the result based on the observations In Detail There are many
algorithms for data analysis and it's not always possible to
quickly choose the best one for each case. Implementation of the
algorithms takes a lot of time. With the help of Mathematica, you
can quickly get a result from the use of a particular method,
because this system contains almost all the known algorithms for
data analysis. If you are not a programmer but you need to analyze
data, this book will show you the capabilities of Mathematica when
just few strings of intelligible code help to solve huge tasks from
statistical issues to pattern recognition. If you're a programmer,
with the help of this book, you will learn how to use the library
of algorithms implemented in Mathematica in your programs, as well
as how to write algorithm testing procedure. With each chapter,
you'll be more immersed in the special world of Mathematica. Along
with intuitive queries for data processing, we will highlight the
nuances and features of this system, allowing you to build
effective analysis systems. With the help of this book, you will
learn how to optimize the computations by combining your libraries
with the Mathematica kernel. Style and approach This book takes a
step-by-step approach, accompanied by examples, so you get a better
understanding of the logic of writing algorithms for data analysis
in Mathematica. We provide a detailed explanation of all the
nuances of the Mathematica language, no matter what your level of
experience is.
Harness the power of RStudio to create web applications, R
packages, markdown reports and pretty data visualizations About
This Book * Discover the multi-functional use of RStudio to support
your daily work with R code * Learn to create stunning, meaningful,
and interactive graphs and learn to embed them into easy
communicable reports using multiple R packages * Develop your own R
packages and Shiny web apps to share your knowledge and collaborate
with others Who This Book Is For This book is aimed at R developers
and analysts who wish to do R statistical development while taking
advantage of RStudio's functionality to ease their development
efforts. R programming experience is assumed as well as being
comfortable with R's basic structures and a number of functions.
What You Will Learn * Discover the RStudio IDE and details about
the user interface * Communicate your insights with R Markdown in
static and interactive ways * Learn how to use different graphic
systems to visualize your data * Build interactive web applications
with the Shiny framework to present and share your results *
Understand the process of package development and assemble your own
R packages * Easily collaborate with other people on your projects
by using Git and GitHub * Manage the R environment for your
organization with RStudio and Shiny server * Apply your obtained
knowledge about RStudio and R development to create a real-world
dashboard solution In Detail RStudio helps you to manage small to
large projects by giving you a multi-functional integrated
development environment, combined with the power and flexibility of
the R programming language, which is becoming the bridge language
of data science for developers and analyst worldwide. Mastering the
use of RStudio will help you to solve real-world data problems.
This book begins by guiding you through the installation of RStudio
and explaining the user interface step by step. From there, the
next logical step is to use this knowledge to improve your data
analysis workflow. We will do this by building up our toolbox to
create interactive reports and graphs or even web applications with
Shiny. To collaborate with others, we will explore how to use Git
and GitHub with RStudio and how to build your own packages to
ensure top quality results. Finally, we put it all together in an
interactive dashboard written with R. Style and approach An
easy-to-follow guide full of hands-on examples to master RStudio.
Beginning from explaining the basics, each topic is explained with
a lot of details for every feature.
|
|