![]() |
![]() |
Your cart is empty |
||
Books > Computing & IT > Computer software packages > Other software packages > Mathematical & statistical software
This proceedings volume features top contributions in modern statistical methods from Statistics 2021 Canada, the 6th Annual Canadian Conference in Applied Statistics, held virtually on July 15-18, 2021. Papers are contributed from established and emerging scholars, covering cutting-edge and contemporary innovative techniques in statistics and data science. Major areas of contribution include Bayesian statistics; computational statistics; data science; semi-parametric regression; and stochastic methods in biology, crop science, ecology and engineering. It will be a valuable edited collection for graduate students, researchers, and practitioners in a wide array of applied statistical and data science methods.
This book provides an accessible introduction and practical guidelines to apply asymmetric multidimensional scaling, cluster analysis, and related methods to asymmetric one-mode two-way and three-way asymmetric data. A major objective of this book is to present to applied researchers a set of methods and algorithms for graphical representation and clustering of asymmetric relationships. Data frequently concern measurements of asymmetric relationships between pairs of objects from a given set (e.g., subjects, variables, attributes,...), collected in one or more matrices. Examples abound in many different fields such as psychology, sociology, marketing research, and linguistics and more recently several applications have appeared in technological areas including cybernetics, air traffic control, robotics, and network analysis. The capabilities of the presented algorithms are illustrated by carefully chosen examples and supported by extensive data analyses. A review of the specialized statistical software available for the applications is also provided. This monograph is highly recommended to readers who need a complete and up-to-date reference on methods for asymmetric proximity data analysis.
The book presents the fundamental concepts from asymptotic statistical inference theory, elaborating on some basic large sample optimality properties of estimators and some test procedures. The most desirable property of consistency of an estimator and its large sample distribution, with suitable normalization, are discussed, the focus being on the consistent and asymptotically normal (CAN) estimators. It is shown that for the probability models belonging to an exponential family and a Cramer family, the maximum likelihood estimators of the indexing parameters are CAN. The book describes some large sample test procedures, in particular, the most frequently used likelihood ratio test procedure. Various applications of the likelihood ratio test procedure are addressed, when the underlying probability model is a multinomial distribution. These include tests for the goodness of fit and tests for contingency tables. The book also discusses a score test and Wald's test, their relationship with the likelihood ratio test and Karl Pearson's chi-square test. An important finding is that, while testing any hypothesis about the parameters of a multinomial distribution, a score test statistic and Karl Pearson's chi-square test statistic are identical. Numerous illustrative examples of differing difficulty level are incorporated to clarify the concepts. For better assimilation of the notions, various exercises are included in each chapter. Solutions to almost all the exercises are given in the last chapter, to motivate students towards solving these exercises and to enable digestion of the underlying concepts. The concepts from asymptotic inference are crucial in modern statistics, but are difficult to grasp in view of their abstract nature. To overcome this difficulty, keeping up with the recent trend of using R software for statistical computations, the book uses it extensively, for illustrating the concepts, verifying the properties of estimators and carrying out various test procedures. The last section of the chapters presents R codes to reveal and visually demonstrate the hidden aspects of different concepts and procedures. Augmenting the theory with R software is a novel and a unique feature of the book. The book is designed primarily to serve as a text book for a one semester introductory course in asymptotic statistical inference, in a post-graduate program, such as Statistics, Bio-statistics or Econometrics. It will also provide sufficient background information for studying inference in stochastic processes. The book will cater to the need of a concise but clear and student-friendly book introducing, conceptually and computationally, basics of asymptotic inference.
This textbook provides a step-by-step introduction to the tools and principles of high-dimensional statistics. Each chapter is complemented by numerous exercises, many of them with detailed solutions, and computer labs in R that convey valuable practical insights. The book covers the theory and practice of high-dimensional linear regression, graphical models, and inference, ensuring readers have a smooth start in the field. It also offers suggestions for further reading. Given its scope, the textbook is intended for beginning graduate and advanced undergraduate students in statistics, biostatistics, and bioinformatics, though it will be equally useful to a broader audience.
Most people who have been exposed to Lean, Six Sigma, or other process improvement training have learned good data collection, plotting, and analysis methods, along with how to use Minitab statistical software to help with those tasks. However, the sample data files used in training are usually clean and simple and don't reflect what students actually encounter in the real world. The reader will learn an ideal format for Minitab data sets, and two ways for getting a data file formatted optimally for analysis using Minitab or other statistical software.
The richly illustrated Interactive Web-Based Data Visualization with R, plotly, and shiny focuses on the process of programming interactive web graphics for multidimensional data analysis. It is written for the data analyst who wants to leverage the capabilities of interactive web graphics without having to learn web programming. Through many R code examples, you will learn how to tap the extensive functionality of these tools to enhance the presentation and exploration of data. By mastering these concepts and tools, you will impress your colleagues with your ability to quickly generate more informative, engaging, and reproducible interactive graphics using free and open source software that you can share over email, export to pdf, and more. Key Features: Convert static ggplot2 graphics to an interactive web-based form Link, animate, and arrange multiple plots in standalone HTML from R Embed, modify, and respond to plotly graphics in a shiny app Learn best practices for visualizing continuous, discrete, and multivariate data Learn numerous ways to visualize geo-spatial data This book makes heavy use of plotly for graphical rendering, but you will also learn about other R packages that support different phases of a data science workflow, such as tidyr, dplyr, and tidyverse. Along the way, you will gain insight into best practices for visualization of high-dimensional data, statistical graphics, and graphical perception. The printed book is complemented by an interactive website where readers can view movies demonstrating the examples and interact with graphics.
This introductory textbook presents research methods and data analysis tools in non-technical language. It explains the research process and the basics of qualitative and quantitative data analysis, including procedures and methods, analysis, interpretation, and applications using hands-on data examples in QDA Miner Lite and IBM SPSS Statistics software. The book is divided into four parts that address study and research design; data collection, qualitative methods and surveys; statistical methods, including hypothesis testing, regression, cluster and factor analysis; and reporting. The intended audience is business and social science students learning scientific research methods, however, given its business context, the book will be equally useful for decision-makers in businesses and organizations.
This book presents a general method for deriving higher-order statistics of multivariate distributions with simple algorithms that allow for actual calculations. Multivariate nonlinear statistical models require the study of higher-order moments and cumulants. The main tool used for the definitions is the tensor derivative, leading to several useful expressions concerning Hermite polynomials, moments, cumulants, skewness, and kurtosis. A general test of multivariate skewness and kurtosis is obtained from this treatment. Exercises are provided for each chapter to help the readers understand the methods. Lastly, the book includes a comprehensive list of references, equipping readers to explore further on their own.
An authoritative introduction to the latest comparative methods in evolutionary biology Phylogenetic comparative methods are a suite of statistical approaches that enable biologists to analyze and better understand the evolutionary tree of life, and shed vital new light on patterns of divergence and common ancestry among all species on Earth. This textbook shows how to carry out phylogenetic comparative analyses in the R statistical computing environment. Liam Revell and Luke Harmon provide an incisive conceptual overview of each method along with worked examples using real data and challenge problems that encourage students to learn by doing. By working through this book, students will gain a solid foundation in these methods and develop the skills they need to interpret patterns in the tree of life. Covers every major method of modern phylogenetic comparative analysis in R Explains the basics of R and discusses topics such as trait evolution, diversification, trait-dependent diversification, biogeography, and visualization Features a wealth of exercises and challenge problems Serves as an invaluable resource for students and researchers, with applications in ecology, evolution, anthropology, disease transmission, conservation biology, and a host of other areas Written by two of today's leading developers of phylogenetic comparative methods
The book covers computational statistics, its methodologies and applications for IoT device. It includes the details in the areas of computational arithmetic and its influence on computational statistics, numerical algorithms in statistical application software, basics of computer systems, statistical techniques, linear algebra and its role in optimization techniques, evolution of optimization techniques, optimal utilization of computer resources, and statistical graphics role in data analysis. It also explores computational inferencing and computer model's role in design of experiments, Bayesian analysis, survival analysis and data mining in computational statistics.
The most crucial ability for machine learning and data science is mathematical logic for grasping their essence rather than relying on knowledge or experience. This textbook addresses the fundamentals of kernel methods for machine learning by considering relevant math problems and building Python programs. The book's main features are as follows: The content is written in an easy-to-follow and self-contained style. The book includes 100 exercises, which have been carefully selected and refined. As their solutions are provided in the main text, readers can solve all of the exercises by reading the book. The mathematical premises of kernels are proven and the correct conclusions are provided, helping readers to understand the nature of kernels. Source programs and running examples are presented to help readers acquire a deeper understanding of the mathematics used. Once readers have a basic understanding of the functional analysis topics covered in Chapter 2, the applications are discussed in the subsequent chapters. Here, no prior knowledge of mathematics is assumed. This book considers both the kernel for reproducing kernel Hilbert space (RKHS) and the kernel for the Gaussian process; a clear distinction is made between the two.
This book is a result of a workshop, the 8th of the successful TopoInVis workshop series, held in 2019 in Nykoeping, Sweden. The workshop regularly gathers some of the world's leading experts in this field. Thereby, it provides a forum for discussions on the latest advances in the field with a focus on finding practical solutions to open problems in topological data analysis for visualization. The contributions provide introductory and novel research articles including new concepts for the analysis of multivariate and time-dependent data, robust computational approaches for the extraction and approximations of topological structures with theoretical guarantees, and applications of topological scalar and vector field analysis for visualization. The applications span a wide range of scientific areas comprising climate science, material sciences, fluid dynamics, and astronomy. In addition, community efforts with respect to joint software development are reported and discussed.
This new edition to the classic book by ggplot2 creator Hadley Wickham highlights compatibility with knitr and RStudio. ggplot2 is a data visualization package for R that helps users create data graphics, including those that are multi-layered, with ease. With ggplot2, it's easy to: produce handsome, publication-quality plots with automatic legends created from the plot specification superimpose multiple layers (points, lines, maps, tiles, box plots) from different data sources with automatically adjusted common scales add customizable smoothers that use powerful modeling capabilities of R, such as loess, linear models, generalized additive models, and robust regression save any ggplot2 plot (or part thereof) for later modification or reuse create custom themes that capture in-house or journal style requirements and that can easily be applied to multiple plots approach a graph from a visual perspective, thinking about how each component of the data is represented on the final plot This book will be useful to everyone who has struggled with displaying data in an informative and attractive way. Some basic knowledge of R is necessary (e.g., importing data into R). ggplot2 is a mini-language specifically tailored for producing graphics, and you'll learn everything you need in the book. After reading this book you'll be able to produce graphics customized precisely for your problems, and you'll find it easy to get graphics out of your head and on to the screen or page.
The nonequilibrium behavior of nanoscopic and biological systems, which are typically strongly fluctuating, is a major focus of current research. Lately, much progress has been made in understanding such systems from a thermodynamic perspective. However, new theoretical challenges emerge when the fluctuating system is additionally subject to time delay, e.g. due to the presence of feedback loops. This thesis advances this young and vibrant research field in several directions. The first main contribution concerns the probabilistic description of time-delayed systems; e.g. by introducing a versatile approximation scheme for nonlinear delay systems. Second, it reveals that delay can induce intriguing thermodynamic properties such as anomalous (reversed) heat flow. More generally, the thesis shows how to treat the thermodynamics of non-Markovian systems by introducing auxiliary variables. It turns out that delayed feedback is inextricably linked to nonreciprocal coupling, information flow, and to net energy input on the fluctuating level.
This monograph uses the Julia language to guide the reader through an exploration of the fundamental concepts of probability and statistics, all with a view of mastering machine learning, data science, and artificial intelligence. The text does not require any prior statistical knowledge and only assumes a basic understanding of programming and mathematical notation. It is accessible to practitioners and researchers in data science, machine learning, bio-statistics, finance, or engineering who may wish to solidify their knowledge of probability and statistics. The book progresses through ten independent chapters starting with an introduction of Julia, and moving through basic probability, distributions, statistical inference, regression analysis, machine learning methods, and the use of Monte Carlo simulation for dynamic stochastic models. Ultimately this text introduces the Julia programming language as a computational tool, uniquely addressing end-users rather than developers. It makes heavy use of over 200 code examples to illustrate dozens of key statistical concepts. The Julia code, written in a simple format with parameters that can be easily modified, is also available for download from the book's associated GitHub repository online. See what co-creators of the Julia language are saying about the book: Professor Alan Edelman, MIT: With "Statistics with Julia", Yoni and Hayden have written an easy to read, well organized, modern introduction to statistics. The code may be looked at, and understood on the static pages of a book, or even better, when running live on a computer. Everything you need is here in one nicely written self-contained reference. Dr. Viral Shah, CEO of Julia Computing: Yoni and Hayden provide a modern way to learn statistics with the Julia programming language. This book has been perfected through iteration over several semesters in the classroom. It prepares the reader with two complementary skills - statistical reasoning with hands on experience and working with large datasets through training in Julia.
This book provides a complete and comprehensive guide to Pyomo (Python Optimization Modeling Objects) for beginning and advanced modelers, including students at the undergraduate and graduate levels, academic researchers, and practitioners. Using many examples to illustrate the different techniques useful for formulating models, this text beautifully elucidates the breadth of modeling capabilities that are supported by Pyomo and its handling of complex real-world applications. In the third edition, much of the material has been reorganized, new examples have been added, and a new chapter has been added describing how modelers can improve the performance of their models. The authors have also modified their recommended method for importing Pyomo. A big change in this edition is the emphasis of concrete models, which provide fewer restrictions on the specification and use of Pyomo models. Pyomo is an open source software package for formulating and solving large-scale optimization problems. The software extends the modeling approach supported by modern AML (Algebraic Modeling Language) tools. Pyomo is a flexible, extensible, and portable AML that is embedded in Python, a full-featured scripting language. Python is a powerful and dynamic programming language that has a very clear, readable syntax and intuitive object orientation. Pyomo includes Python classes for defining sparse sets, parameters, and variables, which can be used to formulate algebraic expressions that define objectives and constraints. Moreover, Pyomo can be used from a command-line interface and within Python's interactive command environment, which makes it easy to create Pyomo models, apply a variety of optimizers, and examine solutions.
A new paradigm for scientific discovery through computational tools now permeates every aspect of astronomical research. Computational astrophysics combines modern computational methods, novel hardware designs, advanced algorithms, original software implementations, and associated technologies to discover new phenomena and to make predictions in astronomy. The proceedings of IAU Symposium 362 summarizes ongoing developments in computational astrophysics through astronomers in diverse fields sharing their knowledge and approaches. It focuses on computational methods applied to speed up and broaden the scope of scientific studies, such as finding trends in observational data, high performance computing, automated search algorithms, and model predictability. Experts discuss a palette of challenging informational and technical developments, with the goal of coordinating their efforts and the improvement of techniques in pursuit of a wide range of astronomical studies, including fluid dynamics in star and galaxy evolution, exoplanets, gravitational waves, numerical relativity, data mining, and much more.
An easy-to-understand course book, based on the authentic lectures and detailed research, conducted by the authors themselves, on information optics, holography and MATLAB. This book is the first to highlight the incoherent optical system, provide up-to-date, novel digital holography techniques, and demonstrate MATLAB codes to accomplish tasks such as optical image processing and pattern recognition. This title is a comprehensive introduction to the basics of Fourier optics as well as optical image processing and digital holography. A step-by-step guide which details the vast majority of the derivations, without omitting essential steps, to facilitate a clear mathematical understanding. This book also features exercises at the end of each chapter, providing hands-on experience and consolidating understanding. An ideal companion for graduates and researchers involved in engineering and applied physics, as well as interested in the growing field of information optics.
This book covers applications of R to the general discipline of radiation dosimetry and to the specific areas of luminescence dosimetry, luminescence dating, and radiation protection dosimetry. It features more than 90 detailed worked examples of R code fully integrated into the text, with extensive annotations. The book shows how researchers can use available R packages to analyze their experimental data, and how to extract the various parameters describing mathematically the luminescence signals. In each chapter, the theory behind the subject is summarized, and references are given from the literature, so that researchers can look up the details of the theory and the relevant experiments. Several chapters are dedicated to Monte Carlo methods, which are used to simulate the luminescence processes during the irradiation, heating, and optical stimulation of solids, for a wide variety of materials. This book will be useful to those who use the tools of luminescence dosimetry, including physicists, geologists, archaeologists, and for all researchers who use radiation in their research.
This book shows you how to analyze data sets systematically and to use Excel 2019 to extract information from data almost effortlessly. Both are (not) an art! The statistical methods are presented and discussed using a single data set. This makes it clear how the methods build on each other and gradually more and more information can be extracted from the data. The Excel functions used are explained in detail - the procedure can be easily transferred to other data sets. Various didactic elements facilitate orientation and working with the book: At the checkpoints, the most important aspects from each chapter are briefly summarized. In the freak knowledge section, more advanced aspects are addressed to whet the appetite for more. All examples are calculated with hand and Excel. Numerous applications and solutions as well as further data sets are available on the author's internet platform. This book is a translation of the original German 2nd edition Statistik angewandt mit Excel by Franz Kronthaler, published by Springer-Verlag GmbH Germany, part of Springer Nature in 2021. The translation was done with the help of artificial intelligence (machine translation by the service DeepL.com). A subsequent human revision was done primarily in terms of content, so that the book will read stylistically differently from a conventional translation. Springer Nature works continuously to further the development of tools for the production of books and on the related technologies to support the authors.
BRIDGES THE GAP BETWEEN SAS AND R, ALLOWING USERS TRAINED IN ONE LANGUAGE TO EASILY LEARN THE OTHER SAS and R are widely-used, very different software environments. Prized for its statistical and graphical tools, R is an open-source programming language that is popular with statisticians and data miners who develop statistical software and analyze data. SAS (Statistical Analysis System) is the leading corporate software in analytics thanks to its faster data handling and smaller learning curve. SAS for R Users enables entry-level data scientists to take advantage of the best aspects of both tools by providing a cross-functional framework for users who already know R but may need to work with SAS. Those with knowledge of both R and SAS are of far greater value to employers, particularly in corporate settings. Using a clear, step-by-step approach, this book presents an analytics workflow that mirrors that of the everyday data scientist. This up-to-date guide is compatible with the latest R packages as well as SAS University Edition. Useful for anyone seeking employment in data science, this book: Instructs both practitioners and students fluent in one language seeking to learn the other Provides command-by-command translations of R to SAS and SAS to R Offers examples and applications in both R and SAS Presents step-by-step guidance on workflows, color illustrations, sample code, chapter quizzes, and more Includes sections on advanced methods and applications Designed for professionals, researchers, and students, SAS for R Users is a valuable resource for those with some knowledge of coding and basic statistics who wish to enter the realm of data science and business analytics.
Master the syntax for working with R's plotting functions in graphics and stats in this easy reference to formatting plots. The approach in Visualizing Data in R 4 toward the application of formatting in ggplot() will follow the structure of the formatting used by the plotting functions in graphics and stats. This book will take advantage of the new features added to R 4 where appropriate including a refreshed color palette for charts, Cairo graphics with more fonts/symbols, and improved performance from grid graphics including ggplot 2 rendering speed. Visualizing Data in R 4 starts with an introduction and then is split into two parts and six appendices. Part I covers the function plot() and the ancillary functions you can use with plot(). You'll also see the functions par() and layout(), providing for multiple plots on a page. Part II goes over the basics of using the functions qplot() and ggplot() in the package ggplot2. The default plots generated by the functions qplot() and ggplot() give more sophisticated-looking plots than the default plots done by plot() and are easier to use, but the function plot() is more flexible. Both plot() and ggplot() allow for many layers to a plot. The six appendices will cover plots for contingency tables, plots for continuous variables, plots for data with a limited number of values, functions that generate multiple plots, plots for time series analysis, and some miscellaneous plots. Some of the functions that will be in the appendices include functions that generate histograms, bar charts, pie charts, box plots, and heatmaps. What You Will Learn Use R to create informative graphics Master plot(), qplot(), and ggplot() Discover the canned graphics functions in stats and graphics Format plots generated by plot() and ggplot() Who This Book Is For Those in data science who use R. Some prior experience with R or data science is recommended.
Employ essential tools and functions of the MATLAB and Simulink packages, which are explained and demonstrated via interactive examples and case studies. This revised edition covers features from the latest MATLAB 2022b release, as well as other features that have been released since the first edition published. This book contains dozens of simulation models and solved problems via m-files/scripts and Simulink models which will help you to learn programming and modelling essentials. You'll become efficient with many of the built-in tools and functions of MATLAB/Simulink while solving engineering and scientific computing problems. Beginning MATLAB and Simulink, Second Edition explains various practical issues of programming and modelling in parallel by comparing MATLAB and Simulink. After studying and using this book, you'll be proficient at using MATLAB and Simulink and applying the source code and models from the book's examples as templates for your own projects in data science or engineering. What You Will Learn Master the programming and modelling essentials of MATLAB and Simulink Carry out data visualization with MATLAB Build a GUI and develop App with MATLAB Work with integration and numerical root finding methods Apply MATLAB to differential equations-based models and simulations Use MATLAB and Simulink for data science projects Who This Book Is ForEngineers, programmers, data scientists, and students majoring in engineering and scientific computing who are new to MATLAB and Simulink.
The main focus of this book is on presenting advances in fuzzy statistics, and on proposing a methodology for testing hypotheses in the fuzzy environment based on the estimation of fuzzy confidence intervals, a context in which not only the data but also the hypotheses are considered to be fuzzy. The proposed method for estimating these intervals is based on the likelihood method and employs the bootstrap technique. A new metric generalizing the signed distance measure is also developed. In turn, the book presents two conceptually diverse applications in which defended intervals play a role: one is a novel methodology for evaluating linguistic questionnaires developed at the global and individual levels; the other is an extension of the multi-ways analysis of variance to the space of fuzzy sets. To illustrate these approaches, the book presents several empirical and simulation-based studies with synthetic and real data sets. In closing, it presents a coherent R package called "FuzzySTs" which covers all the previously mentioned concepts with full documentation and selected use cases. Given its scope, the book will be of interest to all researchers whose work involves advanced fuzzy statistical methods.
Watch Andy talk about the new version of his book for R: click here Hot on the heels of the award-winning and best selling Discovering Statistics Using SPSS Third Edition, Andy Field has teamed up with Jeremy Miles (co-author of Discovering Statistics Using SAS) to write Discovering Statistics Using R. Keeping the uniquely humorous and self-depreciating style that has made students across the world fall in love with Andy Field's books, Discovering Statistics Using R takes students on a journey of statistical discovery using the freeware R, a free, flexible and dynamically changing software tool for data analysis that is becoming increasingly popular across the social and behavioral sciences throughout the world. The journey begins by explaining basic statistical and research concepts before a guided tour of the R software environment. Next the importance of exploring and graphing data will be discovered, before moving onto statistical tests that are the foundations of the rest of the book (for e.g. correlation and regression). Readers will then stride confidently into intermediate level analyses such as ANOVA, before ending their journey with advanced techniques such as MANOVA and multilevel models. Although there is enough theory to help the reader gain the necessary conceptual understanding of what they're doing, the emphasis is on applying what's learned to playful and real-world examples that should make the experience more fun than expected. Like its sister textbooks, Discovering Statistics Using R is written in an irreverent style and follows the same ground-breaking structure and pedagogical approach. The core material is augmented by a cast of characters to help the reader on their way, hundreds of examples, self-assessment tests to consolidate knowledge, and additional website material for those wanting to learn more (at www.sagepub.co.uk/fieldandmilesR). Given this book's accessibility, fun spirit, and use of bizarre real-w |
![]() ![]() You may like...
Mathematical Modeling for Smart…
Debabrata Samanta, Debabrata Singh
Hardcover
R12,400
Discovery Miles 124 000
SAS Text Analytics for Business…
Teresa Jade, Biljana Belamaric-Wilsey, …
Hardcover
R2,778
Discovery Miles 27 780
Portfolio and Investment Analysis with…
John B. Guerard, Ziwei Wang, …
Hardcover
R2,491
Discovery Miles 24 910
An Introduction to Creating Standardized…
Todd Case, Yuting Tian
Hardcover
R1,623
Discovery Miles 16 230
Implementing CDISC Using SAS - An…
Chris Holland, Jack Shostak
Hardcover
R1,830
Discovery Miles 18 300
|