0
Your cart

Your cart is empty

Browse All Departments
Price
  • R100 - R250 (7)
  • R250 - R500 (60)
  • R500+ (1,250)
  • -
Status
Format
Author / Contributor
Publisher

Books > Computing & IT > Applications of computing > Databases > Data capture & analysis

Data Visualization - A Practical Introduction (Paperback): Kieran Healy Data Visualization - A Practical Introduction (Paperback)
Kieran Healy
R1,054 R977 Discovery Miles 9 770 Save R77 (7%) Ships in 12 - 17 working days

An accessible primer on how to create effective graphics from data This book provides students and researchers a hands-on introduction to the principles and practice of data visualization. It explains what makes some graphs succeed while others fail, how to make high-quality figures from data using powerful and reproducible methods, and how to think about data visualization in an honest and effective way. Data Visualization builds the reader's expertise in ggplot2, a versatile visualization library for the R programming language. Through a series of worked examples, this accessible primer then demonstrates how to create plots piece by piece, beginning with summaries of single variables and moving on to more complex graphics. Topics include plotting continuous and categorical variables; layering information on graphics; producing effective "small multiple" plots; grouping, summarizing, and transforming data for plotting; creating maps; working with the output of statistical models; and refining plots to make them more comprehensible. Effective graphics are essential to communicating ideas and a great way to better understand data. This book provides the practical skills students and practitioners need to visualize quantitative data and get the most out of their research findings. Provides hands-on instruction using R and ggplot2 Shows how the "tidyverse" of data analysis tools makes working with R easier and more consistent Includes a library of data sets, code, and functions

Introduction to Data Mining (Hardcover): Camila Thompson Introduction to Data Mining (Hardcover)
Camila Thompson
R3,817 R3,293 Discovery Miles 32 930 Save R524 (14%) Ships in 10 - 15 working days
Advanced Data Science and Analytics with Python (Hardcover): Jesus Rogel-Salazar Advanced Data Science and Analytics with Python (Hardcover)
Jesus Rogel-Salazar
R3,427 Discovery Miles 34 270 Ships in 12 - 17 working days

Advanced Data Science and Analytics with Python enables data scientists to continue developing their skills and apply them in business as well as academic settings. The subjects discussed in this book are complementary and a follow-up to the topics discussed in Data Science and Analytics with Python. The aim is to cover important advanced areas in data science using tools developed in Python such as SciKit-learn, Pandas, Numpy, Beautiful Soup, NLTK, NetworkX and others. The model development is supported by the use of frameworks such as Keras, TensorFlow and Core ML, as well as Swift for the development of iOS and MacOS applications. Features: Targets readers with a background in programming, who are interested in the tools used in data analytics and data science Uses Python throughout Presents tools, alongside solved examples, with steps that the reader can easily reproduce and adapt to their needs Focuses on the practical use of the tools rather than on lengthy explanations Provides the reader with the opportunity to use the book whenever needed rather than following a sequential path The book can be read independently from the previous volume and each of the chapters in this volume is sufficiently independent from the others, providing flexibility for the reader. Each of the topics addressed in the book tackles the data science workflow from a practical perspective, concentrating on the process and results obtained. The implementation and deployment of trained models are central to the book. Time series analysis, natural language processing, topic modelling, social network analysis, neural networks and deep learning are comprehensively covered. The book discusses the need to develop data products and addresses the subject of bringing models to their intended audiences - in this case, literally to the users' fingertips in the form of an iPhone app. About the Author Dr. Jesus Rogel-Salazar is a lead data scientist in the field, working for companies such as Tympa Health Technologies, Barclays, AKQA, IBM Data Science Studio and Dow Jones. He is a visiting researcher at the Department of Physics at Imperial College London, UK and a member of the School of Physics, Astronomy and Mathematics at the University of Hertfordshire, UK.

Clinical Audit and Epi Info (Paperback, 1st New edition): Antony Stewart, Jammi Rao Clinical Audit and Epi Info (Paperback, 1st New edition)
Antony Stewart, Jammi Rao
R1,439 Discovery Miles 14 390 Ships in 12 - 17 working days

This book is designed to enable and encourage health professionals and family support workers to include fathers in the process of their work. It focuses on the enormous potential value of accessing men at a time they are known to be particularly receptive - before and after the birth - within the context of providing solutions in the debate about problematic aspects of masculinity and fatherhood. It looks at how important the father's role is within the family environment and how fathers should be encouraged to take part in the upbringing of their children.

Real-Time Analytics - Techniques to Analyze and Visualize Streaming Data (Paperback): Byron Ellis Real-Time Analytics - Techniques to Analyze and Visualize Streaming Data (Paperback)
Byron Ellis
R1,142 R879 Discovery Miles 8 790 Save R263 (23%) Out of stock

Construct a robust end-to-end solution for analyzing and visualizing streaming data Real-time analytics is the hottest topic in data analytics today. In Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data, expert Byron Ellis teaches data analysts technologies to build an effective real-time analytics platform. This platform can then be used to make sense of the constantly changing data that is beginning to outpace traditional batch-based analysis platforms. The author is among a very few leading experts in the field. He has a prestigious background in research, development, analytics, real-time visualization, and Big Data streaming and is uniquely qualified to help you explore this revolutionary field. Moving from a description of the overall analytic architecture of real-time analytics to using specific tools to obtain targeted results, Real-Time Analytics leverages open source and modern commercial tools to construct robust, efficient systems that can provide real-time analysis in a cost-effective manner. The book includes: * A deep discussion of streaming data systems and architectures * Instructions for analyzing, storing, and delivering streaming data * Tips on aggregating data and working with sets * Information on data warehousing options and techniques Real-Time Analytics includes in-depth case studies for website analytics, Big Data, visualizing streaming and mobile data, and mining and visualizing operational data flows. The book's "recipe" layout lets readers quickly learn and implement different techniques. All of the code examples presented in the book, along with their related data sets, are available on the companion website.

Tableau Prep: Up and Running - Self Service Data Preparation for Better Analysis (Paperback): Carl Allchin Tableau Prep: Up and Running - Self Service Data Preparation for Better Analysis (Paperback)
Carl Allchin
R1,347 Discovery Miles 13 470 Ships in 12 - 17 working days

For self-service data preparation, Tableau Prep is relatively easy to use-as long as you know how to clean and organize your datasets. Carl Allchin, from The Information Lab in London, gets you up to speed on Tableau Prep through a series of practical lessons that include methods for preparing, cleaning, automating, organizing, and outputting your datasets. Based on Allchin's popular blog, Preppin' Data, this practical guide takes you step-by-step through Tableau Prep's fundamentals. Self-service data preparation reduces the time it takes to complete data projects and improves the quality of your analyses. Discover how Tableau Prep helps you access your data and turn it into valuable information. Know what to look for when you prepare data Learn which Tableau Prep functions to use when working with data fields Analyze the shape and profile of your dataset Output data for analysis and learn how Tableau Prep automates your workflow Learn how to clean your dataset using Tableau Prep functions Explore ways to use Tableau Prep techniques in real-world scenarios Make your data available to others by managing and documenting the output

Data Mining - Concepts and Techniques (Paperback, 4th edition): Jiawei Han, Jian Pei, Hanghang Tong Data Mining - Concepts and Techniques (Paperback, 4th edition)
Jiawei Han, Jian Pei, Hanghang Tong
R2,038 Discovery Miles 20 380 Ships in 12 - 17 working days

Data Mining: Concepts and Techniques, Fourth Edition introduces concepts, principles, and methods for mining patterns, knowledge, and models from various kinds of data for diverse applications. Specifically, it delves into the processes for uncovering patterns and knowledge from massive collections of data, known as knowledge discovery from data, or KDD. It focuses on the feasibility, usefulness, effectiveness, and scalability of data mining techniques for large data sets. After an introduction to the concept of data mining, the authors explain the methods for preprocessing, characterizing, and warehousing data. They then partition the data mining methods into several major tasks, introducing concepts and methods for mining frequent patterns, associations, and correlations for large data sets; data classificcation and model construction; cluster analysis; and outlier detection. Concepts and methods for deep learning are systematically introduced as one chapter. Finally, the book covers the trends, applications, and research frontiers in data mining.

Guide to Intelligent Data Science - How to Intelligently Make Use of Real Data (Paperback, 2nd ed. 2020): Michael R. Berthold,... Guide to Intelligent Data Science - How to Intelligently Make Use of Real Data (Paperback, 2nd ed. 2020)
Michael R. Berthold, Christian Borgelt, Frank Hoeppner, Frank Klawonn, Rosaria Silipo
R1,856 Discovery Miles 18 560 Ships in 10 - 15 working days

Making use of data is not anymore a niche project but central to almost every project. With access to massive compute resources and vast amounts of data, it seems at least in principle possible to solve any problem. However, successful data science projects result from the intelligent application of: human intuition in combination with computational power; sound background knowledge with computer-aided modelling; and critical reflection of the obtained insights and results. Substantially updating the previous edition, then entitled Guide to Intelligent Data Analysis, this core textbook continues to provide a hands-on instructional approach to many data science techniques, and explains how these are used to solve real world problems. The work balances the practical aspects of applying and using data science techniques with the theoretical and algorithmic underpinnings from mathematics and statistics. Major updates on techniques and subject coverage (including deep learning) are included. Topics and features: guides the reader through the process of data science, following the interdependent steps of project understanding, data understanding, data blending and transformation, modeling, as well as deployment and monitoring; includes numerous examples using the open source KNIME Analytics Platform, together with an introductory appendix; provides a review of the basics of classical statistics that support and justify many data analysis methods, and a glossary of statistical terms; integrates illustrations and case-study-style examples to support pedagogical exposition; supplies further tools and information at an associated website. This practical and systematic textbook/reference is a "need-to-have" tool for graduate and advanced undergraduate students and essential reading for all professionals who face data science problems. Moreover, it is a "need to use, need to keep" resource following one's exploration of the subject.

Basic Statistics for Social Research - Step-by-Step Calculations & Computer Techniques Using Minitab (Paperback): Duncan Cramer Basic Statistics for Social Research - Step-by-Step Calculations & Computer Techniques Using Minitab (Paperback)
Duncan Cramer
R1,298 Discovery Miles 12 980 Ships in 12 - 17 working days

This accessible introdution to statistics using the program Minitab explains when to apply and how to calculate and interpret a wide range of statistical procedures commonly used in the social sciences. Keeping statistical symbols and formulae to a minimum and using simple examples, this book:
* Assumes no prior knowledge of statistics or computing
* Includes a concise introduction to the program Minitab
* Describes a wider range of tests than other introductory texts
* Contains a comprehensive range of exercises with answers.
Basic Statistics for Social Research will prove an invaluable introductory statistics text for students, and a useful resource for graduates and professionals engaged in research in the social sciences.

Qualitative Computing: Using Software for Qualitative Data Analysis - Using Software for Qualitative Data Analysis (Hardcover,... Qualitative Computing: Using Software for Qualitative Data Analysis - Using Software for Qualitative Data Analysis (Hardcover, New Ed)
Mike Fisher
R4,126 Discovery Miles 41 260 Ships in 12 - 17 working days

As qualitative researchers incorporate computer assistance into their analytic approaches, important questions arise about the adoption of new technology. Is it worth learning computer-assisted methods? Will the pay-off be sufficient to justify the investment? Which programs are worth learning? What are the effects on the analysis process? This book complements the existing literature by giving a detailed account of the use of four major programs in analyzing the same data. Priority is given to the tasks of qualitative analysis rather than to program capability and the programs are treated as tools rather than as a discipline to be acquired. The key is not what the programs allow researcher to do, but whether the tasks that researchers need to undertake are facilitated by the software. Thus the study develops a user-centred approach to the adoption of computer-assisted qualitative data analysis. The author emphasises qualitative analysis as a creative craft, but one which must increasingly be subject to rigorous methodological scrutiny. The adoption of computer-aided methods offers opportunities, but also dangers and ultimately this book is about the scientific qualitative research. Written in a distinctive and succinct style, this book will be valuable to social science researchers and students interested in qualitative research and in the potential for computer-assisted analysis.

Quantitative Data Analysis with Minitab - A Guide for Social Scientists (Paperback): Alan Bryman, Duncan Cramer Quantitative Data Analysis with Minitab - A Guide for Social Scientists (Paperback)
Alan Bryman, Duncan Cramer
R1,301 Discovery Miles 13 010 Ships in 12 - 17 working days

Quantitative data analysis is now a compulsory component of most degree courses in the social sciences and students are increasingly reliant on computers for the analysis of data. Quantitative Data Analysis with Minitab explains statistical tests for Mac users using the same formulae free, non-technical approach as the very successful SPPS version. Students will learn a wide range of quantitative data analysis techniques and become familiar with how these techniques can be implemented through the latest version of Minitab. Techniques covered include univariate analysis (with frequency table, dispersion and histograms), bivariate (with contingency tables correlation, analysis of varience and non-parametric tests) and multivariate analysis (with multiple regression, path analysis, covarience and factor analysis). In addition the book covers issues such as sampling, statistical significance, conceptualization and measurement and the selection of appropriate tests. Each chapter concludes with a set of exercises. Social science students will be interested in this integrated, non-mathematical introduction to quantitative data anlysis and the Minitab package.

Intelligent Data Analysis for COVID-19 Pandemic (Hardcover, 1st ed. 2021): M. Niranjanamurthy, Siddhartha Bhattacharyya, Neeraj... Intelligent Data Analysis for COVID-19 Pandemic (Hardcover, 1st ed. 2021)
M. Niranjanamurthy, Siddhartha Bhattacharyya, Neeraj Kumar
R5,312 Discovery Miles 53 120 Ships in 10 - 15 working days

This book presents intelligent data analysis as a tool to fight against COVID-19 pandemic. The intelligent data analysis includes machine learning, natural language processing, and computer vision applications to teach computers to use big data-based models for pattern recognition, explanation, and prediction. These functions are discussed in detail in the book to recognize (diagnose), predict, and explain (treat) COVID-19 infections, and help manage socio-economic impacts. It also discusses primary warnings and alerts; tracking and prediction; data dashboards; diagnosis and prognosis; treatments and cures; and social control by the use of intelligent data analysis. It provides analysis reports, solutions using real-time data, and solution through web applications details.

Data Analysis and Rationality in a Complex World (Paperback, 1st ed. 2021): Theodore Chadjipadelis, Berthold Lausen, Angelos... Data Analysis and Rationality in a Complex World (Paperback, 1st ed. 2021)
Theodore Chadjipadelis, Berthold Lausen, Angelos Markos, Tae Rim Lee, Angela Montanari, …
R5,272 Discovery Miles 52 720 Ships in 10 - 15 working days

This volume presents the latest advances in statistics and data science, including theoretical, methodological and computational developments and practical applications related to classification and clustering, data gathering, exploratory and multivariate data analysis, statistical modeling, and knowledge discovery and seeking. It includes contributions on analyzing and interpreting large, complex and aggregated datasets, and highlights numerous applications in economics, finance, computer science, political science and education. It gathers a selection of peer-reviewed contributions presented at the 16th Conference of the International Federation of Classification Societies (IFCS 2019), which was organized by the Greek Society of Data Analysis and held in Thessaloniki, Greece, on August 26-29, 2019.

Data Intensive Industrial Asset Management - IoT-based Algorithms and Implementation (Paperback, 1st ed. 2020): Farhad Balali,... Data Intensive Industrial Asset Management - IoT-based Algorithms and Implementation (Paperback, 1st ed. 2020)
Farhad Balali, Jessie Nouri, Adel Nasiri, Tian Zhao
R2,957 Discovery Miles 29 570 Ships in 10 - 15 working days

This book presents a step by step Asset Health Management Optimization Approach Using Internet of Things (IoT). The authors provide a comprehensive study which includes the descriptive, diagnostic, predictive, and prescriptive analysis in detail. The presentation focuses on the challenges of the parameter selection, statistical data analysis, predictive algorithms, big data storage and selection, data pattern recognition, machine learning techniques, asset failure distribution estimation, reliability and availability enhancement, condition based maintenance policy, failure detection, data driven optimization algorithm, and a multi-objective optimization approach, all of which can significantly enhance the reliability and availability of the system.

Applied Data Science Using PySpark - Learn the End-to-End Predictive Model-Building Cycle (Paperback, 1st ed.): Ramcharan... Applied Data Science Using PySpark - Learn the End-to-End Predictive Model-Building Cycle (Paperback, 1st ed.)
Ramcharan Kakarla, Sundar Krishnan, Sridhar Alla
R1,472 R1,169 Discovery Miles 11 690 Save R303 (21%) Ships in 10 - 15 working days

Discover the capabilities of PySpark and its application in the realm of data science. This comprehensive guide with hand-picked examples of daily use cases will walk you through the end-to-end predictive model-building cycle with the latest techniques and tricks of the trade. Applied Data Science Using PySpark is divided unto six sections which walk you through the book. In section 1, you start with the basics of PySpark focusing on data manipulation. We make you comfortable with the language and then build upon it to introduce you to the mathematical functions available off the shelf. In section 2, you will dive into the art of variable selection where we demonstrate various selection techniques available in PySpark. In section 3, we take you on a journey through machine learning algorithms, implementations, and fine-tuning techniques. We will also talk about different validation metrics and how to use them for picking the best models. Sections 4 and 5 go through machine learning pipelines and various methods available to operationalize the model and serve it through Docker/an API. In the final section, you will cover reusable objects for easy experimentation and learn some tricks that can help you optimize your programs and machine learning pipelines. By the end of this book, you will have seen the flexibility and advantages of PySpark in data science applications. This book is recommended to those who want to unleash the power of parallel computing by simultaneously working with big datasets. What You Will Learn Build an end-to-end predictive model Implement multiple variable selection techniques Operationalize models Master multiple algorithms and implementations Who This Book is For Data scientists and machine learning and deep learning engineers who want to learn and use PySpark for real-time analysis of streaming data.

Integrated Business Information Systems - A Holistic View of the Linked Business Process Chain ERP-SCM-CRM-BI-Big Data... Integrated Business Information Systems - A Holistic View of the Linked Business Process Chain ERP-SCM-CRM-BI-Big Data (Hardcover, 2nd ed. 2020)
Klaus-Dieter Gronwald
R2,452 Discovery Miles 24 520 Ships in 10 - 15 working days

Enterprise Resource Planning (ERP), Supply Chain Management (SCM), Customer Relationship Management (CRM), Business Intelligence (BI) and Big Data analytics (BDA) are business related tasks and processes, which are supported by standardized software solutions. The book explains that this requires business-oriented thinking and acting from IT specialists and data scientists. It is a good idea to let students experience this directly from the business perspective, for example as executives of a virtual company in a role-playing game. The second edition of the book has been completely revised, restructured and supplemented with actual topics such as blockchains in supply chains and the correlation between Big Data analytics, artificial intelligence and machine learning. The structure of the book is based on the gradual implementation and integration of the respective information systems from the business and management perspectives. Part I contains chapters with detailed descriptions of the topics supplemented by online tests and exercises. Part II introduces role play and the online gaming and simulation environment. Supplementary teaching material, presentations, templates, and video clips are available online in the gaming area. The gaming and business simulation Kdibisglobal.com, newly created for this book, now includes a beer division, a bottled water division, a soft drink division and a manufacturing division for barcode cash register scanner with their specific business processes and supply chains.

Practical User Research - Everything You Need to Know to Integrate User Research to Your Product Development (Paperback, 1st... Practical User Research - Everything You Need to Know to Integrate User Research to Your Product Development (Paperback, 1st ed.)
Emmanuelle Savarit
R1,771 R1,368 Discovery Miles 13 680 Save R403 (23%) Ships in 10 - 15 working days

Explore how User Research has been influenced over the years by a range of disciplines, such as HCI, usability, anthropology, cognitive psychology, ergonomics etc. This book aims to contribute to the User Research community and covers topics that will help UX professionals, students and stakeholders to gain a better understanding of what User Research is. Throughout the book you will acquire a practical skill set, ranging from how to get the research going, to building a case in order to receive the budget and resources needed. It will provide you with a clear account of how to organise your research, how to plan it, and how to manage stakeholders' expectations throughout the project. You'll see how to fit User Research into your organization and incorporate it through the different product development phases (Discovery, Alpha, Beta until Live), as well as how to grow a User Research team. Practical User Research reviews the methodologies used for User Research, looks at how to recruit participants along with how to collect and analyse data, finally focusing on how to interpret and present your findings. Cross-cultural research, accessibility and assisted digital research will also be discussed throughout this book. The final chapter gives you 10 project briefs, with which you will be able to apply your new skill set and put into practice what you have learnt. What You'll Learn Integrate user research into your business Apply user research to your product development cycle Review the appropriate processes necessary to carry out user research Take a pragmatic approach to user research, method by method Who This Book Is For Anyone that wants to understand more about user research.

Big Data 2.0 Processing Systems - A Systems Overview (Hardcover, 2nd ed. 2020): Sherif Sakr Big Data 2.0 Processing Systems - A Systems Overview (Hardcover, 2nd ed. 2020)
Sherif Sakr
R2,193 Discovery Miles 21 930 Ships in 10 - 15 working days

This book provides readers the "big picture" and a comprehensive survey of the domain of big data processing systems. For the past decade, the Hadoop framework has dominated the world of big data processing, yet recently academia and industry have started to recognize its limitations in several application domains and thus, it is now gradually being replaced by a collection of engines that are dedicated to specific verticals (e.g. structured data, graph data, and streaming data). The book explores this new wave of systems, which it refers to as Big Data 2.0 processing systems. After Chapter 1 presents the general background of the big data phenomena, Chapter 2 provides an overview of various general-purpose big data processing systems that allow their users to develop various big data processing jobs for different application domains. In turn, Chapter 3 examines various systems that have been introduced to support the SQL flavor on top of the Hadoop infrastructure and provide competing and scalable performance in the processing of large-scale structured data. Chapter 4 discusses several systems that have been designed to tackle the problem of large-scale graph processing, while the main focus of Chapter 5 is on several systems that have been designed to provide scalable solutions for processing big data streams, and on other sets of systems that have been introduced to support the development of data pipelines between various types of big data processing jobs and systems. Next, Chapter 6 focuses on covering the emerging frameworks and systems in the domain of scalable machine learning and deep learning processing. Lastly, Chapter 7 shares conclusions and an outlook on future research challenges. This new and considerably enlarged second edition not only contains the completely new chapter 6, but also offers a refreshed content for the state-of-the-art in all domains of big data processing over the last years. Overall, the book offers a valuable reference guide for professional, students, and researchers in the domain of big data processing systems. Further, its comprehensive content will hopefully encourage readers to pursue further research on the subject.

The Routledge Handbook of Accounting Information Systems (Hardcover, 2nd edition): Martin Quinn, Erik Strauss The Routledge Handbook of Accounting Information Systems (Hardcover, 2nd edition)
Martin Quinn, Erik Strauss
R5,978 Discovery Miles 59 780 Ships in 12 - 17 working days

Unique reference book covering the entire field of accounting information systems. Contributions from an international range of accounting and information systems experts. Includes coverage of contemporary themes such as big data, data security, cloud computing, IoT and blockchain.

Modern Data Mining Algorithms in C++ and CUDA C - Recent Developments in Feature Extraction and Selection Algorithms for Data... Modern Data Mining Algorithms in C++ and CUDA C - Recent Developments in Feature Extraction and Selection Algorithms for Data Science (Paperback, 1st ed.)
Timothy Masters
R1,749 R1,347 Discovery Miles 13 470 Save R402 (23%) Ships in 10 - 15 working days

Discover a variety of data-mining algorithms that are useful for selecting small sets of important features from among unwieldy masses of candidates, or extracting useful features from measured variables. As a serious data miner you will often be faced with thousands of candidate features for your prediction or classification application, with most of the features being of little or no value. You'll know that many of these features may be useful only in combination with certain other features while being practically worthless alone or in combination with most others. Some features may have enormous predictive power, but only within a small, specialized area of the feature space. The problems that plague modern data miners are endless. This book helps you solve this problem by presenting modern feature selection techniques and the code to implement them. Some of these techniques are: Forward selection component analysis Local feature selection Linking features and a target with a hidden Markov model Improvements on traditional stepwise selection Nominal-to-ordinal conversion All algorithms are intuitively justified and supported by the relevant equations and explanatory material. The author also presents and explains complete, highly commented source code. The example code is in C++ and CUDA C but Python or other code can be substituted; the algorithm is important, not the code that's used to write it. What You Will Learn Combine principal component analysis with forward and backward stepwise selection to identify a compact subset of a large collection of variables that captures the maximum possible variation within the entire set. Identify features that may have predictive power over only a small subset of the feature domain. Such features can be profitably used by modern predictive models but may be missed by other feature selection methods. Find an underlying hidden Markov model that controls the distributions of feature variables and the target simultaneously. The memory inherent in this method is especially valuable in high-noise applications such as prediction of financial markets. Improve traditional stepwise selection in three ways: examine a collection of 'best-so-far' feature sets; test candidate features for inclusion with cross validation to automatically and effectively limit model complexity; and at each step estimate the probability that our results so far could be just the product of random good luck. We also estimate the probability that the improvement obtained by adding a new variable could have been just good luck. Take a potentially valuable nominal variable (a category or class membership) that is unsuitable for input to a prediction model, and assign to each category a sensible numeric value that can be used as a model input. Who This Book Is For Intermediate to advanced data science programmers and analysts.

Data Intensive Industrial Asset Management - IoT-based Algorithms and Implementation (Hardcover, 1st ed. 2020): Farhad Balali,... Data Intensive Industrial Asset Management - IoT-based Algorithms and Implementation (Hardcover, 1st ed. 2020)
Farhad Balali, Jessie Nouri, Adel Nasiri, Tian Zhao
R2,979 Discovery Miles 29 790 Ships in 10 - 15 working days

This book presents a step by step Asset Health Management Optimization Approach Using Internet of Things (IoT). The authors provide a comprehensive study which includes the descriptive, diagnostic, predictive, and prescriptive analysis in detail. The presentation focuses on the challenges of the parameter selection, statistical data analysis, predictive algorithms, big data storage and selection, data pattern recognition, machine learning techniques, asset failure distribution estimation, reliability and availability enhancement, condition based maintenance policy, failure detection, data driven optimization algorithm, and a multi-objective optimization approach, all of which can significantly enhance the reliability and availability of the system.

The Palgrave Handbook of Survey Research (Paperback, Softcover reprint of the original 1st ed. 2018): David L. Vannette, Jon A.... The Palgrave Handbook of Survey Research (Paperback, Softcover reprint of the original 1st ed. 2018)
David L. Vannette, Jon A. Krosnick
R6,649 Discovery Miles 66 490 Ships in 10 - 15 working days

This handbook is a comprehensive reference guide for researchers, funding agencies and organizations engaged in survey research. Drawing on research from a world-class team of experts, this collection addresses the challenges facing survey-based data collection today as well as the potential opportunities presented by new approaches to survey research, including in the development of policy. It examines innovations in survey methodology and how survey scholars and practitioners should think about survey data in the context of the explosion of new digital sources of data. The Handbook is divided into four key sections: the challenges faced in conventional survey research; opportunities to expand data collection; methods of linking survey data with external sources; and, improving research transparency and data dissemination, with a focus on data curation, evaluating the usability of survey project websites, and the credibility of survey-based social science. Chapter 23 of this book is open access under a CC BY 4.0 license at link.springer.com.

Big Data and Visual Analytics (Paperback, Softcover reprint of the original 1st ed. 2017): Sang C Suh, Thomas Anthony Big Data and Visual Analytics (Paperback, Softcover reprint of the original 1st ed. 2017)
Sang C Suh, Thomas Anthony
R4,230 Discovery Miles 42 300 Ships in 10 - 15 working days

This book provides users with cutting edge methods and technologies in the area of big data and visual analytics, as well as an insight to the big data and data analytics research conducted by world-renowned researchers in this field. The authors present comprehensive educational resources on big data and visual analytics covering state-of-the art techniques on data analytics, data and information visualization, and visual analytics. Each chapter covers specific topics related to big data and data analytics as virtual data machine, security of big data, big data applications, high performance computing cluster, and big data implementation techniques. Every chapter includes a description of an unique contribution to the area of big data and visual analytics. This book is a valuable resource for researchers and professionals working in the area of big data, data analytics, and information visualization. Advanced-level students studying computer science will also find this book helpful as a secondary textbook or reference.

Applied Statistics with R - A Practical Guide for the Life Sciences (Paperback): Justin C. Touchon Applied Statistics with R - A Practical Guide for the Life Sciences (Paperback)
Justin C. Touchon
R1,429 Discovery Miles 14 290 Ships in 10 - 15 working days

The statistical analyses that students of the life-sciences are being expected to perform are becoming increasingly advanced. Whether at the undergraduate, graduate, or post-graduate level, this book provides the tools needed to properly analyze your data in an efficient, accessible, plainspoken, frank, and occasionally humorous manner, ensuring that readers come away with the knowledge of which analyses they should use and when they should use them. The book uses the statistical language R, which is the choice of ecologists worldwide and is rapidly becoming the 'go-to' stats program throughout the life-sciences. Furthermore, by using a single, real-world dataset throughout the book, readers are encouraged to become deeply familiar with an imperfect but realistic set of data. Indeed, early chapters are specifically designed to teach basic data manipulation skills and build good habits in preparation for learning more advanced analyses. This approach also demonstrates the importance of viewing data through different lenses, facilitating an easy and natural progression from linear and generalized linear models through to mixed effects versions of those same analyses. Readers will also learn advanced plotting and data-wrangling techniques, and gain an introduction to writing their own functions. Applied Statistics with R is suitable for senior undergraduate and graduate students, professional researchers, and practitioners throughout the life-sciences, whether in the fields of ecology, evolution, environmental studies, or computational biology.

Data Clustering in C++ - An Object-Oriented Approach (Paperback): Guojun Gan Data Clustering in C++ - An Object-Oriented Approach (Paperback)
Guojun Gan
R1,991 Discovery Miles 19 910 Ships in 12 - 17 working days

Data clustering is a highly interdisciplinary field, the goal of which is to divide a set of objects into homogeneous groups such that objects in the same group are similar and objects in different groups are quite distinct. Thousands of theoretical papers and a number of books on data clustering have been published over the past 50 years. However, few books exist to teach people how to implement data clustering algorithms. This book was written for anyone who wants to implement or improve their data clustering algorithms. Using object-oriented design and programming techniques, Data Clustering in C++ exploits the commonalities of all data clustering algorithms to create a flexible set of reusable classes that simplifies the implementation of any data clustering algorithm. Readers can follow the development of the base data clustering classes and several popular data clustering algorithms. Additional topics such as data pre-processing, data visualization, cluster visualization, and cluster interpretation are briefly covered. This book is divided into three parts-- Data Clustering and C++ Preliminaries: A review of basic concepts of data clustering, the unified modeling language, object-oriented programming in C++, and design patterns A C++ Data Clustering Framework: The development of data clustering base classes Data Clustering Algorithms: The implementation of several popular data clustering algorithms A key to learning a clustering algorithm is to implement and experiment the clustering algorithm. Complete listings of classes, examples, unit test cases, and GNU configuration files are included in the appendices of this book as well as in the downloadable resources. The only requirements to compile the code are a modern C++ compiler and the Boost C++ libraries.

Free Delivery
Pinterest Twitter Facebook Google+
You may like...
Computer Intensive Statistical Methods…
J.S. Urban Hjorth Paperback R1,865 Discovery Miles 18 650
Multilevel Modeling - Methodological…
Steven P. Reise, Naihua Duan Hardcover R4,159 Discovery Miles 41 590
Data Visualization with Excel Dashboards…
D Kusleika Paperback R769 Discovery Miles 7 690
Fundamentals of Data Engineering - Plan…
Joe Reis Paperback R1,353 Discovery Miles 13 530
Technology for Success - Computer…
Mark Ciampa, Jill West, … Paperback  (1)
R1,273 R1,098 Discovery Miles 10 980
New Methods of Market Research and…
G. Scott Erickson Hardcover R2,900 Discovery Miles 29 000
Ethics of Data and Analytics - Concepts…
Kirsten Martin Paperback R1,799 Discovery Miles 17 990
SQL for Data Scientists - A Beginner's…
RMP Teat Paperback R862 Discovery Miles 8 620
Data Clustering in C++ - An…
Guojun Gan Hardcover R4,186 Discovery Miles 41 860
Functional Aesthetics for Data…
V Setlur Paperback R738 Discovery Miles 7 380

 

Partners