![]() |
Welcome to Loot.co.za!
Sign in / Register |Wishlists & Gift Vouchers |Help | Advanced search
|
Your cart is empty |
||
|
Books > Computing & IT > Applications of computing > Databases > Data capture & analysis
This book provides readers the "big picture" and a comprehensive survey of the domain of big data processing systems. For the past decade, the Hadoop framework has dominated the world of big data processing, yet recently academia and industry have started to recognize its limitations in several application domains and thus, it is now gradually being replaced by a collection of engines that are dedicated to specific verticals (e.g. structured data, graph data, and streaming data). The book explores this new wave of systems, which it refers to as Big Data 2.0 processing systems. After Chapter 1 presents the general background of the big data phenomena, Chapter 2 provides an overview of various general-purpose big data processing systems that allow their users to develop various big data processing jobs for different application domains. In turn, Chapter 3 examines various systems that have been introduced to support the SQL flavor on top of the Hadoop infrastructure and provide competing and scalable performance in the processing of large-scale structured data. Chapter 4 discusses several systems that have been designed to tackle the problem of large-scale graph processing, while the main focus of Chapter 5 is on several systems that have been designed to provide scalable solutions for processing big data streams, and on other sets of systems that have been introduced to support the development of data pipelines between various types of big data processing jobs and systems. Next, Chapter 6 focuses on covering the emerging frameworks and systems in the domain of scalable machine learning and deep learning processing. Lastly, Chapter 7 shares conclusions and an outlook on future research challenges. This new and considerably enlarged second edition not only contains the completely new chapter 6, but also offers a refreshed content for the state-of-the-art in all domains of big data processing over the last years. Overall, the book offers a valuable reference guide for professional, students, and researchers in the domain of big data processing systems. Further, its comprehensive content will hopefully encourage readers to pursue further research on the subject.
Big Data in Psychiatry and Neurology provides an up-to-date overview of achievements in the field of big data in Psychiatry and Medicine, including applications of big data methods to aging disorders (e.g., Alzheimer's disease and Parkinson's disease), mood disorders (e.g., major depressive disorder), and drug addiction. This book will help researchers, students and clinicians implement new methods for collecting big datasets from various patient populations. Further, it will demonstrate how to use several algorithms and machine learning methods to analyze big datasets, thus providing individualized treatment for psychiatric and neurological patients. As big data analytics is gaining traction in psychiatric research, it is an essential component in providing predictive models for both clinical practice and public health systems. As compared with traditional statistical methods that provide primarily average group-level results, big data analytics allows predictions and stratification of clinical outcomes at an individual subject level.
Making use of data is not anymore a niche project but central to almost every project. With access to massive compute resources and vast amounts of data, it seems at least in principle possible to solve any problem. However, successful data science projects result from the intelligent application of: human intuition in combination with computational power; sound background knowledge with computer-aided modelling; and critical reflection of the obtained insights and results. Substantially updating the previous edition, then entitled Guide to Intelligent Data Analysis, this core textbook continues to provide a hands-on instructional approach to many data science techniques, and explains how these are used to solve real world problems. The work balances the practical aspects of applying and using data science techniques with the theoretical and algorithmic underpinnings from mathematics and statistics. Major updates on techniques and subject coverage (including deep learning) are included. Topics and features: guides the reader through the process of data science, following the interdependent steps of project understanding, data understanding, data blending and transformation, modeling, as well as deployment and monitoring; includes numerous examples using the open source KNIME Analytics Platform, together with an introductory appendix; provides a review of the basics of classical statistics that support and justify many data analysis methods, and a glossary of statistical terms; integrates illustrations and case-study-style examples to support pedagogical exposition; supplies further tools and information at an associated website. This practical and systematic textbook/reference is a "need-to-have" tool for graduate and advanced undergraduate students and essential reading for all professionals who face data science problems. Moreover, it is a "need to use, need to keep" resource following one's exploration of the subject.
Knowledge Discovery in Big Data from Astronomy and Earth Observation: Astrogeoinformatics bridges the gap between astronomy and geoscience in the context of applications, techniques and key principles of big data. Machine learning and parallel computing are increasingly becoming cross-disciplinary as the phenomena of Big Data is becoming common place. This book provides insight into the common workflows and data science tools used for big data in astronomy and geoscience. After establishing similarity in data gathering, pre-processing and handling, the data science aspects are illustrated in the context of both fields. Software, hardware and algorithms of big data are addressed. Finally, the book offers insight into the emerging science which combines data and expertise from both fields in studying the effect of cosmos on the earth and its inhabitants.
E-health applications such as tele-medicine, tele-radiology, tele-ophthalmology, and tele-diagnosis are very promising and have immense potential to improve global healthcare. They can improve access, equity, and quality through the connection of healthcare facilities and healthcare professionals, diminishing geographical and physical barriers. One critical issue, however, is related to the security of data transmission and access to the technologies of medical information. Currently, medical-related identity theft costs billions of dollars each year and altered medical information can put a person's health at risk through misdiagnosis, delayed treatment or incorrect prescriptions. Yet, the use of hand-held devices for storing, accessing, and transmitting medical information is outpacing the privacy and security protections on those devices. Researchers are starting to develop some imperceptible marks to ensure the tamper-proofing, cost effective, and guaranteed originality of the medical records. However, the robustness, security and efficient image archiving and retrieval of medical data information against these cyberattacks is a challenging area for researchers in the field of e-health applications. Intelligent Data Security Solutions for e-Health Applications focuses on cutting-edge academic and industry-related research in this field, with particular emphasis on interdisciplinary approaches and novel techniques to provide security solutions for smart applications. The book provides an overview of cutting-edge security techniques and ideas to help graduate students, researchers, as well as IT professionals who want to understand the opportunities and challenges of using emerging techniques and algorithms for designing and developing more secure systems and methods for e-health applications.
This book highlights advanced applications of geospatial data analytics to address real-world issues in urban society. With a connected world, we are generating spatial at unprecedented rates which can be harnessed for insightful analytics which define the way we analyze past events and define the future directions. This book is an anthology of applications of spatial data and analytics performed on them for gaining insights which can be used for problem solving in an urban setting. Each chapter is contributed by spatially aware data scientists in the making who present spatial perspectives drawn on spatial big data. The book shall benefit mature researchers and student alike to discourse a variety of urban applications which display the use of machine learning algorithms on spatial big data for real-world problem solving.
With the advent of new technologies in big data science, the study of medical problems has made significant progress. Connecting medical studies and computational methods is crucial for the advancement of the medical industry. Big Data Analytics in HIV/AIDS Research provides emerging research on the development and implementation of computational techniques in big data analysis for biological and medical practices. While highlighting topics such as deep learning, management software, and molecular modeling, this publication explores the various applications of data analysis in clinical decision making. This book is a vital resource for medical practitioners, nurses, scientists, researchers, and students seeking current research on the connections between data analytics in the field of medicine.
Edge computing is quickly becoming an important technology throughout a number of fields as businesses and industries alike embrace the benefits it can have in their companies. The streamlining of data is crucial for the development and evolution of businesses in order to keep up with competition and improve functions overall. In order to appropriately utilize edge computing to its full potential, further study is required to examine the potential pitfalls and opportunities of this innovative technology. The Research Anthology on Edge Computing Protocols, Applications, and Integration establishes critical research on the current uses, innovations, and challenges of edge computing across disciplines. The text highlights the history of edge computing and how it has been adapted over time to improve industries. Covering a range of topics such as bandwidth, data centers, and security, this major reference work is ideal for industry professionals, computer scientists, engineers, practitioners, researchers, academicians, scholars, instructors, and students.
The entertainment industry has long been dominated by legendary screenwriter William Goldman's "Nobody-Knows-Anything" mantra, which argues that success is the result of managerial intuition and instinct. This book builds the case that combining such intuition with data analytics and rigorous scholarly knowledge provides a source of sustainable competitive advantage - the same recipe for success that is behind the rise of firms such as Netflix and Spotify, but has also fueled Disney's recent success. Unlocking a large repertoire of scientific studies by business scholars and entertainment economists, the authors identify essential factors, mechanisms, and methods that help a new entertainment product succeed. The book thus offers a timely alternative to "Nobody-Knows" decision-making in the digital era: while coupling a good idea with smart data analytics and entertainment theory cannot guarantee a hit, it systematically and substantially increases the probability of success in the entertainment industry. Entertainment Science is poised to inspire fresh new thinking among managers, students of entertainment, and scholars alike. Thorsten Hennig-Thurau and Mark B. Houston - two of our finest scholars in the area of entertainment marketing - have produced a definitive research-based compendium that cuts across various branches of the arts to explain the phenomena that provide consumption experiences to capture the hearts and minds of audiences. Morris B. Holbrook, W. T. Dillard Professor Emeritus of Marketing, Columbia University Entertainment Science is a must-read for everyone working in the entertainment industry today, where the impact of digital and the use of big data can't be ignored anymore. Hennig-Thurau and Houston are the scientific frontrunners of knowledge that the industry urgently needs. Michael Koelmel, media entrepreneur and Honorary Professor of Media Economics at University of Leipzig Entertainment Science's winning combination of creativity, theory, and data analytics offers managers in the creative industries and beyond a novel, compelling, and comprehensive approach to support their decision-making. This ground-breaking book marks the dawn of a new Golden Age of fruitful conversation between entertainment scholars, managers, and artists. Allegre Hadida, Associate Professor in Strategy, University of Cambridge
This book provides an introduction to spatial analyses concerning disaggregated (or micro) spatial data. Particular emphasis is put on spatial data compilation and the structuring of the connections between the observations. Descriptive analysis methods of spatial data are presented in order to identify and measure the spatial, global and local dependency. The authors then focus on autoregressive spatial models, to control the problem of spatial dependency between the residues of a basic linear statistical model, thereby contravening one of the basic hypotheses of the ordinary least squares approach. This book is a popularized reference for students looking to work with spatialized data, but who do not have the advanced statistical theoretical basics.
This book covers the relationship of recent technologies (such as Blockchain, IoT, and 5G) with the cloud computing as well as fog computing, and mobile edge computing. The relationship will not be limited to only architecture proposal, trends, and technical advancements. However, the book also explores the possibility of predictive analytics in cloud computing with respect to Blockchain, IoT, and 5G. The recent advancements in the internet-supported distributed computing i.e. cloud computing, has made it possible to process the bulk amount of data in a parallel and distributed. This has made it a lucrative technology to process the data generated from technologies such as Blockchain, IoT, and 5G. However, there are several issues a Cloud Service Provider (CSP) encounters, such as Blockchain security in cloud, IoT elasticity and scalability management in cloud, Service Level Agreement (SLA) compliances for 5G, Resource management, Load balancing, and Fault-tolerance. This edited book will discuss the aforementioned issues in connection with Blockchain, IoT, and 5G. Moreover, the book discusses how the cloud computing is not sufficient and one needs to use fog computing, and edge computing to efficiently process the data generated from IoT, and 5G. Moreover, the book shows how smart city, smart healthcare system, and smart communities are few of the most relevant IoT applications where fog computing plays a significant role. The book discusses the limitation of fog computing and the need for the edge computing to further reduce the network latency to process streaming data from IoT devices. The book also explores power of predictive analytics of Blockchain, IoT, and 5G data in cloud computing with its sister technologies. Since, the amount of resources increases day-by day, artificial intelligence (AI) tools are becoming more popular due to their capability which can be used in solving wide variety of issues, such as minimize the energy consumption of physical servers, optimize the service cost, improve the quality of experience, increase the service availability, efficiently handle the huge data flow, manages the large number of IoT devices, etc.
Model Management and Analytics for Large Scale Systems covers the use of models and related artefacts (such as metamodels and model transformations) as central elements for tackling the complexity of building systems and managing data. With their increased use across diverse settings, the complexity, size, multiplicity and variety of those artefacts has increased. Originally developed for software engineering, these approaches can now be used to simplify the analytics of large-scale models and automate complex data analysis processes. Those in the field of data science will gain novel insights on the topic of model analytics that go beyond both model-based development and data analytics. This book is aimed at both researchers and practitioners who are interested in model-based development and the analytics of large-scale models, ranging from big data management and analytics, to enterprise domains. The book could also be used in graduate courses on model development, data analytics and data management.
Big data is a field of research that is growing rapidly, and as the Covid-19 crisis has shown, health care is an area that could benefit greatly from its increased use and application. Big data, as derived partly from the internet of things and analysed according to specific algorithms, has a large and beneficial role to play in preventative medicine, in monitoring the health of specific groups, and in improving diagnostics. Big Data Analytics and Intelligence: A Perspective for Health Care focuses on various areas of health care, ranging from nutrition to cancer, and providing diverse perspectives on all of them. This book explores the entire life-cycle of big data, from information retrieval to analysis, and it shows how big data's applications can enhance, streamline and improve services for patients and health-care professionals. Each chapter focuses on a specific area of health care and how big data is applicable to it, with background and current examples provided.
Intelligent Data Analysis for Biomedical Applications: Challenges and Solutions presents specialized statistical, pattern recognition, machine learning, data abstraction and visualization tools for the analysis of data and discovery of mechanisms that create data. It provides computational methods and tools for intelligent data analysis, with an emphasis on problem-solving relating to automated data collection, such as computer-based patient records, data warehousing tools, intelligent alarming, effective and efficient monitoring, and more. This book provides useful references for educational institutions, industry professionals, researchers, scientists, engineers and practitioners interested in intelligent data analysis, knowledge discovery, and decision support in databases.
This volume provides a comprehensive introduction to mHealth technology and is accessible to technology-oriented researchers and practitioners with backgrounds in computer science, engineering, statistics, and applied mathematics. The contributing authors include leading researchers and practitioners in the mHealth field. The book offers an in-depth exploration of the three key elements of mHealth technology: the development of on-body sensors that can identify key health-related behaviors (sensors to markers), the use of analytic methods to predict current and future states of health and disease (markers to predictors), and the development of mobile interventions which can improve health outcomes (predictors to interventions). Chapters are organized into sections, with the first section devoted to mHealth applications, followed by three sections devoted to the above three key technology areas. Each chapter can be read independently, but the organization of the entire book provides a logical flow from the design of on-body sensing technology, through the analysis of time-varying sensor data, to interactions with a user which create opportunities to improve health outcomes. This volume is a valuable resource to spur the development of this growing field, and ideally suited for use as a textbook in an mHealth course.
Social Network Analytics: Computational Research Methods and Techniques focuses on various technical concepts and aspects of social network analysis. The book features the latest developments and findings in this emerging area of research. In addition, it includes a variety of applications from several domains, such as scientific research, and the business and industrial sectors. The technical aspects of analysis are covered in detail, including visualizing and modeling, network theory, mathematical models, the big data analytics of social networks, multidimensional scaling, and more. As analyzing social network data is rapidly gaining interest in the scientific research community because of the importance of the information and insights that can be culled from the wealth of data inherent in the various aspects of the network, this book provides insights on measuring the relationships and flows between people, groups, organizations, computers, URLs, and more.
Meta-Analytics: Consensus Approaches and System Patterns for Data Analysis presents an exhaustive set of patterns for data science to use on any machine learning based data analysis task. The book virtually ensures that at least one pattern will lead to better overall system behavior than the use of traditional analytics approaches. The book is 'meta' to analytics, covering general analytics in sufficient detail for readers to engage with, and understand, hybrid or meta- approaches. The book has relevance to machine translation, robotics, biological and social sciences, medical and healthcare informatics, economics, business and finance. Inn addition, the analytics within can be applied to predictive algorithms for everyone from police departments to sports analysts.
This textbook grew out of notes for the ECE143 Programming for Data Analysis class that the author has been teaching at University of California, San Diego, which is a requirement for both graduate and undergraduate degrees in Machine Learning and Data Science. This book is ideal for readers with some Python programming experience. The book covers key language concepts that must be understood to program effectively, especially for data analysis applications. Certain low-level language features are discussed in detail, especially Python memory management and data structures. Using Python effectively means taking advantage of its vast ecosystem. The book discusses Python package management and how to use third-party modules as well as how to structure your own Python modules. The section on object-oriented programming explains features of the language that facilitate common programming patterns. After developing the key Python language features, the book moves on to third-party modules that are foundational for effective data analysis, starting with Numpy. The book develops key Numpy concepts and discusses internal Numpy array data structures and memory usage. Then, the author moves onto Pandas and details its many features for data processing and alignment. Because strong visualizations are important for communicating data analysis, key modules such as Matplotlib are developed in detail, along with web-based options such as Bokeh, Holoviews, Altair, and Plotly. The text is sprinkled with many tricks-of-the-trade that help avoid common pitfalls. The author explains the internal logic embodied in the Python language so that readers can get into the Python mindset and make better design choices in their codes, which is especially helpful for newcomers to both Python and data analysis. To get the most out of this book, open a Python interpreter and type along with the many code samples.
The book proposes a systematic approach to big data collection, documentation and development of analytic procedures that foster collaboration on a large scale. This approach, designated as "data factoring" emphasizes the need to think of each individual dataset developed by an individual project as part of a broader data ecosystem, easily accessible and exploitable by parties not directly involved with data collection and documentation. Furthermore, data factoring uses and encourages pre-analytic operations that add value to big data sets, especially recombining and repurposing. The book proposes a research-development agenda that can undergird an ideal data factory approach. Several programmatic chapters discuss specialized issues involved in data factoring (documentation, meta-data specification, building flexible, yet comprehensive data ontologies, usability issues involved in collaborative tools, etc.). The book also presents case studies for data factoring and processing that can lead to building better scientific collaboration and data sharing strategies and tools. Finally, the book presents the teaching utility of data factoring and the ethical and privacy concerns related to it. Chapter 9 of this book is available open access under a CC BY 4.0 license at link.springer.com
Learn the basics of Data Science through an easy to understand conceptual framework and immediately practice using RapidMiner platform. Whether you are brand new to data science or working on your tenth project, this book will show you how to analyze data, uncover hidden patterns and relationships to aid important decisions and predictions. Data Science has become an essential tool to extract value from data for any organization that collects, stores and processes data as part of its operations. This book is ideal for business users, data analysts, business analysts, engineers, and analytics professionals and for anyone who works with data. You'll be able to: Gain the necessary knowledge of different data science techniques to extract value from data. Master the concepts and inner workings of 30 commonly used powerful data science algorithms. Implement step-by-step data science process using using RapidMiner, an open source GUI based data science platform Data Science techniques covered: Exploratory data analysis, Visualization, Decision trees, Rule induction, k-nearest neighbors, Naive Bayesian classifiers, Artificial neural networks, Deep learning, Support vector machines, Ensemble models, Random forests, Regression, Recommendation engines, Association analysis, K-Means and Density based clustering, Self organizing maps, Text mining, Time series forecasting, Anomaly detection, Feature selection and more...
As millions of people have been exposed to computing through the tremendous growth of microcomputers, there has developed an increasing appreciation of the history of data processing, which dates back many decades before the arrival of the computer. Stretching back to at least the 1860s, such early technologies as adding machines, punch cards, and the office appliance industry are now being recognized for their place in the history of the information processing industry. This work brings together a comprehensive list of sources that offer a general introduction to the literature of the industry. Divided into nine chapters covering topics and historical periods, the bibliography provides an annotated list of published materials describing both the history of the industry and significant items of general interest. Each chapter is introduced with a short review of historically important issues and comments on the literature, and contains contemporary publications as well as more recent material. To give the work a continuing usefulness, ongoing publications, such as computer magazines, are highlighted. Entries are grouped under nearly 100 subheadings, covering such material as contemporary descriptions of hardware and software of the past, seminal technical papers, industry surveys, programming languages, significant individuals and companies, and the role of Japan and microcomputing. All citations are annotated with a brief summary of either the work's contents or its historical importance, while two indexes provide both subject references and author citations. This bibliography will be an important reference source for courses in the history of data processing and business history, and auseful addition to public, college, and university libraries.
This book uses a mathematical approach to deriving the laws of science and technology, based upon the concept of Fisher information. The approach that follows from these ideas is called the principle of Extreme Physical Information (EPI). The authors show how to use EPI to determine the theoretical input/output laws of unknown systems. Will benefit readers whose math skill is at the level of an undergraduate science or engineering degree.
Dependence Analysis may be considered to be the second edition of the author's 1988 book, Dependence Analysis for Supercomputing. It is, however, a completely new work that subsumes the material of the 1988 publication. This book is the third volume in the series Loop Transformations for Restructuring Compilers. This series has been designed to provide a complete mathematical theory of transformations that can be used to automatically change a sequential program containing FORTRAN-like do loops into an equivalent parallel form. In Dependence Analysis, the author extends the model to a program consisting of do loops and assignment statements, where the loops need not be sequentially nested and are allowed to have arbitrary strides. In the context of such a program, the author studies, in detail, dependence between statements of the program caused by program variables that are elements of arrays. Dependence Analysis is directed toward graduate and undergraduate students, and professional writers of restructuring compilers. The prerequisite for the book consists of some knowledge of programming languages, and familiarity with calculus and graph theory. No knowledge of linear programming is required. |
You may like...
Machine Learning for Biometrics…
Partha Pratim Sarangi, Madhumita Panda, …
Paperback
R2,570
Discovery Miles 25 700
Intelligent Data Analysis for e-Learning…
Jorge Miguel, Santi Caballe, …
Paperback
Big Data - Concepts, Methodologies…
Information Reso Management Association
Hardcover
R17,613
Discovery Miles 176 130
Challenges and Applications of Data…
V. Sathiyamoorthi, Atilla Elci
Hardcover
R6,734
Discovery Miles 67 340
Cognitive and Soft Computing Techniques…
Akash Kumar Bhoi, Victor Hugo Costa de Albuquerque, …
Paperback
R2,583
Discovery Miles 25 830
Big Data Analytics for Sustainable…
Anandakumar Haldorai, Arulmurugan Ramu
Hardcover
R6,696
Discovery Miles 66 960
Handbook of Big Data Analytics, Volume 1…
Vadlamani Ravi, Aswani Kumar Cherukuri
Hardcover
|