![]() |
Welcome to Loot.co.za!
Sign in / Register |Wishlists & Gift Vouchers |Help | Advanced search
|
Your cart is empty |
||
|
Books > Computing & IT > Applications of computing > Databases > Data mining
Spectral Feature Selection for Data Mining introduces a novel feature selection technique that establishes a general platform for studying existing feature selection algorithms and developing new algorithms for emerging problems in real-world applications. This technique represents a unified framework for supervised, unsupervised, and semisupervised feature selection. The book explores the latest research achievements, sheds light on new research directions, and stimulates readers to make the next creative breakthroughs. It presents the intrinsic ideas behind spectral feature selection, its theoretical foundations, its connections to other algorithms, and its use in handling both large-scale data sets and small sample problems. The authors also cover feature selection and feature extraction, including basic concepts, popular existing algorithms, and applications. A timely introduction to spectral feature selection, this book illustrates the potential of this powerful dimensionality reduction technique in high-dimensional data processing. Readers learn how to use spectral feature selection to solve challenging problems in real-life applications and discover how general feature selection and extraction are connected to spectral feature selection.
The latest inventions in internet technology influence most of business and daily activities. Internet security, internet data management, web search, data grids, cloud computing, and web-based applications play vital roles, especially in business and industry, as more transactions go online and mobile. Issues related to ubiquitous computing are becoming critical. Internet technology and data engineering should reinforce efficiency and effectiveness of business processes. These technologies should help people make better and more accurate decisions by presenting necessary information and possible consequences for the decisions. Intelligent information systems should help us better understand and manage information with ubiquitous data repository and cloud computing. This book is a compilation of some recent research findings in Internet Technology and Data Engineering. This book provides state-of-the-art accounts in computational algorithms/tools, database management and database technologies, intelligent information systems, data engineering applications, internet security, internet data management, web search, data grids, cloud computing, web-based application, and other related topics.
This book will focus on utilizing statistical modelling of the software source code, in order to resolve issues associated with the software development processes. Writing and maintaining software source code is a costly business; software developers need to constantly rely on large existing code bases. Statistical modelling identifies the patterns in software artifacts and utilize them for predicting the possible issues.
This open access book describes the results of natural language processing and machine learning methods applied to clinical text from electronic patient records. It is divided into twelve chapters. Chapters 1-4 discuss the history and background of the original paper-based patient records, their purpose, and how they are written and structured. These initial chapters do not require any technical or medical background knowledge. The remaining eight chapters are more technical in nature and describe various medical classifications and terminologies such as ICD diagnosis codes, SNOMED CT, MeSH, UMLS, and ATC. Chapters 5-10 cover basic tools for natural language processing and information retrieval, and how to apply them to clinical text. The difference between rule-based and machine learning-based methods, as well as between supervised and unsupervised machine learning methods, are also explained. Next, ethical concerns regarding the use of sensitive patient records for research purposes are discussed, including methods for de-identifying electronic patient records and safely storing patient records. The book's closing chapters present a number of applications in clinical text mining and summarise the lessons learned from the previous chapters. The book provides a comprehensive overview of technical issues arising in clinical text mining, and offers a valuable guide for advanced students in health informatics, computational linguistics, and information retrieval, and for researchers entering these fields.
Introduction to Bio-Ontologies explores the computational background of ontologies. Emphasizing computational and algorithmic issues surrounding bio-ontologies, this self-contained text helps readers understand ontological algorithms and their applications. The first part of the book defines ontology and bio-ontologies. It also explains the importance of mathematical logic for understanding concepts of inference in bio-ontologies, discusses the probability and statistics topics necessary for understanding ontology algorithms, and describes ontology languages, including OBO (the preeminent language for bio-ontologies), RDF, RDFS, and OWL. The second part covers significant bio-ontologies and their applications. The book presents the Gene Ontology; upper-level ontologies, such as the Basic Formal Ontology and the Relation Ontology; and current bio-ontologies, including several anatomy ontologies, Chemical Entities of Biological Interest, Sequence Ontology, Mammalian Phenotype Ontology, and Human Phenotype Ontology. The third part of the text introduces the major graph-based algorithms for bio-ontologies. The authors discuss how these algorithms are used in overrepresentation analysis, model-based procedures, semantic similarity analysis, and Bayesian networks for molecular biology and biomedical applications. With a focus on computational reasoning topics, the final part describes the ontology languages of the Semantic Web and their applications for inference. It covers the formal semantics of RDF and RDFS, OWL inference rules, a key inference algorithm, the SPARQL query language, and the state of the art for querying OWL ontologies. Web Resource This book provides readers with the foundation to use ontologies as a starting point for new bioinformatics research projects or to support current molecular genetics research projects. By supplying a self-contained introduction to OBO ontologies and the Semantic Web, it bridges the gap between both fields and helps readers see what each can contribute to the analysis and understanding of biomedical data.
The book focuses on how machine learning and the Internet of Things (IoT) has empowered the advancement of information driven arrangements including key concepts and advancements. Ontologies that are used in heterogeneous IoT environments have been discussed including interpretation, context awareness, analyzing various data sources, machine learning algorithms and intelligent services and applications. Further, it includes unsupervised and semi-supervised machine learning techniques with study of semantic analysis and thorough analysis of reviews. Divided into sections such as machine learning, security, IoT and data mining, the concepts are explained with practical implementation including results. Key Features Follows an algorithmic approach for data analysis in machine learning Introduces machine learning methods in applications Address the emerging issues in computing such as deep learning, machine learning, Internet of Things and data analytics Focuses on machine learning techniques namely unsupervised and semi-supervised for unseen and seen data sets Case studies are covered relating to human health, transportation and Internet applications
Data mining is one of the most rapidly growing research areas in computer science and statistics. In Volume 2 of this three volume series, we have brought together contributions from some of the most prestigious researchers in theoretical data mining. Each of the chapters is self contained. Statisticians and applied scientists/ engineers will find this volume valuable. Additionally, it provides a sourcebook for graduate students interested in the current direction of research in data mining.
Providing a complete review of existing work in music emotion developed in psychology and engineering, Music Emotion Recognition explains how to account for the subjective nature of emotion perception in the development of automatic music emotion recognition (MER) systems. Among the first publications dedicated to automatic MER, it begins with a comprehensive introduction to the essential aspects of MER-including background, key techniques, and applications. This ground-breaking reference examines emotion from a dimensional perspective. It defines emotions in music as points in a 2D plane in terms of two of the most fundamental emotion dimensions according to psychologists-valence and arousal. The authors present a computational framework that generalizes emotion recognition from the categorical domain to real-valued 2D space. They also: Introduce novel emotion-based music retrieval and organization methods Describe a ranking-base emotion annotation and model training method Present methods that integrate information extracted from lyrics, chord sequence, and genre metadata for improved accuracy Consider an emotion-based music retrieval system that is particularly useful for mobile devices The book details techniques for addressing the issues related to: the ambiguity and granularity of emotion description, heavy cognitive load of emotion annotation, subjectivity of emotion perception, and the semantic gap between low-level audio signal and high-level emotion perception. Complete with more than 360 useful references, 12 example MATLAB (R) codes, and a listing of key abbreviations and acronyms, this cutting-edge guide supplies the technical understanding and tools needed to develop your own automatic MER system based on the automatic recognition model.
Biomarker discovery is an important area of biomedical research that may lead to significant breakthroughs in disease analysis and targeted therapy. Biomarkers are biological entities whose alterations are measurable and are characteristic of a particular biological condition. Discovering, managing, and interpreting knowledge of new biomarkers are challenging and attractive problems in the emerging field of biomedical informatics. This volume is a collection of state-of-the-art research into the application of data mining to the discovery and analysis of new biomarkers. Presenting new results, models and algorithms, the included contributions focus on biomarker data integration, information retrieval methods, and statistical machine learning techniques. This volume is intended for students, and researchers in bioinformatics, proteomics, and genomics, as well engineers and applied scientists interested in the interdisciplinary application of data mining techniques.
This volume comprises the 6th IFIP International Conference on Intelligent Infor- tion Processing. As the world proceeds quickly into the Information Age, it encounters both successes and challenges, and it is well recognized nowadays that intelligent information processing provides the key to the Information Age and to mastering many of these challenges. Intelligent information processing supports the most - vanced productive tools that are said to be able to change human life and the world itself. However, the path is never a straight one and every new technology brings with it a spate of new research problems to be tackled by researchers; as a result we are not running out of topics; rather the demand is ever increasing. This conference provides a forum for engineers and scientists in academia and industry to present their latest research findings in all aspects of intelligent information processing. This is the 6th IFIP International Conference on Intelligent Information Processing. We received more than 50 papers, of which 35 papers are included in this program as regular papers and 4 as short papers. We are grateful for the dedicated work of both the authors and the referees, and we hope these proceedings will continue to bear fruit over the years to come. All papers submitted were reviewed by two referees. A conference such as this cannot succeed without help from many individuals who contributed their valuable time and expertise.
The book provides an overview of the state-of-the-art of map construction algorithms, which use tracking data in the form of trajectories to generate vector maps. The most common trajectory type is GPS-based trajectories. It introduces three emerging algorithmic categories, outlines their general algorithmic ideas, and discusses three representative algorithms in greater detail. To quantify map construction algorithms, the authors include specific datasets and evaluation measures. The datasets, source code of map construction algorithms and evaluation measures are publicly available on http://www.mapconstruction.org. The web site serves as a repository for map construction data and algorithms and researchers can contribute by uploading their own code and benchmark data. Map Construction Algorithms is an excellent resource for professionals working in computational geometry, spatial databases, and GIS. Advanced-level students studying computer science, geography and mathematics will also find this book a useful tool.
Data mining consists of attempting to discover novel and useful knowledge from data, trying to find patterns among datasets that can help in intelligent decision making. However, reports of real-world case studies are not generally detailed in the literature, due to the fact that they are usually based on proprietary datasets, making it impossible to publish the results. This kind of situation makes hard to evaluate, in a precise way, the degree of effectiveness of data mining techniques in real-world applications. On the other hand, researchers of this field of expertise usually exploit public-domain datasets. This volume offers a wide spectrum of research work developed for data mining for real-world application. In the following, we give a brief introduction of the chapters that are included in this book.
Like a data-guzzling turbo engine, advanced data mining has been powering post-genome biological studies for two decades. Reflecting this growth, Biological Data Mining presents comprehensive data mining concepts, theories, and applications in current biological and medical research. Each chapter is written by a distinguished team of interdisciplinary data mining researchers who cover state-of-the-art biological topics. The first section of the book discusses challenges and opportunities in analyzing and mining biological sequences and structures to gain insight into molecular functions. The second section addresses emerging computational challenges in interpreting high-throughput Omics data. The book then describes the relationships between data mining and related areas of computing, including knowledge representation, information retrieval, and data integration for structured and unstructured biological data. The last part explores emerging data mining opportunities for biomedical applications. This volume examines the concepts, problems, progress, and trends in developing and applying new data mining techniques to the rapidly growing field of genome biology. By studying the concepts and case studies presented, readers will gain significant insight and develop practical solutions for similar biological data mining projects in the future.
This book covers deep-learning-based approaches for sentiment analysis, a relatively new, but fast-growing research area, which has significantly changed in the past few years. The book presents a collection of state-of-the-art approaches, focusing on the best-performing, cutting-edge solutions for the most common and difficult challenges faced in sentiment analysis research. Providing detailed explanations of the methodologies, the book is a valuable resource for researchers as well as newcomers to the field.
Most life science researchers will agree that biology is not a truly theoretical branch of science. The hype around computational biology and bioinformatics beginning in the nineties of the 20th century was to be short lived (1, 2). When almost no value of practical importance such as the optimal dose of a drug or the three-dimensional structure of an orphan protein can be computed from fundamental principles, it is still more straightforward to determine them experimentally. Thus, experiments and observationsdogeneratetheoverwhelmingpartofinsightsintobiologyandmedicine. The extrapolation depth and the prediction power of the theoretical argument in life sciences still have a long way to go. Yet, two trends have qualitatively changed the way how biological research is done today. The number of researchers has dramatically grown and they, armed with the same protocols, have produced lots of similarly structured data. Finally, high-throu- put technologies such as DNA sequencing or array-based expression profiling have been around for just a decade. Nevertheless, with their high level of uniform data generation, they reach the threshold of totally describing a living organism at the biomolecular level for the first time in human history. Whereas getting exact data about living systems and the sophistication of experimental procedures have primarily absorbed the minds of researchers previously, the weight increasingly shifts to the problem of interpreting accumulated data in terms of biological function and bio- lecular mechanisms.
Describes the State-of-the-Art in Spatial Data Mining, Focuses on Data Quality Substantial progress has been made toward developing effective techniques for spatial information processing in recent years. This science deals with models of reality in a GIS, however, and not with reality itself. Therefore, spatial information processes are often imprecise, allowing for much interpretation of abstract figures and data. Quality Aspects in Spatial Data Mining introduces practical and theoretical solutions for making sense of the often chaotic and overwhelming amount of concrete data available to researchers. In this cohesive collection of peer-reviewed chapters, field authorities present the latest field advancements and cover such essential areas as data acquisition, geoinformation theory, spatial statistics, and dissemination. Each chapter debuts with an editorial preview of each topic from a conceptual, applied, and methodological point of view, making it easier for researchers to judge which information is most beneficial to their work. Chapters Evolve From Error Propagation and Spatial Statistics to Address Relevant Applications The book advises the use of granular computing as a means of circumventing spatial complexities. This counter-application to traditional computing allows for the calculation of imprecise probabilities - the kind of information that the spatial information systems community wrestles with much of the time. Under the editorial guidance of internationally respected geoinformatics experts, this indispensable volume addresses quality aspects in the entire spatial data mining process, from data acquisition to end user. It also alleviates what is oftenfield researchers' most daunting task by organizing the wealth of concrete spatial data available into one convenient source, thereby advancing the frontiers of spatial information systems.
This textbook offers a comprehensive introduction to Machine Learning techniques and algorithms. This Third Edition covers newer approaches that have become highly topical, including deep learning, and auto-encoding, introductory information about temporal learning and hidden Markov models, and a much more detailed treatment of reinforcement learning. The book is written in an easy-to-understand manner with many examples and pictures, and with a lot of practical advice and discussions of simple applications. The main topics include Bayesian classifiers, nearest-neighbor classifiers, linear and polynomial classifiers, decision trees, rule-induction programs, artificial neural networks, support vector machines, boosting algorithms, unsupervised learning (including Kohonen networks and auto-encoding), deep learning, reinforcement learning, temporal learning (including long short-term memory), hidden Markov models, and the genetic algorithm. Special attention is devoted to performance evaluation, statistical assessment, and to many practical issues ranging from feature selection and feature construction to bias, context, multi-label domains, and the problem of imbalanced classes.
This book develops two key machine learning principles: the semi-supervised paradigm and learning with interdependent data. It reveals new applications, primarily web related, that transgress the classical machine learning framework through learning with interdependent data. The book traces how the semi-supervised paradigm and the learning to rank paradigm emerged from new web applications, leading to a massive production of heterogeneous textual data. It explains how semi-supervised learning techniques are widely used, but only allow a limited analysis of the information content and thus do not meet the demands of many web-related tasks. Later chapters deal with the development of learning methods for ranking entities in a large collection with respect to precise information needed. In some cases, learning a ranking function can be reduced to learning a classification function over the pairs of examples. The book proves that this task can be efficiently tackled in a new framework: learning with interdependent data. Researchers and professionals in machine learning will find these new perspectives and solutions valuable. Learning with Partially Labeled and Interdependent Data is also useful for advanced-level students of computer science, particularly those focused on statistics and learning.
Introduction to Algorithms for Data Mining and Machine Learning introduces the essential ideas behind all key algorithms and techniques for data mining and machine learning, along with optimization techniques. Its strong formal mathematical approach, well selected examples, and practical software recommendations help readers develop confidence in their data modeling skills so they can process and interpret data for classification, clustering, curve-fitting and predictions. Masterfully balancing theory and practice, it is especially useful for those who need relevant, well explained, but not rigorous (proofs based) background theory and clear guidelines for working with big data.
This book highlights the applications of data mining technologies in structural dynamic analysis, including structural design, optimization, parameter identification, model updating, damage identification, in civil, mechanical, and aerospace engineering. These engineering applications require precise structural design, fabrication, inspection, and further monitoring to obtain a full life-cycle analysis, and by focusing on data processing, data mining technologies offer another aspect in structural dynamic analysis. Discussing techniques in time/frequency domain, such as Hilbert transforms, wavelet theory, and machine learning for structural dynamic analysis to help in structural monitoring and diagnosis, the book is an essential reference resource for beginners, graduates and industrial professionals in various fields.
The Elements of Knowledge Organization is a unique and original work introducing the fundamental concepts related to the field of Knowledge Organization (KO). There is no other book like it currently available. The author begins the book with a comprehensive discussion of "knowledge" and its associated theories. He then presents a thorough discussion of the philosophical underpinnings of knowledge organization. The author walks the reader through the Knowledge Organization domain expanding the core topics of ontologies, taxonomies, classification, metadata, thesauri and domain analysis. The author also presents the compelling challenges associated with the organization of knowledge. This is the first book focused on the concepts and theories associated with KO domain. Prior to this book, individuals wishing to study Knowledge Organization in its broadest sense would generally collocate their own resources, navigating the various methods and models and perhaps inadvertently excluding relevant materials. This text cohesively links key and related KO material and provides a deeper understanding of the domain in its broadest sense and with enough detail to truly investigate its many facets. This book will be useful to both graduate and undergraduate students in the computer science and information science domains both as a text and as a reference book. It will also be valuable to researchers and practitioners in the industry who are working on website development, database administration, data mining, data warehousing and data for search engines. The book is also beneficial to anyone interested in the concepts and theories associated with the organization of knowledge. Dr. Richard P. Smiraglia is a world-renowned author who is well published in the Knowledge Organization domain. Dr. Smiraglia is editor-in-chief of the journal Knowledge Organization, published by Ergon-Verlag of Wurzburg. He is a professor and member of the Information Organization Research Group at the School of Information Studies at University of Wisconsin Milwaukee.
This collection of contributed chapters demonstrates a wide range of applications within two overlapping research domains: social media analysis and social network analysis. Various methodologies were utilized in the twelve individual chapters including static, dynamic and real-time approaches to graph, textual and multimedia data analysis. The topics apply to reputation computation, emotion detection, topic evolution, rumor propagation, evaluation of textual opinions, friend ranking, analysis of public transportation networks, diffusion in dynamic networks, analysis of contributors to communities of open source software developers, biometric template generation as well as analysis of user behavior within heterogeneous environments of cultural educational centers. Addressing these challenging applications is what makes this edited volume of interest to researchers and students focused on social media and social network analysis.
"Bayesian Networks and Influence Diagrams: A Guide to Construction and Analysis, Second Edition, "provides a comprehensive guide for practitioners who wish to understand, construct, and analyze intelligent systems for decision support based on probabilistic networks. This new edition contains six new sections, in addition to fully-updated examples, tables, figures, and a revised appendix. Intended primarily for practitioners, this book does not require sophisticated mathematical skills or deep understanding of the underlying theory and methods nor does it discuss alternative technologies for reasoning under uncertainty. The theory and methods presented are illustrated through more than 140 examples, and exercises are included for the reader to check his or her level of understanding. The techniques and methods presented for knowledge elicitation, model construction and verification, modeling techniques and tricks, learning models from data, and analyses of models have all been developed and refined on the basis of numerous courses that the authors have held for practitioners worldwide. " |
You may like...
Handbook of Research on Automated…
Mrutyunjaya Panda, Harekrishna Misra
Hardcover
R7,766
Discovery Miles 77 660
Enhancing Academic Research With…
Dhananjay Subhashchandra Deshpande, Narayan Bhosale, …
Hardcover
R4,999
Discovery Miles 49 990
|