![]() |
![]() |
Your cart is empty |
||
Books > Computing & IT > Applications of computing > Databases > Data mining
This book covers key issues related to Geospatial Semantic Web, including geospatial web services for spatial data interoperability; geospatial ontology for semantic interoperability; ontology creation, sharing, and integration; querying knowledge and information from heterogeneous data source; interfaces for Geospatial Semantic Web, VGI (Volunteered Geographic Information) and Geospatial Semantic Web; challenges of Geospatial Semantic Web; and development of Geospatial Semantic Web applications. This book also describes state-of-the-art technologies that attempt to solve these problems such as WFS, WMS, RDF, OWL and GeoSPARQL and demonstrates how to use the Geospatial Semantic Web technologies to solve practical real-world problems such as spatial data interoperability.
This book explains the Linked Data domain by adopting a bottom-up approach: it introduces the fundamental Semantic Web technologies and building blocks, which are then combined into methodologies and end-to-end examples for publishing datasets as Linked Data, and use cases that harness scholarly information and sensor data. It presents how Linked Data is used for web-scale data integration, information management and search. Special emphasis is given to the publication of Linked Data from relational databases as well as from real-time sensor data streams. The authors also trace the transformation from the document-based World Wide Web into a Web of Data. Materializing the Web of Linked Data is addressed to researchers and professionals studying software technologies, tools and approaches that drive the Linked Data ecosystem, and the Web in general.
1) Discusses technical details of the Machine Learning tools and techniques in the different types of cancers 2) Machine learning and data mining in healthcare is a very important topic and hence there would be a demand for such a book 3) As compared to other titles, the proposed book focuses on different types of cancer disease and their prediction strategy using machine leaning and data mining.
This textbook brings together both new and traditional research methods in Human Computer Interaction (HCI). Research methods include interviews and observations, ethnography, grounded theory and analysis of digital traces of behavior. Readers will gain an understanding of the type of knowledge each method provides, its disciplinary roots and how each contributes to understanding users, user behavior and the context of use. The background context, clear explanations and sample exercises make this an ideal textbook for graduate students, as well as a valuable reference for researchers and practitioners. 'It is an impressive collection in terms of the level of detail and variety.' (M. Sasikumar, ACM Computing Reviews #CR144066)
This open access book offers a comprehensive and thorough introduction to almost all aspects of metalearning and automated machine learning (AutoML), covering the basic concepts and architecture, evaluation, datasets, hyperparameter optimization, ensembles and workflows, and also how this knowledge can be used to select, combine, compose, adapt and configure both algorithms and models to yield faster and better solutions to data mining and data science problems. It can thus help developers to develop systems that can improve themselves through experience. As one of the fastest-growing areas of research in machine learning, metalearning studies principled methods to obtain efficient models and solutions by adapting machine learning and data mining processes. This adaptation usually exploits information from past experience on other tasks and the adaptive processes can involve machine learning approaches. As a related area to metalearning and a hot topic currently, AutoML is concerned with automating the machine learning processes. Metalearning and AutoML can help AI learn to control the application of different learning methods and acquire new solutions faster without unnecessary interventions from the user. This book is a substantial update of the first edition published in 2009. It includes 18 chapters, more than twice as much as the previous version. This enabled the authors to cover the most relevant topics in more depth and incorporate the overview of recent research in the respective area. The book will be of interest to researchers and graduate students in the areas of machine learning, data mining, data science and artificial intelligence.
In recent years, extensive research has been conducted by eminent mathematicians and engineers whose results and proposed problems are presented in this new volume. It is addressed to graduate students, research mathematicians, physicists, and engineers. Individual contributions are devoted to topics of approximation theory, functional equations and inequalities, fixed point theory, numerical analysis, theory of wavelets, convex analysis, topology, operator theory, differential operators, fractional integral operators, integro-differential equations, ternary algebras, super and hyper relators, variational analysis, discrete mathematics, cryptography, and a variety of applications in interdisciplinary topics. Several of these domains have a strong connection with both theories and problems of linear and nonlinear optimization. The combination of results from various domains provides the reader with a solid, state-of-the-art interdisciplinary reference to theory and problems. Some of the works provide guidelines for further research and proposals for new directions and open problems with relevant discussions.
This book introduces the latest thinking on the use of Big Data in the context of urban systems, including research and insights on human behavior, urban dynamics, resource use, sustainability and spatial disparities, where it promises improved planning, management and governance in the urban sectors (e.g., transportation, energy, smart cities, crime, housing, urban and regional economies, public health, public engagement, urban governance and political systems), as well as Big Data's utility in decision-making, and development of indicators to monitor economic and social activity, and for urban sustainability, transparency, livability, social inclusion, place-making, accessibility and resilience.
This book provides comprehensive coverage of neural networks, their evolution, their structure, the problems they can solve, and their applications. The first half of the book looks at theoretical investigations on artificial neural networks and addresses the key architectures that are capable of implementation in various application scenarios. The second half is designed specifically for the production of solutions using artificial neural networks to solve practical problems arising from different areas of knowledge. It also describes the various implementation details that were taken into account to achieve the reported results. These aspects contribute to the maturation and improvement of experimental techniques to specify the neural network architecture that is most appropriate for a particular application scope. The book is appropriate for students in graduate and upper undergraduate courses in addition to researchers and professionals.
The book provides an overview of the state-of-the-art of map construction algorithms, which use tracking data in the form of trajectories to generate vector maps. The most common trajectory type is GPS-based trajectories. It introduces three emerging algorithmic categories, outlines their general algorithmic ideas, and discusses three representative algorithms in greater detail. To quantify map construction algorithms, the authors include specific datasets and evaluation measures. The datasets, source code of map construction algorithms and evaluation measures are publicly available on http://www.mapconstruction.org. The web site serves as a repository for map construction data and algorithms and researchers can contribute by uploading their own code and benchmark data. Map Construction Algorithms is an excellent resource for professionals working in computational geometry, spatial databases, and GIS. Advanced-level students studying computer science, geography and mathematics will also find this book a useful tool.
This book covers deep-learning-based approaches for sentiment analysis, a relatively new, but fast-growing research area, which has significantly changed in the past few years. The book presents a collection of state-of-the-art approaches, focusing on the best-performing, cutting-edge solutions for the most common and difficult challenges faced in sentiment analysis research. Providing detailed explanations of the methodologies, the book is a valuable resource for researchers as well as newcomers to the field.
This book presents a comprehensive and systematic introduction to transforming process-oriented data into information about the underlying business process, which is essential for all kinds of decision-making. To that end, the authors develop step-by-step models and analytical tools for obtaining high-quality data structured in such a way that complex analytical tools can be applied. The main emphasis is on process mining and data mining techniques and the combination of these methods for process-oriented data. After a general introduction to the business intelligence (BI) process and its constituent tasks in chapter 1, chapter 2 discusses different approaches to modeling in BI applications. Chapter 3 is an overview and provides details of data provisioning, including a section on big data. Chapter 4 tackles data description, visualization, and reporting. Chapter 5 introduces data mining techniques for cross-sectional data. Different techniques for the analysis of temporal data are then detailed in Chapter 6. Subsequently, chapter 7 explains techniques for the analysis of process data, followed by the introduction of analysis techniques for multiple BI perspectives in chapter 8. The book closes with a summary and discussion in chapter 9. Throughout the book, (mostly open source) tools are recommended, described and applied; a more detailed survey on tools can be found in the appendix, and a detailed code for the solutions together with instructions on how to install the software used can be found on the accompanying website. Also, all concepts presented are illustrated and selected examples and exercises are provided. The book is suitable for graduate students in computer science, and the dedicated website with examples and solutions makes the book ideal as a textbook for a first course in business intelligence in computer science or business information systems. Additionally, practitioners and industrial developers who are interested in the concepts behind business intelligence will benefit from the clear explanations and many examples.
This handbook provides an overarching view of cyber security and digital forensic challenges related to big data and IoT environment, prior to reviewing existing data mining solutions and their potential application in big data context, and existing authentication and access control for IoT devices. An IoT access control scheme and an IoT forensic framework is also presented in this book, and it explains how the IoT forensic framework can be used to guide investigation of a popular cloud storage service. A distributed file system forensic approach is also presented, which is used to guide the investigation of Ceph. Minecraft, a Massively Multiplayer Online Game, and the Hadoop distributed file system environment are also forensically studied and their findings reported in this book. A forensic IoT source camera identification algorithm is introduced, which uses the camera's sensor pattern noise from the captured image. In addition to the IoT access control and forensic frameworks, this handbook covers a cyber defense triage process for nine advanced persistent threat (APT) groups targeting IoT infrastructure, namely: APT1, Molerats, Silent Chollima, Shell Crew, NetTraveler, ProjectSauron, CopyKittens, Volatile Cedar and Transparent Tribe. The characteristics of remote-controlled real-world Trojans using the Cyber Kill Chain are also examined. It introduces a method to leverage different crashes discovered from two fuzzing approaches, which can be used to enhance the effectiveness of fuzzers. Cloud computing is also often associated with IoT and big data (e.g., cloud-enabled IoT systems), and hence a survey of the cloud security literature and a survey of botnet detection approaches are presented in the book. Finally, game security solutions are studied and explained how one may circumvent such solutions. This handbook targets the security, privacy and forensics research community, and big data research community, including policy makers and government agencies, public and private organizations policy makers. Undergraduate and postgraduate students enrolled in cyber security and forensic programs will also find this handbook useful as a reference.
Data Stewardship for Open Science: Implementing FAIR Principles has been written with the intention of making scientists, funders, and innovators in all disciplines and stages of their professional activities broadly aware of the need, complexity, and challenges associated with open science, modern science communication, and data stewardship. The FAIR principles are used as a guide throughout the text, and this book should leave experimentalists consciously incompetent about data stewardship and motivated to respect data stewards as representatives of a new profession, while possibly motivating others to consider a career in the field. The ebook, avalable for no additional cost when you buy the paperback, will be updated every 6 months on average (providing that significant updates are needed or avaialble). Readers will have the opportunity to contribute material towards these updates, and to develop their own data management plans, via the free Data Stewardship Wizard.
This book explores a broad cross section of research and actual case studies to draw out new insights that may be used to build a benchmark for IT security professionals. This research takes a deeper dive beneath the surface of the analysis to uncover novel ways to mitigate data security vulnerabilities, connect the dots and identify patterns in the data on breaches. This analysis will assist security professionals not only in benchmarking their risk management programs but also in identifying forward looking security measures to narrow the path of future vulnerabilities.
This book addresses the usefulness of knowledge discovery through data mining. With this aim, contributors from different fields propose concrete problems and applications showing how data mining and discovering embedded knowledge from raw data can be beneficial to social organizations, domestic spheres, and ICT markets. Data mining or knowledge discovery in databases (KDD) has received increasing interest due to its focus on transforming large amounts of data into novel, valid, useful, and structured knowledge by detecting concealed patterns and relationships. The concept of knowledge is broad and speculative and has promoted epistemological debates in western philosophies. The intensified interest in knowledge management and data mining stems from the difficulty in identifying computational models able to approximate human behaviors and abilities in resolving organizational, social, and physical problems. Current ICT interfaces are not yet adequately advanced to support and simulate the abilities of physicians, teachers, assistants or housekeepers in domestic spheres. And unlike in industrial contexts where abilities are routinely applied, the domestic world is continuously changing and unpredictable. There are challenging questions in this field: Can knowledge locked in conventions, rules of conduct, common sense, ethics, emotions, laws, cultures, and experiences be mined from data? Is it acceptable for automatic systems displaying emotional behaviors to govern complex interactions based solely on the mining of large volumes of data? Discussing multidisciplinary themes, the book proposes computational models able to approximate, to a certain degree, human behaviors and abilities in resolving organizational, social, and physical problems. The innovations presented are of primary importance for: a. The academic research community b. The ICT market c. Ph.D. students and early stage researchers d. Schools, hospitals, rehabilitation and assisted-living centers e. Representatives from multimedia industries and standardization bodies
This book provides a perspective on the application of machine learning-based methods in knowledge discovery from natural languages texts. By analysing various data sets, conclusions which are not normally evident, emerge and can be used for various purposes and applications. The book provides explanations of principles of time-proven machine learning algorithms applied in text mining together with step-by-step demonstrations of how to reveal the semantic contents in real-world datasets using the popular R-language with its implemented machine learning algorithms. The book is not only aimed at IT specialists, but is meant for a wider audience that needs to process big sets of text documents and has basic knowledge of the subject, e.g. e-mail service providers, online shoppers, librarians, etc. The book starts with an introduction to text-based natural language data processing and its goals and problems. It focuses on machine learning, presenting various algorithms with their use and possibilities, and reviews the positives and negatives. Beginning with the initial data pre-processing, a reader can follow the steps provided in the R-language including the subsuming of various available plug-ins into the resulting software tool. A big advantage is that R also contains many libraries implementing machine learning algorithms, so a reader can concentrate on the principal target without the need to implement the details of the algorithms her- or himself. To make sense of the results, the book also provides explanations of the algorithms, which supports the final evaluation and interpretation of the results. The examples are demonstrated using realworld data from commonly accessible Internet sources.
The abundance of data and the rise of new quantitative and statistical techniques have created a promising area: data analytics. This combination of a culture of data-driven decision making and techniques to include domain knowledge allows organizations to exploit big data analytics in their evaluation and decision processes. Also, in education and learning, big data analytics is being used to enhance the learning process, to evaluate efficiency, to improve feedback, and to enrich the learning experience. As every step a student takes in the online world can be traced, analyzed, and used, there are plenty of opportunities to improve the learning process of students. First, data analytics techniques can be used to enhance the student' s learning process by providing real-time feedback, or by enriching the learning experience. Second, data analytics can be used to support the instructor or teacher. Using data analytics, the instructor can better trace, and take targeted actions to improve, the learning process of the student. Third, there are possibilities in using data analytics to measure the performance of instructors. Finally, for policy makers, it is often unclear how schools use their available resources to "produce" outcomes. By combining structured and unstructured data from various sources, data analytics might provide a solution for governments that aim to monitor the performance of schools more closely. Data analytics in education should not be the domain of a single discipline. Economists should discuss the possibilities, issues, and normative questions with a multidisciplinary team of pedagogists, philosophers, computer scientists, and sociologists. By bringing together various disciplines, a more comprehensive answer can be formulated to the challenges ahead. This book starts this discussion by highlighting some economic perspectives on the use of data analytics in education. The book begins a rich, multidisciplinary discussion that may make data analytics in education seem as natural as a teacher in front of a classroom.
This book is about the process of using analytics and the capabilities of analytics in today's organizations. Cutting through the buzz surrounding the term analytics and the overloaded expectations about using analytics, the book demystifies analytics with an in-depth examination of concepts grounded in operations research and management science. Analytics as a set of tools and processes is only as effective as: The data with which it is working The human judgment applying the processes and understanding the output of these processes. For this reason, the book focuses on the analytics process. What is intrinsic to analytics' real organizational impact are the careful application of tools and the thoughtful application of their outcomes. This work emphasizes analytics as part of a process that supports decision-making within organizations. It wants to debunk overblown expectations that somehow analytics outputs or analytics as applied to other concepts, such as Big Data, are the be-all and end-all of the analytics process. They are, instead, only a step within a holistic and critical approach to management thinking that can create real value for an organization. To develop this holistic approach, the book is divided into two sections that examine concepts and applications. The first section makes the case for executive management taking a holistic approach to analytics. It draws on rich research in operations and management science that form the context in which analytics tools are to be applied. There is a strong emphasis on knowledge management concepts and techniques, as well as risk management concepts and techniques. The second section focuses on both the use of the analytics process and organizational issues that are required to make the analytics process relevant and impactful.
If you are a manager who receives the results of any data analyst's work to help with your decision-making, this book is for you. Anyone playing a role in the field of analytics can benefit from this book as well. In the two decades the editors of this book spent teaching and consulting in the field of analytics, they noticed a critical shortcoming in the communication abilities of many analytics professionals. Specifically, analysts have difficulty in articulating in business terms what their analyses showed and what actionable recommendations were made. When analysts made presentations, they tended to lapse into the technicalities of mathematical procedures, rather than focusing on the strategic and tactical impact and meaning of their work. As analytics has become more mainstream and widespread in organizations, this problem has grown more acute. Data Analytics: Effective Methods for Presenting Results tackles this issue. The editors have used their experience as presenters and audience members who have become lost during presentation. Over the years, they experimented with different ways of presenting analytics work to make a more compelling case to top managers. They have discovered tried and true methods for improving presentations, which they share. The book also presents insights from other analysts and managers who share their own experiences. It is truly a collection of experiences and insight from academics and professionals involved with analytics. The book is not a primer on how to draw the most beautiful charts and graphs or about how to perform any specific kind of analysis. Rather, it shares the experiences of professionals in various industries about how they present their analytics results effectively. They tell their stories on how to win over audiences. The book spans multiple functional areas within a business, and in some cases, it discusses how to adapt presentations to the needs of audiences at different levels of management.
From the Foreword: "Big Data Management and Processing is [a] state-of-the-art book that deals with a wide range of topical themes in the field of Big Data. The book, which probes many issues related to this exciting and rapidly growing field, covers processing, management, analytics, and applications... [It] is a very valuable addition to the literature. It will serve as a source of up-to-date research in this continuously developing area. The book also provides an opportunity for researchers to explore the use of advanced computing technologies and their impact on enhancing our capabilities to conduct more sophisticated studies." ---Sartaj Sahni, University of Florida, USA "Big Data Management and Processing covers the latest Big Data research results in processing, analytics, management and applications. Both fundamental insights and representative applications are provided. This book is a timely and valuable resource for students, researchers and seasoned practitioners in Big Data fields. --Hai Jin, Huazhong University of Science and Technology, China Big Data Management and Processing explores a range of big data related issues and their impact on the design of new computing systems. The twenty-one chapters were carefully selected and feature contributions from several outstanding researchers. The book endeavors to strike a balance between theoretical and practical coverage of innovative problem solving techniques for a range of platforms. It serves as a repository of paradigms, technologies, and applications that target different facets of big data computing systems. The first part of the book explores energy and resource management issues, as well as legal compliance and quality management for Big Data. It covers In-Memory computing and In-Memory data grids, as well as co-scheduling for high performance computing applications. The second part of the book includes comprehensive coverage of Hadoop and Spark, along with security, privacy, and trust challenges and solutions. The latter part of the book covers mining and clustering in Big Data, and includes applications in genomics, hospital big data processing, and vehicular cloud computing. The book also analyzes funding for Big Data projects.
This book presents a collection of essays written by leading researchers to honor Roman Slowinski's major scholarly interests and contributions. He is well-known for conducting extensive research on methodologies and techniques for intelligent decision support, where he combines operational research and artificial intelligence. The book reconstructs his main contributions, presents cutting-edge research and provides an outlook on the most promising and advanced domains of computer science and multiple criteria decision aiding. The respective chapters cover a wide range of related research areas, including decision sciences, ordinal data mining, preference learning and multiple criteria decision aiding, modeling of uncertainty and imprecision in decision problems, rough set theory, fuzzy set theory, multi-objective optimization, project scheduling and decision support applications. As such, the book will appeal to researchers and scholars in related fields.
This book presents the state of the art in distributed machine learning algorithms that are based on gradient optimization methods. In the big data era, large-scale datasets pose enormous challenges for the existing machine learning systems. As such, implementing machine learning algorithms in a distributed environment has become a key technology, and recent research has shown gradient-based iterative optimization to be an effective solution. Focusing on methods that can speed up large-scale gradient optimization through both algorithm optimizations and careful system implementations, the book introduces three essential techniques in designing a gradient optimization algorithm to train a distributed machine learning model: parallel strategy, data compression and synchronization protocol. Written in a tutorial style, it covers a range of topics, from fundamental knowledge to a number of carefully designed algorithms and systems of distributed machine learning. It will appeal to a broad audience in the field of machine learning, artificial intelligence, big data and database management.
This open access book brings together a set of original studies that use cutting-edge computational methods to investigate conflict at various geographic scales and degrees of intensity and violence. Methodologically, this book covers a variety of computational approaches from text mining and machine learning to agent-based modelling and social network analysis. Empirical cases range from migration policy framing in North America and street protests in Iran to violence against civilians in Congo and food riots world-wide. Supplementary materials in the book include a comprehensive list of the datasets on conflict and dissent, as well as resources to online repositories where the annotated code and data of individual chapters can be found and where (agent-based) models can be re-produced and altered. These materials are a valuable resource for those wishing to retrace and learn from the analyses described in this volume and adapt and apply them to their own research interests. By bringing together novel research through an international team of scholars from a range of disciplines, Computational Conflict Research pioneers and maps this emerging field. The book will appeal to students, scholars, and anyone interested in the prospects of using computational social sciences to advance our understanding of conflict dynamics.
This book introduces readers to a workload-aware methodology for large-scale graph algorithm optimization in graph-computing systems, and proposes several optimization techniques that can enable these systems to handle advanced graph algorithms efficiently. More concretely, it proposes a workload-aware cost model to guide the development of high-performance algorithms. On the basis of the cost model, the book subsequently presents a system-level optimization resulting in a partition-aware graph-computing engine, PAGE. In addition, it presents three efficient and scalable advanced graph algorithms - the subgraph enumeration, cohesive subgraph detection, and graph extraction algorithms. This book offers a valuable reference guide for junior researchers, covering the latest advances in large-scale graph analysis; and for senior researchers, sharing state-of-the-art solutions based on advanced graph algorithms. In addition, all readers will find a workload-aware methodology for designing efficient large-scale graph algorithms.
The Elements of Knowledge Organization is a unique and original work introducing the fundamental concepts related to the field of Knowledge Organization (KO). There is no other book like it currently available. The author begins the book with a comprehensive discussion of "knowledge" and its associated theories. He then presents a thorough discussion of the philosophical underpinnings of knowledge organization. The author walks the reader through the Knowledge Organization domain expanding the core topics of ontologies, taxonomies, classification, metadata, thesauri and domain analysis. The author also presents the compelling challenges associated with the organization of knowledge. This is the first book focused on the concepts and theories associated with KO domain. Prior to this book, individuals wishing to study Knowledge Organization in its broadest sense would generally collocate their own resources, navigating the various methods and models and perhaps inadvertently excluding relevant materials. This text cohesively links key and related KO material and provides a deeper understanding of the domain in its broadest sense and with enough detail to truly investigate its many facets. This book will be useful to both graduate and undergraduate students in the computer science and information science domains both as a text and as a reference book. It will also be valuable to researchers and practitioners in the industry who are working on website development, database administration, data mining, data warehousing and data for search engines. The book is also beneficial to anyone interested in the concepts and theories associated with the organization of knowledge. Dr. Richard P. Smiraglia is a world-renowned author who is well published in the Knowledge Organization domain. Dr. Smiraglia is editor-in-chief of the journal Knowledge Organization, published by Ergon-Verlag of Wurzburg. He is a professor and member of the Information Organization Research Group at the School of Information Studies at University of Wisconsin Milwaukee. |
![]() ![]() You may like...
Evolutionary Controversies in Economics…
"Japan Association For Evolutionary Economics", Y. Aruka
Hardcover
R3,067
Discovery Miles 30 670
Stable Domination and Independence in…
Deirdre Haskell, Ehud Hrushovski, …
Hardcover
R3,457
Discovery Miles 34 570
Pro T-SQL 2012 Programmer's Guide
Michael Coles, Scott Shaw, …
Paperback
R1,642
Discovery Miles 16 420
Algebraic Perspectives on Substructural…
Davide Fazio, Antonio Ledda, …
Hardcover
R3,284
Discovery Miles 32 840
Elementary Lessons in Logic - Deductive…
William Stanley Jevons
Paperback
R610
Discovery Miles 6 100
|