Welcome to Loot.co.za!
Sign in / Register |Wishlists & Gift Vouchers |Help | Advanced search
|
Your cart is empty |
|||
Books > Computing & IT > Applications of computing > Databases > Data mining
New technologies have enabled us to collect massive amounts of data in many fields. However, our pace of discovering useful information and knowledge from these data falls far behind our pace of collecting the data. Data Mining: Theories, Algorithms, and Examples introduces and explains a comprehensive set of data mining algorithms from various data mining fields. The book reviews theoretical rationales and procedural details of data mining algorithms, including those commonly found in the literature and those presenting considerable difficulty, using small data examples to explain and walk through the algorithms. The book covers a wide range of data mining algorithms, including those commonly found in data mining literature and those not fully covered in most of existing literature due to their considerable difficulty. The book presents a list of software packages that support the data mining algorithms, applications of the data mining algorithms with references, and exercises, along with the solutions manual and PowerPoint slides of lectures. The author takes a practical approach to data mining algorithms so that the data patterns produced can be fully interpreted. This approach enables students to understand theoretical and operational aspects of data mining algorithms and to manually execute the algorithms for a thorough understanding of the data patterns produced by them.
This book provides a comprehensive introduction and practical look at the concepts and techniques readers need to get the most out of their data in real-world, large-scale data mining projects. It also guides readers through the data-analytic thinking necessary for extracting useful knowledge and business value from the data. The book is based on the Soft Computing and Data Mining (SCDM-16) conference, which was held in Bandung, Indonesia on August 18th-20th 2016 to discuss the state of the art in soft computing techniques, and offer participants sufficient knowledge to tackle a wide range of complex systems. The scope of the conference is reflected in the book, which presents a balance of soft computing techniques and data mining approaches. The two constituents are introduced to the reader systematically and brought together using different combinations of applications and practices. It offers engineers, data analysts, practitioners, scientists and managers the insights into the concepts, tools and techniques employed, and as such enables them to better understand the design choice and options of soft computing techniques and data mining approaches that are necessary to thrive in this data-driven ecosystem.
The two-volume set LNCS 10132 and 10133 constitutes the thoroughly refereed proceedings of the 23rd International Conference on Multimedia Modeling, MMM 2017, held in Reykjavik, Iceland, in January 2017. Of the 149 full papers submitted, 36 were selected for oral presentation and 33 for poster presentation; of the 34 special session papers submitted, 24 were selected for oral presentation and 2 for poster presentation; in addition, 5 demonstrations were accepted from 8 submissions, and all 7 submissions to VBS 2017. All papers presented were carefully reviewed and selected from 198 submissions. MMM is a leading international conference for researchers and industry practitioners for sharing new ideas, original research results and practical development experiences from all MMM related areas, broadly falling into three categories: multimedia content analysis; multimedia signal processing and communications; and multimedia applications and services.
Temporal Data Mining via Unsupervised Ensemble Learning provides the principle knowledge of temporal data mining in association with unsupervised ensemble learning and the fundamental problems of temporal data clustering from different perspectives. By providing three proposed ensemble approaches of temporal data clustering, this book presents a practical focus of fundamental knowledge and techniques, along with a rich blend of theory and practice. Furthermore, the book includes illustrations of the proposed approaches based on data and simulation experiments to demonstrate all methodologies, and is a guide to the proper usage of these methods. As there is nothing universal that can solve all problems, it is important to understand the characteristics of both clustering algorithms and the target temporal data so the correct approach can be selected for a given clustering problem. Scientists, researchers, and data analysts working with machine learning and data mining will benefit from this innovative book, as will undergraduate and graduate students following courses in computer science, engineering, and statistics.
Big Data and Smart Service Systems presents the theories and applications regarding Big Data and smart service systems, data acquisition, smart cities, business decision-making support, and smart service design. The rapid development of computer and Internet technologies has led the world to the era of Big Data. Big Data technologies are widely used, which has brought unprecedented impacts on traditional industries and lifestyle. More and more governments, business sectors, and institutions begin to realize data is becoming the most valuable asset and its analysis is becoming the core competitiveness.
This book contains the proceedings as well as invited papers for the first annual conference of the UNESCO Unitwin Complex System Digital Campus (CSDC), which is an international initiative gathering 120 Universities on four continents, and structured in ten E-Departments. First Complex Systems Digital Campus World E-Conference 2015 features chapters from the latest research results on theoretical questions of complex systems and their experimental domains. The content contained bridges the gap between the individual and the collective within complex systems science and new integrative sciences on topics such as: genes to organisms to ecosystems, atoms to materials to products, and digital media to the Internet. The conference breaks new ground through a dedicated video-conferencing system - a concept at the heart of the international UNESCO UniTwin, embracing scientists from low-income and distant countries. This book promotes an integrated system of research, education, and training. It also aims at contributing to global development by taking into account its social, economic, and cultural dimensions. First Complex Systems Digital Campus World E-Conference 2015 will appeal to students and researchers working in the fields of complex systems, statistical physics, computational intelligence, and biological physics.
The LNCS journal Transactions on Large-Scale Data- and Knowledge-Centered Systems focuses on data management, knowledge discovery, and knowledge processing, which are core and hot topics in computer science. Since the 1990s, the Internet has become the main driving force behind application development in all domains. An increase in the demand for resource sharing across different sites connected through networks has led to an evolution of data- and knowledge-management systems from centralized systems to decentralized systems enabling large-scale distributed applications providing high scalability. Current decentralized systems still focus on data and knowledge as their main resource. Feasibility of these systems relies basically on P2P (peer-to-peer) techniques and the support of agent systems with scaling and decentralized control. Synergy between grids, P2P systems, and agent technologies is the key to data- and knowledge-centered systems in large-scale environments. This, the 29th issue of Transactions on Large-Scale Data- and Knowledge-Centered Systems, contains four revised selected regular papers. Topics covered include optimization and cluster validation processes for entity matching, business intelligence systems, and data profiling in the Semantic Web.
Learn to use Apache Pig to develop lightweight big data applications easily and quickly. This book shows you many optimization techniques and covers every context where Pig is used in big data analytics. Beginning Apache Pig shows you how Pig is easy to learn and requires relatively little time to develop big data applications.The book is divided into four parts: the complete features of Apache Pig; integration with other tools; how to solve complex business problems; and optimization of tools.You'll discover topics such as MapReduce and why it cannot meet every business need; the features of Pig Latin such as data types for each load, store, joins, groups, and ordering; how Pig workflows can be created; submitting Pig jobs using Hue; and working with Oozie. You'll also see how to extend the framework by writing UDFs and custom load, store, and filter functions. Finally you'll cover different optimization techniques such as gathering statistics about a Pig script, joining strategies, parallelism, and the role of data formats in good performance. What You Will Learn* Use all the features of Apache Pig* Integrate Apache Pig with other tools* Extend Apache Pig* Optimize Pig Latin code* Solve different use cases for Pig LatinWho This Book Is ForAll levels of IT professionals: architects, big data enthusiasts, engineers, developers, and big data administrators
The work presents new approaches to Machine Learning for Cyber Physical Systems, experiences and visions. It contains some selected papers from the international Conference ML4CPS - Machine Learning for Cyber Physical Systems, which was held in Karlsruhe, September 29th, 2016. Cyber Physical Systems are characterized by their ability to adapt and to learn: They analyze their environment and, based on observations, they learn patterns, correlations and predictive models. Typical applications are condition monitoring, predictive maintenance, image processing and diagnosis. Machine Learning is the key technology for these developments.
This book constitutes the refereed post-conference proceedings of the First International Workshop on Spectral and Shape Analysis in Medical Imaging, SeSAMI 2016, held in conjunction with MICCAI 2016, in Athens, Greece, in October 2016. The 10 submitted full papers presented in this volume were carefully reviewed. The papers reflect the following topics: spectral methods; longitudinal methods; and shape methods.
This book reports on the latest advances on the theories, practices, standards and strategies that are related to the modern technology paradigms, the Mobile Cloud computing (MCC) and Big Data, as the pillars and their association with the emerging 5G mobile networks. The book includes 15 rigorously refereed chapters written by leading international researchers, providing the readers with technical and scientific information about various aspects of Big Data and Mobile Cloud Computing, from basic concepts to advanced findings, reporting the state-of-the-art on Big Data management. It demonstrates and discusses methods and practices to improve multi-source Big Data manipulation techniques, as well as the integration of resources availability through the 3As (Anywhere, Anything, Anytime) paradigm, using the 5G access technologies.
This book constitutes the refereed proceedings of the 51st Annual Convention of the Computer Society of India, CSI 2016, held in Coimbatore, India, in December 2016. The 23 revised papers presented were carefully reviewed and selected from 74 submissions. The theme of CSI 2016, Digital Connectivity - Social Impact, has been selected to highlight the importance of technology in solving social problems and thereby creating a long term impact on society. The papers are organized in topical sections on information science; computational intelligence; network computing; IT for society.
This book presents a collection of representative and novel work in the field of data mining, knowledge discovery, clustering and classification, based on expanded and reworked versions of a selection of the best papers originally presented in French at the EGC 2014 and EGC 2015 conferences held in Rennes (France) in January 2014 and Luxembourg in January 2015. The book is in three parts: The first four chapters discuss optimization considerations in data mining. The second part explores specific quality measures, dissimilarities and ultrametrics. The final chapters focus on semantics, ontologies and social networks. Written for PhD and MSc students, as well as researchers working in the field, it addresses both theoretical and practical aspects of knowledge discovery and management.
This book includes the original, peer-reviewed research from the 2nd International Conference on Emerging Trends in Electrical, Communication and Information Technologies (ICECIT 2015), held in December, 2015 at Srinivasa Ramanujan Institute of Technology, Ananthapuramu, Andhra Pradesh, India. It covers the latest research trends or developments in areas of Electrical Engineering, Electronic and Communication Engineering, and Computer Science and Information.
This book discusses the effective use of modern ICT solutions for business needs, including the efficient use of IT resources, decision support systems, business intelligence, data mining and advanced data processing algorithms, as well as the processing of large datasets (inter alia social networking such as Twitter and Facebook, etc.). The ability to generate, record and process qualitative and quantitative data, including in the area of big data, the Internet of Things (IoT) and cloud computing offers a real prospect of significant improvements for business, as well as the operation of a company within Industry 4.0. The book presents new ideas, approaches, solutions and algorithms in the area of knowledge representation, management and processing, quantitative and qualitative data processing (including sentiment analysis), problems of simulation performance, and the use of advanced signal processing to increase the speed of computation. The solutions presented are also aimed at the effective use of business process modeling and notation (BPMN), business process semantization and investment project portfolio selection. It is a valuable resource for researchers, data analysts, entrepreneurs and IT professionals alike, and the research findings presented make it possible to reduce costs, increase the accuracy of investment, optimize resources and streamline operations and marketing.
Learn how to develop powerful data analytics applications quickly for SQL Server database administrators and developers. Organizations will be able to sift data and derive the business intelligence needed to drive business decisions and profit. The addition of R to SQL Server 2016 places a powerful analytical processor into an environment most developers are already comfortable with - Visual Studio. This book walks even the newest of users through the creation process of a powerful R-language tool set for use in analyzing and reporting on your data. As a SQL Server database administrator or developer, it is sometimes difficult to stay on the bleeding edge of technology. Microsoft's addition of R to SQL Server 2016 is sure to be a game-changer, and the language will certainly become an integral part of future releases. R is in fact widely used today in statistical and related applications, and its use is only growing. Beginning SQL Server R Services helps you jump on board this important trend by providing good examples with detailed explanations of the WHY and not just the HOW. Walks you through setup and installation of SQL Server R Services. Explains the basics of working with R Tools for Visual Studio. Provides a road map to successfully creating custom R code. What You Will Learn Discover R's role in the SQL Server 2016 hierarchy. Manage the components needed to run SQL Server R Services code. Run R-language analytics and queries inside the database. Create analytic solutions that run across multiple datasets. Gain in-depth knowledge of the R language itself. Implement custom SQL Server R Services solutions. Who This Book Is For Any level of database administrator or developer, but specifically it's for those developers with the need to develop powerful data analytics applications quickly. Seasoned R developers will appreciate the book for its robust learning pattern, using visual aids in combination with properties explanations and scenarios. Beginning SQL Server R Services is the perfect "new hire" gift for new database developers in any organization.
This book provides a systematic review of many advanced techniques to support the analysis of large collections of documents, ranging from the elementary to the profound, covering all the aspects of the visualization of text documents. Particularly, we start by introducing the fundamental concept of information visualization and visual analysis, followed by a brief survey of the field of text visualization and commonly used data models for converting document into a structured form for visualization. Then we introduce the key visualization techniques including visualizing document similarity, content, sentiments, as well as text corpus exploration system in details with concrete examples in the rest of the book.
This book constitutes the refereed proceedings of the 10th Metadata and Semantics Research Conference, MTSR 2016, held in Goettingen, Germany, in November 2016. The 26 full papers and 6 short papers presented were carefully reviewed and selected from 67 submissions. The papers are organized in several sessions and tracks: Digital Libraries, Information Retrieval, Linked and Social Data, Metadata and Semantics for Open Repositories, Research Information Systems and Data Infrastructures, Metadata and Semantics for Agriculture, Food and Environment, Metadata and Semantics for Cultural Collections and Applications, European and National Projects.
The papers in this volume are the refereed papers presented at AI-2016, the Thirty-sixth SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence, held in Cambridge in December 2016 in both the technical and the application streams. They present new and innovative developments and applications, divided into technical stream sections on Knowledge Discovery and Data Mining, Sentiment Analysis and Recommendation, Machine Learning, AI Techniques, and Natural Language Processing, followed by application stream sections on AI for Medicine and Disability, Legal Liability and Finance, Telecoms and eLearning, and Genetic Algorithms in Action. The volume also includes the text of short papers presented as posters at the conference. This is the thirty-third volume in the Research and Development in Intelligent Systems series, which also incorporates the twenty-fourth volume in the Applications and Innovations in Intelligent Systems series. These series are essential reading for those who wish to keep up to date with developments in this important field.
This book contributes a basic framework for and specific insights into interdisciplinary connections between production, logistics, and traffic subsystems. The book is divided into two parts, the first of which presents an overview of interdisciplinarity in value-added networks and freight traffic. This includes an introduction to the topic and a description of an integrated framework of production, logistics, and traffic. Furthermore, it describes the barriers and challenges of interdisciplinary decision-making and project management. In turn, the second part presents domain-specific perspectives on interdisciplinary decision support, exploring domain-specific challenges of interdisciplinary interfaces and requirements for management methods and instruments from the standpoint of production management, logistics management, traffic management, and information technologies.
The two-volume set CCIS 662 and CCIS 663 constitutes the refereed proceedings of the 7th Chinese Conference on Pattern Recognition, CCPR 2016, held in Chengdu, China, in November 2016.The 121 revised papers presented in two volumes were carefully reviewed and selected from 199 submissions. The papers are organized in topical sections on robotics; computer vision; basic theory of pattern recognition; image and video processing; speech and language; emotion recognition.
This book constitutes the refereed proceedings of the Second International Conference on Technologies and Innovation, CITI 2016, held in Guayaquil, Ecuador, in November 2016. The 21 revised full papers presented were carefully reviewed and selected from 65 submissions. The papers are organized in topical sections on knowledge representation and natural language processing; Cloud and mobile computing; software engineering; expert systems and soft computing.
This book has a collection of articles written by Big Data experts to describe some of the cutting-edge methods and applications from their respective areas of interest, and provides the reader with a detailed overview of the field of Big Data Analytics as it is practiced today. The chapters cover technical aspects of key areas that generate and use Big Data such as management and finance; medicine and healthcare; genome, cytome and microbiome; graphs and networks; Internet of Things; Big Data standards; bench-marking of systems; and others. In addition to different applications, key algorithmic approaches such as graph partitioning, clustering and finite mixture modelling of high-dimensional data are also covered. The varied collection of themes in this volume introduces the reader to the richness of the emerging field of Big Data Analytics.
This volume provides a comprehensive introduction to the Translation Process Research Database (TPR-DB), which was compiled by the Centre for Research and Innovation in Translation and Technologies (CRITT). The TPR-DB is a unique resource featuring more than 500 hours of recorded translation process data, augmented with over 200 different rich annotations. Twelve chapters describe the diverse research directions this data can support, including the computational, statistical and psycholinguistic modeling of human translation processes.In the first chapters of this book, the reader is introduced to the CRITT TPR-DB. This is followed by two main parts, the first of which focuses on usability issues and details of implementing interactive machine translation. It also discusses the use of external resources and translator-information interaction. The second part addresses the cognitive and statistical modeling of human translation processes, including co-activation at the lexical, syntactic and discourse levels, translation literality, and various annotation schemata for the data.
This book constitutes the thoroughly refereed proceedings of the 5th National Conference of Social Media Processing, SMP 2016, held in Nanchang, China, in October 2016. The 24 revised full papers presented were carefully reviewed and selected from 109 submissions. The papers address issues such as: mining social media and applications; natural language processing; data mining; information retrieval; emergent social media processing problems. |
You may like...
Surfacing - On Being Black And Feminist…
Desiree Lewis, Gabeba Baderoon
Paperback
|