|
|
Books > Computing & IT > Applications of computing > Databases > Data mining
Big Data and Smart Service Systems presents the theories and
applications regarding Big Data and smart service systems, data
acquisition, smart cities, business decision-making support, and
smart service design. The rapid development of computer and
Internet technologies has led the world to the era of Big Data. Big
Data technologies are widely used, which has brought unprecedented
impacts on traditional industries and lifestyle. More and more
governments, business sectors, and institutions begin to realize
data is becoming the most valuable asset and its analysis is
becoming the core competitiveness.
Temporal Data Mining via Unsupervised Ensemble Learning provides
the principle knowledge of temporal data mining in association with
unsupervised ensemble learning and the fundamental problems of
temporal data clustering from different perspectives. By providing
three proposed ensemble approaches of temporal data clustering,
this book presents a practical focus of fundamental knowledge and
techniques, along with a rich blend of theory and practice.
Furthermore, the book includes illustrations of the proposed
approaches based on data and simulation experiments to demonstrate
all methodologies, and is a guide to the proper usage of these
methods. As there is nothing universal that can solve all problems,
it is important to understand the characteristics of both
clustering algorithms and the target temporal data so the correct
approach can be selected for a given clustering problem.
Scientists, researchers, and data analysts working with machine
learning and data mining will benefit from this innovative book, as
will undergraduate and graduate students following courses in
computer science, engineering, and statistics.
Handbook of Mobility Data Mining: Volume Three: Mobility
Data-Driven Applications introduces the fundamental technologies of
mobile big data mining (MDM), advanced AI methods, and upper-level
applications, helping readers comprehensively understand MDM with a
bottom-up approach. The book explains how to preprocess mobile big
data, visualize urban mobility, simulate and predict human travel
behavior, and assess urban mobility characteristics and their
matching performance as conditions and constraints in transport,
emergency management, and sustainability development systems. The
book contains crucial information for researchers, engineers,
operators, administrators, and policymakers seeking greater
understanding of current technologies' infra-knowledge structure
and limitations. The book introduces how to design MDM platforms
that adapt to the evolving mobility environment-and new types of
transportation and users-based on an integrated solution that
utilizes sensing and communication capabilities to tackle
significant challenges faced by the MDM field. This third volume
looks at various cases studies to illustrate and explore the
methods introduced in the first two volumes, covering topics such
as Intelligent Transportation Management, Smart Emergency
Management-detailing cases such as the Fukushima earthquake,
Hurricane Katrina, and COVID-19-and Urban Sustainability
Development, covering bicycle and railway travel behavior, mobility
inequality, and road and light pollution inequality.
The emergence of new technologies within the industrial revolution
has transformed businesses to a new socio-digital era. In this new
era, businesses are concerned with collecting data on customer
needs, behaviors, and preferences for driving effective customer
engagement and product development, as well as for crucial decision
making. However, the ever-shifting behaviors of consumers provide
many challenges for businesses to pinpoint the wants and needs of
their audience. Consumer Behavior Change and Data Analytics in the
Socio-Digital Era focuses on the concepts, theories, and analytical
techniques to track consumer behavior change. It provides
multidisciplinary research and practice focusing on social and
behavioral analytics to track consumer behavior shifts and improve
decision making among businesses. Covering topics such as consumer
sentiment analysis, emotional intelligence, and online purchase
decision making, this premier reference source is a timely resource
for business executives, entrepreneurs, data analysts, marketers,
advertisers, government officials, social media professionals,
libraries, students and educators of higher education, researchers,
and academicians.
Opinion Mining and Text Analytics on Literary Works and Social
Media introduces the use of artificial intelligence and big data
analytics techniques which can apply opinion mining and text
analytics on literary works and social media. This book focuses on
theories, method and approaches in which data analytic techniques
can be used to analyze data from social media, literary books,
novels, news, texts, and beyond to provide a meaningful pattern.
The subject area of this book is multidisciplinary; related to data
science, artificial intelligence, social science and humanities,
and literature. This is an essential resource for scholars,
Students and lecturers from various fields of data science,
artificial intelligence, social science and humanities, and
literature, university libraries, new agencies, and many more.
In today's digital world, the huge amount of data being generated
is unstructured, messy, and chaotic in nature. Dealing with such
data, and attempting to unfold the meaningful information, can be a
challenging task. Feature engineering is a process to transform
such data into a suitable form that better assists with
interpretation and visualization. Through this method, the
transformed data is more transparent to the machine learning
models, which in turn causes better prediction and analysis of
results. Data science is crucial for the data scientist to assess
the trade-offs of their decisions regarding the effectiveness of
the machine learning model implemented. Investigating the demand in
this area today and in the future is a necessity. The Handbook of
Research on Automated Feature Engineering and Advanced Applications
in Data Science provides an in-depth analysis on both the
theoretical and the latest empirical research findings on how
features can be extracted and transformed from raw data. The
chapters will introduce feature engineering and the recent
concepts, methods, and applications with the use of various data
types, as well as examine the latest machine learning applications
on the data. While highlighting topics such as detection, tracking,
selection techniques, and prediction models using data science,
this book is ideally intended for research scholars, big data
scientists, project developers, data analysts, and computer
scientists along with practitioners, researchers, academicians, and
students interested in feature engineering and its impact on data.
BIG DATA ANALYTICS FOR INTERNET OF THINGS Discover the latest
developments in IoT Big Data with a new resource from established
and emerging leaders in the field Big Data Analytics for Internet
of Things delivers a comprehensive overview of all aspects of big
data analytics in Internet of Things (IoT) systems. The book
includes discussions of the enabling technologies of IoT data
analytics, types of IoT data analytics, challenges in IoT data
analytics, demand for IoT data analytics, computing platforms,
analytical tools, privacy, and security. The distinguished editors
have included resources that address key techniques in the analysis
of IoT data. The book demonstrates how to select the appropriate
techniques to unearth valuable insights from IoT data and offers
novel designs for IoT systems. With an abiding focus on practical
strategies with concrete applications for data analysts and IoT
professionals, Big Data Analytics for Internet of Things also
offers readers: A thorough introduction to the Internet of Things,
including IoT architectures, enabling technologies, and
applications An exploration of the intersection between the
Internet of Things and Big Data, including IoT as a source of Big
Data, the unique characteristics of IoT data, etc. A discussion of
the IoT data analytics, including the data analytical requirements
of IoT data and the types of IoT analytics, including predictive,
descriptive, and prescriptive analytics A treatment of machine
learning techniques for IoT data analytics Perfect for
professionals, industry practitioners, and researchers engaged in
big data analytics related to IoT systems, Big Data Analytics for
Internet of Things will also earn a place in the libraries of IoT
designers and manufacturers interested in facilitating the
efficient implementation of data analytics strategies.
The success of many companies through the assistance of bitcoin
proves that technology continually dominates and transforms how
economics operate. However, a deeper, more conceptual understanding
of how these technologies work to identify innovation opportunities
and how to successfully thrive in an increasingly competitive
environment is needed for the entrepreneurs of tomorrow.
Transforming Businesses With Bitcoin Mining and Blockchain
Applications provides innovative insights into IT infrastructure
and emerging trends in the realm of digital business technologies.
This publication analyzes and extracts information from Bitcoin
networks and provides the necessary steps to designing open
blockchain. Highlighting topics that include financial markets,
risk management, and smart technologies, the research contained
within the title is ideal for entrepreneurs, business
professionals, managers, executives, academicians, researchers, and
business students.
Multimedia represents information in novel and varied formats. One
of the most prevalent examples of continuous media is video.
Extracting underlying data from these videos can be an arduous
task. From video indexing, surveillance, and mining, complex
computational applications are required to process this data.
Intelligent Analysis of Multimedia Information is a pivotal
reference source for the latest scholarly research on the
implementation of innovative techniques to a broad spectrum of
multimedia applications by presenting emerging methods in
continuous media processing and manipulation. This book offers a
fresh perspective for students and researchers of information
technology, media professionals, and programmers.
Multinational organizations have begun to realize that sentiment
mining plays an important role for decision making and market
strategy. The revolutionary growth of digital marketing not only
changes the market game, but also brings forth new opportunities
for skilled professionals and expertise. Currently, the
technologies are rapidly changing, and artificial intelligence (AI)
and machine learning are contributing as game-changing
technologies. These are not only trending but are also increasingly
popular among data scientists and data analysts. New Opportunities
for Sentiment Analysis and Information Processing provides
interdisciplinary research in information retrieval and sentiment
analysis including studies on extracting sentiments from textual
data, sentiment visualization-based dimensionality reduction for
multiple features, and deep learning-based multi-domain sentiment
extraction. The book also optimizes techniques used for sentiment
identification and examines applications of sentiment analysis and
emotion detection. Covering such topics as communication networks,
natural language processing, and semantic analysis, this book is
essential for data scientists, data analysts, IT specialists,
scientists, researchers, academicians, and students.
The Data and Analytics Playbook: Proven Methods for Governed Data
and Analytic Quality explores the way in which data continues to
dominate budgets, along with the varying efforts made across a
variety of business enablement projects, including applications,
web and mobile computing, big data analytics, and traditional data
integration. The book teaches readers how to use proven methods and
accelerators to break through data obstacles to provide faster,
higher quality delivery of mission critical programs. Drawing upon
years of practical experience, and using numerous examples and an
easy to understand playbook, Lowell Fryman, Gregory Lampshire, and
Dan Meers discuss a simple, proven approach to the execution of
multiple data oriented activities. In addition, they present a
clear set of methods to provide reliable governance, controls,
risk, and exposure management for enterprise data and the programs
that rely upon it. In addition, they discuss a cost-effective
approach to providing sustainable governance and quality outcomes
that enhance project delivery, while also ensuring ongoing
controls. Example activities, templates, outputs, resources, and
roles are explored, along with different organizational models in
common use today and the ways they can be mapped to leverage
playbook data governance throughout the organization.
Cluster or co-cluster analyses are important tools in a variety of
scientific areas. The introduction of this book presents a state of
the art of already well-established, as well as more recent methods
of co-clustering. The authors mainly deal with the two-mode
partitioning under different approaches, but pay particular
attention to a probabilistic approach. Chapter 1 concerns
clustering in general and the model-based clustering in particular.
The authors briefly review the classical clustering methods and
focus on the mixture model. They present and discuss the use of
different mixtures adapted to different types of data. The
algorithms used are described and related works with different
classical methods are presented and commented upon. This chapter is
useful in tackling the problem of co-clustering under the mixture
approach. Chapter 2 is devoted to the latent block model proposed
in the mixture approach context. The authors discuss this model in
detail and present its interest regarding co-clustering. Various
algorithms are presented in a general context. Chapter 3 focuses on
binary and categorical data. It presents, in detail, the
appropriated latent block mixture models. Variants of these models
and algorithms are presented and illustrated using examples.
Chapter 4 focuses on contingency data. Mutual information,
phi-squared and model-based co-clustering are studied. Models,
algorithms and connections among different approaches are described
and illustrated. Chapter 5 presents the case of continuous data. In
the same way, the different approaches used in the previous
chapters are extended to this situation. Contents 1. Cluster
Analysis. 2. Model-Based Co-Clustering. 3. Co-Clustering of Binary
and Categorical Data. 4. Co-Clustering of Contingency Tables. 5.
Co-Clustering of Continuous Data. About the Authors Gerard Govaert
is Professor at the University of Technology of Compiegne, France.
He is also a member of the CNRS Laboratory Heudiasyc (Heuristic and
diagnostic of complex systems). His research interests include
latent structure modeling, model selection, model-based cluster
analysis, block clustering and statistical pattern recognition. He
is one of the authors of the MIXMOD (MIXtureMODelling) software.
Mohamed Nadif is Professor at the University of Paris-Descartes,
France, where he is a member of LIPADE (Paris Descartes computer
science laboratory) in the Mathematics and Computer Science
department. His research interests include machine learning, data
mining, model-based cluster analysis, co-clustering, factorization
and data analysis. Cluster Analysis is an important tool in a
variety of scientific areas. Chapter 1 briefly presents a state of
the art of already well-established as well more recent methods.
The hierarchical, partitioning and fuzzy approaches will be
discussed amongst others. The authors review the difficulty of
these classical methods in tackling the high dimensionality,
sparsity and scalability. Chapter 2 discusses the interests of
coclustering, presenting different approaches and defining a
co-cluster. The authors focus on co-clustering as a simultaneous
clustering and discuss the cases of binary, continuous and
co-occurrence data. The criteria and algorithms are described and
illustrated on simulated and real data. Chapter 3 considers
co-clustering as a model-based co-clustering. A latent block model
is defined for different kinds of data. The estimation of
parameters and co-clustering is tackled under two approaches:
maximum likelihood and classification maximum likelihood. Hard and
soft algorithms are described and applied on simulated and real
data. Chapter 4 considers co-clustering as a matrix approximation.
The trifactorization approach is considered and algorithms based on
update rules are described. Links with numerical and probabilistic
approaches are established. A combination of algorithms are
proposed and evaluated on simulated and real data. Chapter 5
considers a co-clustering or bi-clustering as the search for
coherent co-clusters in biological terms or the extraction of
co-clusters under conditions. Classical algorithms will be
described and evaluated on simulated and real data. Different
indices to evaluate the quality of coclusters are noted and used in
numerical experiments.
Based on current literature and cutting-edge advances in the
machine learning field, there are four algorithms whose usage in
new application domains must be explored: neural networks, rule
induction algorithms, tree-based algorithms, and density-based
algorithms. A number of machine learning related algorithms have
been derived from these four algorithms. Consequently, they
represent excellent underlying methods for extracting hidden
knowledge from unstructured data, as essential data mining tasks.
Implementation of Machine Learning Algorithms Using Control-Flow
and Dataflow Paradigms presents widely used data-mining algorithms
and explains their advantages and disadvantages, their mathematical
treatment, applications, energy efficient implementations, and
more. It presents research of energy efficient accelerators for
machine learning algorithms. Covering topics such as control-flow
implementation, approximate computing, and decision tree
algorithms, this book is an essential resource for computer
scientists, engineers, students and educators of higher education,
researchers, and academicians.
Data Simplification: Taming Information With Open Source Tools
addresses the simple fact that modern data is too big and complex
to analyze in its native form. Data simplification is the process
whereby large and complex data is rendered usable. Complex data
must be simplified before it can be analyzed, but the process of
data simplification is anything but simple, requiring a specialized
set of skills and tools. This book provides data scientists from
every scientific discipline with the methods and tools to simplify
their data for immediate analysis or long-term storage in a form
that can be readily repurposed or integrated with other data.
Drawing upon years of practical experience, and using numerous
examples and use cases, Jules Berman discusses the principles,
methods, and tools that must be studied and mastered to achieve
data simplification, open source tools, free utilities and snippets
of code that can be reused and repurposed to simplify data, natural
language processing and machine translation as a tool to simplify
data, and data summarization and visualization and the role they
play in making data useful for the end user.
Data is powerful. It separates leaders from laggards and it drives
business disruption, transformation, and reinvention. Today's most
progressive companies are using the power of data to propel their
industries into new areas of innovation, specialization, and
optimization. The horsepower of new tools and technologies have
provided more opportunities than ever to harness, integrate, and
interact with massive amounts of disparate data for business
insights and value - something that will only continue in the era
of the Internet of Things. And, as a new breed of tech-savvy and
digitally native knowledge workers rise to the ranks of data
scientist and visual analyst, the needs and demands of the people
working with data are changing, too. The world of data is changing
fast. And, it's becoming more visual. Visual insights are becoming
increasingly dominant in information management, and with the
reinvigorated role of data visualization, this imperative is a
driving force to creating a visual culture of data discovery. The
traditional standards of data visualizations are making way for
richer, more robust and more advanced visualizations and new ways
of seeing and interacting with data. However, while data
visualization is a critical tool to exploring and understanding
bigger and more diverse and dynamic data, by understanding and
embracing our human hardwiring for visual communication and
storytelling and properly incorporating key design principles and
evolving best practices, we take the next step forward to transform
data visualizations from tools into unique visual information
assets.
The effective application of knowledge management principles has
proven to be beneficial for modern organizations. When utilized in
the academic community, these frameworks can enhance the value and
quality of research initiatives. Enhancing Academic Research With
Knowledge Management Principles is a pivotal reference source for
the latest research on implementing theoretical frameworks of
information management in the context of academia and universities.
Featuring extensive coverage on relevant areas such as data mining,
organizational and academic culture, this publication is an ideal
resource for researchers, academics, practitioners, professionals,
and students.
|
|