![]() |
![]() |
Your cart is empty |
||
Books > Computing & IT > Applications of computing > Artificial intelligence > Computer vision
COMPUTER VISION is a field of research that encompasses many objectives. A primary goal has been to construct visual sensors that can provide general-purpose robots with the same information about their surroundings as we receive from our own visual senses. This book takes an important step towards this goal by describing a working computer vision system named SCERPO. This system can recognize known three-dimensional objects in ordinary black-and-white images taken from unknown viewpoints, even when parts of the object are undetectable or hidden from view. A second major goal of computer vision re search is to provide a computational understanding of human vision. The research presented in this book has many implica tions for our understanding of human vision, particularly in the areas of perceptual organization and knowledge-based recogni tion. An attempt has been made to relate each computational result to the relevant areas in the psychology of vision. Since the material is meant to be accessible to a wide range of inter disciplinary readers, the book is written in plain language and attempts to explain most concepts from the starting position of the non-specialist. vii viii PREFACE One of the most important conclusions ansmg from this research is that visual recognition can commonly be achieved directly from the two-dimensional image without any prelim inary reconstruction of depth information or surface orienta tion from the visual input."
This is an examination of the history and the state of the art of the quest for visualizing scientific knowledge and the dynamics of its development. Through an interdisciplinary perspective this book presents profound visions, pivotal advances, and insightful contributions made by generations of researchers and professionals, which portrays a holistic view of the underlying principles and mechanisms of the development of science. This updated and extended second edition: highlights the latest advances in mapping scientific frontiers examines the foundations of strategies, principles, and design patterns provides an integrated and holistic account of major developments across disciplinary boundaries "Anyone who tries to follow the exponential growth of the literature on citation analysis and scientometrics knows how difficult it is to keep pace. Chaomei Chen has identified the significant methods and applications in visual graphics and made them clear to the uninitiated. Derek Price would have loved this book which not only pays homage to him but also to the key players in information science and a wide variety of others in the sociology and history of science." - Eugene Garfield "This is a wide ranging book on information visualization, with a specific focus on science mapping. Science mapping is still in its infancy and many intellectual challenges remain to be investigated and many of which are outlined in the final chapter. In this new edition Chaomei Chen has provided an essential text, useful both as a primer for new entrants and as a comprehensive overview of recent developments for the seasoned practitioner." - Henry Small Chaomei Chen is a Professor in the College of Information Science and Technology at Drexel University, Philadelphia, USA, and a ChangJiang Scholar at Dalian University of Technology, Dalian, China. He is the Editor-in-Chief of Information Visualization and the author of Turning Points: The Nature of Creativity (Springer, 2012) and Information Visualization: Beyond the Horizon (Springer, 2004, 2006).
This volume provides universal methodologies accompanied by Matlab software to manipulate numerous signal and image processing applications. It is done with discrete and polynomial periodic splines. Various contributions of splines to signal and image processing from a unified perspective are presented. This presentation is based on Zak transform and on Spline Harmonic Analysis (SHA) methodology. SHA combines approximation capabilities of splines with the computational efficiency of the Fast Fourier transform. SHA reduces the design of different spline types such as splines, spline wavelets (SW), wavelet frames (SWF) and wavelet packets (SWP) and their manipulations by simple operations. Digital filters, produced by wavelets design process, give birth to subdivision schemes. Subdivision schemes enable to perform fast explicit computation of splines' values at dyadic and triadic rational points. This is used for signals and images up sampling. In addition to the design of a diverse library of splines, SW, SWP and SWF, this book describes their applications to practical problems. The applications include up sampling, image denoising, recovery from blurred images, hydro-acoustic target detection, to name a few. The SWF are utilized for image restoration that was degraded by noise, blurring and loss of significant number of pixels. The book is accompanied by Matlab based software that demonstrates and implements all the presented algorithms. The book combines extensive theoretical exposure with detailed description of algorithms, applications and software. The Matlab software can be downloaded from http://extras.springer.com
This highly anticipated new edition provides a comprehensive account of face recognition research and technology, spanning the full range of topics needed for designing operational face recognition systems. After a thorough introductory chapter, each of the following chapters focus on a specific topic, reviewing background information, up-to-date techniques, and recent results, as well as offering challenges and future directions. Features: fully updated, revised and expanded, covering the entire spectrum of concepts, methods, and algorithms for automated face detection and recognition systems; provides comprehensive coverage of face detection, tracking, alignment, feature extraction, and recognition technologies, and issues in evaluation, systems, security, and applications; contains numerous step-by-step algorithms; describes a broad range of applications; presents contributions from an international selection of experts; integrates numerous supporting graphs, tables, charts, and performance data.
Face Image Analysis by Unsupervised Learning explores adaptive approaches to image analysis. It draws upon principles of unsupervised learning and information theory to adapt processing to the immediate task environment. In contrast to more traditional approaches to image analysis in which relevant structure is determined in advance and extracted using hand-engineered techniques, Face Image Analysis by Unsupervised Learning explores methods that have roots in biological vision and/or learn about the image structure directly from the image ensemble. Particular attention is paid to unsupervised learning techniques for encoding the statistical dependencies in the image ensemble. The first part of this volume reviews unsupervised learning, information theory, independent component analysis, and their relation to biological vision. Next, a face image representation using independent component analysis (ICA) is developed, which is an unsupervised learning technique based on optimal information transfer between neurons. The ICA representation is compared to a number of other face representations including eigenfaces and Gabor wavelets on tasks of identity recognition and expression analysis. Finally, methods for learning features that are robust to changes in viewpoint and lighting are presented. These studies provide evidence that encoding input dependencies through unsupervised learning is an effective strategy for face recognition. Face Image Analysis by Unsupervised Learning is suitable as a secondary text for a graduate-level course, and as a reference for researchers and practitioners in industry.
In many computer vision applications, objects have to be learned and recognized in images or image sequences. This book presents new probabilistic hierarchical models that allow an efficient representation of multiple objects of different categories, scales, rotations, and views. The idea is to exploit similarities between objects and object parts in order to share calculations and avoid redundant information. Furthermore inference approaches for fast and robust detection are presented. These new approaches combine the idea of compositional and similarity hierarchies and overcome limitations of previous methods. Besides classical object recognition the book shows the use for detection of human poses in a project for gait analysis. The use of activity detection is presented for the design of environments for ageing, to identify activities and behavior patterns in smart homes. In a presented project for parking spot detection using an intelligent vehicle, the proposed approaches are used to hierarchically model the environment of the vehicle for an efficient and robust interpretation of the scene in real-time.
This book on autonomous road-following vehicles brings together twenty years of innovation in the field. The book uniquely details an approach to real-time machine vision for the understanding of dynamic scenes, viewed from a moving platform that begins with spatio-temporal representations of motion for hypothesized objects whose parameters are adjusted by well-known prediction error feedback and recursive estimation techniques.
This book introduces a new theory in Computer Vision yielding elementary techniques to analyze digital images. These techniques are a mathematical formalization of the Gestalt theory. From the mathematical viewpoint the closest field to it is stochastic geometry, involving basic probability and statistics, in the context of image analysis. The book is mathematically self-contained, needing only basic understanding of probability and calculus. The text includes more than 130 illustrations, and numerous examples based on specific images on which the theory is tested. Detailed exercises at the end of each chapter help the reader develop a firm understanding of the concepts imparted.
The sixteen chapters included in this book were written by invited experts of international recognition and address important issues in Medical Image Processing and Computational Vision, including: Object Recognition, Object Detection, Object Tracking, Pose Estimation, Facial Expression Recognition, Image Retrieval, Data Mining, Automatic Video Understanding and Management, Edges Detection, Image Segmentation, Modelling and Simulation, Medical thermography, Database Systems, Synthetic Aperture Radar and Satellite Imagery. Different applications are addressed and described throughout the book, comprising: Object Recognition and Tracking, Facial Expression Recognition, Image Database, Plant Disease Classification, Video Understanding and Management, Image Processing, Image Segmentation, Bio-structure Modelling and Simulation, Medical Imaging, Image Classification, Medical Diagnosis, Urban Areas Classification, Land Map Generation. The book brings together the current state-of-the-art in the various multi-disciplinary solutions for Medical Image Processing and Computational Vision, including research, techniques, applications and new trends contributing to the development of the related areas.
Spatial trajectories have been bringing the unprecedented wealth to a variety of research communities. A spatial trajectory records the paths of a variety of moving objects, such as people who log their travel routes with GPS trajectories. The field of moving objects related research has become extremely active within the last few years, especially with all major database and data mining conferences and journals. "Computing with Spatial Trajectories" introduces the algorithms, technologies, and systems used to process, manage and understand existing spatial trajectories for different applications. This book also presents an overview on both fundamentals and the state-of-the-art research inspired by spatial trajectory data, as well as a special focus on trajectory pattern mining, spatio-temporal data mining and location-based social networks. Each chapter provides readers with a tutorial-style introduction to one important aspect of location trajectory computing, case studies and many valuable references to other relevant research work. "Computing with Spatial Trajectories" is designed as a reference or secondary text book for advanced-level students and researchers mainly focused on computer science and geography. Professionals working on spatial trajectory computing will also find this book very useful.
This is the first book to treat the analysis of 3D dynamic scenes using a stereovision system. Several approaches are described, for example two different methods for dealing with long and short sequences of images of an unknown environment including an arbitrary number of rigid mobile objects. Results obtained from stereovision systems are found to be superior to those from monocular image systems, which are often very sensitive to noise and therefore of little use in practice. It is shown thatmotion estimation can be further improved by the explicit modeling of uncertainty in geometric objects. The techniques developed in this book have been successfully demonstrated with a large number of real images in the context of visual navigation of a mobile robot.
Wireless communication and sensor networks would form the backbone to create pervasive and ubiquitous environments that would have profound influence on the society and thus are important to the society. The wireless communication technologies and wireless sensor networks would encompass a wide range of domains such as HW devices such as motes, sensors and associated instrumentation, actuators, transmitters, receivers, antennas, etc., sensor network aspects such as topologies, routing algorithms, integration of heterogeneous network elements and topologies, designing RF devices and systems for energy efficiency and reliability etc. These sensor networks would provide opportunity to continuously and in a distributed manner monitor the environment and generate the necessary warnings and actions. However most of the developments have been demonstrated only in controlled and laboratory environments. So we are yet to see those powerful, ubiquitous applications for the benefit of the society. The conference and consequentially the proceedings would provide opportunity to the researchers to interact with other researchers and share their researches covering all the above areas. The proceedings of the conference thus covers the research work of different authors in the area of wireless sensor networks, wireless communications, devices, tools and techniques for WSN, and applications of wireless sensor networks. This book is beneficial for those researchers who are working in the area of wireless sensor networks, wireless communication, and developing applications of Wireless sensor networks.
The research book is a continuation of the authors' previous works, which are focused on recent advances in computer vision methodologies and technical solutions using conventional and intelligent paradigms. The book gathers selected contributions addressing aerial and satellite image processing and related fields. Topics covered include novel tensor and wave models, a new comparative morphology scheme, warping compensation in video stabilization, image deblurring based on physical processes of blur impacts, and a rapid and robust core structural verification algorithm for feature extraction in images and videos, among others. All chapters focus on practical implementations. Given the tremendous interest among researchers in the development and applications of computer vision paradigms in the field of business, engineering, medicine, security and aviation, this book offers a timely guide.
Machine learning is a novel discipline concerned with the analysis of large and multiple variables data. It involves computationally intensive methods, like factor analysis, cluster analysis, and discriminant analysis. It is currently mainly the domain of computer scientists, and is already commonly used in social sciences, marketing research, operational research and applied sciences. It is virtually unused in clinical research. This is probably due to the traditional belief of clinicians in clinical trials where multiple variables are equally balanced by the randomization process and are not further taken into account. In contrast, modern computer data files often involve hundreds of variables like genes and other laboratory values, and computationally intensive methods are required. This book was written as a hand-hold presentation accessible to clinicians, and as a must-read publication for those new to the methods.
Mobile robots operating in real-world, outdoor scenarios depend on dynamic scene understanding for detecting and avoiding obstacles, recognizing landmarks, acquiring models, and for detecting and tracking moving objects. Motion understanding has been an active research effort for more than a decade, searching for solutions to some of these problems; however, it still remains one of the more difficult and challenging areas of computer vision research. Qualitative Motion Understanding describes a qualitative approach to dynamic scene and motion analysis, called DRIVE (Dynamic Reasoning from Integrated Visual Evidence). The DRIVE system addresses the problems of (a) estimating the robot's egomotion, (b) reconstructing the observed 3-D scene structure; and (c) evaluating the motion of individual objects from a sequence of monocular images. The approach is based on the FOE (focus of expansion) concept, but it takes a somewhat unconventional route. The DRIVE system uses a qualitative scene model and a fuzzy focus of expansion to estimate robot motion from visual cues, to detect and track moving objects, and to construct and maintain a global dynamic reference model.
In recent yearswe haveseen considerableadvances in the development of - manoid robots, that is robots with an anthropomorphic design. Such robots should be capable of autonomously performing tasks for their human users in changing environments by adapting to these and to the circumstances at hand. To do so, they as well as any kind of autonomous robot need to have some way of understanding the world around them. We humans do so by our senses, both our far senses vision and hearing (smelling too) and our near senses touch and taste. Vision plays a special role in the way it simulta- ously tells us "where" and "what" in a direct way. It is therefore an accepted factthatto developautonomousrobots,humanoidornot,itisessentialto- clude competent systems for visual perception. Such systems should embody techniques from the ?eld of computer vision, in which sophisticated com- tational methods for extracting information from visual imagery have been developed over a number of decades. However, complete systems incorpor- ing such advanced techniques, while meeting the requirements of real-time processing and adaptivity to the complexity that even our everyday envir- ment displays, are scarce. The present volume takes an important step for ?lling this gap by presenting methods and a system for visual perception for a humanoid robot with speci?c applications to manipulation tasks and to how the robot can learn by imitating the human.
Image Modeling and Retrieval; E. Vicario. Efficient and Effective Nearest Neighbor Search in a Medical Image Database of Tumor Shapes; F. Korn, et al. Shape-Similarity-Based Retrieval in Image Databases; R. Mehrotra, J.E. Gary. Color Angular Indexing and Image Retrieval; G.D. Finlayson, et al. Indexing Color-Texture Image Patterns; A.D. Ventura, et al. Iconic Indexing for Visual Databases; Q-L. Zhang, S-K. Chang. Using Weighted Spatial Relationships in Retrieval by Visual Contents; A. Del Bimbo, et al.. Index.
The computer interpretation of line drawings is a classic problem in arti?cial intelligence (AI) which has inspired the development of some fundamental AI tools, including constraint propagation, probabilistic relaxation, the characte- zation of tractable constraint classes and, most recently, the propagationof soft constraintsin?nite-domainoptimizationproblems. Line drawinginterpretation has many distinct applications on the borderline of computer vision and c- puter graphics, including sketch interpretation, the input of 3D object models 1 and the creation of 2 D illustrations in electronic documents. 2 I hope I have made this fascinating topic accessible not only to computer scientistsbutalsotomathematicians,psychologistsandcognitivescientistsand, indeed, to anyone who is intrigued by optical illusions and impossible or - biguous ?gures. This book could not have been written without the support of the CNRS, theFrenchCentreNational deRecherche Scienti?que,who?nancedmyone-year break from teaching at the University of Toulouse III. The UK Engineering and Physical Sciences Research Council also ?nanced several extended visits to the Oxford University Computing Laboratory. Section 9.1 is just a brief summary of the results on tractable constraints that have come out of this very productive joint research programme with David Cohen, Peter Jeavons and Andrei Krokhin. The various soft arc consistency techniques described in Chapter 8 were developed in collaboration with Thomas Schiex and Simon de Givry at INRA, Toulouse. I am also grateful to Ralph Martin and Peter Varley for their comments on the line-labelling constraints presented in Chapter 3.
Traditionally, scientific fields have defined boundaries, and scientists work on research problems within those boundaries. However, from time to time those boundaries get shifted or blurred to evolve new fields. For instance, the original goal of computer vision was to understand a single image of a scene, by identifying objects, their structure, and spatial arrangements. This has been referred to as image understanding. Recently, computer vision has gradually been making the transition away from understanding single images to analyzing image sequences, or video understanding. Video understanding deals with understanding of video sequences, e. g., recognition of gestures, activities, facial expressions, etc. The main shift in the classic paradigm has been from the recognition of static objects in the scene to motion-based recognition of actions and events. Video understanding has overlapping research problems with other fields, therefore blurring the fixed boundaries. Computer graphics, image processing, and video databases have obvious overlap with computer vision. The main goal of computer graphics is to gener ate and animate realistic looking images, and videos. Researchers in computer graphics are increasingly employing techniques from computer vision to gen erate the synthetic imagery. A good example of this is image-based rendering and modeling techniques, in which geometry, appearance, and lighting is de rived from real images using computer vision techniques. Here the shift is from synthesis to analysis followed by synthesis."
Human and animal vision systems have been driven by the pressures of evolution to become capable of perceiving and reacting to their environments as close to instantaneously as possible. Casting such a goal of reactive vision into the framework of existing technology necessitates an artificial system capable of operating continuously, selecting and integrating information from an environment within stringent time delays. The YAP (Vision As Process) project embarked upon the study and development of techniques with this aim in mind. Since its conception in 1989, the project has successfully moved into its second phase, YAP II, using the integrated system developed in its predecessor as a basis. During the first phase of the work the "vision as a process paradigm" was realised through the construction of flexible stereo heads and controllable stereo mounts integrated in a skeleton system (SA V A) demonstrating continuous real-time operation. It is the work of this fundamental period in the V AP story that this book aptly documents. Through its achievements, the consortium has contributed to building a strong scientific base for the future development of continuously operating machine vision systems, and has always underlined the importance of not just solving problems of purely theoretical interest but of tackling real-world scenarios. Indeed the project members should now be well poised to contribute (and take advantage of) industrial applications such as navigation and process control, and already the commercialisation of controllable heads is underway.
This volume contains revised papers based on contributions to the NATO Advanced Research Workshop on Multisensor Fusion for Computer Vision, held in Grenoble, France, in June 1989. The 24 papers presented here cover a broad range of topics, including the principles and issues in multisensor fusion, information fusion for navigation, multisensor fusion for object recognition, network approaches to multisensor fusion, computer architectures for multi sensor fusion, and applications of multisensor fusion. The participants met in the beautiful surroundings of Mont Belledonne in Grenoble to discuss their current work in a setting conducive to interaction and the exchange of ideas. Each participant is a recognized leader in his or her area in the academic, governmental, or industrial research community. The workshop focused on techniques for the fusion or integration of sensor information to achieve the optimum interpretation of a scene. Several participants presented novel points of view on the integration of information. The 24 papers presented in this volume are based on those collected by the editor after the workshop, and reflect various aspects of our discussions. The papers are organized into five parts, as follows.
MPEG-7 is the first international standard which contains a number of key techniques from Computer Vision and Image Processing. The Curvature Scale Space technique was selected as a contour shape descriptor for MPEG-7 after substantial and comprehensive testing, which demonstrated the superior performance of the CSS-based descriptor. Curvature Scale Space Representation: Theory, Applications, and MPEG-7 Standardization is based on key publications on the CSS technique, as well as its multiple applications and generalizations. The goal was to ensure that the reader will have access to the most fundamental results concerning the CSS method in one volume. These results have been categorized into a number of chapters to reflect their focus as well as content. The book also includes a chapter on the development of the CSS technique within MPEG standardization, including details of the MPEG-7 testing and evaluation processes which led to the selection of the CSS shape descriptor for the standard. The book can be used as a supplementary textbook by any university or institution offering courses in computer and information science.
Overview Recent years have seen an increasing interest in the development of multi-sensory robot systems. The reason for this interest stems from a realization that there are fundamental limitations on the reconstruction of environment descriptions using only a single source of sensor information. If robot systems are ever to achieve a degree of intelligence and autonomy, they must be capable of using many different sources of sensory information in an active and dynamic manner. The observations made by the different sensors of a multi-sensor system are always uncertain, usually partial, occasionally spuri9us or incorrect and often geographically or geometrically imcomparable with other sensor views. The sensors of these systems are characterized by the diversity of information that they can provide and by the complexity of their operation. It is the goal of a multi sensor system to combine information from all these different sources into a robust and consistent description of the environment." |
![]() ![]() You may like...
Theory, Application, and Implementation…
Pooneh Saidi Bidokhti
Hardcover
R3,338
Discovery Miles 33 380
Architectural Intelligence - Selected…
Philip F. Yuan, Mike Xie, …
Hardcover
R4,392
Discovery Miles 43 920
Applying AI-Based IoT Systems to…
Bhatia Madhulika, Bhatia Surabhi, …
Hardcover
R7,243
Discovery Miles 72 430
Chemical Modelling - Volume 17
Hilke Bahmann, Jean Christophe Tremblay
Hardcover
R12,177
Discovery Miles 121 770
Thermo-Hydro-Mechanical-Chemical…
Olaf Kolditz, Hua Shao, …
Hardcover
|