![]() |
![]() |
Your cart is empty |
||
Books > Computing & IT > Applications of computing > Artificial intelligence > Computer vision
Matrix transforms are ubiquitous within the world of computer graphics, where they have become an invaluable tool in a programmer's toolkit for solving everything from 2D image scaling to 3D rotation about an arbitrary axis. Virtually every software system and hardware graphics processor uses matrices to undertake operations such as scaling, translation, reflection and rotation. Nevertheless, for some newcomers to the world of computer games and animation, matrix notation can appear obscure and challenging. Matrices and determinants were originally used to solve groups of simultaneous linear equations, and were subsequently embraced by the computer graphics community to describe the geometric operations for manipulating two- and three-dimensional structures. Consequently, to place matrix notation within an historical context, the author provides readers with some useful background to their development, alongside determinants. Although it is assumed that the reader is familiar with everyday algebra and the solution of simultaneous linear equations, "Matrix Transforms for Computer Games and Animation" does not expect any prior knowledge of matrix notation. It includes chapters on matrix notation, determinants, matrices, 2D transforms, 3D transforms and quaternions, and includes many worked examples to illustrate their practical use.
This book constitutes the proceedings of the 4th International Workshop on Motion in Games, held in Edinburgh, UK, in November 2011. The 30 revised full papers presented together with 8 revised poster papers in this volume were carefully reviewed and selected from numerous submissions. The papers are organized in topical sections on character animation, motion synthesis, physically-based character motion, behavior animation, animation systems, crowd simulation, as well as path planning and navigation.
The following chapters of this book presents key issues concerning the neurophysiological aspects of executing cognitive thought processes and the basics of cognitive informatics and new proposals of UBIAS systems dedicated to the meaning-based analysis of selected types of medical images. In particular, to structure the considerations of pattern classification methods, Chapter 2 discusses traditional image recognition techniques and algorithms from the simplest methods based on metric spaces up to methods that use the paradigms of computer image understanding. Chapter 3 deals with the cognitive aspects of brain function. Information from this chapter allows the authors, in a latter part of this book, to show functional analogies between the operation of biological systems and computer implementations. Chapter 4 provides a short compendium of knowledge about the new branch of informatics which formally describes thought processes, namely cognitive informatics. The introduction to subjects of cognitive processes analysed by cognitive informatics will then allow us to introduce new classes of computer systems executing cognitive resonance processes. The following Chapter 5 defines a new class of information systems using cognitive resonance processes. This chapter reviews several proposals of various classes of cognitive categorisation systems put forward by the authors. Chapter 6 contains a broader discussion of the UBIAS system class which the authors proposed for the meaning-based analysis of medical images. Then, Chapter 7 discusses in detail two examples of UBIAS systems built for the semantic classification of foot bone X-rays and images of long bone injuries in extremities. Chapter 8, the last, compiles and summarises information on creating cognitive vision systems designed for the semantic classification of patterns. The authors present this book to Readers in the hope that it will stir their fascination with the scientific aspects of creating new generation computer systems which imitate thought processes and can determine the meaning of complex image patterns.
This volume contains the proceedings of a workshop on Analog Integrated Neural Systems held May 8, 1989, in connection with the International Symposium on Circuits and Systems. The presentations were chosen to encompass the entire range of topics currently under study in this exciting new discipline. Stringent acceptance requirements were placed on contributions: (1) each description was required to include detailed characterization of a working chip, and (2) each design was not to have been published previously. In several cases, the status of the project was not known until a few weeks before the meeting date. As a result, some of the most recent innovative work in the field was presented. Because this discipline is evolving rapidly, each project is very much a work in progress. Authors were asked to devote considerable attention to the shortcomings of their designs, as well as to the notable successes they achieved. In this way, other workers can now avoid stumbling into the same traps, and evolution can proceed more rapidly (and less painfully). The chapters in this volume are presented in the same order as the corresponding presentations at the workshop. The first two chapters are concerned with fmding solutions to complex optimization problems under a predefmed set of constraints. The first chapter reports what is, to the best of our knowledge, the first neural-chip design. In each case, the physics of the underlying electronic medium is used to represent a cost function in a natural way, using only nearest-neighbor connectivity.
In recent years there has been an increasing interest in Second Generation Image and Video Coding Techniques. These techniques introduce new concepts from image analysis that greatly improve the performance of the coding schemes for very high compression. This interest has been further emphasized by the future MPEG 4 standard. Second generation image and video coding techniques are the ensemble of approaches proposing new and more efficient image representations than the conventional canonical form. As a consequence, the human visual system becomes a fundamental part of the encoding/decoding chain. More insight to distinguish between first and second generation can be gained if it is noticed that image and video coding is basically carried out in two steps. First, image data are converted into a sequence of messages and, second, code words are assigned to the messages. Methods of the first generation put the emphasis on the second step, whereas methods of the second generation put it on the first step and use available results for the second step. As a result of including the human visual system, second generation can be also seen as an approach of seeing the image composed by different entities called objects. This implies that the image or sequence of images have first to be analyzed and/or segmented in order to find the entities. It is in this context that we have selected in this book three main approaches as second generation video coding techniques: Segmentation-based schemes Model Based Schemes Fractal Based Schemes GBP/LISTGBP Video Coding: The Second Generation Approach is an important introduction to the new coding techniques for video. As such, all researchers, students and practitioners working in image processing will find this book of interest.
The computer interpretation of line drawings is a classic problem in arti?cial intelligence (AI) which has inspired the development of some fundamental AI tools, including constraint propagation, probabilistic relaxation, the characte- zation of tractable constraint classes and, most recently, the propagationof soft constraintsin?nite-domainoptimizationproblems. Line drawinginterpretation has many distinct applications on the borderline of computer vision and c- puter graphics, including sketch interpretation, the input of 3D object models 1 and the creation of 2 D illustrations in electronic documents. 2 I hope I have made this fascinating topic accessible not only to computer scientistsbutalsotomathematicians,psychologistsandcognitivescientistsand, indeed, to anyone who is intrigued by optical illusions and impossible or - biguous ?gures. This book could not have been written without the support of the CNRS, theFrenchCentreNational deRecherche Scienti?que,who?nancedmyone-year break from teaching at the University of Toulouse III. The UK Engineering and Physical Sciences Research Council also ?nanced several extended visits to the Oxford University Computing Laboratory. Section 9.1 is just a brief summary of the results on tractable constraints that have come out of this very productive joint research programme with David Cohen, Peter Jeavons and Andrei Krokhin. The various soft arc consistency techniques described in Chapter 8 were developed in collaboration with Thomas Schiex and Simon de Givry at INRA, Toulouse. I am also grateful to Ralph Martin and Peter Varley for their comments on the line-labelling constraints presented in Chapter 3.
Human Identification Based on Gait is the first book to address gait as a biometric. Biometrics is now in a unique position where it affects most people's lives. This is especially true of "gait," which is one of the most recent biometrics. Recognizing people by the way they walk and run implies analyzing movement which, in turn, implies analyzing sequences of images, thus requiring memory and computational performance that became available only recently. Human Identification Based on Gait introduces developments from distinguished researchers within this relatively new area of biometrics. This book clearly establishes how human gait is biometric. Human Identification Based on Gait is structured to meet the needs of professionals in industry, as well as advanced-level students in computer science.
This book contains the proceedings of the 4th International Conference on Data Analysis and Processing held in Cefalu' (Palermo, ITALY) on September 23-25 1987. The aim of this Conference, now at its fourth edition, was to give a general view of the actual research in the area of methods and systems for achieving artificial vision as well as to have an up-dated information of the current activity in Europe. A number of invited speakers presented overviews of statistical classification problems and methods, non conventional archi tectures, mathematical morphology, robotic vision, analysis of range images in vision systems, pattern matching algorithms and astronomical data processing. Finally a survey of the discussion on the contribution of AI to Image Analysis is given. The papers presented at the Conference have been subdivided in four sections: knowledge based approaches, basic pattern recognition tools, multi features system based solutions, image analysis-applications. We must thank the IBM-Italia and the Digital Equipment Corpo ration for sponsoring this Conference. We feel that the days spent at Cefalu' were an important step toward the mutual exchange of scientific information within the image processing community. v. Cantoni Pavia University V. Di Gesu' Palermo University S. Levialdi Rome University v CONTENTS INVITED LECTURES . * * * * . * * * 3 Morphological Optics.
Visual languages have long been lit pursuitofeffective communication 00 tween human and machine. Today, they are suecessfully employed for e: nd user progmmming, modeliog, rapid prototypmg, and design activities by people ofmany disciplines including arehitects, artists, children, engi neers, and scientists. Furthermore. with rapid advances ofthe Internet and Web technology, human human communication through the Web or eleo tronie mobile deviees is becoming more and moreprevalent This manuscript provides a comprehensive introduetion to diagmmmatiooI visual programming languages and the technologyofautomatie genemtion ofsnch languages. It covers a broad rangeofcontents from the underlying theoryofgraph grammars to the applications in various domains. Thecon tents were ex: l: l: aeted from the papers that my Ph. D. students and I have published in the last 10 years. and are updated and organized in a coherent fashion. The manuseript gives an in. -depth treatmentof all the topic areas. Pointers to related work and further readings are also faeilitated at the end ofeverychapterexeeptChapter 9. Rather than describing how to program visually, the manuscript discusses what are visual programming languages, and how sooh languages and their underlying foundations can be usefully applied to other fields incomputer science that need graphs as the p: rimary meansofrepresentation. Assuming the basic knowledge of computer programming and compiler co: nstruetion, the manuscript can be used as a textbook for senior orgradu ate computer science classes on visual languages, or a reference book for programming language classes, practitioners, and researchers inthe related field. The manuscript cannot be completed without the helps of many people.
A guide on the use of SVMs in pattern classification, including a rigorous performance comparison of classifiers and regressors. The book presents architectures for multiclass classification and function approximation problems, as well as evaluation criteria for classifiers and regressors. Features: Clarifies the characteristics of two-class SVMs; Discusses kernel methods for improving the generalization ability of neural networks and fuzzy systems; Contains ample illustrations and examples; Includes performance evaluation using publicly available data sets; Examines Mahalanobis kernels, empirical feature space, and the effect of model selection by cross-validation; Covers sparse SVMs, learning using privileged information, semi-supervised learning, multiple classifier systems, and multiple kernel learning; Explores incremental training based batch training and active-set training methods, and decomposition techniques for linear programming SVMs; Discusses variable selection for support vector regressors.
This book constitutes the proceedings of the 8th International Conference on Swarm Intelligence, held in Brussels, Belgium, in September 2012. This volume contains 15 full papers, 20 short papers, and 7 extended abstracts carefully selected out of 81 submissions. The papers cover various topics of swarm intelligence.
This book constitutes the refereed proceedings of the International Workshop on Mesh Processing in Medical Image Analysis, MeshMed 2012, held in Nice, France, in October 2012 in conjunction with MICCAI 2012, the 15th International Conference on Medical Image Computing and Computer Assisted Intervention. The book includes 16 submissions, 8 were selected for presentation along with the 3 plenary talks representative of the meshing, and 8 were selected for poster presentations. The papers cover a broad range of topics, including statistical shape analysis and atlas construction, novel meshing approaches, soft tissue simulation, quad dominant meshing and mesh based shape descriptors. The described techniques were applied to a variety of medical data including cortical bones, ear canals, cerebral aneurysms and vascular structures.
Although there has been much progress in developing theories, models and systems in the areas of Natural Language Processing (NLP) and Vision Processing (VP) there has up to now been little progress on integrating these two subareas of Artificial Intelligence (AI). This book contains a set of edited papers on recent advances in the theories, computational models and systems of the integration of NLP and VP. The volume includes original work of notable researchers: Alex Waibel outlines multimodal interfaces including studies in speech, gesture and points; eye-gaze, lip motion and facial expression; hand writing, face recognition, face tracking and sound localization in a connectionist framework. Antony Cohen and John Gooday use spatial relations to describe visual languages. Naoguki Okada considers intentions of agents in visual environments. In addition to these studies, the volume includes many recent advances from North America, Europe and Asia demonstrating the fact that integration of Natural Language Processing and Vision is truly an international challenge.
COMPUTER VISION is a field of research that encompasses many objectives. A primary goal has been to construct visual sensors that can provide general-purpose robots with the same information about their surroundings as we receive from our own visual senses. This book takes an important step towards this goal by describing a working computer vision system named SCERPO. This system can recognize known three-dimensional objects in ordinary black-and-white images taken from unknown viewpoints, even when parts of the object are undetectable or hidden from view. A second major goal of computer vision re search is to provide a computational understanding of human vision. The research presented in this book has many implica tions for our understanding of human vision, particularly in the areas of perceptual organization and knowledge-based recogni tion. An attempt has been made to relate each computational result to the relevant areas in the psychology of vision. Since the material is meant to be accessible to a wide range of inter disciplinary readers, the book is written in plain language and attempts to explain most concepts from the starting position of the non-specialist. vii viii PREFACE One of the most important conclusions ansmg from this research is that visual recognition can commonly be achieved directly from the two-dimensional image without any prelim inary reconstruction of depth information or surface orienta tion from the visual input."
Arobotmustperceivethethree-dimensionalworldifitistobeeffective there. Yet recovering 3-D information from projected images is difficult, and still remains thesubjectofbasic research. Alternatively, onecan use sensorsthatcanprovidethree-dimensionalrangeinformationdirectly. The technique ofprojecting light-stripesstartedto be used in industrialobject recognition systems asearly asthe 1970s,andtime-of-flight laser-scanning range finders became available for outdoor mobile robotnavigation in the mid-eighties. Once range data are obtained, a vision system must still describe the scene in terms of 3-D primitives such as edges, surfaces, and volumes, and recognize objeCts of interest. Today, the art of sensing, extractingfeatures, and recognizing objectsbymeans ofthree-dimensional rangedataisoneofthemostexcitingresearchareasincomputervision. Three-Dimensional Machine Vision is a collection of papers dealing withthree-dimensionalrangedata. Theauthorsarepioneeringresearchers: some are founders and others are bringingnew excitements in thefield. I have tried to select milestone papers, and my goalhas beento make this bookareferenceworkforresearchersinthree-dimensionalvision. The book is organized into four parts: 3-D Sensors, 3-D Feature Extractions,ObjectRecognitionAlgorithms,andSystemsandApplications. Part I includes four papers which describe the development of unique, capable 3-D range sensors, as well as discussions of optical, geometrical, electronic, and computational issues. Mundy and Porter describe asensor systembasedonstructuredilluminationforinspectingmetalliccastings. In order to achieve high-speed data acquisition, it uses multiple lightstripes withwavelength multiplexing. Case, Jalkio,andKim alsopresentamulti- stripe system and discuss various design issues in range sensing by triangulation. ThenumericalstereocameradevelopedbyAltschuler, Bae, Altschuler, Dijak, Tamburino, and Woolford projects space-coded grid patterns which are generated by an electro-optical programmable spatial viii PREFACE light modulator. Kanade and Fuhrman present a proximity sensor using multipleLEDswhich areconically arranged. Itcan measurebothdistance andorientationofanobject'ssurface.
Overview Recent years have seen an increasing interest in the development of multi-sensory robot systems. The reason for this interest stems from a realization that there are fundamental limitations on the reconstruction of environment descriptions using only a single source of sensor information. If robot systems are ever to achieve a degree of intelligence and autonomy, they must be capable of using many different sources of sensory information in an active and dynamic manner. The observations made by the different sensors of a multi-sensor system are always uncertain, usually partial, occasionally spuri9us or incorrect and often geographically or geometrically imcomparable with other sensor views. The sensors of these systems are characterized by the diversity of information that they can provide and by the complexity of their operation. It is the goal of a multi sensor system to combine information from all these different sources into a robust and consistent description of the environment."
Augmented (AR) and Virtual Reality (VR) technologies are increasingly being used in manufacturing processes. These use real and simulated objects to create a simulated environment that can be used to enhance the design and manufacturing processes. Virtual Reality and Augmented Reality Applications in Manufacturing is written by experts from the world s leading institutions working in virtual manufacturing and gives the state of the art of the field. Features: - Chapters covering the state of the art in VR and AR technology and how these technologies can be applied to manufacturing. - The latest findings in key areas of AR and VR application to manufacturing. - The results of recent cross-disciplinary research projects in the US and Europe showing application solutions of AR and VR technology in real industrial settings. Virtual Reality and Augmented Reality Applications in Manufacturing will be of interest to all engineers wishing to keep up-to-date with technologies that have the potential to revolutionize manufacturing processes over the next few years."
This book constitutes the joint refereed proceedings of the 15th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems, APPROX 2012, and the 16th International Workshop on Randomization and Computation, RANDOM 2012, held in Cambridge, Massachusetts, USA, in August 2011. The volume contains 28 contributed papers, selected by the APPROX Program Committee out of 70 submissions, and 28 contributed papers, selected by the RANDOM Program Committee out of 67 submissions. APPROX focuses on algorithmic and complexity issues surrounding the development of efficient approximate solutions to computationally difficult problems. RANDOM is concerned with applications of randomness to computational and combinatorial problems.
The two-volume set LNCS 6468-6469 contains the carefully selected and reviewed papers presented at the eight workshops that were held in conjunction with the 10th Asian Conference on Computer Vision, in Queenstown, New Zealand, in November 2010.From a total of 167 submissions to all workshops, 89 papers were selected for publication. The contributions are grouped together according to the main workshops topics, which were: computational photography and aesthetics; computer vision in vehicle technology: from Earth to Mars; electronic cultural heritage; subspace based methods; video event categorization, tagging and retrieval; visual surveillance; application of computer vision for mixed and augmented reality.
This book constitutes the refereed proceedings of the Second International Workshop on Machine Learning in Medical Imaging, MLMI 2011, held in conjunction with MICCAI 2011, in Toronto, Canada, in September 2011. The 44 revised full papers presented were carefully reviewed and selected from 74 submissions. The papers focus on major trends in machine learning in medical imaging aiming to identify new cutting-edge techniques and their use in medical imaging.
This book constitutes the refereed proceedings of the 15th and 16th International Conference on Formal Grammar 2010 and 2011, collocated with the European Summer School in Logic, Language and Information in July 2010/2011. The 19 revised full papers were carefully reviewed and selected from a total of 50 submissions. The papers papers deal with the following topics: formal and computational phonology, morphology, syntax, semantics and pragmatics; model-theoretic and proof-theoretic methods in linguistics; logical aspects of linguistic structure; constraint-based and resource-sensitive approaches to grammar; learnability of formal grammar; integration of stochastic and symbolic models of grammar; foundational, methodological and architectural issues in grammar; mathematical foundations of statistical approaches to linguistic analysis.
In den letzten Jahren hat sich der Workshop "Bildverarbeitung fur die Medizin" durch erfolgreiche Veranstaltungen etabliert. Ziel ist auch 2012 wieder die Darstellung aktueller Forschungsergebnisse und die Vertiefung der Gesprache zwischen Wissenschaftlern, Industrie und Anwendern. Die Beitrage dieses Bandes - einige davon in englischer Sprache - umfassen alle Bereiche der medizinischen Bildverarbeitung, insbesondere Algorithmen, Hard- und Softwaresysteme sowie deren klinische Anwendung, u.a.: Bildgebung und -akquisition, Sichtbares Licht, Endoskopie, Mikroskopie, Visualisierung und Animation, Patientenindividuelle Simulation und Planung, Computerunterstutzte Diagnose, Biomechanische Modellierung, Computergestutzte Operationsplanung, Bildverarbeitung in der Telemedizin, Bildgestutzte Roboter und Chirurgische Simulatoren."
The development of technologies for the identi?cation of individuals has driven the interest and curiosity of many people. Spearheaded and inspired by the Bertillon coding system for the classi?cation of humans based on physical measurements, scientists and engineers have been trying to invent new devices and classi?cation systems to capture the human identity from its body measurements. One of the main limitations of the precursors of today's biometrics, which is still present in the vast majority of the existing biometric systems, has been the need to keep the device in close contact with the subject to capture the biometric measurements. This clearly limits the applicability and convenience of biometric systems. This book presents an important step in addressing this limitation by describing a number of methodologies to capture meaningful biometric information from a distance. Most materials covered in this book have been presented at the International Summer School on Biometrics which is held every year in Alghero, Italy and which has become a ?agship activity of the IAPR Technical Committee on Biometrics (IAPR TC4). The last four chapters of the book are derived from some of the best p- sentations by the participating students of the school. The educational value of this book is also highlighted by the number of proposed exercises and questions which will help the reader to better understand the proposed topics.
This work presents a full generic approach to the detection and recognition of traffic signs. The approach is based on the latest computer vision methods for object detection, and on powerful methods for multiclass classification. The challenge was to robustly detect a set of different sign classes in real time, and to classify each detected sign into a large, extensible set of classes. To address this challenge, several state-of-the-art methods were developed that can be used for different recognition problems. Following an introduction to the problems of traffic sign detection and categorization, the text focuses on the problem of detection, and presents recent developments in this field. The text then surveys a specific methodology for the problem of traffic sign categorization - Error-Correcting Output Codes - and presents several algorithms, performing experimental validation on a mobile mapping application. The work ends with a discussion on future research and continuing challenges.
As cameras become more pervasive in our daily life, vast amounts of video data are generated. The popularity of YouTube and similar websites such as Tudou and Youku provides strong evidence for the increasing role of video in society. One of the main challenges confronting us in the era of information technology is to - fectively rely on the huge and rapidly growing video data accumulating in large multimedia archives. Innovative video processing and analysis techniques will play an increasingly important role in resolving the difficult task of video search and retrieval. A wide range of video-based applications have benefited from - vances in video search and mining including multimedia information mana- ment, human-computer interaction, security and surveillance, copyright prot- tion, and personal entertainment, to name a few. This book provides an overview of emerging new approaches to video search and mining based on promising methods being developed in the computer vision and image analysis community. Video search and mining is a rapidly evolving discipline whose aim is to capture interesting patterns in video data. It has become one of the core areas in the data mining research community. In comparison to other types of data mining (e. g. text), video mining is still in its infancy. Many challenging research problems are facing video mining researchers. |
![]() ![]() You may like...
Recent Advances in Differential…
Juan Luis Garcia Guirao, Jose Alberto Murillo Hernandez, …
Hardcover
The Theory of Queuing Systems with…
Alexander N. Dudin, Valentina I. Klimenok, …
Hardcover
R2,944
Discovery Miles 29 440
|