![]() |
Welcome to Loot.co.za!
Sign in / Register |Wishlists & Gift Vouchers |Help | Advanced search
|
Your cart is empty |
||
|
Books > Computing & IT > Applications of computing > Artificial intelligence > Computer vision
Perceptual Organization for Artificial Vision Systems is an edited collection of invited contributions based on papers presented at The Workshop on Perceptual Organization in Computer Vision, held in Corfu, Greece, in September 1999. The theme of the workshop was Assessing the State of the Community and Charting New Research Directions.' Perceptual organization can be defined as the ability to impose structural regularity on sensory data, so as to group sensory primitives arising from a common underlying cause. This book explores new models, theories, and algorithms for perceptual organization. Perceptual Organization for Artificial Vision Systems includes contributions by the world's leading researchers in the field. It explores new models, theories, and algorithms for perceptual organization, as well as demonstrates the means for bringing research results and theoretical principles to fruition in the construction of computer vision systems. The focus of this collection is on the design of artificial vision systems. The chapters comprise contributions from researchers in both computer vision and human vision.
This is the first book to describe how Autonomous Virtual Humans and Social Robots can interact with real people, be aware of the environment around them, and react to various situations. Researchers from around the world present the main techniques for tracking and analysing humans and their behaviour and contemplate the potential for these virtual humans and robots to replace or stand in for their human counterparts, tackling areas such as awareness and reactions to real world stimuli and using the same modalities as humans do: verbal and body gestures, facial expressions and gaze to aid seamless human-computer interaction (HCI). The research presented in this volume is split into three sections: *User Understanding through Multisensory Perception: deals with the analysis and recognition of a given situation or stimuli, addressing issues of facial recognition, body gestures and sound localization. *Facial and Body Modelling Animation: presents the methods used in modelling and animating faces and bodies to generate realistic motion. *Modelling Human Behaviours: presents the behavioural aspects of virtual humans and social robots when interacting and reacting to real humans and each other. Context Aware Human-Robot and Human-Agent Interaction would be of great use to students, academics and industry specialists in areas like Robotics, HCI, and Computer Graphics.
For the sixth consecutive year, the AGILE conference promoted the
publication a book collecting high-level scientific contributions
from unpublished fundamental scientific research.
The topic of level sets is currently very timely and useful for creating realistic 3-D images and animations. They are powerful numerical techniques for analyzing and computing interface motion in a host of application settings. In computer vision, it has been applied to stereo and segmentation, whereas in graphics it has been applied to the postproduction process of in-painting and 3-D model construction. Osher is co-inventor of the Level Set Methods, a pioneering framework introduced jointly with James Sethian from the University of Berkeley in 1998. This methodology has been used up to now to provide solutions to a wide application range not limited to image processing, computer vision, robotics, fluid mechanics, crystallography, lithography, and computer graphics. The topic is of great interest to advanced students, professors, and R&D professionals working in the areas of graphics (post-production), video-based surveillance, visual inspection, augmented reality, document image processing, and medical image processing. These techniques are already employed to provide solutions and products in the industry (Cognitech, Siemens, Philips, Focus Imaging). An essential compilation of survey chapters from the leading researchers in the field, emphasizing the applications of the methods. This book can be suitable for a short professional course related with the processing of visual information.
Cross disciplinary biometric systems help boost the performance of the conventional systems. Not only is the recognition accuracy significantly improved, but also the robustness of the systems is greatly enhanced in the challenging environments, such as varying illumination conditions. By leveraging the cross disciplinary technologies, face recognition systems, fingerprint recognition systems, iris recognition systems, as well as image search systems all benefit in terms of recognition performance. Take face recognition for an example, which is not only the most natural way human beings recognize the identity of each other, but also the least privacy-intrusive means because people show their face publicly every day. Face recognition systems display superb performance when they capitalize on the innovative ideas across color science, mathematics, and computer science (e.g., pattern recognition, machine learning, and image processing). The novel ideas lead to the development of new color models and effective color features in color science; innovative features from wavelets and statistics, and new kernel methods and novel kernel models in mathematics; new discriminant analysis frameworks, novel similarity measures, and new image analysis methods, such as fusing multiple image features from frequency domain, spatial domain, and color domain in computer science; as well as system design, new strategies for system integration, and different fusion strategies, such as the feature level fusion, decision level fusion, and new fusion strategies with novel similarity measures.
PREVIOUS EDITIONThis textbook introduces the "Fundamentals of Multimedia", addressing real issues commonly faced in the workplace. The essential concepts are explained in a practical way to enable students to apply their existing skills to address problems in multimedia. Fully revised and updated, this new edition now includes coverage of such topics as 3D TV, social networks, high-efficiency video compression and conferencing, wireless and mobile networks, and their attendant technologies. Features: presents an overview of the key concepts in multimedia, including color science; reviews lossless and lossy compression methods for image, video and audio data; examines the demands placed by multimedia communications on wired and wireless networks; discusses the impact of social media and cloud computing on information sharing and on multimedia content search and retrieval; includes study exercises at the end of each chapter; provides supplementary resources for both students and instructors at an associated website.
Based on the highly successful 3-volume reference "Handbook of
Computer Vision and Applications," this concise edition covers in a
single volume the entire spectrum of computer vision ranging form
the imaging process to high-end algorithms and applications. This
book consists of three parts, including an application gallery, and
is accompanied by a companion website
This book highlights the methods and applications for roadside video data analysis, with a particular focus on the use of deep learning to solve roadside video data segmentation and classification problems. It describes system architectures and methodologies that are specifically built upon learning concepts for roadside video data processing, and offers a detailed analysis of the segmentation, feature extraction and classification processes. Lastly, it demonstrates the applications of roadside video data analysis including scene labelling, roadside vegetation classification and vegetation biomass estimation in fire risk assessment.
Appendix 164 3. A 3. A. 1 Approximate Estimation of Fundamental Matrix from General Matrix 164 3. A. 2 Estimation of Affine Transformation 165 4 RECOVERY OF EPIPOLAR GEOMETRY FROM LINE SEGMENTS OR LINES 167 Line Segments or Straight Lines 168 4. 1 4. 2 Solving Motion Using Line Segments Between Two Views 173 4. 2. 1 Overlap of Two Corresponding Line Segments 173 Estimating Motion by Maximizing Overlap 175 4. 2. 2 Implementation Details 4. 2. 3 176 Reconstructing 3D Line Segments 4. 2. 4 179 4. 2. 5 Experimental Results 180 4. 2. 6 Discussions 192 4. 3 Determining Epipolar Geometry of Three Views 194 4. 3. 1 Trifocal Constraints for Point Matches 194 4. 3. 2 Trifocal Constraints for Line Correspondences 199 4. 3. 3 Linear Estimation of K, L, and M Using Points and Lines 200 4. 3. 4 Determining Camera Projection Matrices 201 4. 3. 5 Image Transfer 203 4. 4 Summary 204 5 REDEFINING STEREO, MOTION AND OBJECT RECOGNITION VIA EPIPOLAR GEOMETRY 205 5. 1 Conventional Approaches to Stereo, Motion and Object Recognition 205 5. 1. 1 Stereo 205 5. 1. 2 Motion 206 5. 1. 3 Object Recognition 207 5. 2 Correspondence in Stereo, Motion and Object Recognition as 1D Search 209 5. 2. 1 Stereo Matching 209 xi Contents 5. 2. 2 Motion Correspondence and Segmentation 209 5. 2. 3 3D Object Recognition and Localization 210 Disparity and Spatial Disparity Space 210 5.
Face analysis is essential for a large number of applications such as human-computer interaction or multimedia (e.g. content indexing and retrieval). Although many approaches are under investigation, performance under uncontrolled conditions is still not satisfactory. The variations that impact facial appearance (e.g. pose, expression, illumination, occlusion, motion blur) make it a difficult problem to solve. This book describes the progress towards this goal, from a core building block - landmark detection - to the higher level of micro and macro expression recognition. Specifically, the book addresses the modeling of temporal information to coincide with the dynamic nature of the face. It also includes a benchmark of recent solutions along with details about the acquisition of a dataset for such tasks.
There are many good AI books. Usually they consecrate at most one or two chapters to the imprecision knowledge processing. To our knowledge this is among the few books to be entirely dedicated to the treatment of knowledge imperfection when bui- ing intelligent systems. We consider that an entire book should be focused on this important aspect of knowledge processing. The expected audience for this book - cludes undergraduate students in computer science, IT&C, mathematics, business, medicine, etc. , graduates, specialists and researchers in these fields. The subjects treated in the book include expert systems, knowledge representation, reasoning under knowledge Imperfection (Probability Theory, Possibility Theory, Belief Theory, and Approximate Reasoning). Most of the examples discussed in details throughout the book are from the medical domain. Each chapter ends with a set of carefully pe- gogically chosen exercises, which complete solution provided. Their understanding will trigger the comprehension of the theoretical notions, concepts and results. Chapter 1 is dedicated to the review of expert systems. Hence are briefly discussed production rules, structure of ES, reasoning in an ES, and conflict resolution. Chapter 2 treats knowledge representation. That includes the study of the differences between data, information and knowledge, logical systems with focus on predicate calculus, inference rules in classical logic, semantic nets and frames.
Techniques of vision-based motion analysis aim to detect, track, identify, and generally understand the behavior of objects in image sequences. With the growth of video data in a wide range of applications from visual surveillance to human-machine interfaces, the ability to automatically analyze and understand object motions from video footage is of increasing importance. Among the latest developments in this field is the application of statistical machine learning algorithms for object tracking, activity modeling, and recognition. Developed from expert contributions to the first and second International Workshop on Machine Learning for Vision-Based Motion Analysis, this important text/reference highlights the latest algorithms and systems for robust and effective vision-based motion understanding from a machine learning perspective. Highlighting the benefits of collaboration between the communities of object motion understanding and machine learning, the book discusses the most active forefronts of research, including current challenges and potential future directions. Topics and features: provides a comprehensive review of the latest developments in vision-based motion analysis, presenting numerous case studies on state-of-the-art learning algorithms; examines algorithms for clustering and segmentation, and manifold learning for dynamical models; describes the theory behind mixed-state statistical models, with a focus on mixed-state Markov models that take into account spatial and temporal interaction; discusses object tracking in surveillance image streams, discriminative multiple target tracking, and guidewire tracking in fluoroscopy; explores issues of modeling for saliency detection, human gait modeling, modeling of extremely crowded scenes, and behavior modeling from video surveillance data; investigates methods for automatic recognition of gestures in Sign Language, and human action recognition from small training sets. Researchers, professional engineers, and graduate students in computer vision, pattern recognition and machine learning, will all find this text an accessible survey of machine learning techniques for vision-based motion analysis. The book will also be of interest to all who work with specific vision applications, such as surveillance, sport event analysis, healthcare, video conferencing, and motion video indexing and retrieval.
The book aims to investigate methods and techniques for spatial statistical analysis suitable to model spatial information in support of decision systems. Over the last few years there has been a considerable interest in these tools and in the role they can play in spatial planning and environmental modelling. One of the earliest and most famous definition of spatial planning was a geographical expression to the economic, social, cultural and ecological policies of society: borrowing from this point of view, this text shows how an interdisciplinary approach is an effective way to an harmonious integration of national policies with regional and local analysis. A wide range of spatial models and techniques is, also, covered: spatial data mining, point processes analysis, nearest neighbor statistics and cluster detection, Fuzzy Regression model and local indicators of spatial association; all of these tools provide the policy-maker with a valuable support to policy development. "
Advancements in digital sensor technology, digital image analysis techniques, as well as computer software and hardware have brought together the fields of computer vision and photogrammetry, which are now converging towards sharing, to a great extent, objectives and algorithms. The potential for mutual benefits by the close collaboration and interaction of these two disciplines is great, as photogrammetric know-how can be aided by the most recent image analysis developments in computer vision, while modern quantitative photogrammetric approaches can support computer vision activities. Devising methodologies for automating the extraction of man-made objects (e.g. buildings, roads) from digital aerial or satellite imagery is an application where this cooperation and mutual support is already reaping benefits. The valuable spatial information collected using these interdisciplinary techniques is of improved qualitative and quantitative accuracy. This book offers a comprehensive selection of high-quality and in-depth contributions from world-wide leading research institutions, treating theoretical as well as implementational issues, and representing the state-of-the-art on this subject among the photogrammetric and computer vision communities.
Enterprise Interoperability is the ability of an enterprise or organisation to work with other enterprises or organisations without special effort. It is now recognised that interoperability of systems and thus sharing of information is not sufficient to ensure common understanding between enterprises. Knowledge of information meaning and understanding of how is to be used must also be shared if decision makers distributed between those enterprises in the network want to act consistently and efficiently. Industry's need for Enterprise Interoperability has been one of the significant drivers for research into the Internet of the Future. EI research will embrace and extend contributions from the Internet of Things and the Internet of Services, and will go on to drive the future needs for Internets of People, Processes, and Knowledge.
This indispensable text introduces the foundations of three-dimensional computer vision and describes recent contributions to the field. Fully revised and updated, this much-anticipated new edition reviews a range of triangulation-based methods, including linear and bundle adjustment based approaches to scene reconstruction and camera calibration, stereo vision, point cloud segmentation, and pose estimation of rigid, articulated, and flexible objects. Also covered are intensity-based techniques that evaluate the pixel grey values in the image to infer three-dimensional scene structure, and point spread function based approaches that exploit the effect of the optical system. The text shows how methods which integrate these concepts are able to increase reconstruction accuracy and robustness, describing applications in industrial quality inspection and metrology, human-robot interaction, and remote sensing.
This book, divided in two volumes, originates from Techno-Societal 2020: the 3rd International Conference on Advanced Technologies for Societal Applications, Maharashtra, India, that brings together faculty members of various engineering colleges to solve Indian regional relevant problems under the guidance of eminent researchers from various reputed organizations. The focus of this volume is on technologies that help develop and improve society, in particular on issues such as sensor and ICT based technologies for the betterment of people, Technologies for agriculture and healthcare, micro and nano technological applications. This conference aims to help innovators to share their best practices or products developed to solve specific local problems which in turn may help the other researchers to take inspiration to solve problems in their region. On the other hand, technologies proposed by expert researchers may find applications in different regions. This offers a multidisciplinary platform for researchers from a broad range of disciplines of Science, Engineering and Technology for reporting innovations at different levels.
Measurement of Image Velocity presents a computational framework for computing motion information from sequences of images. Its specific goal is the measurement of image velocity (or optical flow), the projection of 3-D object motion onto the 2-D image plane. The formulation of the problem emphasizes the geometric and photometric properties of image formation, and the occurrence of multiple image velocities caused, for example, by specular reflections, shadows, or transparency. The method proposed for measuring image velocity is based on the phase behavior in the output of velocity-tuned filters. Extensive experimental work is used to show that phase can be a reliable source of pure image translation, small geometric deformation, smooth contrast variations, and multiple local velocities. Extensive theorectical analysis is used to explain the robustness of phase with respect to deviations from image translation, and to detect situations in which phase becomes unstable. The results indicate that optical flow may be extracted reliably for computing egomotion and structure from motion. The monograph also contains a review of other techniques and frequency analysis applied to image sequences, and it discusses the closely related topics of zero-crossing tracking, gradient-based methods, and the measurement of binocular disparity. The work is relevant to those studying machine vision and visual perception.
Image segmentation is generally the first task in any automated image understanding application, such as autonomous vehicle navigation, object recognition, photointerpretation, etc. All subsequent tasks, such as feature extraction, object detection, and object recognition, rely heavily on the quality of segmentation. One of the fundamental weaknesses of current image segmentation algorithms is their inability to adapt the segmentation process as real-world changes are reflected in the image. Only after numerous modifications to an algorithm's control parameters can any current image segmentation technique be used to handle the diversity of images encountered in real-world applications. Genetic Learning for Adaptive Image Segmentation presents the first closed-loop image segmentation system that incorporates genetic and other algorithms to adapt the segmentation process to changes in image characteristics caused by variable environmental conditions, such as time of day, time of year, weather, etc. Image segmentation performance is evaluated using multiple measures of segmentation quality. These quality measures include global characteristics of the entire image as well as local features of individual object regions in the image. This adaptive image segmentation system provides continuous adaptation to normal environmental variations, exhibits learning capabilities, and provides robust performance when interacting with a dynamic environment. This research is directed towards adapting the performance of a well known existing segmentation algorithm (Phoenix) across a wide variety of environmental conditions which cause changes in the image characteristics. The book presents a large number of experimental results and compares performance with standard techniques used in computer vision for both consistency and quality of segmentation results. These results demonstrate, (a) the ability to adapt the segmentation performance in both indoor and outdoor color imagery, and (b) that learning from experience can be used to improve the segmentation performance over time.
Exploration of Visual Data presents latest research efforts in the area of content-based exploration of image and video data. The main objective is to bridge the semantic gap between high-level concepts in the human mind and low-level features extractable by the machines. The two key issues emphasized are "content-awareness" and "user-in-the-loop." The authors provide a comprehensive review on algorithms for visual feature extraction based on color, texture, shape, and structure, and techniques for incorporating such information to aid browsing, exploration, search, and streaming of image and video data. They also discuss issues related to the mixed use of textual and low-level visual features to facilitate more effective access of multimedia data. To bridge the semantic gap, significant recent research efforts have also been put on learning during user interactions, which is also known as "relevance feedback." The difficulty and challenge also come from the personalized information need of each user and a small amount of feedbacks the machine could obtain through real-time user interaction. The authors present and discuss several recently proposed classification and learning techniques that are specifically designed for this problem, with kernel- and boosting-based approaches for nonlinear extensions. Exploration of Visual Data provides state-of-the-art materials on the topics of content-based description of visual data, content-based low-bitrate video streaming, and latest asymmetric and nonlinear relevance feedback algorithms, which to date are unpublished. Exploration of Visual Data will be of interest to researchers, practitioners, and graduate-level students in theareas of multimedia information systems, multimedia databases, computer vision, machine learning.
The computational modelling of deformations has been actively studied for the last thirty years. This is mainly due to its large range of applications that include computer animation, medical imaging, shape estimation, face deformation as well as other parts of the human body, and object tracking. In addition, these advances have been supported by the evolution of computer processing capabilities, enabling realism in a more sophisticated way. This book encompasses relevant works of expert researchers in the field of deformation models and their applications. The book is divided into two main parts. The first part presents recent object deformation techniques from the point of view of computer graphics and computer animation. The second part of this book presents six works that study deformations from a computer vision point of view with a common characteristic: deformations are applied in real world applications. The primary audience for this work are researchers from different multidisciplinary fields, such as those related with Computer Graphics, Computer Vision, Computer Imaging, Biomedicine, Bioengineering, Mathematics, Physics, Medical Imaging and Medicine.
Robotics and autonomous systems can aid disabled individuals in daily living or make a workplace more productive, but these tools are only as effective as the technology behind them. Robotic systems must be able to accurately identify and act upon elements in their environment to be effective in performing their duties. Innovative Research in Attention Modeling and Computer Vision Applications explores the latest research in image processing and pattern recognition for use in robotic real-time cryptography and surveillance applications. This book provides researchers, students, academicians, software designers, and application developers with next-generation insight into the use of computer vision technologies in a variety of industries and endeavors. This premier reference work includes chapters on topics ranging from biometric and facial recognition technologies, to digital image and video watermarking, among many others.
The goal of the Volume I Geometric Algebra for Computer Vision, Graphics and Neural Computing is to present a unified mathematical treatment of diverse problems in the general domain of artificial intelligence and associated fields using Clifford, or geometric, algebra.Geometric algebra provides a rich and general mathematical framework for Geometric Cybernetics in order to develop solutions, concepts and computer algorithms without losing geometric insight of the problem in question. Current mathematical subjects can be treated in an unified manner without abandoning the mathematical system of geometric algebra for instance: multilinear algebra, projective and affine geometry, calculus on manifolds, Riemann geometry, the representation of Lie algebras and Lie groups using bivector algebras and conformal geometry. By treating a wide spectrum of problems in a common language, this Volume I offers both new insights and new solutions that should be useful to scientists, and engineers working in different areas related with the development and building of intelligent machines. Each chapter is written in accessible terms accompanied by numerous examples, figures and a complementary appendix on Clifford algebras, all to clarify the theory and the crucial aspects of the application of geometric algebra to problems in graphics engineering, image processing, pattern recognition, computer vision, machine learning, neural computing and cognitive systems.
Deep Network Design for Medical Image Computing: Principles and Applications covers a range of MIC tasks and discusses design principles of these tasks for deep learning approaches in medicine. These include skin disease classification, vertebrae identification and localization, cardiac ultrasound image segmentation, 2D/3D medical image registration for intervention, metal artifact reduction, sparse-view artifact reduction, etc. For each topic, the book provides a deep learning-based solution that takes into account the medical or biological aspect of the problem and how the solution addresses a variety of important questions surrounding architecture, the design of deep learning techniques, when to introduce adversarial learning, and more. This book will help graduate students and researchers develop a better understanding of the deep learning design principles for MIC and to apply them to their medical problems. |
You may like...
BEM-based Finite Element Approaches on…
Steffen Weisser
Hardcover
Mathematical and Computational Methods…
Miloslav Feistauer, Jiri Felcman, …
Hardcover
R4,683
Discovery Miles 46 830
Approximation and Computation - In Honor…
Walter Gautschi, Giuseppe Mastroianni, …
Hardcover
R2,735
Discovery Miles 27 350
Frontiers in Molecular Design and…
Rachelle J. Bienstock, Veerabahu Shanmugasundaram, …
Hardcover
R4,846
Discovery Miles 48 460
Processing, Analyzing and Learning of…
Ron Kimmel, Xue-Cheng Tai
Hardcover
R4,342
Discovery Miles 43 420
Time Series Analysis: Methods and…
Tata Subba Rao, Suhasini Subba Rao, …
Hardcover
R4,435
Discovery Miles 44 350
Sample Surveys: Design, Methods and…
Danny Pfeffermann, C.R. Rao
Hardcover
R5,708
Discovery Miles 57 080
|