|
Showing 1 - 3 of
3 matches in All Departments
In Synthetic Vision: Using Volume Learning and Visual DNA, a
holistic model of the human visual system is developed into a
working model in C++, informed by the latest neuroscience, DNN, and
computer vision research. The author's synthetic visual pathway
model includes the eye, LGN, visual cortex, and the high level PFC
learning centers. The corresponding visual genome model (VGM),
begun in 2014, is introduced herein as the basis for a visual
genome project analogous to the Human Genome Project funded by the
US government. The VGM introduces volume learning principles and
Visual DNA (VDNA) taking a multivariate approach beyond deep neural
networks. Volume learning is modeled as programmable learning and
reasoning agents, providing rich methods for structured agent
classification networks. Volume learning incorporates a massive
volume of multivariate features in various data space projections,
collected into strands of Visual DNA, analogous to human DNA genes.
VGM lays a foundation for a visual genome project to sequence VDNA
as visual genomes in a public database, using collaborative
research to move synthetic vision science forward and enable new
applications. Bibliographical references are provided to key
neuroscience, computer vision, and deep learning research, which
form the basis for the biologically plausible VGM model and the
synthetic visual pathway. The book also includes graphical
illustrations and C++ API reference materials to enable VGM
application programming. Open source code licenses are available
for engineers and scientists. Scott Krig founded Krig Research to
provide some of the world's first vision and imaging systems
worldwide for military, industry, government, and academic use.
Krig has worked for major corporations and startups in the areas of
machine learning, computer vision, imaging, graphics, robotics and
automation, computer security and cryptography. He has authored
international patents in the areas of computer architecture,
communications, computer security, digital imaging, and computer
vision, and studied at Stanford. Scott Krig is the author of the
English/Chinese Springer book Computer Vision Metrics, Survey,
Taxonomy and Analysis of Computer Vision, Visual Neuroscience, and
Deep Learning, Textbook Edition, as well as other books, articles,
and papers.
Based on the successful 2014 book published by Apress, this
textbook edition is expanded to provide a comprehensive history and
state-of-the-art survey for fundamental computer vision methods and
deep learning. With over 800 essential references, as well as
chapter-by-chapter learning assignments, both students and
researchers can dig deeper into core computer vision topics and
deep learning architectures. The survey covers everything from
feature descriptors, regional and global feature metrics, feature
learning architectures, deep learning, neuroscience of vision,
neural networks, and detailed example architectures to illustrate
computer vision hardware and software optimization methods. To
complement the survey, the textbook includes useful analyses which
provide insight into the goals of various methods, why they work,
and how they may be optimized. The text delivers an essential
survey and a valuable taxonomy, thus providing a key learning tool
for students, researchers and engineers, to supplement the many
effective hands-on resources and open source projects, such as
OpenCV and other imaging and deep learning tools.
Computer Vision Metrics provides an extensive survey and analysis
of over 100 current and historical feature description and machine
vision methods, with a detailed taxonomy for local, regional and
global features. This book provides necessary background to develop
intuition about why interest point detectors and feature
descriptors actually work, how they are designed, with observations
about tuning the methods for achieving robustness and invariance
targets for specific applications. The survey is broader than it is
deep, with over 540 references provided to dig deeper. The taxonomy
includes search methods, spectra components, descriptor
representation, shape, distance functions, accuracy, efficiency,
robustness and invariance attributes, and more. Rather than
providing 'how-to' source code examples and shortcuts, this book
provides a counterpoint discussion to the many fine opencv
community source code resources available for hands-on
practitioners.
|
You may like...
Not available
Loot
Nadine Gordimer
Paperback
(2)
R398
R369
Discovery Miles 3 690
|