![]() |
Welcome to Loot.co.za!
Sign in / Register |Wishlists & Gift Vouchers |Help | Advanced search
|
Your cart is empty |
||
|
Books > Computing & IT > Applications of computing > Audio processing
The mathematical theory of counterpoint was originally aimed at simulating the composition rules described in Johann Joseph Fux's Gradus ad Parnassum. It soon became apparent that the algebraic apparatus used in this model could also serve to define entirely new systems of rules for composition, generated by new choices of consonances and dissonances, which in turn lead to new restrictions governing the succession of intervals. This is the first book bringing together recent developments and perspectives on mathematical counterpoint theory in detail. The authors include recent theoretical results on counterpoint worlds, the extension of counterpoint to microtonal pitch systems, the singular homology of counterpoint models, and the software implementation of contrapuntal models. The book is suitable for graduates and researchers. A good command of algebra is a prerequisite for understanding the construction of the model.
Metal Music Manual shows you the creative and technical processes involved in producing contemporary heavy music for maximum sonic impact. From pre-production to final mastered product, and fundamental concepts to advanced production techniques, this book contains a world of invaluable practical information. Assisted by clear discussion of critical audio principles and theory, and a comprehensive array of illustrations, photos, and screen grabs, Metal Music Manual is the essential guide to achieving professional production standards. The extensive companion website features multi-track recordings, final mixes, processing examples, audio stems, etc., so you can download the relevant content and experiment with the techniques you read about. The website also features video interviews the author conducted with the following acclaimed producers, who share their expertise, experience, and insight into the processes involved: Fredrik Nordstroem (Dimmu Borgir, At The Gates, In Flames) Matt Hyde (Slayer, Parkway Drive, Children of Bodom) Ross Robinson (Slipknot, Sepultura, Machine Head) Logan Mader (Gojira, DevilDriver, Fear Factory) Andy Sneap (Megadeth, Killswitch Engage, Testament) Jens Bogren (Opeth, Kreator, Arch Enemy) Daniel Bergstrand (Meshuggah, Soilwork, Behemoth) Nick Raskulinecz (Mastodon, Death Angel, Trivium) Quotes from these interviews are featured throughout Metal Music Manual, with additional contributions from: Ross "Drum Doctor" Garfield (one of the world's top drum sound specialists, with Metallica and Slipknot amongst his credits) Andrew Scheps (Black Sabbath, Linkin Park, Metallica) Maor Appelbaum (Sepultura, Faith No More, Halford)
Stochastically-Based Semantic Analysis investigates the problem of automatic natural language understanding in a spoken language dialog system. The focus is on the design of a stochastic parser and its evaluation with respect to a conventional rule-based method. Stochastically-Based Semantic Analysis will be of most interest to researchers in artificial intelligence, especially those in natural language processing, computational linguistics, and speech recognition. It will also appeal to practicing engineers who work in the area of interactive speech systems.
Mathematical Music offers a concise and easily accessible history of how mathematics was used to create music. The story presented in this short, engaging volume ranges from ratios in antiquity to random combinations in the 17th century, 20th-century statistics, and contemporary artificial intelligence. This book provides a fascinating panorama of the gradual mechanization of thought processes involved in the creation of music. How did Baroque authors envision a composition system based on combinatorics? What was it like to create musical algorithms at the beginning of the 20th century, before the computer became a reality? And how does this all explain today's use of artificial intelligence and machine learning in music? In addition to discussing the history and the present state of mathematical music, Braguinski also takes a look at what possibilities the near future of music AI might hold for listeners, musicians, and the society. Grounded in research findings from musicology and the history of technology, and written for the non-specialist general audience, this book helps both student and professional readers to make sense of today's music AI by situating it in a continuous historical context.
Introduction to Digital Music with Python Programming provides a foundation in music and code for the beginner. It shows how coding empowers new forms of creative expression while simplifying and automating many of the tedious aspects of production and composition. With the help of online, interactive examples, this book covers the fundamentals of rhythm, chord structure, and melodic composition alongside the basics of digital production. Each new concept is anchored in a real-world musical example that will have you making beats in a matter of minutes. Music is also a great way to learn core programming concepts such as loops, variables, lists, and functions, Introduction to Digital Music with Python Programming is designed for beginners of all backgrounds, including high school students, undergraduates, and aspiring professionals, and requires no previous experience with music or code.
Designing Human Interface in Speech Technology bridges a gap between the needs of the technical engineer and cognitive researchers working in the multidisciplinary area of speech technology applications. The approach is systematic and the focus is on the utility of developing and designing speech related products. Included is coverage of topics such as neuroscience on the multimodal cortex, cognitive theories on multi-task performance, stress and workload, as well as human information process theory and ecological interface design theory for evaluating speech-related human-system interfaces. Of special emphasis are topics such as spoken dialogue system design, in-vehicle communication system design and speech technology in military applications. Also included are tools on how to analyze the design, different design theories and process, methods about how to understand users. The material systematically describes the user-center design process and usability evaluation methods. Designing Human Interface in Speech Technology is appropriate for designers, engineers, and decision makers working in the area of speech technology research. It is also a good text book for senior university students and postgraduate students in the respective interaction design areas.
This thesis discusses the privacy issues in speech-based applications such as biometric authentication, surveillance, and external speech processing services. Author Manas A. Pathak presents solutions for privacy-preserving speech processing applications such as speaker verification, speaker identification and speech recognition. The author also introduces some of the tools from cryptography and machine learning and current techniques for improving the efficiency and scalability of the presented solutions. Experiments with prototype implementations of the solutions for execution time and accuracy on standardized speech datasets are also included in the text. Using the framework proposed may now make it possible for a surveillance agency to listen for a known terrorist without being able to hear conversation from non-targeted, innocent civilians."
The availability of increased computational power and the proliferation of the Internet have facilitated the production and distribution of unauthorized copies of multimedia information. As a result, the problem of copyright protection has attracted the interest of worldwide scientific and business communities. Signal Processing, Perceptual Coding and Watermarking of Digital Audio: Advanced Technologies and Models focuses on watermarking, in which data is marked with hidden ownership information, as a promising solution to copyright protection issues. Compared to embedding watermarks into still images, hiding data in audio is much more challenging due to the extreme sensitivity of the human auditory system to changes in the audio signal. This book focuses on understanding human perception processes and including them in effective psychoacoustic models, as well as synchronization, which is an important component of a successful watermarking system.
Both modern mathematical music theory and computer science are strongly influenced by the theory of categories and functors. One outcome of this research is the data format of denotators, which is based on set-valued presheaves over the category of modules and diaffine homomorphisms. The functorial approach of denotators deals with generalized points in the form of arrows and allows the construction of a universal concept architecture. This architecture is ideal for handling all aspects of music, especially for the analysis and composition of highly abstract musical works. This book presents an introduction to the theory of module categories and the theory of denotators, as well as the design of a software system, called Rubato Composer, which is an implementation of the category-theoretic concept framework. The application is written in portable Java and relies on plug-in components, so-called rubettes, which may be combined in data flow networks for the generation and manipulation of denotators. The Rubato Composer system is open to arbitrary extension and is freely available under the GPL license. It allows the developer to build specialized rubettes for tasks that are of interest to composers, who in turn combine them to create music. It equally serves music theorists, who use them to extract information from and manipulate musical structures. They may even develop new theories by experimenting with the many parameters that are at their disposal thanks to the increased flexibility of the functorial concept architecture. Two contributed chapters by Guerino Mazzola and Florian Thalmann illustrate the application of the theory as well as the software in the development of compositional tools and the creation of a musical work with the help of the Rubato framework.
Dialect Accent Features for Establishing Speaker Identity: A Case Study discusses the subject of forensic voice identification and speaker profiling. Specifically focusing on speaker profiling and using dialects of the Hindi language, widely used in India, the authors have contributed to the body of research on speaker identification by using accent feature as the discriminating factor. This case study contributes to the understanding of the speaker identification process in a situation where unknown speech samples are in different language/dialect than the recording of a suspect. The authors' data establishes that vowel quality, quantity, intonation and tone of a speaker as compared to Khariboli (standard Hindi) could be the potential features for identification of dialect accent.
Introduction to Digital Audio Coding and Standards provides a
detailed introduction to the methods, implementations, and official
standards of state-of-the-art audio coding technology. In the book,
the theory and implementation of each of the basic coder building
blocks is addressed. The building blocks are then fit together into
a full coder and the reader is shown how to judge the performance
of such a coder. Finally, the authors discuss the features,
choices, and performance of the main state-of-the-art coders
defined in the ISO/IEC MPEG and HDTV standards and in commercial
use today.
While the use of technology to compensate for individual shortcomings is nothing new, there has been tremendous progress in the application of technology toward assisting individuals with disabilities, particularly with the use of computer synthesized speech (CSS) to help speech impaired people communicate using voice. Computer Synthesized Speech Technologies: Tools for Aiding Impairment provides information to current and future practitioners that will allow them to better assist speech disabled individuals who wish to utilize CSS technology. Just as important as the practitioner's knowledge of the latest advances in speech technology, so, too, is the practitioner's understanding of how specific client needs affect the use of CSS, how cognitive factors related to comprehension of CSS affect its use, and how social factors related to perceptions of the CSS user affect their interaction with others. This cutting edge book addresses those topics pertinent to understanding the myriad of concerns involved with the implementation of CSS so that CSS technologies may continue to evolve and improve for speech impaired individuals.
This revised and updated book describes how to reduce costs, and covers the basic techniques, products and applications of the technology. It also gives information and access to over 400 organizations that provide services in the voice processing area.
This book provides various speech enhancement algorithms for digital hearing aids. It covers information on noise signals extracted from silences of speech signal. The description of the algorithm used for this purpose is also provided. Different types of adaptive filters such as Least Mean Squares (LMS), Normalized LMS (NLMS) and Recursive Lease Squares (RLS) are described for noise reduction in the speech signals. Different types of noises are taken to generate noisy speech signals, and therefore information on various noises signals is provided. The comparative performance of various adaptive filters for noise reduction in speech signals is also described. In addition, the book provides a speech enhancement technique using adaptive filtering and necessary frequency strength enhancement using wavelet transform as per the requirement of audiogram for digital hearing aids. Presents speech enhancement techniques for improving performance of digital hearing aids; Covers various types of adaptive filters and their advantages and limitations; Provides a hybrid speech enhancement technique using wavelet transform and adaptive filters.
This book is one outcome of the NATO Advanced Studies Institute (ASI) Workshop, "Speechreading by Man and Machine," held at the Chateau de Bonas, Castera-Verduzan (near Auch, France) from August 28 to Septem ber 8, 1995 - the first interdisciplinary meeting devoted the subject of speechreading ("lipreading"). The forty-five attendees from twelve countries covered the gamut of speechreading research, from brain scans of humans processing bi-modal stimuli, to psychophysical experiments and illusions, to statistics of comprehension by the normal and deaf communities, to models of human perception, to computer vision and learning algorithms and hardware for automated speechreading machines. The first week focussed on speechreading by humans, the second week by machines, a general organization that is preserved in this volume. After the in evitable difficulties in clarifying language and terminology across disciplines as diverse as human neurophysiology, audiology, psychology, electrical en gineering, mathematics, and computer science, the participants engaged in lively discussion and debate. We think it is fair to say that there was an atmosphere of excitement and optimism for a field that is both fascinating and potentially lucrative. Of the many general results that can be taken from the workshop, two of the key ones are these: * The ways in which humans employ visual image for speech recogni tion are manifold and complex, and depend upon the talker-perceiver pair, severity and age of onset of any hearing loss, whether the topic of conversation is known or unknown, the level of noise, and so forth.
Corpus-based methods will be found at the heart of many language and speech processing systems. This book provides an in-depth introduction to these technologies through chapters describing basic statistical modeling techniques for language and speech, the use of Hidden Markov Models in continuous speech recognition, the development of dialogue systems, part-of-speech tagging and partial parsing, data-oriented parsing and n-gram language modeling. The book attempts to give both a clear overview of the main technologies used in language and speech processing, along with sufficient mathematics to understand the underlying principles. There is also an extensive bibliography to enable topics of interest to be pursued further. Overall, we believe that the book will give newcomers a solid introduction to the field and it will give existing practitioners a concise review of the principal technologies used in state-of-the-art language and speech processing systems. Corpus-Based Methods in Language and Speech Processing is an initiative of ELSNET, the European Network in Language and Speech. In its activities, ELSNET attaches great importance to the integration of language and speech, both in research and in education. The need for and the potential of this integration are well demonstrated by this publication.
The advances in computing and networking have sparked an enormous interest in deploying automatic speech recognition on mobile devices and over communication networks. This book brings together academic researchers and industrial practitioners to address the issues in this emerging realm and presents the reader with a comprehensive introduction to the subject of speech recognition in devices and networks. It covers network, distributed and embedded speech recognition systems.
This work addresses the evaluation of the human and the automatic speaker recognition performances under different channel distortions caused by bandwidth limitation, codecs, and electro-acoustic user interfaces, among other impairments. Its main contribution is the demonstration of the benefits of communication channels of extended bandwidth, together with an insight into how speaker-specific characteristics of speech are preserved through different transmissions. It provides sufficient motivation for considering speaker recognition as a criterion for the migration from narrowband to enhanced bandwidths, such as wideband and super-wideband.
This new Springer volume provides a comprehensive and detailed look at current approaches to automated question answering. The level of presentation is suitable for newcomers to the field as well as for professionals wishing to study this area and/or to build practical QA systems. The book can serve as a "how-to" handbook for IT practitioners and system developers. It can also be used to teach graduate courses in Computer Science, Information Science and related disciplines.
Game Audio Fundamentals takes the reader on a journey through game audio design: from analog and digital audio basics, to the art and execution of sound effects, soundtracks, and voice production, as well as learning how to make sense of a truly effective soundscape. Presuming no pre-existing knowledge, this accessible guide is accompanied by online resources - including practical examples and incremental DAW exercises - and presents the theory and practice of game audio in detail, and in a format anyone can understand. This is essential reading for any aspiring game audio designer, as well as students and professionals from a range of backgrounds, including music, audio engineering, and game design.
Rhythm and Transforms is a book that explores rhythm in music, its structure and how we perceive it. The book will be bought by engineers interested in acoustic signal processing as well as musicians, composers and computer scientists. Anyone interested in the scientific basis of music from psychologists to the designers of electronic musical instruments will be interested in this book. |
You may like...
Multimodal Behavior Analysis in the Wild…
Xavier Alameda-Pineda, Elisa Ricci, …
Paperback
Statistical Pronunciation Modeling for…
Rainer E. Gruhn, Wolfgang Minker, …
Hardcover
R2,653
Discovery Miles 26 530
Trends in Music Information Seeking…
Petros Kostagiolas, Konstantina Martzoukou, …
Hardcover
R4,969
Discovery Miles 49 690
Introduction to EEG- and Speech-Based…
Priyanka A. Abhang, Bharti Gawali, …
Paperback
R1,930
Discovery Miles 19 300
|