0
Your cart

Your cart is empty

Browse All Departments
Price
  • R100 - R250 (1)
  • R250 - R500 (2)
  • R500+ (90)
  • -
Status
Format
Author / Contributor
Publisher

Books > Computing & IT > Applications of computing > Audio processing > Speech recognition & synthesis

Handbook of Research on Recent Developments in Intelligent Communication Application (Hardcover): Siddhartha Bhattacharyya,... Handbook of Research on Recent Developments in Intelligent Communication Application (Hardcover)
Siddhartha Bhattacharyya, Nibaran Das, Debotosh Bhattacharjee, Anirban Mukherjee
R9,426 Discovery Miles 94 260 Ships in 12 - 17 working days

The communication field is evolving rapidly in order to keep up with society's demands. As such, it becomes imperative to research and report recent advancements in computational intelligence as it applies to communication networks. The Handbook of Research on Recent Developments in Intelligent Communication Application is a pivotal reference source for the latest developments on emerging data communication applications. Featuring extensive coverage across a range of relevant perspectives and topics, such as satellite communication, cognitive radio networks, and wireless sensor networks, this book is ideally designed for engineers, professionals, practitioners, upper-level students, and academics seeking current information on emerging communication networking trends.

Visual Speech Recognition - Lip Segmentation and Mapping (Hardcover): Alan Wee-Chung Liew, Shilin Wang Visual Speech Recognition - Lip Segmentation and Mapping (Hardcover)
Alan Wee-Chung Liew, Shilin Wang
R5,946 Discovery Miles 59 460 Ships in 12 - 17 working days

The unique research area of audio-visual speech recognition has attracted much interest in recent years as visual information about lip dynamics has been shown to improve the performance of automatic speech recognition systems, especially in noisy environments.""Visual Speech Recognition: Lip Segmentation and Mapping"" presents an up-to-date account of research done in the areas of lip segmentation, visual speech recognition, and speaker identification and verification. A useful reference for researchers working in this field, this book contains the latest research results from renowned experts with in-depth discussion on topics such as visual speaker authentication, lip modeling, and systematic evaluation of lip features.

New Era for Robust Speech Recognition - Exploiting Deep Learning (Hardcover, 1st ed. 2017): Shinji Watanabe, Marc Delcroix,... New Era for Robust Speech Recognition - Exploiting Deep Learning (Hardcover, 1st ed. 2017)
Shinji Watanabe, Marc Delcroix, Florian Metze, John R. Hershey
R5,475 Discovery Miles 54 750 Ships in 12 - 17 working days

This book covers the state-of-the-art in deep neural-network-based methods for noise robustness in distant speech recognition applications. It provides insights and detailed descriptions of some of the new concepts and key technologies in the field, including novel architectures for speech enhancement, microphone arrays, robust features, acoustic model adaptation, training data augmentation, and training criteria. The contributed chapters also include descriptions of real-world applications, benchmark tools and datasets widely used in the field. This book is intended for researchers and practitioners working in the field of speech processing and recognition who are interested in the latest deep learning techniques for noise robustness. It will also be of interest to graduate students in electrical engineering or computer science, who will find it a useful guide to this field of research.

Automatic Speech Recognition - A Deep Learning Approach (Hardcover, 2015 ed.): Dong Yu, Li Deng Automatic Speech Recognition - A Deep Learning Approach (Hardcover, 2015 ed.)
Dong Yu, Li Deng
R3,953 Discovery Miles 39 530 Ships in 12 - 17 working days

This book provides a comprehensive overview of the recent advancement in the field of automatic speech recognition with a focus on deep learning models including deep neural networks and many of their variants. This is the first automatic speech recognition book dedicated to the deep learning approach. In addition to the rigorous mathematical treatment of the subject, the book also presents insights and theoretical foundation of a series of highly successful deep learning models.

Self-Learning Speaker Identification - A System for Enhanced Speech Recognition (Hardcover, 2011 ed.): Tobias Herbig, Franz... Self-Learning Speaker Identification - A System for Enhanced Speech Recognition (Hardcover, 2011 ed.)
Tobias Herbig, Franz Gerl, Wolfgang Minker
R2,890 Discovery Miles 28 900 Ships in 10 - 15 working days

Current speech recognition systems are based on speaker independent speech models and suffer from inter-speaker variations in speech signal characteristics. This work develops an integrated approach for speech and speaker recognition in order to gain space for self-learning opportunities of the system. This work introduces a reliable speaker identification which enables the speech recognizer to create robust speaker dependent models In addition, this book gives a new approach to solve the reverse problem, how to improve speech recognition if speakers can be recognized. The speaker identification enables the speaker adaptation to adapt to different speakers which results in an optimal long-term adaptation.

Estimating Spoken Dialog System Quality with User Models (Hardcover, 2013 ed.): Klaus-Peter Engelbrecht Estimating Spoken Dialog System Quality with User Models (Hardcover, 2013 ed.)
Klaus-Peter Engelbrecht
R2,776 Discovery Miles 27 760 Ships in 10 - 15 working days

Spoken dialog systems have the potential to offer highly intuitive user interfaces, as they allow systems to be controlled using natural language. However, the complexity inherent in natural language dialogs means that careful testing of the system must be carried out from the very beginning of the design process. This book examines how user models can be used to support such early evaluations in two ways: by running simulations of dialogs, and by estimating the quality judgments of users. First, a design environment supporting the creation of dialog flows, the simulation of dialogs, and the analysis of the simulated data is proposed. How the quality of user simulations may be quantified with respect to their suitability for both formative and summative evaluation is then discussed. The remainder of the book is dedicated to the problem of predicting quality judgments of users based on interaction data. New modeling approaches are presented, which process the dialogs as sequences, and which allow knowledge about the judgment behavior of users to be incorporated into predictions. All proposed methods are validated with example evaluation studies.

Audio For Authors - Audiobooks, Podcasting, And Voice Technologies (Hardcover, Hardback ed.): Joanna Penn Audio For Authors - Audiobooks, Podcasting, And Voice Technologies (Hardcover, Hardback ed.)
Joanna Penn
R567 Discovery Miles 5 670 Ships in 10 - 15 working days
Digital Speech Processing Using Matlab (Hardcover, 2014 ed.): E.S. Gopi Digital Speech Processing Using Matlab (Hardcover, 2014 ed.)
E.S. Gopi
R3,266 Discovery Miles 32 660 Ships in 12 - 17 working days

Digital Speech Processing Using Matlab deals with digital speech pattern recognition, speech production model, speech feature extraction, and speech compression. The book is written in a manner that is suitable for beginners pursuing basic research in digital speech processing. Matlab illustrations are provided for most topics to enable better understanding of concepts. This book also deals with the basic pattern recognition techniques (illustrated with speech signals using Matlab) such as PCA, LDA, ICA, SVM, HMM, GMM, BPN, and KSOM.

Statistical Pronunciation Modeling for Non-Native Speech Processing (Hardcover, 2011 ed.): Rainer E. Gruhn, Wolfgang Minker,... Statistical Pronunciation Modeling for Non-Native Speech Processing (Hardcover, 2011 ed.)
Rainer E. Gruhn, Wolfgang Minker, Satoshi Nakamura
R2,789 Discovery Miles 27 890 Ships in 10 - 15 working days

In this work, the authors present a fully statistical approach to model non--native speakers' pronunciation. Second-language speakers pronounce words in multiple different ways compared to the native speakers. Those deviations, may it be phoneme substitutions, deletions or insertions, can be modelled automatically with the new method presented here.

The methods is based on a discrete hidden Markov model as a word pronunciation model, initialized on a standard pronunciation dictionary. The implementation and functionality of the methodology has been proven and verified with a test set of non-native English in the regarding accent.

The book is written for researchers with a professional interest in phonetics and automatic speech and speaker recognition.

Proactive Spoken Dialogue Interaction in Multi-Party Environments (Hardcover, 2010 ed.): Petra-Maria Strauss, Wolfgang Minker Proactive Spoken Dialogue Interaction in Multi-Party Environments (Hardcover, 2010 ed.)
Petra-Maria Strauss, Wolfgang Minker
R2,894 Discovery Miles 28 940 Ships in 10 - 15 working days

Proactive Spoken Dialogue Interaction in Multi-Party Environments describes spoken dialogue systems that act as independent dialogue partners in the conversation with and between users. The resulting novel characteristics such as proactiveness and multi-party capabilities pose new challenges on the dialogue management component of such a system and require the use and administration of an extensive dialogue history. In order to assist the proactive spoken dialogue systems development, a comprehensive data collection seems mandatory and may be performed in a Wizard-of-Oz environment. Such an environment builds also the appropriate basis for an extensive usability and acceptance evaluation.

Proactive Spoken Dialogue Interaction in Multi-Party Environments is a useful reference for students and researchers in speech processing.

Speech Spectrum Analysis (Hardcover, 2011 ed.): Sean A. Fulop Speech Spectrum Analysis (Hardcover, 2011 ed.)
Sean A. Fulop
R2,797 Discovery Miles 27 970 Ships in 10 - 15 working days

The accurate determination of the speech spectrum, particularly for short frames, is commonly pursued in diverse areas including speech processing, recognition, and acoustic phonetics. With this book the author makes the subject of spectrum analysis understandable to a wide audience, including those with a solid background in general signal processing and those without such background. In keeping with these goals, this is not a book that replaces or attempts to cover the material found in a general signal processing textbook. Some essential signal processing concepts are presented in the first chapter, but even there the concepts are presented in a generally understandable fashion as far as is possible. Throughout the book, the focus is on applications to speech analysis; mathematical theory is provided for completeness, but these developments are set off in boxes for the benefit of those readers with sufficient background. Other readers may proceed through the main text, where the key results and applications will be presented in general heuristic terms, and illustrated with software routines and practical "show-and-tell" discussions of the results. At some points, the book refers to and uses the implementations in the Praat speech analysis software package, which has the advantages that it is used by many scientists around the world, and it is free and open source software. At other points, special software routines have been developed and made available to complement the book, and these are provided in the Matlab programming language. If the reader has the basic Matlab package, he/she will be able to immediately implement the programs in that platform---no extra "toolboxes" are required.

Speech Processing in Embedded Systems (Hardcover, 2010 ed.): Priyabrata Sinha Speech Processing in Embedded Systems (Hardcover, 2010 ed.)
Priyabrata Sinha
R2,890 Discovery Miles 28 900 Ships in 10 - 15 working days

Speech Processing has rapidly emerged as one of the most widespread and well-understood application areas in the broader discipline of Digital Signal Processing. Besides the telecommunications applications that have hitherto been the largest users of speech processing algorithms, several non-traditional embedded processor applications are enhancing their functionality and user interfaces by utilizing various aspects of speech processing.

"Speech Processing in Embedded Systems" describes several areas of speech processing, and the various algorithms and industry standards that address each of these areas. The topics covered include different types of Speech Compression, Echo Cancellation, Noise Suppression, Speech Recognition and Speech Synthesis. In addition this book explores various issues and considerations related to efficient implementation of these algorithms on real-time embedded systems, including the role played by processor CPU and peripheral functionality.

Speech and Audio Processing for Coding, Enhancement and Recognition (Hardcover, 2015 ed.): Tokunbo Ogunfunmi, Roberto Togneri,... Speech and Audio Processing for Coding, Enhancement and Recognition (Hardcover, 2015 ed.)
Tokunbo Ogunfunmi, Roberto Togneri, Madihally (Sim) Narasimha
R4,211 R3,490 Discovery Miles 34 900 Save R721 (17%) Ships in 12 - 17 working days

This book describes the basic principles underlying the generation, coding and transmission of speech and audio signals and reveals the latest advances in this area. Waveform coding and parametric coding of speech are described and the fundamental principles behind these methods are delineated. Examples of speech coding standards in use today and their practical implementation are discussed. The principles underlying speech enhancement and speech recognition are also presented, along with the latest recent advances in these areas.

Adaptive Digital Filters (Hardcover, 2013 ed.): Branko Kovacevic, Zoran Banjac, Milan Milosavljevic Adaptive Digital Filters (Hardcover, 2013 ed.)
Branko Kovacevic, Zoran Banjac, Milan Milosavljevic
R4,582 R3,297 Discovery Miles 32 970 Save R1,285 (28%) Ships in 12 - 17 working days

"Adaptive Digital Filters" presents an important discipline applied to the domain of speech processing. The book first makes the reader acquainted with the basic terms of filtering and adaptive filtering, before introducing the field of advanced modern algorithms, some of which are contributed by the authors themselves. Working in the field of adaptive signal processing requires the use of complex mathematical tools. The book offers a detailed presentation of the mathematical models that is clear and consistent, an approach that allows everyone with a college level of mathematics knowledge to successfully follow the mathematical derivations and descriptions of algorithms.
The algorithms are presented in flow charts, which facilitates their practical implementation. The book presents many experimental results and treats the aspects of practical application of adaptive filtering in real systems, making it a valuable resource for both undergraduate and graduate students, and for all others interested in mastering this important field.

RFID Security and Privacy - Concepts, Protocols, and Architectures (Hardcover, Limited): Dirk Henrici RFID Security and Privacy - Concepts, Protocols, and Architectures (Hardcover, Limited)
Dirk Henrici
R4,605 R4,319 Discovery Miles 43 190 Save R286 (6%) Ships in 12 - 17 working days

The vision of a world in which privacy persists and security is ensured but the full potential of the technology is nevertheless tapped guides this work. It is argued that security and privacy can be ensured using technical safeguards if the whole RFID system is designed properly. The challenge is immense since many constraints exist for providing security and privacy in RFID systems: technically and economically but also ethically and socially. Not only security and privacy needs to be provided but the solutions also need to be inexpensive, practical, reliable, scalable, flexible, inter-organizational, and lasting.

After analyzing the problem area in detail, this work introduces a number of new concepts and protocols that provide security and ensure privacy in RFID systems by technical means. The classic RFID model is extended and considerations in new directions are taken. This leads to innovative solutions with advantageous characteristics. Finally, a comprehensive framework including required protocols for operation is proposed. It can be used within a global scope, supports inter-organizational cooperation and data sharing, and adheres to all the architectural guidelines derived in this work. Security and privacy is provided by technical means in an economic manner. Altogether, the goal of building scalable and efficient RFID systems on a global, inter-organizational scale without neglecting security and privacy has been achieved well.

Spoken Dialogue Systems Technology and Design (Hardcover, 2011 ed.): Wolfgang Minker, Gary Geunbae Lee, Satoshi Nakamura,... Spoken Dialogue Systems Technology and Design (Hardcover, 2011 ed.)
Wolfgang Minker, Gary Geunbae Lee, Satoshi Nakamura, Joseph Mariani
R5,600 Discovery Miles 56 000 Ships in 10 - 15 working days

Spoken Dialogue Systems Technology and Design covers key topics in the field of spoken language dialogue interaction from a variety of leading researchers. It brings together several perspectives in the areas of corpus annotation and analysis, dialogue system construction, as well as theoretical perspectives on communicative intention, context-based generation, and modelling of discourse structure. These topics are all part of the general research and development within the area of discourse and dialogue with an emphasis on dialogue systems; corpora and corpus tools and semantic and pragmatic modelling of discourse and dialogue.

Situated Dialog in Speech-Based Human-Computer Interaction (Hardcover, 1st ed. 2016): Alexander Rudnicky, Antoine Raux, Ian... Situated Dialog in Speech-Based Human-Computer Interaction (Hardcover, 1st ed. 2016)
Alexander Rudnicky, Antoine Raux, Ian Lane, Teruhisa Misu
R3,746 R3,306 Discovery Miles 33 060 Save R440 (12%) Ships in 12 - 17 working days

This book provides a survey of the state-of-the-art in the practical implementation of Spoken Dialog Systems for applications in everyday settings. It includes contributions on key topics in situated dialog interaction from a number of leading researchers and offers a broad spectrum of perspectives on research and development in the area. In particular, it presents applications in robotics, knowledge access and communication and covers the following topics: dialog for interacting with robots; language understanding and generation; dialog architectures and modeling; core technologies; and the analysis of human discourse and interaction. The contributions are adapted and expanded contributions from the 2014 International Workshop on Spoken Dialog Systems (IWSDS 2014), where researchers and developers from industry and academia alike met to discuss and compare their implementation experiences, analyses and empirical findings.

Designing Human Interface in Speech Technology (Hardcover, 2006 ed.): Fang Chen Designing Human Interface in Speech Technology (Hardcover, 2006 ed.)
Fang Chen
R3,026 Discovery Miles 30 260 Ships in 10 - 15 working days

Designing Human Interface in Speech Technology bridges a gap between the needs of the technical engineer and cognitive researchers working in the multidisciplinary area of speech technology applications. The approach is systematic and the focus is on the utility of developing and designing speech related products.

Included is coverage of topics such as neuroscience on the multimodal cortex, cognitive theories on multi-task performance, stress and workload, as well as human information process theory and ecological interface design theory for evaluating speech-related human-system interfaces.

Of special emphasis are topics such as spoken dialogue system design, in-vehicle communication system design and speech technology in military applications. Also included are tools on how to analyze the design, different design theories and process, methods about how to understand users. The material systematically describes the user-center design process and usability evaluation methods.

Designing Human Interface in Speech Technology is appropriate for designers, engineers, and decision makers working in the area of speech technology research. It is also a good text book for senior university students and postgraduate students in the respective interaction design areas.

Privacy-Preserving Machine Learning for Speech Processing (Hardcover, 2013 ed.): Manas A. Pathak Privacy-Preserving Machine Learning for Speech Processing (Hardcover, 2013 ed.)
Manas A. Pathak
R3,221 Discovery Miles 32 210 Ships in 12 - 17 working days

This thesis discusses the privacy issues in speech-based applications such as biometric authentication, surveillance, and external speech processing services. Author Manas A. Pathak presents solutions for privacy-preserving speech processing applications such as speaker verification, speaker identification and speech recognition. The author also introduces some of the tools from cryptography and machine learning and current techniques for improving the efficiency and scalability of the presented solutions. Experiments with prototype implementations of the solutions for execution time and accuracy on standardized speech datasets are also included in the text. Using the framework proposed may now make it possible for a surveillance agency to listen for a known terrorist without being able to hear conversation from non-targeted, innocent civilians."

Speech Enhancement Techniques for Digital Hearing Aids (Hardcover, 1st ed. 2019): Komal R. Borisagar, Rohit M. Thanki, Bhavin... Speech Enhancement Techniques for Digital Hearing Aids (Hardcover, 1st ed. 2019)
Komal R. Borisagar, Rohit M. Thanki, Bhavin S. Sedani
R2,783 Discovery Miles 27 830 Ships in 10 - 15 working days

This book provides various speech enhancement algorithms for digital hearing aids. It covers information on noise signals extracted from silences of speech signal. The description of the algorithm used for this purpose is also provided. Different types of adaptive filters such as Least Mean Squares (LMS), Normalized LMS (NLMS) and Recursive Lease Squares (RLS) are described for noise reduction in the speech signals. Different types of noises are taken to generate noisy speech signals, and therefore information on various noises signals is provided. The comparative performance of various adaptive filters for noise reduction in speech signals is also described. In addition, the book provides a speech enhancement technique using adaptive filtering and necessary frequency strength enhancement using wavelet transform as per the requirement of audiogram for digital hearing aids. Presents speech enhancement techniques for improving performance of digital hearing aids; Covers various types of adaptive filters and their advantages and limitations; Provides a hybrid speech enhancement technique using wavelet transform and adaptive filters.

Dialect Accent Features for Establishing Speaker Identity - A Case Study (Hardcover, 2012): Manisha Kulshreshtha, Ramkumar... Dialect Accent Features for Establishing Speaker Identity - A Case Study (Hardcover, 2012)
Manisha Kulshreshtha, Ramkumar Mathur
R1,469 Discovery Miles 14 690 Ships in 10 - 15 working days

Dialect Accent Features for Establishing Speaker Identity: A Case Study discusses the subject of forensic voice identification and speaker profiling. Specifically focusing on speaker profiling and using dialects of the Hindi language, widely used in India, the authors have contributed to the body of research on speaker identification by using accent feature as the discriminating factor. This case study contributes to the understanding of the speaker identification process in a situation where unknown speech samples are in different language/dialect than the recording of a suspect. The authors' data establishes that vowel quality, quantity, intonation and tone of a speaker as compared to Khariboli (standard Hindi) could be the potential features for identification of dialect accent.

Computer Synthesized Speech Technologies - Tools for Aiding Impairment (Hardcover, New): Computer Synthesized Speech Technologies - Tools for Aiding Impairment (Hardcover, New)
R6,418 Discovery Miles 64 180 Ships in 12 - 17 working days

While the use of technology to compensate for individual shortcomings is nothing new, there has been tremendous progress in the application of technology toward assisting individuals with disabilities, particularly with the use of computer synthesized speech (CSS) to help speech impaired people communicate using voice. Computer Synthesized Speech Technologies: Tools for Aiding Impairment provides information to current and future practitioners that will allow them to better assist speech disabled individuals who wish to utilize CSS technology. Just as important as the practitioner's knowledge of the latest advances in speech technology, so, too, is the practitioner's understanding of how specific client needs affect the use of CSS, how cognitive factors related to comprehension of CSS affect its use, and how social factors related to perceptions of the CSS user affect their interaction with others. This cutting edge book addresses those topics pertinent to understanding the myriad of concerns involved with the implementation of CSS so that CSS technologies may continue to evolve and improve for speech impaired individuals.

Advanced Speech Recognition: Concepts and Case Studies (Hardcover): Marcus Hintz Advanced Speech Recognition: Concepts and Case Studies (Hardcover)
Marcus Hintz
R2,769 Discovery Miles 27 690 Ships in 12 - 17 working days
Human and Automatic Speaker Recognition over Telecommunication Channels (Hardcover, 1st ed. 2016): Laura Fernandez Gallardo Human and Automatic Speaker Recognition over Telecommunication Channels (Hardcover, 1st ed. 2016)
Laura Fernandez Gallardo
R3,248 Discovery Miles 32 480 Ships in 12 - 17 working days

This work addresses the evaluation of the human and the automatic speaker recognition performances under different channel distortions caused by bandwidth limitation, codecs, and electro-acoustic user interfaces, among other impairments. Its main contribution is the demonstration of the benefits of communication channels of extended bandwidth, together with an insight into how speaker-specific characteristics of speech are preserved through different transmissions. It provides sufficient motivation for considering speaker recognition as a criterion for the migration from narrowband to enhanced bandwidths, such as wideband and super-wideband.

Epoch Synchronous Overlap Add (ESOLA) - A Concatenative Synthesis Procedure for Speech (Hardcover, 1st ed. 2018): Asoke Kumar... Epoch Synchronous Overlap Add (ESOLA) - A Concatenative Synthesis Procedure for Speech (Hardcover, 1st ed. 2018)
Asoke Kumar Datta
R1,490 Discovery Miles 14 900 Ships in 10 - 15 working days

This book presents details of a text-to-speech synthesis procedure using epoch synchronous overlap add (ESOLA), and provides a solution for development of a text-to-speech system using minimum data resources compared to existing solutions. It also examines most natural speech signals including random perturbation in synthesis. The book is intended for students, researchers and industrial practitioners in the field of text-to-speech synthesis.

Free Delivery
Pinterest Twitter Facebook Google+
You may like...
Proactive Spoken Dialogue Interaction in…
Petra-Maria Strauss, Wolfgang Minker Paperback R2,789 Discovery Miles 27 890
Handling Emotions in Human-Computer…
Johannes Pittermann, Angela Pittermann, … Hardcover R2,951 Discovery Miles 29 510
Recent Advances in Nonlinear Speech…
Anna Esposito, Marcos Faundez-Zanuy, … Hardcover R3,827 R3,387 Discovery Miles 33 870
Time Domain Representation of Speech…
Asoke Kumar Datta Hardcover R3,134 R2,976 Discovery Miles 29 760
Novel Techniques for Dialectal Arabic…
Mohamed Elmahdy, Rainer Gruhn, … Hardcover R2,772 Discovery Miles 27 720
Data-Driven Methods for Adaptive Spoken…
Oliver Lemon, Olivier Pietquin Hardcover R2,788 Discovery Miles 27 880
Audio and Speech Processing with MATLAB
Paul Hill Paperback R1,624 Discovery Miles 16 240
Computational Linguistics, Speech And…
Neamat El Gayar, Ching Yee Suen Hardcover R2,702 Discovery Miles 27 020
Fundamentals of Speaker Recognition
Homayoon Beigi Hardcover R3,697 Discovery Miles 36 970
Practical Speech User Interface Design
James R Lewis Hardcover R3,203 Discovery Miles 32 030

 

Partners