0
Your cart

Your cart is empty

Browse All Departments
Price
  • R250 - R500 (2)
  • R500+ (90)
  • -
Status
Format
Author / Contributor
Publisher

Books > Computing & IT > Applications of computing > Audio processing > Speech recognition & synthesis

Intelligent Speech Signal Processing (Paperback): Nilanjan Dey Intelligent Speech Signal Processing (Paperback)
Nilanjan Dey
R2,517 Discovery Miles 25 170 Ships in 10 - 15 working days

Intelligent Speech Signal Processing investigates the utilization of speech analytics across several systems and real-world activities, including sharing data analytics, creating collaboration networks between several participants, and implementing video-conferencing in different application areas. Chapters focus on the latest applications of speech data analysis and management tools across different recording systems. The book emphasizes the multidisciplinary nature of the field, presenting different applications and challenges with extensive studies on the design, development and management of intelligent systems, neural networks and related machine learning techniques for speech signal processing.

New Era for Robust Speech Recognition - Exploiting Deep Learning (Hardcover, 1st ed. 2017): Shinji Watanabe, Marc Delcroix,... New Era for Robust Speech Recognition - Exploiting Deep Learning (Hardcover, 1st ed. 2017)
Shinji Watanabe, Marc Delcroix, Florian Metze, John R. Hershey
R5,505 Discovery Miles 55 050 Ships in 10 - 15 working days

This book covers the state-of-the-art in deep neural-network-based methods for noise robustness in distant speech recognition applications. It provides insights and detailed descriptions of some of the new concepts and key technologies in the field, including novel architectures for speech enhancement, microphone arrays, robust features, acoustic model adaptation, training data augmentation, and training criteria. The contributed chapters also include descriptions of real-world applications, benchmark tools and datasets widely used in the field. This book is intended for researchers and practitioners working in the field of speech processing and recognition who are interested in the latest deep learning techniques for noise robustness. It will also be of interest to graduate students in electrical engineering or computer science, who will find it a useful guide to this field of research.

Data-Driven Methods for Adaptive Spoken Dialogue Systems - Computational Learning for Conversational Interfaces (Hardcover,... Data-Driven Methods for Adaptive Spoken Dialogue Systems - Computational Learning for Conversational Interfaces (Hardcover, 2012 ed.)
Oliver Lemon, Olivier Pietquin
R2,653 Discovery Miles 26 530 Ships in 18 - 22 working days

Data driven methods have long been used in Automatic Speech Recognition (ASR) and Text-To-Speech (TTS) synthesis and have more recently been introduced for dialogue management, spoken language understanding, and Natural Language Generation. Machine learning is now present "end-to-end" in Spoken Dialogue Systems (SDS). However, these techniques require data collection and annotation campaigns, which can be time-consuming and expensive, as well as dataset expansion by simulation. In this book, we provide an overview of the current state of the field and of recent advances, with a specific focus on adaptivity.

Automatic Speech Recognition - A Deep Learning Approach (Hardcover, 2015 ed.): Dong Yu, Li Deng Automatic Speech Recognition - A Deep Learning Approach (Hardcover, 2015 ed.)
Dong Yu, Li Deng
R4,000 Discovery Miles 40 000 Ships in 10 - 15 working days

This book provides a comprehensive overview of the recent advancement in the field of automatic speech recognition with a focus on deep learning models including deep neural networks and many of their variants. This is the first automatic speech recognition book dedicated to the deep learning approach. In addition to the rigorous mathematical treatment of the subject, the book also presents insights and theoretical foundation of a series of highly successful deep learning models.

Digital Speech Processing Using Matlab (Hardcover, 2014 ed.): E.S. Gopi Digital Speech Processing Using Matlab (Hardcover, 2014 ed.)
E.S. Gopi
R3,294 Discovery Miles 32 940 Ships in 10 - 15 working days

Digital Speech Processing Using Matlab deals with digital speech pattern recognition, speech production model, speech feature extraction, and speech compression. The book is written in a manner that is suitable for beginners pursuing basic research in digital speech processing. Matlab illustrations are provided for most topics to enable better understanding of concepts. This book also deals with the basic pattern recognition techniques (illustrated with speech signals using Matlab) such as PCA, LDA, ICA, SVM, HMM, GMM, BPN, and KSOM.

Adaptive Digital Filters (Hardcover, 2013 ed.): Branko Kovacevic, Zoran Banjac, Milan Milosavljevic Adaptive Digital Filters (Hardcover, 2013 ed.)
Branko Kovacevic, Zoran Banjac, Milan Milosavljevic
R4,395 R3,324 Discovery Miles 33 240 Save R1,071 (24%) Ships in 10 - 15 working days

"Adaptive Digital Filters" presents an important discipline applied to the domain of speech processing. The book first makes the reader acquainted with the basic terms of filtering and adaptive filtering, before introducing the field of advanced modern algorithms, some of which are contributed by the authors themselves. Working in the field of adaptive signal processing requires the use of complex mathematical tools. The book offers a detailed presentation of the mathematical models that is clear and consistent, an approach that allows everyone with a college level of mathematics knowledge to successfully follow the mathematical derivations and descriptions of algorithms.
The algorithms are presented in flow charts, which facilitates their practical implementation. The book presents many experimental results and treats the aspects of practical application of adaptive filtering in real systems, making it a valuable resource for both undergraduate and graduate students, and for all others interested in mastering this important field.

RFID Security and Privacy - Concepts, Protocols, and Architectures (Hardcover, Limited): Dirk Henrici RFID Security and Privacy - Concepts, Protocols, and Architectures (Hardcover, Limited)
Dirk Henrici
R4,159 Discovery Miles 41 590 Ships in 18 - 22 working days

The vision of a world in which privacy persists and security is ensured but the full potential of the technology is nevertheless tapped guides this work. It is argued that security and privacy can be ensured using technical safeguards if the whole RFID system is designed properly. The challenge is immense since many constraints exist for providing security and privacy in RFID systems: technically and economically but also ethically and socially. Not only security and privacy needs to be provided but the solutions also need to be inexpensive, practical, reliable, scalable, flexible, inter-organizational, and lasting.

After analyzing the problem area in detail, this work introduces a number of new concepts and protocols that provide security and ensure privacy in RFID systems by technical means. The classic RFID model is extended and considerations in new directions are taken. This leads to innovative solutions with advantageous characteristics. Finally, a comprehensive framework including required protocols for operation is proposed. It can be used within a global scope, supports inter-organizational cooperation and data sharing, and adheres to all the architectural guidelines derived in this work. Security and privacy is provided by technical means in an economic manner. Altogether, the goal of building scalable and efficient RFID systems on a global, inter-organizational scale without neglecting security and privacy has been achieved well.

Audio and Speech Processing with MATLAB (Paperback): Paul Hill Audio and Speech Processing with MATLAB (Paperback)
Paul Hill
R1,737 Discovery Miles 17 370 Ships in 10 - 15 working days

Speech and audio processing has undergone a revolution in preceding decades that has accelerated in the last few years generating game-changing technologies such as truly successful speech recognition systems; a goal that had remained out of reach until very recently. This book gives the reader a comprehensive overview of such contemporary speech and audio processing techniques with an emphasis on practical implementations and illustrations using MATLAB code. Core concepts are firstly covered giving an introduction to the physics of audio and vibration together with their representations using complex numbers, Z transforms and frequency analysis transforms such as the FFT. Later chapters give a description of the human auditory system and the fundamentals of psychoacoustics. Insights, results, and analyses given in these chapters are subsequently used as the basis of understanding of the middle section of the book covering: wideband audio compression (MP3 audio etc.), speech recognition and speech coding. The final chapter covers musical synthesis and applications describing methods such as (and giving MATLAB examples of) AM, FM and ring modulation techniques. This chapter gives a final example of the use of time-frequency modification to implement a so-called phase vocoder for time stretching (in MATLAB). Features A comprehensive overview of contemporary speech and audio processing techniques from perceptual and physical acoustic models to a thorough background in relevant digital signal processing techniques together with an exploration of speech and audio applications. A carefully paced progression of complexity of the described methods; building, in many cases, from first principles. Speech and wideband audio coding together with a description of associated standardised codecs (e.g. MP3, AAC and GSM). Speech recognition: Feature extraction (e.g. MFCC features), Hidden Markov Models (HMMs) and deep learning techniques such as Long Short-Time Memory (LSTM) methods. Book and computer-based problems at the end of each chapter. Contains numerous real-world examples backed up by many MATLAB functions and code.

Computational Linguistics, Speech And Image Processing For Arabic Language (Hardcover): Neamat El Gayar, Ching Yee Suen Computational Linguistics, Speech And Image Processing For Arabic Language (Hardcover)
Neamat El Gayar, Ching Yee Suen
R2,571 Discovery Miles 25 710 Ships in 18 - 22 working days

This book encompasses a collection of topics covering recent advances that are important to the Arabic language in areas of natural language processing, speech and image analysis. This book presents state-of-the-art reviews and fundamentals as well as applications and recent innovations.The book chapters by top researchers present basic concepts and challenges for the Arabic language in linguistic processing, handwritten recognition, document analysis, text classification and speech processing. In addition, it reports on selected applications in sentiment analysis, annotation, text summarization, speech and font analysis, word recognition and spotting and question answering.Moreover, it highlights and introduces some novel applications in vital areas for the Arabic language. The book is therefore a useful resource for young researchers who are interested in the Arabic language and are still developing their fundamentals and skills in this area. It is also interesting for scientists who wish to keep track of the most recent research directions and advances in this area.

Fundamentals of Speaker Recognition (Hardcover, 2011): Homayoon Beigi Fundamentals of Speaker Recognition (Hardcover, 2011)
Homayoon Beigi
R3,738 Discovery Miles 37 380 Ships in 10 - 15 working days

An emerging technology, Speaker Recognition is becoming well-known for providing voice authentication over the telephone for helpdesks, call centres and other enterprise businesses for business process automation.

"Fundamentals of Speaker Recognition" introduces Speaker Identification, Speaker Verification, Speaker (Audio Event) Classification, Speaker Detection, Speaker Tracking and more. The technical problems are rigorously defined, and a complete picture is made of the relevance of the discussed algorithms and their usage in building a comprehensive Speaker Recognition System.

Designed as a textbook with examples and exercises at the end of each chapter, "Fundamentals of Speaker Recognition" is suitable for advanced-level students in computer science and engineering, concentrating on biometrics, speech recognition, pattern recognition, signal processing and, specifically, speaker recognition. It is also a valuable reference for developers of commercial technology and for speech scientists.

Please click on the link under "Additional Information" to view supplemental information including the Table of Contents and Index.

Practical Speech User Interface Design (Hardcover, New): James R Lewis Practical Speech User Interface Design (Hardcover, New)
James R Lewis
R3,385 Discovery Miles 33 850 Ships in 10 - 15 working days

Although speech is the most natural form of communication between humans, most people find using speech to communicate with machines anything but natural. Drawing from psychology, human-computer interaction, linguistics, and communication theory, Practical Speech User Interface Design provides a comprehensive yet concise survey of practical speech user interface (SUI) design. It offers practice-based and research-based guidance on how to design effective, efficient, and pleasant speech applications that people can really use. Focusing on the design of speech user interfaces for IVR applications, the book covers speech technologies including speech recognition and production, ten key concepts in human language and communication, and a survey of self-service technologies. The author, a leading human factors engineer with extensive experience in research, innovation and design of products with speech interfaces that are used worldwide, covers both high- and low-level decisions and includes Voice XML code examples. To help articulate the rationale behind various SUI design guidelines, he includes a number of detailed discussions of the applicable research. The techniques for designing usable SUIs are not obvious, and to be effective, must be informed by a combination of critically interpreted scientific research and leading design practices. The blend of scholarship and practical experience found in this book establishes research-based leading practices for the design of usable speech user interfaces for interactive voice response applications.

Towards Adaptive Spoken Dialog Systems (Hardcover, 2013 ed.): Alexander Schmitt, Wolfgang Minker Towards Adaptive Spoken Dialog Systems (Hardcover, 2013 ed.)
Alexander Schmitt, Wolfgang Minker
R4,168 R3,367 Discovery Miles 33 670 Save R801 (19%) Ships in 10 - 15 working days

In Monitoring Adaptive Spoken Dialog Systems, authors Alexander Schmitt and Wolfgang Minker investigate statistical approaches that allow for recognition of negative dialog patterns in Spoken Dialog Systems (SDS). The presented stochastic methods allow a flexible, portable and accurate use. Beginning with the foundations of machine learning and pattern recognition, this monograph examines how frequently users show negative emotions in spoken dialog systems and develop novel approaches to speech-based emotion recognition using hybrid approach to model emotions. The authors make use of statistical methods based on acoustic, linguistic and contextual features to examine the relationship between the interaction flow and the occurrence of emotions using non-acted recordings several thousand real users from commercial and non-commercial SDS. Additionally, the authors present novel statistical methods that spot problems within a dialog based on interaction patterns. The approaches enable future SDS to offer more natural and robust interactions. This work provides insights, lessons and inspiration for future research and development, not only for spoken dialog systems, but for data-driven approaches to human-machine interaction in general.

Studies on Speech Production - 11th International Seminar, ISSP 2017, Tianjin, China, October 16-19, 2017, Revised Selected... Studies on Speech Production - 11th International Seminar, ISSP 2017, Tianjin, China, October 16-19, 2017, Revised Selected Papers (Paperback, 1st ed. 2018)
Qiang Fang, Jianwu Dang, Pascal Perrier, Jianguo Wei, Longbiao Wang, …
R1,408 Discovery Miles 14 080 Ships in 18 - 22 working days

This book constitutes the refereed post-conference proceedings of the 11th International Seminar on Speech Production, ISSP 2017, held in Tianjin, China, In October 2017. The 20 revised full papers included in this volume were carefully reviewed and selected from 68 submissions. They cover a wide range of speech science fields including phonology, phonetics, prosody, mechanics, acoustics, physiology, motor control, neuroscience, computer science and human interaction. The papers are organized in the following topical sections: emotional speech analysis and recognition; articulatory speech synthesis; speech acquisition; phonetics; speech planning and comprehension, and speech disorder.

Automatic Speech Recognition of Arabic Phonemes with Neural Networks - A Contrastive Study of Arabic and English (Paperback,... Automatic Speech Recognition of Arabic Phonemes with Neural Networks - A Contrastive Study of Arabic and English (Paperback, 1st ed. 2019)
Mohammed Dib
R1,372 Discovery Miles 13 720 Ships in 18 - 22 working days

This book presents a contrastive linguistics study of Arabic and English for the dual purposes of improved language teaching and speech processing of Arabic via spectral analysis and neural networks. Contrastive linguistics is a field of linguistics which aims to compare the linguistic systems of two or more languages in order to ease the tasks of teaching, learning, and translation. The main focus of the present study is to treat the Arabic minimal syllable automatically to facilitate automatic speech processing in Arabic. It represents important reading for language learners and for linguists with an interest in Arabic and computational approaches.

Statistical Language and Speech Processing - 6th International Conference, SLSP 2018, Mons, Belgium, October 15-16, 2018,... Statistical Language and Speech Processing - 6th International Conference, SLSP 2018, Mons, Belgium, October 15-16, 2018, Proceedings (Paperback, 1st ed. 2018)
Thierry Dutoit, Carlos Martin-Vide, Gueorgui Pironkov
R1,408 Discovery Miles 14 080 Ships in 18 - 22 working days

This book constitutes the proceedings of the 6th International Conference on Statistical Language and Speech Processing, SLSP 2018, held in Mons, Belgium, in October 2018. The 15 full papers presented in this volume were carefully reviewed and selected from 40 submissions. They were organized in topical sections named: speech synthesis and spoken language generation; speech recognition and post-processing; natural language processing and understanding; and text processing and analysis.

Natural Language Processing and Computational Ling uistics 2: Semantics, Discourse and Applications (Hardcover, Volume 2): MZ... Natural Language Processing and Computational Ling uistics 2: Semantics, Discourse and Applications (Hardcover, Volume 2)
MZ Kurdi
R3,773 Discovery Miles 37 730 Ships in 18 - 22 working days

Natural Language Processing (NLP) is a scientific discipline which is found at the intersection of fields such as Artificial Intelligence, Linguistics, and Cognitive Psychology. This book presents in four chapters the state of the art and fundamental concepts of key NLP areas. Are presented in the first chapter the fundamental concepts in lexical semantics, lexical databases, knowledge representation paradigms, and ontologies. The second chapter is about combinatorial and formal semantics. Discourse and text representation as well as automatic discourse segmentation and interpretation, and anaphora resolution are the subject of the third chapter. Finally, in the fourth chapter, I will cover some aspects of large scale applications of NLP such as software architecture and their relations to cognitive models of NLP as well as the evaluation paradigms of NLP software. Furthermore, I will present in this chapter the main NLP applications such as Machine Translation (MT), Information Retrieval (IR), as well as Big Data and Information Extraction such as event extraction, sentiment analysis and opinion mining.

Fundamentals of Speech Enhancement (Paperback, 1st ed. 2018): Jacob Benesty Fundamentals of Speech Enhancement (Paperback, 1st ed. 2018)
Jacob Benesty
R1,408 Discovery Miles 14 080 Ships in 18 - 22 working days

This book presents and develops several important concepts of speech enhancement in a simple but rigorous way. Many of the ideas are new; not only do they shed light on this old problem but they also offer valuable tips on how to improve on some well-known conventional approaches. The book unifies all aspects of speech enhancement, from single channel, multichannel, beamforming, time domain, frequency domain and time-frequency domain, to binaural in a clear and flexible framework. It starts with an exhaustive discussion on the fundamental best (linear and nonlinear) estimators, showing how they are connected to various important measures such as the coefficient of determination, the correlation coefficient, the conditional correlation coefficient, and the signal-to-noise ratio (SNR). It then goes on to show how to exploit these measures in order to derive all kinds of noise reduction algorithms that can offer an accurate and versatile compromise between noise reduction and speech distortion.

Designing Voice User Interfaces (Paperback): Cathy Pearl Designing Voice User Interfaces (Paperback)
Cathy Pearl
R1,019 R695 Discovery Miles 6 950 Save R324 (32%) Ships in 10 - 15 working days

Voice user interfaces (VUIs) are becoming all the rage today. But how do you build one that people can actually converse with? Whether you're designing a mobile app, a toy, or a device such as a home assistant, this practical book guides you through basic VUI design principles, helps you choose the right speech recognition engine, and shows you how to measure your VUI's performance and improve upon it. Author Cathy Pearl also takes product managers, UX designers, and VUI designers into advanced design topics that will help make your VUI not just functional, but great. Understand key VUI design concepts, including command-and-control and conversational systems Decide if you should use an avatar or other visual representation with your VUI Explore speech recognition technology and its impact on your design Take your VUI above and beyond the basic exchange of information Learn practical ways to test your VUI application with users Monitor your app and learn how to quickly improve performance Get real-world examples of VUIs for home assistants, smartwatches, and car systems

Speech and Language Processing for Human-Machine Communications - Proceedings of CSI 2015 (Paperback, 1st ed. 2018): S.S.... Speech and Language Processing for Human-Machine Communications - Proceedings of CSI 2015 (Paperback, 1st ed. 2018)
S.S. Agrawal, Amita Devi, Ritika Wason, Poonam Bansal
R3,219 Discovery Miles 32 190 Ships in 18 - 22 working days

This volume comprises the select proceedings of the annual convention of the Computer Society of India. Divided into 10 topical volumes, the proceedings present papers on state-of-the-art research, surveys, and succinct reviews. The volumes cover diverse topics ranging from communications networks to big data analytics, and from system architecture to cyber security. This volume focuses on Speech and Language Processing for Human-Machine Communications. The contents of this book will be useful to researchers and students alike.

Proactive Spoken Dialogue Interaction in Multi-Party Environments (Paperback, 2010 ed.): Petra-Maria Strauss, Wolfgang Minker Proactive Spoken Dialogue Interaction in Multi-Party Environments (Paperback, 2010 ed.)
Petra-Maria Strauss, Wolfgang Minker
R2,653 Discovery Miles 26 530 Ships in 18 - 22 working days

Proactive Spoken Dialogue Interaction in Multi-Party Environments describes spoken dialogue systems that act as independent dialogue partners in the conversation with and between users. The resulting novel characteristics such as proactiveness and multi-party capabilities pose new challenges on the dialogue management component of such a system and require the use and administration of an extensive dialogue history. In order to assist the proactive spoken dialogue systems development, a comprehensive data collection seems mandatory and may be performed in a Wizard-of-Oz environment. Such an environment builds also the appropriate basis for an extensive usability and acceptance evaluation. Proactive Spoken Dialogue Interaction in Multi-Party Environments is a useful reference for students and researchers in speech processing.

Estimating Spoken Dialog System Quality with User Models (Paperback, 2013 ed.): Klaus-Peter Engelbrecht Estimating Spoken Dialog System Quality with User Models (Paperback, 2013 ed.)
Klaus-Peter Engelbrecht
R3,439 Discovery Miles 34 390 Ships in 18 - 22 working days

Spoken dialog systems have the potential to offer highly intuitive user interfaces, as they allow systems to be controlled using natural language. However, the complexity inherent in natural language dialogs means that careful testing of the system must be carried out from the very beginning of the design process. This book examines how user models can be used to support such early evaluations in two ways: by running simulations of dialogs, and by estimating the quality judgments of users. First, a design environment supporting the creation of dialog flows, the simulation of dialogs, and the analysis of the simulated data is proposed. How the quality of user simulations may be quantified with respect to their suitability for both formative and summative evaluation is then discussed. The remainder of the book is dedicated to the problem of predicting quality judgments of users based on interaction data. New modeling approaches are presented, which process the dialogs as sequences, and which allow knowledge about the judgment behavior of users to be incorporated into predictions. All proposed methods are validated with example evaluation studies.

Towards Adaptive Spoken Dialog Systems (Paperback, 2013 ed.): Alexander Schmitt, Wolfgang Minker Towards Adaptive Spoken Dialog Systems (Paperback, 2013 ed.)
Alexander Schmitt, Wolfgang Minker
R3,314 Discovery Miles 33 140 Ships in 18 - 22 working days

In Monitoring Adaptive Spoken Dialog Systems, authors Alexander Schmitt and Wolfgang Minker investigate statistical approaches that allow for recognition of negative dialog patterns in Spoken Dialog Systems (SDS). The presented stochastic methods allow a flexible, portable and accurate use. Beginning with the foundations of machine learning and pattern recognition, this monograph examines how frequently users show negative emotions in spoken dialog systems and develop novel approaches to speech-based emotion recognition using hybrid approach to model emotions. The authors make use of statistical methods based on acoustic, linguistic and contextual features to examine the relationship between the interaction flow and the occurrence of emotions using non-acted recordings several thousand real users from commercial and non-commercial SDS. Additionally, the authors present novel statistical methods that spot problems within a dialog based on interaction patterns. The approaches enable future SDS to offer more natural and robust interactions. This work provides insights, lessons and inspiration for future research and development, not only for spoken dialog systems, but for data-driven approaches to human-machine interaction in general.

Speech Spectrum Analysis (Paperback, 2011 ed.): Sean A. Fulop Speech Spectrum Analysis (Paperback, 2011 ed.)
Sean A. Fulop
R2,653 Discovery Miles 26 530 Ships in 18 - 22 working days

The accurate determination of the speech spectrum, particularly for short frames, is commonly pursued in diverse areas including speech processing, recognition, and acoustic phonetics. With this book the author makes the subject of spectrum analysis understandable to a wide audience, including those with a solid background in general signal processing and those without such background. In keeping with these goals, this is not a book that replaces or attempts to cover the material found in a general signal processing textbook. Some essential signal processing concepts are presented in the first chapter, but even there the concepts are presented in a generally understandable fashion as far as is possible. Throughout the book, the focus is on applications to speech analysis; mathematical theory is provided for completeness, but these developments are set off in boxes for the benefit of those readers with sufficient background. Other readers may proceed through the main text, where the key results and applications will be presented in general heuristic terms, and illustrated with software routines and practical "show-and-tell" discussions of the results. At some points, the book refers to and uses the implementations in the Praat speech analysis software package, which has the advantages that it is used by many scientists around the world, and it is free and open source software. At other points, special software routines have been developed and made available to complement the book, and these are provided in the Matlab programming language. If the reader has the basic Matlab package, he/she will be able to immediately implement the programs in that platform---no extra "toolboxes" are required.

Statistical Pronunciation Modeling for Non-Native Speech Processing (Paperback, 2011 ed.): Rainer E. Gruhn, Wolfgang Minker,... Statistical Pronunciation Modeling for Non-Native Speech Processing (Paperback, 2011 ed.)
Rainer E. Gruhn, Wolfgang Minker, Satoshi Nakamura
R2,653 Discovery Miles 26 530 Ships in 18 - 22 working days

In this work, the authors present a fully statistical approach to model non--native speakers' pronunciation. Second-language speakers pronounce words in multiple different ways compared to the native speakers. Those deviations, may it be phoneme substitutions, deletions or insertions, can be modelled automatically with the new method presented here.

The methods is based on a discrete hidden Markov model as a word pronunciation model, initialized on a standard pronunciation dictionary. The implementation and functionality of the methodology has been proven and verified with a test set of non-native English in the regarding accent.

The book is written for researchers with a professional interest in phonetics and automatic speech and speaker recognition.

Emotion Recognition using Speech Features (Paperback, 2013 ed.): K. Sreenivasa Rao, Shashidhar G. Koolagudi Emotion Recognition using Speech Features (Paperback, 2013 ed.)
K. Sreenivasa Rao, Shashidhar G. Koolagudi
R1,719 Discovery Miles 17 190 Ships in 18 - 22 working days

"Emotion Recognition Using Speech Features" provides coverage of emotion-specific features present in speech. The author also discusses suitable models for capturing emotion-specific information for distinguishing different emotions. The content of this book is important for designing and developing natural and sophisticated speech systems. In this Brief, Drs. Rao and Koolagudi lead a discussion of how emotion-specific information is embedded in speech and how to acquire emotion-specific knowledge using appropriate statistical models. Additionally, the authors provide information about exploiting multiple evidences derived from various features and models. The acquired emotion-specific knowledge is useful for synthesizing emotions. Features includes discussion of: * Global and local prosodic features at syllable, word and phrase levels, helpful for capturing emotion-discriminative information; * Exploiting complementary evidences obtained from excitation sources, vocal tract systems and prosodic features in order to enhance the emotion recognition performance; * Proposed multi-stage and hybrid models for improving the emotion recognition performance. This brief is for researchers working in areas related to speech-based products such as mobile phone manufacturing companies, automobile companies, and entertainment products as well as researchers involved in basic and applied speech processing research.

Free Delivery
Pinterest Twitter Facebook Google+
You may like...
Recent Advances in Nonlinear Speech…
Anna Esposito, Marcos Faundez-Zanuy, … Hardcover R3,671 R3,410 Discovery Miles 34 100
Proactive Spoken Dialogue Interaction in…
Petra-Maria Strauss, Wolfgang Minker Hardcover R2,750 Discovery Miles 27 500
Self-Learning Speaker Identification - A…
Tobias Herbig, Franz Gerl, … Hardcover R2,746 Discovery Miles 27 460
Computer Synthesized Speech Technologies…
Hardcover R6,131 Discovery Miles 61 310
Speech and Audio Processing for Coding…
Tokunbo Ogunfunmi, Roberto Togneri, … Hardcover R4,040 R3,509 Discovery Miles 35 090
Advanced Speech Recognition: Concepts…
Marcus Hintz Hardcover R3,030 R2,745 Discovery Miles 27 450
Spoken Dialogue Systems Technology and…
Wolfgang Minker, Gary Geunbae Lee, … Hardcover R5,302 Discovery Miles 53 020
Handbook of Research on Recent…
Siddhartha Bhattacharyya, Nibaran Das, … Hardcover R9,028 Discovery Miles 90 280
Advances in Speech and Language…
Doroteo T. Toledano, Alfonso Ortega, … Paperback R1,410 Discovery Miles 14 100
Visual Speech Recognition - Lip…
Alan Wee-Chung Liew, Shilin Wang Hardcover R5,748 Discovery Miles 57 480

 

Partners