0
Your cart

Your cart is empty

Browse All Departments
Price
  • R250 - R500 (2)
  • R500+ (91)
  • -
Status
Format
Author / Contributor
Publisher

Books > Computing & IT > Applications of computing > Audio processing > Speech recognition & synthesis

Handling Emotions in Human-Computer Dialogues (Hardcover, 2010 ed.): Johannes Pittermann, Angela Pittermann, Wolfgang Minker Handling Emotions in Human-Computer Dialogues (Hardcover, 2010 ed.)
Johannes Pittermann, Angela Pittermann, Wolfgang Minker
R3,036 Discovery Miles 30 360 Ships in 10 - 15 working days

In this book, a novel approach that combines speech-based emotion recognition with adaptive human-computer dialogue modeling is described. With the robust recognition of emotions from speech signals as their goal, the authors analyze the effectiveness of using a plain emotion recognizer, a speech-emotion recognizer combining speech and emotion recognition, and multiple speech-emotion recognizers at the same time. The semi-stochastic dialogue model employed relates user emotion management to the corresponding dialogue interaction history and allows the device to adapt itself to the context, including altering the stylistic realization of its speech. This comprehensive volume begins by introducing spoken language dialogue systems and providing an overview of human emotions, theories, categorization and emotional speech. It moves on to cover the adaptive semi-stochastic dialogue model and the basic concepts of speech-emotion recognition. Finally, the authors show how speech-emotion recognizers can be optimized, and how an adaptive dialogue manager can be implemented. The book, with its novel methods to perform robust speech-based emotion recognition at low complexity, will be of interest to a variety of readers involved in human-computer interaction.

Recent Advances in Nonlinear Speech Processing (Hardcover, 1st ed. 2016): Anna Esposito, Marcos Faundez-Zanuy, Antonietta M.... Recent Advances in Nonlinear Speech Processing (Hardcover, 1st ed. 2016)
Anna Esposito, Marcos Faundez-Zanuy, Antonietta M. Esposito, Gennaro Cordasco, Thomas Drugman, …
R3,905 R3,623 Discovery Miles 36 230 Save R282 (7%) Ships in 12 - 19 working days

This book presents recent advances in nonlinear speech processing beyond nonlinear techniques. It shows that it exploits heuristic and psychological models of human interaction in order to succeed in the implementations of socially believable VUIs and applications for human health and psychological support. The book takes into account the multifunctional role of speech and what is "outside of the box" (see Bjoern Schuller's foreword). To this aim, the book is organized in 6 sections, each collecting a small number of short chapters reporting advances "inside" and "outside" themes related to nonlinear speech research. The themes emphasize theoretical and practical issues for modelling socially believable speech interfaces, ranging from efforts to capture the nature of sound changes in linguistic contexts and the timing nature of speech; labors to identify and detect speech features that help in the diagnosis of psychological and neuronal disease, attempts to improve the effectiveness and performance of Voice User Interfaces, new front-end algorithms for the coding/decoding of effective and computationally efficient acoustic and linguistic speech representations, as well as investigations capturing the social nature of speech in signaling personality traits, emotions and improving human machine interactions.

Time Domain Representation of Speech Sounds - A Case Study in Bangla (Hardcover, 1st ed. 2018): Asoke Kumar Datta Time Domain Representation of Speech Sounds - A Case Study in Bangla (Hardcover, 1st ed. 2018)
Asoke Kumar Datta
R2,873 Discovery Miles 28 730 Ships in 10 - 15 working days

The book presents the history of time-domain representation and the extent of its development along with that of spectral domain representation in the cognitive and technology domains. It discusses all the cognitive experiments related to this development, along with details of technological developments related to both automatic speech recognition (ASR) and text to speech synthesis (TTS), and introduces a viable time-domain representation for both objective and subjective analysis, as an alternative to the well-known spectral representation. The book also includes a new cohort study on the use of lexical knowledge in ASR. India has numerous official dialects, and spoken-language technology development is a burgeoning area. In fact TTS and ASR taken together constitute the most important technology for empowering people. As such, the book describes time domain representation in such a way that it can be easily and seamlessly incorporated into ASR and TTS research and development. In short, it is a valuable guidebook for the development of ASR and TTS in all the Indian Standard Dialects using signal domain parameters.

Novel Techniques for Dialectal Arabic Speech Recognition (Hardcover, 2012): Mohamed Elmahdy, Rainer Gruhn, Wolfgang Minker Novel Techniques for Dialectal Arabic Speech Recognition (Hardcover, 2012)
Mohamed Elmahdy, Rainer Gruhn, Wolfgang Minker
R2,857 Discovery Miles 28 570 Ships in 10 - 15 working days

Novel Techniques for Dialectal Arabic Speech describes approaches to improve automatic speech recognition for dialectal Arabic. Since speech resources for dialectal Arabic speech recognition are very sparse, the authors describe how existing Modern Standard Arabic (MSA) speech data can be applied to dialectal Arabic speech recognition, while assuming that MSA is always a second language for all Arabic speakers. In this book, Egyptian Colloquial Arabic (ECA) has been chosen as a typical Arabic dialect. ECA is the first ranked Arabic dialect in terms of number of speakers, and a high quality ECA speech corpus with accurate phonetic transcription has been collected. MSA acoustic models were trained using news broadcast speech. In order to cross-lingually use MSA in dialectal Arabic speech recognition, the authors have normalized the phoneme sets for MSA and ECA. After this normalization, they have applied state-of-the-art acoustic model adaptation techniques like Maximum Likelihood Linear Regression (MLLR) and Maximum A-Posteriori (MAP) to adapt existing phonemic MSA acoustic models with a small amount of dialectal ECA speech data. Speech recognition results indicate a significant increase in recognition accuracy compared to a baseline model trained with only ECA data.

Intelligent Speech Signal Processing (Paperback): Nilanjan Dey Intelligent Speech Signal Processing (Paperback)
Nilanjan Dey
R2,672 Discovery Miles 26 720 Ships in 12 - 19 working days

Intelligent Speech Signal Processing investigates the utilization of speech analytics across several systems and real-world activities, including sharing data analytics, creating collaboration networks between several participants, and implementing video-conferencing in different application areas. Chapters focus on the latest applications of speech data analysis and management tools across different recording systems. The book emphasizes the multidisciplinary nature of the field, presenting different applications and challenges with extensive studies on the design, development and management of intelligent systems, neural networks and related machine learning techniques for speech signal processing.

Data-Driven Methods for Adaptive Spoken Dialogue Systems - Computational Learning for Conversational Interfaces (Hardcover,... Data-Driven Methods for Adaptive Spoken Dialogue Systems - Computational Learning for Conversational Interfaces (Hardcover, 2012 ed.)
Oliver Lemon, Olivier Pietquin
R2,873 Discovery Miles 28 730 Ships in 10 - 15 working days

Data driven methods have long been used in Automatic Speech Recognition (ASR) and Text-To-Speech (TTS) synthesis and have more recently been introduced for dialogue management, spoken language understanding, and Natural Language Generation. Machine learning is now present "end-to-end" in Spoken Dialogue Systems (SDS). However, these techniques require data collection and annotation campaigns, which can be time-consuming and expensive, as well as dataset expansion by simulation. In this book, we provide an overview of the current state of the field and of recent advances, with a specific focus on adaptivity.

Introduction to EEG- and Speech-Based Emotion Recognition (Paperback): Priyanka A. Abhang, Bharti Gawali, Suresh C. Mehrotra Introduction to EEG- and Speech-Based Emotion Recognition (Paperback)
Priyanka A. Abhang, Bharti Gawali, Suresh C. Mehrotra
R2,048 Discovery Miles 20 480 Ships in 12 - 19 working days

Introduction to EEG- and Speech-Based Emotion Recognition Methods examines the background, methods, and utility of using electroencephalograms (EEGs) to detect and recognize different emotions. By incorporating these methods in brain-computer interface (BCI), we can achieve more natural, efficient communication between humans and computers. This book discusses how emotional states can be recognized in EEG images, and how this is useful for BCI applications. EEG and speech processing methods are explored, as are the technological basics of how to operate and record EEGs. Finally, the authors include information on EEG-based emotion recognition, classification, and a proposed EEG/speech fusion method for how to most accurately detect emotional states in EEG recordings.

Audio and Speech Processing with MATLAB (Paperback): Paul Hill Audio and Speech Processing with MATLAB (Paperback)
Paul Hill
R1,843 Discovery Miles 18 430 Ships in 12 - 19 working days

Speech and audio processing has undergone a revolution in preceding decades that has accelerated in the last few years generating game-changing technologies such as truly successful speech recognition systems; a goal that had remained out of reach until very recently. This book gives the reader a comprehensive overview of such contemporary speech and audio processing techniques with an emphasis on practical implementations and illustrations using MATLAB code. Core concepts are firstly covered giving an introduction to the physics of audio and vibration together with their representations using complex numbers, Z transforms and frequency analysis transforms such as the FFT. Later chapters give a description of the human auditory system and the fundamentals of psychoacoustics. Insights, results, and analyses given in these chapters are subsequently used as the basis of understanding of the middle section of the book covering: wideband audio compression (MP3 audio etc.), speech recognition and speech coding. The final chapter covers musical synthesis and applications describing methods such as (and giving MATLAB examples of) AM, FM and ring modulation techniques. This chapter gives a final example of the use of time-frequency modification to implement a so-called phase vocoder for time stretching (in MATLAB). Features A comprehensive overview of contemporary speech and audio processing techniques from perceptual and physical acoustic models to a thorough background in relevant digital signal processing techniques together with an exploration of speech and audio applications. A carefully paced progression of complexity of the described methods; building, in many cases, from first principles. Speech and wideband audio coding together with a description of associated standardised codecs (e.g. MP3, AAC and GSM). Speech recognition: Feature extraction (e.g. MFCC features), Hidden Markov Models (HMMs) and deep learning techniques such as Long Short-Time Memory (LSTM) methods. Book and computer-based problems at the end of each chapter. Contains numerous real-world examples backed up by many MATLAB functions and code.

Computational Linguistics, Speech And Image Processing For Arabic Language (Hardcover): Neamat El Gayar, Ching Yee Suen Computational Linguistics, Speech And Image Processing For Arabic Language (Hardcover)
Neamat El Gayar, Ching Yee Suen
R2,784 Discovery Miles 27 840 Ships in 10 - 15 working days

This book encompasses a collection of topics covering recent advances that are important to the Arabic language in areas of natural language processing, speech and image analysis. This book presents state-of-the-art reviews and fundamentals as well as applications and recent innovations.The book chapters by top researchers present basic concepts and challenges for the Arabic language in linguistic processing, handwritten recognition, document analysis, text classification and speech processing. In addition, it reports on selected applications in sentiment analysis, annotation, text summarization, speech and font analysis, word recognition and spotting and question answering.Moreover, it highlights and introduces some novel applications in vital areas for the Arabic language. The book is therefore a useful resource for young researchers who are interested in the Arabic language and are still developing their fundamentals and skills in this area. It is also interesting for scientists who wish to keep track of the most recent research directions and advances in this area.

Fundamentals of Speaker Recognition (Hardcover, 2011): Homayoon Beigi Fundamentals of Speaker Recognition (Hardcover, 2011)
Homayoon Beigi
R3,972 Discovery Miles 39 720 Ships in 12 - 19 working days

An emerging technology, Speaker Recognition is becoming well-known for providing voice authentication over the telephone for helpdesks, call centres and other enterprise businesses for business process automation.

"Fundamentals of Speaker Recognition" introduces Speaker Identification, Speaker Verification, Speaker (Audio Event) Classification, Speaker Detection, Speaker Tracking and more. The technical problems are rigorously defined, and a complete picture is made of the relevance of the discussed algorithms and their usage in building a comprehensive Speaker Recognition System.

Designed as a textbook with examples and exercises at the end of each chapter, "Fundamentals of Speaker Recognition" is suitable for advanced-level students in computer science and engineering, concentrating on biometrics, speech recognition, pattern recognition, signal processing and, specifically, speaker recognition. It is also a valuable reference for developers of commercial technology and for speech scientists.

Please click on the link under "Additional Information" to view supplemental information including the Table of Contents and Index.

Practical Speech User Interface Design (Hardcover, New): James R Lewis Practical Speech User Interface Design (Hardcover, New)
James R Lewis
R3,595 Discovery Miles 35 950 Ships in 12 - 19 working days

Although speech is the most natural form of communication between humans, most people find using speech to communicate with machines anything but natural. Drawing from psychology, human-computer interaction, linguistics, and communication theory, Practical Speech User Interface Design provides a comprehensive yet concise survey of practical speech user interface (SUI) design. It offers practice-based and research-based guidance on how to design effective, efficient, and pleasant speech applications that people can really use. Focusing on the design of speech user interfaces for IVR applications, the book covers speech technologies including speech recognition and production, ten key concepts in human language and communication, and a survey of self-service technologies. The author, a leading human factors engineer with extensive experience in research, innovation and design of products with speech interfaces that are used worldwide, covers both high- and low-level decisions and includes Voice XML code examples. To help articulate the rationale behind various SUI design guidelines, he includes a number of detailed discussions of the applicable research. The techniques for designing usable SUIs are not obvious, and to be effective, must be informed by a combination of critically interpreted scientific research and leading design practices. The blend of scholarship and practical experience found in this book establishes research-based leading practices for the design of usable speech user interfaces for interactive voice response applications.

Towards Adaptive Spoken Dialog Systems (Hardcover, 2013 ed.): Alexander Schmitt, Wolfgang Minker Towards Adaptive Spoken Dialog Systems (Hardcover, 2013 ed.)
Alexander Schmitt, Wolfgang Minker
R4,434 R3,577 Discovery Miles 35 770 Save R857 (19%) Ships in 12 - 19 working days

In Monitoring Adaptive Spoken Dialog Systems, authors Alexander Schmitt and Wolfgang Minker investigate statistical approaches that allow for recognition of negative dialog patterns in Spoken Dialog Systems (SDS). The presented stochastic methods allow a flexible, portable and accurate use. Beginning with the foundations of machine learning and pattern recognition, this monograph examines how frequently users show negative emotions in spoken dialog systems and develop novel approaches to speech-based emotion recognition using hybrid approach to model emotions. The authors make use of statistical methods based on acoustic, linguistic and contextual features to examine the relationship between the interaction flow and the occurrence of emotions using non-acted recordings several thousand real users from commercial and non-commercial SDS. Additionally, the authors present novel statistical methods that spot problems within a dialog based on interaction patterns. The approaches enable future SDS to offer more natural and robust interactions. This work provides insights, lessons and inspiration for future research and development, not only for spoken dialog systems, but for data-driven approaches to human-machine interaction in general.

Mastering Voice Interfaces - Creating Great Voice Apps for Real Users (Paperback, 1st ed.): Ann Thyme-Gobbel, Charles Jankowski Mastering Voice Interfaces - Creating Great Voice Apps for Real Users (Paperback, 1st ed.)
Ann Thyme-Gobbel, Charles Jankowski
R1,795 R1,492 Discovery Miles 14 920 Save R303 (17%) Ships in 10 - 15 working days

Build great voice apps of any complexity for any domain by learning both the how's and why's of voice development. In this book you'll see how we live in a golden age of voice technology and how advances in automatic speech recognition (ASR), natural language processing (NLP), and related technologies allow people to talk to machines and get reasonable responses. Today, anyone with computer access can build a working voice app. That democratization of the technology is great. But, while it's fairly easy to build a voice app that runs, it's still remarkably difficult to build a great one, one that users trust, that understands their natural ways of speaking and fulfills their needs, and that makes them want to return for more. We start with an overview of how humans and machines produce and process conversational speech, explaining how they differ from each other and from other modalities. This is the background you need to understand the consequences of each design and implementation choice as we dive into the core principles of voice interface design. We walk you through many design and development techniques, including ones that some view as advanced, but that you can implement today. We use the Google development platform and Python, but our goal is to explain the reasons behind each technique such that you can take what you learn and implement it on any platform. Readers of Mastering Voice Interfaces will come away with a solid understanding of what makes voice interfaces special, learn the core voice design principles for building great voice apps, and how to actually implement those principles to create robust apps. We've learned during many years in the voice industry that the most successful solutions are created by those who understand both the human and the technology sides of speech, and that both sides affect design and development. Because we focus on developing task-oriented voice apps for real users in the real world, you'll learn how to take your voice apps from idea through scoping, design, development, rollout, and post-deployment performance improvements, all illustrated with examples from our own voice industry experiences. What You Will Learn Create truly great voice apps that users will love and trust See how voice differs from other input and output modalities, and why that matters Discover best practices for designing conversational voice-first applications, and the consequences of design and implementation choices Implement advanced voice designs, with real-world examples you can use immediately. Verify that your app is performing well, and what to change if it doesn't Who This Book Is For Anyone curious about the real how's and why's of voice interface design and development. In particular, it's aimed at teams of developers, designers, and product owners who need a shared understanding of how to create successful voice interfaces using today's technology. We expect readers to have had some exposure to voice apps, at least as users.

Studies on Speech Production - 11th International Seminar, ISSP 2017, Tianjin, China, October 16-19, 2017, Revised Selected... Studies on Speech Production - 11th International Seminar, ISSP 2017, Tianjin, China, October 16-19, 2017, Revised Selected Papers (Paperback, 1st ed. 2018)
Qiang Fang, Jianwu Dang, Pascal Perrier, Jianguo Wei, Longbiao Wang, …
R1,521 Discovery Miles 15 210 Ships in 10 - 15 working days

This book constitutes the refereed post-conference proceedings of the 11th International Seminar on Speech Production, ISSP 2017, held in Tianjin, China, In October 2017. The 20 revised full papers included in this volume were carefully reviewed and selected from 68 submissions. They cover a wide range of speech science fields including phonology, phonetics, prosody, mechanics, acoustics, physiology, motor control, neuroscience, computer science and human interaction. The papers are organized in the following topical sections: emotional speech analysis and recognition; articulatory speech synthesis; speech acquisition; phonetics; speech planning and comprehension, and speech disorder.

Statistical Language and Speech Processing - 6th International Conference, SLSP 2018, Mons, Belgium, October 15-16, 2018,... Statistical Language and Speech Processing - 6th International Conference, SLSP 2018, Mons, Belgium, October 15-16, 2018, Proceedings (Paperback, 1st ed. 2018)
Thierry Dutoit, Carlos Martin-Vide, Gueorgui Pironkov
R1,521 Discovery Miles 15 210 Ships in 10 - 15 working days

This book constitutes the proceedings of the 6th International Conference on Statistical Language and Speech Processing, SLSP 2018, held in Mons, Belgium, in October 2018. The 15 full papers presented in this volume were carefully reviewed and selected from 40 submissions. They were organized in topical sections named: speech synthesis and spoken language generation; speech recognition and post-processing; natural language processing and understanding; and text processing and analysis.

Automatic Speech Recognition of Arabic Phonemes with Neural Networks - A Contrastive Study of Arabic and English (Paperback,... Automatic Speech Recognition of Arabic Phonemes with Neural Networks - A Contrastive Study of Arabic and English (Paperback, 1st ed. 2019)
Mohammed Dib
R1,483 Discovery Miles 14 830 Ships in 10 - 15 working days

This book presents a contrastive linguistics study of Arabic and English for the dual purposes of improved language teaching and speech processing of Arabic via spectral analysis and neural networks. Contrastive linguistics is a field of linguistics which aims to compare the linguistic systems of two or more languages in order to ease the tasks of teaching, learning, and translation. The main focus of the present study is to treat the Arabic minimal syllable automatically to facilitate automatic speech processing in Arabic. It represents important reading for language learners and for linguists with an interest in Arabic and computational approaches.

Fundamentals of Speech Enhancement (Paperback, 1st ed. 2018): Jacob Benesty Fundamentals of Speech Enhancement (Paperback, 1st ed. 2018)
Jacob Benesty
R1,521 Discovery Miles 15 210 Ships in 10 - 15 working days

This book presents and develops several important concepts of speech enhancement in a simple but rigorous way. Many of the ideas are new; not only do they shed light on this old problem but they also offer valuable tips on how to improve on some well-known conventional approaches. The book unifies all aspects of speech enhancement, from single channel, multichannel, beamforming, time domain, frequency domain and time-frequency domain, to binaural in a clear and flexible framework. It starts with an exhaustive discussion on the fundamental best (linear and nonlinear) estimators, showing how they are connected to various important measures such as the coefficient of determination, the correlation coefficient, the conditional correlation coefficient, and the signal-to-noise ratio (SNR). It then goes on to show how to exploit these measures in order to derive all kinds of noise reduction algorithms that can offer an accurate and versatile compromise between noise reduction and speech distortion.

Speech and Language Processing for Human-Machine Communications - Proceedings of CSI 2015 (Paperback, 1st ed. 2018): S.S.... Speech and Language Processing for Human-Machine Communications - Proceedings of CSI 2015 (Paperback, 1st ed. 2018)
S.S. Agrawal, Amita Devi, Ritika Wason, Poonam Bansal
R3,488 Discovery Miles 34 880 Ships in 10 - 15 working days

This volume comprises the select proceedings of the annual convention of the Computer Society of India. Divided into 10 topical volumes, the proceedings present papers on state-of-the-art research, surveys, and succinct reviews. The volumes cover diverse topics ranging from communications networks to big data analytics, and from system architecture to cyber security. This volume focuses on Speech and Language Processing for Human-Machine Communications. The contents of this book will be useful to researchers and students alike.

Statistical Language and Speech Processing - 5th International Conference, SLSP 2017, Le Mans, France, October 23-25, 2017,... Statistical Language and Speech Processing - 5th International Conference, SLSP 2017, Le Mans, France, October 23-25, 2017, Proceedings (Paperback, 1st ed. 2017)
Nathalie Camelin, Yannick Esteve, Carlos Martin-Vide
R2,296 Discovery Miles 22 960 Ships in 10 - 15 working days

This book constitutes the refereed proceedings of the 5th International Conference on Statistical Language and Speech Processing, SLSP 2017, held in Le Mans, France, in October 2017. The 21 full papers presented were carefully reviewed and selected from 39 submissions. The papers cover topics such as anaphora and conference resolution; authorship identification, plagiarism and spam filtering; computer-aided translation; corpora and language resources; data mining and semanticweb; information extraction; information retrieval; knowledge representation and ontologies; lexicons and dictionaries; machine translation; multimodal technologies; natural language understanding; neural representation of speech and language; opinion mining and sentiment analysis; parsing; part-of-speech tagging; question and answering systems; semantic role labeling; speaker identification and verification; speech and language generation; speech recognition; speech synthesis; speech transcription; speech correction; spoken dialogue systems; term extraction; text categorization; test summarization; user modeling. They are organized in the following sections: language and information extraction; post-processing and applications of automatic transcriptions; speech paralinguistics and synthesis; speech recognition: modeling and resources.

Natural Language Processing and Computational Ling uistics 2: Semantics, Discourse and Applications (Hardcover, Volume 2): MZ... Natural Language Processing and Computational Ling uistics 2: Semantics, Discourse and Applications (Hardcover, Volume 2)
MZ Kurdi
R4,090 Discovery Miles 40 900 Ships in 10 - 15 working days

Natural Language Processing (NLP) is a scientific discipline which is found at the intersection of fields such as Artificial Intelligence, Linguistics, and Cognitive Psychology. This book presents in four chapters the state of the art and fundamental concepts of key NLP areas. Are presented in the first chapter the fundamental concepts in lexical semantics, lexical databases, knowledge representation paradigms, and ontologies. The second chapter is about combinatorial and formal semantics. Discourse and text representation as well as automatic discourse segmentation and interpretation, and anaphora resolution are the subject of the third chapter. Finally, in the fourth chapter, I will cover some aspects of large scale applications of NLP such as software architecture and their relations to cognitive models of NLP as well as the evaluation paradigms of NLP software. Furthermore, I will present in this chapter the main NLP applications such as Machine Translation (MT), Information Retrieval (IR), as well as Big Data and Information Extraction such as event extraction, sentiment analysis and opinion mining.

Proactive Spoken Dialogue Interaction in Multi-Party Environments (Paperback, 2010 ed.): Petra-Maria Strauss, Wolfgang Minker Proactive Spoken Dialogue Interaction in Multi-Party Environments (Paperback, 2010 ed.)
Petra-Maria Strauss, Wolfgang Minker
R2,873 Discovery Miles 28 730 Ships in 10 - 15 working days

Proactive Spoken Dialogue Interaction in Multi-Party Environments describes spoken dialogue systems that act as independent dialogue partners in the conversation with and between users. The resulting novel characteristics such as proactiveness and multi-party capabilities pose new challenges on the dialogue management component of such a system and require the use and administration of an extensive dialogue history. In order to assist the proactive spoken dialogue systems development, a comprehensive data collection seems mandatory and may be performed in a Wizard-of-Oz environment. Such an environment builds also the appropriate basis for an extensive usability and acceptance evaluation. Proactive Spoken Dialogue Interaction in Multi-Party Environments is a useful reference for students and researchers in speech processing.

Estimating Spoken Dialog System Quality with User Models (Paperback, 2013 ed.): Klaus-Peter Engelbrecht Estimating Spoken Dialog System Quality with User Models (Paperback, 2013 ed.)
Klaus-Peter Engelbrecht
R3,726 Discovery Miles 37 260 Ships in 10 - 15 working days

Spoken dialog systems have the potential to offer highly intuitive user interfaces, as they allow systems to be controlled using natural language. However, the complexity inherent in natural language dialogs means that careful testing of the system must be carried out from the very beginning of the design process. This book examines how user models can be used to support such early evaluations in two ways: by running simulations of dialogs, and by estimating the quality judgments of users. First, a design environment supporting the creation of dialog flows, the simulation of dialogs, and the analysis of the simulated data is proposed. How the quality of user simulations may be quantified with respect to their suitability for both formative and summative evaluation is then discussed. The remainder of the book is dedicated to the problem of predicting quality judgments of users based on interaction data. New modeling approaches are presented, which process the dialogs as sequences, and which allow knowledge about the judgment behavior of users to be incorporated into predictions. All proposed methods are validated with example evaluation studies.

Towards Adaptive Spoken Dialog Systems (Paperback, 2013 ed.): Alexander Schmitt, Wolfgang Minker Towards Adaptive Spoken Dialog Systems (Paperback, 2013 ed.)
Alexander Schmitt, Wolfgang Minker
R3,591 Discovery Miles 35 910 Ships in 10 - 15 working days

In Monitoring Adaptive Spoken Dialog Systems, authors Alexander Schmitt and Wolfgang Minker investigate statistical approaches that allow for recognition of negative dialog patterns in Spoken Dialog Systems (SDS). The presented stochastic methods allow a flexible, portable and accurate use. Beginning with the foundations of machine learning and pattern recognition, this monograph examines how frequently users show negative emotions in spoken dialog systems and develop novel approaches to speech-based emotion recognition using hybrid approach to model emotions. The authors make use of statistical methods based on acoustic, linguistic and contextual features to examine the relationship between the interaction flow and the occurrence of emotions using non-acted recordings several thousand real users from commercial and non-commercial SDS. Additionally, the authors present novel statistical methods that spot problems within a dialog based on interaction patterns. The approaches enable future SDS to offer more natural and robust interactions. This work provides insights, lessons and inspiration for future research and development, not only for spoken dialog systems, but for data-driven approaches to human-machine interaction in general.

Speech Spectrum Analysis (Paperback, 2011 ed.): Sean A. Fulop Speech Spectrum Analysis (Paperback, 2011 ed.)
Sean A. Fulop
R2,873 Discovery Miles 28 730 Ships in 10 - 15 working days

The accurate determination of the speech spectrum, particularly for short frames, is commonly pursued in diverse areas including speech processing, recognition, and acoustic phonetics. With this book the author makes the subject of spectrum analysis understandable to a wide audience, including those with a solid background in general signal processing and those without such background. In keeping with these goals, this is not a book that replaces or attempts to cover the material found in a general signal processing textbook. Some essential signal processing concepts are presented in the first chapter, but even there the concepts are presented in a generally understandable fashion as far as is possible. Throughout the book, the focus is on applications to speech analysis; mathematical theory is provided for completeness, but these developments are set off in boxes for the benefit of those readers with sufficient background. Other readers may proceed through the main text, where the key results and applications will be presented in general heuristic terms, and illustrated with software routines and practical "show-and-tell" discussions of the results. At some points, the book refers to and uses the implementations in the Praat speech analysis software package, which has the advantages that it is used by many scientists around the world, and it is free and open source software. At other points, special software routines have been developed and made available to complement the book, and these are provided in the Matlab programming language. If the reader has the basic Matlab package, he/she will be able to immediately implement the programs in that platform---no extra "toolboxes" are required.

Statistical Pronunciation Modeling for Non-Native Speech Processing (Paperback, 2011 ed.): Rainer E. Gruhn, Wolfgang Minker,... Statistical Pronunciation Modeling for Non-Native Speech Processing (Paperback, 2011 ed.)
Rainer E. Gruhn, Wolfgang Minker, Satoshi Nakamura
R2,873 Discovery Miles 28 730 Ships in 10 - 15 working days

In this work, the authors present a fully statistical approach to model non--native speakers' pronunciation. Second-language speakers pronounce words in multiple different ways compared to the native speakers. Those deviations, may it be phoneme substitutions, deletions or insertions, can be modelled automatically with the new method presented here.

The methods is based on a discrete hidden Markov model as a word pronunciation model, initialized on a standard pronunciation dictionary. The implementation and functionality of the methodology has been proven and verified with a test set of non-native English in the regarding accent.

The book is written for researchers with a professional interest in phonetics and automatic speech and speaker recognition.

Free Delivery
Pinterest Twitter Facebook Google+
You may like...
Introduction To 80X86 Assembly Language…
Richard C Detmer Paperback R5,659 Discovery Miles 56 590
Oracle Database 10g Data Warehouseing
Lilian Hobbs, Susan Hillson, … Paperback R1,939 Discovery Miles 19 390
Multisensor Decision And Estimation…
Yunmin Zhu Hardcover R4,497 Discovery Miles 44 970
Accelerating MATLAB with GPU Computing…
Jung Suh, Youngmin Kim Paperback R1,547 Discovery Miles 15 470
The Definitive Guide to SQLite
Mike Owens Hardcover R2,369 Discovery Miles 23 690
C Programming For Beginners - The Simple…
Tim Warren Hardcover R597 R541 Discovery Miles 5 410
A Comprehensive Study of SQL - Practice…
Jagdish Chandra Patni Hardcover R2,318 Discovery Miles 23 180
Python Programming For Beginners In 2020…
James Tudor Hardcover R740 Discovery Miles 7 400
Introduction to Assembly Language…
Sivarama P Dandamudi Hardcover R3,094 Discovery Miles 30 940
Fundamentals of Cryptology - A…
Henk C.A. van Tilborg Mixed media product R1,665 Discovery Miles 16 650

 

Partners