0
Your cart

Your cart is empty

Browse All Departments
Price
  • R250 - R500 (2)
  • R500+ (89)
  • -
Status
Format
Author / Contributor
Publisher

Books > Computing & IT > Applications of computing > Audio processing > Speech recognition & synthesis

Biometrics: A Very Short Introduction (Paperback): Michael Fairhurst Biometrics: A Very Short Introduction (Paperback)
Michael Fairhurst
R279 R251 Discovery Miles 2 510 Save R28 (10%) Ships in 9 - 17 working days

We live in a society which is increasingly interconnected, in which communication between individuals is mostly mediated via some electronic platform, and transactions are often carried out remotely. In such a world, traditional notions of trust and confidence in the identity of those with whom we are interacting, taken for granted in the past, can be much less reliable. Biometrics - the scientific discipline of identifying individuals by means of the measurement of unique personal attributes - provides a reliable means of establishing or confirming an individual's identity. These attributes include facial appearance, fingerprints, iris patterning, the voice, the way we write, or even the way we walk. The new technologies of biometrics have a wide range of practical applications, from securing mobile phones and laptops to establishing identity in bank transactions, travel documents, and national identity cards. This Very Short Introduction considers the capabilities of biometrics-based identity checking, from first principles to the practicalities of using different types of identification data. Michael Fairhurst looks at the basic techniques in use today, ongoing developments in system design, and emerging technologies, all aimed at improving precision in identification, and providing solutions to an increasingly wide range of practical problems. Considering how they may continue to develop in the future, Fairhurst explores the benefits and limitations of these pervasive and powerful technologies, and how they can effectively support our increasingly interconnected society. ABOUT THE SERIES: The Very Short Introductions series from Oxford University Press contains hundreds of titles in almost every subject area. These pocket-sized books are the perfect way to get ahead in a new subject quickly. Our expert authors combine facts, analysis, perspective, new ideas, and enthusiasm to make interesting and challenging topics highly readable.

Text-to-Speech Synthesis (Hardcover): Paul Taylor Text-to-Speech Synthesis (Hardcover)
Paul Taylor
R2,721 Discovery Miles 27 210 Ships in 10 - 15 working days

Text-to-Speech Synthesis provides a complete, end-to-end account of the process of generating speech by computer. Giving an in-depth explanation of all aspects of current speech synthesis technology, it assumes no specialized prior knowledge. Introductory chapters on linguistics, phonetics, signal processing and speech signals lay the foundation, with subsequent material explaining how this knowledge is put to use in building practical systems that generate speech. Including coverage of the very latest techniques such as unit selection, hidden Markov model synthesis, and statistical text analysis, explanations of the more traditional techniques such as format synthesis and synthesis by rule are also provided. Weaving together the various strands of this multidisciplinary field, the book is designed for graduate students in electrical engineering, computer science, and linguistics. It is also an ideal reference for practitioners in the fields of human communication interaction and telephony.

Speech Recognition Technology and Applications (Hardcover): Vasile-Florian Pais Speech Recognition Technology and Applications (Hardcover)
Vasile-Florian Pais
R3,255 Discovery Miles 32 550 Ships in 10 - 15 working days
Voice Biometrics - Technology, trust and security (Hardcover): Carmen Garcia Mateo, Gerard Chollet Voice Biometrics - Technology, trust and security (Hardcover)
Carmen Garcia Mateo, Gerard Chollet
R3,099 R2,801 Discovery Miles 28 010 Save R298 (10%) Ships in 18 - 22 working days

Voice biometrics are being implemented globally in large scale applications such as remote banking, government e-services, transportation and building security access, autonomous vehicles, and healthcare. They have been integrated in numerous apps, often coupled with face biometrics and artificial intelligence methods. Voice biometrics products and solutions must meet three key requirements for the success in their deployment: they must be highly trustable regarding privacy protection; easy to use and always be available. This edited book presents the state of the art in voice biometrics research and technologies including implementation and deployment challenges in terms of interoperability, scalability and performance, and security. The team of editors and chapter authors combine a wealth of expertise from academia and the industry. Topics covered include the fundamentals of voice biometrics; design of countermeasures for replay attack; attacker's perspective for voice biometrics; voice biometrics; speaker de-identification; performance evaluation of voice biometrics solutions; standardization of voice biometrics technology; industry perspectives; joining forces of voice and facial biometrics; and future trends and challenges in voice biometrics. Providing comprehensive coverage of the field of voice biometrics, this authoritative volume will be of great interest to researchers, scientists, engineers, practitioners and advanced students involved in the fields of security, biometrics, forensic sciences, human computer interaction, speech processing, acoustics, multimedia, pattern recognition, and privacy-preserving, digital signal processing and speech technologies. It will also be of interest to researchers and professionals working in law and criminology.

Audio For Authors Large Print - Audiobooks, Podcasting, And Voice Technologies (Large print, Paperback, Large type / large... Audio For Authors Large Print - Audiobooks, Podcasting, And Voice Technologies (Large print, Paperback, Large type / large print edition)
Joanna Penn
R506 Discovery Miles 5 060 Ships in 18 - 22 working days
Audio For Authors - Audiobooks, Podcasting, And Voice Technologies (Paperback): Joanna Penn Audio For Authors - Audiobooks, Podcasting, And Voice Technologies (Paperback)
Joanna Penn
R384 Discovery Miles 3 840 Ships in 18 - 22 working days
Voice User Interface Projects - Build voice-enabled applications using Dialogflow for Google Home and Alexa Skills Kit for... Voice User Interface Projects - Build voice-enabled applications using Dialogflow for Google Home and Alexa Skills Kit for Amazon Echo (Paperback)
Henry Lee
R1,100 Discovery Miles 11 000 Ships in 18 - 22 working days

Develop intelligent voice-empowered applications and Chatbots that not only understand voice commands but also respond to it Key Features Target multiple platforms by creating voice interactions for your applications Explore real-world examples of how to produce smart and practical virtual assistants Build a virtual assistant for cars using Android Auto in Xamarin Book DescriptionFrom touchscreen and mouse-click, we are moving to voice- and conversation-based user interfaces. By adopting Voice User Interfaces (VUIs), you can create a more compelling and engaging experience for your users. Voice User Interface Projects teaches you how to develop voice-enabled applications for desktop, mobile, and Internet of Things (IoT) devices. This book explains in detail VUI and its importance, basic design principles of VUI, fundamentals of conversation, and the different voice-enabled applications available in the market. You will learn how to build your first voice-enabled application by utilizing DialogFlow and Alexa's natural language processing (NLP) platform. Once you are comfortable with building voice-enabled applications, you will understand how to dynamically process and respond to the questions by using NodeJS server deployed to the cloud. You will then move on to securing NodeJS RESTful API for DialogFlow and Alexa webhooks, creating unit tests and building voice-enabled podcasts for cars. Last but not the least you will discover advanced topics such as handling sessions, creating custom intents, and extending built-in intents in order to build conversational VUIs that will help engage the users. By the end of the book, you will have grasped a thorough knowledge of how to design and develop interactive VUIs. What you will learn Understand NLP platforms with machine learning Exploit best practices and user experiences in creating VUI Build voice-enabled chatbots Host, secure, and test in a cloud platform Create voice-enabled applications for personal digital assistant devices Develop a virtual assistant for cars Who this book is forVoice User Interface Projects is for you if you are a software engineer who wants to develop voice-enabled applications for your personal digital assistant devices such as Amazon Echo and Google Home, along with your car's virtual assistant systems. Some experience with JavaScript is required.

Alexa Skills Projects - Build exciting projects with Amazon Alexa and integrate it with Internet of Things (Paperback): Madhur... Alexa Skills Projects - Build exciting projects with Amazon Alexa and integrate it with Internet of Things (Paperback)
Madhur Bhargava
R1,002 Discovery Miles 10 020 Ships in 18 - 22 working days

Get up and running with the fundamentals of Amazon Alexa and build exciting IoT projects Key Features Gain hands-on experience of working with Amazon Echo and Alexa Build exciting IoT projects using Amazon Echo Learn about voice-enabled smart devices Book DescriptionAmazon Echo is a smart speaker developed by Amazon, which connects to Amazon's Alexa Voice Service and is entirely controlled by voice commands. Amazon Echo is currently being used for a variety of purposes such as home automation, asking generic queries, and even ordering a cab or pizza. Alexa Skills Projects starts with a basic introduction to Amazon Alexa and Echo. You will then deep dive into Alexa Programming concepts such as Intents, Slots, Lambdas and maintaining your skill's state using DynamoDB. You will get a clear understanding of how some of the most popular Alexa Skills work, and gain experience of working with real-world Amazon Echo applications. In the concluding chapters, you will explore the future of voice-enabled applications and their coverage with respect to the Internet of Things. By the end of the book, you will have learned to design Alexa Skills for specific purposes and interact with Amazon Echo to execute these skills. What you will learn Understand how Amazon Echo is already being used in various domains Discover how an Alexa Skill is architected Get a clear understanding of how some of the most popular Alexa Skills work Design Alexa Skills for specific purposes and interact with Amazon Echo to execute them Gain experience of programming for Amazon Echo Explore future applications of Amazon Echo and other voice-activated devices Who this book is forAlexa Skills Projects is for individuals who want to have a deep understanding of the underlying technology that drives Amazon Echo and Alexa, and how it can be integrated with the Internet of Things to develop hands-on projects.

Spoken Dialogue Systems Technology and Design (Paperback, 2011 ed.): Wolfgang Minker, Gary Geunbae Lee, Satoshi Nakamura,... Spoken Dialogue Systems Technology and Design (Paperback, 2011 ed.)
Wolfgang Minker, Gary Geunbae Lee, Satoshi Nakamura, Joseph Mariani
R5,146 Discovery Miles 51 460 Ships in 18 - 22 working days

Spoken Dialogue Systems Technology and Design covers key topics in the field of spoken language dialogue interaction from a variety of leading researchers. It brings together several perspectives in the areas of corpus annotation and analysis, dialogue system construction, as well as theoretical perspectives on communicative intention, context-based generation, and modelling of discourse structure. These topics are all part of the general research and development within the area of discourse and dialogue with an emphasis on dialogue systems; corpora and corpus tools and semantic and pragmatic modelling of discourse and dialogue.

Dragon Naturally Speaking For Dummies (Paperback, 4th Revised edition): Stephanie Diamond Dragon Naturally Speaking For Dummies (Paperback, 4th Revised edition)
Stephanie Diamond
R717 R646 Discovery Miles 6 460 Save R71 (10%) Ships in 18 - 22 working days

Dragon NaturallySpeaking For Dummies, 4E will introduce readers to everything they need to know to get started with this advanced voice recognition software. Readers will get the most up-to-date information on the latest version of the software. PART I: Hatching and Launching Your Dragon Software Chapter 1: Preparing for Dragons Chapter 2: Basic Training Chapter 3: Launching and Controlling Your Dragon PART II: Fire-Breathing 101 Chapter 4: Basic Dictating Chapter 5: Selecting, Editing, and Correcting in the NaturallySpeaking Window Chapter 6: Fonts, Alignment, and All That: Formatting Your Document Chapter 7: Proofreading and Listening to Your Text Chapter 8: Using Recorded Speech Chapter 9: Mobile Edition and NaturallyMobile Recorder PART III: Giving Your Applications Wings Chapter 10: Dictating into Other Applications Chapter 11: Controlling Your Desktop and Windows by Voice Chapter 12: Using NaturalWord for Word and WordPerfect Chapter 13: A Dragon Online Chapter 14: Dragon Your Data Around Chapter 15: Staying Organized on the Move PART IV: Precision Flying Chapter 16: Feeding Your Dragon: RAM, Disk Space, and Speed Chapter 17: Speaking More Clearly to Your Dragon Chapter 18: Additional Training and Vocabulary Building Chapter 19: Improving Audio Input Chapter 20: Dealing with Change Chapter 21: Having Multiple Users or Vocabularies Chapter 22: Creating Your Own Commands Chapter 23: Taking Draconian Measures: Workarounds for Problems PART V: The Part of Tens Chapter 24: Ten Common Problems Chapter 25: Ten Time-and-Sanity-Saving Tips Chapter 26: Ten Mistakes to Avoid Chapter 27: Ten Stupid Dragon Tricks

Advances in Non-Linear Modeling for Speech Processing (Paperback, 2012): Raghunath S. Holambe, Mangesh S. Deshpande Advances in Non-Linear Modeling for Speech Processing (Paperback, 2012)
Raghunath S. Holambe, Mangesh S. Deshpande
R1,408 Discovery Miles 14 080 Ships in 18 - 22 working days

"Advances in Non-Linear Modeling for Speech Processing" includes advanced topics in non-linear estimation and modeling techniques along with their applications to speaker recognition.
Non-linear aeroacoustic modeling approach is used to estimate the important fine-structure speech events, which are not revealed by the short time Fourier transform (STFT). This aeroacostic modeling approach provides the impetus for the high resolution Teager energy operator (TEO). This operator is characterized by a time resolution that can track rapid signal energy changes within a glottal cycle.
The cepstral features like linear prediction cepstral coefficients (LPCC) and mel frequency cepstral coefficients (MFCC) are computed from the magnitude spectrum of the speech frame and the phase spectra is neglected. To overcome the problem of neglecting the phase spectra, the speech production system can be represented as an amplitude modulation-frequency modulation (AM-FM) model. To demodulate the speech signal, to estimation the amplitude envelope and instantaneous frequency components, the energy separation algorithm (ESA) and the Hilbert transform demodulation (HTD) algorithm are discussed.
Different features derived using above non-linear modeling techniques are used to develop a speaker identification system. Finally, it is shown that, the fusion of speech production and speech perception mechanisms can lead to a robust feature set.

Emulating Human Speech Recognition - A Scene Analysis Approach to Improving Robustness in Automatic Speech Recognition... Emulating Human Speech Recognition - A Scene Analysis Approach to Improving Robustness in Automatic Speech Recognition (Paperback)
Andre Coy
R1,801 Discovery Miles 18 010 Ships in 10 - 15 working days

This book presents a systematic approach to the automatic recognition of simultaneous speech signals using computational auditory scene analysis. Inspired by human auditory perception, this book investigates a range of algorithms and techniques for decomposing multiple speech signals by integrating a spectro-temporal fragment decoder within a statistical search process. The outcome is a comprehensive insight into the mechanisms required if automatic speech recognition is to approach human levels of performance.

Natural Language Processing and Computational uistics 1 (Hardcover): Z Kurdi Natural Language Processing and Computational uistics 1 (Hardcover)
Z Kurdi
R3,766 Discovery Miles 37 660 Ships in 18 - 22 working days

Natural language processing (NLP) is a scientific discipline which is found at the interface of computer science, artificial intelligence and cognitive psychology. Providing an overview of international work in this interdisciplinary field, this book gives the reader a panoramic view of both early and current research in NLP. Carefully chosen multilingual examples present the state of the art of a mature field which is in a constant state of evolution. In four chapters, this book presents the fundamental concepts of phonetics and phonology and the two most important applications in the field of speech processing: recognition and synthesis. Also presented are the fundamental concepts of corpus linguistics and the basic concepts of morphology and its NLP applications such as stemming and part of speech tagging. The fundamental notions and the most important syntactic theories are presented, as well as the different approaches to syntactic parsing with reference to cognitive models, algorithms and computer applications.

Telecommunications Relay Services to Assist Persons with Hearing or Speech Disabilities - Assessments (Paperback): Jason Graham Telecommunications Relay Services to Assist Persons with Hearing or Speech Disabilities - Assessments (Paperback)
Jason Graham
R1,975 Discovery Miles 19 750 Ships in 10 - 15 working days

Since 2002, the overall minutes of use and costs for the Telecommunications Relay Service (TRS) program have grown significantly due to the advent of Internet-based forms of TRS and increased usage by the deaf and hard-of-hearing communities. TRS allows persons with hearing or speech disabilities to place and receive telephone calls, often with the help of a communications assistant who acts as a translator or facilitator between the two parties having the conversation. FCC is the steward of the TRS program and the federal TRS Fund, which reimburses TRS providers. This book examines, among other things, changes in TRS services and costs since 2002; FCC's TRS performance goals and measures and how they compare with key characteristics of successful performance goals and measures; and the extent to which the design of the program's internal control system identifies and considers program risks.

Speech Recognition in Adverse Conditions - Explorations in Behaviour and Neuroscience (Hardcover): Sven Mattys, Ann Bradlow,... Speech Recognition in Adverse Conditions - Explorations in Behaviour and Neuroscience (Hardcover)
Sven Mattys, Ann Bradlow, Matthew Davis, Sophie Scott
R3,168 R2,680 Discovery Miles 26 800 Save R488 (15%) Ships in 10 - 15 working days

Speech recognition in 'adverse conditions' has been a familiar area of research in computer science, engineering, and hearing sciences for several decades. In contrast, most psycholinguistic theories of speech recognition are built upon evidence gathered from tasks performed by healthy listeners on carefully recorded speech, in a quiet environment, and under conditions of undivided attention. Building upon the momentum initiated by the Psycholinguistic Approaches to Speech Recognition in Adverse Conditions workshop held in Bristol, UK, in 2010, the aim of this volume is to promote a multi-disciplinary, yet unified approach to the perceptual, cognitive, and neuro-physiological mechanisms underpinning the recognition of degraded speech, variable speech, speech experienced under cognitive load, and speech experienced by theoretically relevant populations. This collection opens with a review of the literature and a formal classification of adverse conditions. The research articles then highlight those adverse conditions with the greatest potential for constraining theory, showing that some speech phenomena often believed to be immutable can be affected by noise, surface variations, or attentional set in ways that will force researchers to rethink their theory. This volume is essential for those interested in speech recognition outside laboratory constraints.

Advances in Speaker Recognition (Paperback, 2012): Hemant Arjun Patil Advances in Speaker Recognition (Paperback, 2012)
Hemant Arjun Patil
R1,086 Discovery Miles 10 860 Out of stock

"Advances in Speaker Recognition" presents a comprehensive analysis of the progress of speaker recognition. The material addresses the technical aspects of voice technology within the framework of societal needs, such as the use of speech recognition software to produce up-to-date electronic health records, not withstanding patients making changes to health plans and physicians. Due to global security concerns, there is a greater need to identify a person's identity from his or her voice. Included will be discussion of speaker biometrics literature, data collection, corpus design, the detection-to-error trade off curve, mono- and multi-lingual speaker detection, as well as research in mimic resistance.

Free Delivery
Pinterest Twitter Facebook Google+
You may like...
Audio For Authors - Audiobooks…
Joanna Penn Hardcover R565 Discovery Miles 5 650
Estimating Spoken Dialog System Quality…
Klaus-Peter Engelbrecht Hardcover R2,641 Discovery Miles 26 410
Proactive Spoken Dialogue Interaction in…
Petra-Maria Strauss, Wolfgang Minker Hardcover R2,750 Discovery Miles 27 500
Self-Learning Speaker Identification - A…
Tobias Herbig, Franz Gerl, … Hardcover R2,746 Discovery Miles 27 460
Designing Human Interface in Speech…
Fang Chen Hardcover R2,871 Discovery Miles 28 710
Speech Spectrum Analysis
Sean A. Fulop Hardcover R2,661 Discovery Miles 26 610
Visual Speech Recognition - Lip…
Alan Wee-Chung Liew, Shilin Wang Hardcover R5,748 Discovery Miles 57 480
Automatic Speech Recognition of Arabic…
Mohammed Dib Paperback R1,372 Discovery Miles 13 720
Statistical Pronunciation Modeling for…
Rainer E. Gruhn, Wolfgang Minker, … Hardcover R2,653 Discovery Miles 26 530
Speech Processing in Embedded Systems
Priyabrata Sinha Hardcover R2,746 Discovery Miles 27 460

 

Partners