0
Your cart

Your cart is empty

Browse All Departments
Price
  • R100 - R250 (1)
  • R250 - R500 (2)
  • R500+ (90)
  • -
Status
Format
Author / Contributor
Publisher

Books > Computing & IT > Applications of computing > Audio processing > Speech recognition & synthesis

Emotion Recognition using Speech Features (Paperback, 2013 ed.): K. Sreenivasa Rao, Shashidhar G. Koolagudi Emotion Recognition using Speech Features (Paperback, 2013 ed.)
K. Sreenivasa Rao, Shashidhar G. Koolagudi
R1,807 Discovery Miles 18 070 Ships in 10 - 15 working days

"Emotion Recognition Using Speech Features" provides coverage of emotion-specific features present in speech. The author also discusses suitable models for capturing emotion-specific information for distinguishing different emotions. The content of this book is important for designing and developing natural and sophisticated speech systems. In this Brief, Drs. Rao and Koolagudi lead a discussion of how emotion-specific information is embedded in speech and how to acquire emotion-specific knowledge using appropriate statistical models. Additionally, the authors provide information about exploiting multiple evidences derived from various features and models. The acquired emotion-specific knowledge is useful for synthesizing emotions. Features includes discussion of: * Global and local prosodic features at syllable, word and phrase levels, helpful for capturing emotion-discriminative information; * Exploiting complementary evidences obtained from excitation sources, vocal tract systems and prosodic features in order to enhance the emotion recognition performance; * Proposed multi-stage and hybrid models for improving the emotion recognition performance. This brief is for researchers working in areas related to speech-based products such as mobile phone manufacturing companies, automobile companies, and entertainment products as well as researchers involved in basic and applied speech processing research.

Advances in Speech and Language Technologies for Iberian Languages - IberSPEECH 2012 Conference, Madrid, Spain, November 21-23,... Advances in Speech and Language Technologies for Iberian Languages - IberSPEECH 2012 Conference, Madrid, Spain, November 21-23, 2012. Proceedings (Paperback, 2012 ed.)
Doroteo T. Toledano, Alfonso Ortega, Antonio Teixeira, Joaquin Gonzalez-Rodriguez, Luis Hernandez-Gomez, …
R1,472 Discovery Miles 14 720 Ships in 10 - 15 working days

This volume constitutes the refereed proceedings of the Spanish Conference, IberSPEECH 2012: Joint VII "Jornadas en Tecnologia del Habla" and III Iberian SLTech Workshop, held in Madrid, Spain, in November 21-23, 2012. The 29 revised papers were carefully reviewed and selected from 80 submissions. The papers are organized in topical sections on speaker characterization and recognition; audio and speech segmentation; pathology detection and speech characterization; dialogue and multimodal systems; robustness in automatic speech recognition; applications of speech and language technologies.

Automatic Speech Signal Analysis for Clinical Diagnosis and Assessment of Speech Disorders (Paperback, 2013 ed.): Ladan... Automatic Speech Signal Analysis for Clinical Diagnosis and Assessment of Speech Disorders (Paperback, 2013 ed.)
Ladan Baghai-Ravary, Steve W. Beet
R1,469 Discovery Miles 14 690 Ships in 10 - 15 working days

"Automatic Speech Signal Analysis for Clinical Diagnosis and Assessment of Speech Disorders "provides a survey of methods designed to aid clinicians in the diagnosis and monitoring of speech disorders such as dysarthria and dyspraxia, with an emphasis on the signal processing techniques, statistical validity of the results presented in the literature, and the appropriateness of methodsthat do not requirespecialized equipment, rigorously controlled recording procedures or highly skilled personnel to interpret results.
Such techniques offer the promise of a simple and cost-effective, yet objective, assessment of a range of medical conditions, which would be of great value to clinicians. The ideal scenario would begin with the collection of examples of the clients' speech, either over the phone or using portable recording devices operated by non-specialist nursing staff.
The recordings could then be analyzed initially to aid diagnosis of conditions, and subsequently to monitor the clients' progress and response to treatment. The automation of this process would allow more frequent and regular assessments to be performed, as well as providing greater objectivity."

Speech Processing in Modern Communication - Challenges and Perspectives (Paperback, 2010 ed.): Israel Cohen, Jacob Benesty,... Speech Processing in Modern Communication - Challenges and Perspectives (Paperback, 2010 ed.)
Israel Cohen, Jacob Benesty, Sharon Gannot
R4,247 Discovery Miles 42 470 Ships in 10 - 15 working days

Modern communication devices, such as mobile phones, teleconferencing systems, VoIP, etc., are often used in noisy and reverberant environments. Therefore, signals picked up by the microphones from telecommunication devices contain not only the desired near-end speech signal, but also interferences such as the background noise, far-end echoes produced by the loudspeaker, and reverberations of the desired source. These interferences degrade the fidelity and intelligibility of the near-end speech in human-to-human telecommunications and decrease the performance of human-to-machine interfaces (i.e., automatic speech recognition systems).

The proposed book deals with the fundamental challenges of speech processing in modern communication, including speech enhancement, interference suppression, acoustic echo cancellation, relative transfer function identification, source localization, dereverberation, and beamforming in reverberant environments.

Enhancement of speech signals is necessary whenever the source signal is corrupted by noise. In highly non-stationary noise environments, noise transients, and interferences may be extremely annoying. Acoustic echo cancellation is used to eliminate the acoustic coupling between the loudspeaker and the microphone of a communication device. Identification of the relative transfer function between sensors in response to a desired speech signal enables to derive a reference noise signal for suppressing directional or coherent noise sources. Source localization, dereverberation, and beamforming in reverberant environments further enable to increase the intelligibility of the near-end speech signal.

Perception and Machine Intelligence - First Indo-Japan Conference, PerMIn 2012, Kolkata, India, January 12-13, 2011,... Perception and Machine Intelligence - First Indo-Japan Conference, PerMIn 2012, Kolkata, India, January 12-13, 2011, Proceedings (Paperback, 2012)
Malay K. Kundu, Sushmita Mitra, Debasis Mazumdar, Sankar K. Pal
R1,500 Discovery Miles 15 000 Ships in 10 - 15 working days

This book constitutes the proceedings of the First Indo-Japanese conference on Perception and Machine Intelligence, PerMIn 2012, held in Kolkata, India, in January 2012. The 41 papers, presented together with 1 keynote paper and 3 plenary papers, were carefully reviewed and selected for inclusion in the book. The papers are organized in topical sections named perception; human-computer interaction; e-nose and e-tongue; machine intelligence and application; image and video processing; and speech and signal processing.

Automated Speaking Assessment - Using Language Technologies to Score Spontaneous Speech (Hardcover): Klaus Zechner, Keelan... Automated Speaking Assessment - Using Language Technologies to Score Spontaneous Speech (Hardcover)
Klaus Zechner, Keelan Evanini
R3,877 Discovery Miles 38 770 Ships in 12 - 17 working days

Automated Speaking Assessment: Using Language Technologies to Score Spontaneous Speech provides a thorough overview of state-of-the-art automated speech scoring technology as it is currently used at Educational Testing Service (ETS). Its main focus is related to the automated scoring of spontaneous speech elicited by TOEFL iBT Speaking section items, but other applications of speech scoring, such as for more predictable spoken responses or responses provided in a dialogic setting, are also discussed. The book begins with an in-depth overview of the nascent field of automated speech scoring-its history, applications, and challenges-followed by a discussion of psychometric considerations for automated speech scoring. The second and third parts discuss the integral main components of an automated speech scoring system as well as the different types of automatically generated measures extracted by the system features related to evaluate the speaking construct of communicative competence as measured defined by the TOEFL iBT Speaking assessment. Finally, the last part of the book touches on more recent developments, such as providing more detailed feedback on test takers' spoken responses using speech features and scoring of dialogic speech. It concludes with a discussion, summary, and outlook on future developments in this area. Written with minimal technical details for the benefit of non-experts, this book is an ideal resource for graduate students in courses on Language Testing and Assessment as well as teachers and researchers in applied linguistics.

Information Security for Automatic Speaker Identification (Paperback, Edition.): Fathi E. Abd El-Samie Information Security for Automatic Speaker Identification (Paperback, Edition.)
Fathi E. Abd El-Samie
R1,423 Discovery Miles 14 230 Ships in 10 - 15 working days

The author covers the fundamentals of both information and communication security including current developments in some of the most critical areas of automatic speech recognition. Included are topics on speech watermarking, speech encryption, steganography, multilevel security systems comprising speaker identification, real transmission of watermarked or encrypted speech signals, and more. The book is especially useful for information security specialist, government security analysts, speech development professionals, and for individuals involved in the study and research of speech recognition at advanced levels.

Automatic Speech Recognition on Mobile Devices and over Communication Networks (Paperback, Softcover reprint of hardcover 1st... Automatic Speech Recognition on Mobile Devices and over Communication Networks (Paperback, Softcover reprint of hardcover 1st ed. 2008)
Zheng-Hua Tan, Boerge Lindberg
R4,266 Discovery Miles 42 660 Ships in 10 - 15 working days

The advances in computing and networking have sparked an enormous interest in deploying automatic speech recognition on mobile devices and over communication networks. This book brings together academic researchers and industrial practitioners to address the issues in this emerging realm and presents the reader with a comprehensive introduction to the subject of speech recognition in devices and networks. It covers network, distributed and embedded speech recognition systems.

Advances in Nonlinear Speech Processing - International Conference on Nonlinear Speech Processing, NOLISP 2009, Vic, Spain,... Advances in Nonlinear Speech Processing - International Conference on Nonlinear Speech Processing, NOLISP 2009, Vic, Spain, June 25-27, 2009, Revised Selected Papers (Paperback, Edition.)
Jordi Sole-Casals, Vladimir Zai?a?t?s?
R1,469 Discovery Miles 14 690 Ships in 10 - 15 working days

This volume contains the proceedings of NOLISP 2009, an ISCA Tutorial and Workshop on Non-Linear Speech Processing held at the University of Vic (- talonia, Spain) during June 25-27, 2009. NOLISP2009wasprecededbythreeeditionsofthisbiannualeventheld2003 in Le Croisic (France), 2005 in Barcelona, and 2007 in Paris. The main idea of NOLISP workshops is to present and discuss new ideas, techniques and results related to alternative approaches in speech processing that may depart from the mainstream. In order to work at the front-end of the subject area, the following domains of interest have been de?ned for NOLISP 2009: 1. Non-linear approximation and estimation 2. Non-linear oscillators and predictors 3. Higher-order statistics 4. Independent component analysis 5. Nearest neighbors 6. Neural networks 7. Decision trees 8. Non-parametric models 9. Dynamics for non-linear systems 10. Fractal methods 11. Chaos modeling 12. Non-linear di?erential equations The initiative to organize NOLISP 2009 at the University of Vic (UVic) came from the UVic Research Group on Signal Processing and was supported by the Hardware-Software Research Group. We would like to acknowledge the ?nancial support obtained from the M- istry of Science and Innovation of Spain (MICINN), University of Vic, ISCA, and EURASIP. All contributions to this volume are original. They were subject to a doub- blind refereeing procedure before their acceptance for the workshop and were revised after being presented at NOLISP 2009.

Designing Voice User Interfaces (Paperback): Cathy Pearl Designing Voice User Interfaces (Paperback)
Cathy Pearl
R1,063 R672 Discovery Miles 6 720 Save R391 (37%) Ships in 12 - 17 working days

Voice user interfaces (VUIs) are becoming all the rage today. But how do you build one that people can actually converse with? Whether you're designing a mobile app, a toy, or a device such as a home assistant, this practical book guides you through basic VUI design principles, helps you choose the right speech recognition engine, and shows you how to measure your VUI's performance and improve upon it. Author Cathy Pearl also takes product managers, UX designers, and VUI designers into advanced design topics that will help make your VUI not just functional, but great. Understand key VUI design concepts, including command-and-control and conversational systems Decide if you should use an avatar or other visual representation with your VUI Explore speech recognition technology and its impact on your design Take your VUI above and beyond the basic exchange of information Learn practical ways to test your VUI application with users Monitor your app and learn how to quickly improve performance Get real-world examples of VUIs for home assistants, smartwatches, and car systems

Voice User Interface Design - Moving from GUI to Mixed Modal Interaction (Paperback, 1st ed.): Ritwik Dasgupta Voice User Interface Design - Moving from GUI to Mixed Modal Interaction (Paperback, 1st ed.)
Ritwik Dasgupta
R964 R789 Discovery Miles 7 890 Save R175 (18%) Ships in 10 - 15 working days

Design and implement voice user interfaces. This guide to VUI helps you make decisions as you deal with the challenges of moving from a GUI world to mixed-modal interactions with GUI and VUI. The way we interact with devices is changing rapidly and this book gives you a close view across major companies via real-world applications and case studies. Voice User Interface Design provides an explanation of the principles of VUI design. The book covers the design phase, with clear explanations and demonstrations of each design principle through examples of multi-modal interactions (GUI plus VUI) and how they differ from pure VUI. The book also differentiates principles of VUI related to chat-based bot interaction models. By the end of the book you will have a vision of the future, imagining new user-oriented scenarios and new avenues, which until now were untouched. What You'll Learn Implement and adhere to each design principle Understand how VUI differs from other interaction models Work in the current VUI landscape Who This Book Is For Interaction designers, entrepreneurs, tech enthusiasts, thought leaders, and AI enthusiasts interested in the future of user experience/interaction, designing high-quality VUI, and product decision making

Voice Applications for Alexa and Google Assistant (Paperback): Dustin Coates Voice Applications for Alexa and Google Assistant (Paperback)
Dustin Coates
R2,018 Discovery Miles 20 180 Ships in 12 - 17 working days

To create their own voice "skills," users need to learn some new device toolkits, the basics of Voice UI design, and some emerging best practices for building and deploying on these diverse platforms. Voice Applications for Alexa and Google Assistant guides readers in the exciting world of designing, building, and implementing voice-based applications for Amazon Alexa or Google Assistant! They learn how to build their own "skills"-the voice app term for actions the device can perform-from scratch. Key Features * Designing a voice interaction model * Fulfilling skills via a serverless platform like AWS Lambda * Connecting a skill to a database Audience Written for JavaScript developers interested in building voice-enabled applications. No prior experience required! Author Bio Dustin A. Coates is a web developer and web development instructor. He has taught hundreds of students online and offline at General Assembly. Dustin also developed popular courses for OneMonth.com and the European non-profit Konexio, which teaches refugees how to code.

Automated Speaking Assessment - Using Language Technologies to Score Spontaneous Speech (Paperback): Klaus Zechner, Keelan... Automated Speaking Assessment - Using Language Technologies to Score Spontaneous Speech (Paperback)
Klaus Zechner, Keelan Evanini
R1,429 Discovery Miles 14 290 Ships in 12 - 17 working days

Automated Speaking Assessment: Using Language Technologies to Score Spontaneous Speech provides a thorough overview of state-of-the-art automated speech scoring technology as it is currently used at Educational Testing Service (ETS). Its main focus is related to the automated scoring of spontaneous speech elicited by TOEFL iBT Speaking section items, but other applications of speech scoring, such as for more predictable spoken responses or responses provided in a dialogic setting, are also discussed. The book begins with an in-depth overview of the nascent field of automated speech scoring-its history, applications, and challenges-followed by a discussion of psychometric considerations for automated speech scoring. The second and third parts discuss the integral main components of an automated speech scoring system as well as the different types of automatically generated measures extracted by the system features related to evaluate the speaking construct of communicative competence as measured defined by the TOEFL iBT Speaking assessment. Finally, the last part of the book touches on more recent developments, such as providing more detailed feedback on test takers' spoken responses using speech features and scoring of dialogic speech. It concludes with a discussion, summary, and outlook on future developments in this area. Written with minimal technical details for the benefit of non-experts, this book is an ideal resource for graduate students in courses on Language Testing and Assessment as well as teachers and researchers in applied linguistics.

Telecommunications Relay Services to Assist Persons with Hearing or Speech Disabilities - Assessments (Paperback): Jason Graham Telecommunications Relay Services to Assist Persons with Hearing or Speech Disabilities - Assessments (Paperback)
Jason Graham
R1,941 Discovery Miles 19 410 Ships in 12 - 17 working days

Since 2002, the overall minutes of use and costs for the Telecommunications Relay Service (TRS) program have grown significantly due to the advent of Internet-based forms of TRS and increased usage by the deaf and hard-of-hearing communities. TRS allows persons with hearing or speech disabilities to place and receive telephone calls, often with the help of a communications assistant who acts as a translator or facilitator between the two parties having the conversation. FCC is the steward of the TRS program and the federal TRS Fund, which reimburses TRS providers. This book examines, among other things, changes in TRS services and costs since 2002; FCC's TRS performance goals and measures and how they compare with key characteristics of successful performance goals and measures; and the extent to which the design of the program's internal control system identifies and considers program risks.

Artificial Intelligence and Speech Technology - Third International Conference, AIST 2021, Delhi, India, November 12-13, 2021,... Artificial Intelligence and Speech Technology - Third International Conference, AIST 2021, Delhi, India, November 12-13, 2021, Revised Selected Papers (Paperback, 1st ed. 2022)
Amita Dev, S.S. Agrawal, Arun Sharma
R3,146 Discovery Miles 31 460 Ships in 10 - 15 working days

This volume constitutes selected papers presented at the Third International Conference on Artificial Intelligence and Speech Technology, AIST 2021, held in Delhi, India, in November 2021. The 36 full papers and 18 short papers presented were thoroughly reviewed and selected from the 178 submissions. They provide a discussion on application of Artificial Intelligence tools in speech analysis, representation and models, spoken language recognition and understanding, affective speech recognition, interpretation and synthesis, speech interface design and human factors engineering, speech emotion recognition technologies, audio-visual speech processing and several others.

Text-to-Speech Synthesis (Hardcover): Paul Taylor Text-to-Speech Synthesis (Hardcover)
Paul Taylor
R2,579 Discovery Miles 25 790 Ships in 12 - 17 working days

Text-to-Speech Synthesis provides a complete, end-to-end account of the process of generating speech by computer. Giving an in-depth explanation of all aspects of current speech synthesis technology, it assumes no specialized prior knowledge. Introductory chapters on linguistics, phonetics, signal processing and speech signals lay the foundation, with subsequent material explaining how this knowledge is put to use in building practical systems that generate speech. Including coverage of the very latest techniques such as unit selection, hidden Markov model synthesis, and statistical text analysis, explanations of the more traditional techniques such as format synthesis and synthesis by rule are also provided. Weaving together the various strands of this multidisciplinary field, the book is designed for graduate students in electrical engineering, computer science, and linguistics. It is also an ideal reference for practitioners in the fields of human communication interaction and telephony.

Intelligent Speech Signal Processing (Paperback): Nilanjan Dey Intelligent Speech Signal Processing (Paperback)
Nilanjan Dey
R2,731 R2,480 Discovery Miles 24 800 Save R251 (9%) Ships in 12 - 17 working days

Intelligent Speech Signal Processing investigates the utilization of speech analytics across several systems and real-world activities, including sharing data analytics, creating collaboration networks between several participants, and implementing video-conferencing in different application areas. Chapters focus on the latest applications of speech data analysis and management tools across different recording systems. The book emphasizes the multidisciplinary nature of the field, presenting different applications and challenges with extensive studies on the design, development and management of intelligent systems, neural networks and related machine learning techniques for speech signal processing.

Natural Language Processing and Computational Ling uistics 2: Semantics, Discourse and Applications (Hardcover, Volume 2): MZ... Natural Language Processing and Computational Ling uistics 2: Semantics, Discourse and Applications (Hardcover, Volume 2)
MZ Kurdi
R3,722 Discovery Miles 37 220 Ships in 12 - 17 working days

Natural Language Processing (NLP) is a scientific discipline which is found at the intersection of fields such as Artificial Intelligence, Linguistics, and Cognitive Psychology. This book presents in four chapters the state of the art and fundamental concepts of key NLP areas. Are presented in the first chapter the fundamental concepts in lexical semantics, lexical databases, knowledge representation paradigms, and ontologies. The second chapter is about combinatorial and formal semantics. Discourse and text representation as well as automatic discourse segmentation and interpretation, and anaphora resolution are the subject of the third chapter. Finally, in the fourth chapter, I will cover some aspects of large scale applications of NLP such as software architecture and their relations to cognitive models of NLP as well as the evaluation paradigms of NLP software. Furthermore, I will present in this chapter the main NLP applications such as Machine Translation (MT), Information Retrieval (IR), as well as Big Data and Information Extraction such as event extraction, sentiment analysis and opinion mining.

Statistical Language and Speech Processing - 4th International Conference, SLSP 2016, Pilsen, Czech Republic, October 11-12,... Statistical Language and Speech Processing - 4th International Conference, SLSP 2016, Pilsen, Czech Republic, October 11-12, 2016, Proceedings (Paperback, 1st ed. 2016)
Pavel Kral, Carlos Martin-Vide
R1,637 Discovery Miles 16 370 Ships in 10 - 15 working days

This book constitutes the refereed proceedings of the 4th International Conference on Statistical Language and Speech Processing, SLSP 2016, held in Pilsen, Czech Republic, in October 2016. The 11 full papers presented together with two invited talks were carefully reviewed and selected from 38 submissions. The papers cover topics such as anaphora and coreference resolution; authorship identification, plagiarism and spam filtering; computer-aided translation; corpora and language resources; data mining and semantic web; information extraction; information retrieval; knowledge representation and ontologies; lexicons and dictionaries; machine translation; multimodal technologies; natural language understanding; neural representation of speech and language; opinion mining and sentiment analysis; parsing; part-of-speech tagging; question and answering systems; semantic role labeling; speaker identification and verification; speech and language generation; speech recognition; speech synthesis; speech transcription; speech correction; spoken dialogue systems; term extraction; text categorization; test summarization; user modeling.

Introduction to EEG- and Speech-Based Emotion Recognition (Paperback): Priyanka A. Abhang, Bharti Gawali, Suresh C. Mehrotra Introduction to EEG- and Speech-Based Emotion Recognition (Paperback)
Priyanka A. Abhang, Bharti Gawali, Suresh C. Mehrotra
R1,898 Discovery Miles 18 980 Ships in 12 - 17 working days

Introduction to EEG- and Speech-Based Emotion Recognition Methods examines the background, methods, and utility of using electroencephalograms (EEGs) to detect and recognize different emotions. By incorporating these methods in brain-computer interface (BCI), we can achieve more natural, efficient communication between humans and computers. This book discusses how emotional states can be recognized in EEG images, and how this is useful for BCI applications. EEG and speech processing methods are explored, as are the technological basics of how to operate and record EEGs. Finally, the authors include information on EEG-based emotion recognition, classification, and a proposed EEG/speech fusion method for how to most accurately detect emotional states in EEG recordings.

Speech and Computer - 17th International Conference, SPECOM 2015, Athens, Greece, September 20-24, 2015, Proceedings... Speech and Computer - 17th International Conference, SPECOM 2015, Athens, Greece, September 20-24, 2015, Proceedings (Paperback, 1st ed. 2015)
Andrey Ronzhin, Rodmonga Potapova, Nikos Fakotakis
R1,536 Discovery Miles 15 360 Ships in 10 - 15 working days

This book constitutes the refereed proceedings of the 17th International Conference on Speech and Computer, SPECOM 2015, held in Athens, Greece, in September 2015. The 59 revised full papers presented together with 2 invited talks were carefully reviewed and selected from 104 initial submissions. The papers cover a wide range of topics in the area of computer speech processing such as recognition, synthesis, and understanding and related domains including signal processing, language and text processing, multi-modal speech processing or human-computer interaction.

Text, Speech, and Dialogue - 18th International Conference, TSD 2015, Pilsen,Czech Republic, September 14-17, 2015, Proceedings... Text, Speech, and Dialogue - 18th International Conference, TSD 2015, Pilsen,Czech Republic, September 14-17, 2015, Proceedings (Paperback, 1st ed. 2015)
Pavel Kral, Vaclav Matousek
R3,231 Discovery Miles 32 310 Ships in 10 - 15 working days

This book constitutes the refereed proceedings of the 18th International Conference on Text, Speech and Dialogue, TSD 2015, held in Pilsen, Czech Republic, in September 2015. The 67 papers presented together with 3 invited papers were carefully reviewed and selected from 138 submissions. They focus on topics such as corpora and language resources; speech recognition; tagging, classification and parsing of text and speech; speech and spoken language generation; semantic processing of text and speech; integrating applications of text and speech processing; automatic dialogue systems; as well as multimodal techniques and modelling.

Data-Driven Alexa Skills - Voice Access to Rich Data Sources for Enterprise Applications (Paperback, 1st ed.): Simon A. Kingaby Data-Driven Alexa Skills - Voice Access to Rich Data Sources for Enterprise Applications (Paperback, 1st ed.)
Simon A. Kingaby
R1,334 Discovery Miles 13 340 Ships in 12 - 17 working days

Design and build innovative, custom, data-driven Alexa skills for home or business. Working through several projects, this book teaches you how to build Alexa skills and integrate them with online APIs. If you have basic Python skills, this book will show you how to build data-driven Alexa skills. You will learn to use data to give your Alexa skills dynamic intelligence, in-depth knowledge, and the ability to remember. Data-Driven Alexa Skills takes a step-by-step approach to skill development. You will begin by configuring simple skills in the Alexa Skill Builder Console. Then you will develop advanced custom skills that use several Alexa Skill Development Kit features to integrate with lambda functions, Amazon Web Services (AWS), and Internet data feeds. These advanced skills enable you to link user accounts, query and store data using a NoSQL database, and access real estate listings and stock prices via web APIs. What You Will Learn Set up and configure your development environment properly the first time Build Alexa skills quickly and efficiently using Agile tools and techniques Create a variety of data-driven Alexa skills for home and business Access data from web applications and Internet data sources via their APIs Test with unit-testing frameworks throughout the development life cycle Manage and query your data using the DynamoDb NoSQL database engines Who This Book Is For Developers who wish to go beyond Hello World and build complex, data-driven applications on Amazon's Alexa platform; developers who want to learn how to use Lambda functions, the Alexa Skills SDK, Alexa Presentation Language, and Alexa Conversations; developers interested in integrating with public APIs such as real estate listings and stock market prices. Readers will need to have basic Python skills.

Speech and Computer - 15th International Conference, SPECOM 2013, September 1-5, 2013, Pilsen, Czech Republic, Proceedings... Speech and Computer - 15th International Conference, SPECOM 2013, September 1-5, 2013, Pilsen, Czech Republic, Proceedings (Paperback, 2013 ed.)
Milos Zelezny, Iwan Habernal, Andrey Ronzhin
R1,495 Discovery Miles 14 950 Ships in 10 - 15 working days

This book constitutes the refereed proceedings of the 15th International Conference on Speech and Computer, SPECOM 2013, held in Pilsen, Czech Republic. The 48 revised full papers presented were carefully reviewed and selected from 90 initial submissions. The papers are organized in topical sections on speech recognition and understanding, spoken language processing, spoken dialogue systems, speaker identification and diarization, speech forensics and security, language identification, text-to-speech systems, speech perception and speech disorders, multimodal analysis and synthesis, understanding of speech and text, and audio-visual speech processing.

Cross-Word Modeling for Arabic Speech Recognition (Paperback, 2012): Dia AbuZeina, Moustafa Elshafei Cross-Word Modeling for Arabic Speech Recognition (Paperback, 2012)
Dia AbuZeina, Moustafa Elshafei
R1,469 Discovery Miles 14 690 Ships in 10 - 15 working days

Cross-Word Modeling for Arabic Speech Recognition utilizes phonological rules in order to model the cross-word problem, a merging of adjacent words in speech caused by continuous speech, to enhance the performance of continuous speech recognition systems. The author aims to provide an understanding of the cross-word problem and how it can be avoided, specifically focusing on Arabic phonology using an HHM-based classifier.

Free Delivery
Pinterest Twitter Facebook Google+
You may like...
Estimating Spoken Dialog System Quality…
Klaus-Peter Engelbrecht Hardcover R2,776 Discovery Miles 27 760
Digital Speech Processing Using Matlab
E.S. Gopi Hardcover R3,266 Discovery Miles 32 660
Self-Learning Speaker Identification - A…
Tobias Herbig, Franz Gerl, … Hardcover R2,890 Discovery Miles 28 900
Dialect Accent Features for Establishing…
Manisha Kulshreshtha, Ramkumar Mathur Hardcover R1,469 Discovery Miles 14 690
Speech and Audio Processing for Coding…
Tokunbo Ogunfunmi, Roberto Togneri, … Hardcover R4,211 R3,490 Discovery Miles 34 900
Speech Enhancement Techniques for…
Komal R. Borisagar, Rohit M. Thanki, … Hardcover R2,783 Discovery Miles 27 830
Speech Processing in Embedded Systems
Priyabrata Sinha Hardcover R2,890 Discovery Miles 28 900
New Era for Robust Speech Recognition…
Shinji Watanabe, Marc Delcroix, … Hardcover R5,475 Discovery Miles 54 750
Handbook of Research on Recent…
Siddhartha Bhattacharyya, Nibaran Das, … Hardcover R9,426 Discovery Miles 94 260
Proactive Spoken Dialogue Interaction in…
Petra-Maria Strauss, Wolfgang Minker Hardcover R2,894 Discovery Miles 28 940

 

Partners