|
Books > Computing & IT > Applications of computing > Audio processing
 |
Advances in Multimedia Information Processing -- PCM 2015
- 16th Pacific-Rim Conference on Multimedia, Gwangju, South Korea, September 16-18, 2015, Proceedings, Part II
(Paperback, 1st ed. 2015)
Yo-Sung Ho, Jitao Sang, Yong Man Ro, Junmo Kim, Fei Wu
|
R1,642
Discovery Miles 16 420
|
Ships in 10 - 15 working days
|
|
The two-volume proceedings LNCS 9314 and 9315, constitute the
proceedings of the 16th Pacific-Rim Conference on Multimedia, PCM
2015, held in Gwangju, South Korea, in September 2015. The total of
138 full and 32 short papers presented in these proceedings was
carefully reviewed and selected from 224 submissions. The papers
were organized in topical sections named: image and audio
processing; multimedia content analysis; multimedia applications
and services; video coding and processing; multimedia
representation learning; visual understanding and recognition on
big data; coding and reconstruction of multimedia data with
spatial-temporal information; 3D image/video processing and
applications; video/image quality assessment and processing; social
media computing; human action recognition in social robotics and
video surveillance; recent advances in image/video processing; new
media representation and transmission technologies for emerging UHD
services.
 |
Speech and Computer
- 17th International Conference, SPECOM 2015, Athens, Greece, September 20-24, 2015, Proceedings
(Paperback, 1st ed. 2015)
Andrey Ronzhin, Rodmonga Potapova, Nikos Fakotakis
|
R1,589
Discovery Miles 15 890
|
Ships in 10 - 15 working days
|
|
This book constitutes the refereed proceedings of the 17th
International Conference on Speech and Computer, SPECOM 2015, held
in Athens, Greece, in September 2015. The 59 revised full papers
presented together with 2 invited talks were carefully reviewed and
selected from 104 initial submissions. The papers cover a wide
range of topics in the area of computer speech processing such as
recognition, synthesis, and understanding and related domains
including signal processing, language and text processing,
multi-modal speech processing or human-computer interaction.
 |
Text, Speech, and Dialogue
- 18th International Conference, TSD 2015, Pilsen,Czech Republic, September 14-17, 2015, Proceedings
(Paperback, 1st ed. 2015)
Pavel Kral, Vaclav Matousek
|
R3,284
Discovery Miles 32 840
|
Ships in 10 - 15 working days
|
|
This book constitutes the refereed proceedings of the 18th
International Conference on Text, Speech and Dialogue, TSD 2015,
held in Pilsen, Czech Republic, in September 2015. The 67 papers
presented together with 3 invited papers were carefully reviewed
and selected from 138 submissions. They focus on topics such as
corpora and language resources; speech recognition; tagging,
classification and parsing of text and speech; speech and spoken
language generation; semantic processing of text and speech;
integrating applications of text and speech processing; automatic
dialogue systems; as well as multimodal techniques and modelling.
 |
Speech and Computer
- 15th International Conference, SPECOM 2013, September 1-5, 2013, Pilsen, Czech Republic, Proceedings
(Paperback, 2013 ed.)
Milos Zelezny, Iwan Habernal, Andrey Ronzhin
|
R1,547
Discovery Miles 15 470
|
Ships in 10 - 15 working days
|
|
This book constitutes the refereed proceedings of the 15th
International Conference on Speech and Computer, SPECOM 2013, held
in Pilsen, Czech Republic. The 48 revised full papers presented
were carefully reviewed and selected from 90 initial submissions.
The papers are organized in topical sections on speech recognition
and understanding, spoken language processing, spoken dialogue
systems, speaker identification and diarization, speech forensics
and security, language identification, text-to-speech systems,
speech perception and speech disorders, multimodal analysis and
synthesis, understanding of speech and text, and audio-visual
speech processing.
With Computational Thinking in Sound, veteran educators Gena R.
Greher and Jesse M. Heines provide the first book ever written for
music fundamentals educators which is devoted specifically to
music, sound, and technology. The authors demonstrate how the range
of mental tools in computer science - for example, analytical
thought, system design, and problem design and solution - can be
fruitfully applied to music education, including examples of
successful student work. While technology instruction in music
education has traditionally focused on teaching how computers and
software work to produce music, Greher and Heines offer context: a
clear understanding of how music technology can be structured
around a set of learning challenges and tasks of the type common in
computer science classrooms. Using a learner-centered approach that
emphasizes project-based experiences, the book provides music
educators with multiple strategies to explore, create, and solve
problems with music and technology in equal parts. It also provides
examples of hands-on activities which encourage students, alone and
in interdisciplinary groups, to explore the basic principles that
underlie today's music technology and which expose them to current
multimedia development tools.
Design and build innovative, custom, data-driven Alexa skills for
home or business. Working through several projects, this book
teaches you how to build Alexa skills and integrate them with
online APIs. If you have basic Python skills, this book will show
you how to build data-driven Alexa skills. You will learn to use
data to give your Alexa skills dynamic intelligence, in-depth
knowledge, and the ability to remember. Data-Driven Alexa Skills
takes a step-by-step approach to skill development. You will begin
by configuring simple skills in the Alexa Skill Builder Console.
Then you will develop advanced custom skills that use several Alexa
Skill Development Kit features to integrate with lambda functions,
Amazon Web Services (AWS), and Internet data feeds. These advanced
skills enable you to link user accounts, query and store data using
a NoSQL database, and access real estate listings and stock prices
via web APIs. What You Will Learn Set up and configure your
development environment properly the first time Build Alexa skills
quickly and efficiently using Agile tools and techniques Create a
variety of data-driven Alexa skills for home and business Access
data from web applications and Internet data sources via their APIs
Test with unit-testing frameworks throughout the development life
cycle Manage and query your data using the DynamoDb NoSQL database
engines Who This Book Is For Developers who wish to go beyond Hello
World and build complex, data-driven applications on Amazon's Alexa
platform; developers who want to learn how to use Lambda functions,
the Alexa Skills SDK, Alexa Presentation Language, and Alexa
Conversations; developers interested in integrating with public
APIs such as real estate listings and stock market prices. Readers
will need to have basic Python skills.
Why don't Guitar Hero players just pick up real guitars? What
happens when millions of people play the role of a young black gang
member in Grand Theft Auto: San Andreas? How are YouTube-based
music lessons changing the nature of amateur musicianship? This
book is about play, performance, and participatory culture in the
digital age. Miller shows how video games and social media are
bridging virtual and visceral experience, creating dispersed
communities who forge meaningful connections by "playing along"
with popular culture. Playing Along reveals how digital media are
brought to bear in the transmission of embodied knowledge: how a
Grand Theft Auto player uses a virtual radio to hear with her
avatar's ears; how a Guitar Hero player channels the experience of
a live rock performer; and how a beginning guitar student
translates a two-dimensional, pre-recorded online music lesson into
three-dimensional physical practice and an intimate relationship
with a distant teacher. Through a series of engaging ethnographic
case studies, Miller demonstrates that our everyday experiences
with interactive digital media are gradually transforming our
understanding of musicality, creativity, play, and participation.
Unleash your iPod touch and take it to the limit using secret tips
and techniques. Fast and fun to read, Taking Your iPod touch 5 to
the Max will help you get the most out of iOS 5 on your iPod touch.
You'll find all the best undocumented tricks, as well as the most
efficient and enjoyable introduction to the iPod touch available.
Starting with the basics, you'll quickly move on to discover the
iPod touch's hidden potential, like how to connect to a TV and get
contract-free VoIP. From e-mail and surfing the Web, to using
iTunes, iBooks, games, photos, ripping DVDs and getting free VoIP
with Skype or FaceTime--whether you have a new iPod touch, or an
older iPod touch with iOS 5, you'll find it all in this book.
You'll even learn tips on where to get the best and cheapest iPod
touch accessories. Get ready to take iPod touch to the max What
you'll learn * How to get your music, videos, and data onto your
iPod touch * How to manage your media * Tips for shopping in the
App Store and iTunes Store * Getting the most out of iBooks * Using
Mail on your iPod touch * Keeping in touch with FaceTime Who this
book is for Anyone who wants to get the most out of their iPod
touch 5.Table of Contents * Bringing Home the iPod touch * Putting
Your Data and Media on the iPod touch * Interacting with Your iPod
touch * Browsing with Wi-fi and Safari * Touching Photos and Videos
* Touching Your Music * Shopping at the iTunes Store * Shopping at
the App Store * Reading and Buying Books with iBooks * Setting Up
and Using Mail * Staying on Time and Getting There * Using your
Desk Set * Photographing and Recording the World Around You * Video
Calling with FaceTime * Customizing Your iPod touch
Cross-Word Modeling for Arabic Speech Recognition utilizes
phonological rules in order to model the cross-word problem, a
merging of adjacent words in speech caused by continuous speech, to
enhance the performance of continuous speech recognition systems.
The author aims to provide an understanding of the cross-word
problem and how it can be avoided, specifically focusing on Arabic
phonology using an HHM-based classifier.
Automatic speech recognition (ASR) systems are finding
increasing use in everyday life. Many of the commonplace
environments where the systems are used are noisy, for example
users calling up a voice search system from a busy cafeteria or a
street. This can result in degraded speech recordings and adversely
affect the performance of speech recognition systems. As the use of
ASR systems increases, knowledge of the state-of-the-art in
techniques to deal with such problems becomes critical to system
and application engineers and researchers who work with or on ASR
technologies. This book presents a comprehensive survey of the
state-of-the-art in techniques used to improve the robustness of
speech recognition systems to these degrading external
influences.
Key features: Reviews all the main noise robust ASR approaches,
including signal separation, voice activity detection, robust
feature extraction, model compensation and adaptation, missing data
techniques and recognition of reverberant speech.Acts as a timely
exposition of the topic in light of more widespread use in the
future of ASR technology in challenging environments.Addresses
robustness issues and signal degradation which are both key
requirements for practitioners of ASR.Includes contributions from
top ASR researchers from leading research units in the field
|
|