![]() |
Welcome to Loot.co.za!
Sign in / Register |Wishlists & Gift Vouchers |Help | Advanced search
|
Your cart is empty |
||
|
Books > Computing & IT > Applications of computing > Audio processing
This book constitutes the refereed proceedings of the 4th International Conference on Statistical Language and Speech Processing, SLSP 2016, held in Pilsen, Czech Republic, in October 2016. The 11 full papers presented together with two invited talks were carefully reviewed and selected from 38 submissions. The papers cover topics such as anaphora and coreference resolution; authorship identification, plagiarism and spam filtering; computer-aided translation; corpora and language resources; data mining and semantic web; information extraction; information retrieval; knowledge representation and ontologies; lexicons and dictionaries; machine translation; multimodal technologies; natural language understanding; neural representation of speech and language; opinion mining and sentiment analysis; parsing; part-of-speech tagging; question and answering systems; semantic role labeling; speaker identification and verification; speech and language generation; speech recognition; speech synthesis; speech transcription; speech correction; spoken dialogue systems; term extraction; text categorization; test summarization; user modeling.
The two-volume proceedings LNCS 9314 and 9315, constitute the proceedings of the 16th Pacific-Rim Conference on Multimedia, PCM 2015, held in Gwangju, South Korea, in September 2015. The total of 138 full and 32 short papers presented in these proceedings was carefully reviewed and selected from 224 submissions. The papers were organized in topical sections named: image and audio processing; multimedia content analysis; multimedia applications and services; video coding and processing; multimedia representation learning; visual understanding and recognition on big data; coding and reconstruction of multimedia data with spatial-temporal information; 3D image/video processing and applications; video/image quality assessment and processing; social media computing; human action recognition in social robotics and video surveillance; recent advances in image/video processing; new media representation and transmission technologies for emerging UHD services.
This book constitutes the refereed proceedings of the 18th International Conference on Text, Speech and Dialogue, TSD 2015, held in Pilsen, Czech Republic, in September 2015. The 67 papers presented together with 3 invited papers were carefully reviewed and selected from 138 submissions. They focus on topics such as corpora and language resources; speech recognition; tagging, classification and parsing of text and speech; speech and spoken language generation; semantic processing of text and speech; integrating applications of text and speech processing; automatic dialogue systems; as well as multimodal techniques and modelling.
This book constitutes the refereed proceedings of the 17th International Conference on Speech and Computer, SPECOM 2015, held in Athens, Greece, in September 2015. The 59 revised full papers presented together with 2 invited talks were carefully reviewed and selected from 104 initial submissions. The papers cover a wide range of topics in the area of computer speech processing such as recognition, synthesis, and understanding and related domains including signal processing, language and text processing, multi-modal speech processing or human-computer interaction.
This book gives an overview of the research and application of speech technologies in different areas. One of the special characteristics of the book is that the authors take a broad view of the multiple research areas and take the multidisciplinary approach to the topics. One of the goals in this book is to emphasize the application. User experience, human factors and usability issues are the focus in this book.
This book focuses on speech processing in the presence of low-bit rate coding and varying background environments. The methods presented in the book exploit the speech events which are robust in noisy environments. Accurate estimation of these crucial events will be useful for carrying out various speech tasks such as speech recognition, speaker recognition and speech rate modification in mobile environments. The authors provide insights into designing and developing robust methods to process the speech in mobile environments. Covering temporal and spectral enhancement methods to minimize the effect of noise and examining methods and models on speech and speaker recognition applications in mobile environments.
This straightforward introduction to audio techniques guides the
beginner through principles such as sound waves and basic acoustics
and offers practical advice for using recording and reproduction
equipment. Previously known as Audio Explained, this latest edition
includes new material on: reverberation and its use in recording;
principles of digital mixing; digital recording; including MiniDisc
and MP3; digital artificial reverberation.
This book constitutes the refereed proceedings of the 15th International Conference on Speech and Computer, SPECOM 2013, held in Pilsen, Czech Republic. The 48 revised full papers presented were carefully reviewed and selected from 90 initial submissions. The papers are organized in topical sections on speech recognition and understanding, spoken language processing, spoken dialogue systems, speaker identification and diarization, speech forensics and security, language identification, text-to-speech systems, speech perception and speech disorders, multimodal analysis and synthesis, understanding of speech and text, and audio-visual speech processing.
This anthology examines the various facets of video game music. Contributors from the fields of science and practice document its historical development, discuss the music's composition techniques, interactivity and function as well as attending to its performative aspects.
Why don't Guitar Hero players just pick up real guitars? What happens when millions of people play the role of a young black gang member in Grand Theft Auto: San Andreas? How are YouTube-based music lessons changing the nature of amateur musicianship? This book is about play, performance, and participatory culture in the digital age. Miller shows how video games and social media are bridging virtual and visceral experience, creating dispersed communities who forge meaningful connections by "playing along" with popular culture. Playing Along reveals how digital media are brought to bear in the transmission of embodied knowledge: how a Grand Theft Auto player uses a virtual radio to hear with her avatar's ears; how a Guitar Hero player channels the experience of a live rock performer; and how a beginning guitar student translates a two-dimensional, pre-recorded online music lesson into three-dimensional physical practice and an intimate relationship with a distant teacher. Through a series of engaging ethnographic case studies, Miller demonstrates that our everyday experiences with interactive digital media are gradually transforming our understanding of musicality, creativity, play, and participation.
Design and build innovative, custom, data-driven Alexa skills for home or business. Working through several projects, this book teaches you how to build Alexa skills and integrate them with online APIs. If you have basic Python skills, this book will show you how to build data-driven Alexa skills. You will learn to use data to give your Alexa skills dynamic intelligence, in-depth knowledge, and the ability to remember. Data-Driven Alexa Skills takes a step-by-step approach to skill development. You will begin by configuring simple skills in the Alexa Skill Builder Console. Then you will develop advanced custom skills that use several Alexa Skill Development Kit features to integrate with lambda functions, Amazon Web Services (AWS), and Internet data feeds. These advanced skills enable you to link user accounts, query and store data using a NoSQL database, and access real estate listings and stock prices via web APIs. What You Will Learn Set up and configure your development environment properly the first time Build Alexa skills quickly and efficiently using Agile tools and techniques Create a variety of data-driven Alexa skills for home and business Access data from web applications and Internet data sources via their APIs Test with unit-testing frameworks throughout the development life cycle Manage and query your data using the DynamoDb NoSQL database engines Who This Book Is For Developers who wish to go beyond Hello World and build complex, data-driven applications on Amazon's Alexa platform; developers who want to learn how to use Lambda functions, the Alexa Skills SDK, Alexa Presentation Language, and Alexa Conversations; developers interested in integrating with public APIs such as real estate listings and stock market prices. Readers will need to have basic Python skills.
The new iOS 5-driveniPod touch devicesare much more than just music. These have all the features of a PDAincluding email, calendar, Google Maps, the App Store, and even phone capabilitiesas well as the ability to watch movies and play your favorite games, all packed into Apple's sleek design. With iPod touch Made Simple, iOS 5 Edition, you'll learn how to take advantage of all these features and more, now available using the new iOS 5. Packed with over 1,000 visuals and screenshots, this book will help you master all the functions of the iPod touch devices that run iOS 5and teach you time-saving techniques and tips along the way. What you'll learn Support for both Windows and Mac users Sync and manage all your music on the iPod touch or your computer Find the best App Store applications and games Save time with copy/paste & Spotlight search Play music, videos, TV shows, and podcasts Sync playlists, videos, contacts, calendar, and notes Fast email, phone, calendar, and browser tips Use Google Maps to find just about anything Bluetooth and Wi-Fi network setup & security All the best tips and tricks for the touch screen Who this book is for This book is for those new to theiPod touch or the iPod touch devices running the new and latest iOS 5and even for seasoned users who want to learn new tips and techniques. Table of Contents Getting Started with iPod touch Typing Tips, Copy/Paste, and Search Sync Your iPod touch with iTunes Other Sync Methods Wi-Fi Connectivity Organize Your iPod touch Icons and Folders Personalize and Secure Your iPod touch Multitasking and Voice Control Playing Music Viewing Videos, TV Shows, and More iBooks and E-Books Surfing the Web with Safari FaceTime Video Messaging and Skype Email on Your iPod touch Working with Contacts Your Calendar iPod touch Photography Recording and Editing Videos iTunes on Your iPod touch The Amazing App Store Games and Fun Social Networking Eliminate Your Paper Notes Bluetooth on the iPod touch Utilities: Clock, Calculator, and Weather New Media: Reading Newspapers, Magazines, and More Find Your Way with Maps Troubleshooting Your iPod touch Your iTunes User Guide
Apple's iPods continue to set the bar for media players, with bold new features like the Touch's supersized screen and Siri voice control. But iPods still lack a guide to all their features. That's where this full-color book comes in. It shows you how to play music, movies, and slideshows; shoot photos and videos; and navigate Apple's redesigned iTunes media-management program. The important stuff you need to know: Fill it up. Load your iPod with music, photos, movies, TV shows, games, ebooks, and podcasts. Manage your stuff. Download media and apps from the iTunes and App Stores, then organize your collection. Tackle the Touch. Send email and instant messages, make FaceTime calls, and shoot photos and HD video with the Touch's 5-megapixel camera. Go wireless. Use the Touch's new iOS 6 software to sync content wirelessly. Relish the Nano. Enjoy video and photos on the Nano's new big screen, and chart your workouts with the Nike+ pedometer. Master the Shuffle and Classic. Get mucho music on the little Shuffle, and use the Classic's giant hard drive to tote around your audio and video collections. Pump it up. Blast iPod tunes through your home and car stereo.
Cross-Word Modeling for Arabic Speech Recognition utilizes phonological rules in order to model the cross-word problem, a merging of adjacent words in speech caused by continuous speech, to enhance the performance of continuous speech recognition systems. The author aims to provide an understanding of the cross-word problem and how it can be avoided, specifically focusing on Arabic phonology using an HHM-based classifier.
The rapid development in various fields of Digital Audio Effects, or DAFX, has led to new algorithms and this second edition of the popular book, "DAFX: Digital Audio Effects" has been updated throughout to reflect progress in the field. It maintains a unique approach to DAFX with a lecture-style introduction into the basics of effect processing. Each effect description begins with the presentation of the physical and acoustical phenomena, an explanation of the signal processing techniques to achieve the effect, followed by a discussion of musical applications and the control of effect parameters. Topics covered include: filters and delays, modulators and demodulators, nonlinear processing, spatial effects, time-segment processing, time-frequency processing, source-filter processing, spectral processing, time and frequency warping musical signals. Updates to the second edition include: Three completely new chapters devoted to the major research areas of: Virtual Analog Effects, Automatic Mixing and Sound Source Separation, authored by leading researchers in the field .Improved presentation of the basic concepts and explanation of the related technology.Extended coverage of the MATLABTM scripts which demonstrate the implementation of the basic concepts into software programs. Companion website (www.dafx.de) which serves as the download source for MATLABTM scripts, will be updated to reflect the new material in the book. Discussing DAFX from both an introductory and advanced level, the book systematically introduces the reader to digital signal processing concepts, how they can be applied to sound and their use in musical effects. This makes the book suitable for a range of professionals including those working in audio engineering, as well as researchers and engineers involved in the area of digital signal processing along with students on multimedia related courses.
Written by two of the best and brightest podcasting pioneers, Podcast Solutions: The Complete Guide to Audio and Video Podcasting, Second Edition is a comprehensive and perceptive guide to all things podcasting. From downloading podcasts to producing your own for fun or profit, Podcast Solutions covers the entire world of podcasting with insight, humor, and the unmatched wisdom of experience. Big-name companies and podcasters throughout the United States and thousands of faithful listeners around the world will tell you that Michael W. Geoghegan ("Reel Reviews-Films Worth Watching" and GigaVox Media) and Dan Klass ("The Bitterest Pill" and JacketMedia.com) know how to put together compelling and engaging shows that people come backfor week after week. These two pros will guide you through everything, from developing your raw podcast ideas to selecting equipment, creating your podcast (including incorporating music, professional production techniques, and audio- and video-editing secrets), and mobilizing and growing an audience. Plenty has changed since the best-selling first edition of this book, and Michael and Dan bring you all the latest and greatest information on production, distribution, and marketing from the world of audio and video podcasting. Nearly 50 pages of new material and hundreds of updates make this the most complete and up-to-date book on podcasting imaginable. Between Michael's uncanny business and marketing sense and Dan's nearly two decades in the entertainment industry, these authors have the experience to back up their advice on what it takes to elevate your podcast to a professional level. Podcast Solutions gives you not only what youll need to know about podcasting, but also the insider's view on the business of new media production and marketing. Whether you want to use podcasting to inform, educate, entertain, or inspire, whether you are a complete novice or an experienced professional, Podcast Solutions is the guide you need.
Discover the exciting world of software-defined radio (SDR) through this hands-on, beginner-friendly introduction. Software-defined radio (SDR) is transforming wireless communications through flexible, inexpensive devices that can be programmed to receive AM and FM broadcasts, transmit signals over Wi-Fi, monitor GPS location data, communicate with the International Space Station, and more. This book provides a beginner-friendly introduction to this revolutionary technology. Its learn-by-doing approach will take you from total beginner to confident SDR practitioner, without confusing math or technical jargon. Working with intuitive, graphical software, you’ll explore how SDRs work, discover how to demodulate, filter, tune, and transmit analog radio signals—and get hooked on an exciting new hobby!
Go beyond HTML5's Audio tag and boost the audio capabilities of your web application with the Web Audio API. Packed with lots of code examples, crisp descriptions, and useful illustrations, this concise guide shows you how to use this JavaScript API to make the sounds and music of your games and interactive applications come alive. You need little or no digital audio expertise to get started. Author Boris Smus introduces you to digital audio concepts, then shows you how the Web Audio API solves specific application audio problems. You'll not only learn how to synthesize and process digital audio, you'll also explore audio analysis and visualization with this API. Learn Web Audio API, including audio graphs and the audio nodes Provide quick feedback to user actions by scheduling sounds with the API's precise timing model Control gain, volume, and loudness, and dive into clipping and crossfading Understand pitch and frequency: use tools to manipulate soundforms directly with JavaScript Generate synthetic sound effects and learn how to spatialize sound in 3D space Use Web Audio API with the Audio tag, getUserMedia, and the Page Visibility API
This work addresses this problem in the short-time Fourier transform (STFT) domain. We divide the general problem into five basic categories depending on the number of microphones being used and whether the interframe or interband correlation is considered. The first category deals with the single-channel problem where STFT coefficients at different frames and frequency bands are assumed to be independent. In this case, the noise reduction filter in each frequency band is basically a real gain. Since a gain does not improve the signal-to-noise ratio (SNR) for any given subband and frame, the noise reduction is basically achieved by liftering the subbands and frames that are less noisy while weighing down on those that are more noisy. The second category also concerns the single-channel problem. The difference is that now the interframe correlation is taken into account and a filter is applied in each subband instead of just a gain. The advantage of using the interframe correlation is that we can improve not only the long-time fullband SNR, but the frame-wise subband SNR as well. The third and fourth classes discuss the problem of multichannel noise reduction in the STFT domain with and without interframe correlation, respectively. In the last category, we consider the interband correlation in the design of the noise reduction filters. We illustrate the basic principle for the single-channel case as an example, while this concept can be generalized to other scenarios. In all categories, we propose different optimization cost functions from which we derive the optimal filters and we also define the performance measures that help analyzing them.
Pure Data (Pd) is a graphical programming environment for audio and more; libpd is a wrapper that turns Pd into a portable, embeddable audio library. Brian Eno's soundtrack of the game Spore is generated by Pure Data. Inception The App is based on libpd and has been downloaded more than three million times. The popular RJDJ also uses the technology. The purpose of this book is to present tools and techniques for using Pure Data and libpd as an audio engine in mobile apps (for Android and iOS). The tools described are perfect for the sound engine for a game or for transforming a phone or tablet into an experimental instrument. After reading the book, audio developers will know how to prepare Pd patches for use with libpd, and app developers will know how to use all features of the libpd API. Readers with some experience in both computer music and mobile development will be able to create complete musical apps. The book includes a crash course in Pd, just enough to allow readers to make sounds and control them, as well as a discussion of existing solutions for rapidly deploying Pd patches to mobile devices. An introduction to Android or iOS development is beyond the scope of this book; readers will be expected to have a basic grasp of their platform of choice, including a working development setup. The book will, however, explain how to integrate libpd into an existing setup. A number of sample apps, ranging from minimal to full featured, for both Android and iOS, will illustrate all major points.
"Speech Processing and Soft Computing" includes coverage of synergy between speech technology and bio-inspired soft computing methods. Through practical cases, the author explores, dissects and examines how soft computing may complement conventional techniques in speech enhancement and speech recognition in order to provide robust systems. The material is especially useful to graduate students and experienced researchers who are interested in expanding their horizons and investigating new research directions through review of the theoretical and practical settings of soft computing methods in very recent speech applications.
Inside Computer Music is an investigation of how new technological developments have influenced the creative possibilities of composers of computer music in the last 50 years. This book combines detailed research into the development of computer music techniques with nine case studies that analyze key works in the musical and technical development of computer music. The book's companion website offers demonstration videos of the techniques used and downloadable software. There, readers can view interviews and test emulations of the software used by the composers for themselves. The software also presents musical analyses of each of the nine case studies to enable readers to engage with the musical structure aurally and interactively. |
You may like...
Short-Form Creative Writing - A Writer's…
H K Hummel, Stephanie Lenox
Hardcover
R3,034
Discovery Miles 30 340
Computers and the Teaching of Writing in…
Gail E Hawisher, Paul Le Blanc, …
Hardcover
Topics, Questions, Key Words - A…
Petra Hachenburger, Paul Jackson
Paperback
R1,297
Discovery Miles 12 970
Writing as a Learning Tool - Integrating…
Paivi Tynjala, L. Mason, …
Hardcover
R2,772
Discovery Miles 27 720
The Psychology of Written Composition
Carl Bereiter, Marlene Scardamalia
Hardcover
R4,505
Discovery Miles 45 050
|