![]() |
Welcome to Loot.co.za!
Sign in / Register |Wishlists & Gift Vouchers |Help | Advanced search
|
Your cart is empty |
||
|
Books > Computing & IT > Applications of computing > Audio processing
An examination of more than sixty years of successes and failures in developing technologies that allow computers to understand human spoken language. Stanley Kubrick's 1968 film 2001: A Space Odyssey famously featured HAL, a computer with the ability to hold lengthy conversations with his fellow space travelers. More than forty years later, we have advanced computer technology that Kubrick never imagined, but we do not have computers that talk and understand speech as HAL did. Is it a failure of our technology that we have not gotten much further than an automated voice that tells us to "say or press 1"? Or is there something fundamental in human language and speech that we do not yet understand deeply enough to be able to replicate in a computer? In The Voice in the Machine, Roberto Pieraccini examines six decades of work in science and technology to develop computers that can interact with humans using speech and the industry that has arisen around the quest for these technologies. He shows that although the computers today that understand speech may not have HAL's capacity for conversation, they have capabilities that make them usable in many applications today and are on a fast track of improvement and innovation. Pieraccini describes the evolution of speech recognition and speech understanding processes from waveform methods to artificial intelligence approaches to statistical learning and modeling of human speech based on a rigorous mathematical model-specifically, Hidden Markov Models (HMM). He details the development of dialog systems, the ability to produce speech, and the process of bringing talking machines to the market. Finally, he asks a question that only the future can answer: will we end up with HAL-like computers or something completely unexpected?
The Book of Audacity is the definitive guide to Audacity, the powerful, free, cross-platform audio editor. Audacity allows anyone to transform their Windows, Mac, or Linux computer into a powerful recording studio. The Book of Audacity is the perfect book for bands on a budget, solo artists, audiophiles, and anyone who wants to learn more about digital audio. Musician and podcaster Carla Schroder will guide you through a range of fun and useful Audacity projects that will demystify that geeky audio jargon and show you how to get the most from Audacity. You ll learn how to: Record podcasts, interviews, and live performances Be your own backing band or chorus Edit, splice, mix, and master multitrack recordings Create super high-fidelity and surround-sound recordings Digitize your vinyl or tape collection and clean up noise, hisses, and clicks Create custom ringtones and sweet special effects In addition, you ll learn how to choose and use digital audio hardware like mics and pream
Below the level of the musical note lies the realm of microsound, of sound particles lasting less than one-tenth of a second. Recent technological advances allow us to probe and manipulate these pinpoints of sound, dissolving the traditional building blocks of music -- notes and their intervals -- into a more fluid and supple medium. The sensations of point, pulse (series of points), line (tone), and surface (texture) emerge as particle density increases. Sounds coalesce, evaporate, and mutate into other sounds.Composers have used theories of microsound in computer music since the 1950s. Distinguished practitioners include Karlheinz Stockhausen and Iannis Xenakis. Today, with the increased interest in computer and electronic music, many young composers and software synthesis developers are exploring its advantages. Covering all aspects of composition with sound particles, Microsound offers composition theory, historical accounts, technical overviews, acoustical experiments, descriptions of musical works, and aesthetic reflections. The book is accompanied by an audio CD of examples.
The first work to propose a comprehensive musicological framework to study sound-based music, a rapidly developing body of work that includes electroacoustic art music, turntable composition, and acoustic and digital sound installations. The art of sound organization, also known as electroacoustic music, uses sounds not available to traditional music making, including prerecorded, synthesized, and processed sounds. The body of work of such sound-based music (which includes electroacoustic art music, turntable composition, computer games, and acoustic and digital sound installations) has developed more rapidly than its musicology. Understanding the Art of Sound Organization proposes the first general foundational framework for the study of the art of sound organization, defining terms, discussing relevant forms of music, categorizing works, and setting sound-based music in interdisciplinary contexts. Leigh Landy's goal in this book is not only to create a theoretical framework but also to make the work more accessible-to suggest a way to understand sound-based music, to give a listener what he terms "something to hold on to," for example, by connecting elements in a work to everyday experience. Landy considers the difficulties of categorizing works and discusses such types of works as sonic art and electroacoustic music, pointing out where they overlap and how they are distinctive. He proposes a "sound-based music paradigm" that transcends such traditional categories as art and pop music. Landy defines patterns that suggest a general framework and places the studies of sound-based music into interdisciplinary contexts, from acoustics to semiotics, proposing a holistic research approach that considers the interconnectedness of a given work's history, theory, technological aspects, and social impact. The author's ElectroAcoustic Resource Site (EARS, www.ears.dmu.ac.uk), the architecture of which parallels this book's structure, offers updated bibliographic resource abstracts and related information.
How interactive voice-based technology can tap into the automatic and powerful responses all speech-whether from human or machine-evokes. Interfaces that talk and listen are populating computers, cars, call centers, and even home appliances and toys, but voice interfaces invariably frustrate rather than help. In Wired for Speech, Clifford Nass and Scott Brave reveal how interactive voice technologies can readily and effectively tap into the automatic responses all speech-whether from human or machine-evokes. Wired for Speech demonstrates that people are "voice-activated": we respond to voice technologies as we respond to actual people and behave as we would in any social situation. By leveraging this powerful finding, voice interfaces can truly emerge as the next frontier for efficient, user-friendly technology. Wired for Speech presents new theories and experiments and applies them to critical issues concerning how people interact with technology-based voices. It considers how people respond to a female voice in e-commerce (does stereotyping matter?), how a car's voice can promote safer driving (are "happy" cars better cars?), whether synthetic voices have personality and emotion (is sounding like a person always good?), whether an automated call center should apologize when it cannot understand a spoken request ("To Err is Interface; To Blame, Complex"), and much more. Nass and Brave's deep understanding of both social science and design, drawn from ten years of research at Nass's Stanford laboratory, produces results that often challenge conventional wisdom and common design practices. These insights will help designers and marketers build better interfaces, scientists construct better theories, and everyone gain better understandings of the future of the machines that speak with us.
This book is intended to be an easy-to-use, practical guide to all that Pro Tools has to offer. If you are a beginner, you should have absolutely no problem setting up your system and getting started. If you already use Pro Tools regularly, you should find the answers to questions you have always wanted to know. Either way, the ultimate goal is to give all Pro Tools users a reliable reference guide to any issues they may encounter.
Publisher's Note: Products purchased from Third Party sellers are not guaranteed by the publisher for quality, authenticity, or access to any online entitlements included with the product. The first edition of this book was written six years ago. Since then, there have been some significant developments in the area of car audio (and video). In addition, many of the products featured in the first edition are now obsolete. While the first edition of the book continues to sell, we have seen a bit of a slow-down at major accounts. This edition promises to be even more successful than the last.Car Stereo Cookbook, 2e is a completely revamped edition of a hugely successful title that continues to sell. This revised book will include new information on mobile video, satellite radio, mp3, wma, digital broadcast radio, and will eliminate the out-of-date products that are no longer pertinent.
"Virtual Music "is about artificial creativity. Focusing on the author's Experiments in Musical Intelligence computer music composing program, the author and a distinguished group of experts discuss many of the issues surrounding the program, including artificial intelligence, music cognition, and aesthetics. The book is divided into four parts. The first part provides a historical background to Experiments in Musical Intelligence, including examples of historical antecedents, followed by an overview of the program by Douglas Hofstadter. The second part follows the composition of an Experiments in Musical Intelligence work, from the creation of a database to the completion of a new work in the style of Mozart. It includes, in sophisticated lay terms, relatively detailed explanations of how each step in the process contributes to the final composition. The third part consists of perspectives and analyses by Jonathan Berger, Daniel Dennett, Bernard Greenberg, Douglas R. Hofstadter, Steve Larson, and Eleanor Selfridge-Field. The fourth part presents the author's responses to these commentaries, as well as his thoughts on the implications of artificial creativity. The book (and corresponding Web site) includes an appendix providing extended musical examples referred to and discussed in the book, including composers such as Scarlatti, Bach, Mozart, Beethoven, Schubert, Chopin, Puccini, Rachmaninoff, Prokofiev, Debussy, Bartok, and others. It is also accompanied by a CD containing performances of the music in the text.
Imagine being able to carry your entire music library with you everywhere you go. You can do all of this and more with the Dell DJ, and this resource shows you how to harness its power. Get set up and purchase music from Musicmatch and other online music stores, then learn to build a digital audio library from your CDs, cassettes, and LPs. Next, discover advanced topics such as choosing accessories that meet your needs, troubleshooting problems, and even how to use your Dell DJ to back up and store your important documents. From the basics to more advanced topics, you'll get expert tips and invaluable tricks to enhance your Dell DJ experience.Install Dell DJ software and connect your Dell DJ to your PC Find your way around Musicmatch Jukebox and load tunes into its library Create a playlist with Musicmatch or directly with your Dell DJ Rip songs from CDs, LPs, and cassettes Load songs onto your Dell DJ from your Musicmatch library Control the quality of sound files Optimize sound with the DJ's equalizer and create custom equalizer settings Learn about digital rights management Troubleshoot your Dell DJ and find help online About the authors: Rick Broida is a freelance writer and regular contributor to CNET, Computer Shopper, and MobilePC Magazine. He is the founder and former editor of Handheld Computing Magazine and co-author of the best-selling books "How to Do Everything with Your Palm Handheld" and "How to Do Everything with Your Sony CLIE."Dave Johnson is the former editor of Mobility magazine and a popular technology journalist. He is author of the best-selling book "How to Do Everything with Your Digital Camera" and co-author of "How to Do Everything with Your PalmHandheld."
Winner of the 2003 Emerging Scholar Award, presented by the Society for Music Theory In this book, David Temperley addresses a fundamental question about music cognition: how do we extract basic kinds of musical information, such as meter, phrase structure, counterpoint, pitch spelling, harmony, and key from music as we hear it? Taking a computational approach, Temperley develops models for generating these aspects of musical structure. The models he proposes are based on "preference rules," which are criteria for evaluating a possible structural analysis of a piece of music. A preference rule system evaluates many possible interpretations and chooses the one that best satisfies the rules. After an introductory chapter, Temperley presents preference rule systems for generating six basic kinds of musical structure: meter, phrase structure, contrapuntal structure, harmony, and key, as well as pitch spelling (the labeling of pitch events with spellings such as A flat or G sharp). He suggests that preference rule systems not only show how musical structures are inferred, but also shed light on other aspects of music. He substantiates this claim with discussions of musical ambiguity, retrospective revision, expectation, and music outside the Western canon (rock and traditional African music). He proposes a framework for the description of musical styles based on preference rule systems and explores the relevance of preference rule systems to higher-level aspects of music, such as musical schemata, narrative and drama, and musical tension.
Get downloading and get down! This step-by-step resource teaches you techniques for making the most of iTunes. Manage a library of music, automate playback of your favorite music, burn audio CDs, and load and unload songs from Apple's iPod or another external MP3 player. Discover the multitude of ways you can search and browse the iTunes Music Store. Record, edit, and share audio over a local network. Author Todd Stauffer provides plenty of information and instruction in a smartly written style that communicates clearly in non-technical language. Make musical magic with help from "How to Do Everything with iTunes for Macintosh and Windows."Organize, manipulate, and manage your music Instantly buy and download music from the iTunes Music Store Encode, translate, and record your own audio Share music within the same household Add and view album artwork Search by title, style, artist, genre, album, and more Create archives, libraries, and playlists Use iTunes with other iLife '04 applications Customize the Visualizer and hook iTunes to your stereo Run iTunes like a DJ and keep your party thumping Import and work with AAC, MP3, AIFF, and WAV audio files About the author: Todd Stauffer is the best-selling author of all three editions of "How to Do Everything with Your iMac," and the originator of the "How to Do Everything" series.
Regardless of whether your taste is heavy metal, hip-hop, or classical, this book will show you how to set up, operate, and maintain a working home recording studio. Learn to choose the right equipment for your music, produce audio for the Internet or a CD, and even get construction tips for building an optimal recording environment.
Learn the perfect way to rip, copy, record, and burn your favorite songs, using free or open-source software. From mash-ups that combine existing hits, to remixes of your favorite songs, you’ll be ready to tackle it all—even if you start as a novice who doesn’t know BPM from MP3.
Get full coverage of Apples hot new suite of fun, creative tools for making movies, photos, dvds and custom music in one volume! Now you can see how easy it is to make your own customized digital creations using iMovie, iPhoto, iDVD, and iTunes with help from this detailed, results-oriented guide.
The digital turn has created new opportunities for scholars across disciplines to use sound in their scholarship. This volume's contributors provide a blueprint for making sound central to research, teaching, and dissemination. They show how digital sound studies has the potential to transform silent, text-centric cultures of communication in the humanities into rich, multisensory experiences that are more inclusive of diverse knowledges and abilities. Drawing on multiple disciplines-including rhetoric and composition, performance studies, anthropology, history, and information science-the contributors to Digital Sound Studies bring digital humanities and sound studies into productive conversation while probing the assumptions behind the use of digital tools and technologies in academic life. In so doing, they explore how sonic experience might transform our scholarly networks, writing processes, research methodologies, pedagogies, and knowledges of the archive. As they demonstrate, incorporating sound into scholarship is thus not only feasible but urgently necessary. Contributors. Myron M. Beasley, Regina N. Bradley, Steph Ceraso, Tanya Clement, Rebecca Dowd Geoffroy-Schwinden, W. F. Umi Hsu, Michael J. Kramer, Mary Caton Lingold, Darren Mueller, Richard Cullen Rath, Liana M. Silva, Jonathan Sterne, Jennifer Stoever, Jonathan W. Stone, Joanna Swafford, Aaron Trammell, Whitney Trettien
Your Definitive Professional Resource Develop real-world voice-based applications using this authoritative one-of-a-kind guide. Featuring in-depth coverage of both core and emerging topics within voice-enabled technology, this book explains everything from setting up a simple voice mail system to developing advanced multi-model voice applications using the newest Web telephony engine. You'll learn how to integrate VoiceXML with other key technologies such as ASP, JSP, ColdFusion, CCXML, and SALT. All examples are based on today's most current hardware. Containing project specifications, guidelines, deployment procedures--as well as actual case studies with all source code--this practical resource will change the way you develop next-generation voice-based applications.Design dialog flow and navigation architecture and learn guidelines for voice applications Manage content and identify target audience Learn VoiceXML document structure and execute multi-document-based applications Develop voice mail and voice banking systems using ASP and VoiceXML Identify the scope and role of grammars in VoiceXML 2.0 Use JSP to interact with databases and write code for front-end dialogs Understand the benefits and components of the Microsoft Web telephony engine Write CCXML programs and integrate CCXML with VoiceXML applications Produce speech output and speech input in SALT
A hot new release in the extremely popular How to Do Everything series, this friendly, solutions-oriented book is filled with step-by-step details on how to create, download, upload, play--and even remaster--MP3 and digital music files. Youll also get a completely up-to-date survey of peer-to-peer file sharing services like Napster, Aimster, Gnutella, and others, and details on all the latest and best MP3 players, both desktop and portable.
Selling Digital Music, Formatting Culture documents the transition of recorded music on CDs to music as digital files on computers. More than two decades after the first digital music files began circulating in online archives and playing through new software media players, we have yet to fully internalize the cultural and aesthetic consequences of these shifts. Tracing the emergence of what Jeremy Wade Morris calls the "digital music commodity," Selling Digital Music, Formatting Culture considers how a conflicted assemblage of technologies, users, and industries helped reformat popular music's meanings and uses. Through case studies of five key technologies - Winamp, metadata, Napster, iTunes, and cloud computing - this book explores how music listeners gradually came to understand computers and digital files as suitable replacements for their stereos and CD. Morris connects industrial production, popular culture, technology, and commerce in a narrative involving the aesthetics of music and computers, and the labor of producers and everyday users, as well as the value that listeners make and take from digital objects and cultural goods. Above all, Selling Digital Music, Formatting Culture is a sounding out of music's encounters with the interfaces, metadata, and algorithms of digital culture and of why the shifting form of the music commodity matters for the music and other media we love.
In Max/MSP/Jitter for Music, expert author and music technologist V. J. Manzo provides a user-friendly introduction to a powerful programming language that can be used to write custom software for musical interaction. Through clear, step-by-step instructions illustrated with numerous examples of working systems, the book equips readers with everything they need to know in order to design and complete meaningful music projects. The book also discusses ways to interact with software beyond the mouse and keyboard through use of camera tracking, pitch tracking, video game controllers, sensors, mobile devices, and more. The book does not require any prerequisite programming skills, but rather walks readers through a series of small projects through which they will immediately begin to develop software applications for practical musical projects. As the book progresses, and as the individual's knowledge of the language grows, the projects become more sophisticated. This new and expanded second edition brings the book fully up-to-date including additional applications in integrating Max with Ableton Live. It also includes a variety of additional projects as part of the final three project chapters. The book is of special value both to software programmers working in Max/MSP/Jitter and to music educators looking to supplement their lessons with interactive instructional tools, develop adaptive instruments to aid in student composition and performance activities, and create measurement tools with which to conduct music education research.
This book presents a systematic approach to the automatic recognition of simultaneous speech signals using computational auditory scene analysis. Inspired by human auditory perception, this book investigates a range of algorithms and techniques for decomposing multiple speech signals by integrating a spectro-temporal fragment decoder within a statistical search process. The outcome is a comprehensive insight into the mechanisms required if automatic speech recognition is to approach human levels of performance. |
You may like...
Multimodal Behavior Analysis in the Wild…
Xavier Alameda-Pineda, Elisa Ricci, …
Paperback
Multisensory Experiences - Where the…
Carlos Velasco, Marianna Obrist
Hardcover
R1,169
Discovery Miles 11 690
Multilingual Speech Processing
Tanja Schultz, Katrin Kirchhoff
Hardcover
R1,823
Discovery Miles 18 230
Stochastically-Based Semantic Analysis
Wolfgang Minker, Alex Waibel, …
Hardcover
R2,779
Discovery Miles 27 790
Digital Tools for Computer Music…
Dionysios Politis, Miltiadis Tsaligopoulos, …
Hardcover
R4,520
Discovery Miles 45 200
|