![]() |
Welcome to Loot.co.za!
Sign in / Register |Wishlists & Gift Vouchers |Help | Advanced search
|
Your cart is empty |
||
|
Books > Computing & IT > Applications of computing > Audio processing
Offers a non-technical overview of all the major areas in the computer processing of human speech: speech recognition; speech synthesis; speaker recognition; language identification, lip synchronisation; and co-channel separation. The text's intuitive approach uses illustrations, analogies, and both historical and state-of-the-art descriptions to explain relatively complex concepts. Specifically, it helps the reader learn the professional jargon used in different areas of speech processing, evaluate speech processing systems for specific applications, understand how the various technologies of speech processing actually work, identify practical applications for speech technology in the commercial world, and relate speech technology to actual spoken language.
Innovation in Music: Future Opportunities brings together cutting-edge research on new innovations in the field of music production, technology, performance and business. Including contributions from a host of well-respected researchers and practitioners, this volume provides crucial coverage on a range of topics from cybersecurity, to accessible music technology, performance techniques and the role of talent shows within music business. Innovation in Music: Future Opportunities is the perfect companion for professionals and researchers alike with an interest in the music industry.
Women in Audio features almost 100 profiles and stories of audio engineers who are women and have achieved success throughout the history of the trade. Beginning with a historical view, the book covers the achievements of women in various audio professions and then focuses on organizations that support and train women and girls in the industry. What follows are eight chapters divided by discipline, highlighting accomplished women in various audio fields: radio; sound for film and television; music recording and electronic music; hardware and software design; acoustics; live sound and sound for theater; education; audio for games, virtual reality, augmented reality, and mixed reality, as well as immersive sound. Women in Audio is a valuable resource for professionals, educators, and students looking to gain insight into the careers of trailblazing women in audio-related fields and represents required reading for those looking to add diversity to their music technology programs.
The Sound System Design Primer is an introduction to the many topics, technologies, and sub-disciplines that make up contemporary sound systems design. Written in clear, conversational language for those who do not have an engineering background, or who think more in language than in numbers, The Sound System Design Primer provides a solid foundation in this expanding discipline for students, early/mid-career system designers, creative and content designers seeking a better grasp on the technical side of things, and non-sound professionals who want or need to be able to speak intelligently with sound system designers.
This work provides an instructive into applications and problems from the broad field of pattern recognition. It describes basic topics and the required mathematical background of image and speech processing. Algorithms and data structures for filtering, feature extraction, segmentation and classification are discussed, introducing and demonstrating different C++ concepts. The practice of object-oriented programming is illustrated by a step-wise development of a complete class library for image processing.
Sound and Image: Aesthetics and Practices brings together international artist scholars to explore diverse sound and image practices, applying critical perspectives to interrogate and evaluate both the aesthetics and practices that underpin the audiovisual. Contributions draw upon established discourses in electroacoustic music, media art history, film studies, critical theory and dance; framing and critiquing these arguments within the context of diverse audiovisual practices. The volume's interdisciplinary perspective contributes to the rich and evolving dialogue surrounding the audiovisual, demonstrating the value and significance of practice-informed theory, and theory derived from practice. The ideas and approaches explored within this book will find application in a wide range of contexts across the whole scope of audiovisuality, from visual music and experimental film, to narrative film and documentary, to live performance, sound design and into sonic art and electroacoustic music. This book is ideal for artists, composers and researchers investigating theoretical positions and compositional practices which bring together sound and image.
Text-to-Speech Synthesis provides a complete, end-to-end account of the process of generating speech by computer. Giving an in-depth explanation of all aspects of current speech synthesis technology, it assumes no specialized prior knowledge. Introductory chapters on linguistics, phonetics, signal processing and speech signals lay the foundation, with subsequent material explaining how this knowledge is put to use in building practical systems that generate speech. Including coverage of the very latest techniques such as unit selection, hidden Markov model synthesis, and statistical text analysis, explanations of the more traditional techniques such as format synthesis and synthesis by rule are also provided. Weaving together the various strands of this multidisciplinary field, the book is designed for graduate students in electrical engineering, computer science, and linguistics. It is also an ideal reference for practitioners in the fields of human communication interaction and telephony.
This book presents a detailed description of Spoken Language Translator (SLT), one of the first major projects in the area of automatic speech translation. The SLT system can translate between English, French, and Swedish in the domain of air travel planning, using a vocabulary of about 1500 words, and with an accuracy of about 75 per cent. The greater part of the book describes the language processing components, which are largely built on top of the SRI Core Language Engine, using a combination of general grammars and techniques that allow them to be rapidly customized to specific domains. Speech recognition is based on Hidden Markov Mode technology, and uses versions of the SRI DECIPHER system. This account of the Spoken Language Translator should be an essential resource both for those who wish to know what is achievable in spoken-language translation today, and for those who wish to understand how to achieve it.
This book explains the principles of biosignal processing and its practical applications using MATLAB. Topics include the emergence of biosignals, electrophysiology, analog and digital biosignal processing, signal discretization, electrodes, time and frequency analysis, analog and digital filters, Fourier-transformation, z-transformation, pattern recognition, statistical data analysis, physiological modelling and applications of EEG, ECG, EMG, PCG and PPG signals. Additional scientifi c contributions on motion analysis by guest authors Prof. Dr. J. Subke and B. Schneider as well as classification of PPG signals by Dr. U. Hackstein.
Whether you're comping a vocal track, restoring an old recording, working with dialogue or sound effects for film, or imposing your own vision with mash-ups or remixes, audio editing is a key skill to successful sound production. Digital Audio Editing gives you the techniques, from the simplest corrective editing like cutting, copying, and pasting to more complex creative editing, such as beat mapping and time-stretching. You'll be able to avoid unnatural-sounding pitch correction and understand the potential pitfalls you face when restoring classic tracks. Author Simon Langford invites you to see editing with his wide-angle view, putting this skill into a broad context that will inform your choices even as you more skillfully manipulate sound. Focusing on techniques applicable to any digital audio workstation, it includes break-outs giving specific keystrokes and instruction in Avids Pro Tools, Apple's Logic Pro, Steinberg's Cubase, and PreSonus's Studio One. The companion websites includes tutorials in all four software packages to help you immediately apply the broad skills from the book.
This volume constitutes selected papers presented at the Third International Conference on Artificial Intelligence and Speech Technology, AIST 2021, held in Delhi, India, in November 2021. The 36 full papers and 18 short papers presented were thoroughly reviewed and selected from the 178 submissions. They provide a discussion on application of Artificial Intelligence tools in speech analysis, representation and models, spoken language recognition and understanding, affective speech recognition, interpretation and synthesis, speech interface design and human factors engineering, speech emotion recognition technologies, audio-visual speech processing and several others.
Metal Music Manual shows you the creative and technical processes involved in producing contemporary heavy music for maximum sonic impact. From pre-production to final mastered product, and fundamental concepts to advanced production techniques, this book contains a world of invaluable practical information. Assisted by clear discussion of critical audio principles and theory, and a comprehensive array of illustrations, photos, and screen grabs, Metal Music Manual is the essential guide to achieving professional production standards. The extensive companion website features multi-track recordings, final mixes, processing examples, audio stems, etc., so you can download the relevant content and experiment with the techniques you read about. The website also features video interviews the author conducted with the following acclaimed producers, who share their expertise, experience, and insight into the processes involved: Fredrik Nordstroem (Dimmu Borgir, At The Gates, In Flames) Matt Hyde (Slayer, Parkway Drive, Children of Bodom) Ross Robinson (Slipknot, Sepultura, Machine Head) Logan Mader (Gojira, DevilDriver, Fear Factory) Andy Sneap (Megadeth, Killswitch Engage, Testament) Jens Bogren (Opeth, Kreator, Arch Enemy) Daniel Bergstrand (Meshuggah, Soilwork, Behemoth) Nick Raskulinecz (Mastodon, Death Angel, Trivium) Quotes from these interviews are featured throughout Metal Music Manual, with additional contributions from: Ross "Drum Doctor" Garfield (one of the world's top drum sound specialists, with Metallica and Slipknot amongst his credits) Andrew Scheps (Black Sabbath, Linkin Park, Metallica) Maor Appelbaum (Sepultura, Faith No More, Halford)
Despite its significant growth over the past five years, the mobile and social videogame industry is still maturing at a rapid rate. Due to various storage and visual and sound asset restrictions, mobile and social gaming must have innovative storytelling techniques. Narrative Tactics grants readers practical advice for improving narrative design and game writing for mobile and social games, and helps them rise to the challenge of mobile game storytelling. The first half of the book covers general storytelling techniques, including worldbuilding, character design, dialogue, and quests. In the second half, leading experts in the field explore various genres and types of mobile and social games, including educational games, licensed IP, games for specific demographics, branding games, and free to play (F2P). Key Features The only book dedicated to narrative design and game writing in social and mobile games, an explosive market overtaking the console gaming market. Provides tips for narrative design and writing tailored specifically for mobile and social game markets. Guides readers along with conclusions that include questions to help the reader in narrative design and/or writing. Explores real games to illustrate theory and best practices with analyses of game case studies per chapter, covering indie, social/mobile, and AAA games. Includes checklists to help readers critique their own narrative design/writing.
An Introduction to Music Technology, Second Edition provides a clear overview of the essential elements of music technology for today's musician. This book focuses on the topics that underlie the hardware and software in use today: Sound, Audio, MIDI, Computer Notation, and Computer- Assisted Instruction. Appendices cover necessary computer hardware and software concepts. Written for both music technology majors and non-majors, this textbook introduces fundamental principles and practices so students can learn to work with a wide range of software programs, adapt to new music technologies, and apply music technology in their performance, composition, teaching, and analysis. Features: Thorough explanations of key topics in music technology Content applicable to all software and hardware, not linked to just one piece of software or gear In-depth discussion of digital audio topics, such as sampling rates, resolutions, and file formats Explanations of standard audio plug-ins including dynamics processors, EQs, and delay based effects Coverage of synthesis and sampling in software instruments Pedagogical features, including: Further Reading sections that allow the student to delve deeper into topics of interest Suggested Activities that can be carried out with a variety of different programs Key Terms at the end of each chapter What Do I Need? Chapters covering the types of hardware and software needed in order to put together Audio and MIDI systems A companion website with links to audio examples that demonstrate various concepts, step-by-step tutorials, relevant hardware, software, and additional audio and video resources. The new edition has been fully updated to cover new technologies that have emerged since the first edition, including iOS and mobile platforms, online notation software, alternate controllers, and Open Sound Control (OSC).
Audio production is an incredibly rewarding craft. To take the raw, basic tracks of a fledgling idea and shape them into one glorious stereophonic sound wave is an amazing feat. The transformation from analogue to digital dominance has brought many advances in sound quality and new techniques, but producing digital music with only a standard computer and DAW can be problematic, time-consuming and sometimes disappointing without the right approach and skills. In Template Mixing and Mastering, renowned mix engineer Billy Decker tackles the challenges of in-the-box production through his innovative template approach. He shares his passion and knowledge from over twenty years of industry experience, including an introduction to templates and a step-by-step guide to their set-up and a discussion of drum replacement technology. Channel and setting information for each of the drum, instrument and vocal sections of his template is discussed along with the master channel and his methodology of mixing and mastering. Finally, he gives professional advice and best practice. This book features the full template used on sixteen No 1 records!
Build great voice apps of any complexity for any domain by learning both the how's and why's of voice development. In this book you'll see how we live in a golden age of voice technology and how advances in automatic speech recognition (ASR), natural language processing (NLP), and related technologies allow people to talk to machines and get reasonable responses. Today, anyone with computer access can build a working voice app. That democratization of the technology is great. But, while it's fairly easy to build a voice app that runs, it's still remarkably difficult to build a great one, one that users trust, that understands their natural ways of speaking and fulfills their needs, and that makes them want to return for more. We start with an overview of how humans and machines produce and process conversational speech, explaining how they differ from each other and from other modalities. This is the background you need to understand the consequences of each design and implementation choice as we dive into the core principles of voice interface design. We walk you through many design and development techniques, including ones that some view as advanced, but that you can implement today. We use the Google development platform and Python, but our goal is to explain the reasons behind each technique such that you can take what you learn and implement it on any platform. Readers of Mastering Voice Interfaces will come away with a solid understanding of what makes voice interfaces special, learn the core voice design principles for building great voice apps, and how to actually implement those principles to create robust apps. We've learned during many years in the voice industry that the most successful solutions are created by those who understand both the human and the technology sides of speech, and that both sides affect design and development. Because we focus on developing task-oriented voice apps for real users in the real world, you'll learn how to take your voice apps from idea through scoping, design, development, rollout, and post-deployment performance improvements, all illustrated with examples from our own voice industry experiences. What You Will Learn Create truly great voice apps that users will love and trust See how voice differs from other input and output modalities, and why that matters Discover best practices for designing conversational voice-first applications, and the consequences of design and implementation choices Implement advanced voice designs, with real-world examples you can use immediately. Verify that your app is performing well, and what to change if it doesn't Who This Book Is For Anyone curious about the real how's and why's of voice interface design and development. In particular, it's aimed at teams of developers, designers, and product owners who need a shared understanding of how to create successful voice interfaces using today's technology. We expect readers to have had some exposure to voice apps, at least as users.
Unleash your iPhone and take it to the limit using secret tips and techniques from gadget hacker Erica Sadun. Fast and fun to read, Taking Your iPod touch 4 to the Max is fully updated to show you how get the most out of Apple's OS 4. You'll find all the best undocumented tricks as well as the most efficient and enjoyable introduction to the iPhone available. Starting with an introduction to iPod touch 4 basics, you'll quickly move on to discover the iPod touch's hidden potential, like how to connect to a TV, get contract-free VOIP, and hack OS 4 so it will run apps on your iPod touch. From e-mail and surfing the Web, to using iTunes, iBooks, games, photos, ripping DVDs and getting free VOIP with Skype or Jajah-you'll find it all in this book. You'll even learn tips on where to get the best and cheapest iPod touch accessories. Get ready to take your iPod touch to the max!
Hone your Pro Tools music production skills and create better tracks with Pro Tools 11: Music Production, Recording, Editing, and Mixing. With Pro Tools 11, you'll get more than descriptions of Pro Tools features and menus-this book grounds its Pro Tools instruction thoroughly in real-world music production. Learn to leverage this powerful DAW and bend it to your will, whether you're recording and mixing a band or producing a dance track. Get tips that will save you time, even if you're an old hand at Pro Tools. Extensive full-color screenshots visually guide you through the book, and an informal writing style keeps you engaged. Includes coverage of additional features incorporated into version 10.3.6, which can be co-installed alongside Pro Tools 11 to allow use of TDM and RTAS plug-in formats. Author Mike Collins, an independent music producer and music technology consultant who has worked with Pro Tools since 1991, gives you a frank view of the software without the hype. This book is carefully designed for users with basic music production experience or knowledge, but can serve as a quick learning guide for ambitious beginners or as a reference for the advanced or professional user. Pro Tools 11 includes coverage of the application's new features, including: Avid Audio Engine Dynamic Host-based Plug-in Processing Low-latency Input Buffer Offline Bounce Unified Workspace Browser Advanced Metering for Pro Tools HD 11 Co-Install with Pro Tools 10.3.6 Level: Intermediate
Acoustic and MIDI Orchestration for the Contemporary Composer, Second Edition provides effective explanations and illustrations to teach you how to integrate traditional approaches to orchestration with the use of the modern sequencing techniques and tools available to today’s composer. By covering both approaches, Pejrolo and DeRosa offer a comprehensive and multifaceted learning experience that will develop your orchestration and sequencing skills and enhance your final productions.
Guiding you through the history and emergence of modern mastering techniques, then providing practical hints and tips on how to use them in your set up, Practical Mastering is the book for anyone wanting to master this elusive art form. Providing you with solid mastering theory underpinned by years of professional experience and hands-on advice to getting the most out of your set up. Using years of practical and professional experience, Mark and Russ offer a discussion of how to effectively listen to and interpret post-mix tracks, showing you how to pick out areas of the mix that could be optimized or need development. Backing this up with professional tips and tricks on how to develop and fine tuning your hearing skills. Honing your ears to efficiently and effectively listen to your mixes and create perfectly polished master tracks.
The acoustics of a space can have a real impact on the sounds you create and capture. Acoustics and Psychoacoustics, Fifth Edition provides supportive tools and exercises to help you understand how music sounds and behaves in different spaces, whether during a performance or a recording, when planning a control room or listening space, and how it is perceived by performers, listeners, and recording engineers.
Simon Grimm examines new multi-microphone signal processing strategies that aim to achieve noise reduction and dereverberation. Therefore, narrow-band signal enhancement approaches are combined with broad-band processing in terms of directivity based beamforming. Previously introduced formulations of the multichannel Wiener filter rely on the second order statistics of the speech and noise signals. The author analyses how additional knowledge about the location of a speaker as well as the microphone arrangement can be used to achieve further noise reduction and dereverberation.
This open access book describes the results of natural language processing and machine learning methods applied to clinical text from electronic patient records. It is divided into twelve chapters. Chapters 1-4 discuss the history and background of the original paper-based patient records, their purpose, and how they are written and structured. These initial chapters do not require any technical or medical background knowledge. The remaining eight chapters are more technical in nature and describe various medical classifications and terminologies such as ICD diagnosis codes, SNOMED CT, MeSH, UMLS, and ATC. Chapters 5-10 cover basic tools for natural language processing and information retrieval, and how to apply them to clinical text. The difference between rule-based and machine learning-based methods, as well as between supervised and unsupervised machine learning methods, are also explained. Next, ethical concerns regarding the use of sensitive patient records for research purposes are discussed, including methods for de-identifying electronic patient records and safely storing patient records. The book's closing chapters present a number of applications in clinical text mining and summarise the lessons learned from the previous chapters. The book provides a comprehensive overview of technical issues arising in clinical text mining, and offers a valuable guide for advanced students in health informatics, computational linguistics, and information retrieval, and for researchers entering these fields.
This volume provides a comprehensive introduction to foundational topics in sound design for linear media, such as listening and recording; audio postproduction; key musical concepts and forms such as harmony, conceptual sound design, electronica, soundscape, and electroacoustic composition; the audio commons; and sound's ontology and phenomenology. The reader will gain a broad understanding of the key concepts and practices that define sound design for its use with moving images as well as important forms of composed sound. The chapters are written by international authors from diverse backgrounds who provide multidisciplinary perspectives on sound in its linear forms. The volume is designed as a textbook for students and teachers, as a handbook for researchers in sound, media and experience, and as a survey of key trends and ideas for practitioners interested in exploring the boundaries of their profession. |
You may like...
Computational Thinking in Sound…
Gena R Greher, Jesse M. Heines
Hardcover
R3,842
Discovery Miles 38 420
Estimating Spoken Dialog System Quality…
Klaus-Peter Engelbrecht
Hardcover
R2,641
Discovery Miles 26 410
Intelligent Music Information Systems…
Jialie Shen, John Shepherd, …
Hardcover
R4,593
Discovery Miles 45 930
Handbook of Research on Recent…
Siddhartha Bhattacharyya, Nibaran Das, …
Hardcover
R9,028
Discovery Miles 90 280
|