Building a voice model means to capture the characteristics of a
speaker's voice in a data structure. This data structure is then
used by a computer for further processing, such as comparison with
other voices. Voice modeling is a vital step in the process of
automatic speaker recognition that itself is the foundation of
several applied technologies: (a) biometric authentication, (b)
speech recognition and (c) multimedia indexing. Current automatic
speaker recognition works well under relatively constrained
circumstances, such as studio recordings, or when prior knowledge
on the number and identity of occurring speakers is available.
Under more adverse conditions, such as in feature films or amateur
material on the web, the achieved speaker recognition scores drop
below a rate that is acceptable for an end user or for further
processing. In this book, algorithmic and methodic improvements to
the state of the art in automatic speaker recognition are
presented. They are accompanied by a capacious software toolkit
called "sclib." Additionally, the method of "Eidetic Design"
facilitates intuitive algorithm design, development and teaching.
General
Imprint: |
Sudwestdeutscher Verlag Fur Hochschulschriften AG
|
Country of origin: |
United States |
Release date: |
July 2010 |
First published: |
July 2010 |
Authors: |
Thilo Stadelmann
|
Dimensions: |
229 x 152 x 14mm (L x W x T) |
Format: |
Paperback - Trade
|
Pages: |
240 |
ISBN-13: |
978-3-8381-1632-7 |
Categories: |
Books >
Computing & IT >
General theory of computing >
General
|
LSN: |
3-8381-1632-1 |
Barcode: |
9783838116327 |
Is the information for this product incomplete, wrong or inappropriate?
Let us know about it.
Does this product have an incorrect or missing image?
Send us a new image.
Is this product missing categories?
Add more categories.
Review This Product
No reviews yet - be the first to create one!