|
Showing 1 - 4 of
4 matches in All Departments
This book addresses the problems that are encountered, and
solutions that have been proposed, when we aim to identify people
and to reconstruct populations under conditions where information
is scarce, ambiguous, fuzzy and sometimes erroneous. The process
from handwritten registers to a reconstructed digitized population
consists of three major phases, reflected in the three main
sections of this book. The first phase involves transcribing and
digitizing the data while structuring the information in a
meaningful and efficient way. In the second phase, records that
refer to the same person or group of persons are identified by a
process of linkage. In the third and final phase, the information
on an individual is combined into a reconstruction of their life
course. The studies and examples in this book originate from a
range of countries, each with its own cultural and administrative
characteristics, and from medieval charters through historical
censuses and vital registration, to the modern issue of privacy
preservation. Despite the diverse places and times addressed, they
all share the study of fundamental issues when it comes to model
reasoning for population reconstruction and the possibilities and
limitations of information technology to support this process. It
is thus not a single discipline that is involved in such an
endeavor. Historians, social scientists, and linguists represent
the humanities through their knowledge of the complexity of the
past, the limitations of sources, and the possible interpretations
of information. The availability of big data from digitized
archives and the need for complex analyses to identify individuals
calls for the involvement of computer scientists. With
contributions from all these fields, often in direct cooperation,
this book is at the heart of the digital humanities, and will
hopefully offer a source of inspiration for future investigations.
Corpus-based methods will be found at the heart of many language
and speech processing systems. This book provides an in-depth
introduction to these technologies through chapters describing
basic statistical modeling techniques for language and speech, the
use of Hidden Markov Models in continuous speech recognition, the
development of dialogue systems, part-of-speech tagging and partial
parsing, data-oriented parsing and n-gram language modeling. The
book attempts to give both a clear overview of the main
technologies used in language and speech processing, along with
sufficient mathematics to understand the underlying principles.
There is also an extensive bibliography to enable topics of
interest to be pursued further. Overall, we believe that the book
will give newcomers a solid introduction to the field and it will
give existing practitioners a concise review of the principal
technologies used in state-of-the-art language and speech
processing systems. Corpus-Based Methods in Language and Speech
Processing is an initiative of ELSNET, the European Network in
Language and Speech. In its activities, ELSNET attaches great
importance to the integration of language and speech, both in
research and in education. The need for and the potential of this
integration are well demonstrated by this publication.
Corpus-based methods will be found at the heart of many language
and speech processing systems. This book provides an in-depth
introduction to these technologies through chapters describing
basic statistical modeling techniques for language and speech, the
use of Hidden Markov Models in continuous speech recognition, the
development of dialogue systems, part-of-speech tagging and partial
parsing, data-oriented parsing and n-gram language modeling. The
book attempts to give both a clear overview of the main
technologies used in language and speech processing, along with
sufficient mathematics to understand the underlying principles.
There is also an extensive bibliography to enable topics of
interest to be pursued further. Overall, we believe that the book
will give newcomers a solid introduction to the field and it will
give existing practitioners a concise review of the principal
technologies used in state-of-the-art language and speech
processing systems. Corpus-Based Methods in Language and Speech
Processing is an initiative of ELSNET, the European Network in
Language and Speech. In its activities, ELSNET attaches great
importance to the integration of language and speech, both in
research and in education. The need for and the potential of this
integration are well demonstrated by this publication.
This book addresses the problems that are encountered, and
solutions that have been proposed, when we aim to identify people
and to reconstruct populations under conditions where information
is scarce, ambiguous, fuzzy and sometimes erroneous. The process
from handwritten registers to a reconstructed digitized population
consists of three major phases, reflected in the three main
sections of this book. The first phase involves transcribing and
digitizing the data while structuring the information in a
meaningful and efficient way. In the second phase, records that
refer to the same person or group of persons are identified by a
process of linkage. In the third and final phase, the information
on an individual is combined into a reconstruction of their life
course. The studies and examples in this book originate from a
range of countries, each with its own cultural and administrative
characteristics, and from medieval charters through historical
censuses and vital registration, to the modern issue of privacy
preservation. Despite the diverse places and times addressed, they
all share the study of fundamental issues when it comes to model
reasoning for population reconstruction and the possibilities and
limitations of information technology to support this process. It
is thus not a single discipline that is involved in such an
endeavor. Historians, social scientists, and linguists represent
the humanities through their knowledge of the complexity of the
past, the limitations of sources, and the possible interpretations
of information. The availability of big data from digitized
archives and the need for complex analyses to identify individuals
calls for the involvement of computer scientists. With
contributions from all these fields, often in direct cooperation,
this book is at the heart of the digital humanities, and will
hopefully offer a source of inspiration for future investigations.
|
|