|
|
Showing 1 - 3 of
3 matches in All Departments
Deep learning is revolutionizing how machine translation systems
are built today. This book introduces the challenge of machine
translation and evaluation - including historical, linguistic, and
applied context -- then develops the core deep learning methods
used for natural language applications. Code examples in Python
give readers a hands-on blueprint for understanding and
implementing their own machine translation systems. The book also
provides extensive coverage of machine learning tricks, issues
involved in handling various forms of data, model enhancements, and
current challenges and methods for analysis and visualization.
Summaries of the current research in the field make this a
state-of-the-art textbook for undergraduate and graduate classes,
as well as an essential reference for researchers and developers
interested in other applications of neural methods in the broader
field of human language processing.
The dream of automatic language translation is now closer thanks to
recent advances in the techniques that underpin statistical machine
translation. This class-tested textbook from an active researcher
in the field, provides a clear and careful introduction to the
latest methods and explains how to build machine translation
systems for any two languages. It introduces the subject's building
blocks from linguistics and probability, then covers the major
models for machine translation: word-based, phrase-based, and
tree-based, as well as machine translation evaluation, language
modeling, discriminative training and advanced methods to integrate
linguistic annotation. The book also reports the latest research,
presents the major outstanding challenges, and enables novices as
well as experienced researchers to make novel contributions to this
exciting area. Ideal for students at undergraduate and graduate
level, or for anyone interested in the latest developments in
machine translation.
This unique book provides a comprehensive introduction to the most
popular syntax-based statistical machine translation models,
filling a gap in the current literature for researchers and
developers in human language technologies. While phrase-based
models have previously dominated the field, syntax-based approaches
have proved a popular alternative, as they elegantly solve many of
the shortcomings of phrase-based models. The heart of this book is
a detailed introduction to decoding for syntax-based models. The
book begins with an overview of synchronous-context free grammar
(SCFG) and synchronous tree-substitution grammar (STSG) along with
their associated statistical models. It also describes how three
popular instantiations (Hiero, SAMT, and GHKM) are learned from
parallel corpora. It introduces and details hypergraphs and
associated general algorithms, as well as algorithms for decoding
with both tree and string input. Special attention is given to
efficiency, including search approximations such as beam search and
cube pruning, data structures, and parsing algorithms. The book
consistently highlights the strengths (and limitations) of
syntax-based approaches, including their ability to generalize
phrase-based translation units, their modeling of specific
linguistic phenomena, and their function of structuring the search
space.
|
|