![]() |
![]() |
Your cart is empty |
||
Books > Language & Literature > Language & linguistics > Computational linguistics
Multi-Dimensional Analysis: Research Methods and Current Issues provides a comprehensive guide both to the statistical methods in Multi-Dimensional Analysis (MDA) and its key elements, such as corpus building, tagging, and tools. The major goal is to explain the steps involved in the method so that readers may better understand this complex research framework and conduct MD research on their own. Multi-Dimensional Analysis is a method that allows the researcher to describe different registers (textual varieties defined by their social use) such as academic settings, regional discourse, social media, movies, and pop songs. Through multivariate statistical techniques, MDA identifies complementary correlation groupings of dozens of variables, including variables which belong both to the grammatical and semantic domains. Such groupings are then associated with situational variables of texts like information density, orality, and narrativity to determine linguistic constructs known as dimensions of variation, which provide a scale for the comparison of a large number of texts and registers. This book is a comprehensive research guide to MDA.
Linguistic Issues in Language Technology focuses on the relationships between linguistic insights and language technology. In conjunction with machine learning and statistical techniques, more sophisticated models of language and speech are needed to make significant progress in both existing and newly emerging areas of computational language analysis. The vast quantity of electronically accessible natural language data provides unprecedented opportunities for data-intensive analysis of linguistic phenomena, which can in turn enrich computational methods. Linguistic Issues in Language Technology provides a forum for this work. In this volume, contributors offer new perspectives on semantic representations for textual inference.
This handbook presents an overview of the phenomenon of reference - the ability to refer to and pick out entities - which is an essential part of human language and cognition. In the volume's 21 chapters, international experts in the field offer a critical account of all aspects of reference from a range of theoretical perspectives. Chapters in the first part of the book are concerned with basic questions related to different types of referring expression and their interpretation. They address questions about the role of the speaker - including speaker intentions - and of the addressee, as well as the role played by the semantics of the linguistic forms themselves in establishing reference. This part also explores the nature of such concepts as definite and indefinite reference and specificity, and the conditions under which reference may fail. The second part of the volume looks at implications and applications, with chapters covering such topics as the acquisition of reference by children, the processing of reference both in the human brain and by machines. The volume will be of interest to linguists in a wide range of subfields, including semantics, pragmatics, computational linguistics, and psycho- and neurolinguistics, as well as scholars in related fields such as philosophy and computer science.
A comprehensive corpus analysis of adolescent health communication is long overdue - and this book provides it. We know comparatively little about the language adolescents use to articulate their health concerns, and discourse analysis of their choices can shed light on their attitudes towards and beliefs about health and illness. This book interrogates a two million word corpus of messages posted by adolescents to an online health forum. It adopts a mixed method corpus approach to health communication, combining both quantitative and qualitative techniques. Analysis in this way gives voice to an age group whose subjective experiences of illness have often been marginalized or simply overlooked in favour of the concerns of older populations.
Linguistically annotated corpora are becoming a central part of the corpus linguistics field. One of their main strengths is the level of searchability they offer, but with the annotation come problems of the initial complexity of queries and query tools. This book gives a full, pedagogic account of this burgeoning field.Beginning with an overview of corpus linguistics, its prerequisites and goals, the book then introduces linguistically annotated corpora. It explores the different levels of linguistic annotation, including morphological, parts of speech, syntactic, semantic and discourse-level, as well as advantages and challenges for such annotations. It covers the main annotated corpora for English, the Penn Treebank, the International Corpus of English, and OntoNotes, as well as a wide range of corpora for other languages. In its third part, search strategies required for different types of data are explored. All chapters are accompanied by exercises and by sections on further reading, together with an integral companion website that contains lists and guidance on contemporary annotated corpora and query tools.
Linguistically annotated corpora are becoming a central part of the corpus linguistics field. One of their main strengths is the level of searchability they offer, but with the annotation come problems of the initial complexity of queries and query tools. This book gives a full, pedagogic account of this burgeoning field.Beginning with an overview of corpus linguistics, its prerequisites and goals, the book then introduces linguistically annotated corpora. It explores the different levels of linguistic annotation, including morphological, parts of speech, syntactic, semantic and discourse-level, as well as advantages and challenges for such annotations. It covers the main annotated corpora for English, the Penn Treebank, the International Corpus of English, and OntoNotes, as well as a wide range of corpora for other languages. In its third part, search strategies required for different types of data are explored. All chapters are accompanied by exercises and by sections on further reading, together with an integral companion website that contains lists and guidance on contemporary annotated corpora and query tools.
Language, apart from its cultural and social dimension, has a scientific side that is connected not only to the study of 'grammar' in a more or less traditional sense, but also to disciplines like mathematics, physics, chemistry and biology. This book explores developments in linguistic theory, looking in particular at the theory of generative grammar from the perspective of the natural sciences. It highlights the complex and dynamic nature of language, suggesting that a comprehensive and full understanding of such a species-specific property will only be achieved through interdisciplinary work.
This book describes new methodological and technological approaches to corpus building and presents recent research based on the Norwegian Newspaper Corpus. This is a large monitor corpus of contemporary Norwegian language, compiled through daily harvesting of web newspapers. The book gives an overview of the corpus and its system architecture, and presents tools used for tasks such as text harvesting, annotation, topic classification and extraction and frequency profiling of new words and phrases. Among the innovative technologies is Corpuscle, a corpus query engine and management system which is flexible enough to handle very large corpora in an efficient way. The individual research contributions based on the corpus explore different aspects of Norwegian, including the occurrence of anglicisms, neologisms and terminology, and the use of metonymy and metaphor in newspaper language. The book also describes an innovative method of applying correspondence analysis and implicational analysis to investigate interdependencies between morphosyntactic variants.
This book demonstrates how corpus-based research can advance the understanding of linguistic phenomena in a given language. By presenting a detailed analysis of collocations and idioms in a digital corpus of English and German, the contributors to this volume show how the use of collocations and idioms has changed over time, and suggests possible triggers for this change. The book not only examines what these collocations and idioms are, but also what their purpose is within languages. Idioms and Collocations is divided into three sections. The first section discusses the construction, composition and annotation of the corpus. Chapters in the second section describe the methods for querying the corpus, the generation and maintenance of the example subcorpora, and the linguistic-lexicographic analyses of the target idioms.Finally, the third section presents the results of specific investigations into the syntactic, semantic, and historical properties of collocations. This book presents original work in corpus linguistics, computational linguistics, theoretical linguistics and lexicography. It will be useful for researchers in academic and industrial settings, and lexicographers.The editorial board include: Paul Baker (Lancaster), Frantisek Cermak (Prague), Susan Conrad (Portland), Geoffrey Leech (Lancaster), Dominique Maingueneau (Paris XII), Christian Mair (Freiburg), Alan Partington (Bologna), Elena Tognini-Bonelli (Siena and TWC), Ruth Wodak (Lancaster), and, Feng Zhiwei (Beijing). "Corpus Linguistics" provides the methodology to extract meaning from texts. Taking as its starting point the fact that language is not a mirror of reality but lets us share what we know, believe and think about reality, it focuses on language as a social phenomenon, and makes visible the attitudes and beliefs expressed by the members of a discourse community.Consisting of both spoken and written language, discourse always has historical, social, functional, and regional dimensions. Discourse can be monolingual or multilingual, interconnected by translations. Discourse is where language and social studies meet."The Corpus and Discourse" series consists of two strands. The first, "Research in Corpus and Discourse", features innovative contributions to various aspects of corpus linguistics and a wide range of applications, from language technology via the teaching of a second language to a history of mentalities. The second strand, "Studies in Corpus and Discourse", is comprised of key texts bridging the gap between social studies and linguistics. Although equally academically rigorous, this strand will be aimed at a wider audience of academics and postgraduate students working in both disciplines.
Corpus linguistics is often regarded as a methodology in its own right, but little attention has been given to the theoretical perspectives from which the subject can be approached. The present book contributes to filling this gap. Bringing together original contributions by internationally renowned authors, the chapters include coverage of the lexical priming theory, parole-linguistics, a four-part model of language system and language use, and the concept of local textual functions. The theoretical arguments are illustrated and complemented by case studies using data from large corpora such as the BNC, smaller purpose-built corpora, and Google searches. By presenting theoretical positions in corpus linguistics, "Text, Discourse, and Corpora" provides an essential overview for advanced undergraduate, postgraduate and academic readers. "Corpus and Discourse Series" editors are: Wolfgang Teubert, University of Birmingham, and Michaela Mahlberg, Liverpool Hope University College. Editorial Board: Frantisek Cermak (Prague), Susan Conrad (Portland), Geoffrey Leech (Lancaster), Elena Tognini-Bonelli (Lecce and TWC), Ruth Wodak (Lancaster and Vienna), and Feng Zhiwei (Beijing). Corpus linguistics provides the methodology to extract meaning from texts. Taking as its starting point the fact that language is not a mirror of reality but lets us share what we know, believe and think about reality, it focuses on language as a social phenomenon, and makes visible the attitudes and beliefs expressed by the members of a discourse community. Consisting of both spoken and written language, discourse always has historical, social, functional, and regional dimensions. Discourse can be monolingual or multilingual, interconnected by translations. Discourse is where language and social studies meet. "The Corpus and Discourse" series consists of two strands. The first, "Research in Corpus and Discourse", features innovative contributions to various aspects of corpus linguistics and a wide range of applications, from language technology via the teaching of a second language to a history of mentalities. The second strand, "Studies in Corpus and Discourse", is comprised of key texts bridging the gap between social studies and linguistics. Although equally academically rigorous, this strand will be aimed at a wider audience of academics and postgraduate students working in both disciplines.
The book will appeal to scholars and advanced students of
morphology, syntax, computational linguistics and natural language
processing (NLP). It provides a critical and practical guide to
computational techniques for handling morphological and syntactic
phenomena, showing how these techniques have been used and modified
in practice.
This book presents a novel analysis of Particle Movement from the point of view of psycholinguistics. As well as examining the methodology of Particle Movement, the study addresses more theoretical questions. It is argued that some theories of how language is produced by the brain cannot explain the results found in practical studies, and Gries therefore looks at the relative merits of more interactive models of language production. This book will be useful to postgraduates and academics researching cognitive linguistics and psycholinguistics.
This book is an introduction to the rudiments of Perl programming. It provides the general reader with an interest in language with the most usable and relevant aspects of Perl for writing programs that deal with language.Through a series of simple examples and exercises, the reader is gradually introduced to the essentials of good programming. The examples are carefully constructed to make the introduction of new concepts as simple as possible, while at the same time using sample programs that make sense to someone who works with language as data. Many of these programs can be used immediately with minimal or no modification. The text is accompanied by exercises at the end of each chapter and all the code is available from the companion website: http: //www .u.arizona.edu/~hammond.
To apply the same approaches to analysing spoken and written formulaic language is problematic; to do so masks the fact that the contextual meaning of spoken formulaic language is encoded, to a large extent, in its prosody. In The Prosody of Formulaic Sequences, Phoebe Lin offers a new perspective on formulaic language, arguing that while past research often treats formulaic language as a lexical phenomenon, the phonological aspect of it is a more fundamental facet. This book draws its conclusions from three original, empirical studies of spoken formulaic language, assessing intonation unit boundaries as well as features such as tempo and stress placement. Across all studies, Lin considers questions of methodology and conceptual framework. The corpus-based descriptions of prosody outlined in this book not only deepen our understanding of the nature of formulaic language but have important implications for English Language Teaching and automatic speech synthesis.
This volume reflects the developments in the rapidly-changing field of typography for computer interface design. Presented as a series of integrated case studies and interviews, the book covers: the skills needed for quality website design; the impact of computers upon publishing and coroprate design; the use of computers within the educational field; the progress of child-orientated typefaces; and issues in screen layout when designing educational and training software.
This volume showcases original, agenda-setting studies in the field of learner corpus research of both spoken and written production. The studies have important applications for classroom pedagogy. The volume brings readers up-to-date with new written and spoken learner corpora, often looking at previously under-examined variables in learner corpus investigations. It also demonstrates innovative applications of learner corpus findings, addressing issues such as the effect of task, the effect of learner variables and the nature of learner language. The volume is of significant interest to researchers working in corpus linguistics, learner corpus research, second language acquisition and English for Academic and Specific Purposes, as well to practitioners interested in the application of the findings in language teaching and assessment.
This book is the first dedicated to linguistic parsing - the processing of natural language according to the rules of a formal grammar - in the Minimalist Program. While Minimalism has been at the forefront of generative grammar for several decades, it often remains inaccessible to computer scientists and others in adjacent fields. This volume makes connections with standard computational architectures, provides efficient implementations of some fundamental minimalist accounts of syntax, explores implementations of recent theoretical proposals, and explores correlations between posited structures and measures of neural activity during human language comprehension. These studies will appeal to graduate students and researchers in formal syntax, computational linguistics, psycholinguistics, and computer science.
Academic vocabulary is in fashion, as witnessed by the increasing
number of books published on the topic. In the first part of this
book," "Magali Paquot scrutinizes the concept of 'academic
vocabulary' and proposes a corpus-driven procedure based on the
criteria of keyness, range and evenness of distribution to select
academic words that could be part of a common-core academic
vocabulary syllabus.
Finding a particular scientific document amidst a sea of thousands
of other documents can often seem like an insurmountable task. "The
Structure of Scientific Articles" shows how linguistic theory can
provide a solution by analyzing rhetorical structures to make
information retrieval easier and faster.
Linguistic Databases explores the increasing use of databases in linguistics. The enormous potential in linguistic data - billions of utterances and messages daily - has been difficult to exploit. Many linguists have had to concentrate on introspective data with its inevitable blinders toward frequency, variation, and naturalness. Applications of linguistics have been handicapped. This volume explores the potential advantages of database applications to linguistics. Included in this volume are reports on database activities in phonetics, phonology, lexicography and syntax, comparative grammar, second-language acquisition, linguistic fieldwork, and language pathology. The book presents the specialized problems of multi-media (especially audio) and multi-lingual texts, including those in exotic writing systems. Implemented solutions are also discussed. The opportunities to use existing, minimally structured text repositories are presented.
From an abundance of intensifiers to frequent repetition and parallelisms, Donald Trump’s idiolect is highly distinctive from that of other politicians and previous Presidents of the United States. Combining quantitative and qualitative analyses, this book identifies the characteristic features of Trump’s language and argues that his speech style, often sensationalized by the media, differs from the usual political rhetoric on more levels than is immediately apparent. Chapters examine Trump’s tweets, inaugural address, political speeches, interviews, and presidential debates, revealing populist language traits that establish his idiolect as a direct reflection of changing social and political norms. The authors scrutinize Trump’s conspicuous use of nicknames, the definite article, and conceptual metaphors as strategies of othering and antagonising his opponents. They further shed light on Trump’s fake news agenda and his mutation of the conventional political apology which are strategically implemented for a political purpose. Drawing on methods from corpus linguistics, conversation analysis, and critical discourse analysis, this book provides a multifaceted investigation of Trump’s language use and addresses essential questions about Trump as a political phenomenon.
The COVID-19 pandemic has led to a host of critical reflections about discourse practises dealing with public health issues. Situating crisis communication at the centre of societal and political debates about responses to the pandemic, this volume analyses the discursive strategies used in a variety of settings. Exploring how crisis discourse has become a part of managing the public health crisis itself, this book focuses on the communicative tasks and challenges for both speakers and their public audiences in seven areas: - establishment of discursive and political authority - official governmental and expert communication to the public - public understanding of government communication - legitimation of public health management as a 'war' - judging and blaming a collective other - cross-national comparison and rivalry - empathy and encouragement Covering global discourses from Asia, Europe, the Middle East, North and South America, and New Zealand, chapters use corpus-based data to cast light on these issues from a variety of languages. With crisis discourse already the object of fierce national and international debates about the appropriateness of specific communicative styles, information management and 'verbal hygiene', Pandemic and Crisis Discourse offers an authoritative intervention from language experts.
|
![]() ![]() You may like...
Intelligent Natural Language Processing…
Khaled Shaalan, Aboul Ella Hassanien, …
Hardcover
R7,972
Discovery Miles 79 720
The Temporal Structure of Multimodal…
Laszlo Hunyadi, Istvan Szekrenyes
Hardcover
R2,927
Discovery Miles 29 270
Foundation Models for Natural Language…
Gerhard Paaß, Sven Giesselbach
Hardcover
Linguistic Inquiries into Donald…
Ulrike Schneider, Matthias Eitelmann
Hardcover
R4,138
Discovery Miles 41 380
|