Download or read book Corpus Processing for Lexical Acquisition written by Bran Boguraev and published by Bradford Book. This book was released on 1996 with total page 280 pages. Available in PDF, EPUB and Kindle. Book excerpt: The lexicon has emerged from the study of computational linguistics as a fundamental resource that enables a variety of linguistic processes to operate in the course of tasks ranging from language analysis and text processing to machine translation. Lexicon acquisition, therefore, plays an essential part in getting any natural language processing system to function in the real world. Computers that process natural language require a variety of lexical information in addition to what can be found in standard dictionaries. Moreover, machine-readable dictionaries of the conventional sort have been found to be inadequate for fully supporting realistic natural language processing tasks. This volume describes corpus processing techniques that can be used to extract the additional lexical information required. Bringing together a balanced blend of the theoretical and practical, the contributions provide the most recent look at lexical acquisition techniques and practices. These include coping with unknown lexicalizations, task-driven lexical induction, categorization of lexical units, lexical semantics from corpus analysis, and measuring lexical acquisition. The problems addressed reflect a host of topics including recognition of open compounds, incremental acquisition of meanings from sentence usages, recognition of new senses of existing words, sense disambiguation, recognition of specific classes of works, and recognition and annotation of patterns of word use, each of them important to the overall language analysis process, and each employing text analysis techniques in a useful and theoretically motivated way. Language, Speech, and Communication series
Download or read book The Cambridge Handbook of Learner Corpus Research written by Sylviane Granger and published by Cambridge University Press. This book was released on 2015-10-01 with total page 1199 pages. Available in PDF, EPUB and Kindle. Book excerpt: The origins of learner corpus research go back to the late 1980s when large electronic collections of written or spoken data started to be collected from foreign/second language learners, with a view to advancing our understanding of the mechanisms of second language acquisition and developing tailor-made pedagogical tools. Engaging with the interdisciplinary nature of this fast-growing field, The Cambridge Handbook of Learner Corpus Research explores the diverse and extensive applications of learner corpora, with 27 chapters written by internationally renowned experts. This comprehensive work is a vital resource for students, teachers and researchers, offering fresh perspectives and a unique overview of the field. With representative studies in each chapter which provide an essential guide on how to conduct learner corpus research in a wide range of areas, this work is a cutting-edge account of learner corpus collection, annotation, methodology, theory, analysis and applications.
Download or read book Language Processing in Advanced Learners of English written by Marco Schilk and published by John Benjamins Publishing Company. This book was released on 2020-05-15 with total page 313 pages. Available in PDF, EPUB and Kindle. Book excerpt: The production and processing of collocations and formulaic language is a field of growing interest in corpus linguistics and experimental psycholinguistics. In the past this fascinating field at the interface of grammar and the lexicon has been mainly studied based on English native speakers, while research focusing on second language speakers and language learners has been comparatively rare. This book proposes an integration of corpus-based and experimental methods by analysing language processing of collocation by advanced learners of English. In using corpus-derived collocational stimuli of native-like and learner-typical language use in an experimental setting, it shows how advanced German L1 learners of English process native-like collocations, L1-based interferences and non-collocating lexical combinations. This book is of interest to anyone interested in the psycholinguistic validity of collocation from a bilingual point of view, as it explores methods of tracking collocational processing of speakers working with different sets of ‘collocational preferences’.
Download or read book Computer Learner Corpora Second Language Acquisition and Foreign Language Teaching written by Sylviane Granger and published by John Benjamins Publishing. This book was released on 2002-12-11 with total page 257 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book takes stock of current research into computer learner corpora conducted both by ELT and SLA specialists. It should be of particular interest to researchers looking to assess its relevance to SLA theory and ELT practice. Throughout the volume, emphasis is also placed on practical, methodological aspects of computer learner corpus research, in particular the contribution of technology to the research process. The advantages and disadvantages of automated and semi-automated approaches are analyzed, the capabilities of linguistic software tools investigated, the corpora (and compilation processes) described in detail. In this way, an important function of the volume is to give practical insight to researchers who may be considering compiling a corpus of learner data or embarking on learner corpus research.The volume is divided into three main sections: • Section 1 gives a general overview of learner corpus research; • Section 2 illustrates a range of corpus-based approaches to interlanguage analysis; • Section 3 demonstrates the direct pedagogical relevance of learner corpus work.
Download or read book Corpus based Perspectives in Linguistics written by Yuji Kawaguchi and published by John Benjamins Publishing. This book was released on 2007 with total page 464 pages. Available in PDF, EPUB and Kindle. Book excerpt: UBLI has conducted field surveys since 2002 and built spoken language corpora for French, Spanish, Italian (Salentino dialect), Russian, Malaysian, Turkish, Japanese, and Canadian multilinguals. This volume features new research presented at the UBLI second workshop on Corpus Linguistics Research Domain, which was held on September 14, 2006. The first part consisting of eleven presentations to this workshop shows a wide range of subjects within the area of corpus-based research, such as dictionary, linguistic atlas, dialect, translation, ancient texts, non-standard texts, sociolinguistics, second language acquisition, and natural language processing. The second part of this volume comprises ten additional contributions to both written and spoken corpora by the members and research assistants of UBLI.
Download or read book Usage Based Approaches to Language Acquisition and Processing written by Nick C. Ellis and published by Wiley-Blackwell. This book was released on 2016-06-13 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: Nick C. Ellis, Ute Römer, and Matthew Brook O'Donnell present a view of language as a complex adaptive system that is learned through usage. In a series of research studies, they analyze Verb-Argument Constructions (VACs) in first and second language learning, processing, and use. Drawing on diverse epistemological and methodological perspectives, they show how language emerges out of multiple experiences of meaning-making. In the development of both mother tongue and additional languages, each usage experience affects construction knowledge following general principles of learning relating to frequency, contingency, and semantic prototypicality. The implications of this work will be of value to students and scholars from a wide range of disciplinary interests in language and learning. "This is an impressive volume that will inspire researchers for generations to come. Focusing on the construction and acquisition of language, it combines a comprehensive synthesis of theory with a detailed account of extensive empirical work." —Susan Hunston, University of Birmingham "This book is a phenomenal synthesis of a formidable research program. In a feast of corpus, psycholinguistic, acquisitional, and simulation evidence, the authors’ bold theoretical insights advance knowledge about human language to unprecedented levels." —Lourdes Ortega, Georgetown University "The authors present a superb synthesis of approaches to verb-argument constructions and convincingly demonstrate the close links between lexical patterning and constructional meaning. An absolute must-read for anyone interested in usage-based approaches to language learning." —Ewa Dabrowska, University of Northumbria at Newcastle "This book represents an outstanding achievement. The authors illustrate why the most exciting work in the language sciences today is conducted across disciplinary boundaries. Working at the intersection of experimental, computational, and corpus-based approaches, their research inspires us to look beyond our own disciplines to observe language data from all angles." —Patrick Rebuschat, Lancaster University
Download or read book The Routledge Handbook of Corpus Linguistics written by Anne O'Keeffe and published by Routledge. This book was released on 2010-04-05 with total page 1263 pages. Available in PDF, EPUB and Kindle. Book excerpt: The Routledge Handbook of Corpus Linguistics provides a timely overview of a dynamic and rapidly growing area with a widely applied methodology. Through the electronic analysis of large bodies of text, corpus linguistics demonstrates and supports linguistic statements and assumptions. In recent years it has seen an ever-widening application in a variety of fields: computational linguistics, discourse analysis, forensic linguistics, pragmatics and translation studies. Bringing together experts in the key areas of development and change, the handbook is structured around six themes which take the reader through building and designing a corpus to using a corpus to study literature and translation. A comprehensive introduction covers the historical development of the field and its growing influence and application in other areas. Structured around five headings for ease of reference, each contribution includes further reading sections with three to five key texts highlighted and annotated to facilitate further exploration of the topics. The Routledge Handbook of Corpus Linguistics is the ideal resource for advanced undergraduates and postgraduates.
Download or read book Corpus Linguistics written by Douglas Biber and published by Cambridge University Press. This book was released on 1998-04-23 with total page 324 pages. Available in PDF, EPUB and Kindle. Book excerpt: An investigation into the way people use language in speech and writing, this volume introduces the corpus-based approach, which is based on analysis of large databases of real language examples stored on computer.
Download or read book Language Processing and Acquisition in Languages of Semitic Root Based Morphology written by Joseph Shimron and published by John Benjamins Publishing. This book was released on 2003-04-28 with total page 400 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book puts together contributions of linguists and psycholinguists whose main interest here is the representation of Semitic words in the mental lexicon of Semitic language speakers. The central topic of the book confronts two views about the morphology of Semitic words. The point of the argument is: Should we see Semitic words’ morphology as “root-based” or “word-based?” The proponents of the root-based approach, present empirical evidence demonstrating that Semitic language speakers are sensitive to the root and the template as the two basic elements (bound morphemes) of Semitic words. Those supporting the word-based approach, present arguments to the effect that Semitic word formation is not based on the merging of roots and templates, but that Semitic words are comprised of word stems and affixes like we find in Indo-European languages. The variety of evidence and arguments for each claim should force the interested readers to reconsider their views on Semitic morphology.
Download or read book The Cambridge Handbook of English Corpus Linguistics written by Douglas Biber and published by Cambridge University Press. This book was released on 2015-06-25 with total page 757 pages. Available in PDF, EPUB and Kindle. Book excerpt: The Cambridge Handbook of English Corpus Linguistics (CHECL) surveys the breadth of corpus-based linguistic research on English, including chapters on collocations, phraseology, grammatical variation, historical change, and the description of registers and dialects. The most innovative aspects of the CHECL are its emphasis on critical discussion, its explicit evaluation of the state of the art in each sub-discipline, and the inclusion of empirical case studies. While each chapter includes a broad survey of previous research, the primary focus is on a detailed description of the most important corpus-based studies in this area, with discussion of what those studies found, and why they are important. Each chapter also includes a critical discussion of the corpus-based methods employed for research in this area, as well as an explicit summary of new findings and discoveries.
Download or read book Lexical Pragmatics and Theory of Mind written by Sandrine Zufferey and published by John Benjamins Publishing. This book was released on 2010 with total page 209 pages. Available in PDF, EPUB and Kindle. Book excerpt: The concept of theory of mind (ToM), a hot topic in cognitive psychology for the past twenty-five years, has gained increasing importance in the fields of linguistics and pragmatics. However, even though the relationship between ToM and verbal communication is now recognized, the extent, causality and full implications of this connection remain mostly to be explored. This book presents a comprehensive discussion of the interface between language, communication, and theory of mind, and puts forward an innovative proposal regarding the role of discourse connectives for this interface. The proposed analysis of connectives is tested from the perspective of their acquisition, using empirical methods such as corpus analysis and controlled experiments, thus placing the study of connectives within the emerging framework of experimental pragmatics.
Download or read book Foundations of Statistical Natural Language Processing written by Christopher Manning and published by MIT Press. This book was released on 1999-05-28 with total page 722 pages. Available in PDF, EPUB and Kindle. Book excerpt: Statistical approaches to processing natural language text have become dominant in recent years. This foundational text is the first comprehensive introduction to statistical natural language processing (NLP) to appear. The book contains all the theory and algorithms needed for building NLP tools. It provides broad but rigorous coverage of mathematical and linguistic foundations, as well as detailed discussion of statistical methods, allowing students and researchers to construct their own implementations. The book covers collocation finding, word sense disambiguation, probabilistic parsing, information retrieval, and other applications.
Download or read book Developing Linguistic Corpora written by Martin Wynne and published by Oxbow Books Limited. This book was released on 2005 with total page 100 pages. Available in PDF, EPUB and Kindle. Book excerpt: A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.
Download or read book Specialisation and Variation in Language Corpora written by Ana Díaz Negrillo and published by Peter Lang Gmbh, Internationaler Verlag Der Wissenschaften. This book was released on 2014 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: This volume intends to give evidence of the extraordinary expansion corpus linguistics and language corpora have experienced over the past years. It focuses on emerging types of corpora and corpus techniques and presents corpus-based studies in areas which have benefited from recent develpments in corpus linguistics methods and techniques.
Download or read book Quantitative Corpus Linguistics with R written by Stefan Th. Gries and published by Routledge. This book was released on 2009-03-04 with total page 257 pages. Available in PDF, EPUB and Kindle. Book excerpt: The first textbook of its kind, Quantitative Corpus Linguistics with R demonstrates how to use the open source programming language R for corpus linguistic analyses. Computational and corpus linguists doing corpus work will find that R provides an enormous range of functions that currently require several programs to achieve – searching and processing corpora, arranging and outputting the results of corpus searches, statistical evaluation, and graphing.
Download or read book Corpus Presenter written by Raymond Hickey and published by John Benjamins Publishing Company. This book was released on 2003 with total page 312 pages. Available in PDF, EPUB and Kindle. Book excerpt: "The manual contains many sample analyses and discussions so that users can acquaint themselves with the suite quickly and easily."--BOOK JACKET.
Download or read book Corpus Linguistics Volume 2 written by Anke Lüdeling and published by Walter de Gruyter. This book was released on 2009-03-26 with total page 606 pages. Available in PDF, EPUB and Kindle. Book excerpt: In vielen Bereichen der Linguistik werden Textkorpora, Sprachkorpora oder multimodale Korpora heute als empirische Basis verwendet. Aufbauend auf Methoden des 19. Jahrhunderts haben sich dabei mit dem Aufkommen von elektronischen Korpora seit den 1940ern neue Standards für linguistische Annotation und Vorverarbeitung sowie für qualitative und quantitative Untersuchungen entwickelt. Das Handbuch bietet einen umfassenden Überblick über Geschichte, Methoden und Anwendungen der Korpuslinguistik. Die einzelnen Überblicks- und Spezialartikel sind von Experten und Expertinnen der jeweiligen Gebiete geschrieben. Dabei wird auf klare und umfassende Darstellung, eine gute Vernetzung zwischen den Artikel und weiterführende Hinweise Wert gelegt.