EBookClubs

Read Books & Download eBooks Full Online

EBookClubs

Read Books & Download eBooks Full Online

Book Web As Corpus

Download or read book Web As Corpus written by Maristella Gatto and published by A&C Black. This book was released on 2014-02-13 with total page 255 pages. Available in PDF, EPUB and Kindle. Book excerpt: Is the internet a suitable linguistic corpus? How can we use it in corpus techniques? What are the special properties that we need to be aware of? This book answers those questions. The Web is an exponentially increasing source of language and corpus linguistics data. From gigantic static information resources to user-generated Web 2.0 content, the breadth and depth of information available is breathtaking – and bewildering. This book explores the theory and practice of the “web as corpus”. It looks at the most common tools and methods used and features a plethora of examples based on the author's own teaching experience. This book also bridges the gap between studies in computational linguistics, which emphasize technical aspects, and studies in corpus linguistics, which focus on the implications for language theory and use.

Book Corpus Linguistics and the Web

Download or read book Corpus Linguistics and the Web written by and published by BRILL. This book was released on 2015-07-14 with total page 311 pages. Available in PDF, EPUB and Kindle. Book excerpt: Using the Web as Corpus is one of the recent challenges for corpus linguistics. This volume presents a current state-of-the-arts discussion of the topic. The articles address practical problems such as suitable linguistic search tools for accessing the www, the question of register variation, or they probe into methods for culling data from the web. The book also offers a wide range of case studies, covering morphology, syntax, lexis, as well as synchronic and diachronic variation in English. These case studies make use of the two approaches to the www in corpus linguistics – web-as-corpus and web-for-corpus-building. The case studies demonstrate that web data can provide useful additional evidence for a broad range of research questions.

Book Web Corpus Construction

    Book Details:
  • Author : Roland Schäfer
  • Publisher : Morgan & Claypool Publishers
  • Release : 2013-07-01
  • ISBN : 1627053123
  • Pages : 197 pages

Download or read book Web Corpus Construction written by Roland Schäfer and published by Morgan & Claypool Publishers. This book was released on 2013-07-01 with total page 197 pages. Available in PDF, EPUB and Kindle. Book excerpt: The World Wide Web constitutes the largest existing source of texts written in a great variety of languages. A feasible and sound way of exploiting this data for linguistic research is to compile a static corpus for a given language. There are several adavantages of this approach: (i) Working with such corpora obviates the problems encountered when using Internet search engines in quantitative linguistic research (such as non-transparent ranking algorithms). (ii) Creating a corpus from web data is virtually free. (iii) The size of corpora compiled from the WWW may exceed by several orders of magnitudes the size of language resources offered elsewhere. (iv) The data is locally available to the user, and it can be linguistically post-processed and queried with the tools preferred by her/him. This book addresses the main practical tasks in the creation of web corpora up to giga-token size. Among these tasks are the sampling process (i.e., web crawling) and the usual cleanups including boilerplate removal and removal of duplicated content. Linguistic processing and problems with linguistic processing coming from the different kinds of noise in web corpora are also covered. Finally, the authors show how web corpora can be evaluated and compared to other corpora (such as traditionally compiled corpora).

Book The Web as Corpus

Download or read book The Web as Corpus written by Maristella Gatto and published by . This book was released on 2014 with total page pages. Available in PDF, EPUB and Kindle. Book excerpt:

Book WaCky

    Book Details:
  • Author : Marco Baroni
  • Publisher : Gedit
  • Release : 2006
  • ISBN :
  • Pages : 238 pages

Download or read book WaCky written by Marco Baroni and published by Gedit. This book was released on 2006 with total page 238 pages. Available in PDF, EPUB and Kindle. Book excerpt:

Book Web As Corpus

Download or read book Web As Corpus written by Maristella Gatto and published by A&C Black. This book was released on 2014-02-13 with total page 250 pages. Available in PDF, EPUB and Kindle. Book excerpt: Is the internet a suitable linguistic corpus? How can we use it in corpus techniques? What are the special properties that we need to be aware of? This book answers those questions. The Web is an exponentially increasing source of language and corpus linguistics data. From gigantic static information resources to user-generated Web 2.0 content, the breadth and depth of information available is breathtaking – and bewildering. This book explores the theory and practice of the “web as corpus”. It looks at the most common tools and methods used and features a plethora of examples based on the author's own teaching experience. This book also bridges the gap between studies in computational linguistics, which emphasize technical aspects, and studies in corpus linguistics, which focus on the implications for language theory and use.

Book Corpus based Language Studies

Download or read book Corpus based Language Studies written by Tony McEnery and published by Taylor & Francis. This book was released on 2006 with total page 412 pages. Available in PDF, EPUB and Kindle. Book excerpt: Covering the major approaches to the use of corpus data, this work gathers together influential readings from leading names in the discipline, including Biber, Widdowson, Sinclair, Carter and McCarthy.

Book Corpus Linguistics for Online Communication

Download or read book Corpus Linguistics for Online Communication written by Luke Curtis Collins and published by Routledge. This book was released on 2019-02-25 with total page 206 pages. Available in PDF, EPUB and Kindle. Book excerpt: Corpus Linguistics for Online Communication provides an instructive and practical guide to conducting research using methods in corpus linguistics in studies of various forms of online communication. Offering practical exercises and drawing on original data taken from online interactions, this book: introduces the basics of corpus linguistics, including what is involved in designing and building a corpus; reviews cutting-edge studies of online communication using corpus linguistics, foregrounding different analytical components to facilitate studies in professional discourse, online learning, public understanding of health issues and dating apps; showcases both freely-available corpora and the innovative tools that students and researchers can access to carry out their own research. Corpus Linguistics for Online Communication supports researchers and students in generating high quality, applied research and is essential reading for those studying and researching in this area.

Book Quantitative Corpus Linguistics with R

Download or read book Quantitative Corpus Linguistics with R written by Stefan Th. Gries and published by Routledge. This book was released on 2009-03-04 with total page 257 pages. Available in PDF, EPUB and Kindle. Book excerpt: The first textbook of its kind, Quantitative Corpus Linguistics with R demonstrates how to use the open source programming language R for corpus linguistic analyses. Computational and corpus linguists doing corpus work will find that R provides an enormous range of functions that currently require several programs to achieve – searching and processing corpora, arranging and outputting the results of corpus searches, statistical evaluation, and graphing.

Book Developing Linguistic Corpora

Download or read book Developing Linguistic Corpora written by Martin Wynne and published by Oxbow Books Limited. This book was released on 2005 with total page 100 pages. Available in PDF, EPUB and Kindle. Book excerpt: A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.

Book The Oxford Handbook of the History of English

Download or read book The Oxford Handbook of the History of English written by Terttu Nevalainen (linguiste) and published by Oxford University Press. This book was released on 2016 with total page 983 pages. Available in PDF, EPUB and Kindle. Book excerpt: This ambitious handbook takes advantage of recent advances in the study of the history of English to rethink the understanding of the field.

Book A Practical Handbook of Corpus Linguistics

Download or read book A Practical Handbook of Corpus Linguistics written by Magali Paquot and published by Springer Nature. This book was released on 2021-05-04 with total page 686 pages. Available in PDF, EPUB and Kindle. Book excerpt: This handbook is a comprehensive practical resource on corpus linguistics. It features a range of basic and advanced approaches, methods and techniques in corpus linguistics, from corpus compilation principles to quantitative data analyses. The Handbook is organized in six Parts. Parts I to III feature chapters that discuss key issues and the know-how related to various topics around corpus design, methods and corpus types. Parts IV-V aim to offer a user-friendly introduction to the quantitative analysis of corpus data: for each statistical technique discussed, chapters provide a practical guide with R and come with supplementary online material. Part VI focuses on how to write a corpus linguistic paper and how to meta-analyze corpus linguistic research. The volume can serve as a course book as well as for individual study. It will be an essential reading for students of corpus linguistics as well as experienced researchers who want to expand their knowledge of the field.

Book Web Corpus Construction

Download or read book Web Corpus Construction written by Roland Schäfer and published by Springer Nature. This book was released on 2022-05-31 with total page 129 pages. Available in PDF, EPUB and Kindle. Book excerpt: The World Wide Web constitutes the largest existing source of texts written in a great variety of languages. A feasible and sound way of exploiting this data for linguistic research is to compile a static corpus for a given language. There are several adavantages of this approach: (i) Working with such corpora obviates the problems encountered when using Internet search engines in quantitative linguistic research (such as non-transparent ranking algorithms). (ii) Creating a corpus from web data is virtually free. (iii) The size of corpora compiled from the WWW may exceed by several orders of magnitudes the size of language resources offered elsewhere. (iv) The data is locally available to the user, and it can be linguistically post-processed and queried with the tools preferred by her/him. This book addresses the main practical tasks in the creation of web corpora up to giga-token size. Among these tasks are the sampling process (i.e., web crawling) and the usual cleanups including boilerplate removal and removal of duplicated content. Linguistic processing and problems with linguistic processing coming from the different kinds of noise in web corpora are also covered. Finally, the authors show how web corpora can be evaluated and compared to other corpora (such as traditionally compiled corpora). For additional material please visit the companion website: sites.morganclaypool.com/wcc Table of Contents: Preface / Acknowledgments / Web Corpora / Data Collection / Post-Processing / Linguistic Processing / Corpus Evaluation and Comparison / Bibliography / Authors' Biographies

Book Corpus Linguistics  Volume 1

Download or read book Corpus Linguistics Volume 1 written by Anke Lüdeling and published by Walter de Gruyter. This book was released on 2008-12-10 with total page 797 pages. Available in PDF, EPUB and Kindle. Book excerpt: This volume provides an up-to-date survey of the field of corpus linguistics, a field whose methodology has revolutionized much of the empirical work done in most fields of linguistic study over the past decade. Corpus linguistics investigates human language by starting out from large collections of texts - spoken, written, or recorded. These language corpora, which are now regularly available in electronic form, are the basis for quantitative and qualitative research on almost any question of linguistic interest. Many techniques that are in use in corpus linguistics today are rooted in the tradition of the late 18th and 19th century, when linguistics began to make use of mathematical and empirical methods. Modern corpus linguistics has used and developed these methods in close connection with computer science and computational linguistics. The handbook sketches the history of corpus linguistics, shows its potential, discusses its problems, and describes various methods of collecting, annotating, and searching corpora as well as processing corpus data. It also reports case studies that illustrate the wide range of linguistic research questions addressed in corpus linguistics. The over 60 articles included in the handbook are divided into five sections: (1) the origins and history of corpus linguistics and surveys of its relationship to central fields of linguistics (2) corpus compilation (3) corpus types (4) preprocessing of corpora (5) the use and exploitation of corpora. The final section gives an overview of the results of corpus studies obtained in phonetics, phonology, morphology, syntax, semantics, sociolinguistics, historical linguistics, stylometry, dialectology, and discourse analysis. It also reports on recent advances made in human and machine translation, contrastive studies, computer-assisted language learning, and automatic summarization. The contributors to the volume are internationally known experts in their respective fields. The handbook is intended for a wide audience ranging from teachers, university students, and scholars to anyone interested in the use of computers in linguistic analyses and applications.

Book Corpus Linguistics

Download or read book Corpus Linguistics written by Tony McEnery and published by Cambridge University Press. This book was released on 2011-10-06 with total page pages. Available in PDF, EPUB and Kindle. Book excerpt: Corpus linguistics is the study of language data on a large scale - the computer-aided analysis of very extensive collections of transcribed utterances or written texts. This textbook outlines the basic methods of corpus linguistics, explains how the discipline of corpus linguistics developed and surveys the major approaches to the use of corpus data. It uses a broad range of examples to show how corpus data has led to methodological and theoretical innovation in linguistics in general. Clear and detailed explanations lay out the key issues of method and theory in contemporary corpus linguistics. A structured and coherent narrative links the historical development of the field to current topics in 'mainstream' linguistics. Practical tasks and questions for discussion at the end of each chapter encourage students to test their understanding of what they have read and an extensive glossary provides easy access to definitions of technical terms used in the text.

Book Corpus Linguistics and the Description ofEnglish

Download or read book Corpus Linguistics and the Description ofEnglish written by Hans Lindquist and published by Edinburgh University Press. This book was released on 2009-12-07 with total page 241 pages. Available in PDF, EPUB and Kindle. Book excerpt: A lively hands-on introduction to the use ofelectronic corpora in the description and analysis of English, this bookprovides an ideal introduction for university students of English at theintermediate level. Students planning papers, dissertations or theses willfind the book a particularly valuable guide.After introducing corpora andthe rationale and basic methodology of corpus linguistics, the authorpresents a number of case studies providing new insights into vocabulary,collocations, phraseology, metaphor and metonymy, syntactic structures, maleand female language, and language change. In a final chapter it is shown howthe web can be used as a source for linguistic investigations. Each chapterhas study questions, exercises and suggestions for further reading.Studentswill benefit from the book's*Clear language and structure *Well-definedterminology *Step-by-step instructions *Generous, up-to-date exemplificationfrom different varieties of English around the world *Accompanying web-pagewith exercises and updated information about freely accessiblecorpora.

Book Contemporary Corpus Linguistics

Download or read book Contemporary Corpus Linguistics written by Paul Baker and published by A&C Black. This book was released on 2012-03-15 with total page 370 pages. Available in PDF, EPUB and Kindle. Book excerpt: Acts as a one-volume resource, providing an introduction to every aspect of corpus linguistics as it is being used at the moment.