EBookClubs

Read Books & Download eBooks Full Online

EBookClubs

Read Books & Download eBooks Full Online

Book Cluster Analysis for Corpus Linguistics

Download or read book Cluster Analysis for Corpus Linguistics written by Hermann Moisl and published by Walter de Gruyter GmbH & Co KG. This book was released on 2015-02-24 with total page 396 pages. Available in PDF, EPUB and Kindle. Book excerpt: The standard scientific methodology in linguistics is empirical testing of falsifiable hypotheses. As such the process of hypothesis generation is central, and involves formulation of a research question about a domain of interest and statement of a hypothesis relative to it. In corpus linguistics the domain is text, and generation involves abstraction of data from text, data analysis, and formulation of a hypothesis based on inference from the results. Traditionally this process has been paper-based, but the advent of electronic text has increasingly rendered it obsolete both because the size of digital corpora is now at or beyond the limit of what can efficiently be used in the traditional way, and because the complexity of data abstracted from them can be impenetrable to understanding. Linguists are increasingly turning to mathematical and statistical computational methods for help, and cluster analysis is such a method. It is used across the sciences for hypothesis generation by identification of structure in data which are too large or complex, or both, to be interpretable by direct inspection. This book aims to show how cluster analysis can be used for hypothesis generation in corpus linguistics, thereby contributing to a quantitative empirical methodology for the discipline.

Book Cluster Analysis for Corpus Linguistics

Download or read book Cluster Analysis for Corpus Linguistics written by Hermann Moisl and published by Walter de Gruyter. This book was released on 2015-01-16 with total page 381 pages. Available in PDF, EPUB and Kindle. Book excerpt: The rapidly growing volume of digital natural language text and the complexity of data abstracted from it have increasingly rendered traditional corpus linguistic analytical methodology obsolete. This book describes a cluster analytic methodology for generating linguistic hypotheses on the basis of data abstracted from language corpora.

Book Cluster Analysis for Corpus Linguistics

Download or read book Cluster Analysis for Corpus Linguistics written by Hermann Moisl and published by Walter de Gruyter GmbH & Co KG. This book was released on 2015-02-24 with total page 398 pages. Available in PDF, EPUB and Kindle. Book excerpt: The standard scientific methodology in linguistics is empirical testing of falsifiable hypotheses. As such the process of hypothesis generation is central, and involves formulation of a research question about a domain of interest and statement of a hypothesis relative to it. In corpus linguistics the domain is text, and generation involves abstraction of data from text, data analysis, and formulation of a hypothesis based on inference from the results. Traditionally this process has been paper-based, but the advent of electronic text has increasingly rendered it obsolete both because the size of digital corpora is now at or beyond the limit of what can efficiently be used in the traditional way, and because the complexity of data abstracted from them can be impenetrable to understanding. Linguists are increasingly turning to mathematical and statistical computational methods for help, and cluster analysis is such a method. It is used across the sciences for hypothesis generation by identification of structure in data which are too large or complex, or both, to be interpretable by direct inspection. This book aims to show how cluster analysis can be used for hypothesis generation in corpus linguistics, thereby contributing to a quantitative empirical methodology for the discipline.

Book Corpus Linguistics and Statistics with R

Download or read book Corpus Linguistics and Statistics with R written by Guillaume Desagulier and published by Springer. This book was released on 2017-11-17 with total page 353 pages. Available in PDF, EPUB and Kindle. Book excerpt: This textbook examines empirical linguistics from a theoretical linguist’s perspective. It provides both a theoretical discussion of what quantitative corpus linguistics entails and detailed, hands-on, step-by-step instructions to implement the techniques in the field. The statistical methodology and R-based coding from this book teach readers the basic and then more advanced skills to work with large data sets in their linguistics research and studies. Massive data sets are now more than ever the basis for work that ranges from usage-based linguistics to the far reaches of applied linguistics. This book presents much of the methodology in a corpus-based approach. However, the corpus-based methods in this book are also essential components of recent developments in sociolinguistics, historical linguistics, computational linguistics, and psycholinguistics. Material from the book will also be appealing to researchers in digital humanities and the many non-linguistic fields that use textual data analysis and text-based sensorimetrics. Chapters cover topics including corpus processing, frequencing data, and clustering methods. Case studies illustrate each chapter with accompanying data sets, R code, and exercises for use by readers. This book may be used in advanced undergraduate courses, graduate courses, and self-study.

Book Statistics in Corpus Linguistics

Download or read book Statistics in Corpus Linguistics written by Vaclav Brezina and published by Cambridge University Press. This book was released on 2018-09-20 with total page 317 pages. Available in PDF, EPUB and Kindle. Book excerpt: A comprehensive and accessible introduction to statistics in corpus linguistics, covering multiple techniques of quantitative language analysis and data visualisation.

Book Aggregating Dialectology  Typology  and Register Analysis

Download or read book Aggregating Dialectology Typology and Register Analysis written by Benedikt Szmrecsanyi and published by Walter de Gruyter GmbH & Co KG. This book was released on 2014-08-22 with total page 421 pages. Available in PDF, EPUB and Kindle. Book excerpt: This volume aims to overcome sub-disciplinary boundaries in the study of linguistic variation - be it language-internal or cross-linguistic. Even though dialectologists, register analysts, typologists, and quantitative linguists all deal with linguistic variation, there is astonishingly little interaction across these fields. But the fourteen contributions in this volume show that these subdisciplines actually share many interests and methodological concerns in common. The chapters specifically converge in the following ways: First, they all seek to explore linguistic variation, within or across languages. Second, they are based on usage data, that is, on corpora of (more or less) authentic text or speech of different languages or language varieties. Third, all chapters are concerned with the joint analysis (also sometimes known as “aggregation” or “data synthesis”) of multiple phenomena, features, or measurements of some sort. And lastly, the contributors all marshal quantitative analysis techniques to analyse the data. In short, the volume explores the text-feature-aggregation pipeline in variation studies, demonstrating that there is much mutual inspiration to be had by thinking outside the disciplinary box.

Book Mastering Corpus Linguistics Methods

Download or read book Mastering Corpus Linguistics Methods written by Dirk Speelman and published by John Wiley & Sons. This book was released on 2021-10-25 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book provides a hands-on introduction to qualitative and especially quantitative corpus-linguistics methods, dealing with both the conceptual and the practical side of conducting corpus-linguistic case studies. The main focus of this book is to illustrate how a wide range of research questions can be tackled with corpus linguistic methods that involve only a modest number of technical hurdles as well as to gently guide the researcher through the technicalities of some more complex methods. Methods of Corpus Linguistics is aimed at a broad audience of linguists, presenting both basic and modern methods of corpus linguistics.

Book Corpus Linguistics and the Web

Download or read book Corpus Linguistics and the Web written by Marianne Hundt and published by Rodopi. This book was released on 2007 with total page 313 pages. Available in PDF, EPUB and Kindle. Book excerpt: Using the Web as Corpus is one of the recent challenges for corpus linguistics. This volume presents a current state-of-the-arts discussion of the topic. The articles address practical problems such as suitable linguistic search tools for accessing the www, the question of register variation, or they probe into methods for culling data from the web. The book also offers a wide range of case studies, covering morphology, syntax, lexis, as well as synchronic and diachronic variation in English. These case studies make use of the two approaches to the www in corpus linguistics - web-as-corpus and web-for-corpus-building. The case studies demonstrate that web data can provide useful additional evidence for a broad range of research questions.

Book A Practical Handbook of Corpus Linguistics

Download or read book A Practical Handbook of Corpus Linguistics written by Magali Paquot and published by Springer Nature. This book was released on 2021-05-04 with total page 686 pages. Available in PDF, EPUB and Kindle. Book excerpt: This handbook is a comprehensive practical resource on corpus linguistics. It features a range of basic and advanced approaches, methods and techniques in corpus linguistics, from corpus compilation principles to quantitative data analyses. The Handbook is organized in six Parts. Parts I to III feature chapters that discuss key issues and the know-how related to various topics around corpus design, methods and corpus types. Parts IV-V aim to offer a user-friendly introduction to the quantitative analysis of corpus data: for each statistical technique discussed, chapters provide a practical guide with R and come with supplementary online material. Part VI focuses on how to write a corpus linguistic paper and how to meta-analyze corpus linguistic research. The volume can serve as a course book as well as for individual study. It will be an essential reading for students of corpus linguistics as well as experienced researchers who want to expand their knowledge of the field.

Book Arabic Corpus Linguistics

Download or read book Arabic Corpus Linguistics written by Tony McEnery and published by Edinburgh University Press. This book was released on 2018-05-31 with total page 272 pages. Available in PDF, EPUB and Kindle. Book excerpt: Explores the cultural politics of televisual engagements with the history, literature and archaeology of Ancient Greece

Book Natural Language Processing for Corpus Linguistics

Download or read book Natural Language Processing for Corpus Linguistics written by Jonathan Dunn and published by Cambridge University Press. This book was released on 2022-03-31 with total page 149 pages. Available in PDF, EPUB and Kindle. Book excerpt: Corpus analysis can be expanded and scaled up by incorporating computational methods from natural language processing. This Element shows how text classification and text similarity models can extend our ability to undertake corpus linguistics across very large corpora. These computational methods are becoming increasingly important as corpora grow too large for more traditional types of linguistic analysis. We draw on five case studies to show how and why to use computational methods, ranging from usage-based grammar to authorship analysis to using social media for corpus-based sociolinguistics. Each section is accompanied by an interactive code notebook that shows how to implement the analysis in Python. A stand-alone Python package is also available to help readers use these methods with their own data. Because large-scale analysis introduces new ethical problems, this Element pairs each new methodology with a discussion of potential ethical implications.

Book Data and Methods in Corpus Linguistics

Download or read book Data and Methods in Corpus Linguistics written by Ole Schützler and published by Cambridge University Press. This book was released on 2022-05-26 with total page 375 pages. Available in PDF, EPUB and Kindle. Book excerpt: By contrasting different approaches and datasets, this book highlights critical developments in latest corpus-linguistic research.

Book Corpus Linguistics

Download or read book Corpus Linguistics written by McEnery Tony McEnery and published by Edinburgh University Press. This book was released on 2019-08-06 with total page 256 pages. Available in PDF, EPUB and Kindle. Book excerpt: Corpus Linguistics has quickly established itself as the leading undergraduate course book in the subject. This second edition takes full account of the latest developments in the rapidly changing field, making this the most up-to-date and comprehensive textbook available. It gives a step-by-step introduction to what a corpus is, how corpora are constructed, and what can be done with them. Each chapter ends with a section of study questions that contain practical corpus-based exercises.* Designed for student use, with all technical terms explained in the text and referenced further in a Glossary* Examples are taken from existing corpora; detailed case study chapter included* Contains end-of-chapter summaries, study questions and suggestions for further reading* Updated reviews of new studies, areas that have recently come to prominence and new directions in corpus encoding and annotation standards* Detailed coverage of multilingual corpus construction and use* An in-depth historical review of computer-based corpora from the 1940s to the present day* Helpful appendices include answers to the study questions, up-to-date information on where corpora can be found, and the latest software for corpus research."e;[An] important addition to the fast growing literature in corpus linguistics... should be read by anyone interested in utilization of large-scale corpora in linguistic research."e; Studies in the Linguistic Sciences, on the first edition

Book Statistics for Corpus Linguistics

Download or read book Statistics for Corpus Linguistics written by Michael Oakes and published by Edinburgh University Press. This book was released on 2019-08-06 with total page 304 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book in the Edinburgh Textbooks in Empirical Linguistics series is a comprehensive introduction to the statistics currently used in corpus linguistics. Statistical techniques and corpus applications - whether oriented towards linguistics or language engineering - often go hand in glove, and corpus linguists have used an increasingly wide variety of statistics, drawing on techniques developed in a great many fields. This is the first one-volume introduction to the subject.

Book Statistics in Corpus Linguistics

Download or read book Statistics in Corpus Linguistics written by Vaclav Brezina and published by Cambridge University Press. This book was released on 2018-09-20 with total page 317 pages. Available in PDF, EPUB and Kindle. Book excerpt: Do you use language corpora in your research or study, but find that you struggle with statistics? This practical introduction will equip you to understand the key principles of statistical thinking and apply these concepts to your own research, without the need for prior statistical knowledge. The book gives step-by-step guidance through the process of statistical analysis and provides multiple examples of how statistical techniques can be used to analyse and visualise linguistic data. It also includes a useful selection of discussion questions and exercises which you can use to check your understanding. The book comes with a Companion website, which provides additional materials (answers to exercises, datasets, advanced materials, teaching slides etc.) and Lancaster Stats Tools online (http://corpora.lancs.ac.uk/stats), a free click-and-analyse statistical tool for easy calculation of the statistical measures discussed in the book.

Book Corpus Methods for Semantics

Download or read book Corpus Methods for Semantics written by Dylan Glynn and published by John Benjamins Publishing Company. This book was released on 2014-11-06 with total page 555 pages. Available in PDF, EPUB and Kindle. Book excerpt: This volume seeks to advance and popularise the use of corpus-driven quantitative methods in the study of semantics. The first part presents state-of-the-art research in polysemy and synonymy from a Cognitive Linguistic perspective. The second part presents and explains in a didactic manner each of the statistical techniques used in the first part of the volume. A handbook both for linguists working with statistics in corpus research and for linguists in the fields of polysemy and synonymy.

Book The Routledge Handbook of Corpus Linguistics

Download or read book The Routledge Handbook of Corpus Linguistics written by Anne O'Keeffe and published by Taylor & Francis. This book was released on 2022-02-08 with total page 755 pages. Available in PDF, EPUB and Kindle. Book excerpt: The Routledge Handbook of Corpus Linguistics 2e provides an updated overview of a dynamic and rapidly growing area with a widely applied methodology. Over a decade on from the first edition of the Handbook, this collection of 47 chapters from experts in key areas offers a comprehensive introduction to both the development and use of corpora as well as their ever-evolving applications to other areas, such as digital humanities, sociolinguistics, stylistics, translation studies, materials design, language teaching and teacher development, media discourse, discourse analysis, forensic linguistics, second language acquisition and testing. The new edition updates all core chapters and includes new chapters on corpus linguistics and statistics, digital humanities, translation, phonetics and phonology, second language acquisition, social media and theoretical perspectives. Chapters provide annotated further reading lists and step-by-step guides as well as detailed overviews across a wide range of themes. The Handbook also includes a wealth of case studies that draw on some of the many new corpora and corpus tools that have emerged in the last decade. Organised across four themes, moving from the basic start-up topics such as corpus building and design to analysis, application and reflection, this second edition remains a crucial point of reference for advanced undergraduates, postgraduates and scholars in applied linguistics.