[EBOOK] Comparable Corpora And Computer Assisted Translation PDF Download

Computers

Comparable Corpora and Computer assisted Translation

Book Details:

Author : Estelle Maryline Delpech
Publisher : John Wiley & Sons
Release : 2014-07-22
ISBN : 1119002702
Pages : 221 pages

Download or read book Comparable Corpora and Computer assisted Translation written by Estelle Maryline Delpech and published by John Wiley & Sons. This book was released on 2014-07-22 with total page 221 pages. Available in PDF, EPUB and Kindle. Book excerpt: Computer-assisted translation (CAT) has always used translation memories, which require the translator to have a corpus of previous translations that the CAT software can use to generate bilingual lexicons. This can be problematic when the translator does not have such a corpus, for instance, when the text belongs to an emerging field. To solve this issue, CAT research has looked into the leveraging of comparable corpora, i.e. a set of texts, in two or more languages, which deal with the same topic but are not translations of one another. This work had two primary objectives. The first is to assess the input of lexicons extracted from comparable corpora in the context of a specialized human translation task. The second objective is to identify bilingual-lexicon-extraction methods which best match the translators' needs, determining the current limits of these techniques and suggesting improvements. The author focuses, in particular, on the identification of fertile translations, the management of multiple morphological structures, and the ranking of candidate translations. The experiments are carried out on two language pairs (English–French and English–German) and on specialized texts dealing with breast cancer. This research puts significant emphasis on applicability – methodological choices are guided by the needs of the final users. This book is organized in two parts: the first part presents the applicative and scientific context of the research, and the second part is given over to efforts to improve compositional translation. The research work presented in this book received the PhD Thesis award 2014 from the French association for natural language processing (ATALA).

Computers

Building and Using Comparable Corpora

Book Details:

Author : Serge Sharoff
Publisher : Springer Science & Business Media
Release : 2013-12-13
ISBN : 3642201288
Pages : 333 pages

Download or read book Building and Using Comparable Corpora written by Serge Sharoff and published by Springer Science & Business Media. This book was released on 2013-12-13 with total page 333 pages. Available in PDF, EPUB and Kindle. Book excerpt: The 1990s saw a paradigm change in the use of corpus-driven methods in NLP. In the field of multilingual NLP (such as machine translation and terminology mining) this implied the use of parallel corpora. However, parallel resources are relatively scarce: many more texts are produced daily by native speakers of any given language than translated. This situation resulted in a natural drive towards the use of comparable corpora, i.e. non-parallel texts in the same domain or genre. Nevertheless, this research direction has not produced a single authoritative source suitable for researchers and students coming to the field. The proposed volume provides a reference source, identifying the state of the art in the field as well as future trends. The book is intended for specialists and students in natural language processing, machine translation and computer-assisted translation.

Computers

Using Comparable Corpora for Under Resourced Areas of Machine Translation

Book Details:

Author : Inguna Skadiņa
Publisher : Springer
Release : 2019-02-06
ISBN : 3319990047
Pages : 323 pages

Download or read book Using Comparable Corpora for Under Resourced Areas of Machine Translation written by Inguna Skadiņa and published by Springer. This book was released on 2019-02-06 with total page 323 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book provides an overview of how comparable corpora can be used to overcome the lack of parallel resources when building machine translation systems for under-resourced languages and domains. It presents a wealth of methods and open tools for building comparable corpora from the Web, evaluating comparability and extracting parallel data that can be used for the machine translation task. It is divided into several sections, each covering a specific task such as building, processing, and using comparable corpora, focusing particularly on under-resourced language pairs and domains. The book is intended for anyone interested in data-driven machine translation for under-resourced languages and domains, especially for developers of machine translation systems, computational linguists and language workers. It offers a valuable resource for specialists and students in natural language processing, machine translation, corpus linguistics and computer-assisted translation, and promotes the broader use of comparable corpora in natural language processing and computational linguistics.

Computers

Building and Using Comparable Corpora for Multilingual Natural Language Processing

Book Details:

Author : Serge Sharoff
Publisher : Springer Nature
Release : 2023-08-23
ISBN : 3031313844
Pages : 138 pages

Download or read book Building and Using Comparable Corpora for Multilingual Natural Language Processing written by Serge Sharoff and published by Springer Nature. This book was released on 2023-08-23 with total page 138 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book provides a comprehensive overview of methods to build comparable corpora and of their applications, including machine translation, cross-lingual transfer, and various kinds of multilingual natural language processing. The authors begin with a brief history on the topic followed by a comparison to parallel resources and an explanation of why comparable corpora have become more widely used. In particular, they provide the basis for the multilingual capabilities of pre-trained models, such as BERT or GPT. The book then focuses on building comparable corpora, aligning their sentences to create a database of suitable translations, and using these sentence translations to produce dictionaries and term banks. Then, it is explained how comparable corpora can be used to build machine translation engines and to develop a wide variety of multilingual applications.

Language Arts & Disciplines

Corpus Use and Translating

Book Details:

Author : Allison Beeby
Publisher : John Benjamins Publishing
Release : 2009-03-11
ISBN : 9027291063
Pages : 166 pages

Download or read book Corpus Use and Translating written by Allison Beeby and published by John Benjamins Publishing. This book was released on 2009-03-11 with total page 166 pages. Available in PDF, EPUB and Kindle. Book excerpt: Professional translators are increasingly dependent on electronic resources, and trainee translators need to develop skills that allow them to make the best use of these resources. The aim of this book is to show how CULT (Corpus Use for Learning to Translate) methodologies can be used to prepare learning materials, and how novice translators can become autonomous users of corpora. Readers interested in translation studies, translator training and corpus linguistics will find the book particularly useful. Not only does it include practical, technical advice for using and learning to use corpora, but it also addresses important issues such as the balance between training and education and how CULT methodologies reinforce student autonomy and responsibility. Not only is this a good introduction to CULT, but it also incorporates the latest developments in this field, showing the advantages of using these methodologies in competence-based learning.

Language Arts & Disciplines

Corpora in Translation and Contrastive Research in the Digital Age

Book Details:

Author : Julia Lavid-López
Publisher : John Benjamins Publishing Company
Release : 2021-12-15
ISBN : 9027259682
Pages : 353 pages

Download or read book Corpora in Translation and Contrastive Research in the Digital Age written by Julia Lavid-López and published by John Benjamins Publishing Company. This book was released on 2021-12-15 with total page 353 pages. Available in PDF, EPUB and Kindle. Book excerpt: Corpus-based contrastive and translation research are areas that keep evolving in the digital age, as the range of new corpus resources and tools expands, opening up to different approaches and application contexts. The current book contains a selection of papers which focus on corpora and translation research in the digital age, outlining some recent advances and explorations. After an introductory chapter which outlines language technologies applied to translation and interpreting with a view to identifying challenges and research opportunities, the first part of the book is devoted to current advances in the creation of new parallel corpora for under-researched areas, the development of tools to manage parallel corpora or as an alternative to parallel corpora, and new methodologies to improve existing translation memory systems. The contributions in the second part of the book address a number of cutting-edge linguistic issues in the area of contrastive discourse studies and translation analysis on the basis of comparable and parallel corpora in several languages such as English, German, Swedish, French, Italian, Spanish, Portuguese and Turkish, thus showcasing the richness of the linguistic diversity carried out in these recent investigations. Given the multiplicity of topics, methodologies and languages studied in the different chapters, the book will be of interest to a wide audience working in the fields of translation studies, contrastive linguistics and the automatic processing of language.

Language Arts & Disciplines

Parallel Corpora for Contrastive and Translation Studies

Book Details:

Author : Irene Doval
Publisher : John Benjamins Publishing Company
Release : 2019-03-20
ISBN : 9027262845
Pages : 313 pages

Download or read book Parallel Corpora for Contrastive and Translation Studies written by Irene Doval and published by John Benjamins Publishing Company. This book was released on 2019-03-20 with total page 313 pages. Available in PDF, EPUB and Kindle. Book excerpt: This volume assesses the state of the art of parallel corpus research as a whole, reporting on advances in both recent developments of parallel corpora – with some particular references to comparable corpora as well– and in ways of exploiting them for a variety of purposes. The first part of the book is devoted to new roles that parallel corpora can and should assume in translation studies and in contrastive linguistics, to the usefulness and usability of parallel corpora, and to advances in parallel corpus alignment, annotation and retrieval. There follows an up-to-date presentation of a number of parallel corpus projects currently being carried out in Europe, some of them multimodal, with certain chapters illustrating case studies developed on the basis of the corpora at hand. In most of these chapters, attention is paid to specific technical issues of corpus building. The third part of the book reflects on specific applications and on the creation of bilingual resources from parallel corpora. This volume will be welcomed by scholars, postgraduate and PhD students in the fields of contrastive linguistics, translation studies, lexicography, language teaching and learning, machine translation, and natural language processing.

Language Arts & Disciplines

Topics in Language Resources for Translation and Localisation

Book Details:

Author : Elia Yuste Rodrigo
Publisher : John Benjamins Publishing
Release : 2008-11-12
ISBN : 9027291098
Pages : 237 pages

Download or read book Topics in Language Resources for Translation and Localisation written by Elia Yuste Rodrigo and published by John Benjamins Publishing. This book was released on 2008-11-12 with total page 237 pages. Available in PDF, EPUB and Kindle. Book excerpt: Language Resources (LRs) are sets of language data and descriptions in machine readable form, such as written and spoken language corpora, terminological databases, computational lexica and dictionaries, and linguistic software tools. Over the past few decades, mainly within research environments, LRs have been specifically used to create, optimise or evaluate natural language processing (NLP) and human language technologies (HLT) applications, including translation-related technologies. Gradually the infrastructures and exploitation tools of LRs are being perceived as core resources in the language services industries and in localisation production settings. However, some efforts ought yet to be made to raise further awareness about LRs in general, and LRs for translation and localisation in particular to a wider audience in all corners of the world. Topics in Language Resources for Translation and Localisation sets out to establish the state of the art of this ever expanding field and underscores the usefulness that LRs can potentially have in the process of creating, adapting, managing, standardising and leveraging content for more than one language and culture from various perspectives.

Language Arts & Disciplines

Computer Assisted Literary Translation

Book Details:

Author : Andrew Rothwell
Publisher : Taylor & Francis
Release : 2023-11-30
ISBN : 1000969118
Pages : 303 pages

Download or read book Computer Assisted Literary Translation written by Andrew Rothwell and published by Taylor & Francis. This book was released on 2023-11-30 with total page 303 pages. Available in PDF, EPUB and Kindle. Book excerpt: This collection surveys the state of the art of computer-assisted literary translation (CALT), making the case for its potential to enhance literary translation research and practice. The volume brings together early career and established scholars from around the world in countering prevailing notions around the challenges of effectively implementing contemporary CALT applications in literary translation practice which has traditionally followed the model of a single translator focused on a single work. The book begins by addressing key questions on the definition of literary translation, examining its sociological dimensions and individual translator perspective. Chapters explore the affordances of technological advancements and availability of new tools in such areas as post-edited machine translation (PEMT) in expanding the boundaries of what we think of when we think of literary translation, looking to examples from developments in co-translation, collaborative translation, crowd-sourced translation and fan translation. As the first book of its kind dedicated to the contribution CALT in its various forms can add to existing and future scholarship, this volume will be of interest to students and scholars in Translation Studies, especially those working in literary translation, machine translation and translation technologies.

Language Arts & Disciplines

New directions in corpus based translation studies

Book Details:

Author : Claudio Fantinuoli
Publisher : Language Science Press
Release : 2015
ISBN : 3944675835
Pages : 175 pages

Download or read book New directions in corpus based translation studies written by Claudio Fantinuoli and published by Language Science Press. This book was released on 2015 with total page 175 pages. Available in PDF, EPUB and Kindle. Book excerpt: Corpus-based translation studies has become a major paradigm and research methodology and has investigated a wide variety of topics in the last two decades. The contributions to this volume add to the range of corpus-based studies by providing examples of some less explored applications of corpus analysis methods to translation research. They show that the area keeps evolving as it constantly opens up to different frameworks and approaches, from appraisal theory to process-oriented analysis, and encompasses multiple translation settings, including (indirect) literary translation, machine (assisted)-translation and the practical work of professional legal translators. The studies included in the volume also expand the range of application of corpus applications in terms of the tools used to accomplish the research tasks outlined.

Language Arts & Disciplines

Multiword Units in Machine Translation and Translation Technology

Book Details:

Author : Ruslan Mitkov
Publisher : John Benjamins Publishing Company
Release : 2018-07-15
ISBN : 9027264201
Pages : 271 pages

Download or read book Multiword Units in Machine Translation and Translation Technology written by Ruslan Mitkov and published by John Benjamins Publishing Company. This book was released on 2018-07-15 with total page 271 pages. Available in PDF, EPUB and Kindle. Book excerpt: The correct interpretation of Multiword Units (MWUs) is crucial to many applications in Natural Language Processing but is a challenging and complex task. In recent years, the computational treatment of MWUs has received considerable attention but there is much more to be done before we can claim that NLP and Machine Translation (MT) systems process MWUs successfully. This volume provides a general overview of the field with particular reference to Machine Translation and Translation Technology and focuses on languages such as English, Basque, French, Romanian, German, Dutch and Croatian, among others. The chapters of the volume illustrate a variety of topics that address this challenge, such as the use of rule-based approaches, compound splitting techniques, MWU identification methodologies in multilingual applications, and MWU alignment issues.

Computers

Machine Learning in Translation Corpora Processing

Book Details:

Author : Krzysztof Wolk
Publisher : CRC Press
Release : 2019-02-25
ISBN : 0429588836
Pages : 209 pages

Download or read book Machine Learning in Translation Corpora Processing written by Krzysztof Wolk and published by CRC Press. This book was released on 2019-02-25 with total page 209 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book reviews ways to improve statistical machine speech translation between Polish and English. Research has been conducted mostly on dictionary-based, rule-based, and syntax-based, machine translation techniques. Most popular methodologies and tools are not well-suited for the Polish language and therefore require adaptation, and language resources are lacking in parallel and monolingual data. The main objective of this volume to develop an automatic and robust Polish-to-English translation system to meet specific translation requirements and to develop bilingual textual resources by mining comparable corpora.

Language Arts & Disciplines

Corpus based Perspectives in Linguistics

Book Details:

Author : Yuji Kawaguchi
Publisher : John Benjamins Publishing
Release : 2007
ISBN : 9789027233189
Pages : 464 pages

Download or read book Corpus based Perspectives in Linguistics written by Yuji Kawaguchi and published by John Benjamins Publishing. This book was released on 2007 with total page 464 pages. Available in PDF, EPUB and Kindle. Book excerpt: UBLI has conducted field surveys since 2002 and built spoken language corpora for French, Spanish, Italian (Salentino dialect), Russian, Malaysian, Turkish, Japanese, and Canadian multilinguals. This volume features new research presented at the UBLI second workshop on Corpus Linguistics Research Domain, which was held on September 14, 2006. The first part consisting of eleven presentations to this workshop shows a wide range of subjects within the area of corpus-based research, such as dictionary, linguistic atlas, dialect, translation, ancient texts, non-standard texts, sociolinguistics, second language acquisition, and natural language processing. The second part of this volume comprises ten additional contributions to both written and spoken corpora by the members and research assistants of UBLI.

Language Arts & Disciplines

Translation Driven Corpora

Book Details:

Author : Federico Zanettin
Publisher : Routledge
Release : 2014-04-08
ISBN : 1317639847
Pages : 205 pages

Download or read book Translation Driven Corpora written by Federico Zanettin and published by Routledge. This book was released on 2014-04-08 with total page 205 pages. Available in PDF, EPUB and Kindle. Book excerpt: Electronic texts and text analysis tools have opened up a wealth of opportunities to higher education and language service providers, but learning to use these resources continues to pose challenges to scholars and professionals alike. Translation-Driven Corpora aims to introduce readers to corpus tools and methods which may be used in translation research and practice. Each chapter focuses on specific aspects of corpus creation and use. An introduction to corpora and overview of applications of corpus linguistics methodologies to translation studies is followed by a discussion of corpus design and acquisition. Different stages and tools involved in corpus compilation and use are outlined, from corpus encoding and annotation to indexing and data retrieval, and the various methods and techniques that allow end users to make sense of corpus data are described. The volume also offers detailed guidelines for the construction and analysis of multilingual corpora. Corpus creation and use are illustrated through practical examples and case studies, with each chapter outlining a set of tasks aimed at guiding researchers, students and translators to practice some of the methods and use some of the resources discussed. These tasks are meant as hands-on activities to be carried out using the materials and links available in an accompanying DVD. Suggested further readings at the end of each chapter are complemented by an extensive bibliography at the end of the volume. Translation-Driven Corpora is designed for use by teachers and students in the classroom or by researchers and professionals for self-learning. It is an invaluable resource for anyone interested in this fast growing area of scholarly and professional activity.

Improving Statistical Machine Translation Using Comparable Corpora

Book Details:

Author : Matthew Garvey Snover
Publisher :
Release : 2010
ISBN :
Pages : pages

Download or read book Improving Statistical Machine Translation Using Comparable Corpora written by Matthew Garvey Snover and published by . This book was released on 2010 with total page pages. Available in PDF, EPUB and Kindle. Book excerpt:

Computers

A Dictionary of Translation Technology

Book Details:

Author : Sin-wai Chan
Publisher : Chinese University Press
Release : 2004
ISBN : 9789629961480
Pages : 660 pages

Download or read book A Dictionary of Translation Technology written by Sin-wai Chan and published by Chinese University Press. This book was released on 2004 with total page 660 pages. Available in PDF, EPUB and Kindle. Book excerpt: This dictionary is intended for anyone who is interested in translation and translation technology. Especially, translation as an academic discipline, a language activity, a specialized profession, or a business undertaking. The book covers theory and practice of translation and interpretation in a number of areas. Addressing and explaining important concepts in computer translation, computer-aided translation, and translation tools. Most popular and commercially available translation software are included along with their website addresses for handy reference. This dictionary has 1,377 entries. The entries are alphabetized and defined in a simple and concise manner.

Applying Comparable Corpora to Machine Translation

Book Details:

Author : Krzysztof Wolk
Publisher : LAP Lambert Academic Publishing
Release : 2015-07-24
ISBN : 9783659762864
Pages : 212 pages

Download or read book Applying Comparable Corpora to Machine Translation written by Krzysztof Wolk and published by LAP Lambert Academic Publishing. This book was released on 2015-07-24 with total page 212 pages. Available in PDF, EPUB and Kindle. Book excerpt: The problem investigated here was how to improve statistical machine language translation between Polish and English speech. While excellent translation systems exist for many popular languages, it is fair to say that the development of such systems for Polish and English has been neglected. The most popular methodologies are not well suited for the Polish language and require adaptation. Polish language resources are lacking in parallel and monolingual data. Therefore, the main objective of the present study was to develop an automatic and robust Polish to English translation system to meet specific translation requirements and to develop bilingual textual resources by mining comparable corpora. Experiments were conducted mostly on casual human speech, consisting of lectures, movie subtitles, European Parliament proceedings, and European Medicines Agency. The aims were to rigorously analyze the various problems and to improve the quality of baseline systems, i.e., adaptation of techniques and training parameters to increase the Bilingual Evaluation Understudy (BLEU) score for maximum performance.