EBookClubs

Read Books & Download eBooks Full Online

EBookClubs

Read Books & Download eBooks Full Online

Book Discourse Cohesion in Chinese English Statistical Machine Translation

Download or read book Discourse Cohesion in Chinese English Statistical Machine Translation written by David Steele and published by . This book was released on 2019 with total page pages. Available in PDF, EPUB and Kindle. Book excerpt:

Book Discourse aware Neural Machine Translation

Download or read book Discourse aware Neural Machine Translation written by Longyue Wang and published by . This book was released on 2019 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: Machine translation (MT) models usually translate a text by considering isolated sentences based on a strict assumption that the sentences in a text are independent of one another. However, it is a truism that texts have properties of connectedness that go beyond those of their individual sentences. Disregarding dependencies across sentences will harm translation quality especially in terms of coherence, cohesion, and consistency. Previously, some discourse-aware approaches have been investigated for conventional statistical machine translation (SMT). However, this is a serious obstacle for the state-of-the-art neural machine translation (NMT), which recently has surpassed the performance of SMT. In this thesis, we try to incorporate useful discourse information for enhancing NMT models. More specifically, we conduct research on two main parts: 1) exploiting novel document-level NMT architecture; and 2) dealing with a specific discourse phenomenon for translation models. Firstly, we investigate the influence of historical contextual information on the perfor- mance of NMT models. A cross-sentence context-aware NMT model is proposed to consider the influence of previous sentences in the same document. Specifically, this history is summarized using an additional hierarchical encoder. The historical representations are then integrated into the standard NMT model in different strategies. Experimental results on a Chinese-English document-level translation task show that the approach significantly improves upon a strong attention-based NMT system by up to +2.1 BLEU points. In addition, analysis and comparison also give insightful discussions and conclusions for this research direction. Secondly, we explore the impact of discourse phenomena on the performance of MT. In this thesis, we focus on the phenomenon of pronoun-dropping (pro-drop), where, in pro-drop languages, pronouns can be omitted when it is possible to infer the referent from the context. As the data for training a dropped pronoun (DP) generator is scarce, we propose to automatically annotate DPs using alignment information from a large parallel corpus. We then introduce a hybrid approach: building a neural-based DP generator and integrating it into the SMT model. Experimental results on both Chinese-English and Japanese-English translation tasks demonstrate that our approach achieves a significant improvement of up to +1.58 BLEU points with 66% F-score for DP generation accuracy. Motivated by this promising result, we further exploit the DP translation approach for advanced NMT models. A novel reconstruction-based model is proposed to reconstruct the DP-annotated source sentence from the hidden states of either encoder or decoder, or both components. Experimental results on the same translation tasks show that the proposed approach significantly and consistently improves translation performance over a strong NMT baseline, which is trained on DP-annotated parallel data. To avoid the errors propagated from an external DP prediction model, we finally investigate an end-to-end DP translation model. Specifically, we improve the reconstruction-based model from three perspectives. We first employ a shared reconstructor to better exploit encoder and decoder representations. Secondly, we propose to jointly learn to translate and predict DPs. In order to capture discourse information for DP prediction, we finally combine the hierarchical encoder with the DP translation model. Experimental results on the same translation tasks show that our approach significantly improves both translation performance and DP prediction accuracy.

Book Natural Language Processing and Chinese Computing

Download or read book Natural Language Processing and Chinese Computing written by Juanzi Li and published by Springer. This book was released on 2015-10-07 with total page 612 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book constitutes the refereed proceedings of the 4th CCF Conference, NLPCC 2015, held in Nanchang, China, in October 2015. The 35 revised full papers presented together with 22 short papers were carefully reviewed and selected from 238 submissions. The papers are organized in topical sections on fundamentals on language computing; applications on language computing; NLP for search technology and ads; web mining; knowledge acquisition and information extraction.

Book New perspectives on cohesion and coherence

Download or read book New perspectives on cohesion and coherence written by Katrin Menzel and published by Language Science Press. This book was released on 2017-06-23 with total page 168 pages. Available in PDF, EPUB and Kindle. Book excerpt: The contributions to this volume investigate relations of cohesion and coherence as well as instantiations of discourse phenomena and their interaction with information structure in multilingual contexts. Some contributions concentrate on procedures to analyze cohesion and coherence from a corpus-linguistic perspective. Others have a particular focus on textual cohesion in parallel corpora that include both originals and translated texts. Additionally, the papers in the volume discuss the nature of cohesion and coherence with implications for human and machine translation. The contributors are experts on discourse phenomena and textuality who address these issues from an empirical perspective. The chapters in this volume are grounded in the latest research making this book useful to both experts of discourse studies and computational linguistics, as well as advanced students with an interest in these disciplines. We hope that this volume will serve as a catalyst to other researchers and will facilitate further advances in the development of cost-effective annotation procedures, the application of statistical techniques for the analysis of linguistic phenomena and the elaboration of new methods for data interpretation in multilingual corpus linguistics and machine translation.

Book Empirical Studies of Translation and Interpreting

Download or read book Empirical Studies of Translation and Interpreting written by Caiwen Wang and published by Routledge. This book was released on 2021-05-30 with total page 206 pages. Available in PDF, EPUB and Kindle. Book excerpt: This edited book is a collection of the latest empirical studies of translation and interpreting (T&I) from the post-structuralist perspective. The contributors are professors, readers, senior lecturers, lecturers, and research students from an international context. The contributions are characterised by five themes: Intervention in T&I Process of T&I Product of T&I T&I and technology T&I education These up-to-date topics are reflective of the shift in attitudes that is being witnessed as a new generation of translation scholars rejects the subjective assertions of previous generations, in favour of an altogether more rigorous approach. The book will notably contribute to the development of T&I and enhance our knowledge of the areas. It will be a useful reference for academics, postgraduate research students, and professional translators and interpreters. The book will also play a role in proposing practical and empirically based ways of training for universities and the industry, so as to overcome traditional barriers to translation and interpreting learning. The book will additionally provide reference material for relevant professional bodies.

Book Chinese english Statistical Machine Translation by Parsing

Download or read book Chinese english Statistical Machine Translation by Parsing written by Yue Zhang (M.Sc.) and published by . This book was released on 2006 with total page 86 pages. Available in PDF, EPUB and Kindle. Book excerpt:

Book Linguistically Motivated Statistical Machine Translation

Download or read book Linguistically Motivated Statistical Machine Translation written by Deyi Xiong and published by Springer. This book was released on 2015-02-11 with total page 159 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book provides a wide variety of algorithms and models to integrate linguistic knowledge into Statistical Machine Translation (SMT). It helps advance conventional SMT to linguistically motivated SMT by enhancing the following three essential components: translation, reordering and bracketing models. It also serves the purpose of promoting the in-depth study of the impacts of linguistic knowledge on machine translation. Finally it provides a systematic introduction of Bracketing Transduction Grammar (BTG) based SMT, one of the state-of-the-art SMT formalisms, as well as a case study of linguistically motivated SMT on a BTG-based platform.

Book Text  Speech and Dialogue

Download or read book Text Speech and Dialogue written by Petr Sojka and published by Springer. This book was released on 2014-09-01 with total page 623 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book constitutes the refereed proceedings of the 17th International Conference on Text, Speech and Dialogue, TSD 2013, held in Brno, Czech Republic, in September 2014. The 70 papers presented together with 3 invited papers were carefully reviewed and selected from 143 submissions. They focus on topics such as corpora and language resources; speech recognition; tagging, classification and parsing of text and speech; speech and spoken language generation; semantic processing of text and speech; integrating applications of text and speech processing; automatic dialogue systems; as well as multimodal techniques and modelling.

Book Discourse in Statistical Machine Translation

Download or read book Discourse in Statistical Machine Translation written by Christian Hardmeier and published by . This book was released on 2014-09-08 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt:

Book Combining Linguistics and Statistics for High quality Limited Domain English Chinese Machine Translation

Download or read book Combining Linguistics and Statistics for High quality Limited Domain English Chinese Machine Translation written by Yushi Xu (Ph. D.) and published by . This book was released on 2008 with total page 93 pages. Available in PDF, EPUB and Kindle. Book excerpt: Second language learning is a compelling activity in today's global markets. This thesis focuses on critical technology necessary to produce a computer spoken translation game for learning Mandarin Chinese in a relatively broad travel domain. Three main aspects are addressed: efficient Chinese parsing, high-quality English-Chinese machine translation, and how these technologies can be integrated into a translation game system. In the language understanding component, the TINA parser is enhanced with bottom-up and long distance constraint features. The results showed that with these features, the Chinese grammar ran ten times faster and covered 15% more of the test set. In the machine translation component, a combined method of linguistic and statistical system is introduced. The English-Chinese translation is done via an intermediate language "Zhonglish", where the English-Zhonglish translation is accomplished by a parse-and-paraphrase paradigm using hand-coded rules, mainly for structural reconstruction. Zhonglish-Chinese translation is accomplished by a standard phrase based statistical machine translation system, mostly accomplishing word sense disambiguation and lexicon mapping. We evaluated in an independent test set in IWSLT travel domain spoken language corpus. Substantial improvements were achieved for GIZA alignment crossover: we obtained a 45% decrease in crossovers compared to a traditional phrase-based statistical MT system. Furthermore, the BLEU score improved by 2 points. Finally, a framework of the translation game system is described, and the feasibility of integrating the components to produce reference translation and to automatically assess student's translation is verified.

Book Statistical Language and Speech Processing

Download or read book Statistical Language and Speech Processing written by Adrian-Horia Dediu and published by Springer. This book was released on 2013-07-24 with total page 319 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book constitutes the refereed proceedings of the First International Conference on Statistical Language and Speech Processing, SLSP 2013, held in Tarragona, Spain, in July 2013. The 24 full papers presented together with two invited talks were carefully reviewed and selected from 61 submissions. The papers cover a wide range of topics in the fields of computational language and speech processing and the statistical methods that are currently in use.

Book Machine Translation Through Clausal Syntax

Download or read book Machine Translation Through Clausal Syntax written by Dan Lowe Wheeler and published by . This book was released on 2008 with total page 293 pages. Available in PDF, EPUB and Kindle. Book excerpt: Language pairs such as Chinese and English with largely differing word order have proved to be one of the greatest challenges in statistical machine translation. One reason is that such techniques usually work with sentences as flat strings of words, rather than explicitly attempting to parse any sort of hierarchical structural representation. Because even simple syntactic differences between languages can quickly lead to a universe of idiosyncratic surface level word reordering rules, many believe the near future of machine translation will lie heavily in syntactic modeling. The time to start may be now: advances in statistical parsing over the last decade have already started opening the door. Following the work of Cowan et al., I present a statistical tree-to-tree translation system for Chinese to English that formulates the translation step as a prediction of English clause structure from Chinese clause structure. Chinese sentences are segmented and parsed, split into clauses, and independently translated into English clauses using a discriminative feature based model. Clausal arguments, such as subject and object, are translated separately using an off-the-shelf phrase-based translator. By explicitly modeling syntax at a clausal level, but using a phrase-based (flat-sentence) method on local, reduced expressions, such as clausal arguments, I aim to address the current weakness in long-distance word reordering while still leveraging the excellent local translations that today's state of the art has to offer.

Book Translation Metaphorical Technical Terms from English Into Mandarin Chinese

Download or read book Translation Metaphorical Technical Terms from English Into Mandarin Chinese written by Riccardo Superbo and published by . This book was released on 2015 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt:

Book Handbook of Natural Language Processing

Download or read book Handbook of Natural Language Processing written by Nitin Indurkhya and published by CRC Press. This book was released on 2010-02-22 with total page 704 pages. Available in PDF, EPUB and Kindle. Book excerpt: The Handbook of Natural Language Processing, Second Edition presents practical tools and techniques for implementing natural language processing in computer systems. Along with removing outdated material, this edition updates every chapter and expands the content to include emerging areas, such as sentiment analysis.New to the Second EditionGreater

Book A Systemic Functional Grammar of Chinese Nominal Groups

Download or read book A Systemic Functional Grammar of Chinese Nominal Groups written by Jing Fang and published by Springer Nature. This book was released on 2022-09-13 with total page 259 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book describes the grammar of Chinese nominal groups for the purpose of text analysis, drawing upon Halliday’s systemic functional linguistics (SFL) model. Exploring the metafunctional grammatical resources in nominal groups, the book provides a new perspective on conducting text analysis by focusing on the metafunctions performed by various elements in the nominal group. The observations on nominal groups presented here are based on both a working corpus of 180 texts of various types and a large referential corpus of over 16 billion tokens. With clear descriptions of the terminology used, the book presents a case study at the end of each major chapter, which demonstrates how the grammatical resources discussed can be applied to the delicate analysis of authentic texts. This monograph is more than a grammar book, for it offers a new way to engage with a text microscopically and enables readers to approach and analyse a text by focusing on grammatical units below the clause level. The book provides an accessible and valuable resource for readers who are interested in SFL-based typological description, text analysis, translation studies between English and Chinese, English–Chinese comparative linguistic studies, and Chinese language teaching and learning.