Download or read book LMF Lexical Markup Framework written by Gil Francopoulo and published by John Wiley & Sons. This book was released on 2013-05-06 with total page 268 pages. Available in PDF, EPUB and Kindle. Book excerpt: The community responsible for developing lexicons for Natural Language Processing (NLP) and Machine Readable Dictionaries (MRDs) started their ISO standardization activities in 2003. These activities resulted in the ISO standard – Lexical Markup Framework (LMF). After selecting and defining a common terminology, the LMF team had to identify the common notions shared by all lexicons in order to specify a common skeleton (called the core model) and understand the various requirements coming from different groups of users. The goals of LMF are to provide a common model for the creation and use of lexical resources, to manage the exchange of data between and among these resources, and to enable the merging of a large number of individual electronic resources to form extensive global electronic resources. The various types of individual instantiations of LMF can include monolingual, bilingual or multilingual lexical resources. The same specifications can be used for small and large lexicons, both simple and complex, as well as for both written and spoken lexical representations. The descriptions range from morphology, syntax and computational semantics to computer-assisted translation. The languages covered are not restricted to European languages, but apply to all natural languages. The LMF specification is now a success and numerous lexicon managers currently use LMF in different languages and contexts. This book starts with the historical context of LMF, before providing an overview of the LMF model and the Data Category Registry, which provides a flexible means for applying constants like /grammatical gender/ in a variety of different settings. It then presents concrete applications and experiments on real data, which are important for developers who want to learn about the use of LMF. Contents 1. LMF – Historical Context and Perspectives, Nicoletta Calzolari, Monica Monachini and Claudia Soria. 2. Model Description, Gil Francopoulo and Monte George. 3. LMF and the Data Category Registry: Principles and Application, Menzo Windhouwer and Sue Ellen Wright. 4. Wordnet-LMF: A Standard Representation for Multilingual Wordnets, Piek Vossen, Claudia Soria and Monica Monachini. 5. Prolmf: A Multilingual Dictionary of Proper Names and their Relations, Denis Maurel, Béatrice Bouchou-Markhoff. 6. LMF for Arabic, Aida Khemakhem, Bilel Gargouri, Kais Haddar and Abdelmajid Ben Hamadou. 7. LMF for a Selection of African Languages, Chantal Enguehard and Mathieu Mangeot. 8. LMF and its Implementation in Some Asian Languages, Takenobu Tokunaga, Sophia Y.M. Lee, Virach Sornlertlamvanich, Kiyoaki Shirai, Shu-Kai Hsieh and Chu-Ren Huang. 9. DUELME: Dutch Electronic Lexicon of Multiword Expressions, Jan Odijk. 10. UBY-LMF – Exploring the Boundaries of Language-Independent Lexicon Models, Judith Eckle-Kohler, Iryna Gurevych, Silvana Hartmann, Michael Matuschek and Christian M. Meyer. 11. Conversion of Lexicon-Grammar Tables to LMF: Application to French, Éric Laporte, Elsa Tolone and Matthieu Constant. 12. Collaborative Tools: From Wiktionary to LMF, for Synchronic and Diachronic Language Data, Thierry Declerck, Pirsoka Lendvai and Karlheinz Mörth. 13. LMF Experiments on Format Conversions for Resource Merging: Converters and Problems, Marta Villegas, Muntsa Padró and Núria Bel. 14. LMF as a Foundation for Servicized Lexical Resources, Yoshihiko Hayashi, Monica Monachini, Bora Savas, Claudia Soria and Nicoletta Calzolari. 15. Creating a Serialization of LMF: The Experience of the RELISH Project, Menzo Windhouwer, Justin Petro, Irina Nevskaya, Sebastian Drude, Helen Aristar-Dry and Jost Gippert. 16. Global Atlas: Proper Nouns, From Wikipedia to LMF, Gil Francopoulo, Frédéric Marcoul, David Causse and Grégory Piparo. 17. LMF in U.S. Government Language Resource Management, Monte George. About the Authors Gil Francopoulo works for Tagmatica (www.tagmatica.com), a company specializing in software development in the field of linguistics and documentation in the semantic web, in Paris, France, as well as for Spotter (www.spotter.com), a company specializing in media and social media analytics.
Download or read book Linked Lexical Knowledge Bases written by Iryna Gurevych and published by Springer Nature. This book was released on 2022-06-01 with total page 124 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book conveys the fundamentals of Linked Lexical Knowledge Bases (LLKB) and sheds light on their different aspects from various perspectives, focusing on their construction and use in natural language processing (NLP). It characterizes a wide range of both expert-based and collaboratively constructed lexical knowledge bases. Only basic familiarity with NLP is required and this book has been written for both students and researchers in NLP and related fields who are interested in knowledge-based approaches to language analysis and their applications. Lexical Knowledge Bases (LKBs) are indispensable in many areas of natural language processing, as they encode human knowledge of language in machine readable form, and as such, they are required as a reference when machines attempt to interpret natural language in accordance with human perception. In recent years, numerous research efforts have led to the insight that to make the best use of available knowledge, the orchestrated exploitation of different LKBs is necessary. This allows us to not only extend the range of covered words and senses, but also gives us the opportunity to obtain a richer knowledge representation when a particular meaning of a word is covered in more than one resource. Examples where such an orchestrated usage of LKBs proved beneficial include word sense disambiguation, semantic role labeling, semantic parsing, and text classification. This book presents different kinds of automatic, manual, and collaborative linkings between LKBs. A special chapter is devoted to the linking algorithms employing text-based, graph-based, and joint modeling methods. Following this, it presents a set of higher-level NLP tasks and algorithms, effectively utilizing the knowledge in LLKBs. Among them, you will find advanced methods, e.g., distant supervision, or continuous vector space models of knowledge bases (KB), that have become widely used at the time of this book's writing. Finally, multilingual applications of LLKB's, such as cross-lingual semantic relatedness and computer-aided translation are discussed, as well as tools and interfaces for exploring LLKBs, followed by conclusions and future research directions.
Download or read book Lexical Conflict written by Danko Šipka and published by Cambridge University Press. This book was released on 2015-09-18 with total page 265 pages. Available in PDF, EPUB and Kindle. Book excerpt: The first practical study of its kind, Lexical Conflict presents a taxonomy of cross-linguistic lexical differences, with thorough discussion of zero equivalence, multiple equivalence and partial equivalence across languages. Illustrated with numerous examples taken from over one hundred world languages, this work is an exhaustive exploration of cross-linguistic and cross-cultural differences, presenting guidelines and solutions for the lexicographic treatment of these differences. The text combines theoretical and applied linguistic perspectives to create an essential guide for students, researchers and practitioners in linguistics, anthropology, cross-cultural psychology, translation, interpretation and international marketing.
Download or read book The Routledge Handbook of Lexicography written by Pedro A. Fuertes-Olivera and published by Routledge. This book was released on 2017-10-02 with total page 987 pages. Available in PDF, EPUB and Kindle. Book excerpt: The Routledge Handbook of Lexicography provides a comprehensive overview of the major approaches to lexicography and their applications within the field. This Handbook features key case studies and cutting-edge contributions from an international range of practitioners, teachers, and researchers. Analysing the theory and practice of compiling dictionaries within the digital era, the 47 chapters address the core issues of: The foundations of lexicography, and its interactions with other disciplines including Corpus Linguistics and Information Science; Types of dictionaries, for purposes such as translation and teaching; Innovative specialised dictionaries such as the Oenolex wine dictionary and the Online Dictionary of New Zealand Sign Language; Lexicography and world languages, including Arabic, Hindi, Russian, Chinese, and Indonesian; The future of lexicography, including the use of the Internet, user participation, and dictionary portals. The Routledge Handbook of Lexicography is essential reading for researchers and students working in this area.
Download or read book Language Culture Computation Computational Linguistics and Linguistics written by Nachum Dershowitz and published by Springer. This book was released on 2014-12-05 with total page 882 pages. Available in PDF, EPUB and Kindle. Book excerpt: This Festschrift volume is published in Honor of Yaacov Choueka on the occasion of this 75th birthday. The present three-volumes liber amicorum, several years in gestation, honours this outstanding Israeli computer scientist and is dedicated to him and to his scientific endeavours. Yaacov's research has had a major impact not only within the walls of academia, but also in the daily life of lay users of such technology that originated from his research. An especially amazing aspect of the temporal span of his scholarly work is that half a century after his influential research from the early 1960s, a project in which he is currently involved is proving to be a sensation, as will become apparent from what follows. Yaacov Choueka began his research career in the theory of computer science, dealing with basic questions regarding the relation between mathematical logic and automata theory. From formal languages, Yaacov moved to natural languages. He was a founder of natural-language processing in Israel, developing numerous tools for Hebrew. He is best known for his primary role, together with Aviezri Fraenkel, in the development of the Responsa Project, one of the earliest fulltext retrieval systems in the world. More recently, he has headed the Friedberg Genizah Project, which is bringing the treasures of the Cairo Genizah into the Digital Age. This third part of the three-volume set covers a range of topics related to language, ranging from linguistics to applications of computation to language, using linguistic tools. The papers are grouped in topical sections on: natural language processing; representing the lexicon; and neologisation.
Download or read book Features written by Greville G. Corbett and published by Cambridge University Press. This book was released on 2012-10-11 with total page 341 pages. Available in PDF, EPUB and Kindle. Book excerpt: A unique examination of the features of language: how features vary between languages and also how they work.
Download or read book Towards the Multilingual Semantic Web written by Paul Buitelaar and published by Springer. This book was released on 2014-11-13 with total page 339 pages. Available in PDF, EPUB and Kindle. Book excerpt: To date, the relation between multilingualism and the Semantic Web has not yet received enough attention in the research community. One major challenge for the Semantic Web community is to develop architectures, frameworks and systems that can help in overcoming national and language barriers, facilitating equal access to information produced in different cultures and languages. As such, this volume aims at documenting the state-of-the-art with regard to the vision of a Multilingual Semantic Web, in which semantic information will be accessible in and across multiple languages. The Multilingual Semantic Web as envisioned in this volume will support the following functionalities: (1) responding to information needs in any language with regard to semantically structured data available on the Semantic Web and Linked Open Data (LOD) cloud, (2) verbalizing and accessing semantically structured data, ontologies or other conceptualizations in multiple languages, (3) harmonizing, integrating, aggregating, comparing and repurposing semantically structured data across languages and (4) aligning and reconciling ontologies or other conceptualizations across languages. The volume is divided into three main sections: Principles, Methods and Applications. The section on “Principles” discusses models, architectures and methodologies that enrich the current Semantic Web architecture with features necessary to handle multiple languages. The section on “Methods” describes algorithms and approaches for solving key issues related to the construction of the Multilingual Semantic Web. The section on “Applications” describes the use of Multilingual Semantic Web based approaches in the context of several application domains. This volume is essential reading for all academic and industrial researchers who want to embark on this new research field at the intersection of various research topics, including the Semantic Web, Linked Data, natural language processing, computational linguistics, terminology and information retrieval. It will also be of great interest to practitioners who are interested in re-examining their existing infrastructure and methodologies for handling multiple languages in Web applications or information retrieval systems.
Download or read book Multiword expressions in lexical resources written by Voula Giouli and published by Language Science Press. This book was released on 2024-06-17 with total page 372 pages. Available in PDF, EPUB and Kindle. Book excerpt: This volume contains chapters that paint the current landscape of the multiword expressions (MWE) representation in lexical resources, in view of their robust identification and computational processing. Both large-size general lexica and smaller MWE-centred ones are included, with special focus on the representation decisions and mechanisms that facilitate their usage in Natural Language Processing tasks. The presentations go beyond the morpho-syntactic description of MWEs, into their semantics. One challenge in representing MWEs in lexical resources is ensuring that the variability along with extra features required by the different types of MWEs can be captured efficiently. In this respect, recommendations for representing MWEs in mono- and multilingual computational lexicons have been proposed; these focus mainly on the syntactic and semantic properties of support verbs and noun compounds and their proper encoding thereof.
Download or read book The Swedish FrameNet written by Dana Dannélls and published by John Benjamins Publishing Company. This book was released on 2021-11-26 with total page 349 pages. Available in PDF, EPUB and Kindle. Book excerpt: Large computational lexicons are central NLP resources. Swedish FrameNet++ aims to be a versatile full-scale lexical resource for NLP containing many kinds of linguistic information. Although focused on Swedish, this ongoing effort, which includes building a new Swedish framenet and recycling existing lexicons, has offered valuable insights into general aspects of lexical-resource building for NLP, which are discussed in this book: computational and linguistic problems of lexical semantics and lexical typology, the nature of lexical items (words and multiword expressions), achieving interoperability among heterogeneous lexical content, NLP methods for extending and interlinking existing lexicons, and deploying the new resource in practical NLP applications. This book is targeted at everyone with an interest in lexicography, computational lexicography, lexical typology, lexical semantics, linguistics, computational linguistics and related fields. We believe it should be of particular interest to those who are or have been involved in language resource creation, development and evaluation.
Download or read book The Language Grid written by Toru Ishida and published by Springer Science & Business Media. This book was released on 2011-07-29 with total page 303 pages. Available in PDF, EPUB and Kindle. Book excerpt: There is increasing interaction among communities with multiple languages, thus we need services that can effectively support multilingual communication. The Language Grid is an initiative to build an infrastructure that allows end users to create composite language services for intercultural collaboration. The aim is to support communities to create customized multilingual environments by using language services to overcome local language barriers. The stakeholders of the Language Grid are the language resource providers, the language service users, and the language grid operators who coordinate the former. This book includes 18 chapters in six parts that summarize various research results and associated development activities on the Language Grid. The chapters in Part I describe the framework of the Language Grid, i.e., service-oriented collective intelligence, used to bridge providers, users and operators. Two kinds of software are introduced, the service grid server software and the Language Grid Toolbox, and code for both is available via open source licenses. Part II describes technologies for service workflows that compose atomic language services. Part III reports on research work and activities relating to sharing and using language services. Part IV describes various applications of language services as applicable to intercultural collaboration. Part V contains reports on applying the Language Grid for translation activities, including localization of industrial documents and Wikipedia articles. Finally, Part VI illustrates how the Language Grid can be connected to other service grids, such as DFKI's Heart of Gold and smart classroom services in Tsinghua University in Beijing. The book will be valuable for researchers in artificial intelligence, natural language processing, services computing and human--computer interaction, particularly those who are interested in bridging technologies and user communities.
Download or read book Linguistic Linked Data written by Philipp Cimiano and published by Springer Nature. This book was released on 2020-01-13 with total page 289 pages. Available in PDF, EPUB and Kindle. Book excerpt: This is the first monograph on the emerging area of linguistic linked data. Presenting a combination of background information on linguistic linked data and concrete implementation advice, it introduces and discusses the main benefits of applying linked data (LD) principles to the representation and publication of linguistic resources, arguing that LD does not look at a single resource in isolation but seeks to create a large network of resources that can be used together and uniformly, and so making more of the single resource. The book describes how the LD principles can be applied to modelling language resources. The first part provides the foundation for understanding the remainder of the book, introducing the data models, ontology and query languages used as the basis of the Semantic Web and LD and offering a more detailed overview of the Linguistic Linked Data Cloud. The second part of the book focuses on modelling language resources using LD principles, describing how to model lexical resources using Ontolex-lemon, the lexicon model for ontologies, and how to annotate and address elements of text represented in RDF. It also demonstrates how to model annotations, and how to capture the metadata of language resources. Further, it includes a chapter on representing linguistic categories. In the third part of the book, the authors describe how language resources can be transformed into LD and how links can be inferred and added to the data to increase connectivity and linking between different datasets. They also discuss using LD resources for natural language processing. The last part describes concrete applications of the technologies: representing and linking multilingual wordnets, applications in digital humanities and the discovery of language resources. Given its scope, the book is relevant for researchers and graduate students interested in topics at the crossroads of natural language processing / computational linguistics and the Semantic Web / linked data. It appeals to Semantic Web experts who are not proficient in applying the Semantic Web and LD principles to linguistic data, as well as to computational linguists who are used to working with lexical and linguistic resources wanting to learn about a new paradigm for modelling, publishing and exploiting linguistic resources.
Download or read book Language technologies for a multilingual Europe written by Georg Rehm and published by Language Science Press. This book was released on 2018-06-19 with total page 220 pages. Available in PDF, EPUB and Kindle. Book excerpt: This volume of the series “Translation and Multilingual Natural Language Processing” includes most of the papers presented at the Workshop “Language Technology for a Multilingual Europe”, held at the University of Hamburg on September 27, 2011 in the framework of the conference GSCL 2011 with the topic “Multilingual Resources and Multilingual Applications”, along with several additional contributions. In addition to an overview article on Machine Translation and two contributions on the European initiatives META-NET and Multilingual Web, the volume includes six full research articles. Our intention with this workshop was to bring together various groups concerned with the umbrella topics of multilingualism and language technology, especially multilingual technologies. This encompassed, on the one hand, representatives from research and development in the field of language technologies, and, on the other hand, users from diverse areas such as, among others, industry, administration and funding agencies. The Workshop “Language Technology for a Multilingual Europe” was co-organised by the two GSCL working groups “Text Technology” and “Machine Translation” (http://gscl.info) as well as by META-NET (http://www.meta-net.eu).
Download or read book Human Language Technology Challenges of the Information Society written by Zygmunt Vetulani and published by Springer Science & Business Media. This book was released on 2009-09-07 with total page 486 pages. Available in PDF, EPUB and Kindle. Book excerpt: Half a centuryago not manypeople had realizedthat a new epoch in the history of homo sapiens had just started. The term “Information Society Age” seems an appropriate name for this epoch. Communication was without a doubt a lever of the conquest of the human race over the rest of the animate world. There is little doubt that the human racebegan when our predecessorsstarted to communicate with each other using language.This highly abstractmeans of communicationwas probably one of the major factors contributing to the evolutionary success of the human race within the animal world. Physically weak and imperfect, humans started to dominate the rest of the world through the creation of communication-based societies where individuals communicated initially to satisfy immediate needs, and then to create, accumulate and process knowledge for future use. The crucial step in the history of humanity was the invention of writing. It is worth noting that writing is a human invention, not a phenomenon resulting from natural evolution. Humans invented writing as a technique for recording speech as well as for storing and facilitating the dissemination of knowledge across the world. Humans continue to be born illiterate, and therefore teaching and conscious supervised learning is necessary to maintain this basic social skill.
Download or read book Linguistic Modeling of Information and Markup Languages written by Andreas Witt and published by Springer Science & Business Media. This book was released on 2010-01-09 with total page 272 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book covers recent developments in the field, from multi-layered mark-up and standards to theoretical formalisms to applications. It presents results from international research in text technology, computational linguistics, hypertext modeling and more.
Download or read book Essential Speech and Language Technology for Dutch written by Peter Spyns and published by Springer Science & Business Media. This book was released on 2013-02-26 with total page 414 pages. Available in PDF, EPUB and Kindle. Book excerpt: The book provides an overview of more than a decade of joint R&D efforts in the Low Countries on HLT for Dutch. It not only presents the state of the art of HLT for Dutch in the areas covered, but, even more importantly, a description of the resources (data and tools) for Dutch that have been created are now available for both academia and industry worldwide. The contributions cover many areas of human language technology (for Dutch): corpus collection (including IPR issues) and building (in particular one corpus aiming at a collection of 500M word tokens), lexicology, anaphora resolution, a semantic network, parsing technology, speech recognition, machine translation, text (summaries) generation, web mining, information extraction, and text to speech to name the most important ones. The book also shows how a medium-sized language community (spanning two territories) can create a digital language infrastructure (resources, tools, etc.) as a basis for subsequent R&D. At the same time, it bundles contributions of almost all the HLT research groups in Flanders and the Netherlands, hence offers a view of their recent research activities. Targeted readers are mainly researchers in human language technology, in particular those focusing on Dutch. It concerns researchers active in larger networks such as the CLARIN, META-NET, FLaReNet and participating in conferences such as ACL, EACL, NAACL, COLING, RANLP, CICling, LREC, CLIN and DIR ( both in the Low Countries), InterSpeech, ASRU, ICASSP, ISCA, EUSIPCO, CLEF, TREC, etc. In addition, some chapters are interesting for human language technology policy makers and even for science policy makers in general.
Download or read book Knowledge Science Engineering and Management written by Robert Buchmann and published by Springer. This book was released on 2014-10-10 with total page 407 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book constitutes the refereed proceedings of the 7th International Conference on Knowledge Science, Engineering and Management, KSEM 2014, held in Sibiu, Romania, in October 2014. The 30 revised full papers presented together with 5 short papers and 3 keynotes were carefully selected and reviewed from 77 submissions. The papers are organized in topical sections on formal semantics; content and document analysis; concept and lexical analysis; clustering and classification; metamodeling and conceptual modeling; enterprise knowledge; knowledge discovery and retrieval; formal knowledge processing; ontology engineering and management; knowledge management; and hybrid knowledge systems.
Download or read book Semi Automatic Ontology Development Processes and Resources written by Pazienza, Maria Teresa and published by IGI Global. This book was released on 2012-02-29 with total page 341 pages. Available in PDF, EPUB and Kindle. Book excerpt: "This book includes state-of-the-art research results aimed at the automation of ontology development processes and the reuse of external resources becoming a reality, thus being of interest for a wide and diversified community of users"--