Download or read book Exploiting Semantic Web Knowledge Graphs in Data Mining written by P. Ristoski and published by IOS Press. This book was released on 2019-06-28 with total page 246 pages. Available in PDF, EPUB and Kindle. Book excerpt: Data Mining and Knowledge Discovery in Databases (KDD) is a research field concerned with deriving higher-level insights from data. The tasks performed in this field are knowledge intensive and can benefit from additional knowledge from various sources, so many approaches have been proposed that combine Semantic Web data with the data mining and knowledge discovery process. This book, Exploiting Semantic Web Knowledge Graphs in Data Mining, aims to show that Semantic Web knowledge graphs are useful for generating valuable data mining features that can be used in various data mining tasks. In Part I, Mining Semantic Web Knowledge Graphs, the author evaluates unsupervised feature generation strategies from types and relations in knowledge graphs used in different data mining tasks such as classification, regression, and outlier detection. Part II, Semantic Web Knowledge Graphs Embeddings, proposes an approach that circumvents the shortcomings introduced with the approaches in Part I, developing an approach that is able to embed complete Semantic Web knowledge graphs in a low dimensional feature space where each entity and relation in the knowledge graph is represented as a numerical vector. Finally, Part III, Applications of Semantic Web Knowledge Graphs, describes a list of applications that exploit Semantic Web knowledge graphs like classification and regression, showing that the approaches developed in Part I and Part II can be used in applications in various domains. The book will be of interest to all those working in the field of data mining and KDD.
Download or read book Exploiting Linked Data and Knowledge Graphs in Large Organisations written by Jeff Z. Pan and published by Springer. This book was released on 2017-01-24 with total page 281 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book addresses the topic of exploiting enterprise-linked data with a particular focus on knowledge construction and accessibility within enterprises. It identifies the gaps between the requirements of enterprise knowledge consumption and “standard” data consuming technologies by analysing real-world use cases, and proposes the enterprise knowledge graph to fill such gaps. It provides concrete guidelines for effectively deploying linked-data graphs within and across business organizations. It is divided into three parts, focusing on the key technologies for constructing, understanding and employing knowledge graphs. Part 1 introduces basic background information and technologies, and presents a simple architecture to elucidate the main phases and tasks required during the lifecycle of knowledge graphs. Part 2 focuses on technical aspects; it starts with state-of-the art knowledge-graph construction approaches, and then discusses exploration and exploitation techniques as well as advanced question-answering topics concerning knowledge graphs. Lastly, Part 3 demonstrates examples of successful knowledge graph applications in the media industry, healthcare and cultural heritage, and offers conclusions and future visions.
Download or read book Knowledge Graphs written by Aidan Hogan and published by Morgan & Claypool Publishers. This book was released on 2021-11-08 with total page 257 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book provides a comprehensive and accessible introduction to knowledge graphs, which have recently garnered notable attention from both industry and academia. Knowledge graphs are founded on the principle of applying a graph-based abstraction to data, and are now broadly deployed in scenarios that require integrating and extracting value from multiple, diverse sources of data at large scale. The book defines knowledge graphs and provides a high-level overview of how they are used. It presents and contrasts popular graph models that are commonly used to represent data as graphs, and the languages by which they can be queried before describing how the resulting data graph can be enhanced with notions of schema, identity, and context. The book discusses how ontologies and rules can be used to encode knowledge as well as how inductive techniques—based on statistics, graph analytics, machine learning, etc.—can be used to encode and extract knowledge. It covers techniques for the creation, enrichment, assessment, and refinement of knowledge graphs and surveys recent open and enterprise knowledge graphs and the industries or applications within which they have been most widely adopted. The book closes by discussing the current limitations and future directions along which knowledge graphs are likely to evolve. This book is aimed at students, researchers, and practitioners who wish to learn more about knowledge graphs and how they facilitate extracting value from diverse data at large scale. To make the book accessible for newcomers, running examples and graphical notation are used throughout. Formal definitions and extensive references are also provided for those who opt to delve more deeply into specific topics.
Download or read book Geographic Knowledge Graph Summarization written by B. Yan and published by IOS Press. This book was released on 2019-08-08 with total page 170 pages. Available in PDF, EPUB and Kindle. Book excerpt: Geographic knowledge graphs can have an important role in delivering interoperability, accessibility and the demands of conceptualization in geographic information science (GIS). However, the massive amount of accompanying information and the enormous diversity of geographic knowledge graphs limits their applicability and hinders the widespread adoption of this useful structured knowledge. This book, Geographic Knowledge Graph Summarization, focuses on the ways in which geographic knowledge graphs can be digested and summarized. Such a summarization would relieve the burden of information overload for end users and reduce data storage, as well as speeding up queries and eliminating ‘noise’. The book introduces the general concept of geospatial inductive bias and explains the different ways in which this idea can be used in the summarization of geographic knowledge graphs. The book breaks up the task of summarization into separate but related components, and after an introduction and a brief overview of concepts and theories, Chapters 3, 4 and 5 explore hierarchical place type structure, multimedia leaf nodes, and general relation and entity components respectively. Chapter 6 presents a spatial knowledge map interface which illustrates the effectiveness of summarization. The book integrates top-down knowledge engineering and bottom-up knowledge learning methods, and will do much to promote awareness of this fascinating area and related issues.
Download or read book Knowledge Graphs for eXplainable Artificial Intelligence Foundations Applications and Challenges written by I. Tiddi and published by IOS Press. This book was released on 2020-05-06 with total page 314 pages. Available in PDF, EPUB and Kindle. Book excerpt: The latest advances in Artificial Intelligence and (deep) Machine Learning in particular revealed a major drawback of modern intelligent systems, namely the inability to explain their decisions in a way that humans can easily understand. While eXplainable AI rapidly became an active area of research in response to this need for improved understandability and trustworthiness, the field of Knowledge Representation and Reasoning (KRR) has on the other hand a long-standing tradition in managing information in a symbolic, human-understandable form. This book provides the first comprehensive collection of research contributions on the role of knowledge graphs for eXplainable AI (KG4XAI), and the papers included here present academic and industrial research focused on the theory, methods and implementations of AI systems that use structured knowledge to generate reliable explanations. Introductory material on knowledge graphs is included for those readers with only a minimal background in the field, as well as specific chapters devoted to advanced methods, applications and case-studies that use knowledge graphs as a part of knowledge-based, explainable systems (KBX-systems). The final chapters explore current challenges and future research directions in the area of knowledge graphs for eXplainable AI. The book not only provides a scholarly, state-of-the-art overview of research in this subject area, but also fosters the hybrid combination of symbolic and subsymbolic AI methods, and will be of interest to all those working in the field.
Download or read book Semantic AI in Knowledge Graphs written by Sanju Tiwari and published by CRC Press. This book was released on 2023-08-21 with total page 217 pages. Available in PDF, EPUB and Kindle. Book excerpt: Existing research papers do not have complete information in depth about the Semantic AI in Knowledge Graphs. This book has all the basic information required to gain in-depth knowledge of this field. Covers neuro-symbolic AI, explainable AI and deep learning to knowledge discover and mining, and knowledge representation and reasoning.
Download or read book Mining Authoritativeness in Art Historical Photo Archives written by M. Daquino and published by IOS Press. This book was released on 2019-09-04 with total page 230 pages. Available in PDF, EPUB and Kindle. Book excerpt: In the course of their research, art historians frequently need to refer to historical photo archives when attempting to authenticate works of art. This book, Mining Authoritativeness in Art Historical Photo Archives, provides an aid to retrieving relevant sources and assessing the textual authoritativeness – the internal grounds – of sources of attribution, and to evaluating the authoritativeness of cited scholars. The book aims to do three things: facilitate knowledge discovery in art historical photo archives, support users’ decision-making processes when evaluating contradictory attributions, and provide policies to improve the quality of information in art historical photo archives. The author’s approach is to leverage Semantic Web technologies in order to aggregate, assess, and recommend the most documented authorship attributions. At the same time, the retrieval process allows the providers of art historical data to define a low-cost data integration process with which to update and enrich their collection data. This conceptual framework for assessing questionable information will also be of value to those working in a number of other fields, such as archives, museums, and libraries, as well as to art historians.
Download or read book Machine Learning for Data Science Handbook written by Lior Rokach and published by Springer Nature. This book was released on 2023-08-17 with total page 975 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book organizes key concepts, theories, standards, methodologies, trends, challenges and applications of data mining and knowledge discovery in databases. It first surveys, then provides comprehensive yet concise algorithmic descriptions of methods, including classic methods plus the extensions and novel methods developed recently. It also gives in-depth descriptions of data mining applications in various interdisciplinary industries.
Download or read book Engineering Background Knowledge for Social Robots written by L. Asprino and published by IOS Press. This book was released on 2020-09-25 with total page 240 pages. Available in PDF, EPUB and Kindle. Book excerpt: Social robots are embodied agents that perform knowledge-intensive tasks involving several kinds of information from different heterogeneous sources. This book, Engineering Background Knowledge for Social Robots, introduces a component-based architecture for supporting the knowledge-intensive tasks performed by social robots. The design was based on the requirements of a real socially-assistive robotic application, and all the components contribute to and benefit from the knowledge base which is its cornerstone. The knowledge base is structured by a set of interconnected and modularized ontologies which model the information, and is initially populated with linguistic, ontological and factual knowledge retrieved from Linked Open Data. Access to the knowledge base is guaranteed by Lizard, a tool providing software components, with an API for accessing facts stored in the knowledge base in a programmatic and object-oriented way. The author introduces two methods for engineering the knowledge needed by robots, a novel method for automatically integrating knowledge from heterogeneous sources with a frame-driven approach, and a novel empirical method for assessing foundational distinctions over Linked Open Data entities from a common-sense perspective. These effectively enable the evolution of the robot’s knowledge by automatically integrating information derived from heterogeneous sources and the generation of common-sense knowledge using Linked Open Data as an empirical basis. The feasibility and benefits of the architecture have been assessed through a prototype deployed in a real socially-assistive scenario, and the book presents two applications and the results of a qualitative and quantitative evaluation.
Download or read book Strategies and Techniques for Federated Semantic Knowledge Integration and Retrieval written by D. Collarana and published by IOS Press. This book was released on 2020-01-24 with total page 158 pages. Available in PDF, EPUB and Kindle. Book excerpt: The vast amount of data available on the web has led to the need for effective retrieval techniques to transform that data into usable machine knowledge. But the creation of integrated knowledge, especially knowledge about the same entity from different web data sources, is a challenging task requiring the solving of interoperability problems. This book addresses the problem of knowledge retrieval and integration from heterogeneous web sources, and proposes a holistic semantic knowledge retrieval and integration approach to creating knowledge graphs on-demand from diverse web sources. Semantic Web Technologies have evolved as a novel approach to tackle the problem of knowledge integration from heterogeneous data, but because of the Extraction-Transformation-Load approach that dominates the process, knowledge retrieval and integration from web data sources is either expensive, or full physical integration of the data is impeded by restricted access. Focusing on the representation of data from web sources as pieces of knowledge belonging to the same entity which can then be synthesized as a knowledge graph helps to solve interoperability conflicts and allow for a more cost-effective integration approach, providing a method that enables the creation of valuable insights from heterogeneous web data. Empirical evaluations to assess the effectiveness of this holistic approach provide evidence that the methodology and techniques proposed in this book help to effectively integrate the disparate knowledge spread over heterogeneous web data sources, and the book also demonstrates how three domain applications of law enforcement, job market analysis, and manufacturing, have been developed and managed using the approach.
Download or read book Neural Generation of Textual Summaries from Knowledge Base Triples written by P. Vougiouklis and published by IOS Press. This book was released on 2020-04-07 with total page 174 pages. Available in PDF, EPUB and Kindle. Book excerpt: Most people need textual or visual interfaces to help them make sense of Semantic Web data. In this book, the author investigates the problems associated with generating natural language summaries for structured data encoded as triples using deep neural networks. An end-to-end trainable architecture is proposed, which encodes the information from a set of knowledge graph triples into a vector of fixed dimensionality, and generates a textual summary by conditioning the output on this encoded vector. Different methodologies for building the required data-to-text corpora are explored to train and evaluate the performance of the approach. Attention is first focused on generating biographies, and the author demonstrates that the technique is capable of scaling to domains with larger and more challenging vocabularies. The applicability of the technique for the generation of open-domain Wikipedia summaries in Arabic and Esperanto – two under-resourced languages – is then discussed, and a set of community studies, devised to measure the usability of the automatically generated content by Wikipedia readers and editors, is described. Finally, the book explains an extension of the original model with a pointer mechanism that enables it to learn to verbalise in a different number of ways the content from the triples while retaining the capacity to generate words from a fixed target vocabulary. The evaluation of performance using a dataset encompassing all of English Wikipedia is described, with results from both automatic and human evaluation both of which highlight the superiority of the latter approach as compared to the original architecture.
Download or read book Study on Data Placement Strategies in Distributed RDF Stores written by D.D. Janke and published by IOS Press. This book was released on 2020-03-18 with total page 312 pages. Available in PDF, EPUB and Kindle. Book excerpt: The distributed setting of RDF stores in the cloud poses many challenges, including how to optimize data placement on the compute nodes to improve query performance. In this book, a novel benchmarking methodology is developed for data placement strategies; one that overcomes these limitations by using a data-placement-strategy-independent distributed RDF store to analyze the effect of the data placement strategies on query performance. Frequently used data placement strategies have been evaluated, and this evaluation challenges the commonly held belief that data placement strategies which emphasize local computation lead to faster query executions. Indeed, results indicate that queries with a high workload can be executed faster on hash-based data placement strategies than on, for example, minimal edge-cut covers. The analysis of additional measurements indicates that vertical parallelization (i.e., a well-distributed workload) may be more important than horizontal containment (i.e., minimal data transport) for efficient query processing. Two such data placement strategies are proposed: the first, found in the literature, is entitled overpartitioned minimal edge-cut cover, and the second is the newly developed molecule hash cover. Evaluation revealed a balanced query workload and a high horizontal containment, which lead to a high vertical parallelization. As a result, these strategies demonstrated better query performance than other frequently used data placement strategies. The book also tests the hypothesis that collocating small connected triple sets on the same compute node while balancing the amount of triples stored on the different compute nodes leads to a high vertical parallelization.
Download or read book Managing and Consuming Completeness Information for RDF Data Sources written by F. Darari and published by IOS Press. This book was released on 2019-11-12 with total page 194 pages. Available in PDF, EPUB and Kindle. Book excerpt: The increasing amount of structured data available on the Web is laying the foundations for a global-scale knowledge base. But the ever increasing amount of Semantic Web data gives rise to the question – how complete is that data? Though data on the Semantic Web is generally incomplete, some may indeed be complete. In this book, the author deals with how to manage and consume completeness information about Semantic Web data. In particular, the book explores how completeness information can guarantee the completeness of query answering. Optimization techniques for completeness reasoning and the conducting of experimental evaluations are provided to show the feasibility of the approaches, as well as a technique for checking the soundness of queries with negation via reduction to query completeness checking. Other topics covered include completeness information with timestamps, and two demonstrators – CORNER and COOL-WD – are provided to show how a completeness framework can be realized. Finally, the book investigates an automated method to generate completeness statements from text on the Web. The book will be of interest to anyone whose work involves dealing with Web-data completeness.
Download or read book Multi modal Data Fusion based on Embeddings written by S. Thoma and published by IOS Press. This book was released on 2019-11-06 with total page 174 pages. Available in PDF, EPUB and Kindle. Book excerpt: Many web pages include structured data in the form of semantic markup, which can be transferred to the Resource Description Framework (RDF) or provide an interface to retrieve RDF data directly. This RDF data enables machines to automatically process and use the data. When applications need data from more than one source the data has to be integrated, and the automation of this can be challenging. Usually, vocabularies are used to concisely describe the data, but because of the decentralized nature of the web, multiple data sources can provide similar information with different vocabularies, making integration more difficult. This book, Multi-modal Data Fusion based on Embeddings, describes how similar statements about entities can be identified across sources, independent of the vocabulary and data modeling choices. Previous approaches have relied on clean and extensively modeled ontologies for the alignment of statements, but the often noisy data in a web context does not necessarily adhere to these prerequisites. In this book, the use of RDF label information of entities is proposed to tackle this problem. In combination with embeddings, the use of label information allows for a better integration of noisy data, something that has been empirically confirmed by experiment. The book presents two main scientific contributions: the vocabulary and modeling agnostic fusion approach on the purely textual label information, and the combination of three different modalities into one multi-modal embedding space for a more human-like notion of similarity. The book will be of interest to all those faced with the problem of processing data from multiple web-based sources.
Download or read book Applications and Practices in Ontology Design Extraction and Reasoning written by G. Cota and published by IOS Press. This book was released on 2020-12-02 with total page 244 pages. Available in PDF, EPUB and Kindle. Book excerpt: Semantic Web technologies enable people to create data stores on the Web, build vocabularies, and write rules for handling data. They have been in use for several years now, and knowledge extraction and knowledge discovery are two key aspects investigated in a number of research fields which can potentially benefit from the application of semantic web technologies, and specifically from the development and reuse of ontologies. This book, Applications and Practices in Ontology Design, Extraction, and Reasoning, has as its main goal the provision of an overview of application fields for semantic web technologies. In particular, it investigates how state-of-the-art formal languages, models, methods, and applications of semantic web technologies reframe research questions and approaches in a number of research fields. The book also aims to showcase practical tools and background knowledge for the building and querying of ontologies. The first part of the book presents the state-of-the-art of ontology design, applications and practices in a number of communities, and in doing so it provides an overview of the latest approaches and techniques for building and reusing ontologies according to domain-dependent and independent requirements. Once the data is represented according to ontologies, it is important to be able to query and reason about them, also in the presence of uncertainty, vagueness and probabilities. The second part of the book covers some of the latest advances in the fields of ontology, semantics and reasoning, without losing sight of the book’s practical goals.
Download or read book Identity of Long tail Entities in Text written by F. Ilievski and published by IOS Press. This book was released on 2019-11-29 with total page 229 pages. Available in PDF, EPUB and Kindle. Book excerpt: The digital era has generated a huge amount of data on the identities (profiles) of people, organizations and other entities in a digital format, largely consisting of textual documents such as news articles, encyclopedias, personal websites, books, and social media. Identity has thus been transformed from a philosophical to a societal issue, one requiring robust computational tools to determine entity identity in text. Computational systems developed to establish identity in text often struggle with long-tail cases. This book investigates how Natural Language Processing (NLP) techniques for establishing the identity of long-tail entities – which are all infrequent in communication, hardly represented in knowledge bases, and potentially very ambiguous – can be improved through the use of background knowledge. Topics covered include: distinguishing tail entities from head entities; assessing whether current evaluation datasets and metrics are representative for long-tail cases; improving evaluation of long-tail cases; accessing and enriching knowledge on long-tail entities in the Linked Open Data cloud; and investigating the added value of background knowledge (“profiling”) models for establishing the identity of NIL entities. Providing novel insights into an under-explored and difficult NLP challenge, the book will be of interest to all those working in the field of entity identification in text.
Download or read book Services for Connecting and Integrating Big Numbers of Linked Datasets written by M. Mountantonakis and published by IOS Press. This book was released on 2021-02-19 with total page 314 pages. Available in PDF, EPUB and Kindle. Book excerpt: Linked Data is a method of publishing structured data to facilitate sharing, linking, searching and re-use. Many such datasets have already been published, but although their number and size continues to increase, the main objectives of linking and integration have not yet been fully realized, and even seemingly simple tasks, like finding all the available information for an entity, are still challenging. This book, Services for Connecting and Integrating Big Numbers of Linked Datasets, is the 50th volume in the series ‘Studies on the Semantic Web’. The book analyzes the research work done in the area of linked data integration, and focuses on methods that can be used at large scale. It then proposes indexes and algorithms for tackling some of the challenges, such as, methods for performing cross-dataset identity reasoning, finding all the available information for an entity, methods for ordering content-based dataset discovery, and others. The author demonstrates how content-based dataset discovery can be reduced to solving optimization problems, and techniques are proposed for solving these efficiently while taking the contents of the datasets into consideration. To order them in real time, the proposed indexes and algorithms have been implemented in a suite of services called LODsyndesis, in turn enabling the implementation of other high level services, such as techniques for knowledge graph embeddings, and services for data enrichment which can be exploited for machine-learning tasks, and which also improve the prediction of machine-learning problems.