Strategies and Techniques for Federated Semantic Knowledge Integration and Retrieval

Strategies and Techniques for Federated Semantic Knowledge Integration and Retrieval

Author: D. Collarana

Publisher: IOS Press

Published: 2020-01-24

Total Pages: 158

ISBN-13: 1643680471

DOWNLOAD EBOOK

The vast amount of data available on the web has led to the need for effective retrieval techniques to transform that data into usable machine knowledge. But the creation of integrated knowledge, especially knowledge about the same entity from different web data sources, is a challenging task requiring the solving of interoperability problems. This book addresses the problem of knowledge retrieval and integration from heterogeneous web sources, and proposes a holistic semantic knowledge retrieval and integration approach to creating knowledge graphs on-demand from diverse web sources. Semantic Web Technologies have evolved as a novel approach to tackle the problem of knowledge integration from heterogeneous data, but because of the Extraction-Transformation-Load approach that dominates the process, knowledge retrieval and integration from web data sources is either expensive, or full physical integration of the data is impeded by restricted access. Focusing on the representation of data from web sources as pieces of knowledge belonging to the same entity which can then be synthesized as a knowledge graph helps to solve interoperability conflicts and allow for a more cost-effective integration approach, providing a method that enables the creation of valuable insights from heterogeneous web data. Empirical evaluations to assess the effectiveness of this holistic approach provide evidence that the methodology and techniques proposed in this book help to effectively integrate the disparate knowledge spread over heterogeneous web data sources, and the book also demonstrates how three domain applications of law enforcement, job market analysis, and manufacturing, have been developed and managed using the approach.


Study on Data Placement Strategies in Distributed RDF Stores

Study on Data Placement Strategies in Distributed RDF Stores

Author: D.D. Janke

Publisher: IOS Press

Published: 2020-03-18

Total Pages: 312

ISBN-13: 1643680692

DOWNLOAD EBOOK

The distributed setting of RDF stores in the cloud poses many challenges, including how to optimize data placement on the compute nodes to improve query performance. In this book, a novel benchmarking methodology is developed for data placement strategies; one that overcomes these limitations by using a data-placement-strategy-independent distributed RDF store to analyze the effect of the data placement strategies on query performance. Frequently used data placement strategies have been evaluated, and this evaluation challenges the commonly held belief that data placement strategies which emphasize local computation lead to faster query executions. Indeed, results indicate that queries with a high workload can be executed faster on hash-based data placement strategies than on, for example, minimal edge-cut covers. The analysis of additional measurements indicates that vertical parallelization (i.e., a well-distributed workload) may be more important than horizontal containment (i.e., minimal data transport) for efficient query processing. Two such data placement strategies are proposed: the first, found in the literature, is entitled overpartitioned minimal edge-cut cover, and the second is the newly developed molecule hash cover. Evaluation revealed a balanced query workload and a high horizontal containment, which lead to a high vertical parallelization. As a result, these strategies demonstrated better query performance than other frequently used data placement strategies. The book also tests the hypothesis that collocating small connected triple sets on the same compute node while balancing the amount of triples stored on the different compute nodes leads to a high vertical parallelization.


Knowledge Graphs: Semantics, Machine Learning, and Languages

Knowledge Graphs: Semantics, Machine Learning, and Languages

Author: M. Acosta

Publisher: IOS Press

Published: 2023-10-03

Total Pages: 262

ISBN-13: 1643684256

DOWNLOAD EBOOK

Semantic computing is an integral part of modern technology, an essential component of fields as diverse as artificial intelligence, data science, knowledge discovery and management, big data analytics, e-commerce, enterprise search, technical documentation, document management, business intelligence, and enterprise vocabulary management. This book presents the proceedings of SEMANTICS 2023, the 19th International Conference on Semantic Systems, held in Leipzig, Germany, from 20 to 22 September 2023. The conference is a pivotal event for those professionals and researchers actively engaged in harnessing the power of semantic computing, an opportunity to increase their understanding of the subject’s transformative potential while confronting its practical limitations. Attendees include information managers, IT architects, software engineers, and researchers from a broad spectrum of organizations, including research facilities, non-profit entities, public administrations, and the world's largest corporations. For this year’s conference a total of 54 submissions were received in response to a call for papers. These were subjected to a rigorous, double-blind review process, with at least three independent reviews conducted for each submission. The 16 papers included here were ultimately accepted for presentation, with an acceptance rate of 29.6%. Areas covered include novel research challenges in areas such as data science, machine learning, logic programming, content engineering, social computing, and the Semantic Web. The book provides an up-to-date overview, which will be of interest to all those wishing to stay abreast of emerging trends and themes within the vast field of semantic computing.


Services for Connecting and Integrating Big Numbers of Linked Datasets

Services for Connecting and Integrating Big Numbers of Linked Datasets

Author: M. Mountantonakis

Publisher: IOS Press

Published: 2021-02-19

Total Pages: 314

ISBN-13: 1643681656

DOWNLOAD EBOOK

Linked Data is a method of publishing structured data to facilitate sharing, linking, searching and re-use. Many such datasets have already been published, but although their number and size continues to increase, the main objectives of linking and integration have not yet been fully realized, and even seemingly simple tasks, like finding all the available information for an entity, are still challenging. This book, Services for Connecting and Integrating Big Numbers of Linked Datasets, is the 50th volume in the series ‘Studies on the Semantic Web’. The book analyzes the research work done in the area of linked data integration, and focuses on methods that can be used at large scale. It then proposes indexes and algorithms for tackling some of the challenges, such as, methods for performing cross-dataset identity reasoning, finding all the available information for an entity, methods for ordering content-based dataset discovery, and others. The author demonstrates how content-based dataset discovery can be reduced to solving optimization problems, and techniques are proposed for solving these efficiently while taking the contents of the datasets into consideration. To order them in real time, the proposed indexes and algorithms have been implemented in a suite of services called LODsyndesis, in turn enabling the implementation of other high level services, such as techniques for knowledge graph embeddings, and services for data enrichment which can be exploited for machine-learning tasks, and which also improve the prediction of machine-learning problems.


Neural Generation of Textual Summaries from Knowledge Base Triples

Neural Generation of Textual Summaries from Knowledge Base Triples

Author: P. Vougiouklis

Publisher: IOS Press

Published: 2020-04-07

Total Pages: 174

ISBN-13: 1643680676

DOWNLOAD EBOOK

Most people need textual or visual interfaces to help them make sense of Semantic Web data. In this book, the author investigates the problems associated with generating natural language summaries for structured data encoded as triples using deep neural networks. An end-to-end trainable architecture is proposed, which encodes the information from a set of knowledge graph triples into a vector of fixed dimensionality, and generates a textual summary by conditioning the output on this encoded vector. Different methodologies for building the required data-to-text corpora are explored to train and evaluate the performance of the approach. Attention is first focused on generating biographies, and the author demonstrates that the technique is capable of scaling to domains with larger and more challenging vocabularies. The applicability of the technique for the generation of open-domain Wikipedia summaries in Arabic and Esperanto – two under-resourced languages – is then discussed, and a set of community studies, devised to measure the usability of the automatically generated content by Wikipedia readers and editors, is described. Finally, the book explains an extension of the original model with a pointer mechanism that enables it to learn to verbalise in a different number of ways the content from the triples while retaining the capacity to generate words from a fixed target vocabulary. The evaluation of performance using a dataset encompassing all of English Wikipedia is described, with results from both automatic and human evaluation both of which highlight the superiority of the latter approach as compared to the original architecture.


Towards a Knowledge-Aware AI

Towards a Knowledge-Aware AI

Author: A. Dimou

Publisher: IOS Press

Published: 2022-09-29

Total Pages: 236

ISBN-13: 1643683217

DOWNLOAD EBOOK

Semantic systems lie at the heart of modern computing, interlinking with areas as diverse as AI, data science, knowledge discovery and management, big data analytics, e-commerce, enterprise search, technical documentation, document management, business intelligence, enterprise vocabulary management, machine learning, logic programming, content engineering, social computing, and the Semantic Web. This book presents the proceedings of SEMANTiCS 2022, the 18th International Conference on Semantic Systems, held as a hybrid event – live in Vienna, Austria and online – from 12 to 15 September 2022. The SEMANTiCS conference is an annual meeting place for the professionals and researchers who make semantic computing work, who understand its benefits and encounter its limitations, and is attended by information managers, IT architects, software engineers, and researchers from organizations ranging from research facilities and NPOs, through public administrations to the largest companies in the world. The theme and subtitle of the 2022 conference was Towards A Knowledge-Aware AI, and the book contains 15 papers, selected on the basis of quality, impact and scientific merit following a rigorous review process which resulted in an acceptance rate of 29%. The book is divided into four chapters: semantics in data quality, standards and protection; representation learning and reasoning for downstream AI tasks; ontology development; and learning over complementary knowledge. Providing an overview of emerging trends and topics in the wide area of semantic computing, the book will be of interest to anyone involved in the development and deployment of computer technology and AI systems.


Engineering Background Knowledge for Social Robots

Engineering Background Knowledge for Social Robots

Author: L. Asprino

Publisher: IOS Press

Published: 2020-09-25

Total Pages: 240

ISBN-13: 1643681095

DOWNLOAD EBOOK

Social robots are embodied agents that perform knowledge-intensive tasks involving several kinds of information from different heterogeneous sources. This book, Engineering Background Knowledge for Social Robots, introduces a component-based architecture for supporting the knowledge-intensive tasks performed by social robots. The design was based on the requirements of a real socially-assistive robotic application, and all the components contribute to and benefit from the knowledge base which is its cornerstone. The knowledge base is structured by a set of interconnected and modularized ontologies which model the information, and is initially populated with linguistic, ontological and factual knowledge retrieved from Linked Open Data. Access to the knowledge base is guaranteed by Lizard, a tool providing software components, with an API for accessing facts stored in the knowledge base in a programmatic and object-oriented way. The author introduces two methods for engineering the knowledge needed by robots, a novel method for automatically integrating knowledge from heterogeneous sources with a frame-driven approach, and a novel empirical method for assessing foundational distinctions over Linked Open Data entities from a common-sense perspective. These effectively enable the evolution of the robot’s knowledge by automatically integrating information derived from heterogeneous sources and the generation of common-sense knowledge using Linked Open Data as an empirical basis. The feasibility and benefits of the architecture have been assessed through a prototype deployed in a real socially-assistive scenario, and the book presents two applications and the results of a qualitative and quantitative evaluation.


Knowledge Graphs for eXplainable Artificial Intelligence: Foundations, Applications and Challenges

Knowledge Graphs for eXplainable Artificial Intelligence: Foundations, Applications and Challenges

Author: I. Tiddi

Publisher: IOS Press

Published: 2020-05-06

Total Pages: 314

ISBN-13: 1643680811

DOWNLOAD EBOOK

The latest advances in Artificial Intelligence and (deep) Machine Learning in particular revealed a major drawback of modern intelligent systems, namely the inability to explain their decisions in a way that humans can easily understand. While eXplainable AI rapidly became an active area of research in response to this need for improved understandability and trustworthiness, the field of Knowledge Representation and Reasoning (KRR) has on the other hand a long-standing tradition in managing information in a symbolic, human-understandable form. This book provides the first comprehensive collection of research contributions on the role of knowledge graphs for eXplainable AI (KG4XAI), and the papers included here present academic and industrial research focused on the theory, methods and implementations of AI systems that use structured knowledge to generate reliable explanations. Introductory material on knowledge graphs is included for those readers with only a minimal background in the field, as well as specific chapters devoted to advanced methods, applications and case-studies that use knowledge graphs as a part of knowledge-based, explainable systems (KBX-systems). The final chapters explore current challenges and future research directions in the area of knowledge graphs for eXplainable AI. The book not only provides a scholarly, state-of-the-art overview of research in this subject area, but also fosters the hybrid combination of symbolic and subsymbolic AI methods, and will be of interest to all those working in the field.


Further with Knowledge Graphs

Further with Knowledge Graphs

Author: M. Alam

Publisher: IOS Press

Published: 2021-09-23

Total Pages: 284

ISBN-13: 1643682016

DOWNLOAD EBOOK

The field of semantic computing is highly diverse, linking areas such as artificial intelligence, data science, knowledge discovery and management, big data analytics, e-commerce, enterprise search, technical documentation, document management, business intelligence, and enterprise vocabulary management. As such it forms an essential part of the computing technology that underpins all our lives today. This volume presents the proceedings of SEMANTiCS 2021, the 17th International Conference on Semantic Systems. As a result of the continuing Coronavirus restrictions, SEMANTiCS 2021 was held in a hybrid form in Amsterdam, the Netherlands, from 6 to 9 September 2021. The annual SEMANTiCS conference provides an important platform for semantic computing professionals and researchers, and attracts information managers, IT­architects, software engineers, and researchers from a wide range of organizations, such as research facilities, NPOs, public administrations and the largest companies in the world. The subtitle of the 2021 conference’s was “In the Era of Knowledge Graphs”, and 66 submissions were received, from which the 19 papers included here were selected following a rigorous single-blind reviewing process; an acceptance rate of 29%. Topics covered include data science, machine learning, logic programming, content engineering, social computing, and the Semantic Web, as well as the additional sub-topics of digital humanities and cultural heritage, legal tech, and distributed and decentralized knowledge graphs. Providing an overview of current research and development, the book will be of interest to all those working in the field of semantic systems.


Type-Safe Programming for the Semantic Web

Type-Safe Programming for the Semantic Web

Author: M. Leinberger

Publisher: IOS Press

Published: 2021-10-14

Total Pages: 170

ISBN-13: 1643681974

DOWNLOAD EBOOK

Graph-based data formats are a flexible way of representing data – semantic data models in particular – where the schema is part of the data, and have become more popular and had some commercial success in recent years. Semantic data models are also the basis for the Semantic Web – a Web of data governed by open standards in which computer programs can freely access the data provided. This book is about checking the correctness of programs that can access semantic data. Although the flexibility of semantic data models is one of their greatest strengths, it can lead programmers to accidentally fail to account for unintuitive edge cases, leading to run-time errors or unintended side-effects during program execution. A program may even run for a long time before such an error occurs and the program crashes. Providing a type system is an established methodology for proving the absence of run-time errors in programs without requiring execution. The book defines type systems that can detect and avoid such run-time errors based on schema languages available for the Semantic Web. Using the Web Ontology Language (OWL) and its theoretic underpinnings i.e. description logics, and the Shapes Constraint Language (SHACL) in particular, the book defines systems that can provide type-safe data access to semantic data graphs. The book is divided into 3 parts: Part I contains an introduction and preliminaries; Part II covers type systems for the Semantic Web; and Part III includes related work and conclusions.