Multi-modal data fusion based on embeddings

Many web pages include structured data in the form of semantic markup, which can be transferred to the Resource Description Framework (RDF) or provide an interface to retrieve RDF data directly. This RDF data enables machines to automatically process and use the data. When applications need data fro...

Descripción completa

Detalles Bibliográficos
Autor principal:	Thoma, Steffen (-)
Formato:	Libro electrónico
Idioma:	Inglés
Publicado:	Amsterdam : IOS Press [2019]
Colección:	EBSCO Academic eBook Collection Complete. Studies on the Semantic Web ; 041.
Acceso en línea:	Conectar con la versión electrónica
Ver en Universidad de Navarra:	https://innopac.unav.es/record=b44373259*spi

Descripción
Sumario:	Many web pages include structured data in the form of semantic markup, which can be transferred to the Resource Description Framework (RDF) or provide an interface to retrieve RDF data directly. This RDF data enables machines to automatically process and use the data. When applications need data from more than one source the data has to be integrated, and the automation of this can be challenging. Usually, vocabularies are used to concisely describe the data, but because of the decentralized nature of the web, multiple data sources can provide similar information with different vocabularies, making integration more difficult. This book, Multi-modal Data Fusion based on Embeddings, describes how similar statements about entities can be identified across sources, independent of the vocabulary and data modeling choices. Previous approaches have relied on clean and extensively modeled ontologies for the alignment of statements, but the often noisy data in a web context does not necessarily adhere to these prerequisites. In this book, the use of RDF label information of entities is proposed to tackle this problem. In combination with embeddings, the use of label information allows for a better integration of noisy data, something that has been empirically confirmed by experiment. The book presents two main scientific contributions: the vocabulary and modeling agnostic fusion approach on the purely textual label information, and the combination of three different modalities into one multi-modal embedding space for a more human-like notion of similarity. The book will be of interest to all those faced with the problem of processing data from multiple web-based sources.
Descripción Física:	1 recurso electrónico
Formato:	Forma de acceso: World Wide Web.
Bibliografía:	Incluye referencias bibliográficas.
ISBN:	9781643680293

Multi-modal data fusion based on embeddings

Ejemplares similares