TAIPAN: Automatic Property Mapping for Tabular Data

The Web encompasses a significant amount of knowledge hidden in entity-attributes tables. Bridging the gap between these tables and the Web of Data thus has the potential to facilitate a large number of applications, including the augmentation of knowledge bases from tables, the search for related t...

Full description

Saved in:
Bibliographic Details
Published inKnowledge Engineering and Knowledge Management Vol. 10024; pp. 163 - 179
Main Authors Ermilov, Ivan, Ngomo, Axel-Cyrille Ngonga
Format Book Chapter
LanguageEnglish
Published Switzerland Springer International Publishing AG 2016
Springer International Publishing
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN9783319490038
3319490036
ISSN0302-9743
1611-3349
DOI10.1007/978-3-319-49004-5_11

Cover

More Information
Summary:The Web encompasses a significant amount of knowledge hidden in entity-attributes tables. Bridging the gap between these tables and the Web of Data thus has the potential to facilitate a large number of applications, including the augmentation of knowledge bases from tables, the search for related tables and the completion of tables using knowledge bases. Computing such bridges is impeded by the poor accuracy of automatic property mapping, the lack of approaches for the discovery of subject columns and the mere size of table corpora. We propose Taipan, a novel approach for recovering the semantics of tables. Our approach begins by identifying subject columns using a combination of structural and semantic features. It then maps binary relations inside a table to predicates from a given knowledge base. Therewith, our solution supports both the tasks of table expansion and knowledge base augmentation. We evaluate our approach on a table dataset generated from real RDF data and a manually curated version of the T2D gold standard. Our results suggest that we outperform the state of the art by up to 85 % F-measure.
ISBN:9783319490038
3319490036
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-319-49004-5_11