Querying and Merging Heterogeneous Data by Approximate Joins on Higher-Order Terms

Research output: Chapter in Book/Report/Conference proceedingConference Contribution (Conference Proceeding)

2 Citations (Scopus)
181 Downloads (Pure)

Abstract

Integrating heterogeneous data from sources as diverse as web pages, digital libraries, knowledge bases, the Semantic Web and databases is an open problem. The ultimate aim of our work is to be able to query such heterogeneous data sources as if their data were conveniently held in a single relational database. Pursuant to this aim, we propose a generalisation of joins from the relational database model to enable joins on arbitrarily complex structured data in a higher-order representation. By incorporating kernels and distances for structured data, we further extend this model to support approximate joins of heterogeneous data. We demonstrate the flexibility of our approach in the publications domain by evaluating example approximate queries on the CORA data sets, joining on types ranging from sets of co-authors through to entire publications.
Original languageEnglish
Title of host publicationInductive Logic Programming
Subtitle of host publication18th International Conference, ILP 2008 Prague, Czech Republic, September 10-12, 2008 Proceedings
Pages226 - 243
Number of pages18
Volume5194
ISBN (Electronic)978-3-540-85928-4
DOIs
Publication statusPublished - Sep 2008
Event18th International Conference, ILP 2008 Prague, Czech Republic, September 10-12, 2008 Proceedings - Prague, United Kingdom
Duration: 10 Sep 200812 Sep 2008

Conference

Conference18th International Conference, ILP 2008 Prague, Czech Republic, September 10-12, 2008 Proceedings
CountryUnited Kingdom
CityPrague
Period10/09/0812/09/08

Bibliographical note

Editors: F. Zelezny and N. Lavrac
ISBN: 9783540859277
Publisher: Springer-Verlag Berlin Heidelberg
Name and Venue of Conference: Proceedings of 18th International Conference on Inductive Logic Programming, ILP 2008, Prague, Czech Republic
Conference Organiser: Czech Technical University
Other: http://ida.felk.cvut.cz/ilp2008/

Fingerprint Dive into the research topics of 'Querying and Merging Heterogeneous Data by Approximate Joins on Higher-Order Terms'. Together they form a unique fingerprint.

  • Cite this

    Price, S., & Flach, PA. (2008). Querying and Merging Heterogeneous Data by Approximate Joins on Higher-Order Terms. In Inductive Logic Programming: 18th International Conference, ILP 2008 Prague, Czech Republic, September 10-12, 2008 Proceedings (Vol. 5194, pp. 226 - 243) https://doi.org/10.1007/978-3-540-85928-4_19