Abstract
Representing data items as vectors in a space is a common practice in machine learning, where it often goes under the name of “data embedding”. This representation is typically learnt from known relations that exist in the original data, such as co-occurrence of words, or connections in graphs. A property of these embeddings is known as compositionality, whereby the vector representation of an item can be decomposed into different parts, which can be understood separately. This property, first observed in the case of word embeddings, could help with various challenges of modern AI: detection of unwanted bias in the representation, explainability of AI decisions based on these representations, and the possibility of performing analogical reasoning or counterfactual question answering. One important direction of research is to understand the origins, properties and limitations of compositional data embeddings, with the idea of going beyond word embeddings. In this paper, we propose two methods to test for this property, demonstrating their use in the case of sentence embedding and knowledge graph embedding.
Original language | English |
---|---|
Title of host publication | Advances in Intelligent Data Analysis XXI |
Subtitle of host publication | 21st International Symposium on Intelligent Data Analysis, IDA 2023, Louvain-la-Neuve, Belgium, April 12–14, 2023, Proceedings |
Editors | Bruno Crémilleux, Sibylle Hess, Siegfried Nijssen |
Publisher | Springer |
Pages | 484-496 |
Number of pages | 13 |
Volume | 13876 |
ISBN (Electronic) | 9783031300479 |
ISBN (Print) | 9783031300462 |
DOIs | |
Publication status | Published - 1 Apr 2023 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 13876 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Bibliographical note
Publisher Copyright:© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
Fingerprint
Dive into the research topics of 'On Compositionality in Data Embedding'. Together they form a unique fingerprint.Student theses
-
Processing questions with multi-task sentence embedding
Xu, Z. (Author), Cristianini, N. (Supervisor), Lewis, M. (Supervisor) & Damen, D. (Supervisor), 9 May 2023Student thesis: Doctoral Thesis › Doctor of Philosophy (PhD)
File