Using Time Dependent Link Reduction to Improve the Efficiency of Topic Prediction in Co-Authorship Graphs

Susanne Hoche, David Hardcastle, Peter Flach

Research output: Chapter in Book/Report/Conference proceedingConference Contribution (Conference Proceeding)

Abstract

One prominent task in graphs is property prediction, where a property of some of the graph's nodes is known and used to make predictions for those individuals for which this property is unknown. In this paper, we look at topic prediction for papers organized in a so-called co-authorship graph (CAG) where the individuals are scientific papers with links between them if they share some author. A CAG tends to have a large number of cliques, each formed by all the papers published by the same author. Thus, topic prediction in a CAG tends to be computationally expensive. We investigate in how far we can reduce this complexity without sacrificing the prediction quality by reducing the number of links in the CAG based on the papers' publication dates. We apply an inexpensive iterative neighbourhood's majority vote based algorithm to predict unknown topics based on the papers with known topics and the CAG's link structure. For three data sets, we evaluate our algorithm in terms of classification accuracy and computational time on both the full graph G and subgraphs of it. On substantially smaller subgraphs of G, our algorithm obtains classification accuracies that are similar to the results obtained on G, while achieving a reduction in execution time of up to one order of magnitude.
Translated title of the contributionUsing Time Dependent Link Reduction to Improve the Efficiency of Topic Prediction in Co-Authorship Graphs
Original languageEnglish
Title of host publicationComplex Networks: Results of the 1st International Workshop on Complex Networks (CompleNet 2009)
PublisherSpringer
Publication statusPublished - 2009

Bibliographical note

Other page information: -
Conference Proceedings/Title of Journal: Complex Networks: Results of the 1st International Workshop on Complex Networks (CompleNet 2009)
Other identifier: 2000991

Fingerprint Dive into the research topics of 'Using Time Dependent Link Reduction to Improve the Efficiency of Topic Prediction in Co-Authorship Graphs'. Together they form a unique fingerprint.

Cite this