Assessing semantic similarity between concepts using Wikipedia based on nonlinear fitting

Guangjian Huang, Yuncheng Jiang, Wenjun Ma, Weiru Liu

Research output: Chapter in Book/Report/Conference proceedingConference Contribution (Conference Proceeding)

1 Citation (Scopus)
122 Downloads (Pure)


Feature-based methods of semantic similarity with Wikipedia achieve fruitful performances on measuring the "likeness" between objects in many research fields. However, since Wikipedia is created and edited by volunteers around the world, the preciseness of these methods more or less are influenced by the incompleteness, invalidity and inconsistency of the knowledge in Wikipedia. Unfortunately, this problem has not got enough attention in the existing work. To address this issue, this paper proposes a novel feature-based method for semantic similarity, which has three parts: low frequency features removal, the similarities of generalized synonyms computing, and weighted feature-based methods based on nonlinear fitting. Moreover, we show that our new method can always get a better Pearson correlation coefficient on one or more benchmarks through a set of experimental evaluations.
Original languageEnglish
Title of host publicationThe 12th International Conference on Knowledge Science, Engineering and Management (KSEM 2019)
EditorsRandy Goebel, Yuzuru Tanaka, Wolfgang Wahlster
Number of pages13
ISBN (Electronic) 978-3-030-29563-9
ISBN (Print)978-3-030-29562-2
Publication statusPublished - Aug 2019

Publication series

NameLecture Notes in Artificial Intelligence
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349


  • Semantic similarity
  • Wikipedia
  • Nonlinear fitting


Dive into the research topics of 'Assessing semantic similarity between concepts using Wikipedia based on nonlinear fitting'. Together they form a unique fingerprint.

Cite this