SubSift web services and workflows for profiling and comparing scientists and their published works

S Price, PA Flach, SR Spiegler, CP Bailey, N Rogers

Research output: Chapter in Book/Report/Conference proceedingConference Contribution (Conference Proceeding)

Abstract

Scientific researchers, laboratories and organisations can be profiled and compared by analysing their published works, including documents ranging from academic papers to web sites, blog posts and Twitter feeds. This paper describes how the vector space model from information retrieval, more normally associated with full text search, has been employed in the open source SubSift software to support workflows to profile and compare such collections of documents. SubSift was originally designed to match submitted conference or journal papers to potential peer reviewers based on the similarity between the paper's abstract and the reviewer's publications as found in online bibliographic databases. The software is implemented as a family of RESTful web services that, composed into a re-usable workflow, have already been used to support several major data mining conferences. Alternative workflows and service compositions are now enabling other interesting applications.
Translated title of the contributionSubSift web services and workflows for profiling and comparing scientists and their published works
Original languageEnglish
Title of host publicationSixth IEEE International Conference on e-Science (e-Science 2010), Brisbane, Australia
PublisherIEEE Computer Society
Pages182 - 189
ISBN (Print)9780769542904
Publication statusPublished - Dec 2010
EventIEEE e–Science Conference 2010 - Brisbane, Australia
Duration: 7 Dec 201010 Dec 2010

Conference

ConferenceIEEE e–Science Conference 2010
CountryAustralia
CityBrisbane
Period7/12/1010/12/10

Bibliographical note

Other: DOI 10.1109/eScience.2010.29

Structured keywords

  • Jean Golding

Fingerprint Dive into the research topics of 'SubSift web services and workflows for profiling and comparing scientists and their published works'. Together they form a unique fingerprint.

Cite this