An Approach for Document Fragment Retrieval and Its Formatting Issue in Engineering Information Management

S. Liu, C. McMahon, M. Darlington, S. Culley, P. Wild

Research output: Chapter in Book/Report/Conference proceedingConference Contribution (Conference Proceeding)

Abstract

This paper discusses engineering document fragment mark-up supported by the use of the eXstensible Stylesheet Language – Formatting Objects (XLS-FO). XLS-FO can be used to convert the native format repre-sentation of such documents as Word, Excel and PDF into XML. Once in XML, documents fragments can be retrieved at will in response to a search query. In the paper the process of a document fragment retrieval – based on the authors’ decomposition scheme approach – has been modelled and the issue of converting documents into XML addressed. Additionally, the use of document templates is discussed as a means of ensuring that the transformed XML documents are compliant with the decomposition schemes. Automating the reformatting of documents into XML and the use of templates helps make implementation of a document-fragment approach to retrieval more resource efficient, so making its adoption in industry more practicable.
Original languageEnglish
Title of host publicationComputational Science and Its Applications - ICCSA 2006
Subtitle of host publicationInternational Conference, Glasgow, UK, May 8-11, 2006. Proceedings, Part II
PublisherSpringer
Pages279-287
Number of pages9
ISBN (Electronic)9783540340744
ISBN (Print)9783540340720
DOIs
Publication statusPublished - 2006

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume3981
ISSN (Print)0302-9743

Fingerprint Dive into the research topics of 'An Approach for Document Fragment Retrieval and Its Formatting Issue in Engineering Information Management'. Together they form a unique fingerprint.

Cite this