Column-Oriented Table Access Using STIL: Fast Analysis of Very Large Tables

M. B. Taylor, C. G. Page

Research output: Contribution to journalArticle (Academic Journal)

Abstract

By use of column-oriented storage and file mapping, great improvements in efficiency over more conventional methods can be made for some important kinds of access to large and very large tabular datasets. These techniques have been implemented in the STIL library, enabling their use in the established table analysis applications TOPCAT and STILTS. Benchmarks are presented which show certain common analysistasks running 10--40 times faster than their MySQL equivalents. Applied to datasets in the range hundreds of Mbyte to hundreds of Gbyte this speedup can be put to good use both on the desktop and at the datacenter to bring new regimes of data exploration within practical reach.
Original languageEnglish
Pages (from-to)422
Number of pages4
JournalAstronomical Society of the Pacific Conference Series
Volume394
Publication statusPublished - 1 Aug 2008

Fingerprint

Dive into the research topics of 'Column-Oriented Table Access Using STIL: Fast Analysis of Very Large Tables'. Together they form a unique fingerprint.

Cite this