ACM SIGMOD Anthology VLDB dblp.uni-trier.de

Fast Incremental Indexing for Full-Text Information Retrieval.

Eric W. Brown, James P. Callan, W. Bruce Croft: Fast Incremental Indexing for Full-Text Information Retrieval. VLDB 1994: 192-202
@inproceedings{DBLP:conf/vldb/BrownCC94,
  author    = {Eric W. Brown and
               James P. Callan and
               W. Bruce Croft},
  editor    = {Jorge B. Bocca and
               Matthias Jarke and
               Carlo Zaniolo},
  title     = {Fast Incremental Indexing for Full-Text Information Retrieval},
  booktitle = {VLDB'94, Proceedings of 20th International Conference on Very
               Large Data Bases, September 12-15, 1994, Santiago de Chile, Chile},
  publisher = {Morgan Kaufmann},
  year      = {1994},
  isbn      = {1-55860-153-8},
  pages     = {192-202},
  ee        = {db/conf/vldb/vldb94-192.html},
  crossref  = {DBLP:conf/vldb/94},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}

Abstract

Full-text information retrieval systems have traditionally been designed for archival environments. They often provide little or no support for adding new documents to an existing document collection, requiring instead that the entire collection be re-indexed. Modern applications, such as information filtering, operate in dynamic environments that require frequent additions to document collections. We provide this ability using a traditional inverted file index built on top of a persistent object store. The data management facilities of the persistent object store are used to produce efficient incremental update of the inverted lists. We describe our system and present experimental results showing superior incremental indexing and competitive query processing performance.

Copyright © 1994 by the VLDB Endowment. Permission to copy without fee all or part of this material is granted provided that the copies are not made or distributed for direct commercial advantage, the VLDB copyright notice and the title of the publication and its date appear, and notice is given that copying is by the permission of the Very Large Data Base Endowment. To copy otherwise, or to republish, requires a fee and/or special permission from the Endowment.


Online Paper

ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 1 Issue 5, VLDB '89-'97" and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...

Printed Edition

Jorge B. Bocca, Matthias Jarke, Carlo Zaniolo (Eds.): VLDB'94, Proceedings of 20th International Conference on Very Large Data Bases, September 12-15, 1994, Santiago de Chile, Chile. Morgan Kaufmann 1994, ISBN 1-55860-153-8
Contents CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML

References

[BCCM94]
Eric W. Brown, James P. Callan, W. Bruce Croft, J. Eliot B. Moss: Supporting Full-Text Information Retrieval with a Persistent Object Store. EDBT 1994: 365-378 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[CCH92]
James P. Callan, W. Bruce Croft, Stephen M. Harding: The INQUERY Retrieval System. DEXA 1992: 78-83 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[CP90]
Douglas R. Cutting, Jan O. Pedersen: Optimizations for Dynamic Inverted Index Maintenance. SIGIR 1990: 405-411 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[Fal85]
Christos Faloutsos: Access Methods for Text. ACM Comput. Surv. 17(1): 49-74(1985) CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[FJ92a]
Christos Faloutsos, H. V. Jagadish: Hybrid Index Organizations for Text Databases. EDBT 1992: 310-327 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[FJ92b]
Christos Faloutsos, H. V. Jagadish: On B-Tree Indices for Skewed Distributions. VLDB 1992: 363-374 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[FL91]
...
[Har94]
...
[Hea78]
...
[HFBYL92]
...
[Mos90]
J. Eliot B. Moss: Design of the Mneme Persistent Object Store. ACM Trans. Inf. Syst. 8(2): 103-139(1990) CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[MZ94a]
...
[MZ94b]
...
[SM83]
Gerard Salton, Michael McGill: Introduction to Modern Information Retrieval. McGraw-Hill Book Company 1984, ISBN 0-07-054484-0
CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[STGM94]
Kurt A. Shoens, Anthony Tomasic, Hector Garcia-Molina: Synthetic Workload Performance Analysis of Incremental Updates. SIGIR 1994: 329-338 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[TC91]
Howard R. Turtle, W. Bruce Croft: Evaluation of an Inference Network-Based Retrieval Model. ACM Trans. Inf. Syst. 9(3): 187-222(1991) CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[TC92]
Howard R. Turtle, W. Bruce Croft: A Comparison of Text Retrieval Models. Comput. J. 35(3): 279-290(1992) CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[TGMS94]
Anthony Tomasic, Hector Garcia-Molina, Kurt A. Shoens: Incremental Updates of Inverted Lists for Text Document Retrieval. SIGMOD Conference 1994: 289-300 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[Wil84]
...
[WL93]
Wai Yee Peter Wong, Dik Lun Lee: Implementations of Partial Document Ranking Using Inverted Files. Inf. Process. Manage. 29(5): 647-669(1993) CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[WMB94]
Ian H. Witten, Alistair Moffat, Timothy C. Bell: Managing Gigabytes: Compressing and Indexing Documents and Images. Van Nostrand Reinhold 1994, ISBN 0-442-01863-0
CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[Wol92a]
Dietmar Wolfram: Applying Informetric Characteristics of Databases to IR System File Design, Part I: Informetric Models. Inf. Process. Manage. 28(1): 121-134(1992) CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[Wol92b]
Dietmar Wolfram: Applying Informetric Characteristics of Databases to IR System File Design, Part II: Simulation Comparisons. Inf. Process. Manage. 28(1): 135-(1992) CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[Zip49]
George Kingsley Zipf: Human Behaviour and the Principle of Least Effort: an Introduction to Human Ecology. Addison-Wesley 1949
CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[ZMSD92]
Justin Zobel, Alistair Moffat, Ron Sacks-Davis: An Efficient Indexing Technique for Full Text Databases. VLDB 1992: 352-362 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML

Referenced by

  1. Michael J. Carey, Donald Kossmann: Reducing the Braking Distance of an SQL Query Engine. VLDB 1998: 158-169
  2. Sibel Adali, Piero A. Bonatti, Maria Luisa Sapino, V. S. Subrahmanian: A Multi-Similarity Algebra. SIGMOD Conference 1998: 402-413
  3. Narayanan Shivakumar, Hector Garcia-Molina: Wave-Indices: Indexing Evolving Databases. SIGMOD Conference 1997: 381-392
  4. Charles L. Viles, James C. French: On the Update of Term Weights in Dynamic Information Retrieval Systems. CIKM 1995: 167-174
  5. Helmut Kaufmann, Moira C. Norrie: Relaxation of Correctness in Database Systems. ADBIS 1995: 90-105

Copyright © Mon Nov 2 21:18:50 2009 by Michael Ley (ley@uni-trier.de)