Fast Incremental Indexing for Full-Text Information Retrieval.
Eric W. Brown, James P. Callan, W. Bruce Croft:
Fast Incremental Indexing for Full-Text Information Retrieval.
VLDB 1994: 192-202@inproceedings{DBLP:conf/vldb/BrownCC94,
author = {Eric W. Brown and
James P. Callan and
W. Bruce Croft},
editor = {Jorge B. Bocca and
Matthias Jarke and
Carlo Zaniolo},
title = {Fast Incremental Indexing for Full-Text Information Retrieval},
booktitle = {VLDB'94, Proceedings of 20th International Conference on Very
Large Data Bases, September 12-15, 1994, Santiago de Chile, Chile},
publisher = {Morgan Kaufmann},
year = {1994},
isbn = {1-55860-153-8},
pages = {192-202},
ee = {db/conf/vldb/vldb94-192.html},
crossref = {DBLP:conf/vldb/94},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
Abstract
Full-text information retrieval systems have traditionally been
designed for archival environments. They often provide little or no
support for adding new documents to an existing document collection,
requiring instead that the entire collection be re-indexed. Modern
applications, such as information filtering, operate in dynamic
environments that require frequent additions to document collections.
We provide this ability using a traditional inverted file index built
on top of a persistent object store. The data management facilities of
the persistent object store are used to produce efficient incremental
update of the inverted lists. We describe our system and present
experimental results showing superior incremental indexing and
competitive query processing performance.
Copyright © 1994 by the VLDB Endowment.
Permission to copy without fee all or part of this material is granted provided that the copies are not made or
distributed for direct commercial advantage, the VLDB
copyright notice and the title of the publication and
its date appear, and notice is given that copying
is by the permission of the Very Large Data Base
Endowment. To copy otherwise, or to republish, requires
a fee and/or special permission from the Endowment.
Online Paper
CDROM Version: Load the CDROM "Volume 1 Issue 5, VLDB '89-'97" and ...
DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...
Printed Edition
Jorge B. Bocca, Matthias Jarke, Carlo Zaniolo (Eds.):
VLDB'94, Proceedings of 20th International Conference on Very Large Data Bases, September 12-15, 1994, Santiago de Chile, Chile.
Morgan Kaufmann 1994, ISBN 1-55860-153-8
Contents
References
- [BCCM94]
- Eric W. Brown, James P. Callan, W. Bruce Croft, J. Eliot B. Moss:
Supporting Full-Text Information Retrieval with a Persistent Object Store.
EDBT 1994: 365-378
- [CCH92]
- James P. Callan, W. Bruce Croft, Stephen M. Harding:
The INQUERY Retrieval System.
DEXA 1992: 78-83
- [CP90]
- Douglas R. Cutting, Jan O. Pedersen:
Optimizations for Dynamic Inverted Index Maintenance.
SIGIR 1990: 405-411
- [Fal85]
- Christos Faloutsos:
Access Methods for Text.
ACM Comput. Surv. 17(1): 49-74(1985)
- [FJ92a]
- Christos Faloutsos, H. V. Jagadish:
Hybrid Index Organizations for Text Databases.
EDBT 1992: 310-327
- [FJ92b]
- Christos Faloutsos, H. V. Jagadish:
On B-Tree Indices for Skewed Distributions.
VLDB 1992: 363-374
- [FL91]
- ...
- [Har94]
- ...
- [Hea78]
- ...
- [HFBYL92]
- ...
- [Mos90]
- J. Eliot B. Moss:
Design of the Mneme Persistent Object Store.
ACM Trans. Inf. Syst. 8(2): 103-139(1990)
- [MZ94a]
- ...
- [MZ94b]
- ...
- [SM83]
- Gerard Salton, Michael McGill:
Introduction to Modern Information Retrieval.
McGraw-Hill Book Company 1984, ISBN 0-07-054484-0
- [STGM94]
- Kurt A. Shoens, Anthony Tomasic, Hector Garcia-Molina:
Synthetic Workload Performance Analysis of Incremental Updates.
SIGIR 1994: 329-338
- [TC91]
- Howard R. Turtle, W. Bruce Croft:
Evaluation of an Inference Network-Based Retrieval Model.
ACM Trans. Inf. Syst. 9(3): 187-222(1991)
- [TC92]
- Howard R. Turtle, W. Bruce Croft:
A Comparison of Text Retrieval Models.
Comput. J. 35(3): 279-290(1992)
- [TGMS94]
- Anthony Tomasic, Hector Garcia-Molina, Kurt A. Shoens:
Incremental Updates of Inverted Lists for Text Document Retrieval.
SIGMOD Conference 1994: 289-300
- [Wil84]
- ...
- [WL93]
- Wai Yee Peter Wong, Dik Lun Lee:
Implementations of Partial Document Ranking Using Inverted Files.
Inf. Process. Manage. 29(5): 647-669(1993)
- [WMB94]
- Ian H. Witten, Alistair Moffat, Timothy C. Bell:
Managing Gigabytes: Compressing and Indexing Documents and Images.
Van Nostrand Reinhold 1994, ISBN 0-442-01863-0
- [Wol92a]
- Dietmar Wolfram:
Applying Informetric Characteristics of Databases to IR System File Design, Part I: Informetric Models.
Inf. Process. Manage. 28(1): 121-134(1992)
- [Wol92b]
- Dietmar Wolfram:
Applying Informetric Characteristics of Databases to IR System File Design, Part II: Simulation Comparisons.
Inf. Process. Manage. 28(1): 135-(1992)
- [Zip49]
- George Kingsley Zipf:
Human Behaviour and the Principle of Least Effort: an Introduction to Human Ecology.
Addison-Wesley 1949
- [ZMSD92]
- Justin Zobel, Alistair Moffat, Ron Sacks-Davis:
An Efficient Indexing Technique for Full Text Databases.
VLDB 1992: 352-362
Copyright © Tue Mar 16 02:22:04 2010
by Michael Ley (ley@uni-trier.de)