Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies.
Luis Gravano, Hector Garcia-Molina:
Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies.
VLDB 1995: 78-89@inproceedings{DBLP:conf/vldb/GravanoG95,
author = {Luis Gravano and
Hector Garcia-Molina},
editor = {Umeshwar Dayal and
Peter M. D. Gray and
Shojiro Nishio},
title = {Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies},
booktitle = {VLDB'95, Proceedings of 21th International Conference on Very
Large Data Bases, September 11-15, 1995, Zurich, Switzerland},
publisher = {Morgan Kaufmann},
year = {1995},
isbn = {1-55860-379-4},
pages = {78-89},
ee = {db/conf/vldb/GravanoG95.html},
crossref = {DBLP:conf/vldb/95},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
Abstract
As large numbers of text databases have become available on the Internet, it is harder to locate the right sources for given queries.
In this paper we present gGlOSS, a generalized Glossary-Of-Servers Server, that keeps statistics on the available databases to estimate which databases are the potentially most useful for a given query.
gGlOSS extends our previous work [l], which focused on databases using the boolean model of document retrieval, to cover databases using the more sophisticated vector-space retrieval model.
We evaluate our new techniques using real-user queries and 53 databases.
Finally, we further generalize our approach by showing how to build a hierarchy of gGlOSS brokers.
The top level of the hierarchy is so small it could be widely replicated, even at end-user workstations.
Copyright © 1995 by the VLDB Endowment.
Permission to copy without fee all or part of this material is granted provided that the copies are not made or
distributed for direct commercial advantage, the VLDB
copyright notice and the title of the publication and
its date appear, and notice is given that copying
is by the permission of the Very Large Data Base
Endowment. To copy otherwise, or to republish, requires
a fee and/or special permission from the Endowment.
Online Paper
CDROM Version: Load the CDROM "Volume 1 Issue 5, VLDB '89-'97" and ...
DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...
Printed Edition
Umeshwar Dayal, Peter M. D. Gray, Shojiro Nishio (Eds.):
VLDB'95, Proceedings of 21th International Conference on Very Large Data Bases, September 11-15, 1995, Zurich, Switzerland.
Morgan Kaufmann 1995, ISBN 1-55860-379-4
Contents
References
- [1]
- Luis Gravano, Hector Garcia-Molina, Anthony Tomasic:
The Effectiveness of GlOSS for the Text Database Discovery Problem.
SIGMOD Conference 1994: 126-137
- [2]
- Michael F. Schwartz, Alan Emtage, Brewster Kahle, B. Clifford Neuman:
A Comparison of Internet Resource Discovery Approaches.
Computing Systems 5(4): 461-493(1992)
- [3]
- Katia Obraczka, Peter B. Danzig, Shih-Hao Li:
Internet Resource Discovery Services.
IEEE Computer 26(9): 8-22(1993)
- [4]
- Luis Gravano, Hector Garcia-Molina, Anthony Tomasic:
Precision and Recall of GlOSS Estimators for Database Discovery.
PDIS 1994: 103-106
- [5]
- Gerard Salton, Michael McGill:
Introduction to Modern Information Retrieval.
McGraw-Hill Book Company 1984, ISBN 0-07-054484-0
- [6]
- Gerard Salton:
Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer.
Addison-Wesley 1989, ISBN 0-201-12227-8
- [7]
- B. Clifford Neuman:
The Prospero File System: A Global File System Based on the Virtual System Model.
Computing Systems 5(4): 407-432(1992)
- [8]
- Tim Berners-Lee, Robert Cailliau, Jean-François Groff, Bernd Pollermann:
World-Wide Web: The Information Universe.
Electronic Networking: Research, Applications and Policy 1(2): 74-82(1992)
- [9]
- ...
- [10]
- James P. Callan, Zhihong Lu, W. Bruce Croft:
Searching Distributed Collections with Inference Networks.
SIGIR 1995: 21-28
- [11]
- Mark A. Sheldon, Andrzej Duda, Ron Weiss, James O'Toole, David K. Gifford:
Content Routing for Distributed Information Servers.
EDBT 1994: 109-122
- [12]
- Andrzej Duda, Mark A. Sheldon:
Content Routing in a Network of WAIS Servers.
ICDCS 1994: 124-132
- [13]
- ...
- [14]
- Anthony Tomasic, Luis Gravano, Calvin Lue, Peter M. Schwarz, Laura M. Haas:
Data Structures for Efficient Broker Implementation.
ACM Trans. Inf. Syst. 15(3): 223-253(1997)
- [15]
- Tak W. Yan, Hector Garcia-Molina:
SIFT - a Tool for Wide-Area Information Dissemination.
USENIX Winter 1995: 177-186
- [16]
- ...
- [17]
- Luis Gravano, Hector Garcia-Molina, Anthony Tomasic:
Precision and Recall of GlOSS Estimators for Database Discovery.
PDIS 1994: 103-106
- [18]
- Ellen M. Voorhees, Narendra Kumar Gupta, Ben Johnson-Laird:
The Collection Fusion Problem.
TREC 1994: 0-
- [19]
- Alistair Moffat, Justin Zobel:
Information Retrieval Systems for Large Document Collections.
TREC 1994: 0-
Copyright © Mon Mar 15 03:55:55 2010
by Michael Ley (ley@uni-trier.de)