Join Queries with External Text Sources: Execution and Optimization Techniques.
Surajit Chaudhuri, Umeshwar Dayal, Tak W. Yan:
Join Queries with External Text Sources: Execution and Optimization Techniques.
SIGMOD Conference 1995: 410-422@inproceedings{DBLP:conf/sigmod/ChaudhuriDY95,
author = {Surajit Chaudhuri and
Umeshwar Dayal and
Tak W. Yan},
editor = {Michael J. Carey and
Donovan A. Schneider},
title = {Join Queries with External Text Sources: Execution and Optimization
Techniques},
booktitle = {Proceedings of the 1995 ACM SIGMOD International Conference on
Management of Data, San Jose, California, May 22-25, 1995},
publisher = {ACM Press},
year = {1995},
pages = {410-422},
ee = {http://doi.acm.org/10.1145/223784.223856, db/conf/sigmod/sigmod95-33.html},
crossref = {DBLP:conf/sigmod/95},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
Abstract
Text is a pervasive information type, and many applications require
querying over text sources in addition to structured data. This can be
facilitated by a loose integration of an extensible database system and a
text retrieval system. This paper studies the problem of query processing
in such a system. The focus is on a class of conjunctive queries that
include joins between the structured data and text data, in addition to
selections over these two types of data. We investigate the relevance of
previous work on distributed query processing and foreign function
optimization. We show that, while several of these techniques can be
adapted, the characteristics of text retrieval systems lend themselves to
some new techniques. We describe a novel class of join method based on {\em
probing} that is especially useful for joins with text systems, and we
present a cost model for the various alternative query processing methods.
We describe experimental results that confirm the utility of these methods.
We show that the space of query plans is extended due to the additional
techniques, and we describe an optimization algorithm for searching this
extended space. Finally, we argue that the techniques we describe in this
paper may be more generally applicable to other types of external data
managers loosely integrated with a database system.
Copyright © 1995 by the ACM,
Inc., used by permission. Permission to make
digital or hard copies is granted provided that
copies are not made or distributed for profit or
direct commercial advantage, and that copies show
this notice on the first page or initial screen of
a display along with the full citation.
Online Version (ACM WWW Account required): Full Text in PDF Format
CDROM Version: Load the CDROM "Volume 1 Issue 1, SIGMOD '93-'97" and ...
DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...
Printed Edition
Michael J. Carey, Donovan A. Schneider (Eds.):
Proceedings of the 1995 ACM SIGMOD International Conference on Management of Data, San Jose, California, May 22-25, 1995.
ACM Press 1995
,
SIGMOD Record 24(2),
June 1995
Contents
[Index Terms]
[Full Text in PDF Format, 1456 KB]
References
- [ACM93]
- Serge Abiteboul, Sophie Cluet, Tova Milo:
Querying and Updating the File.
VLDB 1993: 73-84
![bibliographical record in XML](../../xml.gif)
- [AS91]
- Walid G. Aref, Hanan Samet:
Optimization for Spatial Query Processing.
VLDB 1991: 81-90
![bibliographical record in XML](../../xml.gif)
- [BG92]
- Ludger Becker, Ralf Hartmut Güting:
Rule-Based Optimization and Query Processing in an Extensible Geometric Database System.
ACM Trans. Database Syst. 17(2): 247-303(1992)
![bibliographical record in XML](../../xml.gif)
- [BGWR81]
- Philip A. Bernstein, Nathan Goodman, Eugene Wong, Christopher L. Reeve, James B. Rothnie Jr.:
Query Processing in a System for Distributed Databases (SDD-1).
ACM Trans. Database Syst. 6(4): 602-625(1981)
![bibliographical record in XML](../../xml.gif)
- [BRG88]
- Elisa Bertino, Fausto Rabitti, Simon J. Gibbs:
Query Processing in a Multimedia Document System.
ACM Trans. Inf. Syst. 6(1): 1-41(1988)
![bibliographical record in XML](../../xml.gif)
- [CACS94]
- Vassilis Christophides, Serge Abiteboul, Sophie Cluet, Michel Scholl:
From Structured Documents to Novel Query Facilities.
SIGMOD Conference 1994: 313-324
![bibliographical record in XML](../../xml.gif)
- [CDY]
- ...
- [CGK89]
- Danette Chimenti, Ruben Gamboa, Ravi Krishnamurthy:
Towards on Open Architecture for LDL.
VLDB 1989: 195-203
![bibliographical record in XML](../../xml.gif)
- [CHK+91]
- Tim Connors, Waqar Hasan, Curtis P. Kolovson, Marie-Anne Neimat, Donovan A. Schneider, W. Kevin Wilkinson:
The Papyrus Integrated Data Server.
PDIS 1991: 139
![bibliographical record in XML](../../xml.gif)
- [CM94]
- Mariano P. Consens, Tova Milo:
Optimizing Queries on Files.
SIGMOD Conference 1994: 301-312
![bibliographical record in XML](../../xml.gif)
- [CMU94]
- ...
- [Cor94]
- ...
- [CS93]
- Surajit Chaudhuri, Kyuseok Shim:
Query Optimization in the Presence of Foreign Functions.
VLDB 1993: 529-542
![bibliographical record in XML](../../xml.gif)
- [DH91]
- ...
- [Fal85]
- Christos Faloutsos:
Access Methods for Text.
ACM Comput. Surv. 17(1): 49-74(1985)
![bibliographical record in XML](../../xml.gif)
- [Fal92]
- ...
- [GD87]
- Goetz Graefe, David J. DeWitt:
The EXODUS Optimizer Generator.
SIGMOD Conference 1987: 160-172
![bibliographical record in XML](../../xml.gif)
- [GHK92]
- Sumit Ganguly, Waqar Hasan, Ravi Krishnamurthy:
Query Optimization for Parallel Execution.
SIGMOD Conference 1992: 9-18
![bibliographical record in XML](../../xml.gif)
- [Hew92]
- ...
- [HFLP89]
- Laura M. Haas, Johann Christoph Freytag, Guy M. Lohman, Hamid Pirahesh:
Extensible Query Processing in Starburst.
SIGMOD Conference 1989: 377-388
![bibliographical record in XML](../../xml.gif)
- [HHK+93]
- Waqar Hasan, Michael L. Heytens, Curtis P. Kolovson, Marie-Anne Neimat, Spyros Potamianos, Donovan A. Schneider:
Papyrus GIS Demonstration.
SIGMOD Conference 1993: 554-555
![bibliographical record in XML](../../xml.gif)
- [HS93]
- Joseph M. Hellerstein, Michael Stonebraker:
Predicate Migration: Optimizing Queries with Expensive Predicates.
SIGMOD Conference 1993: 267-276
![bibliographical record in XML](../../xml.gif)
- [KMP93]
- Alfons Kemper, Guido Moerkotte, Klaus Peithner:
A Blackboard Architecture for Query Optimization in Object Bases.
VLDB 1993: 543-554
![bibliographical record in XML](../../xml.gif)
- [Lib94]
- ...
- [LMH+85]
- ...
- [LS88]
- Clifford A. Lynch, Michael Stonebraker:
Extended User-Defined Indexing with Application to Textual Databases.
VLDB 1988: 306-317
![bibliographical record in XML](../../xml.gif)
- [LW90]
- Wan-Lik Lee, Darrell Woelk:
Integration of Text Search with ORION.
IEEE Data Eng. Bull. 13(1): 56-62(1990)
![bibliographical record in XML](../../xml.gif)
- [MDZ93]
- Gail Mitchell, Umeshwar Dayal, Stanley B. Zdonik:
Control of an Extensible Query Optimizer: A Planning-Based Approach.
VLDB 1993: 517-528
![bibliographical record in XML](../../xml.gif)
- [SAC+79]
- Patricia G. Selinger, Morton M. Astrahan, Donald D. Chamberlin, Raymond A. Lorie, Thomas G. Price:
Access Path Selection in a Relational Database Management System.
SIGMOD Conference 1979: 23-34
![bibliographical record in XML](../../xml.gif)
- [Sal89]
- Gerard Salton:
Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer.
Addison-Wesley 1989, ISBN 0-201-12227-8
![bibliographical record in XML](../../xml.gif)
- [SJGP90]
- Michael Stonebraker, Anant Jhingran, Jeffrey Goh, Spyros Potamianos:
On Rules, Procedures, Caching and Views in Data Base Systems.
SIGMOD Conference 1990: 281-290
![bibliographical record in XML](../../xml.gif)
- [YA94]
- Tak W. Yan, Jurgen Annevelink:
Integrating a Structured-Text Retrieval System with an Object-Oriented Database System.
VLDB 1994: 740-749
![bibliographical record in XML](../../xml.gif)
- [YC85]
- ...
Copyright © Fri Mar 12 17:21:32 2010
by Michael Ley (ley@uni-trier.de)