Introduction
Welcome
PC Chairs message
Keynotes
Programme
Proceedings
Presentations
Accepted papers
Program committees
Conference officers
Important Dates
Guidelines for Authors
Submission
Workshops
Calls
Sponsorship
Co-located conferences
Registration
Venue
Auckland Airport Transport
Auckland Accommodation
Tourism
Conference Details

Conference video

Contact

Conference video

Monday, 25th August 2008

8:30 - 9:00

Opening Ceremony

Monday

9:00 - 10:15

Keynote

  • Is Transactional Memory an Oxymoron? video
    Mark D. Hill (University of Wisconsin-Madison).

10:45 - 12:30

Research Session 1 Systems A

Session Chair: Ken Ross

  • Constrained Physical Design Tuning video
    Nicolas Bruno (Microsoft Research, USA), Surajit Chaudhuri (Microsoft Research, USA).
  • Scalable Multi-Query Optimization for Exploratory Queries over Federated Scientific Databases video
    Dieter Van de Craen (Hasselt University), Frank Neven (Hasselt University), Anastasios Kementsietsidis (IBM T.J. Watson Research Center), Stijn Vansummeren (Hasselt University).
  • Clustera: An Integrated Computation and Data Management System video
    David DeWitt (UW - Madison), Eric Robinson (UW - Madison), Srinath Shankar (UW - Madison), Erik Paulson (UW - Madison), Jeffrey Naughton (UW - Madison), Andrew Krioukov (UW - Madison), Joshua Royalty (UW - Madison).
  • Performance Profiling with EndoScope, an Acquisitional Software Monitoring Framework video
    Alvin Cheung (MIT CSAIL, USA), Samuel Madden (MIT CSAIL, USA).

Research Session 2 Mining A

Session Chair: Phil Gibbon

  • Brighthouse: An Analytic Data Warehouse for Ad-hoc Queries (New Date and Time!) video
    Dominik Slezak (Infobright), Jakub Wroblewski (Infobright), Victoria Eastwood (Infobright), Piotr Synak (Infobright).
  • Plan-based Complex Event Detection across Distributed Sources video
    Mert Akdere (Brown University), Ugur Cetintemel (Brown University), Nesime Tatbul (ETH Zurich).
  • Finding Relevant Patterns in Bursty Sequences video
    Alexander Lachmann (Cornell University), Mirek Riedewald (Cornell University).
  • Constrained Locally Weighted Clustering video
    Hao Cheng (University of Central Florida), Kien Hua (University of Central Florida), Khanh Vu (University of Central Florida).

Research Session 3 Privacy & Authentication

Session Chair: N.N.

  • Resisting Structural Re-identification in Anonymized Social Networks video
    Michael Hay (University of Massachusetts Amherst), Gerome Miklau (University of Massachusetts Amherst), David Jensen (University of Massachusetts Amherst), Don Towsley (University of Massachusetts Amherst), Philipp Weis (University of Massachusetts Amherst).
  • Privacy-preserving Anonymization of Set-valued Data video
    Manolis Terrovitis (Univeristy of Hong Kong), Nikos Mamoulis (Univeristy of Hong Kong), Panos Kalnis (National University of Singapore).
  • Authenticating Query Results for Text Search Engines video
    HweeHwa Pang (Singapore Management University), Kyriakos Mouratidis (Singapore Management University).
  • Structural Signatures for Tree Data Structures video
    Ashish Kundu (Purdue University, USA), Elisa Bertino (Purdue University, USA).

Tutorial Session 1 Business Process

Session Chair: Dinesh Das

  • Querying and Monitoring Distributed Business Processes video
    Tova Milo (Tel Aviv University, Israel), Daniel Deutch (Tel Aviv University, Israel).

Demo Group 1 XML

  • eXtract: A Snippet Generation System for XML Search
    Yu Huang, Ziyang Liu, Yi Chen.
  • Language-Integrated Querying of XML Data in SQL Server
    James Terwilliger, Sergey Melnik, Philip Bernstein.
  • XTCcmp: XQuery Compilation on XTC
    Christian Mathis, Andreas Weiner, Theo Harder, Caesar Ralf Franz Hoppen.
  • Periscope/GQ: A Graph Querying Toolkit
    Yuanyuan Tian, Jignesh Patel, Viji Nair, Sebastian Martini, Matthias Kretzler.
  • SEDA: A System for Search, Exploration, Discovery, and Analysis of XML Data
    Andrey Balmin, Latha Colby, Emiran Curtmola, Quanzhong Li, Fatma Ozcan, Sharath Srinivas, Zografoula Vagena.
  • Process Spaceship: Process Views Discovery and Exploration
    Hamid Reza Motahari Nezhad, Boualem Benatallah, Fabio Casati, Periklis Andritsos, Regis Saint-Paul.

14:00 - 15:15

Research Session 4 Web

Session Chair: Jens Dittrich

  • Maintaining Dynamic Channel Profiles on the Web video
    Haggai Roitman (IBM), David Carmel (IBM-Haifa Research Lab), Elad Yom-Tov (IBM-Haifa Research Lab).
  • WYSIWYG Development of Data Driven Web Applications video
    Fan Yang (Yahoo), Chavdar Botev (Cornell University), Nitin Gupta (Cornell University), Elizabeth Churchill (Yahoo! Research), Levchenko George (Yahoo! Research), Jayavel Shanmugasundaram (Yahoo! Research).
  • Web Page Language Identification Based on URLs video
    Eda Baykan (EPF Lausanne), Monika Henzinger (EPF Lausanne), Ingmar Weber (EPF Lausanne).

Research Session 5 Query Optimization

Session Chair: Shivnath Babu

  • Parallelizing Query Optimization video
    Wook-Shin Han (Kyungpook National University), Wooseong Kwak (Kyungpook National University), Jinsoo Lee (Kyungpook National University), Guy Lohman (IBM Research Almaden, USA), Volker Markl (IBM Research Almaden, USA).
  • Hashed Samples: Selectivity Estimators For Set Similarity Selection Queries video
    Marios Hadjieleftheriou (AT&T Labs Inc. ), Xiaohui Yu (York University), Nick Koudas (U of Toronto), Divesh Srivastava (AT&T, USA).
  • Tighter Estimation using Bottom-k Sketches video
    Edith Cohen (AT&T, USA), Haim Kaplan (Tel Aviv University).

Research Session 6 Schema A

Session Chair: Peter Buneman

  • STBenchmark: Towards a Benchmark for Mapping Systems video
    Bogdan Alexe (UC Santa Cruz), Wang-Chiew Tan (UC Santa Cruz), Yannis Velegrakis (University of Trento).
  • Interactive Source Registration in Community-oriented Information Integration video
    Yannis Katsis (UC San Diego), Alin Deutsch (UC San Diego), Yannis Papakonstantinou (UC San Diego).
  • Data Exchange with Data-Metadata Translations video
    Mauricio Hernandez (IBM Almaden Research Center), Paolo Papotti (Universita Roma Tre), Wang-Chiew Tan (UC Santa Cruz).

Tutorial Session 2 Dataspaces

Session Chair: Xiaofang Zhou

  • Dataspaces video
    Michael Franklin (University of California, Berkeley, USA), Alon Halevy (Google), David Maier (Portland State University, USA).

Demo Group 2 P2P

  • P3N: Profiling the Potential of a Peer-based Data Management System
    Mihai Lupu, Y. C. Tay.
  • P2P Logging and Timestamping for Reconciliation
    Mounir Tlili, Kokou Dedzoe, Esther Pacitti, Patrick Valduriez, Reza Akbarinia.
  • AlvisP2P: Scalable Peer-to-Peer Text Retrieval in a Structured P2P Network
    Toan Luu, Gleb Skobeltsyn, Fabius Klemm, Maroje Puh, Ivana Podnar Zarko, Martin Rajman, Karl Aberer.
  • WebContent: Efficient P2P Warehousing of Web Data
    Serge Abiteboul, Tristan Allard, Philippe Chatalic, Georges Gardarin, Anca Ghitescu, Francois Goasdoue, Ioana Manolescu, Benjamin Nguyen, Mohamed Ouazara, Aditya Somani, Nicolas Travers,, Gabriel Vasile,, Spyros Zoupanos.
  • DObjects: Enabling Distributed Data Services for Metacomputing Platforms
    Pawel Jurczyk, Li Xiong.
  • EasyTicket: A Ticket Routing Recommendation Engine for Enterprise Problem Resolution
    Qihong Shao, Yi Chen, Shu Tao, Xifeng Yan, Nikos Anerousis.

15:45 - 17:00

Industry Session 7 Web

Session Chair: Meichun Hsu

  • SLEUTH: Single-pubLisher attack dEtection Using correlaTion Hunting video
    Ahmed Metwally (UCSB), Fatih Emekci (UCSB), Divyakant Agrawal (UCSB), Amr El Abbadi (UCSB).
  • Energy Cost, The Key Challenge of Today's Data Centers: A Power Consumption Analysis of TPC-C Results Cost,The Key_Challenge_of_Today's_Data_Centers-A_Power_Consumption_Analysis_of_TPC-C_Results.avi video
    Meikel Poess (Oracle USA), Raghunath Othayoth Nambiar (Hewlett-Packard).
  • Google's Deep-Web Crawl video
    Jayant Madhavan (Google), David Ko (Google), Lucja Kot (Cornell University), Vignesh Ganapathy (Google), Alex Rasmussen (University of California - San Diego), Alon Halevy (Google).

Research Session 8 Stream Processing

Session Chair: Zack Ives

  • Out-of-Order Processing: A New Architecture for High-Performance Stream Systems video
    Jin Li (Portland State University), Kristin Tufte (Portland State University), Vladislav Shkapenyuk (AT&T labs - Research), Vassilis Papadimos (Portland State University), Theodore Johnson (AT&T labs - Research), David Maier (Portland State University).
  • StreamTX: Extracting Tuples from Streaming XML Data video
    Wook-Shin Han (Kyungpook National University), Haifeng Jiang (Google), Howard Ho (IBM Almaden Research Center), Quanzhong Li (IBM).
  • Sliding-Window Top-k Queries on Uncertain Streams video
    Cheqing Jin (ECUST), Ke Yi (Hong Kong University of Science and Technology), Lei Chen (Hong Kong University of Science and Technology), Jeffrey Xu Yu (Chin. U. HK), Xuemin Lin (UNSW).

Research Session 9 Query Processing in Uncertain Databases

Session Chair: Reynold Cheng

  • Conditioning Probabilistic Databases video
    Christoph Koch (Cornell University), Dan Olteanu (Oxford University).
  • Efficient Search for the Top-k Probable Nearest Neighbors in Uncertain Databases video
    George Beskales (University of Waterloo), Mohamed Soliman (University of Waterloo), Ihab Francis Ilyas (University of Waterloo).
  • BayesStore: Managing Large, Uncertain Data Repositories with Probabilistic Graphical Models video
    Daisy Zhe Wang (UC Berkeley), Eirinaios Michelakis (UC Berkeley), Minos Garofalakis (Yahoo Research, USA), Joseph Hellerstein (UC Berkeley).

Tutorial Session 3 

Session Chair: Yannis Velegrakis

  • Ontologies and Databases: Myths and Challenges video
    Enrico Franconi (Free University of Bozen-Bolzano, Italy).

Demo Group 3 Web, Textual data

  • AJAXSearch: Crawling, Indexing and Searching Web 2.0 Applications
    Cristian Duda, Gianni Frey, Donald Kossman, Chong Zhou.
  • ManyAspects: A System for Highlighting Diverse Concepts in Documents
    Kun Liu, Evimaria Terzi, Tyrone Grandison.
  • Large-Scale Collaborative Analysis and Extraction of Web Data
    Felix Weigel, Biswanath Panda, Mirek Riedewald, Johannes Gehrke.
  • An Effective and Versatile Keyword Search Engine on Heterogenous Data
    Guoliang Li, Jianhua Feng, Jianyong Wang, Lizhu Zhou.
  • DBPubs: Multidimensional Exploration of Database Publications
    Akanksha Baid, Andrey Balmin, Heasoo Hwang, Erik Nijkamp, Jun Rao, Berthold Reinwald, Alkis Simitsis, Yannis Sismanis, Frank Van Ham.
  • Semandaq: A Data Quality System Based on Conditional Functional Dependencies
    Wenfei Fan, Floris Geerts, Xibei Jia.

Tuesday, 26th August 2008

9:00 - 10:15

Keynote

  • Databases and the Silification of Health video
    Justin Zobel (NICTA, University of Melbourne).

10:45 - 12:30

Session 10 Experiments & Analyses

Session Chair: Volker Markl

  • Finding Frequent Items in Data Streams video
    Graham Cormode (AT&T Labs, USA), Marios Hadjieleftheriou (AT&T Labs, USA).
  • Querying and Mining of Time Series Data: Experimental Comparison of Representations and Distance Measures video
    Hui Ding (Northwestern University), Goce Trajcevski (Northwestern University), Hui Ding (Northwestern University), Peter Scheuermann (Northwestern University), Xiaoyue Wang (University of California, Riverside), Eamonn Keogh (University of California, Riverside).
  • Column-Store Support for RDF Data Management: Not All Swans are White video
    Lefteris Sidirourgos (CWI, Amsterdam, The Netherlands), Romulo Goncalves (CWI, Amsterdam, The Netherlands), Martin Kersten (CWI, Amsterdam, The Netherlands), Niels Nes (CWI, Amsterdam, The Netherlands), Stefan Manegold (CWI, Amsterdam, The Netherlands).
  • Prefix based numbering schemes for XML : Techniques, Applications and Performances video
    Virginie Sans (ETIS - CNRS ENSEA Univ Cergy-Pontoise), Dominique Laurent (ETIS - CNRS ENSEA Univ Cergy-Pontoise).
  • A Benchmark for Evaluating Moving Objects Indexes video
    Su Chen (National University of Singapore), Dan Lin (Purdue University), Christian Jensen (Aalborg University, Denmark).
  • Dwarfs in the Rearview Mirror: How Big are they Really? video
    Jens Dittrich (ETH Zurich), Lukas Blunschi (ETH Zurich), Marcos Antonio Vaz Salles (ETH Zurich).

Research Session 11 Theory

Session Chair: Alin Deutsch

  • Type Inference and Type Checking for Queries on Execution Traces video
    Daniel Deutch (Tel Aviv University), Tova Milo (Tel Aviv University).
  • Taming Verification Hardness: An Efficient Algorithm For Testing Subgraph Isomorphism video
    Haichuan Shang (UNSW), Ying Zhang (UNSW), Xuemin Lin (UNSW), Jeffrey Xu Yu (Chin. U. HK).
  • On Generating Near-Optimal Tableaux for Conditional Functional Dependencies video
    Lukasz Golab (AT&T Labs - Research), Howard Karloff (AT&T Labs - Research), Flip Korn (AT&T Labs - Research), Divesh Srivastava (AT&T Labs - Research), Bei Yu (Singapore-MIT Alliance (SMA), Singapore).
  • Propagating Functional Dependencies with Conditions video
    Wenfei Fan (University of Edinburgh, UK), Shuai Ma (University of Edinburgh, UK), Yanli Hu (University of Edinburgh, UK), Jie Liu (Chinese Academy of Sciences, China), Yinghui Wu (University of Edinburgh, UK).

Research Session 12 Web Rank & PubSub

Session Chair: Alexandros Labrinidis

  • Simrank++: Query Rewriting through Link Analysis of the Click Graph video
    Ioannis Antonellis (Stanford University), Hector Garcia-Molina (Stanford University), Chi-Chao Chang (Yahoo!).
  • Accuracy Estimate and Optimization Techniques for SimRank Computation video
    Dmitry Lizorkin (ISP RAS), Pavel Velikhov (ISP RAS), Maxim Grinev (ISP RAS), Denis Turdakov (ISP RAS).
  • End-to-End Support for Joins in Large-Scale Publish/Subscribe Systems video
    Badrish Chandramouli (Duke University), Jun Yang (Duke University).
  • Scalable Ranked Publish/Subscribe video
    Ashwin Machanavajjhala (Cornell University), Erik Vee (Yahoo! Research), Minos Garofalakis (Yahoo! Research), Jayavel Shanmugasundaram (Yahoo! Research).

Tutorial Session 4 Probabilistic Data Management

Session Chair: Paolo Papotti

  • Systems Aspects of Probabilistic Data Management video
    Magdalena Balazinska (University of Washington, USA), Christopher Re (University of Washington, USA), Dan Suciu (University of Washington, USA).

Demo Group 4 Data integration, collaboration

  • RIDE: A Tool for Interactive Source Registration in GLAV-based Information Integration
    Yannis Katsis, Alin Deutsch, Yannis Papakonstantinou, Keliang Zhao.
  • Comparing and Evaluating Mapping Systems with STMark
    Bogdan Alexe, Wang-Chiew Tan, Yannis Velegrakis.
  • Ad-Hoc Data Processing in the Cloud
    Dionysios Logothetis, Kenneth Yocum.
  • XTreeNet: Democratic Community Search
    Emiran Curtmola, Alin Deutsch, Kadangode Ramakrishnan, Divesh Srivastava, Kenneth Yocum, Dionysios Logothetis.
  • Making SENSE: Socially Enhanced Search and Exploration
    Tom Crecelius, Mouna Kacimi, Sebastian Michel, Thomas Neumann, Josiane Xavier Parreira, Ralf Schenkel, Gerhard Weikum.
  • AuditGuard: A system for database auditing under retention restrictions
    Wentian Lu, Gerome Miklau.

14:00 - 15:15

Industry Session 13 Massive Data

Session Chair: Neoklis Polyzotis

  • Industry-Scale Duplicate Detection video
    Melanie Weis (Hasso-Plattner-Institut), Felix Naumann (Hasso-Plattner-Institute), Ulrich Jehle (Schufa), Holger Schuster (Schufa), Jens Lufter (Schufa).
  • SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets video
    Ronnie Chaiken (Microsoft), Bob Jenkins (Microsoft), Paul Larson (Microsoft Research, USA), Bill Ramsey (Microsoft), Darren Shakib (Microsoft), Simon Weaver (Microsoft), Jingren Zhou (Microsoft Research, USA).
  • PNUTS: Yahoo!'s Hosted Data Serving Platform video not available
    Brian Cooper (Yahoo! Research), Raghu Ramakrishnan (Yahoo! Research), Utkarsh Srivastava (Yahoo! Research), Adam Silberstein (Yahoo! Research), Phil Bohannon (Yahoo!), Hans-Arno Jacobsen (Yahoo! Research and University of Toronto), Nick Puz (Yahoo! Research), Daniel Weaver (Yahoo! Research), Ramana Yerneni (Yahoo! Research).

Research Session 14 XML Databases

Session Chair: Yi Chen

  • Dependable Cardinality Forecasts for XQuery video
    Jens Teubner (IBM T.J. Watson Research Center), Torsten Grust (Technische Universitat Munchen), Sebastian Maneth (NICTA), Sherif Sakr (NICTA).
  • Hash-based Subgraph Query Processing Method for Graph-structured XML Documents video
    Hongzhi Wang (Harbin Institute of Technology), Jianzhong Li (Harbin Institute of Technology), Jizhou Luo (Harbin Institute of Technology), Hong Gao (Harbin Institute of Technology).
  • Generating XML Structure Using Examples and Constraints video
    Sara Cohen (Hebrew University of Jerusalem).

Research Session 15 DB Performance & Evaluation

Session Chair: Nick Koudas

  • Read-Optimized Databases, In Depth video
    Allison Holloway (University of Wisconsin), David DeWitt (University of Wisconsin).
  • Flashing Up The Storage Layer video
    Ioannis Koltsidas (University of Edinburgh), Stratis Viglas (University of Edinburgh).
  • Rose: Compressed, Log-Structured Replication video
    Russell Sears (UC Berkeley), Mark Callaghan (Google), Eric Brewer (UC Berkeley).

Tutorial Session 5 Dataspaces

Session Chair: Xiaofang Zhou

  • Dataspaces video
    Michael Franklin (University of California, Berkeley, USA), Alon Halevy (Google), David Maier (Portland State University, USA).

Demo Group 5 Tuning, systems, optimization, etc

  • QueryScope: Visualizing Queries for Repeatable Database Tuning
    Ling Hu, Yuan-chi Chang, Christian Lang, Kenneth Ross, Donghui Zhang.
  • When is it Time to Rethink the Aggregate Configuration of Your OLAP Server?
    Katja Hose, Daniel Klan, Matthias Marx, Kai-Uwe Sattler.
  • H-Store: A High-Performance, Distributed Main Memory Transaction Processing System
    Robert Kallman, Jonathan Natkins, Hideaki Kimura, Andrew Pavlo, Alexander Rasin, Stan Zdonik, Evan Jone, Samuel Madden, Michael Stonebraker, Daniel Abadi.
  • Organizing and Indexing Non-Convex Regions
    Eric Perlman, Randal Burns, Michael Kazhdan.
  • Capri/MR: Exploring Protein Databases from a Structural and Physicochemical Point of View
    Eric Paquet, Herna Viktor.
  • C-DEM: A Multi-Modal Query System for Drosophila Embryo Databases
    Fan Guo, Lei Li, Eric Xing, Christos Faloutsos.

15:45 - 17:00

Industry Session 16 Storage & Sorting

Session Chair: Brian Cooper

  • Relational Support for Flexible Schema Scenarios video not available
    Srini Acharya (Microsoft Corp.), Peter Carlin (Microsoft Corp.), Cesar Galindo-Legaria (Microsoft Corp.), Krzysztof Kozielczyk (Microsoft Corp.), Pawel Terlecki (Microsoft Corp.), Peter Zabback (Microsoft Corp.).
  • Oracle Securefiles System video not available
    Niloy Mukherjee (Oracle), Bharath Aleti (Oracle), Amit Ganesh (Oracle), Krishna Kunchithapadam (Oracle), Scott Lynn (Oracle), Sujatha Muthulingam (Oracle), Kam Shergill (Oracle), Shaoyu Wang (Oracle), Wei Zhang (Oracle).
  • Efficient Implementation of Sorting on Multi-Core SIMD CPU Architecture video not available
    Jatin Chhugani (Intel Corporation), Skip Macy (Intel Corporation), Akram Baransi (Intel Corporation), Anthony Nguyen (Intel Corporation), Mostafa Hagog (Intel Corporation), Sanjeev Kumar (Intel Corporation), Victor Lee (Intel Corporation), Yen-Kuang Chen (Intel Corporation), Pradeep Dubey (Intel Corporation).

Research Session 17 Web Queries

Session Chair: Nicolas Bruno

  • WebTables: Exploring the Power of Tables on the Web video not available
    Michael Cafarella (University of Washington), Alon Halevy (Google, Inc.), Daisy Zhe Wang (UC Berkeley), Eugene Wu (MIT), Yang Zhang (MIT).
  • Scalable Query Result Caching for Web Applications video not available
    Charles Garrod (Carnegie Mellon University), Amit Manjhi (Google), Bruce Maggs (Carnegie Mellon University), Todd Mowry (Carnegie Mellon University), Anthony Tomasic (Carnegie Mellon University), Christopher Olston (Yahoo! Research), Anastasia Ailamaki (Carnegie Mellon University).
  • Optimization of Multi-Domain Queries on the Web video not available
    Daniele Braga (Politecnico di Milano), Stefano Ceri (Politecnico di Milano), Florian Daniel (Politecnico di Milano), Davide Martinenghi (Politecnico di Milano).

Research Session 18 Distributed Systems Processing

Session Chair: Paul Larson

  • Fault-tolerant Stream Processing using a Distributed, Replicated File System video
    YongChul Kwon (University of Washington), Magdalena Balazinska (University of Washington), Albert Greenberg (Microsoft Research).
  • LEEWAVE: Level-Wise Distribution of Wavelet Coefficients for Processing kNN Queries over Distributed Streams video
    Mi-Yen Yeh (National Taiwan University), Kun-Lung Wu (IBM T. J. Watson Research Center), Philip Yu (University of Illinois at Chicago), Ming-Syan Chen (National Taiwan University).
  • A Practical Scalable Distributed B-Tree video not available
    Marcos Aguilera (HP Labs), Wojciech Golab (University of Toronto), Mehul Shah (HP Labs).

Tutorial Session 6 Data Cleaning

Session Chair: Anastasios Kementsietsidis

  • A Revival of Integrity Constraints for Data Cleaning video
    Wenfei Fan (University of Edinburgh, UK and Bell Labs, USA), Floris Geerts (University of Edinburgh, UK).

Demo Group 1 XML

  • Xnippet: Generating Query Biased Result Snippet for XML Search
    Yu Huang, Ziyang Liu, Ziyang Liu.
  • Language-Integrated Querying of XML Data in SQL Server
    James Terwilliger, Sergey Melnik, Philip Bernstein.
  • XTCcmp: XQuery Compilation on XTC
    Christian Mathis, Andreas Weiner, Theo Harder, Caesar Ralf Franz Hoppen.
  • Periscope/GQ: A Graph Querying Toolkit
    Yuanyuan Tian, Jignesh Patel, Viji Nair, Sebastian Martini, Matthias Kretzler.
  • SEDA: A System for Search, Exploration, Discovery, and Analysis of XML Data
    Andrey Balmin, Latha Colby, Emiran Curtmola, Quanzhong Li, Fatma Ozcan, Sharath Srinivas, Zografoula Vagena.
  • Process Spaceship: Process Views Discovery and Exploration
    Hamid Reza Motahari Nezhad, Boualem Benatallah, Fabio Casati, Periklis Andritsos, Regis Saint-Paul.

Wednesday, 27th August 2008

9:00 - 10:15

10 Year Best Paper Award Session

  • A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces video
    Roger Weber, Hans-Jorg Schek, Stephen Blott.

10:45 - 12:30

Research Session 19 System Centric Optimization

Session Chair: Timos Sellis

  • Main-Memory Scan Sharing For Multi-Core CPUs video
    Lin Qiao (IBM Almaden Research Lab), Vijayshankar Raman (IBM Almaden Research Lab), Frederick Reiss (IBM Almaden Research Lab), Peter Haas (IBM Almaden Research Lab), Guy Lohman (IBM Almaden Research Lab).
  • Row-wise Parallel Predicate Evaluation video
    Ryan Johnson (Carnegie Mellon University), Vijayshankar Raman (IBM Almaden Research Lab), Richard Sidle (IBM Almaden Research Lab), Garret Swart (Oracle).
  • Dynamic Partitioning of the Cache Hierarchy in Shared Data Centers video
    Gokul Soundararajan (University of Toronto), Jin Chen (University of Toronto), Mohamed Sharaf (University of Toronto), Cristiana Amza (University of Toronto).
  • RDF-3X: a RISC-style Engine for RDF video
    Thomas Neumann (Max-Planck-Institut Informatik), Gerhard Weikum (MPI).

Research Session 20 IR & Forms

Session Chair: Justin Zobel

  • Multidimensional Content eXploration video
    Alkis Simitsis (IBM Research Almaden, USA), Akanksha Baid (University of Wisconsin-Madison), Yannis Sismanis (IBM Research Almaden, USA), Berthold Reinwald (IBM Research Almaden, USA).
  • Relaxation in Text Search using Taxonomies 02-Relaxation_in_Text_Search_using_Taxonomies.avi video
    Marcus Fontoura (Yahoo! Research), Vanja Josifovski (Yahoo! Research), Ravi Kumar (Yahoo! Research), Christopher Olston (Yahoo! Research), Sergei Vassilvitskii (Yahoo! Research), Andrew Tomkins (Yahoo! Research).
  • Learning to Extract Form Labels video
    Hoa Nguyen (University of Utah), Thanh Nguyen (University of Utah), Juliana Freire (University of Utah).
  • Automated Creation of a Forms video
    Magesh Jayapandian (University of Michigan), H V Jagadish (University of Michigan).

Research Session 21 New Topics

Session Chair: Xiaofang Zhou

  • Efficient Network-Aware Search in Collaborative Tagging Sites video
    Michael Benedikt (Oxford University), Sihem Amer Yahia (Yahoo Research, USA), Laks Lakshmanan (University of British Columbia), Julia Stoyanovich (Columbia University).
  • Cleaning Uncertain Data with Quality Guarantees video
    Reynold Cheng (Hong Kong Polytechnic University, China), Jinchuan Chen (Hong Kong Polytechnic University, China), Xike Xie (Hong Kong Polytechnic University, China).
  • On the Provenance of Non-Answers to Queries over Extracted Data video
    Jiansheng Huang (Univ. of Wisconsin-Madison), Ting Chen (Univ. of Wisconsin-Madison), AnHai Doan (Univ. of Wisconsin-Madison), Jeffrey Naughton (Univ. of Wisconsin-Madison).
  • Dynamic Active Probing of Helpdesk Databases video
    Shenghuo Zhu (NEC Lab), Tao Li (Florida International University), Zhiyuan Chen (UMBC), Dingding Wang (Florida International University), Yihong Gong (NEC Lab).

Tutorial Session 7 XML Structural Summaries

Session Chair: Marios Hatzieleftheriou

  • XML Structural Summaries video
    Mirella M. Moro (Univ. Fed. Rio Grande do Sul, Brazil), Zografoula Vagena (Microsoft Research, UK), Vassilis J. Tsotras (University of California Riverside, USA).

Demo Group 2 P2P

  • P3N: Profiling the Potential of a Peer-based Data Management System
    Mihai Lupu, Y. C. Tay.
  • P2P Logging and Timestamping for Reconciliation
    Mounir Tlili, Kokou Dedzoe, Esther Pacitti, Patrick Valduriez, Reza Akbarinia.
  • AlvisP2P: Scalable Peer-to-Peer Text Retrieval in a Structured P2P Network
    Toan Luu, Gleb Skobeltsyn, Fabius Klemm, Maroje Puh, Ivana Podnar Zarko, Martin Rajman, Karl Aberer.
  • WebContent: Efficient P2P Warehousing of Web Data
    Serge Abiteboul, Tristan Allard, Philippe Chatalic, Georges Gardarin, Anca Ghitescu, Francois Goasdoue, Ioana Manolescu, Benjamin Nguyen, Mohamed Ouazara, Aditya Somani, Nicolas Travers,, Gabriel Vasile,, Spyros Zoupanos.
  • DObjects: Enabling Distributed Data Services for Metacomputing Platforms
    Pawel Jurczyk, Li Xiong.
  • EasyTicket: A Ticket Routing Recommendation Engine for Enterprise Problem Resolution
    Qihong Shao, Yi Chen, Shu Tao, Xifeng Yan, Nikos Anerousis.

14:00 - 15:15

Industry Session 22 Query Optimization

Session Chair: N.N.

  • Efficiently Approximating Query Optimizer Plan Diagrams video
    Atreyee Dey (Indian Institute of Science), Sourjya Bhaumik (Indian Institute of Science), Harish D (Indian Institute of Science), Jayant Haritsa (Indian Institute of Science).
  • Mining Search Engine Query Logs via Suggestion Sampling (New Date and Time!) video
    Maxim Gurevich (Technion), Ziv Bar-Yossef (Google and Technion).
  • Optimizer Plan Change Management: Improved Stability and Performance in Oracle 11g video
    Mohamed Ziauddin (Oracle), Dinesh Das (Oracle), Hong Su (Oracle), Yali Zhu (Oracle), Khaled Yagoub (Oracle).

Research Session 23 Schema B

Session Chair: Ralf Schenkel

  • Graceful Database Schema Evolution: the PRISM Workbench video
    Carlo Curino (Politecnico di Milano), Hyun Moon (UCLA), Carlo Zaniolo (UCLA).
  • Analyzing and Revising Data Integration Schemas to Improve Their Matchability video
    Xiaoyong Chai (University of Wisconsin-Madiso), Mayssam Sayyadian (University of Wisconsin-Madiso), AnHai Doan (University of Wisconsin-Madiso), Arnon Rosenthal (The MITRE Corporation), Len Seligman (The MITRE Corporation).
  • Learning to Create Data-Integrating Queries video
    Partha Talukdar (University of Pennsylvania ), Marie Jacob (University of Pennsylvania), Mohammad Mehmood (University of Pennsylvania ), Koby Crammer (University of Pennsylvania), Zachary Ives (University of Pennsylvania ), Fernando Pereira (University of Pennsylvania), Sudipto Guha (University of Pennsylvania).

Research Session 24 Uncertain DB B (Rel & AC)

Session Chair: Lei Chen

  • Approximate Lineage for Probabilistic Databases video
    Chris Re (University of Washington), Dan Suciu (University of Washington and Microsoft).
  • Exploiting Shared Correlations in Probabilistic Databases video
    Prithviraj Sen (University of Maryland), Amol Deshpande (University of Maryland), Lise Getoor (University of Maryland).
  • Access Control over Uncertain Data video
    Vibhor Rastogi (University of Washington), Dan Suciu (University of Washington and Microsoft), Evan Welbourne (University of Washington).

Tutorial Session 8 

Session Chair: N.N.

  • Ontologies and Databases: Myths and Challenges video
    Enrico Franconi (Free University of Bozen-Bolzano, Italy).

Demo Group 4 Data integration, collaboration

  • RIDE: A Tool for Interactive Source Registration in GLAV-based Information Integration
    Yannis Katsis, Alin Deutsch, Yannis Papakonstantinou, Keliang Zhao.
  • Comparing and Evaluating Mapping Systems with STMark
    Bogdan Alexe, Wang-Chiew Tan, Yannis Velegrakis.
  • Ad-Hoc Data Processing in the Cloud
    Dionysios Logothetis, Kenneth Yocum.
  • XTreeNet: Democratic Community Search
    Emiran Curtmola, Alin Deutsch, Kadangode Ramakrishnan, Divesh Srivastava, Kenneth Yocum, Dionysios Logothetis.
  • Making SENSE: Socially Enhanced Search and Exploration
    Tom Crecelius, Mouna Kacimi, Sebastian Michel, Thomas Neumann, Josiane Xavier Parreira, Ralf Schenkel, Gerhard Weikum.
  • AuditGuard: A system for database auditing under retention restrictions
    Wentian Lu, Gerome Miklau.

15:45 - 17:00

Industry Session 25 Query Processing

Session Chair: Jayant Haritsa

  • Towards a Physical XML independent XQuery/SQL/XML Engine video
    Zhen Hua Liu (Oracle), Thomas Baby (Oracle), Sivasankaran Chandrasekar (Oracle), Hui Chang (Oracle).
  • Closing The Query Processing Loop in Oracle 11g video
    Mohamed Zait (Oracle), Allison Lee (Oracle).
  • Towards a Streaming SQL Standard video
    Stan Zdonik (Streambase,Inc.), Namit Jain (Oracle), Shailendra Mishra (Oracle), Anand Srinivasan (Oracle), Johannes Gehrke (Cornell University, USA), Jennifer Widom (Stanford University), Hari Balakrishnan (Streambase,Inc.), Mitch Cherniack (Streambase,Inc.), Ugur Cetintemel (Streambase,Inc.), Richard Tibbetts (Streambase,Inc.).

Research Session 26 Privacy Preservation

Session Chair: Elisa Bertino

  • Anonymizing Bipartite Graph Data using Safe Groupings video
    Graham Cormode (AT&T Labs, USA), Divesh Srivastava (AT&T Labs, USA), Ting Yu (North Carolina State University), Qing Zhang (North Carolina State University).
  • Privacy Preserving Serial Data Publishing By Role Composition video
    Yingyi Bu (The Chinese University of HK), Ada WaiChee Fu (The Chinese University of Hong Kong), Raymond Chi-Wing Wong (The Hong Kong University of Science and Technology), Lei Chen (The Hong Kong University of Science and Technology), Jiuyong Li (University of South Australia).
  • Output Perturbation with Query Relaxation video
    Xiaokui Xiao (The Chinese University of Hong Kong), Yufei Tao (The Chinese University of Hong Kong).

Research Session 27 Temporal Indexing & Searching

Session Chair: Cyrus Shahabi

  • Transaction Time Indexing with Version Compression video
    David Lomet (Microsoft Research, USA), Mingsheng Hong (Cornell University), Rimma Nehme (Purdue University), Rui Zhang (University of Melbourne).
  • Managing and Querying Transaction-time Databases under Schema Evolution video
    Hyun Moon (UCLA), Carlo Curino (Politecnico di Milano), Alin Deutsch (UCSD), Chien-Yi Hou (UCSD), Carlo Zaniolo (UCLA).
  • On Efficiently Searching Trajectories and Archival Data for Historical Similarities - On Efficiently Searching Trajectories and Archival Data for Historical Similarities.avi video
    Reza Sherkat (IBM Toronto Lab.), Davood Rafiei (University of Alberta ).

Tutorial Session 9 Probabilistic Data Management

Session Chair: Paolo Papotti

  • Systems Aspects of Probabilistic Data Management video
    Magdalena Balazinska (University of Washington, USA), Christopher Re (University of Washington, USA), Dan Suciu (University of Washington, USA).

Thursday, 28th August 2008

9:00 - 10:45

Research Session 28 Text & Keyword Query Processing

Session Chair: Jayant Madhavan

  • Keyword Query Cleaning video
    Ken Pu (UOIT), Xiaohui Yu (York University).
  • Reasoning and Identifying Relevant Matches for XML Keyword Search video
    Ziyang Liu (Arizona State University,USA), Yi Chen (Arizona State University,USA).
  • Ed-Join: An Efficient Algorithm for Similarity Joins With Edit Distance Constraints video
    Chuan Xiao (University of New South Wales), Wei Wang (University of New South Wales), Xuemin Lin (University of New South Wales).
  • Scalable Ad-hoc Entity Extraction from Text Collections Collections.avi video
    Sanjay Agrawal (Microsoft Research), Kaushik Chakrabarti (Microsoft Research), Surajit Chaudhuri (Microsoft Research), Venkatesh Ganti (Microsoft Research).

Research Session 29 Systems B

Session Chair: Jingren Zhou

  • Scheduling Shared Scans of Large Data Files video
    Parag Agrawal (Stanford University), Daniel Kifer (Yahoo! Research), Christopher Olston (Yahoo! Research).
  • Online Maintenance of Very Large Random Samples on Flash Storage video not available
    Suman Nath (Microsoft Research), Phillip Gibbons (Intel Research).
  • A Skip-list Approach for Efficiently Processing Forecasting Queries video not available
    Tingjian Ge (Brown University), Stan Zdonik (Brown University).
  • A Request-Routing Framework for SOA-Based Enterprise Computing video not available
    Thomas Phan (IBM Almaden), Wen-Syan Li (SAP Research Center - China).

Research Session 30 Indexing & Query Processing

Session Chair: Chen Li

  • Hexastore: Sextuple Indexing for Semantic Web Data Management video
    Cathrin Weiss (University of Zurich), Panagiotis Karras (National University of Singapore), Abraham Bernstein (University of Zurich).
  • Indexing Land Surface for Efficient kNN Query video
    Cyrus Shahabi (Univ. of Southern California), Lu-An Tang (Univ. of Southern California), Songhua Xing (Univ. of Southern California).
  • Efficient Skyline Querying with Variable User Preferences on Nominal Attributes video
    Raymond Chi-Wing Wong (The Hong Kong University of Science and Technology), Ada WaiChee Fu ( The Chinese University of HK), Jian Pei (Simon Fraser University), Yip Sing Ho (The Chinese University of HK), Tai Wong (The Chinese University of HK), Yubao Liu (Sun Yat-Sen University, China).
  • Efficient Top-K Processing over Query-Dependent Functions video
    Lin Guo (Yahoo! Research), Sihem Amer Yahia (Yahoo! Research), Raghu Ramakrishnan (Yahoo! Research), Jayavel Shanmugasundaram (Yahoo! Research), Utkarsh Srivastava (Yahoo! Research), Erik Vee (Yahoo! Research).

Tutorial Session 10 Continuous Queries

  • Scheduling Continuous Queries in Data Stream Management Systems video
    Mohamed A. Sharaf (University of Toronto, Canada), Alexandros Labrinidis (University of Pittsburgh, USA), Panos K. Chrysanthis (University of Pittsburgh, USA).

Demo Group 3 Web, Textual data

  • AJAXSearch: Crawling, Indexing and Searching Web 2.0 Applications
    Cristian Duda, Gianni Frey, Donald Kossman, Chong Zhou.
  • ManyAspects: A System for Highlighting Diverse Concepts in Documents
    Kun Liu, Evimaria Terzi, Tyrone Grandison.
  • Large-Scale Collaborative Analysis and Extraction of Web Data
    Felix Weigel, Biswanath Panda, Mirek Riedewald, Johannes Gehrke.
  • An Effective and Versatile Keyword Search Engine on Heterogenous Data
    Guoliang Li, Jianhua Feng, Jianyong Wang, Lizhu Zhou.
  • DBPubs: Multidimensional Exploration of Database Publications
    Akanksha Baid, Andrey Balmin, Heasoo Hwang, Erik Nijkamp, Jun Rao, Berthold Reinwald, Alkis Simitsis, Yannis Sismanis, Frank Van Ham.
  • Semandaq: A Data Quality System Based on Conditional Functional Dependencies
    Wenfei Fan, Floris Geerts, Xibei Jia.

11:15 - 13:00

Research Session 31 Spatial and Motion Data

Session Chair: Xuemin Lin

  • FINCH: Evaluating Reverse k-Nearest-Neighbor Queries on Location Data video not available
    Wei Wu (National University of Singapore), Fei Yang (National University of Singapore), Chee-Yong Chan (National University of Singapore), Kian-Lee Tan (National University of Singapore).
  • Discovery of Convoys in Trajectory Databases video not available
    Hoyoung Jeung (The university of queenslad), Man Lung Yiu (Aalborg University), Xiaofang Zhou (The university of queenslad), Christian Jensen (Aalborg University), Heng Tao Shen (The university of queenslad).
  • TraClass: Trajectory Classification Using Hierarchical Region-Based and Trajectory-Based Clustering video not available
    Jae-Gil Lee (UIUC), Jiawei Han (UIUC), Xiaolei Li (UIUC), Hector Gonzalez (UIUC).
  • The V*-Diagram: a Query-Dependent Method for Moving kNN Queries video not available
    Sarana Nutanong (The University of Melbourne), Rui Zhang (The University of Melbourne), Egemen Tanin (The University of Melbourne), Lars Kulik (The University of Melbourne).

Research Session 32 Query Processing

Session Chair: Divesh Srivastava

  • Rewriting Procedures for Batched Bindings video
    Ravindra Guravannavar (IIT Bombay), S. Sudarshan (IIT Bombay).
  • Identifying Robust Plans through Plan Diagram Reduction video
    Harish D (Indian Institute of Science), Pooja Darera (Indian Institute of Science), Jayant Haritsa (Indian Institute of Science).
  • A Pay-As-You-Go Framework for Query Execution Feedback video
    Surajit Chaudhuri (Microsoft Research), Vivek Narasayya (Microsoft Research), Ravishankar Ramamurthy (Microsoft Research).
  • Evita Raced: Metacompilation for Declarative Networks video
    Tyson Condie (UC Berkeley), Joseph Hellerstein (UC Berkeley), Petros Maniatis (UC Berkeley), David Chu (UC Berkeley).

Research Session 33 Mining B & External Memory

Session Chair: Shinichi Morishita

  • Discovering Data Quality Rules video
    Fei Chiang (University of Toronto), Renee Miller (University of Toronto).
  • Mining Non-Redundant High Order Correlations in Binary Data video
    Xiang Zhang (Univeristy of North Carolina), Feng Pan (Univeristy of North Carolina), Wei Wang (Univeristy of North Carolina), Andrew Nobel (Univeristy of North Carolina).
  • Keyword Search on External Memory Data Graphs video
    Bhavana Dalvi (IIT Bombay, India), Meghana Kshirsagar (IIT Bombay, India), S. Sudarshan (IIT Bombay, India).
  • Sorting Hierarchical Data in External Memory for Archiving video
    Ioannis Koltsidas (University of Edinburgh), Heiko Mueller (University of Edinburgh), Stratis Viglas (University of Edinburgh).

Tutorial Session 11 Clusters in High Dimensions

Session Chair: Stratis Viglas

  • Detecting Clusters in Moderate-to-High Dimensional Data: Subspace Clustering, Pattern-based Clustering, and Correlation Clustering video
    Hans-Peter Kriegel (Ludwig-Maximilians-Universitat Munchen, Germany), Peer Kroger (Ludwig-Maximilians-Universitat Munchen, Germany), Arthur Zimek (Ludwig-Maximilians-Universitat Munchen, Germany).

Demo Group 5 Tuning, systems, optimization, etc

  • QueryScope: Visualizing Queries for Repeatable Database Tuning
    Ling Hu, Yuan-chi Chang, Christian Lang, Kenneth Ross, Donghui Zhang.
  • When is it Time to Rethink the Aggregate Configuration of Your OLAP Server?
    Katja Hose, Daniel Klan, Matthias Marx, Kai-Uwe Sattler.
  • H-Store: A High-Performance, Distributed Main Memory Transaction Processing System
    Robert Kallman, Jonathan Natkins, Hideaki Kimura, Andrew Pavlo, Alexander Rasin, Stan Zdonik, Evan Jone, Samuel Madden, Michael Stonebraker, Daniel Abadi.
  • Organizing and Indexing Non-Convex Regions
    Eric Perlman, Randal Burns, Michael Kazhdan.
  • Capri/MR: Exploring Protein Databases from a Structural and Physicochemical Point of View
    Eric Paquet, Herna Viktor.
  • C-DEM: A Multi-Modal Query System for Drosophila Embryo Databases
    Fan Guo, Lei Li, Eric Xing, Christos Faloutsos.