Programme
				
				Monday, 25th August 2008 
				8:30 - 9:00 
				Opening Ceremony 
				Monday
				9:00 - 10:15 
				Keynote
				
				Is Transactional Memory an Oxymoron? 
					Mark D. Hill (University of Wisconsin-Madison)
					 
				 
				10:45 - 12:30 
				Research Session 1 Systems A 
				Session Chair: Ken Ross
				
				Constrained Physical Design Tuning
					Nicolas Bruno (Microsoft Research, USA), Surajit Chaudhuri (Microsoft Research, USA). 
					 
				Scalable Multi-Query Optimization for Exploratory Queries over Federated Scientific Databases
					Dieter Van de Craen (Hasselt University), Frank Neven (Hasselt University), Anastasios Kementsietsidis (IBM T.J. Watson Research Center), Stijn Vansummeren (Hasselt University). 
					 
				Clustera: An Integrated Computation and Data Management System
					David DeWitt (UW - Madison), Eric Robinson (UW - Madison), Srinath Shankar (UW - Madison), Erik Paulson (UW - Madison), Jeffrey Naughton (UW - Madison), 
					Andrew Krioukov (UW - Madison), Joshua Royalty (UW - Madison). 
					 
					Performance Profiling with EndoScope, an Acquisitional Software Monitoring Framework
					Alvin Cheung (MIT CSAIL, USA), Samuel Madden (MIT CSAIL, USA). 
					 
				 
				Research Session 2 Mining A 
				Session Chair: Phil Gibbon
				
				Brighthouse: An Analytic Data Warehouse for Ad-hoc Queries (New Date and Time!)
						Dominik Slezak (Infobright), Jakub Wroblewski (Infobright), Victoria Eastwood (Infobright), Piotr Synak (Infobright).
					 
				Plan-based Complex Event Detection across Distributed Sources
					Mert Akdere (Brown University), Ugur Cetintemel (Brown University), Nesime Tatbul (ETH Zurich).  
					 
				Finding Relevant Patterns in Bursty Sequences
					Alexander Lachmann (Cornell University), Mirek Riedewald (Cornell University). 
					 
					Constrained Locally Weighted Clustering
					Hao Cheng (University of Central Florida), Kien Hua (University of Central Florida), Khanh Vu (University of Central Florida). 
					 
				 
				Research Session 3 Privacy & Authentication
				Session Chair: N.N.
				
				Resisting Structural Re-identification in Anonymized Social Networks
					Michael Hay (University of Massachusetts Amherst), Gerome Miklau (University of Massachusetts Amherst), 
					David Jensen (University of Massachusetts Amherst), Don Towsley (University of Massachusetts Amherst), Philipp Weis (University of Massachusetts Amherst). 
					 
				Privacy-preserving Anonymization of Set-valued Data
					Manolis Terrovitis (Univeristy of Hong Kong), Nikos Mamoulis (Univeristy of Hong Kong), Panos Kalnis (National University of Singapore).
					 
				Authenticating Query Results for Text Search Engines
					HweeHwa Pang (Singapore Management University), Kyriakos Mouratidis (Singapore Management University). 
					 
					Structural Signatures for Tree Data Structures
					Ashish Kundu (Purdue University, USA), Elisa Bertino (Purdue University, USA). 
					 
				 
				Tutorial Session 1 Business Process 
				Session Chair: Dinesh Das
				
				Querying and Monitoring Distributed Business Processes
					Tova Milo (Tel Aviv University, Israel), Daniel Deutch (Tel Aviv University, Israel). 
					 
				 
				Demo Group 1 XML 
				
				eXtract: A Snippet Generation System for XML Search
					Yu Huang, Ziyang Liu, Yi Chen. 
					 
				Language-Integrated Querying of XML Data in SQL Server
					James Terwilliger, Sergey Melnik, Philip Bernstein. 
					 
				XTCcmp: XQuery Compilation on XTC
					Christian Mathis, Andreas Weiner, Theo Harder, Caesar Ralf Franz Hoppen. 
					 
				Periscope/GQ: A Graph Querying Toolkit
					Yuanyuan Tian, Jignesh Patel, Viji Nair, Sebastian Martini, Matthias Kretzler. 
				 
				SEDA: A System for Search, Exploration, Discovery, and Analysis of XML Data
					Andrey Balmin, Latha Colby, Emiran Curtmola, Quanzhong Li, Fatma Ozcan, Sharath Srinivas, Zografoula Vagena. 
					 
				Process Spaceship: Process Views Discovery and Exploration
					Hamid Reza Motahari Nezhad, Boualem Benatallah, Fabio Casati, Periklis Andritsos, Regis Saint-Paul. 
					 
				 
				14:00 - 15:15 
				Research Session 4 Web
				Session Chair: Jens Dittrich
				
				Maintaining Dynamic Channel Profiles on the Web
					Haggai Roitman (IBM), David Carmel (IBM-Haifa Research Lab), Elad Yom-Tov (IBM-Haifa Research Lab). 
					 
				WYSIWYG Development of Data Driven Web Applications
					Fan Yang (Yahoo), Chavdar Botev (Cornell University), Nitin Gupta (Cornell University), Elizabeth Churchill (Yahoo! Research), 
					Levchenko George (Yahoo! Research), Jayavel Shanmugasundaram (Yahoo! Research). 
					 
				Web Page Language Identification Based on URLs
					Eda Baykan (EPF Lausanne), Monika Henzinger (EPF Lausanne), Ingmar Weber (EPF Lausanne). 
					 
				 
				Research Session 5 Query Optimization
				Session Chair: Shivnath Babu
				
				Parallelizing Query Optimization
					Wook-Shin Han (Kyungpook National University), Wooseong Kwak (Kyungpook National University), Jinsoo Lee (Kyungpook National University), Guy Lohman (IBM Research Almaden, USA), 
					Volker Markl (IBM Research Almaden, USA). 
					 
				Hashed Samples: Selectivity Estimators For Set Similarity Selection Queries
					Marios Hadjieleftheriou (AT&T Labs Inc. ), Xiaohui Yu (York University), Nick Koudas (U of Toronto), Divesh Srivastava (AT&T, USA). 
					 
				Tighter Estimation using Bottom-k Sketches
					Edith Cohen (AT&T, USA), Haim Kaplan (Tel Aviv University). 
					 
				 
				Research Session 6 Schema A
				Session Chair: Peter Buneman
				
				STBenchmark: Towards a Benchmark for Mapping Systems
					Bogdan Alexe (UC Santa Cruz), Wang-Chiew Tan (UC Santa Cruz), Yannis Velegrakis (University of Trento). 
					 
				Interactive Source Registration in Community-oriented Information Integration
					Yannis Katsis (UC San Diego), Alin Deutsch (UC San Diego), Yannis Papakonstantinou (UC San Diego). 
					 
				Data Exchange with Data-Metadata Translations 
					Mauricio Hernandez (IBM Almaden Research Center), Paolo Papotti (Universita Roma Tre), Wang-Chiew Tan (UC Santa Cruz). 
					 
				 
				Tutorial Session 2 Dataspaces 
				Session Chair: Xiaofang Zhou
				
				Dataspaces
					Michael Franklin (University of California, Berkeley, USA), Alon Halevy (Google), David Maier (Portland State University, USA). 
					 
				 
				Demo Group 2 P2P
				
				P3N: Profiling the Potential of a Peer-based Data Management System
					Mihai Lupu, Y. C. Tay.
					 
				P2P Logging and Timestamping for Reconciliation
					Mounir Tlili, Kokou Dedzoe, Esther Pacitti, Patrick Valduriez, Reza Akbarinia. 
					 
				AlvisP2P: Scalable Peer-to-Peer Text Retrieval in a Structured P2P Network
					Toan Luu, Gleb Skobeltsyn, Fabius Klemm, Maroje Puh, Ivana Podnar Zarko, Martin Rajman, Karl Aberer. 
					 
				WebContent: Efficient P2P Warehousing of Web Data
					Serge Abiteboul, Tristan Allard, Philippe Chatalic, Georges Gardarin, Anca Ghitescu, Francois Goasdoue, Ioana Manolescu, Benjamin Nguyen, 
					Mohamed Ouazara, Aditya Somani, Nicolas Travers,, Gabriel Vasile,, Spyros Zoupanos. 
					 
				DObjects: Enabling Distributed Data Services for Metacomputing Platforms
					Pawel Jurczyk, Li Xiong. 
					 
				EasyTicket: A Ticket Routing Recommendation Engine for Enterprise Problem Resolution
					Qihong Shao, Yi Chen, Shu Tao, Xifeng Yan, Nikos Anerousis. 
					 
				 
				15:45 - 17:00
				Industry Session 7 Web
				Session Chair: Meichun Hsu
				
				SLEUTH: Single-pubLisher attack dEtection Using correlaTion Hunting 
					Ahmed Metwally (UCSB), Fatih Emekci (UCSB), Divyakant Agrawal (UCSB), Amr El Abbadi (UCSB)
					 
				Energy Cost, The Key Challenge of Today's Data Centers: A Power Consumption Analysis of TPC-C Results 
					Meikel Poess (Oracle USA), Raghunath Othayoth Nambiar (Hewlett-Packard). 
					 
				Google's Deep-Web Crawl 
					Jayant Madhavan (Google), David Ko (Google), Lucja Kot (Cornell University), Vignesh Ganapathy (Google), Alex Rasmussen (University of California - San Diego), Alon Halevy (Google). 
					 
				 
				Research Session 8 Stream Processing
				Session Chair: Zack Ives
				
				Out-of-Order Processing: A New Architecture for High-Performance Stream Systems 
					Jin Li (Portland State University), Kristin Tufte (Portland State University), Vladislav Shkapenyuk (AT&T labs - Research), Vassilis Papadimos (Portland State University), 
					Theodore Johnson (AT&T labs - Research), David Maier (Portland State University). 
					 
				StreamTX: Extracting Tuples from Streaming XML Data 
					Wook-Shin Han (Kyungpook National University), Haifeng Jiang (Google), Howard Ho (IBM Almaden Research Center), Quanzhong Li (IBM). 
					 
				Sliding-Window Top-k Queries on Uncertain Streams 
					Cheqing Jin (ECUST), Ke Yi (Hong Kong University of Science and Technology), Lei Chen (Hong Kong University of Science and Technology), Jeffrey Xu Yu (Chin. U. HK), 
					Xuemin Lin (UNSW). 
					 
				 
				Research Session 9 Query Processing in Uncertain Databases 
				Session Chair: Reynold Cheng
				
				Conditioning Probabilistic Databases 
					Christoph Koch (Cornell University), Dan Olteanu (Oxford University). 
					 
				Efficient Search for the Top-k Probable Nearest Neighbors in Uncertain Databases 
					George Beskales (University of Waterloo), Mohamed Soliman (University of Waterloo), Ihab Francis Ilyas (University of Waterloo). 
					 
				BayesStore: Managing Large, Uncertain Data Repositories with Probabilistic Graphical Models 
					Daisy Zhe Wang (UC Berkeley), Eirinaios Michelakis (UC Berkeley), Minos Garofalakis (Yahoo Research, USA), Joseph Hellerstein (UC Berkeley)
					 
				 
				Tutorial Session 3
				Session Chair: Yannis Velegrakis
				
				Ontologies and Databases: Myths and Challenges
					Enrico Franconi (Free University of Bozen-Bolzano, Italy). 
					 
				 
				Demo Group 3 Web, Textual data 
				
				AJAXSearch: Crawling, Indexing and Searching Web 2.0 Applications
					Cristian Duda, Gianni Frey, Donald Kossman, Chong Zhou. 
					 
				ManyAspects: A System for Highlighting Diverse Concepts in Documents
					Kun Liu, Evimaria Terzi, Tyrone Grandison. 
					 
				Large-Scale Collaborative Analysis and Extraction of Web Data
					Felix Weigel, Biswanath Panda, Mirek Riedewald, Johannes Gehrke. 
					 
				An Effective and Versatile Keyword Search Engine on Heterogenous Data
					Guoliang Li, Jianhua Feng, Jianyong Wang, Lizhu Zhou. 
					 
				DBPubs: Multidimensional Exploration of Database Publications
					Akanksha Baid, Andrey Balmin, Heasoo Hwang, Erik Nijkamp, Jun Rao, Berthold Reinwald, Alkis Simitsis, Yannis Sismanis, Frank Van Ham.
					 
				Semandaq: A Data Quality System Based on Conditional Functional Dependencies
					Wenfei Fan, Floris Geerts, Xibei Jia. 
					 
				 
				Tuesday, 26th August 2008 
				9:00 - 10:15 
				Keynote
				
				Databases and the Silification of Health 
					Justin Zobel (NICTA, University of Melbourne). 
					 
				 
				10:45 - 12:30 
				Session 10 Experiments & Analyses 
				Session Chair: Volker Markl
				
				Finding Frequent Items in Data Streams 
					Graham Cormode (AT&T Labs, USA), Marios Hadjieleftheriou (AT&T Labs, USA). 
					 
				Querying and Mining of Time Series Data: Experimental Comparison of Representations and Distance Measures 
					Hui Ding (Northwestern University), Goce Trajcevski (Northwestern University), Hui Ding (Northwestern University), 
					Peter Scheuermann (Northwestern University), Xiaoyue Wang (University of California, Riverside), Eamonn Keogh (University of California, Riverside).  
					 
				Column-Store Support for RDF Data Management: Not All Swans are White 
					Lefteris Sidirourgos (CWI, Amsterdam, The Netherlands), Romulo Goncalves (CWI, Amsterdam, The Netherlands), Martin Kersten (CWI, Amsterdam, The Netherlands), 
					Niels Nes (CWI, Amsterdam, The Netherlands), Stefan Manegold (CWI, Amsterdam, The Netherlands). 
					 
					Prefix based numbering schemes for XML : Techniques, Applications and Performances 
					Virginie Sans (ETIS - CNRS ENSEA Univ Cergy-Pontoise), Dominique Laurent (ETIS - CNRS ENSEA Univ Cergy-Pontoise).  
					 
				A Benchmark for Evaluating Moving Objects Indexes 
					Su Chen (National University of Singapore), Dan Lin (Purdue University), Christian Jensen (Aalborg University, Denmark). 
					 
				Dwarfs in the Rearview Mirror: How Big are they Really? 
					Jens Dittrich (ETH Zurich), Lukas Blunschi (ETH Zurich), Marcos Antonio Vaz Salles (ETH Zurich).  
					 
				 
				Research Session 11 Theory
				Session Chair: Alin Deutsch
				
				Type Inference and Type Checking for Queries on Execution Traces 
						Daniel Deutch (Tel Aviv University), Tova Milo (Tel Aviv University). 
					 
				Taming Verification Hardness: An Efficient Algorithm For Testing Subgraph Isomorphism 
					Haichuan Shang (UNSW), Ying Zhang (UNSW), Xuemin Lin (UNSW), Jeffrey Xu Yu (Chin. U. HK). 
					 
				On Generating Near-Optimal Tableaux for Conditional Functional Dependencies 
					Lukasz Golab (AT&T Labs - Research), Howard Karloff (AT&T Labs - Research), Flip Korn (AT&T Labs - Research), Divesh Srivastava (AT&T Labs - Research), 
					Bei Yu (Singapore-MIT Alliance (SMA), Singapore). 
					 
				Propagating Functional Dependencies with Conditions 
					Wenfei Fan (University of Edinburgh, UK), Shuai Ma (University of Edinburgh, UK), Yanli Hu (University of Edinburgh, UK), Jie Liu (Chinese Academy of Sciences, China), 
					Yinghui Wu (University of Edinburgh, UK). 
					 
				 
				Research Session 12 Web Rank & PubSub 
				Session Chair: Alexandros Labrinidis
				
				Simrank++: Query Rewriting through Link Analysis of the Click Graph 
					Ioannis Antonellis (Stanford University), Hector Garcia-Molina (Stanford University), Chi-Chao Chang (Yahoo!). 
					 
				Accuracy Estimate and Optimization Techniques for SimRank Computation 
					Dmitry Lizorkin (ISP RAS), Pavel Velikhov (ISP RAS), Maxim Grinev (ISP RAS), Denis Turdakov (ISP RAS). 
					 
				End-to-End Support for Joins in Large-Scale Publish/Subscribe Systems 
					Badrish Chandramouli (Duke University), Jun Yang (Duke University). 
					 
					Scalable Ranked Publish/Subscribe 
					Ashwin Machanavajjhala (Cornell University), Erik Vee (Yahoo! Research), Minos Garofalakis (Yahoo! Research), Jayavel Shanmugasundaram (Yahoo! Research). 
					 
				 
				Tutorial Session 4 Probabilistic Data Management 
				Session Chair: Paolo Papotti
				
				Systems Aspects of Probabilistic Data Management
					Magdalena Balazinska (University of Washington, USA), Christopher Re (University of Washington, USA), Dan Suciu (University of Washington, USA). 
					 
				 
				Demo Group 4 Data integration, collaboration 
				
				RIDE: A Tool for Interactive Source Registration in GLAV-based Information Integration
					Yannis Katsis, Alin Deutsch, Yannis Papakonstantinou, Keliang Zhao. 
					 
				Comparing and Evaluating Mapping Systems with STMark
					Bogdan Alexe, Wang-Chiew Tan, Yannis Velegrakis. 
					 
				Ad-Hoc Data Processing in the Cloud
					Dionysios Logothetis, Kenneth Yocum. 
					 
				XTreeNet: Democratic Community Search
					Emiran Curtmola, Alin Deutsch, Kadangode Ramakrishnan, Divesh Srivastava, Kenneth Yocum, Dionysios Logothetis. 
				 
				Making SENSE: Socially Enhanced Search and Exploration
					Tom Crecelius, Mouna Kacimi, Sebastian Michel, Thomas Neumann, Josiane Xavier Parreira, Ralf Schenkel, Gerhard Weikum. 
					 
				AuditGuard: A system for database auditing under retention restrictions
					Wentian Lu, Gerome Miklau.  
					 
				 
				14:00 - 15:15 
				Industry Session 13 Massive Data 
				Session Chair: Neoklis Polyzotis
				
				Industry-Scale Duplicate Detection 
					Melanie Weis (Hasso-Plattner-Institut), Felix Naumann (Hasso-Plattner-Institute), Ulrich Jehle (Schufa), Holger Schuster (Schufa), Jens Lufter (Schufa). 
					 
				SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets 
					Ronnie Chaiken (Microsoft), Bob Jenkins (Microsoft), Paul Larson (Microsoft Research, USA), Bill Ramsey (Microsoft), Darren Shakib (Microsoft), 
					Simon Weaver (Microsoft), Jingren Zhou (Microsoft Research, USA). 
					 
				PNUTS: Yahoo!'s Hosted Data Serving Platform 
					Brian Cooper (Yahoo! Research), Raghu Ramakrishnan (Yahoo! Research), Utkarsh Srivastava (Yahoo! Research), Adam Silberstein (Yahoo! Research), Phil Bohannon (Yahoo!), 
					Hans-Arno Jacobsen (Yahoo! Research and University of Toronto), Nick Puz (Yahoo! Research), Daniel Weaver (Yahoo! Research), Ramana Yerneni (Yahoo! Research). 
					 
				 
				Research Session 14 XML Databases 
				Session Chair: Yi Chen
				
				Dependable Cardinality Forecasts for XQuery 
					Jens Teubner (IBM T.J. Watson Research Center), Torsten Grust (Technische Universitat Munchen), Sebastian Maneth (NICTA), Sherif Sakr (NICTA). 
					 
				Hash-based Subgraph Query Processing Method for Graph-structured XML Documents 
					Hongzhi Wang (Harbin Institute of Technology), Jianzhong Li (Harbin Institute of Technology), Jizhou Luo (Harbin Institute of Technology), Hong Gao (Harbin Institute of Technology). 
					 
				Generating XML Structure Using Examples and Constraints 
					Sara Cohen (Hebrew University of Jerusalem). 
					 
				 
				Research Session 15 DB Performance & Evaluation 
				Session Chair: Nick Koudas
				
				Read-Optimized Databases, In Depth 
					Allison Holloway (University of Wisconsin), David DeWitt (University of Wisconsin). 
					 
				Flashing Up The Storage Layer 
					Ioannis Koltsidas (University of Edinburgh), Stratis Viglas (University of Edinburgh). 
					 
				Rose: Compressed, Log-Structured Replication 
					Russell Sears (UC Berkeley), Mark Callaghan (Google), Eric Brewer (UC Berkeley). 
					 
				 
				Tutorial Session 5 Dataspaces 
				Session Chair: Xiaofang Zhou
				
				Dataspaces
					Michael Franklin (University of California, Berkeley, USA), Alon Halevy (Google), David Maier (Portland State University, USA). 
					 
				 
				Demo Group 5 Tuning, systems, optimization, etc 
				
				QueryScope: Visualizing Queries for Repeatable Database Tuning
					Ling Hu, Yuan-chi Chang, Christian Lang, Kenneth Ross, Donghui Zhang. 
					 
				When is it Time to Rethink the Aggregate Configuration of Your OLAP Server?
					Katja Hose, Daniel Klan, Matthias Marx, Kai-Uwe Sattler. 
					 
				H-Store: A High-Performance, Distributed Main Memory Transaction Processing System
					Robert Kallman, Jonathan Natkins, Hideaki Kimura, Andrew Pavlo, Alexander Rasin, Stan Zdonik, Evan Jone, Samuel Madden, Michael Stonebraker, Daniel Abadi. 
					 
				Organizing and Indexing Non-Convex Regions
					Eric Perlman, Randal Burns, Michael Kazhdan. 
					 
				Capri/MR: Exploring Protein Databases from a Structural and Physicochemical Point of View
					Eric Paquet, Herna Viktor. 
					 
				C-DEM: A Multi-Modal Query System for Drosophila Embryo Databases
					Fan Guo, Lei Li, Eric Xing, Christos Faloutsos. 
					 
				 
				15:45 - 17:00
				Industry Session 16 Storage & Sorting 
				Session Chair: Brian Cooper
				
				Relational Support for Flexible Schema Scenarios 
					Srini Acharya (Microsoft Corp.), Peter Carlin (Microsoft Corp.), Cesar Galindo-Legaria (Microsoft Corp.), Krzysztof Kozielczyk (Microsoft Corp.), 
					Pawel Terlecki (Microsoft Corp.), Peter Zabback (Microsoft Corp.). 
					 
				Oracle Securefiles System 
					Niloy Mukherjee (Oracle), Bharath Aleti (Oracle), Amit Ganesh (Oracle), Krishna Kunchithapadam (Oracle), Scott Lynn (Oracle), Sujatha Muthulingam (Oracle), 
					Kam Shergill (Oracle), Shaoyu Wang (Oracle), Wei Zhang (Oracle). 
					 
				Efficient Implementation of Sorting on Multi-Core SIMD CPU Architecture 
					Jatin Chhugani (Intel Corporation), Skip Macy (Intel Corporation), Akram Baransi (Intel Corporation), Anthony Nguyen (Intel Corporation), Mostafa Hagog (Intel Corporation), 
					Sanjeev Kumar (Intel Corporation), Victor Lee (Intel Corporation), Yen-Kuang Chen (Intel Corporation), Pradeep Dubey (Intel Corporation). 
					 
				 
				Research Session 17 Web Queries 
				Session Chair: Nicolas Bruno
				
				WebTables: Exploring the Power of Tables on the Web 
					Michael Cafarella (University of Washington), Alon Halevy (Google, Inc.), Daisy Zhe Wang (UC Berkeley), Eugene Wu (MIT), Yang Zhang (MIT). 
					 
				Scalable Query Result Caching for Web Applications  
					Charles Garrod (Carnegie Mellon University), Amit Manjhi (Google), Bruce Maggs (Carnegie Mellon University), Todd Mowry (Carnegie Mellon University), 
					Anthony Tomasic (Carnegie Mellon University), Christopher Olston (Yahoo! Research), Anastasia Ailamaki (Carnegie Mellon University). 
					 
				Optimization of Multi-Domain Queries on the Web 
					Daniele Braga (Politecnico di Milano), Stefano Ceri (Politecnico di Milano), Florian Daniel (Politecnico di Milano), Davide Martinenghi (Politecnico di Milano). 
					 
				 
				Research Session 18 Distributed Systems Processing 
				Session Chair: Paul Larson
				
				Fault-tolerant Stream Processing using a Distributed, Replicated File System 
					YongChul Kwon (University of Washington), Magdalena Balazinska (University of Washington), Albert Greenberg (Microsoft Research). 
					 
				LEEWAVE: Level-Wise Distribution of Wavelet Coefficients for Processing kNN Queries over Distributed Streams 
					Mi-Yen Yeh (National Taiwan University), Kun-Lung Wu (IBM T. J. Watson Research Center), Philip Yu (University of Illinois at Chicago), Ming-Syan Chen (National Taiwan University). 
					 
				A Practical Scalable Distributed B-Tree 
					Marcos Aguilera (HP Labs), Wojciech Golab (University of Toronto), Mehul Shah (HP Labs). 
					 
				 
				Tutorial Session 6 Data Cleaning 
				Session Chair: Anastasios Kementsietsidis
				
				A Revival of Integrity Constraints for Data Cleaning
					Wenfei Fan (University of Edinburgh, UK and Bell Labs, USA), Floris Geerts (University of Edinburgh, UK). 
					 
				 
				Demo Group 1 XML 
				
				Xnippet: Generating Query Biased Result Snippet for XML Search
					Yu Huang, Ziyang Liu, Ziyang Liu. 
					 
				Language-Integrated Querying of XML Data in SQL Server
					James Terwilliger, Sergey Melnik, Philip Bernstein. 
					 
				XTCcmp: XQuery Compilation on XTC
					Christian Mathis, Andreas Weiner, Theo Harder, Caesar Ralf Franz Hoppen. 
					 
				Periscope/GQ: A Graph Querying Toolkit
					Yuanyuan Tian, Jignesh Patel, Viji Nair, Sebastian Martini, Matthias Kretzler. 
					 
				SEDA: A System for Search, Exploration, Discovery, and Analysis of XML Data
					Andrey Balmin, Latha Colby, Emiran Curtmola, Quanzhong Li, Fatma Ozcan, Sharath Srinivas, Zografoula Vagena. 
					 
				Process Spaceship: Process Views Discovery and Exploration
					Hamid Reza Motahari Nezhad, Boualem Benatallah, Fabio Casati, Periklis Andritsos, Regis Saint-Paul. 
					 
				 
				Wednesday, 27th August 2008 
				9:00 - 10:15 
				10 Year Best Paper Award Session
				
				A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces 
					Roger Weber, Hans-Jorg Schek, Stephen Blott.  
					 
				 
				10:45 - 12:30 
				Research Session 19 System Centric Optimization 
				Session Chair: Timos Sellis
				
				Main-Memory Scan Sharing For Multi-Core CPUs 
					Lin Qiao (IBM Almaden Research Lab), Vijayshankar Raman (IBM Almaden Research Lab), Frederick Reiss (IBM Almaden Research Lab), 
					Peter Haas (IBM Almaden Research Lab), Guy Lohman (IBM Almaden Research Lab). 
					 
				Row-wise Parallel Predicate Evaluation 
					Ryan Johnson (Carnegie Mellon University), Vijayshankar Raman (IBM Almaden Research Lab), Richard Sidle (IBM Almaden Research Lab), Garret Swart (Oracle). 
					 
				Dynamic Partitioning of the Cache Hierarchy in Shared Data Centers 
					Gokul Soundararajan (University of Toronto), Jin Chen (University of Toronto), Mohamed Sharaf (University of Toronto), Cristiana Amza (University of Toronto). 
					 
					RDF-3X: a RISC-style Engine for RDF 
					Thomas Neumann (Max-Planck-Institut Informatik), Gerhard Weikum (MPI). 
					 
				 
				Research Session 20 IR & Forms 
				Session Chair: Justin Zobel
				
				Multidimensional Content eXploration 
						Alkis Simitsis (IBM Research Almaden, USA), Akanksha Baid (University of Wisconsin-Madison), Yannis Sismanis (IBM Research Almaden, USA), 
						Berthold Reinwald (IBM Research Almaden, USA). 
					 
				Relaxation in Text Search using Taxonomies 
					Marcus Fontoura (Yahoo! Research), Vanja Josifovski (Yahoo! Research), Ravi Kumar (Yahoo! Research), Christopher Olston (Yahoo! Research), Sergei Vassilvitskii (Yahoo! Research), 
					Andrew Tomkins (Yahoo! Research). 
					 
				Learning to Extract Form Labels 
					Hoa Nguyen (University of Utah), Thanh Nguyen (University of Utah), Juliana Freire (University of Utah). 
					 
				Automated Creation of a Forms 
					Magesh Jayapandian (University of Michigan), H V Jagadish (University of Michigan). 
					 
				 
				Research Session 21 New Topics 
				Session Chair: Xiaofang Zhou
				
				Efficient Network-Aware Search in Collaborative Tagging Sites 
					Michael Benedikt (Oxford University), Sihem Amer Yahia (Yahoo Research, USA), Laks Lakshmanan (University of British Columbia), Julia Stoyanovich (Columbia University). 
					 
				Cleaning Uncertain Data with Quality Guarantees 
					Reynold Cheng (Hong Kong Polytechnic University, China), Jinchuan Chen (Hong Kong Polytechnic University, China), Xike Xie (Hong Kong Polytechnic University, China). 
					 
				On the Provenance of Non-Answers to Queries over Extracted Data 
					Jiansheng Huang (Univ. of Wisconsin-Madison), Ting Chen (Univ. of Wisconsin-Madison), AnHai Doan (Univ. of Wisconsin-Madison), Jeffrey Naughton (Univ. of Wisconsin-Madison). 
					 
					Dynamic Active Probing of Helpdesk Databases 
					Shenghuo Zhu (NEC Lab), Tao Li (Florida International University), Zhiyuan Chen (UMBC), Dingding Wang (Florida International University), Yihong Gong (NEC Lab). 
					 
				 
				Tutorial Session 7 XML Structural Summaries 
				Session Chair: Marios Hatzieleftheriou
				
				XML Structural Summaries
					Mirella M. Moro (Univ. Fed. Rio Grande do Sul, Brazil), Zografoula Vagena (Microsoft Research, UK), Vassilis J. Tsotras (University of California Riverside, USA). 
					 
				 
				Demo Group 2 P2P
				
				P3N: Profiling the Potential of a Peer-based Data Management System
					Mihai Lupu, Y. C. Tay. 
					 
				P2P Logging and Timestamping for Reconciliation
					Mounir Tlili, Kokou Dedzoe, Esther Pacitti, Patrick Valduriez, Reza Akbarinia. 
					 
				AlvisP2P: Scalable Peer-to-Peer Text Retrieval in a Structured P2P Network
					Toan Luu, Gleb Skobeltsyn, Fabius Klemm, Maroje Puh, Ivana Podnar Zarko, Martin Rajman, Karl Aberer. 
					 
				WebContent: Efficient P2P Warehousing of Web Data
					Serge Abiteboul, Tristan Allard, Philippe Chatalic, Georges Gardarin, Anca Ghitescu, Francois Goasdoue, Ioana Manolescu, Benjamin Nguyen, Mohamed Ouazara, Aditya Somani,
					Nicolas Travers,, Gabriel Vasile,, Spyros Zoupanos. 
				 
				DObjects: Enabling Distributed Data Services for Metacomputing Platforms
					Pawel Jurczyk, Li Xiong. 
					 
				EasyTicket: A Ticket Routing Recommendation Engine for Enterprise Problem Resolution
					Qihong Shao, Yi Chen, Shu Tao, Xifeng Yan, Nikos Anerousis. 
					 
				 
				14:00 - 15:15 
				Industry Session 22 Query Optimization 
				Session Chair: N.N.
				
				Efficiently Approximating Query Optimizer Plan Diagrams 
					Atreyee Dey (Indian Institute of Science), Sourjya Bhaumik (Indian Institute of Science), Harish D (Indian Institute of Science), Jayant Haritsa (Indian Institute of Science). 
					 
				Mining Search Engine Query Logs via Suggestion Sampling (New Date and Time!)
					Maxim Gurevich (Technion), Ziv Bar-Yossef (Google and Technion). 
					 
				Optimizer Plan Change Management: Improved Stability and Performance in Oracle 11g 
					Mohamed Ziauddin (Oracle), Dinesh Das (Oracle), Hong Su (Oracle), Yali Zhu (Oracle), Khaled Yagoub (Oracle). 
					 
				 
				Research Session23 Schema B 
				Session Chair: Ralf Schenkel
				
				Graceful Database Schema Evolution: the PRISM Workbench 
					Carlo Curino (Politecnico di Milano), Hyun Moon (UCLA), Carlo Zaniolo (UCLA). 
					 
				Analyzing and Revising Data Integration Schemas to Improve Their Matchability 
					Xiaoyong Chai (University of Wisconsin-Madiso), Mayssam Sayyadian (University of Wisconsin-Madiso), AnHai Doan (University of Wisconsin-Madiso), 
					Arnon Rosenthal (The MITRE Corporation), Len Seligman (The MITRE Corporation). 
					 
				Learning to Create Data-Integrating Queries  
					Partha Talukdar (University of Pennsylvania ), Marie Jacob (University of Pennsylvania), Mohammad Mehmood (University of Pennsylvania ), Koby Crammer (University of Pennsylvania), 
					Zachary Ives (University of Pennsylvania ), Fernando Pereira (University of Pennsylvania), Sudipto Guha (University of Pennsylvania). 
					 
				 
				Research Session 24 Uncertain DB B (Rel & AC)
				Session Chair: Lei Chen
				
				Approximate Lineage for Probabilistic Databases 
					Chris Re (University of Washington), Dan Suciu (University of Washington and Microsoft). 
					 
				Exploiting Shared Correlations in Probabilistic Databases 
					Prithviraj Sen (University of Maryland), Amol Deshpande (University of Maryland), Lise Getoor (University of Maryland). 
					 
				Access Control over Uncertain Data 
					Vibhor Rastogi (University of Washington), Dan Suciu (University of Washington and Microsoft), Evan Welbourne (University of Washington). 
					 
				 
				Tutorial Session 8
				Session Chair: N.N.
				
				Ontologies and Databases: Myths and Challenges
				Enrico Franconi (Free University of Bozen-Bolzano, Italy). 
					 
				 
				Demo Group 4 Data integration, collaboration 
				
				RIDE: A Tool for Interactive Source Registration in GLAV-based Information Integration
					Yannis Katsis, Alin Deutsch, Yannis Papakonstantinou, Keliang Zhao. 
					 
				Comparing and Evaluating Mapping Systems with STMark
					Bogdan Alexe, Wang-Chiew Tan, Yannis Velegrakis. 
					 
				Ad-Hoc Data Processing in the Cloud
					Dionysios Logothetis, Kenneth Yocum. 
					 
				XTreeNet: Democratic Community Search
					Emiran Curtmola, Alin Deutsch, Kadangode Ramakrishnan, Divesh Srivastava, Kenneth Yocum, Dionysios Logothetis. 
					 
				Making SENSE: Socially Enhanced Search and Exploration
				Tom Crecelius, Mouna Kacimi, Sebastian Michel, Thomas Neumann, Josiane Xavier Parreira, Ralf Schenkel, Gerhard Weikum. 
					 
				AuditGuard: A system for database auditing under retention restrictions
					Wentian Lu, Gerome Miklau. 
					 
				 
				15:45 - 17:00
				Industry Session 25 Query Processing 
				Session Chair: Jayant Haritsa
				
				Towards a Physical XML independent XQuery/SQL/XML Engine 
					Zhen Hua Liu (Oracle), Thomas Baby (Oracle), Sivasankaran Chandrasekar (Oracle), Hui Chang (Oracle). 
					 
				Closing The Query Processing Loop in Oracle 11g 
					Mohamed Zait (Oracle), Allison Lee (Oracle). 
					 
				Towards a Streaming SQL Standard 
					Stan Zdonik (Streambase,Inc.), Namit Jain (Oracle), Shailendra Mishra (Oracle), Anand Srinivasan (Oracle), Johannes Gehrke (Cornell University, USA), 
					Jennifer Widom (Stanford University), Hari Balakrishnan (Streambase,Inc.), Mitch Cherniack (Streambase,Inc.), Ugur Cetintemel (Streambase,Inc.), 
					Richard Tibbetts (Streambase,Inc.). 
					 
				 
				Research Session 26 Privacy Preservation 
				Session Chair: Elisa Bertino
				
				Anonymizing Bipartite Graph Data using Safe Groupings 
					Graham Cormode (AT&T Labs, USA), Divesh Srivastava (AT&T Labs, USA), Ting Yu (North Carolina State University), Qing Zhang (North Carolina State University). 
					 
				Privacy Preserving Serial Data Publishing By Role Composition 
					Yingyi Bu (The Chinese University of HK), Ada WaiChee Fu (The Chinese University of Hong Kong), Raymond Chi-Wing Wong (The Hong Kong University of Science and Technology), 
					Lei Chen (The Hong Kong University of Science and Technology), Jiuyong Li (University of South Australia). 
					 
				Output Perturbation with Query Relaxation 
					Xiaokui Xiao (The Chinese University of Hong Kong), Yufei Tao (The Chinese University of Hong Kong). 
					 
				 
				Research Session 27 Temporal Indexing & Searching 
				Session Chair: Cyrus Shahabi
				
				Transaction Time Indexing with Version Compression 
					David Lomet (Microsoft Research, USA), Mingsheng Hong (Cornell University), Rimma Nehme (Purdue University), Rui Zhang (University of Melbourne). 
					 
				Managing and Querying Transaction-time Databases under Schema Evolution 
					Hyun Moon (UCLA), Carlo Curino (Politecnico di Milano), Alin Deutsch (UCSD), Chien-Yi Hou (UCSD), Carlo Zaniolo (UCLA). 
					 
				On Efficiently Searching Trajectories and Archival Data for Historical Similarities 
					Reza Sherkat (IBM Toronto Lab.), Davood Rafiei (University of Alberta ). 
					 
				 
				Tutorial Session 9 Probabilistic Data Management 
				Session Chair: Paolo Papotti
				
				Systems Aspects of Probabilistic Data Management
					Magdalena Balazinska (University of Washington, USA), Christopher Re (University of Washington, USA), Dan Suciu (University of Washington, USA). 
					 
				 
				Thursday, 28th August 2008 
				9:00 - 10:15 
				Research Session 28 Text & Keyword Query Processing 
				Session Chair: Jayant Madhavan
				
				Keyword Query Cleaning 
					Ken Pu (UOIT), Xiaohui Yu (York University). 
					 
				Reasoning and Identifying Relevant Matches for XML Keyword Search 
				Ziyang Liu (Arizona State University,USA), Yi Chen (Arizona State University,USA). 
					 
				Ed-Join: An Efficient Algorithm for Similarity Joins With Edit Distance Constraints 
					Chuan Xiao (University of New South Wales), Wei Wang (University of New South Wales), Xuemin Lin (University of New South Wales). 
					 
				Scalable Ad-hoc Entity Extraction from Text Collections 
				Sanjay Agrawal (Microsoft Research), Kaushik Chakrabarti (Microsoft Research), Surajit Chaudhuri (Microsoft Research), Venkatesh Ganti (Microsoft Research). 
					 
				 
				Research Session 29 Systems B 
				Session Chair: Jingren Zhou
				
				Scheduling Shared Scans of Large Data Files 
					Parag Agrawal (Stanford University), Daniel Kifer (Yahoo! Research), Christopher Olston (Yahoo! Research). 
					 
				Online Maintenance of Very Large Random Samples on Flash Storage 
					Suman Nath (Microsoft Research), Phillip Gibbons (Intel Research). 
					 
				A Skip-list Approach for Efficiently Processing Forecasting Queries 
					Tingjian Ge (Brown University), Stan Zdonik (Brown University). 
					 
				A Request-Routing Framework for SOA-Based Enterprise Computing 
					Thomas Phan (IBM Almaden), Wen-Syan Li (SAP Research Center - China). 
					 
				 
				Research Session 30 Indexing & Query Processing 
				Session Chair: Chen Li
				
				Hexastore: Sextuple Indexing for Semantic Web Data Management 
					Cathrin Weiss (University of Zurich), Panagiotis Karras (National University of Singapore), Abraham Bernstein (University of Zurich). 
					 
				Indexing Land Surface for Efficient kNN Query 
					Cyrus Shahabi (Univ. of Southern California), Lu-An Tang (Univ. of Southern California), Songhua Xing (Univ. of Southern California). 
					 
				Efficient Skyline Querying with Variable User Preferences on Nominal Attributes 
					Raymond Chi-Wing Wong (The Hong Kong University of Science and Technology), Ada WaiChee Fu ( The Chinese University of HK), 
					Jian Pei (Simon Fraser University), Yip Sing Ho (The Chinese University of HK), Tai Wong (The Chinese University of HK), Yubao Liu (Sun Yat-Sen University, China). 
					 
				Efficient Top-K Processing over Query-Dependent Functions 
					Lin Guo (Yahoo! Research), Sihem Amer Yahia (Yahoo! Research), Raghu Ramakrishnan (Yahoo! Research), Jayavel Shanmugasundaram (Yahoo! Research), 
					Utkarsh Srivastava (Yahoo! Research), Erik Vee (Yahoo! Research). 
					 
				 
				Tutorial Session 10 Continuous Queries 
				
				Scheduling Continuous Queries in Data Stream Management Systems
					Mohamed A. Sharaf (University of Toronto, Canada), Alexandros Labrinidis (University of Pittsburgh, USA), Panos K. Chrysanthis (University of Pittsburgh, USA). 
					 
				 
				Demo Group 3 Web, Textual data 
				
				AJAXSearch: Crawling, Indexing and Searching Web 2.0 Applications
					Cristian Duda, Gianni Frey, Donald Kossman, Chong Zhou. 
					 
				ManyAspects: A System for Highlighting Diverse Concepts in Documents
					Kun Liu, Evimaria Terzi, Tyrone Grandison. 
					 
				Large-Scale Collaborative Analysis and Extraction of Web Data
					Felix Weigel, Biswanath Panda, Mirek Riedewald, Johannes Gehrke. 
					 
				An Effective and Versatile Keyword Search Engine on Heterogenous Data 
				Guoliang Li, Jianhua Feng, Jianyong Wang, Lizhu Zhou. 
				 
				DBPubs: Multidimensional Exploration of Database Publications
					Akanksha Baid, Andrey Balmin, Heasoo Hwang, Erik Nijkamp, Jun Rao, Berthold Reinwald, Alkis Simitsis, Yannis Sismanis, Frank Van Ham. 
					 
				Semandaq: A Data Quality System Based on Conditional Functional Dependencies
				Wenfei Fan, Floris Geerts, Xibei Jia. 
					 
				 
				11:00 - 13:00 
				Research Session 31 Spatial and Motion Data 
				Session Chair: Xuemin Lin
				
				FINCH: Evaluating Reverse k-Nearest-Neighbor Queries on Location Data 
					Wei Wu (National University of Singapore), Fei Yang (National University of Singapore), Chee-Yong Chan (National University of Singapore), 
					Kian-Lee Tan (National University of Singapore). 
					 
				Discovery of Convoys in Trajectory Databases 
					Hoyoung Jeung (The university of queenslad), Man Lung Yiu (Aalborg University), Xiaofang Zhou (The university of queenslad), Christian Jensen (Aalborg University), 
					Heng Tao Shen (The university of queenslad). 
					 
				TraClass: Trajectory Classification Using Hierarchical Region-Based and Trajectory-Based Clustering 
					Jae-Gil Lee (UIUC), Jiawei Han (UIUC), Xiaolei Li (UIUC), Hector Gonzalez (UIUC). 
					 
					The V*-Diagram: a Query-Dependent Method for Moving kNN Queries 
					Sarana Nutanong (The University of Melbourne), Rui Zhang (The University of Melbourne), Egemen Tanin (The University of Melbourne), Lars Kulik (The University of Melbourne). 
					 
				 
				Research Session 32 Query Processing 
				Session Chair: Divesh Srivastava
				
				Rewriting Procedures for Batched Bindings 
					Ravindra Guravannavar (IIT Bombay), S. Sudarshan (IIT Bombay). 
					 
				Identifying Robust Plans through Plan Diagram Reduction 
					Harish D (Indian Institute of Science), Pooja Darera (Indian Institute of Science), Jayant Haritsa (Indian Institute of Science)
					 
				A Pay-As-You-Go Framework for Query Execution Feedback 
					Surajit Chaudhuri (Microsoft Research), Vivek Narasayya (Microsoft Research), Ravishankar Ramamurthy (Microsoft Research). 
					 
					Evita Raced: Metacompilation for Declarative Networks 
					Tyson Condie (UC Berkeley), Joseph Hellerstein (UC Berkeley), Petros Maniatis (UC Berkeley), David Chu (UC Berkeley).  
					 
				 
				Research Session 33 Mining B & External Memory 
				Session Chair: Shinichi Morishita
				
				Discovering Data Quality Rules 
					Fei Chiang (University of Toronto), Renee Miller (University of Toronto). 
					 
				Mining Non-Redundant High Order Correlations in Binary Data 
					Xiang Zhang (Univeristy of North Carolina), Feng Pan (Univeristy of North Carolina), Wei Wang (Univeristy of North Carolina), Andrew Nobel (Univeristy of North Carolina). 
					 
				Keyword Search on External Memory Data Graphs 
					Bhavana Dalvi (IIT Bombay, India), Meghana Kshirsagar (IIT Bombay, India), S. Sudarshan (IIT Bombay, India). 
					 
				Sorting Hierarchical Data in External Memory for Archiving  
					Ioannis Koltsidas (University of Edinburgh), Heiko Mueller (University of Edinburgh), Stratis Viglas (University of Edinburgh). 
					 
				 
				Tutorial Session 11 Clusters in High Dimensions 
				Session Chair: Stratis Viglas
				
				Detecting Clusters in Moderate-to-High Dimensional Data: Subspace Clustering, Pattern-based Clustering, and Correlation Clustering
				Hans-Peter Kriegel (Ludwig-Maximilians-Universitat Munchen, Germany), Peer Kroger (Ludwig-Maximilians-Universitat Munchen, Germany), 
				Arthur Zimek (Ludwig-Maximilians-Universitat Munchen, Germany).  
					 
				 
				Demo Group 5 Tuning, systems, optimization, etc  
				
				QueryScope: Visualizing Queries for Repeatable Database Tuning
					Ling Hu, Yuan-chi Chang, Christian Lang, Kenneth Ross, Donghui Zhang. 
					 
				When is it Time to Rethink the Aggregate Configuration of Your OLAP Server?
				Katja Hose, Daniel Klan, Matthias Marx, Kai-Uwe Sattler. 
					 
				H-Store: A High-Performance, Distributed Main Memory Transaction Processing System
				Robert Kallman, Jonathan Natkins, Hideaki Kimura, Andrew Pavlo, Alexander Rasin, Stan Zdonik, Evan Jone, Samuel Madden, Michael Stonebraker, Daniel Abadi. 
					 
				Organizing and Indexing Non-Convex Regions
					Eric Perlman, Randal Burns, Michael Kazhdan. 
				 
				Capri/MR: Exploring Protein Databases from a Structural and Physicochemical Point of View
				Eric Paquet, Herna Viktor. 
					 
				C-DEM: A Multi-Modal Query System for Drosophila Embryo Databases
					Fan Guo, Lei Li, Eric Xing, Christos Faloutsos. 
					 
				 
			 |