Volume 5, 2011-2012

Editors-in-Chief:
Z. Meral Ozsoyoglu
Founding Editor-in-Chief:
H. V. Jagadish
Advisory Committee:
Philip Bernstein, Michael Boehlen, Peter Buneman, Susan Davidson, S. Sudarshan, Gerhard Weikum
Information Director:
Gerald Weber
Associate Editors:
Gustavo Alonso, Ugur Cetintemel, Nilesh Dalvi, Juliana Freire, Hank Korth, Ahmet Sacan, Nesime Tatbul, Anthony Tung
Review Board:

Volume 5, No. 1

: Front Matter i - i

1 - 12

Explanation-Based Auditing

Daniel Fabbri and Kristen LeFevre

13 - 24

Human-powered Sorts and Joins

Adam Marcus, Eugene Wu, David Karger, Samuel Madden, and Robert Miller

25 - 36

Verifying Computations with Streaming Interactive Proofs

Graham Cormode, Justin Thaler, and Ke Yi

37 - 48

A Moving-Object Index for Efficient Query Processing with Peer-Wise Location Privacy

Dan Lin, Christian S. Jensen, Rui Zhang, Lu Xiao, and Jiaheng Lu

49 - 60

ERA: Efficient Serial and Parallel Suffix Tree Construction for Very Long Strings

Essam Mansour, Amin Allam, Spiros Skiadopoulos, and Panos Kalnis

61 - 72

Fast Updates on Read-Optimized Databases Using Multi-Core CPUs

Jens Krueger, Changkyu Kim, Martin Grund, Nadathur Satish, David Schwalb, Jatin Chhugani, Hasso Plattner, Pradeep Dubey, and Alexander Zeier

73 - 84

A Data-Based Approach to Social Influence Maximization

Amit Goyal, Francesco Bonchi, and Laks V. S. Lakshmanan

Volume 5, No. 2

: Front Matter i - i

85 - 96

On Predictive Modeling for Optimizing Transaction Execution in Parallel OLTP Systems

Andrew Pavlo, Evan P.C. Jones, Stanley Zdonik

97 - 108

View Selection in Semantic Web Databases

François Goasdoué, Konstantinos Karanasos, Julien Leblay, Ioana Manolescu

109 - 120

Building Wavelet Histograms on Large Data in MapReduce

Jeffrey Jestes, Ke Yi, Feifei Li

121 - 132

Summarization and Matching of Density-Based Clusters in Streaming Environments

Di Yang, Elke A. Rundensteiner, Matthew O. Ward

133 - 144

Multilingual Schema Matching for Wikipedia Infoboxes

Thanh Nguyen, Viviane Moreira, Huong Nguyen, Hoa Nguyen, Juliana Freire

145 - 156

Controlling False Positives in Association Rule Mining

Guimei Liu, Haojun Zhang, Limsoon Wong

Volume 5, No. 3

vii - vii

Front Matter and Letter from the Associate Editor

Uğur Çetintemel

157 - 168

PARIS: Probabilistic Alignment of Relations, Instances, and Schema

Fabian M. Suchanek, Serge Abiteboul, Pierre Senellart

169 - 180

Answering Top-k Queries Over a Mixture of Attractive and Repulsive Dimensions

Sayan Ranu, Ambuj K. Singh

181 - 192

PIQL: Success-Tolerant Query Processing in the Cloud

Michael Armbrust, Kristal Curtis, Tim Kraska, Armando Fox, Michael J. Franklin, David A. Patterson

193 - 204

gSketch: On Query Estimation in Graph Streams

Peixiang Zhao, Charu C. Aggarwal, Min Wang

205 - 216

Indexing the Earth Mover's Distance Using Normal Distributions

Brian E. Ruttenberg, Ambuj K. Singh

217 - 228

Generating Exact- and Ranked Partially-Matched Answers to Questions in Advertisements

Rani Qumsiyeh, Maria S. Pera, Yiu-Kai Ng

229 - 240

Size-l Object Summaries for Relational Keyword Search

Georgios J. Fakas, Zhi Cai, Nikos Mamoulis

241 - 252

REX: Explaining Relationships between Entity Pairs

Lujun Fang, Anish Das Sarma, Cong Yu, Philip Bohannon

253 - 264

PASS-JOIN: A Partition-based Method for Similarity Joins

Guoliang Li, Dong Deng, Jiannan Wang, Jianhua Feng

265 - 273

Relative Lempel-Ziv Factorization for Efficient Storage and Retrieval of Web Collections

Christopher Hoobin, Simon J. Puglisi, Justin Zobel

Volume 5, No. 4

vii - vii

Front Matter and Letter from the Associate Editor

Nilesh Dalvi

274 - 285

Towards Cost-Effective Storage Provisioning for DBMSs

Ning Zhang, Junichi Tatemura, Jignesh M. Patel, Hakan Hacıgümüş

286 - 297

B+-tree Index Optimization by Exploiting Internal Parallelism of Flash-based Solid State Drives

Hongchan Roh, Sanghyun Park, Sungho Kim, Mincheol Shin, Sang-Won Lee

298 - 309

High-Performance Concurrency Control Mechanisms for Main-Memory Databases

Per-√Öke Larson, Spyros Blanas, Cristian Diaconu, Craig Freedman, Jignesh M. Patel, Mike Zwilling

310 - 321

Capturing Topology in Graph Pattern Matching

Shuai Ma, Yang Cao, Wenfei Fan, Jinpeng Huai, Tianyu Wo

322 - 333

Probabilistic Management of OCR Data using an RDBMS

Arun Kumar, Christopher Ré

334 - 345

RTED: A Robust Algorithm for the Tree Edit Distance

Mateusz Pawlik, Nikolaus Augsten

346 - 357

Putting Lipstick on Pig: Enabling Database-style Workflow Provenance

Yael Amsterdamer, Susan B. Davidson, Daniel Deutch, Tova Milo, Julia Stoyanovich, Val Tannen

358 - 369

Relational Approach for Shortest Path Discovery over Large Graphs

Jun Gao, Ruoming Jin, Jiashuai Zhou, Jeffrey Xu Yu, Xiao Jiang, Tengjiao Wang

370 - 381

Mining Flipping Correlations from Large Datasets with Taxonomies

Marina Barsky, Sangkyum Kim, Tim Weninger, Jiawei Han

382 - 393

A Statistical Approach Towards Robust Progress Estimation

Arnd Christian König, Bolin Ding, Surajit Chaudhuri, Vivek Narasayya

Volume 5, No. 5

vii - viii

Front Matter and Letter from the Associate Editor

Hank Korth

394 - 405

Relation Strength-Aware Clustering of Heterogeneous Information Networks with Incomplete Attributes

Yizhou Sun, Charu C. Aggarwal, Jiawei Han

406 - 417

Shortest Path and Distance Queries on Road Networks: An Experimental Evaluation

Lingkun Wu, Xiaokui Xiao, Dingxiong Deng, Gao Cong, Andy Diwen Zhu, Shuigeng Zhou

418 - 429

The Filter-Placement Problem and its Application to Minimizing Information Multiplicity

Dóra Erdös, Vatche Ishakian, Andrei Lapets, Evimaria Terzi, Azer Bestavros

430 - 441

Bayesian Locality Sensitive Hashing for Fast Similarity Search

Venu Satuluri, Srinivasan Parthasarathy

442 - 453

Fast and Exact Top-k Search for Random Walk with Restart

Yasuhiro Fujiwara, Makoto Nakatsuji, Makoto Onizuka, Masaru Kitsuregawa

454 - 465

Densest Subgraph in Streaming and MapReduce

Bahman Bahmani, Ravi Kumar, Sergei Vassilvitskii

466 - 477

Mining Attribute-structure Correlated Patterns in Large Attributed Graphs

Arlei Silva, Wagner Meira Jr., Mohammed J. Zaki

478 - 489

Semi-Automatic Index Tuning: Keeping DBAs in the Loop

Karl Schnaitter, Neoklis Polyzotis

490 - 501

Aggregation in Probabilistic Databases via Knowledge Compilation

Robert Fink, Larisa Han, Dan Olteanu

Volume 5, No. 6

i - vii

Front Matter and Letter from the Associate Editor

Anthony Tung

502 - 513

Stochastic Database Cracking: Towards Robust Adaptive Indexing in Main-Memory Column-Stores

Felix Halim, Stratos Idreos, Panagiotis Karras, Roland H. C. Yap

514 - 525

An Adaptive Mechanism for Accurate Query Answering under Differential Privacy

Chao Li, Gerome Miklau

526 - 537

SharedDB: Killing One Thousand Queries With One Stone

Georgios Giannikis, Gustavo Alonso, Donald Kossmann

538 - 549

Pushing the Boundaries of Crowd-enabled Databases with Query-driven Schema Expansion

Joachim Selke, Christoph Lofi, Wolf-Tilo Balke

550 - 561

A Bayesian Approach to Discovering Truth from Conflicting Sources for Data Integration

Bo Zhao, Benjamin I. P. Rubinstein, Jim Gemmell, Jiawei Han

562 - 573

How to Price Shared Optimizations in the Cloud

Prasang Upadhyaya, Magdalena Balazinska, Dan Suciu

574 - 585

Dense Subgraph Maintenance under Streaming Edge Weight Updates for Real-time Story Identification

Albert Angel, Nick Koudas, Nikos Sarkas, Divesh Srivastava

586 - 597

ReStore: Reusing Results of MapReduce Jobs

Iman Elghandour, Ashraf Aboulnaga

Volume 5, No. 7

i - vii

Front Matter and Letter from the Associate Editors

Gustavo Alonso, Juliana Freire

598 - 609

PerfXplain: Debugging MapReduce Job Performance

Nodira Khoussainova, Magdalena Balazinska, Dan Suciu

610 - 621

Uncertain Centroid based Partitional Clustering of Uncertain Data

Francesco Gullo, Andrea Tagarelli

622 - 633

Scalable K-Means++

Bahman Bahmani, Benjamin Moseley, Andrea Vattani, Ravi Kumar, Sergei Vassilvitskii

634 - 645

Querying Schemas With Access Restrictions

Michael Benedikt, Pierre Bourhis, Clemens Ley

646 - 655

Definition, Detection, and Recovery of Single-Page Failures, a Fourth Class of Database Failures

Goetz Graefe, Harumi Kuno

656 - 667

Concurrency Control for Adaptive Indexing

Goetz Graefe, Felix Halim, Stratos Idreos, Harumi Kuno, Stefan Manegold

668 - 679

Comments on "Stack-based Algorithms for Pattern Matching on DAGs"

Qiang Zeng, Zhuge Hai

680 - 691

An Analysis of Structured Data on the Web

Nilesh Dalvi, Ashwin Machanavajjhala, Bo Pang

Volume 5, No. 8

i - vii

Front Matter and Letter from the Associate Editors

Uğur Çetintemel, Nilesh Dalvi

692 - 703

Shortest Path Computation with No Information Leakage

Kyriakos Mouratidis, Man Lung Yiu

704 - 715

V-SMART-Join: A Scalable MapReduce Framework for All-Pair Similarity Joins of Multisets and Vectors

Ahmed Metwally, Christos Faloutsos

716 - 727

Distributed GraphLab: A Framework for Machine Learning in the Cloud

Yucheng Low, Joseph Gonzalez, Aapo Kyrola, Danny Bickson, Carlos Guestrin, Joseph M. Hellerstein

728 - 739

Adding Logical Operators to Tree Pattern Queries on Graph-Structured Data

Qiang Zeng, Xiaorui Jiang, Hai Zhuge

740 - 751

Learning Semantic String Transformations from Examples

Rishabh Singh, Sumit Gulwani

752 - 763

Cologne: A Declarative Distributed Constraint Optimization Platform

Changbin Liu, Lu Ren, Boon Thau Loo, Yun Mao, Prithwish Basu

764 - 775

Optimizing I/O for Big Array Analytics

Yi Zhang, Jun Yang

776 - 787

Probabilistically Bounded Staleness for Practical Partial Quorums

Peter Bailis, Shivaram Venkataraman, Michael J. Franklin, Joseph M. Hellerstein, Ion Stoica

Volume 5, No. 9

i - vii

Front Matter and Letter from the Associate Editors

Hank Korth, Anthony Tung

788 - 799

Efficient Subgraph Matching on Billion Node Graphs

Zhao Sun, Hongzhi Wang, Haixun Wang, Bin Shao, Jianzhong Li

800 - 811

Efficient Subgraph Similarity Search on Large Probabilistic Graph Databases

Ye Yuan, Guoren Wang, Lei Chen, Haixun Wang

812 - 823

Truss Decomposition in Massive Networks

Jia Wang, James Cheng

824 - 835

SEAL: Spatio-Textual Similarity Search

Ju Fan, Guoliang Li, Lizhu Zhou, Shanshan Chen, Jun Hu

836 - 847

On The Spatiotemporal Burstiness of Terms

Theodoros Lappas, Marcos R. Vieira, Dimitrios Gunopulos, Vassilis J. Tsotras

848 - 859

Efficient Reachability Query Evaluation in Large Spatiotemporal Contact Datasets

Houtan Shirani-Mehr, Farnoush Banaei Kashani, Cyrus Shahabi

860 - 871

Boosting Moving Object Indexing through Velocity Partitioning

Thi Nguyen, Zhen He, Rui Zhang, Phillip Ward

872 - 883

Type-Based Detection of XML Query-Update Independence

Nicole Bidoit-Tollu, Dario Colazzo, Federico Ulliana

884 - 895

Minuet: A Scalable Distributed Multiversion B-Tree

Benjamin Sowell, Wojciech Golab, Mehul A. Shah

896 - 907

Challenging the Long Tail Recommendation

Hongzhi Yin, Bin Cui, Jing Li, Junjie Yao, Chen Chen

Volume 5, No. 10

i - viii

Front Matter and Letter from the Associate Editors

Ahmet Sacan, Nesime Tatbul

908 - 919

Answering Table Queries on the Web using Column Keywords

Rakesh Pimplikar, Sunita Sarawagi

920 - 931

Efficient Verification of Web-Content Searching Through Authenticated Web Crawlers

Michael T. Goodrich, Duy Nguyen, Olga Ohrimenko, Charalampos Papamanthou, Roberto Tamassia, Nikos Triandopoulos, Cristina Videira Lopes

932 - 943

SODA: Generating SQL for Business Users

Lukas Blunschi, Claudio Jossen, Donald Kossmann, Magdalini Mori, Kurt Stockinger

944 - 955

Privacy Preservation by Disassociation

Manolis Terrovitis, John Liagouris, Nikos Mamoulis, Spiros Skiadopoulos

956 - 967

Supercharging Recommender Systems using Taxonomies for Learning User Purchase Behavior

Bhargav Kanagal, Amr Ahmed, Sandeep Pandey, Vanja Josifovski, Jeff Yuan, Lluis Garcia-Pueyo

968 - 979

DBToaster: Higher-order Delta Processing for Dynamic, Frequently Fresh Views

Yanif Ahmad, Oliver Kennedy, Christoph Koch, Milos Nikolic

980 - 991

Real Time Discovery of Dense Clusters in Highly Dynamic Graphs: Identifying Real World Events in Highly Dynamic Environments

Manoj K Agarwal, Krithi Ramamritham, Manish Bhide

992 - 1003

Sketch-based Querying of Distributed Sliding-Window Data Streams

Odysseas Papapetrou, Minos Garofalakis, Antonios Deligiannakis

1004 - 1015

LogBase: A Scalable Log-structured Database System in the Cloud

Hoang Tam Vo, Sheng Wang, Divyakant Agrawal, Gang Chen, Beng Chin Ooi

1016 - 1027

Efficient Processing of k Nearest Neighbor Joins using MapReduce

Wei Lu, Yanyan Shen, Su Chen, Beng Chin Ooi

1028 - 1039

Early Accurate Results for Advanced Analytics on MapReduce

Nikolay Laptev, Kai Zeng, Carlo Zaniolo

1040 - 1051

CDAS: A Crowdsourcing Data Analytics System

Xuan Liu, Meiyu Lu, Beng Chin Ooi, Yanyan Shen, Sai Wu, Meihui Zhang

1052 - 1063

Mining Statistically Significant Substrings using the Chi-Square Statistic

Mayank Sachan, Arnab Bhattacharya

1064 - 1075

Massively Parallel Sort-Merge Joins in Main Memory Multi-Core Database Systems

Martina-Cezara Albutiu, Alfons Kemper, Thomas Neumann

1076 - 1087

hStorage-DB: Heterogeneity-aware Data Management to Exploit the Full Capability of Hybrid Storage Systems

Tian Luo, Rubao Lee, Michael Mesnier, Feng Chen, Xiaodong Zhang

Volume 5, No. 11

: Front Matter i - i

x - x

Letter from the Editor-in-Chief

Z. Meral Özsoyoğlu

1088 - 1099

A Scalable Algorithm for Maximizing Range Sum in Spatial Databases

Dong-Wan Choi, Chin-Wan Chung, Yufei Tao

1100 - 1111

Spatial Queries with Two kNN Predicates

Ahmed M. Aly, Walid G. Aref, Mourad Ouzzani

1112 - 1123

Optimal Algorithms for Crawling a Hidden Database in the Web

Cheng Sheng, Nan Zhang, Yufei Tao, Xin Jin

1124 - 1135

Diversifying Top-K Results

Lu Qin, Jeffrey Xu Yu, Lijun Chang

1136 - 1147

Keyword-aware Optimal Route Search

Xin Cao, Lisi Chen, Gao Cong, Xiaokui Xiao

1148 - 1159

Answering Queries using Views over Probabilistic XML: Complexity and Tractability

Bogdan Cautis, Evgeny Kharlamov

1160 - 1171

Probabilistic Databases with MarkoViews

Abhay Jha, Dan Suciu

1172 - 1183

The Complexity of Social Coordination

Konstantinos Mamouras, Sigal Oren, Lior Seeman, Lucja Kot, Johannes Gehrke

1184 - 1195

Efficient Multi-way Theta-Join Processing Using MapReduce

Xiaofei Zhang, Lei Chen, Min Wang

1196 - 1207

Stubby: A Transformation-based Optimizer for MapReduce Workflows

Harold Lim, Herodotos Herodotou, Shivnath Babu

1208 - 1219

Labeling Workflow Views with Fine-Grained Dependencies

Zhuowei Bao, Susan B. Davidson, Tova Milo

1220 - 1231

Fundamentals of Order Dependencies

Jaroslaw Szlichta, Parke Godfrey, Jarek Gryz

1232 - 1243

FDB: A Query Engine for Factorised Relational Databases

Nurzhan Bakibayev, Dan Olteanu, Jakub Z√°vodn√Ω

1244 - 1255

Optimization of Analytic Window Functions

Yu Cao, Chee-Yong Chan, Jie Li, Kian-Lee Tan

1256 - 1267

Opening the Black Boxes in Data Flow Optimization

Fabian Hueske, Mathias Peters, Matthias Sax, Astrid Rheinländer, Rico Bergmann, Aljoscha Krettek, Kostas Tzoumas

1268 - 1279

Spinning Fast Iterative Data Flows

Stephan Ewen, Kostas Tzoumas, Moritz Kaufmann, Volker Markl

1280 - 1291

REX: Recursive, Delta-Based Data-Centric Computation

Svilen R. Mihaylov, Zachary G. Ives, Sudipto Guha

1292 - 1303

K-Reach: Who is in Your Small World

James Cheng, Zechao Shang, Hong Cheng, Haixun Wang, Jeffrey Xu Yu

1304 - 1315

Performance Guarantees for Distributed Reachability Queries

Wenfei Fan, Xin Wang, Yinghui Wu

1316 - 1327

Efficient Indexing and Querying over Syntactically Annotated Trees

Pirooz Chubak, Davood Rafiei

1328 - 1339

Queries with Guarded Negation

Vince Barany, Balder ten Cate, Martin Otto

1340 - 1351

PrivBasis: Frequent Itemset Mining with Differential Privacy

Ninghui Li, Wahbeh Qardaji, Dong Su, Jianneng Cao

1352 - 1363

Low-Rank Mechanism: Optimizing Batch Queries under Differential Privacy

Ganzhao Yuan, Zhenjie Zhang, Marianne Winslett, Xiaokui Xiao, Yin Yang, Zhifeng Hao

1364 - 1375

Functional Mechanism: Regression Analysis under Differential Privacy

Jun Zhang, Zhenjie Zhang, Xiaokui Xiao, Yin Yang, Marianne Winslett

1376 - 1387

Injecting Uncertainty in Graphs for Identity Obfuscation

Paolo Boldi, Francesco Bonchi, Aris Gionis, Tamir Tassa

1388 - 1399

Publishing Microdata with a Robust Privacy Guarantee

Jianneng Cao, Panagiotis Karras

1400 - 1411

Measuring Two-Event Structural Correlations on Graphs

Ziyu Guan, Xifeng Yan, Lance M. Kaplan

1412 - 1423

Ranking Large Temporal Data

Jeffrey Jestes, Jeff M. Phillips, Feifei Li, Mingwang Tang

1424 - 1435

Compacting Transactional Data in Hybrid OLTP & OLAP Databases

Florian Funke, Alfons Kemper, Thomas Neumann

1436 - 1446

Processing a Trillion Cells per Mouse Click

Alexander Hall, Olaf Bachmann, Robert Büssow, Silviu Gănceanu, Marc Nunkesser

1447 - 1458

OLTP on Hardware Islands

Danica Porobic, Ippokratis Pandis, Miguel Branco, Pınar Tözün, Anastasia Ailamaki

1459 - 1470

Serializability, not Serial: Concurrency Control and Availability in Multi-Datacenter Datastores

Stacy Patterson, Aaron J. Elmore, Faisal Nawab, Divyakant Agrawal, Amr El Abbadi

1471 - 1482

Automatic Partitioning of Database Applications

Alvin Cheung, Owen Arden, Samuel Madden, Andrew C. Myers

1483 - 1494

CrowdER: Crowdsourcing Entity Resolution

Jiannan Wang, Tim Kraska, Michael J. Franklin, Jianhua Feng

1495 - 1506

Whom to Ask? Jury Selection for Decision Making Tasks on Micro-blog Services

Caleb Chen CAO, Jieying She, Yongxin Tong, Lei Chen

1507 - 1518

ALAE: Accelerating Local Alignment with Affine Gap Exactly in Biosequence Databases

Xiaochun Yang, Honglei Liu, Bin Wang

1519 - 1530

sDTW: Computing DTW Distances using Locally Relevant Constraints based on Salient Feature Alignments

K. Selçuk Candan, Rosaria Rossini, Maria Luisa Sapino, Xiaolan Wang

1531 - 1542

SCOUT: Prefetching for Latent Feature Following Queries

Farhan Tauheed, Thomas Heinis, Felix Shürmann, Henry Markram, Anastasia Ailamaki

1543 - 1554

Accelerating Pathology Image Data Cross-Comparison on CPU-GPU Hybrid Systems

Kaibo Wang, Yin Huai, Rubao Lee, Fusheng Wang, Xiaodong Zhang, Joel H. Saltz

1555 - 1566

Robust Estimation of Resource Consumption for SQL Queries using Statistical Techniques

Jiexing Li, Arnd Christian König, Vivek Narasayya, Surajit Chaudhuri

1567 - 1578

Who Tags What? An Analysis Framework

Mahashweta Das, Saravanan Thirumuruganathan, Sihem Amer-Yahia, Gautam Das, Cong Yu

1579 - 1590

A Generic Framework for Efficient and Effective Subsequence Retrieval

Haohan Zhu, George Kollios, Vassilis Athitsos

1591 - 1602

Only Aggressive Elephants are Fast Elephants

Jens Dittrich, Jorge-Arnulfo Quiané-Ruiz, Stefan Richter, Stefan Schuh, Alekh Jindal, Jörg Schad

1603 - 1614

Multiple Location Profiling for Users and Relationships from Social Network and Content

Rui Li, Shengjie Wang, Kevin Chen-Chuan Chang

1615 - 1626

Flash-based Extended Cache for Higher Throughput and Faster Recovery

Woon-Hak Kang, Sang-Won Lee, Bongki Moon

1627 - 1637

Don't Thrash: How to Cache Your Hash on Flash

Michael A. Bender, Martin Farach-Colton, Rob Johnson, Russell Kraner, Bradley C. Kuszmaul, Dzejla Medjedovic, Pablo Montes, Pradeep Shetty, Richard P. Spillane, Erez Zadok

1638 - 1649

Learning Expressive Linkage Rules using Genetic Programming

Robert Isele, Christian Bizer

1650 - 1661

Mining Frequent Itemsets over Uncertain Databases

Yongxin Tong, Lei Chen, Yurong Cheng, Philip S. Yu

1662 - 1673

Uncertain Time-Series Similarity: Return to the Basics

Michele Dallachiesa, Besmira Nushi, Katsiaryna Mirylenka, Themis Palpanas

1674 - 1683

Statistical Distortion: Consequences of Data Cleaning

Tamraparni Dasu, Ji Meng Loh

1684 - 1695

Towards Energy-Efficient Database Cluster Design

Willis Lang, Stavros Harizopoulos, Jignesh M. Patel, Mehul A. Shah, Dimitris Tsirogiannis

Volume 5, No. 12

: Front Matter i - i

xvii - xviii

Welcome Message from the VLDB 2012 General Chairs

Adnan Yazıcı, Ling Liu

xix - xx

Message from the VLDB 2012 General Program Chair

Z. Meral Özsoyoğlu

1696 - 1696

Data Management on the Spatial Web

Christian S. Jensen

1697 - 1697

Data Analytics Opportunities in a Smarter Planet

Brenda Dietrich

1698 - 1698

Challenges in Economic Massive Content Storage and Management (MCSAM) in the Era of Self-Organizing, Self-Expanding and Self-Linking Data Clusters

Kenan ≈ûahin

1699 - 1699

Approximate Frequency Counts over Data Streams

Gurmeet Singh Manku, Rajeev Motwani

1700 - 1711

The MADlib Analytics Library or MAD Skills, the SQL

Joe Hellerstein, Christopher Ré, Florian Schoppmann, Daisy Zhe Wang, Eugene Fratkin, Aleksander Gorajek, Kee Siong Ng, Caleb Welton, Xixuan Feng, Kun Li, Arun Kumar

1712 - 1723

Can the Elephants Handle the NoSQL Onslaught?

Avrilia Floratou, Nikhil Teletia, David J. DeWitt, Jignesh M. Patel, Donghui Zhang

1724 - 1735

Solving Big Data Challenges for Enterprise Application Performance Management

Tilmann Rabl, Mohammad Sadoghi, Hans-Arno Jacobsen, Sergio Gómez-Villamor, Victor Muntés-Mulero, Serge Mankowskii

1736 - 1747

M3R: Increased performance for in-memory Hadoop jobs

Avraham Shinnar, David Cunningham, Benjamin Herta, Vijay Saraswat

1748 - 1758

A Storage Advisor for Hybrid-Store Databases

Philipp Rösch, Lars Dannecker, Gregor Hackenbroich, Franz Faerber

1759 - 1770

From Cooperative Scans to Predictive Buffer Management

Michał Świtakowski, Peter Boncz, Marcin Żukowski

1771 - 1780

The Unified Logging Infrastructure for Data Analytics at Twitter

George Lee, Jimmy Lin, Chuang Liu, Andrew Lorek, Dmitriy Ryaboy

1781 - 1789

Transaction Log Based Application Error Recovery and Point In-Time Query

Tomas Talius, Robin Dhamankar, Andrei Dumitrache, Hanuma Kodavalla

1790 - 1801

The Vertica Analytic Database: C-Store 7 Years Later

Andrew Lamb, Matt Fuller, Ramakrishna Varadarajan, Nga Tran, Ben Vandier, Lyric Doshi, Chuck Bear

1802 - 1813

Interactive Analytical Processing in Big Data Systems: A Cross-Industry Study of MapReduce Workloads

Yanpei Chen, Sara Alspaugh, Randy Katz

1814 - 1825

Muppet: MapReduce-Style Processing of Fast Data

Wang Lam, Lu Liu, STS Prasad, Anand Rajaraman, Zoheb Vacheri, AnHai Doan

1826 - 1837

Building User-defined Runtime Adaptation Routines for Stream Processing Applications

Gabriela Jacques-Silva, Buğra Gedik, Rohit Wagle, Kun-Lung Wu, Vibhore Kumar

1838 - 1849

MOIST: A Scalable and Parallel Moving Object Indexer with School Tracking

Junchen Jiang, Hongji Bao, Edward Y. Chang, Yuqian Li

1850 - 1861

Serializable Snapshot Isolation in PostgreSQL

Dan R. K. Ports, Kevin Grittner

1862 - 1873

Exploiting Evidence from Unstructured Data to Enhance Master Data Management

Karin Murthy, Prasad M Deshpande, Atreyee Dey, Ramanujam Halasipuram, Mukesh Mohania, Deepak P, Jennifer Reed, Scott Schumacher

1874 - 1877

Avatara: OLAP for Web-scale Analytics Products

Lili Wu, Roshan Sumbaly, Chris Riccomini, Gordon Koo, Hyung Jin Kim, Jay Kreps, Sam Shah

1878 - 1881

Dedoop: Efficient Deduplication with Hadoop

Lars Kolb, Andreas Thor, Erhard Rahm

1882 - 1885

MapReduce-based Dimensional ETL Made Easy

Xiufeng Liu, Christian Thomsen, Torben Bach Pedersen

1886 - 1889

CloudVista: Interactive and Economical Visual Cluster Analysis for Big Data in the Cloud

Huiqi Xu, Zhen Li, Shumin Guo, Keke Chen

1890 - 1893

Myriad: Scalable and Expressive Data Generation

Alexander Alexandrov, Kostas Tzoumas, Volker Markl

1894 - 1897

A Demonstration of DBWipes: Clean as You Query

Eugene Wu, Samuel Madden, Michael Stonebraker

1898 - 1901

ASTERIX: An Open Source System for "Big Data" Management and Analysis

Sattam Alsubaiee, Yasser Altowim, Hotham Altwaijry, Alexander Behm, Vinayak Borkar, Yingyi Bu, Michael Carey, Raman Grover, Zachary Heilbron, Young-Seok Kim, Chen Li, Nicola Onose, Pouria Pirzadeh, Rares Vernica, Jian Wen

1902 - 1905

Blink and It's Done: Interactive Queries on Very Large Data

Sameer Agarwal, Aurojit Panda, Barzan Mozafari, Anand P. Iyer, Samuel Madden, Ion Stoica

1906 - 1909

Massive Genomic Data Processing and Deep Analysis

Abhishek Roy, Yanlei Diao, Evan Mauceli, Yiping Shen, Bai-Lin Wu

1910 - 1913

MonetDB/DataCell: Online Analytics in a Streaming Column-Store

Erietta Liarou, Stratos Idreos, Stefan Manegold, Martin Kersten

1914 - 1917

SWORS: A System for the Efficient Retrieval of Relevant Spatial Web Objects

Xin Cao, Gao Cong, Christian S. Jensen, Jun Jie Ng, Beng Chin Ooi, Nhan-Tue Phan, Dingming Wu

1918 - 1921

CyLog/Crowd4U: A Declarative Platform for Complex Data-centric Crowdsourcing

Atsuyuki Morishima, Norihide Shinagawa, Tomomi Mitsuishi, Hideto Aoki, Shun Fukusumi

1922 - 1925

Exploiting Database Similarity Joins for Metric Spaces

Yasin N. Silva, Spencer Pearson

1926 - 1929

Stethoscope: A platform for interactive visual analysis of query execution plans

Mrunal Gawade, Martin Kersten

1930 - 1933

Hum-a-song: A Subsequence Matching with Gaps-Range-Tolerances Query-By-Humming System

Alexios Kotsifakos, Panagiotis Papapetrou, Jaakko Hollmén, Dimitrios Gunopulos, Vassilis Athitsos, George Kollios

1934 - 1937

SkewTune in Action: Mitigating Skew in MapReduce Applications

YongChul Kwon, Magdalena Balazinska, Bill Howe, Jerome Rolia

1938 - 1941

Playful Query Specification with DataPlay

Azza Abouzied, Joseph M. Hellerstein, Avi Silberschatz

1942 - 1945

NoDB in Action: Adaptive Query Processing on Raw Data

Ioannis Alagiannis, Renata Borovica, Miguel Branco, Stratos Idreos, Anastasia Ailamaki

1946 - 1949

Complex Preference Queries Supporting Spatial Applications for User Groups

Florian Wenzel, Markus Endres, Stefan Mandl, Werner Kießling

1950 - 1953

Demonstration of the FDB Query Engine for Factorised Databases

Nurzhan Bakibayev, Dan Olteanu, Jakub Z√°vodn√Ω

1954 - 1957

PET: Reducing Database Energy Cost via Query Optimization

Zichen Xu, Yi-Cheng Tu, Xiaorui Wang

1958 - 1961

SPAM: A SPARQL Analysis and Manipulation Tool

Andrés Letelier, Jorge Pérez, Reinhard Pichler, Sebastian Skritek

1962 - 1965

QueryMarket Demonstration: Pricing for Online Data Markets

Paraschos Koutris, Prasang Upadhyaya, Magdalena Balazinska, Bill Howe, Dan Suciu

1966 - 1969

DISKs: A System for Distributed Spatial Group Keyword Search on Road Networks

Siqiang Luo, Yifeng Luo, Shuigeng Zhou, Gao Cong, Jihong Guan

1970 - 1973

WETSUIT: An Efficient Mashup Tool for Searching and Fusing Web Entities

Stefan Endrullis, Andreas Thor, Erhard Rahm

1974 - 1977

Model-based Integration of Past & Future in TimeTravel

Mohamed E. Khalefa, Ulrike Fischer, Torben Bach Pedersen, Wolfgang Lehner

1978 - 1981

DrillBeyond: Enabling Business Analysts to Explore the Web of Open Data

Julian Eberius, Maik Thiele, Katrin Braunschweig, Wolfgang Lehner

1982 - 1985

Discovering and Exploring Relations on the Web

Ndapandula Nakashole, Gerhard Weikum, Fabian Suchanek

1986 - 1989

MapRat: Meaningful Explanation, Interactive Exploration and Geo-Visualization of Collaborative Ratings

Saravanan Thirumuruganathan, Mahashweta Das, Shrikant Desai, Sihem Amer-Yahia, Gautam Das, Cong Yu

1990 - 1993

Deco: A System for Declarative Crowdsourcing

Hyunjung Park, Richard Pang, Aditya Parameswaran, Hector Garcia-Molina, Neoklis Polyzotis, Jennifer Widom

1994 - 1997

Developing and Analyzing XSDs through BonXai

Wim Martens, Frank Neven, Matthias Niewerth, Thomas Schwentick

1998 - 2001

InfoPuzzle: Exploring Group Decision Making in Mobile Peer-to-Peer Databases

Aaron J. Elmore, Sudipto Das, Divyakant Agrawal, Amr El Abbadi

2002 - 2005

Manage and Query Generic Moving Objects in SECONDO

Jianqiu Xu, Ralf Hartmut Güting

2006 - 2009

Chronos: Facilitating History Discovery by Linking Temporal Records

Pei Li, Haidong Wang, Christina Tziviskou, Xin Luna Dong, Xiaoguang Liu, Andrea Maurino, Divesh Srivastava

2010 - 2013

TELEIOS: A Database-Powered Virtual Earth Observatory

Manolis Koubarakis, Kostis Kyzirakos, Manos Karpathiotakis, Charalampos Nikolaou, Stavros Vassos, George Garbis, Michael Sioutis, Konstantina Bereta, Dimitrios Michail, Charalampos Kontoes, Ioannis Papoutsis, Themos Herekakis, Stefan Manegold, Martin Kersten, Milena Ivanova, Holger Pirk, Ying Zhang, Mihai Datcu, Gottfried Schwarz, Corneliu Dumitru, Daniela Espinoza Molina, Katrin Molch, Ugo Di Giammatteo, Manuela Sagona, Sergio Perelli, Thorsten Reitz, Eva Klien, Robert Gregor

2014 - 2015

Efficient Big Data Processing in Hadoop MapReduce

Jens Dittrich, Jorge-Arnulfo Quiané-Ruiz

2016 - 2017

MapReduce Algorithms for Big Data Analysis

Kyuseok Shim

2018 - 2019

Entity Resolution: Theory, Practice & Open Challenges

Lise Getoor, Ashwin Machanavajjhala

2020 - 2021

I/O Characteristics of NoSQL Databases

Jiri Schindler

2022 - 2023

Mining Knowledge from Interconnected Data: A Heterogeneous Information Network Analysis Approach

Yizhou Sun, Jiawei Han, Xifeng Yan, Philip S. Yu

2024 - 2025

Understanding and Managing Cascades on Large Graphs

B. Aditya Prakash, Christos Faloutsos

2026 - 2027

Interoperability in eHealth Systems (Invited Tutorial)

Asuman Dogac

2028 - 2029

Secure and Privacy-Preserving Data Services in the Cloud: A Data Centric View

Divyakant Agrawal, Amr El Abbadi, Shiyuan Wang

2030 - 2031

Graph Synopses, Sketches, and Streams: A Survey

Sudipto Guha, Andrew McGregor

2032 - 2033

Challenges and Opportunities with Big Data

Alexandros Labrinidis, H. V. Jagadish

2034 - 2035

Social Networks and Mobility in the Cloud

Amr El Abbadi, Mohamed F. Mokbel

PVLDB is part of the VLDB Endowment Inc.

Privacy Policy