Volume 6, 2012-2013

Editors-in-Chief:
Michael Böhlen, Christoph Koch
Founding Editor-in-Chief:
H. V. Jagadish
Advisory Committee:
Philip Bernstein, Michael Böehlen, Peter Buneman, Susan Davidson, Z. Meral Ozsoyoglu, S. Sudarshan, Gerhard Weikum
Information Director:
Gerald Weber
Associate Editors:
Ashraf Aboulnaga, Sihem Amer-Yahia, Chee Yong Chan, Yanlei Diao, Ada Waichee Fu, Johannes Gehrke, Alon Halevy, Jayant Haritsa, Nikos Mamoulis, Thomas Neumann, Dan Olteanu, Divesh Srivastava, Jens Teubner, Stefan Manegold
Review Board:

Volume 6, No. 1

Michael Böhlen and Christoph Koch: Front Matter i - x

1 - 12

Spatio-Textual Similarity Joins

Panagiotis Bouros, Shen Ge, and Nikos Mamoulis

13 - 24

DisC Diversity: Result Diversification based on Dissimilarity and Coverage

Marina Drosou and Evaggelia Pitoura

25 - 36

On Differentially Private Frequent Itemset Mining

Chen Zeng, Jeffrey F. Naughton, and Jin-Yi Cai

Volume 6, No. 2

Peer Kröger and Stratis D. Viglas: Front Matter i - x

37 - 48

Less is More: Selecting Sources Wisely for Integration

Xin Luna Dong, Barna Saha, and Divesh Srivastava

49 - 60

Distributed Time-aware Provenance

Wenchao Zhou, Suyog Mapara, Yiqing Ren, Yang Li, Andreas Haeberlen, Zachary Ives, Boon Thau Loo, and Micah Sherr

61 - 72

Query Processing under GLAV Mappings for Relational and Graph Databases

Diego Calvanese, Giuseppe De Giacomo, Maurizio Lenzerini, and Moshe Y. Vardi

73 - 84

Computing Immutable Regions for Subspace Top-k Queries

Kyriakos Mouratidis and HweeHwa Pang

85 - 96

Large Scale Cohesive Subgraphs Discovery for Social Network Visual Analysis

Feng Zhao and Anthony K. H. Tung

97 - 108

Truth Finding on the Deep Web: Is the Problem Solved?

Xian Li, Xin Luna Dong, Kenneth Lyons, Weiyi Meng, and Divesh Srivastava

109 - 120

Counting with the Crowd

Adam Marcus, David Karger, Samuel Madden, Robert Miller, and Sewoong Oh

109 - 120

ClouDiA: A Deployment Advisor for Public Clouds

Tao Zou, Ronan Le Bras, Marcos Vaz Salles, Alan Demers, and Johannes Gehrke

133 - 144

An In-depth Comparison of Subgraph Isomorphism Algorithms in Graph Databases

Jinsoo Lee, Wook-Shin Han, Romans Kasperovics, and Jeong-Hoon Lee

145 - 156

Lightweight Locking for Main Memory Database Systems

Kun Ren, Alexander Thomson, and Daniel J. Abadi

Volume 6, No. 3

Ada Waichee Fu and Alon Halevy: Front Matter i - x

157 - 168

Lightweight Privacy-Preserving Peer-to-Peer Data Integration

Ye Zhang,Wai-Kit Wong, S. M. Yiu, Nikos Mamoulis, and David W. Cheung

169 - 180

Memory Efficient Minimum Substring Partitioning

Yang Li, Pegah Kamousi, Fangqiu Han, Shengqi Yang, Xifeng Yan, and Subhash Suri

181 - 192

NeMa: Fast Graph Search with Label Similarity

Arijit Khan, Yinghui Wu, Charu C. Aggarwal, and Xifeng Yan

193 - 204

PARAS: A Parameter Space Framework for Online Association Mining

Xika Lin, Abhishek Mukherji, Elke A. Rundensteiner, Carolina Ruiz, and Matthew O. Ward

205 - 216

Actively Soliciting Feedback for Query Answers in Keyword Search-Based Data Integration

Zhepeng Yan, Nan Zheng, Zachary G. Ives, Partha Pratim Talukdar, and Cong Yu

217 - 228

Spatial Keyword Query Processing: An Experimental Evaluation

Lisi Chen, Gao Cong, Christian S. Jensen, and Dingming Wu

Volume 6, No. 4

Ashraf Aboulnaga and Chee Yong Chan: Front Matter i - x

229 - 240

Partitioning and Ranking Tagged Data Sources

Milad Eftekhar and Nick Koudas

241 - 252

Efficient Implementation of Generalized Quantification in Relational Query Languages

Bin Cao and Antonio Badia

253 - 264

DAX: A Widely Distributed Multi-tenant Storage Service for DBMS Hosting

Rui Liu, Ashraf Aboulnaga, and Kenneth Salem

265 - 276

A Distributed Graph Engine for Web Scale RDF Data

Kai Zeng, Jiacheng Yang, Haixun Wang, Bin Shao, and Zhongyuan Wang

277 - 288

Upper and Lower Bounds on the Cost of a Map-Reduce Computation

Foto N. Afrati, Anish Das Sarma, Semih Salihoglu, and Jeffrey D. Ullman

Volume 6, No. 5

Dan Olteanu and Divesh Srivastava: Front Matter i - x

289 - 300

Processing Analytical Queries over Encrypted Data

Stephen Tu, M. Frans Kaashoek, Samuel Madden, and Nickolai Zeldovich

301 - 312

Practical Differential Privacy via Grouping and Smoothing

Georgios Kellaris and Stavros Papadopoulos

313 - 324

On Scaling Up Sensitive Data Auditing

Yupeng Fu, Raghav Kaushik, and Ravishankar Ramamurthy

325 - 336

XORing Elephants: Novel Erasure Codes for Big Data

Maheswaran Sathiamoorthy, Megasthenis Asteris, Dimitris Papailiopoulos, Alexandros G. Dimakis, Ramkumar Vadali, Scott Chen, and Dhruba Borthakur

337 - 348

Scaling Factorization Machines to Relational Data

Steffen Rendle

Volume 6, No. 6

Jayant Haritsa and Jens Teubner: Front Matter i - x

349 - 360

Question Selection for Crowd Entity Resolution

Steven Euijong Whang, Peter Lofgren, Hector Garcia-Molina

361 - 372

A Comparison of Knives for Bread Slicing

Alekh Jindal, Endre Palatinus, Vladimir Pavlov, Jens Dittrich

373 - 384

Efficient Error-tolerant Query Autocompletion

Chuan Xiao, Jianbin Qin, Wei Wang, Yoshiharu Ishikawa, Koji Tsuda, Kunihiko Sadakane

385 - 396

Top-k Publish-Subscribe for Social Annotation of News

Alexander Shraer, Maxim Gurevich, Marcus Fontoura, Vanja Josifovski

397 - 408

Efficient Querying of Inconsistent Databases with Binary Integer Programming

Phokion G. Kolaitis, Enela Pema, Wang-Chiew Tan

409 - 420

Piggybacking on Social Networks

Aristides Gionis, Flavio Junqueira, Vincent Leroy, Marco Serafinin, Ingmar Weber

421 - 432

Schema Extraction for Tabular Data on the Web

Marco D. Adelfio, Hanan Samet

433 - 444

Streaming Algorithms for k-core Decomposition

Ahmet Erdem Sariyuce, Bugra Gedik, Gabriela Jacques-Silva, Kun-Lung Wu, Umit V. Catalyurek

444 - 456

Discovering Linkage Points over Web Data

Oktie Hassanzadeh, Ken Q. Pu, Soheil Hassas Yeganeh, Renee J. Miller, Lucian Popa, Muricio A. Hernandez, Howard Ho

457 - 468

IS-LABEL: an Independent-Set based Labeling Scheme for Point-to-Point Distance Querying

Ada Wai-Chee Fu, Huanhuan Wu, James Cheng, Raymond Chi-Wing Wong

469 - 480

Supporting User-Defined Functions on Uncertain Data

Thanh T. L. Tran, Yanlei Diao, Charles Sutton, Anna Liu

481 - 492

Incremental and Accuracy-Aware Personalized PageRank through Scheduled Approximation

Fanwei Zhu, Yuan Fang, Kevin Chen-Chuan Chang, Jing Ying

Volume 6, No. 7

Johannes Gehrke and Nikos Mamoulis: Front Matter i - x

493 - 504

Efficient SimRank-based Similarity Join Over Large Graphs

Weiguo Zheng, Lei Zou, Yansong Feng, Lei Chen, Dongyan Zhao

505 - 516

A Performance Study of Three Disk-based Structures for Indexing and Querying Frequent Itemsets

Guimei Liu, Andre Suchitra, Limsoon Wong

517 - 528

TripleBit: a Fast and Compact System for Large Scale RDF Data

Pingpeng Yuan, Pu Liu, Buwen Wu, Hai Jin, Wenya Zhang, Ling Liu

529 - 540

CorrectDB: SQL Engine with Practical Query Authentication

Sumeet Bajaj, Radu Sion

Volume 6, No. 8

Sihem Amer-Yahia and Stefan Manegold: Front Matter i - x

541 - 552

Hybrid Storage Management for Database Systems

Xin Liu, Kenneth Salem

553 - 564

Scorpion: Explaining Away Outliers in Aggregate Queries

Eugene Wu, Samuel Madden

565 - 576

Ratio Threshold Queries over Distributed Data Sources

Rajeev Gupta, Krithi Ramamritham, Mukesh Mohania

577 - 588

On the Complexity of Query Result Diversification

Ting Deng, Wenfei Fan

589 - 600

Streaming Quotient Filter: A Near Optimal Approximate Duplicate Detection Approach for Data Streams

Sourav Dutta, Ankur Narang, Suman K. Bera

Volume 6, No. 9

Yanlei Diao and Thomas Neumann: Front Matter i - x

601 - 612

On Repairing Structural Problems In Semi-structured Data

Flip Korn, Barna Saha, Divesh Srivastava, Shanshan Ying

613 - 624

A Distributed Algorithm for Large-Scale Generalized Matching

Faraz Makari Manshadi, Baruch Awerbuch, Rainer Gemula, Rohit Khandekar, Julian Mestre, Mauro Sozio

625 - 636

The LLUNATIC Data-Cleaning Framework

Floris Geerts, Giansalvatore Mecca, Paolo Papotti, Donatello Santoro

637 - 648

Sharing Data and Work Across Concurrent Analytical Queries

Iraklis Psaroudakis, Manos Athanassoulis, Anastasia Ailamaki

649 - 660

Skyline Operator on Anti-correlated Distributions

Haichuan Shang, Masaru Kitsuregawa

661 - 672

Low-Latency Multi-Datacenter Databases using Replicated Commit

Hatem Mahmoud, Faisal Nawab, Alexander Pucher, Divyakant Agrawal, Amr El Abbadi

673 - 684

Distribution-Based Query Scheduling

Yun Chi, Hakan Hacigumus, Wang-Pin Hsiung, Jeffrey F. Naughton

685 - 696

Making Queries Tractable on Big Data with Preprocessing

Wenfei Fan, Floris Geerts, Frank Neven

697 - 708

Answering Planning Queries with the Crowd

Haim Kaplan, Ilia Lotosh, Tova Milo, Slava Novgorodov

709 - 720

Hardware-Oblivious Parallelism for In-Memory Column-Stores

Max Heimel, Michael Saecker, Holger Pirk, Stefan Manegold, Volker Markl

721 - 732

Permuting Data on Random-Access Block Storage

Risi Thonangi, Jun Yang

733 - 744

Improving Flash Write Performance by Using Update Frequency

Radu Stoica, Anastasia Ailamaki

745 - 756

Efficient Indexing for Diverse Query Results

Lu Li, Chee-Yong Chan

757 - 768

Reducing Uncertainty of Schema Matching via Crowdsourcing

Chen Jason Zhang, Lei Chen, H. V. Jagadish, Chen Caleb Cao

769 - 780

Travel Cost Inference from Sparse, Spatio-Temporally Correlated Time Series Using Markov Models

Bin Yang, Chenjuan Guo, Christian S. Jensen

Volume 6, No. 10

Themis Palpanas and Yannis Velegrakis: Front Matter i - x

781 - 792

Query Optimization over Crowdsourced Data

Hyunjung Park, Jennifer Widom

793 - 804

A Data-adaptive and Dynamic Segmentation Index for Whole Matching on Time Series

Yang Wang, Peng Wang, Jian Pei, Wei Wang, Sheng Huang

805 - 816

Extraction and Integration of Partially Overlapping Web Sources

Mirko Bronzi, Valter Crescenzi, Paolo Merialdo, Paolo Papotti

817 - 828

The Yin and Yang of Processing Data Warehousing Queries on GPU Devices

Yuan Yuan, Rubao Lee, Xiaodong Zhang

829 - 840

Mining and Indexing Graphs for Supergraph Search

Dayu Yuan, Prasenjit Mitra, C. Lee Giles

841 - 852

Efficient Recovery of Missing Events

Jianmin Wang, Shaoxu Song, Xiaochen Zhu, Xuemin Lin

853 - 864

Hadoop's Adolescence

Kai Ren, YongChul Kwon, Magdalena Balazinska, Bill Howe

865 - 876

RACE: A Scalable and Elastic Parallel System for Discovering Repeats in Very Long Sequences

Essam Mansour, Ahmed El-Roby, Panos Kalnis, Aron Ahmadia, Ashraf Aboulnaga

877 - 888

LLAMA: A Cache/Storage Subsystem for Modern Hardware

Justin Levandoski, David Lomet, Sudipta Sengupta

889 - 900

Revisiting Co-Processing for Hash Joins on the Coupled CPU-GPU Architecture

Jiong He, Mian Lu, Bingsheng He

901 - 912

Top-K Nearest Keyword Search on Large Graphs

Miao Qiao, Lu Qin, Hong Cheng, Jeffrey Xu Yu, Wentao Tian

913 - 924

A General Framework for Geo-Social Query Processing

Nikos Armenatzoglou, Stavros Papadopoulos, Dimitris Papadias

925 - 936

Towards Predicting Query Execution Time for Concurrent and Dynamic Database Workloads

Wentao Wu, Yun Chi, Hakan Hacigumus, Jeffrey F. Naughton

937 - 948

Sketch-based Geometric Monitoring of Distributed Stream Queries

Minos Garofalakis, Daniel Keren, Vasilis Samoladas

949 - 960

Direction-Preserving Trajectory Simplification

Cheng Long, Raymond Chi-Wing Wong, Chenjuan Guo, H. V. Jagadish

Volume 6, No. 11

Min Wang and Cong Yu: Front Matter i - x

961 - 972

Continuous Cloud-Scale Query Optimization and Processing

Nicolas Bruno, Sapna Jain, Jingren Zhou

973 - 984

Optimization Strategies for A/B Testing on HADOOP

Andrii Cherniak, Huma Zaidi, Vladimir Zadorozhny

985 - 996

Piranha: Optimizing Short Jobs in Hadoop

Khaled Elmeleegy

997 - 1008

Making Updates Disk-I/O Friendly Using SSDs

Mohammad Sadoghi, Kenneth A. Ross, Mustafa Canim, Bishwaranjan Bhattacharjee

1009 - 1020

Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce

Ablimit Aji, Fusheng Wang, Hoang Vo, Rubao Lee, Qiaoling Liu, Xiaodong Zhang, Joel Saltz

1021 - 1032

Statistics Collection in Oracle Spatial and Graph: Fast Histogram Construction for Complex Geometry Objects

Bhuvan Bamba, Siva Ravada, Ying Hu, Richard Anderson

1033 - 1044

MillWheel: Fault-Tolerant Stream Processing at Internet Scale

Tyler Akidau, Alex Balikov, Kaya Bekiroglu, Slava Chernyak, Josh Haberman, Reuven Lax, Sam McVeety, Daniel Mills, Paul Nordstrom, Sam Whittle

1045 - 1056

Online, Asynchronous Schema Change in F1

Ian Rae, Eric Rollins, Jeff Shute, Sukhdeep Sodhi, Radek Vingralek

1057 - 1067

Scuba: Diving into Data at Facebook

Lior Abraham, John Allen, Oleksandr Barykin, Vinayak Borkar, Bhuwan Chopra, Ciprian Gerea, Daniel Merl, Josh Metzler, David Reiss, Subbu Subramanian, Janet L. Wiener, Okay Zed

1068 - 1079

F1: A Distributed SQL Database That Scales

Jeff Shute, Radek Vingralek, Bart Samwel, Ben Handy, Chad Whipkey, Eric Rollins, Mircea Oancea, Kyle Littlefield, David Menestrina, Stephan Ellner, John Cieslewicz, Ian Rae, Traian Stancescu, Himani Apte

1080 - 1091

DB2 with BLU Acceleration: So Much More than Just a Column Store

Vijayshankar Raman, Gopi Attaluri, Ronald Barber, Naresh Chainani, David Kalmuk, Vincent KulandaiSamy, Jens Leenstra, Sam Lightstone, Shaorong Liu, Guy M. Lohman, Tim Malkemus, Rene Mueller, Ippokratis Pandis, Berni Schiefer, David Sharpe, Richard Sidle, Adam Storm, Liping Zhang

1092 - 1101

A The Quantcast File System

Michael Ovsiannikov, Silvius Rus, Damian Reeves, Paul Sutter, Sriram Rao, Jim Kelly

1102 - 1113

Adaptive and Big Data Scale Parallel Execution in Oracle

Srikanth Bellamkonda, Hua-Gang Li, Unmesh Jagtap, Yali Zhu, Vince Liang, Thierry Cruanes

1114 - 1125

WOO: A Scalable and Multi-tenant Platform for Continuous Knowledge Base Synthesis

Kedar Bellare, Carlo Curino, Ashwin Machanavajihala, Peter Mika, Mandar Rahurkar, Aamod Sane

1126 - 1137

Entity Extraction, Linking, Classification, and Tagging for Social Media: A Wikipedia-Based Approach

Abhishek Gattani, Digvijay S. Lamba, Nikesh Garera, Mitul Tiwari, Xiaoyong Chai, Sanjib Das, Sri Subramaniam, Anand Rajaraman, Venky Harinarayan, AnHai Doan

1138 - 1149

Overview of Turn Data Management Platform for Digital Advertising

Hazem Elmeleegy, Yinan Li, Yan Qi, Peter Wilmot, Mingxi Wu, Santanu Kolay, Ali Dasdan, Songting Chen

1150 - 1161

Unicorn: A System for Searching the Social Graph

Michael Curtiss, Iain Becker, Tudor Bosman, Sergey Doroshenko, Lucian Grijincu, Tom Jackson, Sandhya Kunnatur, Soren Lassen, Philip Pronin, Sriram Sankar, Guanghao Shen, Gintaras Woss, Chao Yang, Ning Zhang

1162 - 1163

A New Service for Customer Care Based on the TrentoRise BigData Platform

Sergio Ramazzina, Chiara L. Ballari, Daniela Somenzi

1164 - 1165

Exploiting the Diversity, Mass and Speed of Territorial Data by TELCO Operator for Better User Services

Fabrizio Antonelli, Antonino Casella, Cristiana Chitic, Roberto Larcher, Giovanni Torrisi

1166 - 1167

The Trento Big Data Platform for Public Administration and Large Companies: Use cases and Opportunities

Ivan Bedini, Benedikt Elser, Yannis Velegrakis

1168 - 1169

Designing Query Optimizers for Big Data Problems of The Future

Nga Tran, Sreenath Bodagala, Jaimin Dave

1170 - 1171

How to maximize the value of big data with the open source SpagoBI suite through a comprehensive approach

Monica Franceschini

1172 - 1173

Context-Aware Computing: Opportunities and Open Issues

Edward Y. Chang

1174 - 1175

Next Generation Data Analytics at IBM Research

Oktie Hassanzadeh, Anastasios Kementsietsidis, Benny Kimelfeld, Rajasekar Krishnamurthy, Fatma Ozcan, Ippokratis Pandis

1176 - 1177

Learning and Intelligent Optimization (LION): One Ring to Rule Them All

Mauro Brunato, Roberto Battiti

1178 - 1179

Microsoft SQL Server's Integrated Database Approach for Modern Applications and Hardware

David Lomet

1180 - 1181

Odyssey: A Multi-Store System for Evolutionary Analytics

Hakan Hacigumus, Jagan Sankaranarayanan, Junichi Tatemura, Jeff LeFevre, Neoklis Polyzotis

1182 - 1183

A global Entity Name System (ENS) for data ecosystems

Paolo Bouquet, Andrea Molinari

1184 - 1185

SAP HANA: The Evolution from a Modern Main-Memory Data Platform to an Enterprise Application Platform

Vishal Sikka, Franz Farber, Anil Goel, Wolfgang Lehner

1186 - 1187

Keeping the TPC Relevant!

Raghunath Nambiar, Meikel Poess

1188 - 1189

Big Data Integration

Xin Luna Dong, Divesh Srivastava

1190 - 1191

Just-in-time compilation for SQL query processing

Stratis D. Viglas

1192 - 1193

Toward Scalable Transaction Processing

Anastasia Ailamaki, Ryan Johnson, Ippokratis Pandis, Pinar Tozun

1194 - 1195

Towards Database Virtualization for Database as a Service

Aaron J. Elmore, Carlo Curino, Divyakant Agrawal, Amr El Abbadi

1196 - 1197

Mobility and Social Networking: A Data Management Perspective

Mohamed F. Mokbel, Mohamed Sarwat

Volume 6, No. 12

Dimitrios Gunopoulos, Letizia Tanca and Jun Yang: Front Matter i - x

1198 - 1201

DesTeller: A System for Destination Prediction Based on Trajectories with Privacy Protection

Andy Yuan Xue, Rui Zhang, Yu Zheng, Xing Xie, Jianhui Yu, Yong Tang, Sapna Jain, Jingren Zhou

1202 - 1205

Senbazuru: A Prototype Spreadsheet Database Management System

Zhe Chen, Mike Cafarella, Jun Chen, Daniel Prevo, Junfeng Zhuang

1206 - 1209

ReqFlex: Fuzzy Queries for Everyone

Gregory Smits, Olivier Pivert, Thomas Girault

1210 - 1213

Comprehensive and Interactive Temporal Query Processing with SAP HANA

Martin Kaufmann, Panagiotis Vagenas, Peter Fischer, Donald Kossmann, Franz Farber

1214 - 1217

Functions Are Data Too (Defunctionalization for PL/SQL)

Torsten Grust, Nils Schweinsberg, Alexander Ulrich

1218 - 1221

NADEEF: A Generalized Data Cleaning System

Amr Ebaid, Ahmed Elmagarmid, Ihab Ilyas, Mourad Ouzzani, Jorge-Arnulfo Quiane-Ruiz, Nan Tang, Si Yin

1222 - 1225

QUEST: A Keyword Search System for Relational Data based on Semantic and Machine Learning Techniques

Sonia Bergamaschi, Francesco Guerra, Matteo Interlandi, Raquel Trillo Lado, Yannis Velegrakis

1226 - 1229

GroupFinder: A New Approach to Top-K Point-of-Interest Group Retrieval

Kenneth B√∏gh, Anders Skovsgaard, Christian Jensen

1230 - 1233

A Demonstration of SpatialHadoop: An Efficient MapReduce Framework for Spatial Data

Ahmed Eldawy, Mohamed Mokbel

1234 - 1237

Aggregate Profile Clustering for Telco Analytics

Mehmet Ali Abbasoglu, Bugra Gedik, Hakan Ferhatosmanoglu

1238 - 1241

ROSeAnn: Reconciling Opinions of Semantic Annotators

Luying Chen, Stefano Ortona, Giorgio Orsi, Michael Benedikt

1242 - 1245

A RecDB in Action: Recommendation Made Easy in Relational Databases

Mohamed Sarwat, James Avery, Mohamed Mokbel

1246 - 1249

POIKILO: A Tool for Evaluating the Results of Diversification Models and Algorithms

Marina Drosou, Evaggelia Pitoura

1250 - 1253

CrowdMiner: Mining association rules from the crowd

Yael Amsterdamer, Yael Grossman, Tova Milo, Pierre Senellart

1254 - 1257

TeRec: A Temporal Recommender System Over Tweet Stream

Chen Chen, Hongzhi Yin, Junjie Yao, Bin Cui

1258 - 1261

Graph Queries in a Next-Generation Datalog System

Alexander Shkapsky, Kai Zeng, Carlo Zaniolo

1262 - 1265

iRoad: A Framework For Scalable Predictive Query Processing On Road Networks

Abdeltawab Hendawi, Jie Bao, Mohamed Mokbel

1266 - 1269

SkySuite: A Framework of Skyline-Join Operators for Static and Stream Environments

Mithila Nagendra, K. selcuk Candan

1270 - 1273

Parallel Graph Processing on Graphics Processors Made Easy

Jianlong Zhong, Bingsheng He

1274 - 1277

Mosquito: Another One Bites the Data Upload STream

Stefan Richter, Jens Dittrich, Stefan Schuh, Tobias Frey

1278 - 1281

NoFTL: Database Systems on FTL-less Flash Storage

Sergey Hardock, Ilia Petrov, Robert Gottstein, Alejandro Buchmann

1282 - 1285

SmartMonitor: Using Smart Devices to Perform Structural Health Monitoring

Dimitrios Kotsakos, Panos Sakkos, Vana Kalogeraki, Dimitirios Gunopulos

1286 - 1289

Lazy ETL in Action: ETL Technology Dates Scientific Data

Yagiz Kar√¶z, Milena Ivanova, Ying Zhang, Stefan Manegold, Martin Kersten

1290 - 1293

EagleTree: Exploring the Design Space of SSD-Based Algorithms

Niv Dayan, Martin Kj√¶r Svendsen, Matias B√∏rling, Philippe Bonnet, Luc Bouganim

1294 - 1297

EnviroMeter: A Platform for Querying Community-Sensed Data

Saket Sathe, Arthur Oviedo, Dipanjan Chakraborty, Karl Aberer

1298 - 1301

Scolopax: Exploratory Analysis of Scientific Data

Alper Okcan, Mirek Riedewald, Biswanath Panda, Daniel Fink

1302 - 1305

PROPOLIS: Provisioned Analysis of Data-Centric Processes

Daniel Deutch, Yuval Moskovitch, Val Tannen

1306 - 1309

Feature Selection in Enterprise Analytics: A Demonstration using an R-based Data Analytics System

Pradap Konda, Arun Kumar, Christopher Re, Vaishnavi Sashikanth

1310 - 1313

Flexible Query Processor on FPGAs

Mohammadreza Najafi, Mohammad Sadoghi, Hans-Arno Jacobsen

1314 - 1317

MASTRO STUDIO: Managing Ontology-Based Data Access applications

Cristina Civili, Marco Console, Giuseppe De Giacomo, Domenico Lembo, Maurizio Lenzerini, Lorenzo Lepore , Riccardo Mancini, Antonella Poggi, Riccardo Rosati, Marco Ruzzi, Valerio Santarelli, Domenico Fabio Savo

1318 - 1321

PLASMA-HD: Probing the LAttice Structure and MAkeup of High-dimensional Data

David Fuhry, Yang Zhang, Venu Satuluri, Arnab Nandi, Srinivasan Parthasarathy

1322 - 1325

A Demonstration of Iterative Parallel Array Processing in Support of Telescope Image Analysis

Matthew Moyers , Emad Soroush, Spencer Wallace, Simon Krughoff, Jake Vanderplas, Magdalena Balazinska, Andrew Connolly

1326 - 1329

EvenTweet: Online Localized Event Detection from Twitter

Hamed Abdelhaq, Christian Sengstock, Michael Gertz

1330 - 1333

IBminer: A Text Mining Tool for Constructing and Populating InfoBox Databases and Knowledge Bases

Hamid Mousavi, Shi Gao, Carlo Zaniolo

1334 - 1337

PAQO: A Preference-Aware Query Optimizer for PostgreSQL

Nicholas Farnan, Adam Lee, Panos Chyrsanthis, Ting Yu

1338 - 1341

eSkyline: Processing Skyline Queries over Encrypted Data

Suvarna Bothe, Panagiotis Karras, Akrivi Vlachou

1342 - 1345

GestureQuery: A Multitouch Database Query Interface

Lilong Jiang, Michael Mandel, Arnab Nandi

1346 - 1349

Mining and Linking Patterns across Live Data Streams and Stream Archives

Di Yang, Kaiyu Zhao, Maryam Hasan, Hanyuan Lu, Elke Rundensteiner, Matthew Ward

1350 - 1353

PhotoStand: A Map Query Interface for a Database of News Photos

Hanan Samet, Marco Adelfio, Brenden Fruin, Michael Lieberman, Jagan Sankaranarayanan

1354 - 1357

Hone: "Scaling Down" Hadoop on Shared-Memory Systems

K. Ashwin Kumar, Jonathan Gluck, Amol Deshpande, Jimmy Lin

1358 - 1361

Ringtail: A Generalized Nowcasting System

Dolan Antenucci, Erdong Li, Shaobo Liu, Bochun Zhang, Mike Cafarella, Christopher Re

1362 - 1365

IPS: An Interactive Package Configuration System for Trip Planning

Min Xie, Laks V. S. Lakshmanan, Peter Wood

1366 - 1369

R2-D2: a System to Support Probabilistic Path Prediction in Dynamic Environments via "Semi-Lazy" Learning

Jingbo Zhou, Anthony Tung , Wei Wu, Wee Siong Ng

1370 - 1373

REEF: Retainable Evaluator Execution Framework

Byung-Gon Chun, Tyson Condie, Carlo Curino, Raghu Ramakrishnan, Russell Sears, Markus Weimer

1374 - 1377

OmniDB: Towards Portable and Efficient Query Processing on Parallel CPU/GPU Architectures

Shuhao Zhang, Jiong He, Bingsheng He, Mian Lu

1378 - 1381

Complete Approximations of Incomplete Queries

Ognjen Savkovic, Paramita Mirza, Alex Tomasi, Werner Nutt

1382 - 1385

User Analytics with UbeOne: Insights into Web Printing

Georgia Koutrika, Qian Lin, Jerry Liu

1386 - 1389

DiAl: Distributed Streaming Analytics Anywhere, Anytime

Ivo Santos, Marcel Tilly, Badrish Chandramouli, Jonathan Goldstein

1390 - 1391

Big and Useful: What's in the Data for Me?

Rada Chirkova, Jun Yang

1392 - 1397

Universal Indexing of Arbitrary Similarity Models

Tomas Bartos

1398 - 1403

Why it is time for a HyPE: A Hybrid Query Processing Engine for Efficient GPU Coprocessing in DBMS

Sebastian Bress

1404 - 1409

Database Support for Unstructured Meshes

Alireza Rezaei Mahdiraji

1410 - 1415

Domain Specific Multi-stage Query Language for Medical Document Repositories

Aastha Madaan

1416 - 1421

Realtime Analysis of Information Diffusion in Social Media

Io Taxidou

1422 - 1427

Mining Frequent Patterns with Differential Privacy

Luca Bonomi

1428 - 1433

Automatic ontology-based User Profile Learning from heterogeneous Web Resources in a Big Data Context

Anett Hoppe

1434 - 1439

Scalable Transactions across Heterogeneous NoSQL Key-Value Data Stores

Akon Dey

1440 - 1443

Getting Unique Solution in Data Exchange

Nhung Ngo

1444 - 1449

Storing and Processing Temporal Data in a Main Memory Column Store

Martin Kaufmann

1450 - 1455

Efficiency and Security in Similarity Cloud Services

Stepan Kozak

1456 - 1461

Fast Cartography for Data Explorers

Thibault Sellam

Volume 6, No. 13

Peer Kröger and Stratis D. Viglas: Front Matter i - x

1462 - 1473

When Speed Has a Price: Fast Information Extraction Using Approximate Algorithms

Goncalo Simoes, Helena Galhardas, Luis Gravano

1474 - 1485

Design and Evaluation of Storage Organizations for Read-Optimized Main Memory Databases

Craig Chasseur, Jignesh M. Patel

1486 - 1497

Aggregating Semantic Annotators

Luying Chen, Stefano Ortona, Giorgio Orsi, Michael Benedikt

1498 - 1509

Discovering Denial Constraints

Xu Chu, Ihab F. Ilyas, Paolo Papotti

1510 - 1521

Diversified Top-k Graph Pattern Matching

Wenfei Fan, Xin Wang, Yinghui Wu

1522 - 1533

Bitlist: New Full-text Index for Low Space Cost and Efficient Keyword Search

Weixiong Rao, Lei Chen, Pan Hui, Sasu Tarkoma

1534 - 1545

RCSI: Scalable similarity search in thousand(s) of genomes

Sebastian Wandelt, Johannes Starlinger, Marc Bux, Ulf Leser

1546 - 1557

Approximate MaxRS in Spatial Databases

Yufei Tao, Xiaocheng Hu, Dong-Wan Choi, Chin-Wan Chung

1558 - 1569

Multi-Tuple Deletion Propagation: Approximations and Complexity

Benny Kimelfeld, Jan Vondrak, David P. Woodruff

1570 - 1581

Supporting Distributed Feed-Following Apps over Edge Devices

Badrish Chandramouli, Suman Nath, Wenchao Zhou

1582 - 1593

Rank Discovery From Web Databases

Saravanan Thirumuruganathan, Nan Zhang, Gautam Das

1594 - 1605

SPARSI: Partitioning Sensitive Data amongst Multiple Adversaries

Theodoros Rekatsinas, Amol Deshpande, Ashwin Machanavajjhala

1606 - 1617

Scalable Column Concept Determination for Web Tables Using Large Knowledge Bases

Dong Deng, Yu Jiang, Guoliang Li, Jian Li, Cong Yu

1618 - 1629

Top-K Structural Diversity Search in Large Networks

Xin Huang, Hong Cheng, Rong-Hua Li, Lu Qin, Jeffrey Xu Yu

1630 - 1641

Synthetising Changes in XML Documents as PULs

Federico Cavalieri, Alessandro Solimando, Giovanna Guerrini

Volume 6, No. 14

: Front Matter MichaelBöhlen,ChristophKoch - MichaelBöhlen,ChristophKoch

1642 - 1653

Probabilistic Query Rewriting for Efficient and Effective Keyword Search on Graph Data

Lei Zhang, Thanh Tran, Achim Rettinger

1654 - 1665

QuEval: Beyond high-dimensional indexing a la carte

Martin Schäler, Alexander Grebhahn, Reimar Schröter, Sandro Schulze, Veit Köppen, Gunter Saake

1666 - 1677

Discovering Longest-lasting Correlation in Sequence Databases

Yuhong Li, Leong Hou U, Man Lung Yiu, Zhiguo Gong

1678 - 1689

PREDIcT: Towards Predicting the Runtime of Large Scale Iterative Analytics

Adrian Daniel Popescu, Andrey Balmin, Vuk Ercegovac, Anastasia Ailamaki

1690 - 1701

On the Embeddability of Random Walk Distances

Xiaohan Zhao, Adelbert Chang, Atish Das Sarma, Haitao Zheng, Ben Y. Zhao

1702 - 1713

Instant Loading for Main Memory Databases

Tobias Mühlbauer, Wolf Rödiger, Robert Seilbeck, Angelika Reiser, Alfons Kemper, Thomas Neumann

1714 - 1725

Adaptive Range Filters for Cold Data: Avoiding Trips to Siberia

Karolina Alexiou, Donald Kossmann, Per-Ake Larson

1726 - 1737

Scalable Progressive Analytics on Big Data in the Cloud

Badrish Chandramouli, Jonathan Goldstein, Abdul Quamar

1738 - 1749

Scalable XML Query Processing using Parallel Pushdown Transducers

Peter Ogden, David Thomas, Peter Pietzuch

1750 - 1761

Understanding Insights into the Basic Structure and Essential Issues of Table Placement Methods in Clusters

Yin Huai, Siyuan Ma, Rubao Lee, Owen O’Malley, Xiaodong Zhang

1762 - 1773

A Probabilistic Optimization Framework for the Empty-Answer Problem

Davide Mottin, Alice Marascu, Senjuti Basu Roy, Gautam Das, Themis Palpanas, Yannis Velegrakis

1774 - 1785

Summarizing Answer Graphs Induced by Keyword Queries

Yinghui Wu, Shengqi Yang, Mudhakar Srivatsa, Arun Iyengar, Xifeng Yan

1786 - 1797

Supporting Keyword Search in Product Database: A Probabilistic Approach

Huizhong Duan, ChengXiang Zhai, Jinxing Cheng, Abhishek Gattani

1798 - 1809

A Sampling Algebra for Aggregate Estimation

Supriya Nirkhiwale, Alin Dobra, Christopher Jermaine

1810 - 1821

A Temporal-Probabilistic Database Model for Information Extraction

Maximilian Dylla, Iris Miliaraki, Martin Theobald

1822 - 1833

Counter Strike: Generic Top-Down Join Enumeration for Hypergraphs

Pit Fender, Guido Moerkotte

1834 - 1845

Efficient Bulk Updates on Multiversion B-trees

Daniar Achakeev, Bernhard Seeger

1846 - 1857

Query-Driven Approach to Entity Resolution

Hotham Altwaijry, Dmitri V. Kalashnikov, Sharad Mehrotra

1858 - 1869

Expressiveness and Complexity of Order Dependencies

Jaroslaw Szlichta, Parke Godfrey, Jarek Gryz, Calisto Zuzarte

1870 - 1881

Counting and Sampling Triangles from a Graph Stream

A. Pavan, Kanat Tangwongsan, Srikanta Tirthapura, Kun-Lung Wu

1882 - 1893

An Experimental Analysis of Iterated Spatial Joins in Main Memory

Benjamin Sowell, Marcos Vaz Salles, Tuan Cao, Alan Demers, Johannes Gehrke

1894 - 1905

Scaling Queries over Big RDF Graphs with Semantic Hash Partitioning

Kisung Lee, Ling Liu

1906 - 1917

Distributed SociaLite: A Datalog-Based Language for Large-Scale Graph Analysis

Jiwon Seo, Jongsoo Park, Jaeho Shin, Monica S. Lam

1918 - 1929

Horton+: A Distributed System for Processing Declarative Reachability Queries over Partitioned Graphs

Mohamed Sarwat, Sameh Elnikety, Yuxiong He, Mohamed F. Mokbel

1930 - 1941

Streaming Similarity Search over one Billion Tweets using Parallel Locality-Sensitive Hashing

Narayanan Sundaram, Aizana Turmukhametova, Nadathur Satish, Todd Mostak, Piotr Indyk, Samuel Madden, Pradeep Dubey

1942 - 1953

Anti-Caching: A New Approach to Database Management System Architecture

Justin DeBrabant, Andrew Pavlo, Stephen Tu, Michael Stonebraker, Stan Zdonik

1954 - 1965

Understanding Hierarchical Methods for Differentially Private Histograms

Wahbeh Qardaji, Weining Yang, Ninghui Li

1966 - 1977

Towards Social Data Platform: Automatic Topic-focused Monitor for Twitter Stream

Rui Li, Shengjie Wang, Kevin Chen-Chuan Chang

1978 - 1989

Simple, Fast, and Scalable Reachability Oracle

Ruoming Jin, Guan Wang

1990 - 2001

Aggregation and Ordering in Factorised Databases

Nurzhan Bakibayev, Tomas Kocisky, Dan Olteanu, Jakub Zavodny

2002 - 2013

Parallel Computation of Skyline and Reverse Skyline Queries Using MapReduce

Yoonjae Park, Jun-Ki Min, Kyuseok Shim

2014 - 2025

Fast Iterative Graph Computation with Block Updates

Wenlei Xie, Guozhang Wang, David Bindel, Alan Demers, Johannes Gehrke

PVLDB is part of the VLDB Endowment Inc.

Privacy Policy