Volume 7, 2013-2014

H. V. Jagadish and Aoying Zhou
Founding Editor-in-Chief:
H. V. Jagadish
Managing Editor:
Divesh Srivastava
Information Director:
Gerald Weber
Advisory Committee:
Philip Bernstein, Michael Böehlen, Peter Buneman, Susan Davidson, Z. Meral Ozsoyoglu, S. Sudarshan, Gerhard Weikum
Associate Editors:
Shivnath Babu, Lei Chen, Graham Cormode, Bin Cui, Wynne Hsu, Martin Kersten, Donald Kossman, Elke Rundensteiner, Kyuseok Shim, Wang-Chiew Tan, Letizia Tanca, Jeffrey Yu, Gao Cong, Jens Dittrich, Zachary Ives
Review Board:

Volume 7, No. 1

H. V. Jagadish and Aoying Zhou: Front Matter i - ix

1 - 12

Efficient and Effective KNN Sequence Search with Approximate n-grams

Xiaoli Wang, Xiaofeng Ding, Anthony Tung, Zhenjie Zhang

13 - 24

More is Simpler: Effectively and Efficiently Assessing Node-Pair Similarities Based on Hyperlinks

Weiren Yu, Xuemin Lin, Wenjie Zhang, Lijun Chang, Jian Pei

25 - 36

An Approach towards the Study of Symmetric Queries

Marc Gyssens, Jan Paredaens, Dirk Van Gucht, Jef Wijsen, Yuqing Wu

37 - 48

CPU Sharing Techniques for Performance Isolation in Multitenant Relational Database-as-a-Service

Sudipto Das, Vivek Narasayya, Feng Li, Manoj Syamala

49 - 60

Authenticating Top-k Queries in Location-based Services with Confidentiality

Qian Chen, Haibo Hu, Jianling Xu

61 - 72

Toward a Distance Oracle for Billion-Node Graphs

Zichao Qi, Yanghua Xiao, Bin Shao, Haixun Wang

73 - 84

Finding Shortest Paths on Terrains by Killing Two Birds with One Stone

Manohar Kaul, Raymond Chi-Wing Wong, Bin Yang, Christian Jensen

85 - 96

Multi-Core, Main-Memory Joins: Sort vs. Hash Revisited

Cagri Balkesen, Gustavo Alonso, Jens Teubner, M. Tamer Özsu

Volume 7, No. 2

Gao Cong and Jens Dittrich: Front Matter i - ix

97 - 108

The Uncracked Pieces in Database Cracking

Felix Martin Schuhknecht, Alekh Jindal, Jens Dittrich

109 - 120

Diversity based Relevance Feedback for Time Series Search

Bahaeddin Eravci, Hakan Ferhatosmanoglu

121 - 132

Storage Management in the NVRAM Era

Steven Pelley, Thomas F. Wenisch, Brian T. Gold, Bill Bridge

Volume 7, No. 3

Zachary Ives: Front Matter i - ix

133 - 144

Online Ordering of Overlapping Data Sources

Mariam Salloum, Xin Luna Dong, Divesh Srivastava, Vassilis J. Tsotras

145 - 156

Multi-Query Optimization in MapReduce Framework

Guoping Wang, Chee-Yong Chan

157 - 168

Attraction and Avoidance Detection from Movements

Zhenhui Li, Bolin Ding, Fei Wu, Tobias Kin Hou Lei, Roland Kays, Margaret C. Crofoot

169 - 180

A Partition-Based Approach to Structure Similarity Search

Xiang Zhao, Chuan Xiao, Xuemin Lin, Qing Liu, Wenjie Zhang

181 - 192

Highly Available Transactions: Virtues and Limitations

Peter Bailis, Aaron Davidson, Alan Fekete, Ali Ghodsi, Joseph M. Hellerstein, Ion Stoica

193 - 204

From "Think Like a Vertex" to "Think Like a Graph"

Yuanyuan Tian, Andrey Balmin, Severin Andreas Corsten, Shirish Tatikonda, John McPherson

205 - 216

Probabilistic Nearest Neighbor Queries on Uncertain Moving Object Trajectories

Johannes Niedermayer, Andreas Zufle, Tobias Emrich, Matthias Renz, Nikos Mamoulis, Lei Chen, Hans-Peter Kriegel

Volume 7, No. 4

Martin Kersten: Front Matter i - ix

217 - 228

Delta: Scalable Data Dissemination under Capacity Constraints

Konstantinos Karanasos, Asterios Katsifodimos, Ioana Manolescu

229 - 240

GeoScope: Online Detection of Geo-Correlated Information Trends in Social Networks

Ceren Budak, Theodore Georgiou, Divyakant Agrawal, Amr El Abbadi

241 - 252

Optimization for iterative queries on MapReduce

Makoto Onizuka, Hiroyuki Kato, Soichiro Hidaka, Keisuke Nakano, Zhenjiang Hu

253 - 264

Willingness Optimization for Social Group Activity

Hong-Han Shuai, De-Nian Yang, Philip S. Yu, Ming-Syan Chen

265 - 276

High Performance Stream Query Processing With Correlation-Aware Partitioning

Lei Cao, Elke A. Rundensteiner

277 - 288

OLTP-Bench: An Extensible Testbed for Benchmarking Relational Databases

Djellel Eddine Difallah, Andrew Pavlo, Carlo Curino, Philippe Cudre-Mauroux

289 - 300

Gestural Query Specification

Arnab Nandi, Lilong Jiang, Michael Mandel

301 - 312

Scalable Discovery of Unique Column Combinations

Arvid Heise, Jorge-Arnulfo, Quiane-Ruiz, Ziawasch Abedjan, Anja Jentzsch, Felix Naumann

313 - 324

Earth Mover's Distance based Similarity Search at Scale

Yu Tang, Leong Hou U, Yilun Cai, Nikos Mamoulis, Reynold Cheng

325 - 328

SeeDB: Visualizing Database Queries Efficiently

Aditya Parameswaran, Neoklis Polyzotis, Hector Garcia-Molina

Volume 7, No. 5

Michael Carey: Front Matter i - ix

329 - 340

MaaT: Effective and scalable coordination of distributed transactions in the cloud

Hatem A. Mahmoud, Vaibhav Arora, Faisal Nawab, Divyakant Agrawal, Amr El Abbadi

341 - 352

A Data- and Workload-Aware Query Answering Algorithm for Range Queries Under Differential Privacy

Chao Li, Michael Hay, Gerome Miklau, Yue Wang

353 - 364

Certain Query Answering in Partially Consistent Databases

Sergio Greco, Fabian Pijcke, Jef Wijsen

365 - 376

Exemplar Queries: Give me an Example of What You Need

Davide Mottin, Matteo Lissandrini, Yannis Velegrakis, Themis Palpanas

377 - 388

An efficient reconciliation algorithm for social networks

Nitish Korula, Silvio Lattanzi

389 - 400

Computing k-Regret Minimizing Sets

Sean Chester, Alex Thomo, S. Venkatesh, Sue Whitesides

401 - 412

Reverse Top-k Search using Random Walk with Restart

Adams Wei Yu, Nikos Mamoulis, Hao Su

413 - 424

Write-limited sorts and joins for persistent memory

Stratis D. Viglas

425 - 428

Folk-IS: Opportunistic Data Services in Least Developed Countries

N. Anciaux, L. Bouganim, T. Delot, S. Ilarri, L. Kloul, N. Mitton, P. Pucheral

Volume 7, No. 6

Graham Cormode: Front Matter i - ix

429 - 440

Shared Workload Optimization

Georgios Giannikis, Darko Makreshanski, Gustavo Alonso, Donald Kossmann

441 - 452

Scalable and Adaptive Online Joins

Mohammed Elseidy, Abdallah Elguindy, Aleksandar Vitorovic, Christoph Koch

453 - 456

Support the Data Enthusiast: Challenges for Next-Generation Data-Analysis Systems

Kristi Morton, Magdalena Balazinska, Dan Grossman, Jock Mackinlay

457 - 468

A Provenance Framework for Data-Dependent Process Analysis

Daniel Deutch, Yuval Moskovitch, Val Tannen

469 - 480

Tracking Entities in the Dynamic World: A Fast Algorithm for Matching Temporal Records

Yueh-Hsuan Chiang, AnHai Doan, Jeffrey F. Naughton

481 - 492

Edelweiss: Automatic Storage Reclamation for Distributed Programming

Neil Conway, Peter Alvaro, Emily Andrews, Joseph M. Hellerstein

Volume 7, No. 7

Chen Li and Volker Markl: Front Matter i - ix

493 - 504

Rank Join Queries in NoSQL Databases

Nikos Ntarmos, Ioannis Patlakas, Peter Triantafillou

505 - 516

Biperpedia: An Ontology for Search Applications

Rahul Gupta, Alon Halevy, Xuezhi Wang, Steven Euijong Whang, Fei Wu

517 - 528

GRAMI: Frequent Subgraph and Pattern Mining in a Single Large Graph

Mohammed Elseidy, Ehab Abdelhamid, Spiros Skiadopoulos, Panos Kalnis

529 - 540

Lightweight Indexing of Observational Data in Log-Structured Storage

Sheng Wang, David Maier, Beng Chin Ooi

541 - 552

epiC: an Extensible and Scalable System for Processing Big Data

Dawei Jiang, Gang Chen, Beng Chin Ooi, Kian-Lee Tan, Sai Wu

553 - 564

Hybrid Parallelization Strategies for Large-Scale Machine Learning in SystemML

Matthias Boehm, Shirish Tatikonda, Berthold Reinwald, Prithviraj Sen, Yuanyuan Tian, Douglas R. Burdick, Shivakumar Vaithyanathan

565 - 576

Schemaless and Structureless Graph Querying

Shengqi Yang, Yinghui Wu, Huan Sun, Xifeng Yan

577 - 588

Optimizing Graph Algorithms on Pregel-like Systems

Semih Salihoglu, Jennifer Widom

589 - 600

Toward Computational Fact-Checking

You Wu, Pankaj K. Agarwal, Chengkai Li, Jun Yang, Cong Yu

Volume 7, No. 8

Divesh Srivastava: Front Matter i - ix

601 - 612

A Principled Approach to Bridging the Gap between Graph Data and their Schemas

Marcelo Arenas, Gonzalo Diaz, Achille Fokoue, Anastasios Kementsietsidis, Kavitha Srinivas

613 - 624

An Efficient Publish/Subscribe Index for ECommerce Databases

Dongxiang Zhang, Chee-Yong Chan, Kian-Lee Tan

625 - 636

String Similarity Joins: An Experimental Evaluation

Yu Jiang, Guoliang Li, Jianhua Feng, Wen-Syan Li

637 - 648

Calibrating Data to Sensitivity in Private Data Analysis

Davide Proserpio, Sharon Goldberg, Frank McSherry

649 - 660

Effective Multi-Modal Retrieval based on Stacked Auto-Encoders

Wei Wang, Beng Chin Ooi, Xiaoyan Yang, Dongxiang Zhang, Yueting Zhuang

Volume 7, No. 9

H. V. Jagadish: Front Matter i - x

661 - 672

PRESS: A Novel Framework of Trajectory Compression in Road Networks

Renchu Song, Weiwei Sun, Baihua Zheng, Yu Zheng

673 - 684

Finding the Cost-Optimal Path with Time Constraint over Time-Dependent Graphs

Yajun Yang, Hong Gao, Jeffrey Xu Yu, Jianzhong Li

685 - 696

Optimal Crowd-Powered Rating and Filtering Algorithms

Aditya Parameswaran, Stephen Boyd, Hector Garcia-Molina, Ashish Gupta, Neoklis Polyzotis, Jennifer Widom

697 - 708

Incremental Record Linkage

Anja Gruenheid, Xin Luna Dong, Divesh Srivastava

709 - 720

Low-Latency Handshake Join

Pratanu Roy, Jens Teubner, Rainer Gemulla

721 - 732

Path Problems in Temporal Graphs

Huanhuan Wu, James Cheng, Silu Huang, Yiping Ke, Yi Lu, Yanyan Xu

733 - 744

Retrieving Regions of Interest for User Exploration

Xin Cao, Gao Cong, Christian S. Jensen, Man Lung Yiu

745 - 756

SK-LSH: An Efficient Index Structure for Approximate Nearest Neighbor Search

Yingfan Liu, Jiangtao Cui, Zi Huang, Hui Li, Heng Tao Shen

757 - 768

On Arbitrage-free Pricing for General Data Queries

Bing-Rong Lin, Daniel Kifer

769 - 780

Splitter: Mining Fine-Grained Sequential Patterns in Semantic Trajectories

Chao Zhang, Jiawei Han, Lidan Shou, Jiajun Lu, Thomas La Porta

781 - 784

Towards Building Wind Tunnels for Data Center Design

Avrilia Floratou, Frank Bertsch, Jignesh M. Patel, Georgios Laskaris

Volume 7, No. 10

Sharad Mehrotra: Front Matter i - xi

785 - 796

Reverse k-Ranks Query

Zhao Zhang, Cheqing Jin, Qiangqiang Kang

797 - 808

M4: A Visualization-Oriented Time Series Data Aggregation

Uwe Jugel, Zbigniew Jerzak, Gregor Hackenbroich, Volker Markl

809 - 820

Continuous Matrix Approximation on Distributed Data

Mina Ghashami, Jeff M. Phillips, Feifei Li

821 - 832

An Evaluation of the Advantages and Disadvantages of Deterministic Database Systems

Kun Ren, Alexander Thomson, Daniel J. Abadi

833 - 836

Efficient In-memory Data Management: An Analysis

Hao Zhang, Bogdan Marius Tudor, Gang Chen, Beng Chin Ooi

837 - 840

Workload Matters: Why RDF Databases Need a New Design

Gunes Aluc, M. Tamer Özsu, Khuzaima Daudjee

841 - 852

Storage Management in AsterixDB

Sattam Alsubaiee, Alexander Behm, Vinayak Borkar, Zachary Heilbron, Young-Seok Kim, Michael J. Carey, Markus Dreseler, Chen Li

853 - 864

Building Efficient Query Engines in a High-Level Language

Yannis Klonatos, Christoph Koch, Tiark Rompf, Hassan Chafi

865 - 876

Scalable Logging through Emerging Non-Volatile Memory

Tianzheng Wang, Ryan Johnson

877 - 880

When Data Management Systems Meet Approximate Hardware: Challenges and Opportunities

Bingsheng He

881 - 892

From Data Fusion to Knowledge Fusion

Xin Luna Dong, Evgeniy Gabrilovich, Geremy Heitz, Wilko Horn, Kevin Murphy, Shaohua Sun, Wei Zhang

893 - 902

On k-Path Covers and their Applications

Stefan Funke, Andre Nusser, Sabine Storandt

903 - 906

The Case for Data Visualization Management Systems

Eugene Wu, Leilani Battle, Samuel R. Madden

907 - 918

WideTable: An Accelerator for Analytical Data Processing

Yinan Li, Jignesh M. Patel

919 - 930

A Framework for Protecting Worker Location Privacy in Spatial Crowdsourcing

Hien To, Gabriel Ghinita, Cyrus Shahabi

Volume 7, No. 11

Lidan Shou: Front Matter i - ix

931 - 942

Trekking Through Siberia: Managing Cold Data in a Memory-Optimized Database

Ahmed Eldawy, Justin Levandoski, Per-√Öke Larson

943 - 946

The Case for Personal Data-Driven Decision Making

Jennie Duggan

947 - 958

ConfluxDB: Multi-Master Replication for Partitioned Snapshot Isolation Databases

Prima Chairunnanda, Khuzaima Daudjee, M. Tamer Özsu

959 - 962

Υ-DB: Managing scientific hypotheses as uncertain data

Bernardo Goncalves, Fabio Porto

963 - 974

Ibex - An Intelligent Storage Engine with Support for Advanced SQL Off-loading

Louis Woods, Zsolt Istvan, Gustavo Alonso

975 - 986

NOMAD: Nonlocking, stOchastic Multi-machine algorithm for Asynchronous and Decentralized matrix completion

Hyokun Yun, Hsiang-Fu Yu, Cho-Jui Hsieh, S V N Vishwanathan, Inderjit Dhillon

987 - 998

Repairing Vertex Labels under Neighborhood Constraints

Shaoxu Song, Hong Cheng, Jeffrey Xu Yu, Lei Chen

999 - 1010

Progressive Approach to Relational Entity Resolution

Yasser Altowim, Dmitri V. Kalashnikov, Sharad Mehrotra

1011 - 1022

Concurrent Analytical Query Processing with GPUs

Kaibo Wang, Kai Zhang, Yuan Yuan, Siyuan Ma, Rubao Lee, Xiaoning Ding, Xiaodong Zhang

Volume 7, No. 12

H. V. Jagadish: Front Matter i - x

1023 - 1034

Computing Personalized PageRank Quickly by Exploiting Graph Structures

Takanori Maehara, Takuya Akiba, Yoichi Iwata, Ken-ichi Kawarabayashi

1035 - 1046

Accordion: Elastic Scalability for Database Systems Supporting Distributed Transactions

Marco Serafini, Essam Mansour, Ashraf Aboulnaga, Kenneth Salem, Taha Rafiq, Umar Farooq Minhas

1047 - 1058

An Experimental Comparison of Pregel-like Graph Processing Systems

Minyang Han, Khuzaima Daudjee, Khaled Ammar, M. Tamer Özsu, Xingfang Wang, Tianqi Jin

1059 - 1070

ClusterJoin: A Similarity Joins Framework using Map-Reduce

Akash Das Sarma, Yeye He, Surajit Chaudhuri

1071 - 1082

Crowdsourcing Algorithms for Entity Resolution

Norases Vesdapunt, Kedar Bellare, Nilesh Dalvi

1083 - 1094

Distributed Graph Simulation: Impossibility and Possibility

Wenfei Fan, Xin Wang, Yinghui Wu, Dong Deng

1095 - 1106

Code Generation for Efficient Query Processing in Managed Runtimes

Fabian Nagel, Gavin Bierman, Stratis D. Viglas

1107 - 1118

Aggregate Estimation Over Dynamic Hidden Web Databases

Weimo Liu, Saravanan Thirumuruganathan, Nan Zhang, Gautam Das

1119 - 1130

Adaptive Query Processing on RAW Data

Manos Karpathiotakis, Miguel Branco, Ioannis Alagiannis, Anastasia Ailamaki

1131 - 1142

Storing and Querying Tree-Structured Records in Dremel

Foto N. Afrati, Dan Delorey, Mosha Pasumansky, Jeffrey D. Ullman

1143 - 1154

Similarity Search for Scientific Workflows

Johannes Starlinger, Bryan Brancotte, Sarah Cohen-Boulakia, Ulf Leser

1155 - 1166

Differentially Private Event Sequences over Infinite Streams

Georgios Kellaris, Stavros Papadopoulos, Xiaokui Xiao, Dimitris Papadias

1167 - 1178

Matching Titles with Cross Title Web-Search Enrichment and Community Detection

Nikhil Londhe, Vishrawas Gopalakrishnan, Aidong Zhang, Hung Q. Ngo, Rohini Srihari

1179 - 1190

On Concise Set of Relative Candidate Keys

Shaoxu Song, Lei Chen, Hong Cheng

1191 - 1202

Reachability Querying: An Independent Permutation Labeling Approach

Hao Wei, Jeffrey Xu Yu, Can Lu, Ruoming Jin

1203 - 1214

Hop Doubling Label Indexing for Point-to-Point Distance Querying on Scale-Free Networks

Minhao Jiang, Ada Wai-Chee Fu, Raymond Chi-Wing Wong, Yanyan Xu

1215 - 1218

Semantic Culturomics (vision paper)

Fabian M. Suchanek, Nicoleta Preda

1219 - 1230

Benchmarking Scalability and Elasticity of Distributed Database Systems

Jörn Kuhlenkamp, Markus Klems, Oliver Röss

1231 - 1242

Bounded Conjunctive Queries

Yang Cao, Wenfei Fan, Tianyu Wo, Wenyuan Yu

1243 - 1254

Optimizing Join Enumeration in Transformation-based Query Optimizers

Anil Shanbhag, S. Sudarshan

1255 - 1258

A System for Management and Analysis of Preference Data

Marie Jacob, Benny Kimelfeld, Julia Stoyanovich

1259 - 1270

Mesa: Geo-Replicated, Near Real-Time, Scalable Data Warehousing

Ashish Gupta, Fan Yang, Jason Govig, Adam Kirsch, Kelvin Chan, Kevin Lai, Shuo Wu, Sandeep Govind Dhoot, Abhilash Rajesh Kumar, Ankur Agiwal, Sanjay Bhansali, Mingsheng Hong, Jamie Cameron, Masood Siddiqi, David Jones, Jeff Shute, Andrey Gubarev, Shivakumar Venkataraman, Divyakant Agrawal

1271 - 1282

An Effective Encoding Scheme for Spatial RDF Data

John Liagouris, Nikos Mamoulis, Panagiotis Bouros, Manolis Terrovitis

1283 - 1294

DimmWitted: A Study of Main-Memory Statistical Analytics

Ce Zhang, Christopher Re

1295 - 1306

SQL-on-Hadoop: Full Circle Back to Shared-Nothing Database Architectures

Avrilia Floratou, Umar Farooq Minhas, Fatma Özcan

1307 - 1318

Optimal Security-Aware Query Processing

Marco Guarnieri, David Basin

Volume 7, No. 13

H. V. Jagadish: Front Matter i - xiv

1319 - 1330

MRTuner: A Toolkit to Enable Holistic Optimization for MapReduce Jobs

Juwei Shi, Jia Zou, Jiaheng Lu, Zhao Cao, Shiqiang Li, and Chen Wang

1331 - 1342

Reducing Database Locking Contention Through Multi-version Concurrency

Mohammad Sadoghi, Mustafa Canim, Bishwaranjan Bhattacharjee, Fabian Nagel, Kenneth A. Ross

1343 - 1354

Changing Engines in Midstream: A Java Stream Computational Model for Big Data Processing

Xueyuan Su, Garret Swart, Brian Goetz, Brian Oliver, Paul Sandoz

1355 - 1366

Joins on Encoded and Partitioned Data

Jae-Gil Lee, Gopi Attaluri, Ronald Barber, Naresh Chainani, Oliver Draese, Frederick Ho, Stratos Idreos, Min-Soo Kim, Sam Lightstone, Guy Lohman, Konstantinos Morfonios, Keshava Murthy, Ippokratis Pandis, Lin Qiao, Vijayshankar Raman, Vincent Kulandai Samy, Richard Sidle, Knut Stolz, Liping Zhang

1367 - 1378

TPC-DI: The First Industry Benchmark for Data Integration

Meikel Poess, Tilmann Rabl, Brian Caufield

1379 - 1380

Real-Time Twitter Recommendation: Online Motif Detection in Large Dynamic Graphs

Pankaj Gupta, Venu Satuluri, Ajeet Grewal, Siva Gurumurthy, Volodymyr Zhabiuk, Quannan Li, and Jimmy Lin

1381 - 1392

Interval Disaggregate: A New Operator for Business Planning

Sang K. Cha, Kunsoo Park, Changbin Song, Kihong Kim, Cheol Ryu, Sunho Lee

1393 - 1404

Fuxi: a Fault-Tolerant Resource Management and Job Scheduling System at Internet Scale

Zhuo Zhang, Chao Li, Yangyu Tao, Renyu Yangy, Hong Tang, Jie Xu

1405 - 1416

Large-Scale Graph Analytics in Aster 6: Bringing Context to Big Data Discovery

David Simmen, Karl Schnaitter, Jeff Davis, Yingjie He, Sangeet Lohariwala, Ajay Mysore, Vinayak Shenoi, Mingfeng Tan, Yu Xiao

1417 - 1428

Fast Foreign-Key Detection in Microsoft SQL Server PowerPivot for Excel

Zhimin Chen, Vivek Narasayya, Surajit Chaudhuri

1429 - 1440

Big Data Small Footprint: The Design of A Low-Power Classifier for Detecting Transportation Modes

Meng-Chieh Yu, Tong Yu, Shao-Chen Wang, Chih-Jen Lin, Edward Y. Chang

1441 - 1451

Summingbird: A Framework for Integrating Batch and Online MapReduce Computations

Oscar Boykin, Sam Ritchie, Ian O'Connell, Jimmy Lin

1452 - 1461

Of Snowstorms and Bushy Trees

Rafi Ahmed, Rajkumar Sen, Meikel Poess, Sunil Chakkappen

1462 - 1473

Execution Primitives for Scalable Joins and Aggregations in Map Reduce

Srinivas Vemuri, Maneesh Varshney, Krishna Puttaswamy, Rui Liu

1474 - 1483

CAP Limits in Telecom Subscriber Database Design

Javier Arauz

1484 - 1495

Advanced Join Strategies for Large-Scale Distributed Computation

Nicolas Bruno, YongChul Kwon, Ming-Chuan Wu

1496 - 1507

DGFIndex for Smart Grid: Enhancing Hive with a Cost-Effective Multidimensional Range Index

Yue Liu, Songlin Hu, Tilmann Rabl, Wantao Liu, Hans-Arno Jacobsen, Kaifeng Wu, Jian Chen, Jintao Li

1508 - 1519

Error-bounded Sampling for Analytics on Big Sparse Data

Ying Yan, Liang Jeff Chen, Zheng Zhang

1520 - 1528

Indexing HDFS Data in PDW: Splitting the data from the index

Vinitha Reddy Gankidi, Nikhil Teletia, Jignesh M. Patel, Alan Halverson, David J. DeWitt

1529 - 1540

Chimera: Large-Scale Classification using Machine Learning, Rules, and Crowdsourcing

Chong Sun, Narasimhan Rampalli, Frank Yang, AnHai Doan

1541 - 1544

Interactive Join Query Inference with JIM

Angela Bonifati, Radu Ciucanu, Slawek Staworko

1545 - 1548

MESA: A Map Service to Support Fuzzy Type-ahead Search over Geo-Textual Data

Yuxin Zheng, Zhifeng Bao, Lidan Shou, Anthony K. H. Tung

1549 - 1552

R3: A Real-Time Route Recommendation System

Henan Wang, Guoliang Li, Huiqi Hu, Shuo Chen, Bingwen Shen, Hao Wu, Wen-Syan Li, Kian-Lee Tan

1553 - 1556

PDQ: Proof-driven Query Answering over Web-based Data

Michael Benedikt, Julien Leblay, Efthymia Tsamoura

1557 - 1560

Data In, Fact Out: Automated Monitoring of Facts by FactWatcher

Naeemul Hassan, Afroza Sultana, You Wu, Gensheng Zhang, Chengkai Li, Jun Yang, Cong Yu

1561 - 1564

OceanST: A Distributed Analytic System for Large-Scale Spatiotemporal Mobile Broadband Data

Mingxuan Yuan, Ke Deng, Jia Zeng, Yanhua Li, Bing Ni, Xiuqiang He, Fei Wang, Wenyuan Dai, Qiang Yang

1565 - 1568

That's All Folks! LLUNATIC Goes Open Source

Floris Geerts, Giansalvatore Mecca, Paolo Papotti, Donatello Santoro

1569 - 1572

HDBTracker: Monitoring the Aggregates On Dynamic Hidden Web Databases

Weimo Liu, Saad Bin Suhaim, Saravanan Thirumuruganathan, Nan Zhang, Gautam Das, Ali Jaoua

1573 - 1576

BSMA: A Benchmark for Analytical Queries over Social Media Data

Fan Xia, Ye Li, Chengcheng Yu, Haixin Ma, Weining Qian

1577 - 1580

Graph-based Data Integration and Business Intelligence with BIIIG

Andre Petermann, Martin Junghanns, Robert Muller, Erhard Rahm

1581 - 1584

SEEDB: Automatically Generating Query Visualizations

Manasi Vartak, Samuel Madden, Aditya Parameswaran, Neoklis Polyzotis

1585 - 1588

QUEST: An Exploratory Approach to Robust Query Processing

Anshuman Dutt, Sumit Neelam, Jayant R. Haritsa

1589 - 1592

Redoop Infrastructure for Recurring Big Data Queries

Chuan Lei, Zhongfang Zhuang, Elke A. Rundensteiner, Mohamed Y. Eltabakh

1593 - 1596

PackageBuilder: From Tuples to Packages

Matteo Brucato, Rahul Ramakrishna, Azza Abouzied, Alexandra Meliou

1597 - 1600

Ontology Assisted Crowd Mining

Yael Amsterdamer, Susan B. Davidson, Tova Milo, Slava Novgorodov, Amit Somech

1601 - 1604

SOPS: A System for Efficient Processing of Spatial-Keyword Publish/Subscribe

Lisi Chen, Yan Cui, Gao Cong, Xin Cao

1605 - 1608

MLJ: Language-Independent Real-Time Search of Tweets Reported by Media Outlets and Journalists

Masumi Shirakawa, Takahiro Hara, Shojiro Nishio

1609 - 1612

Ocelot/HyPE: Optimized Data Processing on Heterogeneous Hardware

Sebastian Bress, Max Heimel, Michael Saecker, Bastian Kocher, Volker Markl, Gunter Saake

1613 - 1616

MoveMine 2.0: Mining Object Relationships from Movement Data

Fei Wu, Tobias Kin Hou Lei, Zhenhui Li, Jiawei Han

1617 - 1620

A Partitioning Framework for Aggressive Data Skipping

Liwen Sun, Sanjay Krishnan, Reynold S. Xin, Michael J. Franklin

1621 - 1624

Interactive Outlier Exploration in Big Data Streams

Lei Cao, Qingyang Wang, Elke A. Rundensteiner

1625 - 1628

SQL/AA: Executing SQL on an Asymmetric Architecture

Quoc-Cuong To, Benjamin Nguyen, Philippe Pucheral

1629 - 1632

gMission: A General Spatial Crowdsourcing Platform

Zhao Chen, Rui Fu, Ziyuan Zhao, Zheng Liu, Leihao Xia, Lei Chen, Peng Cheng, Caleb Chen Cao, Yongxin Tong, Chen Jason Zhang

1633 - 1636

S-Store: A Streaming NewSQL System for Big Velocity Applications

Ugur Cetintemel, Jiang Du, Tim Kraska, Samuel Madden, David Maier, John Meehan, Andrew Pavlo, Michael Stonebraker, Erik Sutherland, Nesime Tatbul, Kristin Tufte, Hao Wang, Stanley Zdonik

1637 - 1640

CLEar: A Real-time Online Observatory for Bursty and Viral Events

Runquan Xie, Feida Zhu, Hui Ma, Wei Xie, Chen Lin

1641 - 1644

AZDBLab: A Laboratory Information System for Large-Scale Empirical DBMS Studies

Young-Kyoon Suh, Richard T. Snodgrass, Rui Zhang

1645 - 1648

Terrain-Toolkit: A Multi-Functional Tool for Terrain Data

Qi Wang, Manohar Kaul, Cheng Long, Raymond Chi-Wing Wong

1649 - 1652

FORWARD: Data-Centric UIs using Declarative Templates that Efficiently Wrap Third-Party JavaScript Components

Yupeng Fu, Kian Win Ong, Yannis Papakonstantinou, Erick Zamora

1653 - 1656

SPIRE: Supporting Parameter-Driven Interactive Rule Mining and Exploration

Xika Lin, Abhishek Mukherji, Elke A. Rundensteiner, Matthew O. Ward

1657 - 1660

An Integrated Development Environment for Faster Feature Engineering

Michael R. Anderson, Michael Cafarella, Yixing Jiang, Guan Wang, Bochun Zhang

1661 - 1664

Pronto: A Software-Defined Networking based System for Performance Management of Analytical Queries on Distributed Data Stores

Pengcheng Xiong, Hakan Hacigumus

1665 - 1668

Getting Your Big Data Priorities Straight: A Demonstration of Priority-based QoS using Social-network-driven Stock Recommendation

Rui Zhang, Reshu Jain, Prasenjit Sarkar, Lukas Rupprecht

1669 - 1672

VERTEXICA: Your Relational Friend for Graph Analytics!

Alekh Jindal, Praynaa Rawlani, Eugene Wu, Samuel Madden, Amol Deshpande, Mike Stonebraker

1673 - 1676

NScale: Neighborhood-centric Analytics on Large Graphs

Abdul Quamar, Amol Deshpande, Jimmy Lin

1677 - 1680

DPSynthesizer: Differentially Private Data Synthesizer for Privacy Preserving Data Sharing

Haoran Li, Li Xiong, Lifan Zhang, Xiaoqian Jiang

1681 - 1684

SPOT: Locating Social Media Users Based on Social Network Context

Longbo Kong, Zhi Liu, Yan Huang

1685 - 1688

RASP-QS: Efficient and Confidential Query Services in the Cloud

Zohreh Alavi, Lu Zhou, James Powers, Keke Chen

1689 - 1692

Thoth: Towards Managing a Multi-System Cluster

Mayuresh Kunjir, Prajakta Kalmegh, Shivnath Babu

1693 - 1696

X-LiSA: Cross-lingual Semantic Annotation

Lei Zhang, Achim Rettinger

1697 - 1700

Combining User Interaction, Speculative Query Execution and Sampling in the DICE System

Prasanth Jayachandran, Karthik Tunga, Niranjan Kamat, Arnab Nandi

1701 - 1704

STMaker - A System to Make Sense of Trajectory Data

Han Su, Kai Zheng, Kai Zeng, Jiamin Huang, Xiaofang Zhou

1705 - 1708

Faster Visual Analytics through Pixel-Perfect Aggregation

Uwe Jugel, Zbigniew Jerzak, Gregor Hackenbroich, Volker Markl

1709 - 1710

Systems for Big-Graphs

Arijit Khan, Sameh Elnikety

1711 - 1712

Tutorial: Uncertain Entity Resolution

Avigdor Gal

1713 - 1714

Knowledge Bases in the Age of Big Data Analytics

Fabian M. Suchanek, Gerhard Weikum

1715 - 1716

Causality and Explanations in Databases

Alexandra Meliou, Sudeepa Roy, Dan Suciu

1717 - 1718

Enterprise Search in the Big Data Era: Recent Developments and Open Challenges

Yunyao Li, Ziyang Liu, Huaiyu Zhu

1719 - 1719

VLDB 2014 Ph.D. Workshop - An Overview

Yunyao Li, Erich Neuhold

1720 - 1721

Datacenters as Computers: Google Engineering & Database Research Perspectives

Shivakumar Venkataraman, Divyakant Agrawal

1722 - 1729

The Impact of Columnar In-Memory Databases on Enterprise Systems

Hasso Plattner

1730 - 1733

Breaking the Chains: On Declarative Data Analysis and Data Independence in the Big Data Era

Volker Markl

1734 - 1741

Engineering High-Performance Database Engines

Thomas Neumann

1742 - 1747

Realization of the Low Cost and High Performance MySQL Cloud Database

Wei Cao, Feng Yu, Jiasen Xie

1748 - 1753

Fatman: Cost-saving and reliable archival storage based on volunteer resources

An Qin, Dianming Hu, Jun Liu, Wenjun Yang, Dai Tan

1754 - 1759

Design and Implementation of a Real-Time Interactive Analytics System for Large Spatio-Temporal Data

Shiming Zhang, Yin Yang, Wei Fan, Marianne Winslet

1760 - 1765

A Personalized Recommendation System for NetEase Dating Site

Chaoyue Dai, Feng Qian, Wei Jiang, Zhoutian Wang, Zenghong Wu

1766 - 1771

GEMINI: An Integrative Healthcare Analytics System

Zheng Jye Ling, Quoc Trung Tran, Ju Fan, Gerald C.H. Koh, Thi Nguyen, Chuen Seng Tan, James W. L. Yip, Meihui Zhang

1772 - 1777

Mariana: Tencent Deep Learning Platform and its Applications

Yongqiang Zou, Xing Jin, Yi Li, Zhimao Guo, Eryu Wang, Bin Xiao

1778 - 1783

yzBigData: Provisioning Customizable Solution for Big Data

Sai Wu, Gang Chen, Ke Chen, Lidan Shou, Hui Cao, He Bai

1784 - 1784

Errata for "Building Efficient Query Engines in a High-Level Language" (PVLDB 7(10): 853-864)

Yannis Klonatos, Christoph Koch, Tiark Rompf, Hassan Chafi

Volume 7, No. 14

Li Xiong and Cong Yu: Front Matter i - x

1785 - 1796

Show Me the Money: Dynamic Recommendations for Revenue Maximization

Wei Lu, Shanshan Chen, Keqian Li, Laks V.S. Lakshmanan

1797 - 1808

ScalaGiST: Scalable Generalized Search Trees for MapReduce Systems [Innovative Systems Paper]

Peng Lu, Gang Chen, Beng Chin Ooi, Hoang Tam Vo, Sai Wu

1809 - 1820

Finding Patterns in a Knowledge Base using Keywords to Compose Table Answers

Mohan Yang, Bolin Ding, Surajit Chaudhuri, Kaushik Chakrabarti

1821 - 1832

Pregel Algorithms for Graph Connectivity Problems with Performance Guarantees

Da Yan, James Cheng, Kai Xing, Yi Lu, Wilfred Ng, Yingyi Bu

1833 - 1844

Auto-Approximation of Graph Computing

Zechao Shang, Jeffrey Xu Yu

1845 - 1856

DIADEM: Thousands of Websites to a Single Database

Tim Furche, Georg Gottlob, Giovanni Grasso, Xiaonan Guo, Giorgio Orsi, Christian Schallhart, Cheng Wang

1857 - 1868

Uncertainty Aware Query Execution Time Prediction

Wentao Wu, Xi Wu, Hakan Hacigumus, Jeffrey F. Naughton

1869 - 1880

Optimizing the Chase: Scalable Data Integration under Constraints

George Konstantinidis, Jose Luis Ambite

1881 - 1892

BF-Tree: Approximate Tree Indexing

Manos Athanassoulis, Anastasia Ailamaki

1893 - 1904

ADDICT: Advanced Instruction Chasing for Transactions

Pinar Tozun, Islam Atta, Anastasia Ailamaki, Andreas Moshovos

1905 - 1916

AsterixDB: A Scalable, Open Source BDMS

Sattam Alsubaiee, Yasser Altowim, Hotham Altwaijry, Alexander Behm, Vinayak Borkar, Yingyi Bu, Michael Carey, Inci Cetindil, Madhusudan Cheelangi, Khurram Faraaz, Eugenia Gabrielova, Raman Grover, Zachary Heilbron, Young-Seok Kim, Chen Li, Guangqiang Li, Ji Mahn Ok, Nicola Onose, Pouria Pirzadeh, Vassilis Tsotras, Rares Vernica, Jian Wen, Till Westmann

1917 - 1928

LogGP: A Log-based Dynamic Graph Partitioning Method

Ning Xu, Lei Chen, Bin Cui

1929 - 1940

Supervised Meta-blocking

George Papadakis, George Papastefanatos, Georgia Koutrika

1941 - 1952

Generating Top-k Packages via Preference Elicitation

Min Xie, Laks V.S. Lakshmanan, Peter T. Wood

1953 - 1964

Fast Range Query Processing with Strong Privacy Protection for Cloud Computing

Rui Li, Alex X. Liu, Ann L. Wang, Bezawada Bruhadeshwar

1965 - 1976

Finish Them!: Pricing Algorithms for Human Computation

Yihan Gao, Aditya Parameswaran

1977 - 1980

TransactiveDB: Tapping into Collective Human Memories

Michele Catasta, Alberto Tonon, Djellel Eddine Difallah, Gianluca Demartini, Karl Aberer, and Philippe Cudre-Mauroux

1981 - 1992

Blogel: A Block-Centric Framework for Distributed Computation on Real-World Graphs

Da Yan, James Cheng, Yi Lu, Wilfred Ng

1993 - 2004

Efficient Identification of Implicit Facts in Incomplete OWL2-EL Knowledge Bases

John Liagouris, Manolis Terrovitis

2005 - 2016

Where To: Crowd-Aided Path Selection

Chen Jason Zhang, Yongxin Tong, Lei Chen

2017 - 2028

Large Scale Real-time Ridesharing with Service Guarantee on Road Networks

Yan Huang, Favyen Bastani, Ruoming Jin, Xiaoyang Sean Wang

PVLDB is part of the VLDB Endowment Inc.

Privacy Policy