Volume 8, 2014-2015

Editor-in-Chief:
Chen Li and Volker Markl
Founding Editor-in-Chief:
H. V. Jagadish
Managing Editor:
Divesh Srivastava
Information Director:
Gerald Weber
Advisory Committee:
H. V. Jagadish, Renée J. Miller, M. Tamer Äzsu, Kian-Lee Tan, Michael Böehlen, Susan Davidson, S. Sudarshan, Gerhard Weikum
Associate Editors:
Kevin Chang, Shivnath Babu, Magdalena Balazinska, Felix Naumann, Stefan Manegold, Yi Chen, Fatma Ozcan, Jignesh Patel, Rainer Gemulla
Review Board:

Volume 8, No. 1

Chen Li and Volker Markl: Front Matter i - ix

1 - 12

SRS: Solving c-Approximate Nearest Neighbor Queries in High Dimensional Euclidean Space with a Tiny Index

Yifang Sun, Wei Wang, Jianbin Qin, Ying Zhang, Xuemin Lin

13 - 24

Top-k Nearest Neighbor Search In Uncertain Data Series

Michele Dallachiesa, Themis Palpanas, Ihab F. Ilyas

25 - 36

Resource Bricolage for Parallel Database Systems

Jiexing Li, Jeffrey Naughton, Rimma V. Nehme

37 - 48

In-Memory Performance for Big Data

Goetz Graefe, Haris Volos, Hideaki Kimura, Harumi Kuno, Joseph Tucek, Mark Lillibridge, Alistair Veitch

49 - 60

Trajectory Simplification: On Minimizing the Direction-based Error

Cheng Long, Raymond Chi-Wing Wong, H. V. Jagadish

61 - 72

Interpretable and Informative Explanations of Outcomes

Kareem El Gebaly, Parag Agrawal, Lukasz Golab, Flip Korn, Divesh Srivastava

73 - 84

Constructing an Interactive Natural Language Interface for Relational Databases

Fei Li, H. V. Jagadish

85 - 96

Leveraging Graph Dimensions in Online Graph Search

Yuanyuan Zhu, Jeffrey Xu Yu, Lu Qin

97 - 100

Spatial Joins in Main Memory: Implementation Matters!

Darius Šidlauskas, Christian S. Jensen

Volume 8, No. 2

Kevin C. Chang: Front Matter i - ix

101 - 112

Selectivity Estimation on Streaming Spatio-Textual Data Using Local Correlations

Xiaoyang Wang, Ying Zhang, Wenjie Zhang, Xuemin Lin, Wei Wang

113 - 124

Processing Moving kNN Queries Using Influential Neighbor Sets

Chuanwen Li, Yu Gu, Jianzhong Qi, Ge Yu, Rui Zhang, Wang Yi

125 - 136

Scaling Up Crowd-Sourcing to Very Large Datasets: A Case for Active Learning

Barzan Mozafari, Purna Sarkar, Michael Franklin, Michael Jordan, Samuel Madden

137 - 148

CANDS: Continuous Optimal Navigation via Distributed Stream Processing

Dingyu Yang, Dongxiang Zhang, Kian-Lee Tan, Jian Cao, Frédéric Le Mouël

149 - 160

Rare Time Series Motif Discovery from Unbounded Streams

Nurjahan Begum, Eamonn Keogh

161 - 172

Pregelix: Big(ger) Graph Analytics on a Dataflow Engine

Yingyi Bu, Vinayak Borkar, Jianfeng Jia, Michael J. Carey, Tyson Condie

173 - 184

Profiling R on a Contemporary Processor

Shriram Sridharan, Jignesh M. Patel

Volume 8, No. 3

Divesh Srivastava and S. Sudarshan: Front Matter i - ix

185 - 196

Coordination Avoidance in Database Systems

Peter Bailis, Alan Fekete, Michael J. Franklin, Ali Ghodsi, Joseph M. Hellerstein, Ion Stoica

197 - 208

QuickFOIL: Scalable Inductive Logic Programming

Qiang Zeng, Jignesh M. Patel, David Page

209 - 220

Staring into the Abyss: An Evaluation of Concurrency Control with One Thousand Cores

Xiangyao Yu, George Bezerra, Andrew Pavlo, Srinivas Devadas, Michael Stronebraker

221 - 232

Multi-Objective Parametric Query Optimization

Immanuel Trummer, Christoph Koch

233 - 244

Deployment of Query Plans on Multicores

Jana Giceva, Gustavo Alonso, Timothy Roscoe, Tim Harris

245 - 256

E-Store: Fine-Grained Elastic Partitioning for Distributed Transaction Processing

Rebecca Taft, Essam Mansour, Marco Serafini, Jennie Duggan, Aaron J. Elmore, Ashraf Aboulnaga, Andrew Pavlo, Michael Stonebraker

257 - 268

Beyond Itemsets: Mining Frequent Featuresets over Structured Items

Saravanan Thirumuruganathan, Habibur Rahman, Sofiane Abbar, Gautam Das

269 - 280

Inferring Continuous Dynamic Social Influence and Personal Preference for Temporal Behavior Prediction

Jun Zhang, Chaokun Wang, Jianmin Wang, Jeffrey Xu Yu

281 - 292

Large-Scale Distributed Graph Computing Systems: An Experimental Evaluation

Yi Lu, James Cheng, Da Yan, Huanhuan Wu

293 - 304

Faster Set Intersection with SIMD instructions by Reducing Branch Mispredictions

Hiroshi Inoue, Moriyoshi Ohara, Kenjiro Taura

305 - 316

Scalable Topical Phrase Mining from Text Corpora

Ahmed El-Kishky, Yanglei Song, Chi Wang, Clare R. Voss, Jiawei Han

317 - 328

Efficient Top-K SimRank-based Similarity Join

Wenbo Tao, Minghe Yu, Guoliang Li

Volume 8, No. 4

Shivnath Babu: Front Matter i - ix

329 - 340

In-Cache Query Co-Processing on Coupled CPU-GPU Architectures

Jiong He, Shuhao Zhang, Bingsheng He

341 - 352

Scaling Manifold Ranking Based Image Retrieval

Yasuhiro Fujiwara, Go Irie, Shari Kuroyama, Makoto Onizuka

353 - 364

Memory-Efficient Hash Joins

R. Barber, G. Lohman, I. Pandis, V. Raman, R. Sidle, G. Attaluri, N. Chainani, S. Lightstone, D. Sharpe

365 - 376

Preference-aware Integration of Temporal Data

Bogdan Alexe, Mary Roth, Wang-Chiew Tan

377 - 388

MOCgraph: Scalable Distributed Graph Processing Using Message Online Computing

Chang Zhou, Jun Gao, Binbin Sun, Jeffrey Xu Yu

389 - 400

NVRAM-aware Logging in Transaction Systems

Jian Huang, Karsten Schwan, Moinuddin K. Qureshi

401 - 412

Trill: A High-Performance Incremental Query Processor for Diverse Analytics

Badrish Chandramouli, Jonathan Goldstein, Mike Barnett, Robert DeLine, John C. Platt, James F. Terwilliger, John Wernsing

413 - 424

Event Pattern Matching over Graph Streams

Chunyao Song, Tingjian Ge, Cindy Chen, Jie Wang

425 - 436

A Confidence-Aware Approach for Truth Discovery on Long-Tail Data

Qi Li, Yaliang Li, Jing Gao, Lu Su, Bo Zhao, Murat Demirbas, Wei Fan, Jiawei Han

437 - 448

Fast Failure Recovery in Distributed Graph Processing Systems

Yanyan Shen, Gang Chen, H. V. Jagadish, Wei Lu, Beng Chin Ooi, Bogdan Marius Tudor

449 - 460

The More the Merrier: Efficient Multi-Source Graph Traversal

Manuel Then, Moritz Kaufmann, Fernando Chirigati, Tuan-Anh Hoang-Vu, Kien Pham, Alfons Kemper, Thomas Neumann, Huy T. Vo

Volume 8, No. 5

Magdalena Balazinska: Front Matter i - ix

461 - 472

MRCSI: Compressing and Searching String Collections with Multiple References

Sebastian Wandelt, Ulf Leser

473 - 484

YADING: Fast Clustering of Large-Scale Time Series Data

Rui Ding, Qiang Wang, Yingnong Dang, Qiang Fu, Haidong Zhang, Dongmei Zhang

485 - 496

Hear the Whole Story: Towards the Diversity of Opinion in Crowdsourcing Markets

Ting Wu, Lei Chen, Pan Hui, Chen Jason Zhang, Weikai Li

497 - 508

REWIND: Recovery Write-Ahead System for In-Memory Non-Volatile Data-Structures

Andreas Chatzistergiou, Marcelo Cintra, Stratis D. Viglas

509 - 520

Influential Community Search in Large Networks

Rong-Hua Li, Lu Qin, Jeffrey Xu Yu, Rui Mao

521 - 532

Rapid Sampling for Visualizations with Ordering Guarantees

Albert Kim, Eric Blais, Aditya Parameswaran, Piotr Indyk, Sam Madden, Ronitt Rubinfeld

533 - 544

Optimal Enumeration: Efficient Top-k Tree Matching

Lijun Chang, Xuemin Lin, Wenjie Zhang, Jeffrey Xu Yu, Ying Zhang, Lu Qin

545 - 556

Monitoring Distributed Streams using Convex Decompositions

Arnon Lazerson, Izchak Sharfman, Daniel Keren, Assaf Schuster, Minos Garofalakis, Vasilis Samoladas

557 - 568

UDA-GIST: An In-database Framework to Unify Data-Parallel and State-Parallel Analytics

Kun Li, Daisy Zhe Wang, Alin Dobra, Christopher Dudley

569 - 580

Efficient Partial-Pairs SimRank Search for Large Networks

Weiren Yu, Julie A. McCann

581 - 592

Linearized and Single-Pass Belief Propagation

Wolfgang Gatterbauer, Stephan Günnemann, Danai Koutra, Christos Faloutsos

593 - 604

Mining Revenue-Maximizing Bundling Configuration

Loc Do, Hady W. Lauw, Ke Wang

605 - 616

Reverse k Nearest Neighbors Query Processing: Experiments and Analysis

Shiyu Yang, Muhammad Aamir Cheema, Xuemin Lin, Wei Wang

617 - 628

Exploiting Vertex Relationships in Speeding up Subgraph Isomorphism over Large Graphs

Xuguang Ren, Junhu Wang

629 - 640

Approximate Lifted Inference with Probabilistic Databases

Wolfgang Gatterbauer, Dan Suciu

641 - 652

Errata for “Crowdsourcing Algorithms for Entity Resolution” (PVLDB 7(12):1071-1082)

Norases Vesdapunt, Kedar Bellare, Nilesh Dalvi

Volume 8, No. 6

Felix Naumann: Front Matter i - ix

642 - 653

Improving Main Memory Hash Joins on Intel Xeon Phi Processors: An Experimental Approach

Saurabh Jha, Bingsheng He, Mian Lu, Xuntao Cheng, Huynh Phung Huynh

654 - 665

DREAM: Distributed RDF Engine with Adaptive Query Planner and Minimal Communication

Mohammad Hammoud, Dania Abed Rabbou, Reza Nouri, Seyed-Mehdi-Reza Beheshti, Sherif Sakr

666 - 677

Online Topic-Aware Influence Maximization

Shuo Chen, Ju Fan, Guoliang Li, Jianhua Feng, Kian-Iee Tan, Jinhui Tang

678 - 689

Walk, Not Wait: Faster Sampling Over Online Social Networks

Azade Nazi, Zhuojie Zhou, Saravanan Thirumuruganathan, Nan Zhang, Gautam Das

690 - 701

Querying with Access Patterns and Integrity Constraints

Michael Benedikt, Julien Leblay, Efthymia Tsamoura

Volume 8, No. 7

Stefan Manegold: Front Matter i - ix

702 - 713

General Incremental Sliding-Window Aggregation

Kanat Tangwongsan, Martin Hirzel, Scott Schneider, Kun-Lung Wu

714 - 725

Shared Execution of Recurring Workloads in MapReduce

Chuan Lei, Zhongfang Zhuang, Elke A. Rundensteiner, Mohamed Eltabakh

726 - 737

Sharing Buffer Pool Memory in Multi-Tenant Relational Database-as-a-Service

Vivek Narasayya, Ishai Menache, Mohit Singh, Feng Li, Manoj Syamala, Surajit Chaudhuri

738 - 749

Answering Why-not Questions on Reverse Top-k Queries

Yunjun Gao, Qing Liu, Gang Chen, Baihua Zheng, Linlin Zhou

750 - 761

Practical Authenticated Pattern Matching with Optimal Proof Size

Dimitrios Papadopoulos, Charalampos Papamanthou, Roberto Tamassia, Nikos Triandopoulos

762 - 773

A Performance Study of Big Data on Small Nodes

Dumitrel Loghin, Bogdan Marius Tudor, Hao Zhang, Beng Chin Ooi, Yong Meng Teo

774 - 785

Divide & Conquer-based Inclusion Dependency Discovery

Thorsten Papenbrock, Sebastian Kruse, Jorge-Arnulfo Quiané-Ruiz, Felix Naumann

786 - 797

Persistent B+-Trees in Non-Volatile Main Memory

Shimin Chen, Qin Jin

798 - 809

Robust Local Community Detection: On Free Rider Effect and Its Elimination

Yubao Wu, Ruoming Jin, Jing Li, Xiang Zhang

810 - 821

Understanding the Causes of Consistency Anomalies in Apache Cassandra

Hua Fan, Aditya Ramaraju, Marlon McKenzie, Wojciech Golab, Bernard Wong

822 - 833

Viral Marketing Meets Social Advertising: Ad Allocation with Minimum Regret

Cigdem Aslay, Wei Lu, Francesco Bonchi, Amit Goyal, Laks V.S. Lakshmanan

Volume 8, No. 8

Yi Chen: Front Matter i - ix

826 - 837

ALID: Scalable Dominant Cluster Detection

Lingyang Chu, Shuhui Wang, Siyuan Liu, Qingming Huang, Jian Pei

838 - 849

An Efficient Similarity Search Framework for SimRank over Large Dynamic Graphs

Yingxia Shao, Bin Cui, Lei Chen, Mingming Liu, Xing Xie

850 - 861

Compaction Management in Distributed Key-Value Datastores

Muhammad Yousuf Ahmad, Bettina Kemme

862 - 873

D2P: Distance-Based Differential Privacy in Recommenders

Rachid Guerraoui, Anne-Marie Kermarrec, Rhicheek Patra, Mahsa Taziki

874 - 885

FrogWild! – Fast PageRank Approximations on Graph Engines

Ioannis Mitliagkas, Michael Borokhovich, Alexandros G. Dimakis, Constantine Caramanis

886 - 897

Optimal Probabilistic Cache Stampede Prevention

Andrea Vattani, Flavio Chierichetti, Keegan Lowenstein

Volume 8, No. 9

Fatma ”¶zcan: Front Matter i - ix

898 - 909

DAQ: A New Paradigm for Approximate Query Processing

Navneet Potti, Jignesh M. Patel

910 - 921

A Scalable Search Engine for Mass Storage Smart Objects

Nicolas Anciaux, Saliha Lallali, Iulian Sandu Popa, Philippe Pucheral

922 - 933

Schema Management for Document Stores

Lanjun Wang, Oktie Hassanzadeh, Shuo Zhang, Juwei Shi, Limei Jiao, Jia Zou, Chen Wang

934 - 937

On the Surprising Difficulty of Simple Things: the Case of Radix Partitioning

Felix Martin Schuhknecht, Pankaj Khanchandani, Jens Dittrich

938 - 949

Knowledge-Based Trust: Estimating the Trustworthiness of Web Sources

Xin Luna Dong, Evgeniy Gabrilovich, Kevin Murphy, Van Dang Wilko Horn, Camillo Lugaresi, Shaohua Sun, Wei Zhang

950 - 961

Giraph Unchained: Barrierless Asynchronous Parallel Execution in Pregel-like Graph Processing Systems

Minyang Han, Khuzaima Daudjee

962 - 973

Work-Efficient Parallel Skyline Computation for the GPU

Kenneth S. B√∏gh, Sean Chester, Ira Assent

Volume 8, No. 10

Jignesh M. Patel: Front Matter i - ix

974 - 985

Scalable Subgraph Enumeration in MapReduce

Longbin Lai, Lu Qin, Xuemin Lin, Lijun Chang

986 - 997

Indexing Highly Dynamic Hierarchical Data

Jan Finis, Robert Brunel, Alfons Kemper, Thomas Neumann, Norman May, Franz Faerber

998 - 1009

Community Detection in Social Networks: An In-depth Benchmarking Study with a Procedure-Oriented Framework

Meng Wang, Chaokun Wang, Jeffrey Xu Yu, Jun Zhang

1010 - 1021

Growing a Graph Matching from a Handful of Seeds

Ehsan Kazemi, S. Hamed Hassani, Matthias Grossglauser

1022 - 1033

Reliable Diversity-Based Spatial Crowdsourcing by Moving Workers

Peng Cheng, Xiang Lian, Zhao Chen, Rui Fu, Lei Chen, Jinsong Han, Jizhong Zhao

1034 - 1045

Leveraging History for Faster Sampling of Online Social Networks

Zhuojie Zhou, Nan Zhang, Gautam Das

1046 - 1057

TOP: A Framework for Enabling Algorithmic Optimizations for Distance-Related Problems

Yufei Ding, Xipeng Shen, Madanlal Musuvathi, Todd Mytkowicz

1058 - 1069

Efficient Processing of Window Functions in Analytical SQL Queries

Viktor Leis, Kan Kundhikanjana, Alfons Kemper, Thomas Neumann

1070 - 1081

Real-time Targeted Influence Maximization for Online Advertisements

Yuchen Li, Dongxiang Zhang, Kian-Lee Tan

1082 - 1093

Functional Dependency Discovery: An Experimental Evaluation of Seven Algorithms

Thorsten Papenbrock, Jens Ehrlich, Jannik Marten, Tommy Neubert, Jan-Peer Rudolph, Martin Schönberg Jakob Zwiener, Felix Naumann

1094 - 1105

Searchlight: Enabling Integrated Search and Exploration over Large Multidimensional Data

Alexander Kalinin, Ugur Cetintemel, Stan Zdonik

1106 - 1117

Privacy Implications of Database Ranking

Md Farhadur Rahman, Weimo Liu, Saravanan Thirumuruganathan, Nan Zhang, Gautam Das

Volume 8, No. 11

Rainer Gemulla: Front Matter i - ix

1118 - 1129

Possible and Certain SQL Key

Henning Köhler, Sebastian Link, Xiaofang Zhou

1130 - 1141

Scaling Similarity Joins over Tree-Structured Data

Yu Tang, Yilun Cai, Nikos Mamoulis

1142 - 1153

Worker Skill Estimation in Team-Based Tasks

Habibur Rahman, Saravanan Thirumuruganathan, Senjuti Basu Roy, Sihem Amer-Yahia, Gautam Das

1154 - 1165

DPT: Differentially Private Trajectory Synthesis Using Hierarchical Reference Systems

Xi He, Graham Cormode, Ashwin Machanavajjhala, Cecilia M. Procopiuc, Divesh Srivastava

1166 - 1177

Supporting Scalable Analytics with Latency Constraints

Boduo Li, Yanlei Diao, Prashant Shenoy

1178 - 1189

SCAN++: Efficient Algorithm for Finding Clusters, Hubs and Outliers on Large-scale Graphs

Hiroaki Shiokawa, Yasuhiro Fujiwara, Makoto Onizuka

1190 - 1201

Rethinking serializable multiversion concurrency control

Jose M. Faleiro, Daniel J. Abadi

1202 - 1213

Rank aggregation with ties: Experiments and Analysis

Bryan Brancotte, Bo Yang, Guillaume Blin, Sarah Cohen-Boulakia, Alain Denise, Sylvie Hamel

1214 - 1225

GraphMat: High performance graph analytics made productive

Narayanan Sundaram, Nadathur Satish, Md Mostofa Ali Patwary, Subramanya R Dulloor, Michael J. Anderson, Satya Gautam Vadlamudi, Dipankar Das, Pradeep Dubey

1226 - 1237

Mega-KV: A Case for GPUs to Maximize the Throughput of In-Memory Key-Value Stores

Kai Zhang, Kaibo Wang, Yuan Yuan, Lei Guo, Rubao Lee, Xiaodong Zhang

1238 - 1249

Taming Subgraph Isomorphism for RDF Query Processing

Jinha Kim, Hyungyu Shin, Wook-Shin Han, Sungpack Hong, Hassan Chafi

1250 - 1261

SnapToQuery: Providing Interactive Feedback during Exploratory Query Specification

Lilong Jiang, Arnab Nandi

1262 - 1273

GraphTwist: Fast Iterative Graph Computation with Two-tier Optimizations

Yang Zhou, Ling Liu, Kisung Lee, Qi Zhang

1274 - 1285

SIMD- and Cache-Friendly Algorithm for Sorting an Array of Structures

Hiroshi Inoue, Kenjiro Taura

1286 - 1297

Enriching Data Imputation with Extensive Similarity Neighbors

Shaoxu Song, Aoqian Zhang, Lei Chen, Jianmin Wang

1298 - 1309

To Lock, Swap, or Elide: On the Interplay of Hardware Transactional Memory and Lock-Free Indexing

Darko Makreshanski‚Ä®, Justin Levandoski, Ryan Stutsman

1310 - 1321

Incremental Knowledge Base Construction Using DeepDive

Jaeho Shin, Sen Wu, Feiran Wang, Christopher De Sa, Ce Zhang, Christopher Ré

1322 - 1333

Learning User Preferences By Adaptive Pairwise Comparison

Li Qian, Jinyang Gao, H. V. Jagadish

Volume 8, No. 12

Chen Li and Volker Markl: Front Matter i - xiv

1334 - 1345

Aggregate Estimations over Location Based Services

Weimo Liu, Md Farhadur Rahman, Saravanan Thirumuruganathan, Nan Zhang, Gautam Das

1346 - 1357

Principles of Dataset Versioning: Exploring the Recreation/Storage Tradeoff

Souvik Bhattacherjee, Amit Chavan, Silu Huang, Amol Deshpande, Aditya Parameswaran

1358 - 1369

SEMA-JOIN: Joining Semantically-Related Tables Using Big Table Corpora

Yeye He, Kris Ganjam, Xu Chu

1370 - 1381

Stale View Cleaning: Getting Fresh Answers from Stale Materialized Views

Sanjay Krishnan, Jiannan Wang, Michael J. Franklin, Ken Goldberg, Tim Kraska

1382 - 1393

Compressed Spatial Hierarchical Bitmap (cSHB) Indexes for Efficiently Processing Spatial Range Query Workloads

Parth Nagarkar, K. Selçuk Candan, Aneesha Bhat

1394 - 1405

Selective Provenance for Datalog Programs Using Top-K Queries

Daniel Deutch, Amir Gilad, Yuval Moskovitch

1406 - 1417

Processing of Probabilistic Skyline Queries Using MapReduce

Yoonjae Park, Jun-Ki Min, Kyuseok Shim

1418 - 1429

Bonding Vertex Sets Over Distributed Graph: A Betweenness Aware Approach

Xiaofei Zhang, Hong Cheng, Lei Chen

1430 - 1441

A Natural Language Interface for Querying General and Individual Knowledge

Yael Amsterdamer, Anna Kukliansky, Tova Milo

1442 - 1453

Scaling Up Concurrent Main-Memory Column-Store Scans: Towards Adaptive NUMA-aware Data and Task Placement

Iraklis Psaroudakis, Tobias Scheuer, Norman May, Abdelkader Sellami, Anastasia Ailamaki

1454 - 1465

SQLite Optimization with Phase Change Memory for Mobile Applications

Gihwan Oh, Sangchul Kim, Sang-Won Lee, Bongki Moon

1466 - 1477

An Architecture for Compiling UDF-centric Workflows

Andrew Crotty, Alex Galakatos, Kayhan Dursun, Tim Kraska, Carsten Binnig, Ugur Cetintemel, Stan Zdonik

1478 - 1489

A Scalable Distributed Graph Partitioner

Daniel Margo, Margo Seltzer

1490 - 1501

Take me to your leader! Online Optimization of Distributed Storage Configurations

Artyom Sharov, Alexander Shraer, Arif Merchant, Murray Stokely

1502 - 1513

Association Rules with Graph Patterns

Wenfei Fan, Xin Wang, Yinghui Wu, Jingbo Xu

1514 - 1525

Fuzzy Joins in MapReduce: An Experimental Study

Ben Kimmett, Venkatesh Srinivasan, Alex Thomo

1518 - 1529

PARADIS: An Efficient Parallel Algorithm for In-place Radix Sort

Minsik Cho, Daniel Brand, Rajesh Bordawekar, Ulrich Finkler, Vincent Kulandaisamy, Ruchir Puri

1530 - 1541

Join Size Estimation Subject to Filter Conditions

David Vengerov, Andre Cavalheiro Menck, Mohamed Zait, Sunil P. Chakkappen

1542 - 1553

Asynchronous and Fault-Tolerant Recursive Datalog Evaluation in Shared-Nothing Engines

Jingjing Wang, Magdalena Balazinska, Daniel Halperin

1554 - 1565

Maximum Rank Query

Kyriakos Mouratidis, Jilian Zhang, HweeHwa Pang

1566 - 1577

Performance and Scalability of Indexed Subgraph Query Processing Methods

Foteini Katsarou, Nikos Ntarmos, Peter Triantafillou

1578 - 1589

Lenses: An On-Demand Approach to ETL

Ying Yang, Niccolo Meneghetti, Ronny Fehling, Zhen Hua Liu, Oliver Kennedy

1590 - 1601

Keys for Graphs

Wenfei Fan, Zhe Fan, Chao Tian, Xin Luna Dong

1602 - 1613

Spatial Partitioning Techniques in Spatial Hadoop

Ahmed Eldawy, Louai Alarabi, Mohamed F. Mokbel

1606 - 1617

Extracting Logical Hierarchical Structure of HTML Documents Based on Headings

Tomohiro Manabe, Keishi Tajima

1618 - 1629

Permutation Search Methods are Efficient, Yet Faster Search is Possible

Bilegsaikhan Naidan, Leonid Boytsov, Eric Nyberg

1630 - 1641

Distributed Architecture of Oracle Database In-memory

Niloy Mukherjee, Shasank Chavan, Maria Colgan, Dinesh Das, Mike Gleeson, Sanket Hase, Allison Holloway, Hui Jin, Jesse Kamp, Kartik Kulkarni, Tirthankar Lahiri, Juan Loaiza, Neil Macnaughton, Vineet Marwah, Atrayee Mullick, Andy Witkowski, Jiaqi Yan, Mohamed Zait

1642 - 1653

Argonaut: Macrotask Crowdsourcing for Complex Data Processing

Daniel Haas, Jason Ansel, Lydia Gu, Adam Marcus

1654 - 1665

Building a Replicated Logging System with Apache Kafka

Guozhang Wang, Joel Koshy, Sriram Subramanian, Kartik Paramasivam, Mammad Zadeh, Neha Narkhede, Jun Rao, Jay Kreps, Joe Stein

1656 - 1667

Indexing and Selecting Hierarchical Business Logic

Alessandra Loro, Anja Gruenheid, Donald Kossmann, Damien Profeta, Philippe Beaudequin

1668 - 1679

Schema-Agnostic Indexing with Azure DocumentDB

Dharma Shukla, Shireesh Thota, Karthik Raman, Madhan Gajendran, Ankur Shah, Sergii Ziuzin, Krishnan Sundaram, Miguel Gonzalez Guajardo, Anna Wawrzyniak, Samer Boshra, Renato Ferreira, Mohamed Nassar, Michael Koltachev, Ji Huang, Sudipta Sengupta, Justin Levandoski, David Lomet

1680 - 1691

JetScope: Reliable and Interactive Analytics at Cloud Scale

Eric Boutin, Paul Brett, Xiaoyu Chen, Jaliya Ekanayake, Tao Guan, Anna Korsun, Zhicheng Yin, Nan Zhang, Jingren Zhou

1692 - 1703

Differential Privacy in Telco Big Data Platform

Xueyang Hu, Mingxuan Yuan, Jianguo Yao, Yu Deng, Lei Chen, Qiang Yang, Haibing Guan, Jia Zeng

1704 - 1715

Optimization of Common Table Expressions in MPP Database Systems

Amr El-Helw, Venkatesh Raghavan, Mohamed A. Soliman, George Caragea, Zhongxian Gu, Michalis Petropoulos

1716 - 1727

Towards Scalable Real-time Analytics: An Architecture for Scale-out of OLxP Workloads

Anil K Goel, Jeffrey Pound, Nathan Auch, Peter Bumbulis, Scott MacLean, Franz Farber, Francis Gropengiesser, Christian Mathis, Thomas Bodner, Wolfgang Lehner

1728 - 1739

FIT to Monitor Feed Quality

Tamraparni Dasu, Vladislav Shkapenyuk, Divesh Srivastava, Deborah F. Swayne

1740 - 1751

Real-Time Analytical Processing with SQL Server

Per-√Öke Larson, Adrian Birka, Eric N. Hanson, Weiyun Huang, Michal Nowakiewicz, Vassilis Papadimos

1752 - 1763

Efficient Evaluation of Object-Centric Exploration Queries for Visualization

You Wu, Boulos Harb, Jun Yang, Gong Yu

1764 - 1769

Gobblin: Unifying Data Ingestion for Hadoop

Lin Qiao, Yinan Li, Sahil Takiar, Ziyang Liu, Narasimha Veeramreddy, Min Tu, Ying Dai, Issac Buenrostro, Kapil Surlaker, Shirshanka Das, Chavdar Botev

1770 - 1781

Query Optimization in Oracle 12c Database In-Memory

Dinesh Das, Jiaqi Yan, Mohamed Zait, Satyanarayana R Valluri, Nirav Vyas, Ramarajan Krishnamachari, Prashant Gaharwar, Jesse Kamp, Niloy Mukherjee

1782 - 1791

Live Programming in the LogicBlox System: A MetaLogiQL Approach

Todd J. Green, Dan Olteanu, Geoffrey Washburn

1792 - 1803

The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing

Tyler Akidau, Robert Bradshaw, Craig Chambers, Slava Chernyak, Rafael J. Fernandez-Moctezuma, Reuven Lax, Sam McVeety, Daniel Mills, Frances Perry, Eric Schmidt, Sam Whittle

1804 - 1815

One Trillion Edges: Graph Processing at Facebook-Scale

Avery Ching, Sergey Edunov, Maja Kabiljo, Dionysios Logothetis, Sambavi Muthukrishnan

1816 - 1827

Gorilla: A Fast, Scalable, In-Memory Time Series Database

Tuomas Pelkonen, Scott Franklin, Paul Cavallaro, Qi Huang, Justin Meza, Justin Teller, Kaushik Veeraraghavan

1828 - 1839

ConfSeer: Leveraging Customer Support Knowledge Bases for Automated Misconfiguration Detection

Rahul Potharaju, Joseph Chan, Luhui Hu, Cristina Nita-Rotaru, Mingshi Wang, Liyuan Zhang, Navendu Jain

1840 - 1843

Scaling Spark in the Real World: Performance and Usability

Michael Armbrust, Tathagata Das, Aaron Davidson, Ali Ghodsi, Andrew Or, Josh Rosen, Ion Stoica, Patrick Wendell, Reynold Xin, Matei Zaharia

1844 - 1847

StarDB: A Large-Scale DBMS for Strings

Majed Sahli, Essam Mansour, Panos Kalnis

1848 - 1851

Evaluating SPARQL Queries on Massive RDF Datasets

Razen Harbi, Ibrahim Abdelaziz, Panos Kalnis, Nikos Mamoulis

1852 - 1855

A Topic-based Reviewer Assignment System

Ngai Meng Kou, Leong Hou U, Nikos Mamoulis, Yuhong Li, Ye Li, Zhiguo Gong

1856 - 1859

FP-Hadoop: Efficient Execution of Parallel Jobs Over Skewed Data

Miguel Liroz-Gistau, Reza Akbarinia, Patrick Valduriez

1860 - 1863

Data Profiling with Metanome

Thorsten Papenbrock, Tanja Bergmann, Moritz Finke, Jakob Zwiener, Felix Naumann

1864 - 1867

Demonstration of Santoku: Optimizing Machine Learning over Normalized Data

Arun Kumar, Mona Jalal, Boqun Yan, Jeffrey Naughton, Jignesh M. Patel

1868 - 1871

PRISM: Concept-preserving Summarization of Top-K Social Image Search Results

Boon Siew Seah, Sourav S Bhowmick, Aixin Sun

1872 - 1875

Provenance for SQL through Abstract Interpretation: Value-less, but Worthwhile

Tobias Muller, Torsten Grust

1876 - 1879

SDB: A Secure Query Processing System with Data Interoperability

Zhian He, Wai Kit Wong, Ben Kao, David Wai Lok Cheung, Rongbin Li, Siu Ming Yiu, Eric Lo

1880 - 1883

SPARTex: A Vertex-Centric Framework for RDF Data Analytics

Ibrahim Abdelaziz, Razen Harbi, Semih Salihoglu, Panos Kalnis, Nikos Mamoulis

1884 - 1887

I2RS: A Distributed Geo-Textual Image Retrieval and Recommendation System

Lu Chen, Yunjun Gao, Zhihao Xing, Christian S. Jensen, Gang Chen

1888 - 1891

Reformulation-based query answering in RDF: alternatives and performance

Damian Bursztyn, Francois Goasdoue, Ioana Manolescu

1892 - 1895

SAASFEE: Scalable Scientific Workflow Execution Engine

Marc Bux, Jorgen Brandt, Carsten Lipka, Kamal Hakimzadeh, Jim Dowling, Ulf Leser

1896 - 1899

A Demonstration of HadoopViz: An Extensible MapReduce System for Visualizing Big Spatial Data

Ahmed Eldawy, Mohamed F. Mokbel, Christopher Jonathan

1900 - 1903

QOCO: A Query Oriented Data Cleaning System with Oracles

Moria Bergman, Tova Milo, Slava Novgorodov, Wang-Chiew Tan

1904 - 1907

TreeScope: Finding Structural Anomalies In Semi-Structured Data

Shanshan Ying, Flip Korn, Barna Saha, Divesh Srivastava

1908 - 1911

A Demonstration of the BigDAWG Polystore System

A. Elmore, J. Duggan, M. Stonebraker, M. Balazinska, U. Cetintemel, V. Gadepally, J. Heer, B. Howe, J. Kepner, T. Kraska, S. Madden, D. Maier, T. Mattson, S. Papadopoulos, J. Parkhurst, N. Tatbul, M. Vartak, S. Zdonik

1912 - 1915

RINSE: Interactive Data Series Exploration with ADS+

Kostas Zoumpatianos, Stratos Idreos, Themis Palpanas

1916 - 1919

Collaborative Data Analytics with DataHub

Anant Bhardwaj, Amol Deshpande, Aaron J. Elmore, David Karger, Sam Madden, Aditya Parameswaran, Harihar Subramanyam, Eugene Wu, Rebecca Zhang

1920 - 1923

Mindtagger: A Demonstration of Data Labeling in Knowledge Base Construction

Jaeho Shin, Christopher Re, Michael Cafarella

1924 - 1927

Perseus: An Interactive Large-Scale Graph Mining and Visualization Tool

Danai Koutra, Di Jin, Yuanshi Ning, Christos Faloutsos

1928 - 1931

Smart Drill-Down: A New Data Exploration Operator

Manas Joglekar, Hector Garcia-Molina, Aditya Parameswaran

1932 - 1935

Virtual eXist-db: Liberating Hierarchical Queries from the Shackles of Access Path Dependence

Curtis E. Dyreson, Sourav S Bhowmick, Ryan Grapp

1936 - 1939

Annotating Database Schemas to Help Enterprise Search

Eli Cortez, Philip A. Bernstein, Yeye He, Lev Novik

1940 - 1941

VIIQ: Auto-Suggestion Enabled Visual Interface for Interactive Graph Query Formulation

Nandish Jayaram, Sidharth Goyal, Chengkai Li

1944 - 1947

FLORIN – A System to Support (Near) Real-Time Applications on User Generated Content on Daily News

Qingyuan Liu, Eduard C. Dragut, Arjun Mukherjee, Weiyi Meng

1948 - 1951

VINERy: A Visual IDE for Information Extraction

Yunyao Li, Elmer Kim, Marc A. Touchette, Ramiya Venkatachalam, Hao Wang

1952 - 1955

KATARA: Reliable Data Cleaning with Knowledge Bases and Crowdsourcing

Xu Chu, Mourad Ouzzani, John Morcos, Ihab F. Ilyas, Paolo Papotti, Nan Tang, Yin Ye

1956 - 1959

GIS Navigation Boosted by Column Stores

Foteini Alvanaki, Romulo Goncalves, Milena Ivanovaa, Martin Kersten, Kostis Kyzirakos

1960 - 1963

Gain Control over your Integration Evaluations

Patricia C. Arocena, Radu Ciucanu, Boris Glavic, Renee J. Miller

1964 - 1967

AIDE: An Automatic User Navigation System for Interactive Data Exploration

Yanlei Diao, Kyriaki Dimitriadou, Zhan Li, Wenzhao Liu, Olga Papaemmanouil, Kemi Peng, Liping Peng

1968 - 1971

A Demonstration of AQWA: Adaptive Query-Workload-Aware Partitioning of Big Spatial Data

Ahmed M. Aly, Ahmed S. Abdelhamid, Ahmed R. Mahmood, Walid G. Aref, Mohamed S. Hassan, Hazem Elmeleegy, Mourad Ouzzani

1972 - 1975

Janiform Intra-Document Analytics for Reproducible Research

Jens Dittrich, Patrick Bender

1976 - 1979

A Framework for Clustering Uncertain Data

Erich Schubert, Alexander Koos, Tobias Emrich, Andreas Zufle, Klaus Arthur Schmid, Arthur Zimek

1980 - 1983

EFQ: Why-Not Answer Polynomials in Action

Nicole Bidoit, Melanie Herschel, Katerina Tzompanaki

1984 - 1987

Error Diagnosis and Data Profiling with Data X-Ray

Xiaolan Wang, Mary Feng, Yue Wang, Xin Luna Dong, Alexandra Meliou

1988 - 1991

Sharing and Reproducing Database Applications

Quan Pham, Severin Thaler, Tanu Malik, Ian Foster, Boris Glavic

1992 - 1995

A Demonstration of TripleProv: Tracking and Querying Provenance over Web Data

Marcin Wylot, Philippe Cudre-Mauroux, Paul Groth

1996 - 1999

WADaR: Joint Wrapper and Data Repair

Stefano Ortona, Giorgio Orsi, Marcello Buoncristiano, Tim Furche

2000 - 2003

DATASPREAD: Unifying Databases and Spreadsheets

Mangesh Bendre, Bofan Sun, Ding Zhang, Xinyan Zhou, Kevin Chen-Chuan Chang, Aditya Parameswaran

2004 - 2007

Wisteria: Nurturing Scalable Data Cleaning Infrastructure

Daniel Haas, Sanjay Krishnan, Jiannan Wang, Michael J. Franklin, Eugene Wu

2008 - 2011

CODD: A Dataless Approach to Big Data Testing

Ashoke S., Jayant R. Haritsa

2012 - 2015

Query-Oriented Summarization of RDF Graphs

Sejla Cebiric, Francois Goasdoue, Ioana Manolescu

2016 - 2019

Universal-DB: Towards Representation Independent Graph Analytics

Yodsawalai Chodpathumwan, Amirhossein, Aleyasen, Arash Termehchy, Yizhou Sun

2020 - 2023

Tornado: A Distributed Spatio-Textual Stream Processing System

Ahmed R. Mahmood, Ahmed M. Aly, Thamir Qadah, El Kindi Rezig, Anas Daghistani, Amgad Madkour, Ahmed S. Abdelhamid, Mohamed S. Hassan, Walid G. Aref, Seleh Basalamah

2024 - 2027

Vizdom: Interactive Analytics through Pen and Touch

Andrew Crotty, Alex Galakatos, Emanuel Zgraggen, Carsten Binnig, Tim Kraska

2028 - 2031

S+EPPs: Construct and Explore Bisimulation Summaries, plus Optimize Navigational Queries; all on Existing SPARQL Systems

Mariano P. Consens, Valeria Fionda, Shahan Khatchadourian, Giuseppe Pirro

2032 - 2035

GraphGen: Exploring Interesting Graphs in Relational Data

Konstantinos Xirogiannopoulos, Udayan Khurana, Amol Deshpande

2036 - 2039

DBSeer: Pain-free Database Administration through Workload Intelligence

Dong Young Yoon, Barzan Mozafari, Douglas P. Brown

2040 - 2041

Real Time Analytics: Algorithms and Systems

Arun Kejariwal, Sanjeev Kulkarni, Karthik Ramasamy

2042 - 2043

On Uncertain Graphs Modeling and Queries

Arijit Khan, Lei Chen

2044 - 2045

A Time Machine for Information: Looking Back to Look Forward

Xin Luna Dong, Wang-Chiew Tan

2046 - 2047

Structured Analytics in Social Media

Mahashweta Das, Gautam Das

2048 - 2049

Truth Discovery and Crowdsourcing Aggregation: A Unified Perspective

Jing Gao, Qi Li, Bo Zhao, Wei Fan, Jiawei Han

2050 - 2051

Tutorial: SQL-on-Hadoop Systems

Daniel Abadi, Shivnath Babu, Fatma Ozcan, Ippokratis Pandis

2052 - 2052

Engineering Database Hardware and Software Together

Juan Loaiza

2053 - 2056

Big Data Research: Will Industry Solve all the Problems?

Magdalena Balazinska

2057 - 2057

Big Plateaus of Big Data on the Big Island

Todd Walter

2058 - 2061

Databases and Hardware: The Beginning and Sequel of a Beautiful Friendship

Anastasia Ailamaki

Volume 8, No. 13

: Front Matter i - vii

2062 - 2073

AQWA: Adaptive Query-Workload-Aware Partitioning of Big Spatial Data

Ahmed M. Aly, Ahmed R. Mahmood, Mohamed S. Hassan, Walid G. Aref, Mourad Ouzzani, Hazem Elmeleegy, Thamir Qadah

2074 - 2085

Lightning Fast and Space Efficient Inequality Joins

Zuhair Khayyat, William Lucia, Meghna Singh, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiane-Ruiz, Nan Tang, Panos Kalnis

2086 - 2097

Finding Pareto Optimal Groups: Group-based Skyline

Jinfei Liu, Li Xiong, Jian Pei, Jun Luo, Haoyu Zhang

2098 - 2109

k-Regret Queries with Nonlinear Utilities

Taylor Kessler Faulkner, Will Brackenbury, Ashwin Lall

2110 - 2121

Clash of the Titans: MapReduce vs. Spark for Large Scale Data Analytics

Juwei Shi, Yunjie Qiu, Umar Farooq Minhas, Limei Jiao, Chen Wang, Berthold Reinwald, Fatma Ozcan

2122 - 2133

Towards Maximum Independent Sets on Massive Graphs

Yu Liu, Jiaheng Lu, Hua Yang, Xiaokui Xiao, Zhewei Wei

2134 - 2145

S-Store: Streaming Meets Transaction Processing

John Meehan, Nesime Tatbul, Stan Zdonik, Cansu Aslantas, Ugur Cetintemel, Jiang Du, Tim Kraska, Samuel Madden, David Maier, Andrew Pavlo, Michael Stonebraker, Kristin Tufte, Hao Wang

2146 - 2157

Multi-Version Range Concurrency Control in Deuteronomy

Justin Levandoski, David Lomet, Sudipta Sengupta, Ryan Stutsman, Rui Wang

2158 - 2169

Query From Examples: An Iterative, Data-Driven Approach to Query Construction

Hao Li, Chee-Yong Chan, David Maier

2170 - 2181

Tracking the Conductance of Rapidly Evolving Topic-Subgraphs

Sainyam Galhotra, Amitabha Bagchi, Srikanta Bedathur, Maya Ramanath, Vidit Jain

2182 - 2193

SEEDB: Efficient Data-Driven Visualization Recommendations to Support Visual Analytics

Manasi Vartak, Sajjadur Rahman, Samuel Madden, Aditya Parameswaran, Neoklis Polyzotis

2194 - 2205

DEXTER: Large-Scale Discovery and Extraction of Product Specifications on the Web

Disheng Qiu, Luciano Barbosa, Xin Luna Dong, Yanyan Shen, Divesh Srivastava

PVLDB is part of the VLDB Endowment Inc.

Privacy Policy