Volume 10, 2016-2017

Peter Boncz and Ken Salem
Founding Editor-in-Chief:
H. V. Jagadish
Managing Editor:
Divesh Srivastava
Information Director:
Gerald Weber
Advisory Committee:
H. V. Jagadish, Kian-Lee Tan, Renée J. Miller, S. Sudarshan, Juliana Freire, M. Tamer Özsu, Chen Li, Wolfgang Lehner
Associate Editors:
Ashraf Aboulnaga, Shimin Chen, Gautam Das, Amol Deshpande, Zack Ives, Qiong Luo, Stefan Manegold, Ioana Manolescu, Sharad Mehrotra, Fatma Ozcan, Themis Palpanas, Rachel Pottinger, Ken Ross, Gerhard Weikum
Review Board:

Volume 10, No. 1

Peter Boncz and Ken Salem: Front Matter i - vi

1 - 12

Cohort Query Processing

Dawei Jiang, Qingchao Cai, Gang Chen, H.V. Jagadish, Beng Chin Ooi, Kian-Lee Tan, Anthony Tung

13 - 24

Remember Where You Came From: On The Second-Order Random Walk Based Proximity Measures

Yubao Wu, Yuchen Bian, Xiang Zhang

25 - 36

IL-Miner: Instance-Level Discovery of Complex Event Patterns

Lars George, Bruno Cadonna, Matthias Weidlich

Volume 10, No. 2

: Front Matter i - v

37 - 48

Adaptive NUMA-aware data placement and task scheduling for analytical workloads in main-memory column-stores

Iraklis Psaroudakis, Tobias Scheuer, Norman May, Abdelkader Sellami, Anastasia Ailamaki

49 - 60

Mostly-Optimistic Concurrency Control for Highly Contended Dynamic Workloads on a Thousand Cores

Tianzheng Wang, Hideaki Kimura

61 - 72

Effective Indexing for Approximate Constrained Shortest Path Queries on Large Road Networks

Sibo Wang, Xiaokui Xiao, Yin Yang, Wenqing Lin

Volume 10, No. 3

: Front Matter i - vi

73 - 84

Toward High-Performance Distributed Stream Processing via Approximate Fault Tolerance

Qun Huang, Patrick P. C. Lee

85 - 96

Path Cost Distribution Estimation Using Trajectory Data

Jian Dai, Bin Yang, Chenjuan Guo, Christian Jensen, Jilin Hu

97 - 108

Fast Hierarchy Construction for Dense Subgraphs

Ahmet Erdem Sarıyüce, Ali Pinar

109 - 120

Sapprox: Enabling Efficient and Accurate Approximations on Sub-datasets with Distribution-aware Online Sampling

Xuhong Zhang, Jun Wang, Jiangling Yin, Shouling Ji

121 - 132

Multi-Query Optimization for Subgraph Isomorphism Search

Xuguang Ren, Junhu Wang

133 - 144

Efficient Computation of Feedback Arc Set at Web-Scale

Michael Simpson, Venkatesh Srinivasan, Alex Thomo

145 - 156

A Declarative Query Processing System for Nowcasting

Dolan Antenucci, Michael Anderson, Michael Cafarella

157 - 168

NG-DBSCAN: Scalable Density-Based Clustering for Arbitrary Data

Alessandro Lulli, Matteo Dell'Amico, Pietro Michiardi, Laura Ricci

169 - 180

Interactive Time Series Exploration Powered by the Marriage of Similarity Distances

Rodica Neamtu, Ramoza Ahsan, Elke Rundensteiner, Gabor Sarkozy

181 - 192

Computing Longest Increasing Subsequences over Sequential Data Streams

Youhuan Li, Lei Zou, Huaming Zhang, Dongyan Zhao

193 - 204

Knowledge Exploration using Tables on the Web

Fernando Chirigati, Jialu Liu, Flip Korn, You Wu, Cong Yu, Hao Zhang

205 - 216

HubPPR: Effective Indexing for Approximate Personalized PageRank

Sibo Wang, Youze Tang, Xiaokui Xiao, Yin Yang, Zengxiang Li

217 - 228

Scalable Distributed Subgraph Enumeration

Longbin Lai, Lu Qin, Xuemin Lin, Ying Zhang, Lijun Chang

229 - 240

Fast Algorithm for the Lasso based L1-Graph Construction

Yasuhiro Fujiwara, Yasutoshi Ida, Junya Arai, Mai Nishimura, Sotetsu Iwamura

241 - 252

Resisting Tag Spam by Leveraging Implicit User Behaviors

Ennan Zhai, Zhenhua Li, Zhenyu Li, Fan Wu, Guihai Chen

253 - 264

A General Framework for Estimating Graphlet Statistics via Random Walk

Xiaowei Chen, Yongkun Li, Pinghui Wang, John C.S. Lui

265 - 276

Fast In-Memory SQL Analytics on Typed Graphs

Chunbin Lin, Benjamin Mandel, Yannis Papakonstantinou, Matthias Springer

277 - 288

Stochastic Data Acquisition for Answering Queries as Time Goes by

Zheng Li, Tingjian Ge

Volume 10, No. 4

: Front Matter i - vi

289 - 300

Finding Persistent Items in Data Streams

Haipeng Dai, Muhammad Shahzad, Alex X. Liu, Yuankun Zhong

301 - 312

BlueCache: A Scalable Distributed Flash-based Key-value Store

Shuotao Xu, Sungjin Lee, Sang-Woo Jun, Ming Liu, Jamey Hicks, Arvind

313 - 324

A General and Parallel Platform for Mining Co-Movement Patterns over Large-scale Trajectories

Qi Fan, Dongxiang Zhang, Huayu Wu, Kian-Lee Tan

325 - 336

VIP-Tree: An Effective Index for Indoor Spatial Queries

Zhou Shao, Muhammad Cheema, David Taniar, Hua Lu

337 - 348

Write-Behind Logging

Joy Arulraj, Matthew Perron, Andrew Pavlo

349 - 360

The TileDB Array Data Storage Manager

Stavros Papadopoulos, Kushal Datta, Samuel Madden, Timothy Mattson

361 - 372

DOCS: Domain-Aware Crowdsourcing System

Yudian Zheng, Guoliang Li, Reynold Cheng

373 - 384

Lifting the Haze off the Cloud: A Consumer-Centric Market for Database Computation in the Cloud

Yue Wang, Alexandra Meliou, Gerome Miklau

385 - 396

Two Birds, One Stone: A Fast, yet Lightweight, Indexing Scheme for Modern Database Systems

Jia Yu, Mohamed Sarwat

397 - 408

History is a mirror to the future: Best-effort approximate complex event matching with insufficient resources

Zheng Li, Tingjian Ge

409 - 420

PHyTM: Persistent Hybrid Transactional Memory

Hillel Avni, Trevor Brown

421 - 432

Skipping-oriented Partitioning for Columnar Layouts

Liwen Sun, Michael Franklin, Jiannan Wang, Eugene Wu

433 - 444

Estimating Quantiles from the Union of Historical and Streaming Data

Sneha Singh, Divesh Srivastava, Srikanta Tirthapura

445 - 456

Clay: Fine-Grained Adaptive Partitioning for General Database Schemas

Marco Serafini, Rebecca Taft, Aaron J. Elmore, Andrew Pavlo, Ashraf Aboulnaga, Michael Stonebraker

457 - 468

Effortless Data Exploration with zenvisage: An Expressive and Interactive Visual Analytics System

Tarique Ashraf Siddiqui, Albert Kim, John Lee, Karrie Karahalios, Aditya Parameswaran

Volume 10, No. 5

: Front Matter i - v

469 - 480

MapReduce and Streaming Algorithms for Diversity Maximization in Metric Spaces of Bounded Doubling Dimension

Matteo Ceccarello, Andrea Pietracaprina, Geppino Pucci, Eli Upfal

481 - 492

Plausible Deniability for Privacy-Preserving Data Synthesis

Vincent Bindschaedler, Reza Shokri, Carl Gunter

493 - 504

An Experimental Comparison of Partitioning Strategies in Distributed Graph Processing

Shiv Verma, Luke Leslie, Yosub Shin, Indranil Gupta

505 - 516

Shrink - Prescribing Resiliency Solutions for Streaming

Badrish Chandramouli, Jonathan Goldstein

517 - 528

Distributed Join Algorithms on Thousands of Cores

Claude Barthels, Gustavo Alonso, Torsten Hoefler, Timo Schneider, Ingo Müller

529 - 540

Clue-based Spatio-textual Query

Junling Liu, Ke Deng, Huanliang Sun, Yu Ge, Xiaofang Zhou, Christian Jensen

541 - 552

Truth Inference in Crowdsourcing: Is the Problem Solved?

Yudian Zheng, Guoliang Li, Yuanbing Li, Caihua Shan, Reynold Cheng

553 - 564

An Evaluation of Distributed Concurrency Control

Rachael Harding, Dana Van Aken, Andrew Pavlo, Michael Stonebraker

565 - 576

KBQA: Learning Question Answering over QA Corpora and Knowledge Bases

Wanyun Cui, Yanghua Xiao, Haixun Wang, Yangqiu Song, Seung-won Hwang, Wei Wang

577 - 588

Provenance for Natural Language Queries

Daniel Deutch, Nave Frost, Amir Gilad

589 - 600

AdaptDB: Adaptive Partitioning for Distributed Joins

Yi Lu, Anil Shanbhag, Alekh Jindal, Samuel Madden

601 - 612

An Experimental Evaluation of SimRank-based Similarity Search Algorithms

Zhipeng Zhang, Yingxia Shao, Bin Cui, Ce Zhang

613 - 624

High Performance Transactions via Early Write Visibility

Jose Faleiro, Daniel Abadi, Joseph Hellerstein

625 - 636

ZooBP: Belief Propagation for Heterogeneous Networks

Dhivya Eswaran, Stephan Guennemann, Christos Faloutsos, Disha Makhija, Mohit Kumar

Volume 10, No. 6

: Front Matter i - v

637 - 648

Understanding the Sparse Vector Technique for Differential Privacy

Min Lyu, Dong Su, Ninghui Li

649 - 660

OLAK: An Efficient Algorithm to Prevent Unraveling in Social Networks

Fan Zhang, Wenjie Zhang, Ying Zhang, Lu Qin, Xuemin Lin

661 - 672

Data Tweening: Incremental Visualization of Data Transforms

Meraj Ahmed Khan, Larry Xu, Arnab Nandi, Joseph Hellerstein

673 - 684

SMCQL: Secure Query Processing for Private Data Networks

Johes Bater, Greg Elliott, Craig Eggen, Satyender Goel, Abel Kho, Jennie Rogers

685 - 696

The End of a Myth: Distributed Transaction Can Scale

Erfan Zamanian, Carsten Binnig, Tim Kraska, Tim Harris

697 - 708

NED: An Inter-Graph Node Metric Based On Edit Distance

Haohan Zhu, Xianrui Meng, George Kollios

709 - 720

Effective Community Search over Large Spatial Graphs

Yixiang Fang, Reynold Cheng, Xiaodong Li, Siqiang Luo, Jiafeng Hu

Volume 10, No. 7

: Front Matter i - v

721 - 732

Effective and Complete Discovery of Order Dependencies via Set-based Axiomatization

Jaroslaw Szlichta, Parke Godfrey, Lukasz Golab, Mehdi Kargar, Divesh Srivastava

733 - 744

Adaptive Work Placement for Query Processing on Heterogeneous Computing Resources

Tomas Karnagel, Dirk Habich, Wolfgang Lehner

745 - 756

LFTF: A Framework for Efficient Tensor Analytics at Scale

Fan Yang, Fanhua Shang, Yuzhen Huang, James Cheng, Jinfeng Li, Yunjian Zhao, Ruihao Zhao

757 - 768

Local Search Methods for k-Means with Outliers

Shalmoli Gupta, Ravi Kumar, Kefu Lu, Benjamin Moseley, Sergei Vassilvitskii

769 - 780

Dimensional Testing for Reverse k-Nearest Neighbor Search

Guillaume Casanova, Elias Englmeier, Michael Houle, Peer Kroeger, Michael Nett, Erich Schubert, Arthur Zimek

781 - 792

An Empirical Evaluation of In-Memory Multi-Version Concurrency Control

Yingjun Wu, Joy Arulraj, Jiexi Lin, Ran Xian, Andrew Pavlo

793 - 804

Finding Diverse, High-Value Representatives on a Surface of Answers

You Wu, Junyang Gao, Pankaj Agarwal, Jun Yang

805 - 816

Real-Time Influence Maximization on Dynamic Social Streams

Yanhao Wang, Qi Fan, Yuchen Li, Kian-Lee Tan

817 - 828

From Community Detection to Community Profiling

Hongyun Cai, Vincent Zheng, Fanwei Zhu, Kevin Chen-Chuan Chang, Zi Huang

829 - 840

Understanding Workers, Developing Effective Tasks, and Enhancing Marketplace Dynamics: A Study of a Large Crowdsourcing Marketplace

Ayush Jain, Akash Das Sarma, Aditya Parameswaran, Jennifer Widom

841 - 852

One-Pass Error Bounded Trajectory Simplification

Xuelian Lin, Shuai Ma, Han Zhang, Tianyu Wo, Jinpeng Huai

Volume 10, No. 8

: Front Matter i - v

853 - 864

MILC: Inverted List Compression in Memory

Jianguo Wang, Chunbin Lin, Ruining He, Moojin Chae, Yannis Papakonstantinou, Steven Swanson

865 - 876

Cümülön-D: Data Analytics in a Dynamic Spot Market

Botong Huang, Jun Yang

877 - 888

Automatic Algorithm Transformation for Efficient Multi-Snapshot Analytics on Temporal Graphs

Manuel Then, Timo Kersten, Stephan Guennemann, Alfons Kemper, Thomas Neumann

889 - 900

Looking Ahead Makes Query Plans Robust

Jianqiao Zhu, Navneet Potti, Saket Saurabh, Jignesh Patel

901 - 912

Bridging the Gap between HPC and Big Data frameworks

Michael Anderson, Shaden Smith, Narayanan Sundaram, Mihai Capotă, Zheguang Zhao, Subramanya Dulloor, Nadathur Satish, Theodore Willke

Volume 10, No. 9

: Front Matter i - v

913 - 924

Revisiting the Stop-and-Stare Algorithms for Influence Maximization

Keke Huang, Sibo Wang, Glenn Bevilacqua, Xiaokui Xiao, Laks Lakshmanan

925 - 936

Leveraging Set Relations in Exact Set Similarity Join

Xubo Wang, Lu Qin, Xuemin Lin, Ying Zhang, Lijun Chang

937 - 948

READS: A Random Walk Approach for Efficient and Accurate Dynamic SimRank

Minhao Jiang, Ada Wai Chee Fu, Raymond Chi-Wing Wong, Ke Wang

949 - 960

Attribute-Driven Community Search

Xin Huang, Laks Lakshmanan

961 - 972

Bias-Aware Sketches

Jiecao Chen, Qin Zhang

973 - 984

Data Driven Approximation with Bounded Resources

Yang Cao, Wenfei Fan

985 - 996

Errata for ``Lightning Fast and Space Efficient Inequality Joins'' (PVLDB 8(13): 2074-2085)

Zuhair Khayyat, William Lucia, Meghna Singh, Mourad Ouzzani, Paolo Papotti, Jorge Arnulfo Quiane Ruiz, Nan Tang, Panos Kalnis

Volume 10, No. 10

: Front Matter i - vi

986 - 997

Scalable Asynchronous Gradient Descent Optimization for Out-of-Core Models

Chengjie Qin, Martin Torres, Florin Rusu

998 - 1009

When Engagement Meets Similarity: Efficient (k,r)-Core Computation on Social Networks

Fan Zhang, Ying Zhang, Lu Qin, Wenjie Zhang, Xuemin Lin

1010 - 1021

An Experimental Evaluation of Point-of-interest Recommendation in Location-based Social Networks

Yiding Liu, Tuan-Anh Pham, Gao Cong, Quan Yuan

1022 - 1033

Don't Hold My Data Hostage - A Case For Client Protocol Redesign

Mark Raasveldt, Hannes Mühleisen

1034 - 1045

Auto-Join: Joining Tables by Leveraging Transformations

Erkang Zhu, Yeye He, Surajit Chaudhuri

1046 - 1057

Time Series Data Cleaning: From Anomaly Detection to Anomaly Repairing

Aoqian Zhang, Shaoxu Song, Jianmin Wang, Philip Yu

1058 - 1069

Pivot-based Metric Indexing

Lu Chen, Yunjun Gao, Baihua Zheng, Christian Jensen, Hanyu Yang, Keyu Yang

1070 - 1081

Heterogeneous Recommendations: What You Might Like To Read After Watching Interstellar

Rachid Guerraoui, Anne-Marie Kermarrec, Tao Lin, Rhicheek Patra

1082 - 1093

SilkMoth: An Efficient Method for Finding Related Sets with Maximum Matching Constraints

Dong Deng, Albert Kim, Samuel Madden, Michael Stonebraker

1094 - 1105

A Data Quality Metric (DQM): How to Estimate the Number of Undetected Errors in Data Sets

Yeounoh Chung, Sanjay Krishnan, Tim Kraska

1106 - 1117

Slalom: Coasting Through Raw Data via Adaptive Partitioning and Indexing

Matthaios Olma, Manos Karpathiotakis, Ioannis Alagiannis, Manos Athanassoulis, Anastasia Ailamaki

1118 - 1129

Mison: A Fast JSON Parser for Data Analytics

Yinan Li, Nikos R. Katsipoulakis, Badrish Chandramouli, Jonathan Goldstein, Donald Kossmann

1130 - 1141

OrpheusDB: Bolt-on Versioning for Relational Databases

Silu Huang, Liqi Xu, Jialin Liu, Aaron J. Elmore, Aditya Parameswaran

1142 - 1153

Revisiting Reuse for Approximate Query Processing

Alex Galakatos, Andrew Crotty, Emanuel Zgraggen, Carsten Binning, Tim Kraska

1154 - 1165

Probabilistic Database Summarization for Interactive Data Exploration

Laurel Orr, Dan Suciu, Magdalena Balazinska

Volume 10, No. 11

: Front Matter i - vii

1166 - 1177

Memory Management Techniques for Large-Scale Persistent-Main-Memory Systems

Ismail Oukid, Daniel Booss, Adrien Lespinasse, Wolfgang Lehner, Thomas Willhalm, Grégoire Gomes

1178 - 1189

Trajectory Similarity Join in Spatial Networks

Shuo Shang, Lisi Chen, Zhewei Wei, Christian Jensen, Kai Zheng, Panos Kalnis

1190 - 1201

HoloClean: Holistic Data Repairs with Probabilistic Inference

Theodoros Rekatsinas, Xu Chu, Ihab Ilyas, Chris Re

1202 - 1213

Caribou: Intelligent Distributed Storage

Zsolt Istvan, David Sidler, Gustavo Alonso

1214 - 1225

Towards Linear Algebra over Normalized Data

Lingjiao Chen, Arun Kumar, Jeffrey Naughton, Jignesh Patel

1226 - 1237

Comparative Evaluation of Big-Data Systems on Scientific Image Analytics Workloads

Parmita Mehta, Sven Dorkenwald, Dongfang Zhao, Tomer Kaftan, Alvin Cheung, Magdalena Balazinska, Ariel Rokem, Andrew Connolly, Jacob Vanderplas, Yusra AlSayyad

1238 - 1249

Revenue Maximization in Incentivized Social Advertising

Cigdem Aslay, Francesco Bonchi, Laks Lakshmanan, Wei Lu

1250 - 1261

SquirrelJoin: Network-Aware Distributed Join Processing with Lazy Partitioning

Lukas Rupprecht, William Culhane, Peter Pietzuch

1262 - 1273

I’ve Seen “Enough”: Incrementally Improving Visualizations to Support Rapid Decision Making

Sajjadur Rahman, Maryam Aliakbarpour, Hidy Kong, Eric Blais, Karrie Karahalios, Aditya Parameswaran, Ronitt Rubinfeld

1274 - 1285

Minimal On-Road Time Route Scheduling on Time-Dependent Graphs

Lei Li, Wen Hua, Xingzhong Du, Xiaofang Zhou

1286 - 1297

A holistic view of stream partitioning costs

Nikos R. Katsipoulakis, Alexandros Labrinidis, Panos Chrysanthis

1298 - 1309

Truss-based Community Search: a Truss-equivalence Based Indexing Approach

Esra Akbas, Peixiang Zhao

1310 - 1321

Query Optimization for Dynamic Imputation

Jose Cambronero, John Feser, Micah Smith, Samuel Madden

1322 - 1333

In Search of an Entity Resolution OASIS: Optimal Asymptotic Sequential Importance Sampling

Neil Marchant, Benjamin Rubinstein

1334 - 1345

Flexible Online Task Assignment in Real-Time Spatial Data

Yongxin Tong, Libin Wang, Zimu Zhou, Bolin Ding, Lei Chen, Jieping Ye, Ke Xu

1346 - 1357

A Forward Scan based Plane Sweep Algorithm for Parallel Interval Joins

Panagiotis Bouros, Nikos Mamoulis

1358 - 1369

ASAP: Prioritizing Attention via Time Series Smoothing

Kexin Rong, Peter Bailis

1370 - 1381

Knowledge Verification for LongTail Verticals

Furong Li, Xin Luna Dong, Anno Langen, Yang Li

1382 - 1393

SkyGraph: Retrieving Regions of Interest using Skyline Subgraph Queries

Shiladitya Pande, Sayan Ranu, Arnab Bhattacharya

1394 - 1405

Reverse Engineering Aggregation Queries

Wei Chit Tan, Meihui Zhang, Hazem Elmeleegy, Divesh Srivastava

1406 - 1417

LDA*: A Robust and Large-scale Topic Modeling System

Lele Yu, Bin Cui, Ce Zhang, Yingxia Shao

1418 - 1429

Social Hash Partitioner: A Scalable Distributed Hypergraph Partitioner

Igor Kabiljo, Brian Karrer, Mayank Pundir, Sergey Pupyrev, Alon Shalita, Yaroslav Akhremtsev, Alessandro Presta

1430 - 1441

On Sampling from Massive Graph Streams

Nesreen Ahmed, Nick Duffield, Theodore Willke, Ryan Rossi

1442 - 1453

Pyramid Sketch: a Sketch Framework for Frequency Estimation of Data Streams

Tong Yang, Yang Zhou, Hao Jin, Shigang Chen, Xiaoming Li

1454 - 1465

Reconciling Skyline and Ranking Queries

Paolo Ciaccia, Davide Martinenghi

1466 - 1477

CleanM: An Optimizable Query Language for Unified Scale-Out Data Cleaning

Stella Giannakopoulou, Manos Karpathiotakis, Benjamin Gaidioz, Anastasia Ailamaki

1478 - 1489

Distributed Trajectory Similarity Search

Dong Xie, Feifei Li, Jeff Phillips

1490 - 1501

Runtime Optimization of Join Location in Parallel Data Management Systems

Bikash Chandra, S. Sudarshan

1502 - 1513

Stitching Web Tables for Improving Matching Quality

Oliver Lehmberg, Christian Bizer

1514 - 1525

DigitHist: a Histogram-Based Data Summary with Tight Error Bounds

Michael Shekelyan, Anton Dignös, Johann Gamper

1526 - 1537

Fast Scans on Key-Value Stores

Markus Pilman, Kevin Bocksrocker, Lucas Braun, Renato Marroquín, Donald Kossmann

1538 - 1549

Finding the maximum clique in massive graphs

Can Lu, Jeffrey Yu, Hao Wei, Yikai Zhang

1550 - 1561

Privacy-preserving Network Provenance

Yuankai Zhang, Adam O'Neill, Micah Sherr, Wenchao Zhou

1562 - 1573

Truth Discovery for SpatioTemporal Events from Crowdsourced Data

Daniel Garcia Ulloa, Li Xiong, Vaidy Sunderam

1574 - 1585

Data Vocalization: Optimizing Voice Output of Relational Data

Immanuel Trummer, Jiancheng Zhu, Mark Bryan

1586 - 1597

NoScope: Optimizing Deep CNN-Based Queries over Video Streams at Scale

Daniel Kang, John Emmons, Firas Abuzaid, Peter Bailis, Matei Zaharia

Volume 10, No. 12

: Front Matter i - ix

1598 - 1609

Parallel Replication across Formats in SAP HANA for Scaling Out Mixed OLTP/OLAP Workloads

Juchang Lee, SeungHyun Moon, Kyu Hwan Kim, Deok Hoe Kim, Sang Kyun Cha, Wook-Shin Han, Chang Gyoo Park, Hyoung Jun Na, Joo Yeon Lee

1610 - 1621

Developing a Low Dimensional Patient Class Profile in Accordance to Their Respiration-Induced Tumor Motion

Rittika Shamsuddin, Balakrishnan Prabhakaran, Amit Sawant

1622 - 1633

Dimensions Based Data Clustering and Zone Maps

Mohamed Ziauddin, Andrew Witkowski, You Jung Kim, Janaki Lahorani, Dmitry Potapov, Murali Krishna

1634 - 1645

Stateful Scalable Stream Processing at LinkedIn

Shadi A Noghabi, Kartik Paramasivam, Yi Pan, Navina Ramesh, Jon Bringhurst, Indranil Gupta, Roy Campbell

1646 - 1657

Query-able Kafka: An agile data analytics pipeline for mobile wireless networks

Eric Falk, Vijay Gurbani, Radu State

1658 - 1669

Statisticum: Data Statistics Management in SAP HANA

Anisoara Nica, Reza Sherkat, Mihnea Andrei, Xun Chen, Martin Heidel, Christian Bensberg, Heiko Gerwens

1670 - 1681

Quaestor: Query Web Caching for Database-as-a-Service Providers

Felix Gessert, Michael Schaarschmidt, Wolfram Wingerath, Erik Wiit, Eiko Yoneki, Norbert Ritter

1682 - 1693

Fiber-based architecture for NFV cloud databases

Vaidas Gasiunas, David Dominguez-Sal, Ralph Acker, Aharon Avitzur, Ilan Bronshtein, Rushan Chen, Eli Ginot, Norbert Martinez, Michael Müller, Alexander Nozdrin, Weijie Ou, Nir Pachter, Dima Sivov, Eliezer Levy

1694 - 1705

Probabilistic Demand Forecasting at Scale

Joos-Hendrik Boese, Valentin Flunkert, Jan Gasthaus, Tim Januschowski, Dustin Lange, David Salinas, Sebastian Schelter, Matthias Seeger, Bernie Wang

1706 - 1717

ExtraV: Boosting Graph Processing Near Storage with a Coherent Accelerator

Jinho Lee, Heesu Kim, Sungjoo Yoo, Kiyoung Choi, Peter Hofstee, GiJoon Nam, Mark Nutter, Damir Jamsek

1718 - 1729

State Management in Apache Flink®: Consistent Stateful Distributed Stream Processing

Paris Carbone, Stephan Ewen, Gyula Fóra, Seif Haridi, Stefan Richter, Kostas Tzoumas

1730 - 1741

PaxosStore: High-availability Storage Made Practical in WeChat

Jianjun Zheng, Qian Lin, Jiatao Xu, Cheng Wei, Chuwei Zeng, Pingan Yang, Yunfan Zhang

1742 - 1753

Resumable Online Index Rebuild in SQL Server

Panagiotis Antonopoulos, Hanuma Kodavalla, Alex Tran, Nitish Upreti, Chaitali Shah, Mirek Sztajno

1754 - 1765

SAP HANA Adoption of Non-Volatile Memory

Mihnea Andrei, Christian Lemke, Günter Radestock, Robert Schulze, Carsten Thiel, Rolando Blanco, Akanksha Meghlan, Muhammad Sharique, Sebastian Seifert, Surendra Vishnoi, Daniel Booss, Thomas Peh, Ivan Schreter, Werner Thesing, Mehul Wagle, Thomas Willhalm

1766 - 1777

CarStream: An Industrial System of Big Data Processing for Internet-of-Vehicles

Mingming Zhang, Tianyu Wo, Xuelian Lin, Tao Xie, Yaxiao Liu

1778 - 1789

FAD.js: Fast JSON Data Access Using JIT-based Speculative Optimizations

Daniele Bonetta, Matthias Brantner

1790 - 1801

Colt: Concept Lineage Tool for Data Flow Metadata Capture and Analysis

Kareem Aggour, Jenny Weisenberg Williams, Justin McHugh, Vijay Kumar

1802 - 1812

Matrix Profile IV: Using Weakly Labeled Time Series to Predict Outcomes

Chin-Chia Michael Yeh, Nickolas Kavantzas, Eamonn Keogh

1813 - 1824

Adaptive Statistics in Oracle 12c

Mohamed Zait, Sunil Chakkappen, Suratna Budalakoti, Satyanarayana Valluri, Ramarajan Krishnamachari, Alan Wood

1825 - 1836

Dhalion:Self-Regulating Stream Processing in Heron

Avrilia Floratou, Ashvin Agrawal, Bill Graham, Sriram Rao, Karthik Ramasamy

1837 - 1840

Interactive Navigation of Open Data Linkages

Erkang Zhu, Ken Pu, Fatemeh Nargesian, Renee Miller

1841 - 1844

noWorkflow: a Tool for Collecting, Analyzing, and Managing Provenance from Python Scripts

Jo√£o Felipe Pimentel, Leonardo Murta, Vanessa Braganholo, Juliana Freire

1845 - 1848

ARShop: A Cloud-based Augmented Reality System for Shopping

Chao Wang, Yihao Feng, Qi Guo, Zhaoxian Li, Kexin Liu, Zijian Tang, Anthony Tung, Lifu Wu, Yuxin Zheng

1849 - 1852

Mind the Gap: Bridging Multi-Domain Query Workloads with EmptyHeaded

Christopher Aberger, Andrew Lamb, Kunle Olukotun, Christopher Ré

1853 - 1856

Crossing the finish line faster when paddling the Data Lake with Kayak

Antonio Maccioni, Riccardo Torlone

1857 - 1860

Debugging Transactions and Tracking their Provenance with Reenactment

Xing Niu, Bahareh Sadat Arab, Seokki Lee, Su Feng, Xun Zou, Dieter Gawlick, Vasudha Krishnaswamy, Zhen Hua Liu, Boris Glavic

1861 - 1864

PICASSO: Exploratory Search of Connected Subgraph Substructures in Graph Databases

Kai Huang, Sourav S Bhowmick, Shuigeng Zhou, Byron Choi

1865 - 1868

DITIR: Distributed Index for High Throughput Trajectory Insertion and Real-time Temporal Range Query

Ruichu Cai, Zijie Lu, Li Wang, Zhenjie Zhang, Tom Fu, Marianne Winslett

1869 - 1872

FlashView: An Interactive Visual Explorer for Raw Data

Zhifei Pang, Sai Wu, Gang Chen, Ke Chen, Lidan Shou

1873 - 1876

Upsortable: Programming TopK Queries Over Data Streams

Julien Subercaze, Christophe Gravier, Syed Gillani, Abderrahmen Kammoun, Frédérique Laforest

1877 - 1880

QUIS: InSitu Heterogeneous Data Source Querying

Javad Chamanara, Birgitta König-Ries, H. V. Jagadish

1881 - 1884

Automating Data Citation in CiteDB

Abdussalam Alawini, Susan Davidson, Wei Hu, Yinjun Wu

1885 - 1888

C-Explorer: Browsing Communities in Large Graphs

Yixiang Fang, Reynold Cheng, Siqiang Luo, Jiafeng Hu, Kai Huang

1889 - 1892

GRAPE: Parallelizing Sequential Graph Computations

Wenfei Fan, Jingbo Xu, Yinghui Wu, Wenyuan Yu, Jiaxin Jiang

1893 - 1896

Flower: A Data Analytics Flow Elasticity Manager

Alireza Khoshkbarforoushha, Rajiv Ranjan, Qing Wang, Carsten Friedrich

1897 - 1900

STEED: An Analytical Database System for TrEE-structured Data

Zhiyi Wang, Dongyan Zhou, Shimin Chen

1901 - 1904

LocLok: Location Cloaking with Differential Privacy via Hidden Markov Model

Yonghui Xiao, Li Xiong, Si Zhang, Yang Cao

1905 - 1908

Strider: An Adaptive, Inference-enabled Distributed RDF Stream Processing Engine

Xiangnan Ren, Olivier Curé, Li Ke, Jérémy Lhez, Badre Belabbess, Tendry Randriamalala, Yufan Zheng, Gabriel Kepeklian

1909 - 1912

A Confidence-Aware Top-k Query Processing Toolkit on Crowdsourcing

Yan Li, Ngai Meng Kou, Hao Wang, Leong Hou U, Zhiguo Gong

1913 - 1916

Explaining and Querying Knowledge Graphs by Relatedness

Valeria Fionda, Giuseppe Pirrò

1917 - 1920

Thoth in Action: Memory Management in Modern Data Analytics

Mayuresh Kunjir, Shivnath Babu

1921 - 1924

Monopedia: Staying Single is Good Enough - The HyPer Way for Web Scale Applications

Maximilian Schüle, Pascal Schliski, Thomas Hutzelmann, Tobias Rosenberger, Viktor Leis, Dimitri Vorona, Alfons Kemper, Thomas Neumann

1925 - 1928

Dima: A Distributed In-Memory Similarity-Based Query Processing System

Ji Sun, Zeyuan Shang, Guoliang Li, Dong Deng, Zhifeng Bao

1929 - 1932

TeCoRe: Temporal Conflict Resolution in Knowledge Graphs

Melisachew Chekol, Giuseppe Pirrò, Joerg Schoenfisch, Heiner Stuckenschmidt

1933 - 1936

MLog: Towards Declarative In-Database Machine Learning

Xupeng Li, Bin Cui, Yiru Chen, Wentao Wu, Ce Zhang

1937 - 1940

Foresight: Recommending Visual Insights

Çağatay Demiralp, Peter Haas, Srinivasan Parthasarathy, Tejaswini Pedapati

1941 - 1944

A BAD Demonstration: Towards Big Active Data

Steven Jacobs, Md Yusuf Sarwar Uddin, Michael Carey, Vagelis Hristidis, Vassilis Tsotras, Nalini Venkatasubram, Yao Wu, Syed Safir, Purvi Kaul, Xikui Wang, Mohiuddin Abdul Qader, Yawei Li

1945 - 1948

ClaimBuster:The First-ever End-to-end Fact-checking System

Naeemul Hassan, Gensheng Zhang, Fatma Arslan, Josue Caraballo, Damian Jimenez, Siddhant Gawsane, Shohedul Hasan, Minumol Joseph, Aaditya Kulkarni, Anil Kumar Nayak, Vikas Sable, Chengkai Li, Mark Tremayne

1949 - 1952

QIRANA Demonstration: Real time Scalable Query Pricing

Shaleen Deep, Paris Koutris, Yash Bidasaria

1953 - 1956

DataTweener: A Demonstration of a Tweening Engine for Incremental Visualization of Data Transforms

Meraj Ahmed Khan, Larry Xu, Arnab Nandi, Joseph Hellerstein

1957 - 1960

ZaliQL: Causal Inference from Observational Data at Scale

Babak Salimi, Corey Cole, Dan Ports, Dan Suciu

1961 - 1964

A Demonstration of ST-Hadoop: A MapReduce Framework for Big Spatio-temporal Data

Louai Alarabi, Mohamed Mokbel

1965 - 1968

Creation and Interaction with Large-scale Domain-Specific Knowledge Bases

Shreyas Bharadwaj, Laura Chiticariu, Marina Danilevsky, Samarth Dhingra, Samved Divekar, Arnaldo Carreno-Fuentes, Himanshu Gupta, Nitin Gupta, Sang-Don Han, Mauricio Hernandez, Howard Ho, Parag Jain, Salil Joshi, Hima Karanam, Saravanan Krishnan, Rajasekar Krishnamurthy, Yunyao Li, Satishkumaar Manivannan, Ashish Mittal, Fatma Ozcan, Abdul Quamar, Poornima Raman, Diptikalyan Saha, Karthik Sankaranarayanan, Jaydeep Sen, Prithviraj Sen, Shivakumar Vaithyanathan, Mitesh Vasa, Hao Wang, Huaiyu Zhu

1969 - 1972

A Demonstration of Stella: A Crowdsourcing-Based Geotagging Framework

Christopher Jonathan, Mohamed Mokbel

1973 - 1976

Exploring big volume sensor data with Vroom

Oscar Moll, Aaron Zalewski, Sudeep Pillai, Samuel Madden, Michael Stonebraker, Vijay Gadepally

1977 - 1980

New Trends on Exploratory Methods for Data Analytics

Davide Mottin, Matteo Lissandrini, Yannis Velegrakis, Themis Palpanas

1981 - 1984

Summarizing Static and Dynamic Big Graphs

Arijit Khan, Sourav S Bhowmick, Francesco Bonchi

1985 - 1987

Geometric Approaches for Top-k Queries

Kyriakos Mouratidis

1988 - 1991

Spatial Crowdsourcing: Challenges, Techniques, and Applications

Yongxin Tong, Lei Chen, Cyrus Shahabi

1992 - 1995

The Era of Big Spatial Data

Ahmed Eldawy, Mohamed Mokbel

1996 - 1999

Complex Event Recognition in the Big Data Era

Nikos Giatrakos, Alexander Artikis, Antonios Deligiannakis, Minos Garofalakis

2000 - 2001

Blockchains and Databases

C. Mohan

2002 - 2005

Caching at the Web Scale

Victor Zakhary, Amr El Abbadi, Divyakant Agarwal

2006 - 2017

Human-in-the-loop Data Integration

Guoliang Li

2018 - 2019

The Data Center under your Desk - How Disruptive is Modern Hardware for DB System Design?

Wolfgang Lehner

2020 - 2020

7 Secrets That My Mother Didn't Tell Me

Tova Milo

2021 - 2024

Intelligent Probing for Locality Sensitive Hashing: Multi-Probe LSH and Beyond

Qin Lv, William Josephson, Zhe Wang, Moses Charikar, Kai Li

Volume 10, No. 13

: Front Matter i - vii

2025 - 2036

Scalable Replay-Based Replication For Fast Databases

Dai Qin, Ashvin Goel, Angela Brown

2037 - 2048

SlimDB: A Space-Efficient Key-Value Storage Engine For Semi-Sorted Data

Kai Ren, Qing Zheng, Joy Arulraj, Garth Gibson

2049 - 2060

A Survey and Experimental Comparison of Distributed SPARQL Engines for Very Large RDF Data

Ibrahim Abdelaziz, Razen Harbi, Zuhair Khayyat, Panos Kalnis

2061 - 2072

BlockJoin: Efficient Matrix Partitioning Through Joins

Andreas Kunft, Asterios Katsifodimos, Sebastian Schelter, Tilmann Rabl, Volker Markl

2073 - 2084

Efficient Mining of Regional Movement Patterns in Semantic Trajectories

Dong-Wan Choi, Jian Pei, Thomas Heinis

2085 - 2096

Estimating Join Selectivities using Bandwidth-Optimized Kernel Density Models

Martin Kiefer, Max Heimel, Sebastian Breß, Volker Markl

PVLDB is part of the VLDB Endowment Inc.

Privacy Policy