Volume 14, 2020-2021

Xin Luna Dong and Felix Naumann
Publication Editors:
Thorsten Papenbrock and Hannes Mühleisen
Associate Editors:
Alon Halevy, Anastasia Ailamaki, Angela Bonifati, Arun Kumar, Ashraf Aboulnaga, Eugene Wu, Floris Geerts, Graham Cormode, Jeffrey Xu Yu, Jiannan Wang, Jingren Zhou, Jorge Arnulfo Quiane Ruiz, Juliana Freire, Jun Yang, Martin Theobald, Nesime Tatbul, Paolo Papotti, Rainer Gemulla, Stefan Manegold, Stratos Idreos, Surajit Chaudhuri, Xuemin Lin, Yi Chen, Yufei Tao, Zachary Ives, Zhifeng Bao
Review Board:

Volume 14, No. 1

Xin Luna Dong and Felix Naumann: Front Matter i - vi

1 - 13

Benchmarking Learned Indexes

Ryan C Marcus (MIT), Andreas Kipf (MIT), Alexander van Renen (TUM), Mihail Stoian (TUM), Sanchit Misra (Intel), Alfons Kemper (TUM), Thomas Neumann (TUM), Tim Kraska (MIT)

14 - 27

Tempura: A General Cost-Based Optimizer Framework for Incremental Data Processing

Zuozhi Wang (U C IRVINE), Kai Zeng (Alibaba Group), Botong Huang (Alibaba), Wei Chen (Alibaba), Xiaozong Cui (Alibaba), Bo Wang (Alibaba), Ji Liu (Alibaba), Liya Fan (Alibaba), Dachuan Qu (Alibaba), zhenyu hou (Alibaba Group), Tao Guan (Alibaba Group), Chen Li (UC Irvine), Jingren Zhou (Alibaba Group)

28 - 36

Inspector Gadget: A Data Programming-based Labeling System for Industrial Images

Geon Heo (KAIST), Yuji Roh (KAIST), Seonghyeon Hwang (KAIST), Dayun Lee (KAIST), Steven Whang (KAIST)

37 - 49

Scaling Attributed Network Embedding to Massive Graphs

Renchi Yang (National University of Singapore), Jieming Shi (The Hong Kong Polytechnic University), Xiaokui Xiao (National University of Singapore), Yin Yang (Hamad bin Khalifa University), Juncheng Liu (National University of Singapore), Sourav S Bhowmick (Nanyang Technological University)

50 - 60

Deep Entity Matching with Pre-Trained Language Models

Yuliang Li (Megagon Labs), Jinfeng Li (Megagon Labs), Yoshihiko Suhara (Megagon Labs), AnHai Doan (University of Wisconsin-Madison), Wang-Chiew Tan (Facebook AI)

61 - 73

NeuroCard: One Cardinality Estimator for All Tables

Zongheng Yang (UC Berkeley), Amog Kamsetty (UC Berkeley), Sifei Luan (UC Berkeley), Eric Liang (UC Berkeley), Yan Duan (COVARIANT.AI), Peter Chen (COVARIANT.AI), Ion Stoica (UC Berkeley)

Volume 14, No. 2

Yufei Tao: Front Matter i - vi

74 - 86

Tsunami: A Learned Multi-dimensional Index for Correlated Data and Skewed Workloads

Jialin Ding (MIT), Vikram Nathan (MIT), Mohammad Alizadeh (MIT CSAIL), Tim Kraska (MIT)

87 - 100

Jointly Optimizing Preprocessing and Inference for DNN-based Visual Analytics

Daniel Kang (Stanford University), Ankit Mathur (Stanford University), Teja Veeramacheneni (Stanford University), Peter D Bailis (Stanford University), Matei Zaharia (Stanford and Databricks)

101 - 113

Permutable Compiled Queries: Dynamically Adapting Compiled Queries without Recompiling

Prashanth Menon (Carnegie Mellon Universiy), Amadou Ngom (Carnegie Mellon University), Todd Mowry (Carnegie Mellon University), Andrew Pavlo (Carnegie Mellon University), Lin Ma (Carnegie Mellon University)

114 - 127

EMOGI: Efficient Memory-access for Out-of-memory Graph-traversal In GPUs

Seung Won Min, Vikram Sharma Mailthody, Zaid Qureshi, Jinjun Xiong, Eiman Ebrahimi, Wen-Mei Hwu

128 - 140

On-Off Sketch: A Fast and Accurate Sketch on Persistence

Yinda Zhang (Peking University), Jinyang Li (Peking University), Yutian Lei (Xiangtan University), Tong Yang (Peking University), Zhetao Li (湘潭大学), Gong Zhang (Huawei), Bin Cui (Peking University)

141 - 153

Real-Time Distance-Based Outlier Detection in Data Streams

Luan V Tran (University of Southern California), Min Mun (University of Southern California), Cyrus Shahabi (Computer Science Department. University of Southern California)

154 - 162

Seagull: An Infrastructure for Load Prediction and Optimized Resource Allocation

Olga Poppe (Microsoft), Tayo Amuneke (Microsoft), Dalitso Banda (Microsoft), Aritra De (Microsoft), Ari Green (Microsoft), Manon Knoertzer (Microsoft), Ehi Nosakhare (Microsoft), Karthik Rajendran (Microsoft), Deepak Shankargouda (Microsoft), Meina Wang (Microsoft), Alan Au (Microsoft), Carlo Curino (Microsoft -- GSL), Qun Guo (Microsoft), Alekh Jindal (Microsoft), Ajay Kalhan (Microsoft), Morgan Oslake (Microsoft), Sonia Parchani (Microsoft), Vijay Ramani (Microsoft), Raj Sellappan (Microsoft), Saikat Sen (Microsoft), Sheetal Shrotri (Microsoft), Soundararajan Srinivasan (Microsoft), Ping Xia (Microsoft), Shize Xu (Microsoft), Alicia Yang (Microsoft), Yiwen Zhu (Microsoft)

163 - 175

On the Efficiency of K-Means Clustering: Evaluation, Optimization, and Algorithm Selection

Sheng Wang (New York University), Yuan Sun (Monash University), Zhifeng Bao (RMIT University)

176 - 188

RapidMatch: A Holistic Approach to Subgraph Query Processing

Shixuan Sun (National University of Singapore), Xibo Sun (Hong Kong University of Science and Technology), Yulin Che (Hong Kong University of Science and Technology), Qiong Luo (Hong Kong University of Science and Technology), Bingsheng He (National University of Singapore)

189 - 201

Taurus: Lightweight Parallel Logging for In-Memory Database Management Systems

Yu Xia (MIT), Xiangyao Yu (University of Wisconsin-Madison), Andrew Pavlo (Carnegie Mellon University), Srinivas Devadas (MIT)

202 - 214

Improving Execution Efficiency of Just-in-time Compilation based Query Processing on GPUs

Johns Paul (NUS), Bingsheng He (National University of Singapore), Shengliang Lu (National University of Singapore), Chiew Tong Lau (Nanyang Technological University)

215 - 227

PPQ-Trajectory: Spatio-temporal Quantization for Querying in Large Trajectory Repositories

Shuang Wang (University of Warwick), Hakan Ferhatosmanoglu (University of Warwick)

228 - 240

Aggregated Deletion Propagation for Counting Conjunctive Query Answers

Xiao Hu (Duke University), Shouzhuo Sun (Duke University), Shweta Patwa (Duke University), Debmalya Panigrahi (Duke University), Sudeepa Roy (Duke University, USA)

Volume 14, No. 3

Anastasia Ailamaki: Front Matter i - vi

241 - 254

Breaking Down Memory Walls: Adaptive Memory Management in LSM-based Storage Systems

Chen Luo (Snowflake Inc.), Michael Carey (UC Irvine)

255 - 267

Nearest Neighbor Classifiers over Incomplete Information: From Certain Answers to Certain Predictions

Bojan Karlaš (ETH Zürich), Peng Li (GATECH), Renzhi Wu (Georgia Institute of Technology), Nezihe Merve Gürel (ETH Zürich), Xu Chu (GATECH), Wentao Wu (Microsoft Research), Ce Zhang (ETH)

268 - 280

Elle: Inferring Isolation Anomalies from Experimental Observations

Peter Alvaro (UC Santa Cruz), Kyle Kingsbury (Jepsen)

281 - 293

Scotch: Generating FPGA-Accelerators for Sketching at Line Rate

Martin Kiefer (TU Berlin), Ilias Poulakis (TU Berlin), Sebastian Bress (TU Berlin), Volker Markl (Technische Universität Berlin)

294 - 306

ORBITS: Online Recovery of Missing Values in Multiple Time Series Streams

Mourad Khayati (University of Fribourg), Ines Arous (University of Fribourg), Zakhar Tymchenko (University of Fribourg), Philippe Cudre-Mauroux (Exascale Infolab, Fribourg University)

307 - 319

TURL: Table Understanding through Representation Learning

Xiang Deng (The Ohio State University), Huan Sun (Ohio State University), Alyssa Lees (Google), You Wu (Google), Cong Yu (Google)

320 - 328

EdgeDIPN: a Unified Deep Intent Prediction Network Deployed at the Edge

Long Guo (Alibaba inc.), Lifeng Hua (Alibaba Group), Rongfei Jia (Alibaba Group), Fei Fang (Alibaba Group), Binqiang Zhao (Alibaba), Bin Cui (Peking University)

329 - 341

LOCATER: Cleaning WiFi Connectivity Datasets for Semantic Localization

Yiming Lin (University of California, Irvine), Daokun Jiang (University of California, Irvine), Roberto Yus (UC Irvine), Georgios Bouloukakis (Telecom SudParis), Andrew Chio (university of California Irvine), Sharad Mehrotra (U.C. Irvine), Nalini Venkatasubramanian (University of California, Irvine)

342 - 350

Multi-Modal Transportation Recommendation with Unified Route Representation Learning

Hao LIU (Business Intelligence Lab, Baidu Research), Jindong Han (Baidu), Yanjie Fu (University of Central Florida), Jingbo Zhou (Baidu Inc.), Xinjiang Lu (Baidu Inc.), Hui Xiong (Rutgers University)

351 - 363

DISK: A Distributed Framework for Single-Source SimRank with Accuracy Guarantee

Yue Wang (Shenzhen Institute of Computing Sciences, Shenzhen University.), Ruiqi Xu (University of Edinburgh), Zonghao Feng (Hong Kong University of Science and Technology), Yulin Che (Hong Kong University of Science and Technology), Lei Chen (Hong Kong University of Science and Technology), Qiong Luo (Hong Kong University of Science and Technology), Rui Mao (Shenzhen University)

364 - 377

Toward a Better Understanding and Evaluation of Tree Structures on Flash SSDs

Diego Didona (IBM Research Zurich), Nikolas Ioannou (IBM Research), Radu I Stoica (IBM), Kornilios Kourtis (Indipendent researcher)

378 - 390

Answering Multi-Dimensional Range Queries under Local Differential Privacy

Jianyu Yang (Beijing University of Posts and Telecommunications), Tianhao Wang (Purdue University), Ninghui Li (Purdue University), Xiang Cheng (Beijing University of Posts and Telecommunications), Sen Su (Beijing University of Posts and Telecommunications)

391 - 403

Ananke: A Streaming Framework for Live Forward Provenance

Dimitris Palyvos-Giannas (Chalmers University of Technology), Bastian Havers (Chalmers University of Technology and Volvo Car Corporation), Marina Papatriantafilou (Chalmers University of Technology), Vincenzo Gulisano (Chalmers University of Technology)

404 - 417

RECEIPT: REfine CoarsE-grained IndePendent Tasks for Parallel Tip decomposition of Bipartite Graphs

Kartik Lakhotia (USC), Rajgopal Kannan (USC), Viktor K Prasanna (Unversity of Southern California), César De Rose (PUCRS)

418 - 430

Comprehensive and Efficient Workload Compression

Shaleen Deep (University of Wisconsin-Madison), Anja Gruenheid (Microsoft), Paraschos Koutris (University of Wisconsin-Madison), Jeff Naughton (Google), Stratis Viglas (University of Edinburgh)

431 - 444

CoroBase: Coroutine-Oriented Main-Memory Database Engine

Yongjun He (Simon Fraser University), Jiacheng Lu (Simon Fraser University), Tianzheng Wang (Simon Fraser University)

445 - 457

Scalable Querying of Nested Data

Jaclyn Smith (Oxford University), Michael Benedikt (Oxford University), Milos Nikolic (University of Edinburgh), Amir Shaikhha (University of Edinburgh)

Volume 14, No. 4

Angela Bonifati and Jorge-Arnulfo Quiané-Ruiz: Front Matter i - vi

458 - 470

Space- and Computationally-Efficient Set Reconciliation via Parity Bitmap Sketch (PBS)

Long Gong (Facebook), Ziheng Liu (Peking University), Liang Liu (Georgia Institute of Technology), Jun Xu (Georgia Tech), Mitsunori Ogihara (University of Miami), Tong Yang (Peking University)

471 - 484

Astrid: Accurate Selectivity Estimation for String Predicates using Deep Learning

Suraj Shetiya (The University of Texas at Arlington), Saravanan Thirumuruganathan (QCRI), Nick Koudas (University of Toronto), Gautam Das (U. of Texas Arlington)

485 - 497

Compact, Tamper-Resistant Archival of Fine-Grained Provenance

Nan Zheng (University of Pennsylvania), Zack Ives (University of Pennsylvania)

498 - 506

Rumble: Data Independence for Large Messy Data Sets

Ingo Müller (ETH Zürich), Ghislain Fourny (ETH Zurich), Stefan Irimescu (Beekeeper AG), Can Cikis (ETH Zurich), Gustavo Alonso (ETHZ)

507 - 520

Capturing and querying fine-grained provenance of preprocessing pipelines in data science

Adriane Chapman (University of Southampton), Paolo Missier (Newcastle University), Giulia Simonelli (Universita Roma Tre), Riccardo Torlone (Universita Roma Tre)

521 - 533

Local Dampening: Differential Privacy for Non-numeric Queries via Local Sensitivity

Victor Farias (Universidade Federal do Ceara), Felipe Brito (LSBD/UFC), Cheryl Flynn (AT&T Labs Research), Javam C Machado (LSBD/UFC), Subhabrata Majumdar (AT&T Labs Research), Divesh Srivastava (AT&T Labs Research)

534 - 546

Mainlining Databases: Supporting Fast Transactional Workloads on Universal Columnar Data File Formats

Tianyu Li (Massachusetts Institute of Technology), Matthew Butrovich (Carnegie Mellon University), Amadou Ngom (Carnegie Mellon University), Wan Shen Lim (Carnegie Mellon University), Wes McKinney (Ursa Labs), Andrew Pavlo (Carnegie Mellon University)

547 - 559

Accelerating Exact Constrained Shortest Paths on GPUs

Shengliang Lu (National University of Singapore), Bingsheng He (National University of Singapore), Yuchen Li (Singapore Management University), Hao Fu (Tianjin University)

560 - 572

Towards an Efficient Weighted Random Walk Domination

Songsong Mo (Wuhan University), Zhifeng Bao (RMIT University), Ping Zhang (huawei), Zhiyong Peng ( Wuhan University, China)

573 - 585

Scalable Mining of Maximal Quasi-Cliques: An Algorithm-System Codesign Approach

Guimu Guo (The University of Alabama at Birmingham), Da Yan (University of Alabama at Birmingham), Tamer Özsu (University of Waterloo), Zhe Jiang (University of Alabama), Jalal Khalil (The University of Alabama at Birmingham)

586 - 599

CALYPSO: Private Data Management for Decentralized Ledgers

Eleftherios Kokoris Kogias (IST Austria), Enis Ceyhun Alp (EPFL), Linus Gasser (EPFL), Philipp Jovanovic (UCL), Ewa Syta (Trinity College), Bryan Ford (EPFL)

600 - 612

Stacked Filters: Learning to Filter by Structure

Kyle B Deeds (Harvard University), Brian N Hentschel (Harvard University), Stratos Idreos (Harvard)

613 - 625

Maximizing Social Welfare in a Competitive Diffusion Model

Prithu Banerjee (UBC), Laks V.S. Lakshmanan (The University of British Columbia), Wei Chen (Microsoft)

626 - 639

Understanding the Idiosyncrasies of Real Persistent Memory

Shashank Gugnani (The Ohio State University), Arjun Kashyap (The Ohio State University), Xiaoyi Lu (The Ohio State University)

640 - 652

Explaining Ranking Functions

Abraham Gale, Amelie Marian

653 - 667

ConnectIt: A Framework for Static and Incremental Parallel Graph Connectivity Algorithms

Laxman Dhulipala (MIT CSAIL), Changwan Hong (Massachusetts Institute of Technology), Julian Shun (MIT)

668 - 681

Quality of Sentiment Analysis Tools: The Reasons of Inconsistency

wissam Mammar kouadri (University of Paris), mourad ouziri (University of Paris), Salima Benbernou (Université Paris Descartes), Karima Echihabi (Mohammed VI Polytechnic University), Themis Palpanas (University of Paris), iheb benamor (imba consulting)

682 - 693

Hindsight Logging for Model Training

Rolando Garcia (UC Berkeley), Erick Liu (UC Berkeley), Vikram Sreekanti (UC Berkeley), Bobby Yan (UC Berkeley), Anusha Dandamudi (UC Berkeley), Joseph Gonzalez (UC Berkeley), Joseph M Hellerstein (UC Berkeley), Koushik Sen (University of California, Berkeley)

694 - 707

Scalable Structural Index Construction for JSON Analytics

Lin Jiang (University of California, Riverside), Junqiao Qiu (Michigan Technological University), Zhijia Zhao (University of California, Riverside)

708 - 720

Efficient Join Algorithms For Large Database Tables in a Multi-GPU Environment

Ran Rui (University of South Florida), Hao Li (University of South Florida), Yi-Cheng Tu (University of South Florida)

Volume 14, No. 5

Ashraf Aboulnaga: Front Matter

721 - 729

FlashP: An Analytical Pipeline for Real-time Forecasting of Time-Series Relational Data

YAN SHUYUAN (Alibaba), Bolin Ding (Data Analytics and Intelligence Lab, Alibaba Group), Wei Guo (Alibaba), Jingren Zhou (Alibaba Group), Zhewei Wei (Renmin University of China), Xiaowei Jiang (Alibaba Group), Sheng Xu (Alibaba Group)

730 - 742

Efficient Streaming Subgraph Isomorphism with Graph Neural Networks

Chi Thang Duong (Ecole Polytechnique Federale de Lausanne), Dung Trung Hoang (Hanoi University of Science and Technology), Hongzhi Yin (The University of Queensland), Matthias Weidlich (Humboldt-Universität zu Berlin), Quoc Viet Hung Nguyen (Griffith University), Karl Aberer (EPFL)

743 - 756

Epoch-based Commit and Replication in Distributed OLTP Databases

Yi Lu (MIT), Xiangyao Yu (University of Wisconsin-Madison), Lei Cao (MIT), Samuel Madden (MIT)

757 - 770

Hierarchical Core Maintenance on Large Dynamic Graphs

Zhe Lin (East China Normal University), Fan Zhang (Guangzhou University), Xuemin Lin (University of New South Wales), Wenjie Zhang (University of New South Wales), Zhihong Tian (Guangzhou University)

771 - 784

Analyzing and Mitigating Data Stalls in DNN Training

Jayashree Mohan (UT Austin), Amar Phanishayee (Microsoft Research), Ashish Raniwala (Microsoft), Vijay Chidambaram (UT Austin and VMWare)

785 - 798

Persistent Memory Hash Indexes: An Experimental Evaluation

Daokun Hu (College of Computer Science and Electronic Engineering, Hunan University, China), Zhiwen Chen (Hunan University), Jianbing Wu (Peking University Shenzhen Graduate School), Jianhua Sun (College of Computer Science and Electronic Engineering, Hunan University, China), Hao Chen (College of Computer Science and Electronic Engineering, Hunan University, China)

799 - 812

Optimizing An In-memory Database System For AI-powered On-line Decision Augmentation Using Persistent Memory

Cheng Chen, Jun Yang, Mian Lu, Taize Wang, Zhao Zheng, Yuqiang Chen, Wenyuan Dai, Bingsheng He, Weng-fai Wong, Guoan Wu, Yuping Zhao, Andy Rudoff

813 - 821

DBTagger: Multi-Task Learning for Keyword Mapping in NLIDBs Using Bi-Directional Recurrent Neural Networks

Arif Usta (Bilkent University), Akifhan Karakayalı (Bilkent University), Özgür Ulusoy (Bilkent University)

822 - 834

Improving Information Extraction from Visually Rich Documents using Visual Span Representations

Ritesh Sarkhel (Ohio State University), Arnab Nandi (The Ohio State University)

835 - 848

Zen: a High-Throughput Log-Free OLTP Engine for Non-Volatile Main Memory

Gang Liu (Chinese Academy of Sciences), Leying Chen (Chinese Academy of Sciences), Shimin Chen (Chinese Academy of Sciences)

849 - 862

Differentially Private Binary- and Matrix-Valued Data Query: An XOR Mechanism

Tianxi Ji (Case Western Reserve University), Pan Li (Case Western Reserve University), Emre Yilmaz (University of Houston-Downtown), Erman Ayday (Case Western Reserve University, Bilkent University), Yanfang Ye (Case Western Reserve University), Jinyuan Sun (The University of Tennessee, Knoxville)

Volume 14, No. 6

Hannes Mühleisen and Thorsten Papenbrock: Front Matter

863 - 863

Errata for "Cerebro: A Data System for Optimized Deep Learning Model Selection"

Supun C Nakandala (University of California, San Diego), Yuhao Zhang (University of California, San Diego), Arun Kumar (University of California, San Diego)

864 - 877

ParaX: Boosting Deep Learning for Big Data Analytics on Many-Core CPUs

Lujia Yin (NUDT), Yiming Zhang (NUDT), Zhaoning Zhang (NUDT), Yuxing Peng (NUDT), Peng Zhao (Intel)

878 - 889

Optimization of Threshold Functions over Streams

Walter Cai (University of Washington), Philip Bernstein (Microsoft Research), Wentao Wu (Microsoft Research), Badrish Chandramouli (Microsoft Research)

890 - 902

Budget Constrained Interactive Search for Multiple Targets

Xuliang Zhu (Hong Kong Baptist University), Xin Huang (Hong Kong Baptist University), Byron Choi (Hong Kong Baptist University), Jiaxin Jiang (Hong Kong Baptist University), Zhaonian Zou (Harbin Institute of Technology), Jianliang Xu (Hong Kong Baptist University)

903 - 915

On the String Matching with k Differences in DNA Databases

Yangjun Chen (University of Winnipeg), Hoang Hai Nguyen (University of Winnipeg)

916 - 928

Fast Algorithm for Anchor Graph Hashing

Yasuhiro Fujiwara (NTT Communication Science Laboratories), Sekitoshi Kanai (NTT Software Innovation Center), Yasutoshi Ida (NTT Software Innovation Center), Atsutoshi Kumagai (NTT Software Innovation Center), Naonori Ueda (NTT Communication Science Labs.)

929 - 942

Adaptive Code Generation for Data-Intensive Analytics

Wangda Zhang (Columbia University), Junyoung Kim (Columbia University), Kenneth A Ross (Columbia University), Eric Sedlar (Oracle), Lukas Stadler (Oracle Labs)

943 - 956

Materializing Knowledge Bases via Trigger Graphs

Efthymia Tsamoura (Samsung AI Research), David Carral (LIRMM, Inria, University of Montpellier, CNRS), Enrico Malizia (University of Bologna), Jacopo Urbani (Vrije Universiteit Amsterdam)

957 - 969

Dealer: An End-to-End Model Marketplace with Differential Privacy

Jinfei Liu (Emory University/Georgia Institute of Technology), Jian Lou (Emory University), Junxu Liu (Emory University), Li Xiong (Emory University), Jian Pei (Simon Fraser University), Jimeng Sun (CS)

970 - 983

NOAH: Interactive Spreadsheet Exploration with Dynamic Hierarchical Overviews

Sajjadur Rahman (Megagon Labs), Mangesh Bendre (VISA Research), Yuyang Liu (University of Illinois at Urbana-Champaign), Shichu Zhu (Google LLC), Zhaoyuan Su (University of California, Irvine), Karrie Karahalios (University of Illinois at Urbana-Champaign), Aditya Parameswaran (University of California, Berkeley)

984 - 996

Efficient Bi-triangle Counting for Large Bipartite Networks

Yixing Yang (University of New South Wales), Yixiang Fang (School of Data Science, The Chinese University of Hong Kong, Shenzhen), Maria Orlowska (Polish-Japanese Institute of Information Technology), Wenjie Zhang (University of New South Wales), Xuemin Lin (University of New South Wales)

997 - 1005

Glean: Structured Extractions from Templatic Documents

Sandeep Tata (Google, USA), Navneet Potti (Google), James B Wendt (Google), Lauro Beltrão Costa (Google), Marc Najork (Google), Beliz Gunel (Stanford University)

1006 - 1018

ICS-GNN: Lightweight Interactive Community Search via Graph Neural Network

Jun Gao (Peking University), Jiazun Chen (Peking University), Zhao Li (Alibaba Group), Ji Zhang (The University of Southern Queensland)

1019 - 1032

Building Enclave-Native Storage Engines for Practical Encrypted Databases

Yuanyuan Sun (Alibaba Group), Sheng Wang (Alibaba Group), Huorong Li (Alibaba Group), Feifei Li (Alibaba Group)

1033 - 1039

From Natural Language Processing to Neural Databases

James Thorne (University of Cambridge), Majid Yazdani (Facebook), Marzieh Saeidi (Facebook), Fabrizio Silvestri (Facebook), Sebastian Riedel (), Alon Y Halevy (Facebook)

1040 - 1052

Randomized Error Removal for Online Spread Estimation in Data Streaming

Haibo Wang (University of Florida), Chaoyi Ma (University of Florida), Olufemi O Odegbile (University of Florida), Shigang Chen (University of Florida), Jih-Kwon Peir (University of Florida)

1053 - 1066

Teseo and the Analysis of Structural Dynamic Graphs

Dean De Leo (Centrum Wiskunde & Informatica), Peter Boncz (CWI)

1067 - 1079

Charting the Design Space of Query Execution using VOILA

Tim Gubner (CWI), Peter Boncz (CWI)

1080 - 1092

Heracles: An Efficient Storage Model And Data Flushing For Performance Monitoring Timeseries

Zhiqi WANG (The Chinese University of HK), Jin XUE (The Chinese University of HK), Zili Shao (The Chinese University of Hong Kong)

1093 - 1101

Fine-Grained Lineage for Safer Notebook Interactions

Stephen Macke (University of Illinois at Urbana-Champaign), Aditya Parameswaran (University of California, Berkeley), Hongpu Gong (University of California, Berkeley), Doris Lee (UC Berkeley), Doris Xin (UC Berkeley), Andrew Head (University of California, Berkeley)

1102 - 1110

FREDE: Anytime Graph Embeddings

Anton Tsitsulin (University of Bonn), Marina Munkhoeva (Skoltech), Davide Mottin (Aarhus University), Panagiotis Karras (Aarhus University), Ivan Oseledets (Skolkovo Institute of Science and Technology), Emmanuel Müller (University of Bonn & Fraunhofer IAIS)

1111 - 1123

On Analyzing Graphs with Motif-Paths

Xiaodong Li (The University of Hong Kong), Reynold Cheng (The University of Hong Kong, China), Kevin Chen-Chuan Chang (University of Illinois at Urbana-Champaign), Caihua Shan (The University of Hong Kong), Chenhao Ma (The University of Hong Kong), Hongtai Cao (University of Illinois at Urbana-Champaign)

Volume 14, No. 7

Arun Kumar, Alon Halevy, and Nesime Tatbul: Front Matter

1124 - 1136

Collective Influence Maximization for Multiple Competing Products with an Awareness-to-Influence Model

Dimitris Tsaras (HKUST), George Trimponias (Amazon Search), Lefteris Ntaflos (HKUST), Dimitris Papadias (HKUST)

1137 - 1149

Finding Group Steiner Trees in Graphs with both Vertex and Edge Weights

Yahui Sun (National University of Singapore), Xiaokui Xiao (National University of Singapore), Bin Cui (Peking University), Saman Halgamuge (University of Melbourne), Theodoros Lappas (Stevens Institute of Technology), Jun Luo (Nanyang Technological University)

1150 - 1158

Optimizing Bipartite Matching in Real-World Applications by Incremental Cost Computation

Tenindra Abeywickrama (GrabTaxi Holdings Pte Ltd), Victor Liang (GrabTaxi Holdings Pte Ltd), Kian-Lee Tan (National University of Singapore)

1159 - 1165

The Case for NLP-Enhanced Database Tuning: Towards Tuning Tools that "Read the Manual"

Immanuel Trummer

1166 - 1166

Errata for "Unifying Consensus and Atomic Commitment for Effective Cloud Data Management"

Sujaya A Maiyya (University Of California, Santa Barbara), Faisal Nawab (UC Santa Cruz), Divy Agrawal (University of California, Santa Barbara), Amr El Abbadi (UC Santa Barbara)

1167 - 1174

Software-Defined Data Protection: Low Overhead Policy Compliance at the Storage Layer is Within Reach!

Zsolt István (IT University Copenhagen), Soujanya Ponnapalli (UT Austin), Vijay Chidambaram (UT Austin and VMWare)

1175 - 1187

TRACE: Real-time Compression of Streaming Trajectories in Road Networks

Tianyi Li (Aalborg Univeristy), Lu Chen (Zhejiang University), Christian S Jensen (Aalborg University), Torben Bach Pedersen (Aalborg University)

1188 - 1201

Shortest Paths and Centrality in Uncertain Networks

Arkaprava Saha (Nanyang Technological University), Ruben Brokkelkamp (CWI Amsterdam), Yllka Velaj (University of Vienna), Arijit Khan (Nanyang Technological University), Francesco Bonchi (ISI Foundation, Turin)

1202 - 1214

Adaptive Data Augmentation for Supervised Learning over Missing Data

Tongyu Liu (Renmin University of China), Ju Fan (Renmin University of China), Yinqing Luo (Renmin University of China), Nan Tang (Qatar Computing Research Institute, HBKU), Guoliang Li (Tsinghua University), Xiaoyong Du (Renmin University of China)

1215 - 1227

KLL±: Approximate Quantile Sketches over Dynamic Datasets

Fuheng Zhao, Sujaya Maiyya, Ryan Weiner, Divy Agrawal, Amr El Abbadi

1228 - 1240

Distributed Numerical and Machine Learning Computations via Two-Phase Execution of Aggregated Join Trees

Dimitrije Jankov (Rice University), Binhang Yuan (Rice University), Shangyu Luo (Rice University), Chris Jermaine (Rice University)

1241 - 1253

An Inquiry into Machine Learning-based Automatic Configuration Tuning Services on Real-World Database Management Systems

Dana M Van Aken (Carnegie Mellon University), Dongsheng Yang (Princeton University), Sebastien Brillard (Societe Generale), Ari Fiorino (Carnegie Mellon University), Bohan Zhang (OtterTune), Christian Billian (Societe Generale), Andrew Pavlo (Carnegie Mellon University)

Volume 14, No. 8

Floris Geerts: Front Matter

1254 - 1261

RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation

Nan Tang (Qatar Computing Research Institute, HBKU), Ju Fan (Renmin University of China), Fangyi Li (Renmin University of China), Jianhong Tu (Renmin University of China), Xiaoyong Du (Renmin University of China), Guoliang Li (Tsinghua University), Samuel Madden (MIT), Mourad OUZZANI (Qatar Computing Research Institute, HBKU)

1262 - 1275

Lachesis: Automated Partitioning for UDF-Centric Analytics (Revision of Paper 270)

Jia Zou, Amitabh Das, Pratik Barhate, Arun Iyengar, Binhang Yuan, Dimitrije Jankov, Chris Jermaine

1276 - 1288

Updatable Learned Index with Precise Positions

Jiacheng Wu (Tsinghua University), Yong Zhang ( Tsinghua University, China), Shimin Chen (Chinese Academy of Sciences), Yu Chen (Tsinghua University), Jin Wang (UCLA), Chunxiao Xing (Tsinghua University)

1289 - 1297

MDTP: A Multi-source Deep Traffic Prediction Framework over Spatio-Temporal Trajectory Data

Ziquan Fang (Zhejiang University), lu pan (Zhejiang University), Lu Chen (Zhejiang University), Yuntao Du (Zhejiang University), Yunjun Gao (Zhejiang University)

1298 - 1310

Symmetric Continuous Subgraph Matching with Bidirectional Dynamic Programming

Seunghwan Min (Seoul National University), Sung Gwan Park (Seoul National University), Kunsoo Park (Seoul National University), Dora Giammarresi (Universita Roma Tor Vergata), Giuseppe F. Italiano (LUISS University), Wook-Shin Han (POSTECH)

1311 - 1324

Approaching DRAM performance by using microsecond-latency flash memory for small-sized random read accesses: a new access method and its graph applications

Tomoya Suzuki (Kioxia Corporation), Kazuhiro Hiwada (Kioxia Corporation), Hirotsugu Kajihara (Kioxia Corporation), Shintaro Sano (Kioxia Corporation), Shuou Nomura (Kioxia Corporation), Tatsuo Shiozawa (Kioxia Corporation)

1325 - 1337

CBench: Towards Better Evaluation of Question Answering Over Knowledge Graphs

Abdelghny Orogat (Carleton University), Isabelle Liu (Carleton University), Ahmed El-Roby (Carleton University)

1338 - 1350

Tensor Relational Algebra for Distributed Machine Learning System Design

Binhang Yuan (Rice University), Dimitrije Jankov (Rice University), Jia Zou (Arizona State University), Yuxin Tang (Rice University), Daniel Bourgeois (Rice University), Chris Jermaine (Rice University)

1351 - 1364

Parallel Discrepancy Detection and Incremental Detection

Wenfei Fan (Univ. of Edinburgh), Chao Tian (Alibaba Group), Yanghao Wang (University of Edinburgh), Qiang Yin (Alibaba Group)

1365 - 1377

Towards Crowd-aware Indoor Path Planning

Tiantian Liu (Aalborg University), Huan Li (Aalborg University), Hua Lu (Roskilde University), Muhammad Aamir Cheema (Monash University), Lidan Shou (Zhejiang University)

1378 - 1391

Procedural Extensions of SQL: Understanding their usage in the wild

Surabhi Gupta (Microsoft Research India), Karthik Ramachandra (Microsoft Research India)

1392 - 1400

Discovering Related Data At Scale

Sagar Bharadwaj K S (Microsoft), Praveen Gupta (Microsoft Research), Ranjita Bhagwan (Microsoft Research), Saikat Guha (Microsoft Research)

1401 - 1413

CGPTuner: a Contextual Gaussian Process Bandit Approach for the Automatic Tuning of IT Configurations Under Varying Workload Conditions

Stefano Cereda (Politecnico di Milano), Stefano Valladares (Akamas), Paolo Cremonesi (Politecnico di Milano), Stefano Doni (Akamas)

1414 - 1426

Language-Agnostic Integrated Queries in a Managed Polyglot Runtime

Filippo Schiavio (Università della Svizzera italiana (USI)), Daniele Bonetta (Oracle Labs), Walter Binder (Università della Svizzera italiana (USI))

1427 - 1440

Achieving High Throughput and Elasticity in a Larger-than-Memory Store

Chinmay Kulkarni (University of Utah), Badrish Chandramouli (Microsoft Research), Ryan Stutsman (University of Utah)

1441 - 1453

Efficient Size-Bounded Community Search over Large Networks

Kai Yao (The University of Sydney), Lijun Chang (The University of Sydney)

Volume 14, No. 9

Rainer Gemulla: Front Matter

1454 - 1466

Minimum Vertex Augmentation

Jianwen Zhao (Chinese University of Hong Kong), Yufei Tao (The Chinese University of Hong Kong)

1467 - 1480

Database Isolation By Scheduling

Kevin P Gaffney (University of Wisconsin-Madison), Robert K Claus (UW Madison), Jignesh Patel (UW - Madison)

1481 - 1488

SaS: SSD as SQL Database System

Jong-Hyeok Park (Sungkyunkwan University), Soyee Choi (SungKyunKwan University), Gihwan Oh (Sungkyunkwan University), Sang Won Lee (Sungkyunkwan University)

1489 - 1502

FLAT: Fast, Lightweight and Accurate Method for Cardinality Estimation

Rong Zhu (Alibaba Group), Ziniu Wu (Massachusetts Institute of Technology), Yuxing Han (Alibaba Group), Kai Zeng (Alibaba Group), Andreas Pfadler (Alibaba Group), Zhengping Qian (Alibaba Group), Jingren Zhou (Alibaba Group), Bin Cui (Peking University)

1503 - 1516

Fast Augmentation Algorithms for Network Kernel Density Visualization

Tsz Nam Chan (Hong Kong Baptist University), Zhe Li (The Hong Kong Polytechnic University), Leong Hou U (University of Macau), Jianliang Xu (Hong Kong Baptist University), Reynold Cheng (The University of Hong Kong, China)

1517 - 1530

AutoGR: Automated Geo-Replication with Fast System Performance and Preserved Application Semantics

Jiawei Wang (USTC), Cheng Li (USTC), Kai Ma (University of Science and Technology of China), Jingze Huo (USTC), Feng Yan (University of Nevada, Reno), Xinyu Feng (Nanjing University), Yinlong Xu (University of Science and Technology of China)

1531 - 1543

Local Algorithms for Distance-generalized Core Decomposition over Large Dynamic Graphs

Qing Liu (Hong Kong Baptist University), Xuliang Zhu (Hong Kong Baptist University), Xin Huang (Hong Kong Baptist University), Jianliang Xu (Hong Kong Baptist University)

1544 - 1556

Viper: An Efficient Hybrid PMem-DRAM Key-Value Store

Lawrence Benson (Hasso Plattner Institute, University of Potsdam), Hendrik Makait (Hasso Plattner Institute, University of Potsdam), Tilmann Rabl (HPI, University of Potsdam)

1557 - 1569

Estimating Spread of Contact-Based Contagions in a Population Through Sub-Sampling

Sepanta Zeighami (University of Southern California), Cyrus Shahabi (Computer Science Department. University of Southern California), John Krumm (Microsoft Research)

1570 - 1582

Trident: Task Scheduling over Tiered Storage Systems in Big Data Platforms

Herodotos Herodotou (Cyprus University of Technology), Elena Kakoulli (Cyprus University of Technology)

1583 - 1596

Comprehensible Counterfactual Explanation on Kolmogorov-Smirnov Test

Zicun Cong (Simon Fraser University), Lingyang Chu (McMaster University), Yu Yang (City University of Hong Kong), Jian Pei (Simon Fraser University)

1597 - 1605

Accelerating Large Scale Real-Time GNN Inference using Channel Pruning

hongkuan zhou (University of Southern California), Ajitesh Srivastava (University of Southern California), Hanqing Zeng (USC), Rajgopal Kannan (University of Southern California), Viktor K Prasanna (Unversity of Southern California)

1606 - 1612

Towards Cost-Optimal Query Processing in the Cloud

Viktor Leis (Friedrich-Alexander-Universität Erlangen-Nürnberg), Maximilian Kuschewski (Uni Augsburg, LMU, TUM)

1613 - 1625

Automating Incremental Graph Processing with Flexible Memoization

Shufeng Gong (NorthEastern University), Chao Tian (Alibaba Grioup), Qiang Yin (Alibaba Group), Wenyuan Yu (Alibaba Group), Yanfeng Zhang (NorthEastern University), Liang Geng (Alibaba Group), Song Yu (NorthEastern University), Ge Yu (Northeast University), Jingren Zhou (Alibaba Group)

1626 - 1639

In-Network Support for Transaction Triaging

Theo Jepsen (Stanford University), Alberto Lerner (University of Friborug), Fernando Pedone (Università della Svizzera italiana), Robert Soule (Yale University), Philippe Cudre-Mauroux (Exascale Infolab, Fribourg University)

1640 - 1654

Are We Ready For Learned Cardinality Estimation?

Xiaoying Wang (Simon Fraser University), Changbo Qu (Simon Fraser University), Weiyuan Wu (Simon Fraser University), Jiannan Wang (Simon Fraser University), Qingqing Zhou (Tencent Inc.)

1655 - 1667

On the algebra of data sketches

Jakub Lemiesz (Wrocław University of Science and Technology)

1668 - 1680

Massively Parallel Algorithms for Personalized PageRank

Guanhao Hou (The Chinese University of Hong Kong), Xingguang Chen (The Chinese University of Hong Kong), Sibo Wang (The Chinese University of Hong Kong), Zhewei Wei (Renmin University of China)

1681 - 1693

GeCo: Quality Counterfactual Explanations in Real Time

Maximilian Schleich (University of Washington), Zixuan Geng (University of Washington), Yihong Zhang (University of Washington), Dan Suciu (University of Washington)

1694 - 1702

Automated Feature Engineering for Algorithmic Fairness

Ricardo Salazar (TU Berlin), Felix Neutatz (TU Berlin), Ziawasch Abedjan (Leibniz Universität Hannover)

Volume 14, No. 10

Stefan Manegold: Front Matter

1703 - 1716

How to Design Robust Algorithms using Noisy Comparison Oracle

Raghavendra Addanki (University of Massachusetts Amherst), Sainyam Galhotra (University of Chicago), Barna Saha (University of California, Berkeley)

1717 - 1729

SAND: Streaming Subsequence Anomaly Detection

Paul Boniol (Université de Paris), John Paparrizos (University of Chicago), Themis Palpanas (University of Paris), Michael Franklin (University of Chicago)

1730 - 1742

Optimizing Fitness-For-Use of Differentially Private Linear Queries

Yingtai Xiao (Pennsylvania State University), Zeyu Ding (Penn State), Yuxin Wang (Penn State), Danfeng Zhang (Penn State), Daniel Kifer (Penn State)

1743 - 1755

Cryptanalysis of An Encrypted Database in SIGMOD '14

Xinle Cao (Zhejiang University), Jian Liu (Zhejiang University), Hao Lu (Zhejiang University), Kui Ren (Zhejiang University)

1756 - 1768

Unconstrained Submodular Maximization with Modular Costs: Tight Approximation and Application to Profit Maximization

Tianyuan Jin (National University of Singapore), Yu Yang (City University of Hong Kong), Renchi Yang (National University of Singapore), Jieming Shi (The Hong Kong Polytechnic University), Keke Huang (Nanyang Technological University), Xiaokui Xiao (National University of Singapore)

1769 - 1782

Distributed Deep Learning on Data Systems: A Comparative Analysis of Approaches

Yuhao Zhang (University of California, San Diego), Frank McQuillan (VMware), Nandish Jayaram (Intuit), Nikhil Kak (VMware), Ekta Khanna (VMware), Orhan Kislal (VMware), Domino Valdano (VMware), Arun Kumar (University of California, San Diego)

1783 - 1796

PR-Sketch: Monitoring Per-key Aggregation of Streaming Data with Nearly Full Accuracy

Qun Huang (Peking University), Sa Wang (Chinese Academy of Sciences)

1797 - 1804

Tensors: An abstraction for general data processing

Dimitrios Koutsoukos (ETHZ), Supun C Nakandala (University of California, San Diego), Konstantinos Karanasos (Microsoft), Karla Saur (Microsoft), Gustavo Alonso (ETHZ), Matteo Interlandi (Microsoft)

1805 - 1817

Budget Sharing for Multi-Analyst Differential Privacy

David A Pujol (Duke University), Yikai Wu (Duke University), Brandon T Fain (Duke University), Ashwin Machanavajjhala (Duke)

1818 - 1831

In the Land of Data Streams where Synopses are Missing, One Framework to Bring Them All

Rudi Poepsel-Lemaitre (Technische Universität Berlin), Martin Kiefer (TU Berlin), Joscha von Hein (TU Berlin), Jorge Arnulfo Quiane Ruiz (TU Berlin), Volker Markl (Technische Universität Berlin)

1832 - 1844

Data Acquisition for Improving Machine Learning Models

Yifan Li (York University), Xiaohui Yu (York University), Nick Koudas (University of Toronto)

1845 - 1858

Efficiently Answering Reachability and Path Queries on Temporal Bipartite Graphs

Xiaoshuang Chen (University of New South Wales), Kai Wang (University of New South Wales), Xuemin Lin (University of New South Wales), Wenjie Zhang (University of New South Wales), Lu Qin (UTS), Ying Zhang (University of Technology Sydney)

1859 - 1871

Preference Queries over Taxonomic Domains

Paolo Ciaccia (Università di Bologna), Davide Martinenghi (Politecnico di Milano), Riccardo Torlone (Universita Roma Tre)

1872 - 1885

Revisiting the Design of LSM-tree Based OLTP Storage Engine with Persistent Memory

Baoyue Yan (Beihang University), Xuntao Cheng (AZFT), Bo Jiang (Beihang University), Shibin Chen (AZFT), Canfang Shang (AZFT), Jianying Wang (AZFT), kenry huang (alibaba), Xinjun Yang (AZFT), Wei Cao (AZFT), Feifei Li (Alibaba)

1886 - 1899

Kamino: Constraint-Aware Differentially Private Data Synthesis

Chang Ge (University of Waterloo), Shubhankar Mohapatra (University of Waterloo), Xi He (University of Waterloo), Ihab F Ilyas (U. of Waterloo)

1900 - 1912

Towards Cost-Effective and Elastic Cloud Database Deployment via Memory Disaggregation

Yingqiang Zhang (Alibaba Group), Chaoyi Ruan (USTC), Cheng Li (USTC), Jimmy Yang (Alibaba Group), Wei Cao (Alibaba), Feifei Li (Alibaba Group), Bo Wang (Alibaba Group), Jing Fang (Alibaba Group), Yuhui Wang (Alibaba Group), Jingze Huo (USTC), Chao Bi (USTC)

1913 - 1921

Dual-Objective Fine-Tuning of BERT for Entity Matching

Ralph Peeters (University of Mannheim), Christian Bizer (University of Mannheim)

Volume 14, No. 11

Stratos Idreos and Zack Ives: Front Matter

1922 - 1936

GraphMineSuite: Enabling High-Performance and Programmable Graph Mining Algorithms with Set Algebra

Maciej Besta (ETH Zurich), Zur Vonarburg-Shmaria (ETH Zurich), Yannick Schaffner (ETH Zurich), Leonardo Schwarz (ETH Zurich), Grzegorz Kwasniewski (ETH Zurich), Lukas Gianinazzi (ETH Zurich), Jakub Beranek (VSB), Kacper Janda (AGH-UST), Tobias Holenstein (ETH), Sebastian Leisinger (ETHZ), Peter Tatkowski (ETH), Esref Özdemir (ETH Zürich), Adrian Balla (ETH Zurich), Marcin Copik (ETH Zurich), Philipp Lindenberger (ETH Zurich), Marek Konieczny (AGH-UST), Onur Mutlu (ETH Zurich), Torsten Hoefler (ETH Zurich)

1937 - 1949

PATSQL: Efficient Synthesis of SQL Queries from Example Tables with Quick Inference of Projected Columns

Keita Takenouchi (NTT DATA), Takashi Ishio (Nara Institute of Science and Technology), Joji Okada (NTT DATA), Yuji Sakata (NTT DATA)

1950 - 1963

Fauce: Fast and Accurate Deep Ensembles with Uncertainty for Cardinality Estimation

Jie Liu (University of California, Merced), Wenqian Dong (University of California, Merced), Dong Li (University of California, Merced), Qingqing Zhou (Tencent Inc.)

1964 - 1978

A Comprehensive Survey and Experimental Comparison of Graph-Based Approximate Nearest Neighbor Search

Mengzhao Wang (Hangzhou Dianzi University), Xiaoliang Xu (Hangzhou Dianzi University), Qiang Yue (Hangzhou Dianzi University), Yuxiang Wang (Hangzhou Dianzi University)

1979 - 1991

Towards Plug-and-Play Visual Graph Query Interfaces: Data-driven Canned Pattern Selection for Large Networks

Zifeng Yuan, Huey Eng Chua, Sourav S Bhowmick, Zekun Ye, Wook-shin Han, Byron Choi

1992 - 2005

ThunderRW: An In-Memory Graph Random Walk Engine

Shixuan Sun (National University of Singapore), Yuhang Chen (National University of Singapore), Shengliang Lu (National University of Singapore), Bingsheng He (National University of Singapore), Yuchen Li (Singapore Management University)

2006 - 2018

Butterfly-Core Community Search over Labeled Graphs

Zheng Dong (Baidu), Xin Huang (Hong Kong Baptist University), Guorui Yuan (Baidu), Hengshu Zhu (Baidu), Hui Xiong (Rutgers University)

2019 - 2032

Flow-Loss: Learning Cardinality Estimates That Matter

Parimarjan Negi (MIT CSAIL), Ryan C Marcus (MIT), Andreas Kipf (MIT), Hongzi Mao (MIT CSAIL), Nesime Tatbul (Intel Labs and MIT), Tim Kraska (MIT), Mohammad Alizadeh (Massachusetts Institute of Technology)

2033 - 2045

On Querying Historical K-Cores

Michael R Yu (UNSW), Dong Wen (University of Technology Sydney), Lu Qin (UTS), Ying Zhang (University of Technology Sydney), Wenjie Zhang (University of New South Wales), Xuemin Lin (University of New South Wales)

2046 - 2058

Frequency Estimation under Local Differential Privacy

Graham Cormode (University of Warwick), Sam Maddock (University of Warwick), Carsten Maple (University of Warwick)

2059 - 2072

Doing More with Less: Characterizing Dataset Downsampling for AutoML

Fatjon Zogaj (ETH Zurich), Jose P Cambronero Sanchez (MIT), Martin Rinard (MIT), Jürgen Cito (TU Wien and MIT)

2073 - 2086

LES3: Learning-based exact set similarity search

Yifan Li (York University), Xiaohui Yu (York University), Nick Koudas (University of Toronto)

2087 - 2100

Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture

Seung Won Min (University of Illinois at Urbana-Champaign), Kun Wu (University of Illinois at Urbana-Champaign), Sitao Huang (University of Illinois at Urbana-Champaign), Mert Hidayetoglu (University of Illinois at Urbana-Champaign), Jinjun Xiong (IBM Thomas J. Watson Research Center), Eiman Ebrahimi (NVIDIA), Deming Chen (University of Illinois at Urbana-Champaign), Wen-mei Hwu (NVIDIA Corporation)

2101 - 2113

FlexPushdownDB: Hybrid Pushdown and Caching in a Cloud DBMS

Yifei Yang (University of California, San Diego), Matt Youill (Burnian), Matthew Woicik (MIT), Yizhou Liu (University of Wisconsin, Madison), Xiangyao Yu (University of Wisconsin-Madison), Marco Serafini (University of Massachusetts Amherst), Ashraf Aboulnaga (QCRI), Michael Stonebraker (MIT)

2114 - 2126

Approximating Median Absolute Deviation with Bounded Error

Zhiwei Chen (Tsinghua University), Shaoxu Song (Tsinghua University), Ziheng Wei (Huawei Technologies Co., Ltd.), Jingyun Fang (Huawei Technologies Co., Ltd.), Jiang Long (Huawei Technologies Co., Ltd.)

2127 - 2140

An Experimental Evaluation and Guideline for Path Finding in Weighted Dynamic Network

Mengxuan Zhang (The University of Queensland), Lei Li (University of Queensland), Xiaofang Zhou (The Hong Kong University of Science and Technology)

2141 - 2153

Robustness against Read Committed for Transaction Templates

Brecht Vandevoort (Hasselt University), Bas Ketsman (Vrije Universiteit Brussel), Christoph Koch (EPFL, Switzerland), Frank Neven (Hasselt University)

2154 - 2166

LANCET: Labeling Complex Data at Scale

Huayi Zhang (WPI), Lei Cao (MIT), Samuel Madden (MIT), Elke A Rundensteiner (Worcester Polytechnic Institute)

2167 - 2176

VolcanoML: Speeding up End-to-End AutoML via Scalable Search Space Decomposition

Yang Li (Peking University), Yu Shen (Peking University), Wentao Zhang (Peking University), Jiawei Jiang (ETH Zurich), Yaliang Li (Alibaba Group), Bolin Ding (Data Analytics and Intelligence Lab, Alibaba Group), Jingren Zhou (Alibaba Group), Zhi Yang (Peking University), Wentao Wu (Microsoft Research), Ce Zhang (ETH), Bin Cui (Peking University)

2177 - 2189

A Queueing-Theoretic Framework for Vehicle Dispatching in Dynamic Car-Hailing

Peng Cheng (East China Normal University), Jiabao Jin (East China Normal University), Lei Chen (Hong Kong University of Science and Technology), Xuemin Lin (University of New South Wales), Libin Zheng (Sun Yat-sen University)

2190 - 2202

Data Synthesis via Differentially Private Markov Random Field

Kuntai Cai (National University of Singapore), Xiaoyu Lei (University of Connecticut), Jianxin Wei (National Univ. of Singapore), Xiaokui Xiao (National University of Singapore)

2203 - 2215

Scaling Replicated State Machines with Compartmentalization

Michael J Whittaker (UC Berkeley), Ailidani Ailidani (Microsoft), Aleksey Charapko (University of New Hampshire), Murat Demirbas (University at Buffalo, SUNY), Neil Giridharan (UC Berkeley), Joseph M Hellerstein (UC Berkeley), Heidi Howard (University of Cambridge), Ion Stoica (UC Berkeley), Adriana Szekeres (VMware)

2216 - 2229

Constructing and Analyzing the LSM Compaction Design Space

Subhadeep Sarkar (Boston University), Dimitris Staratzis (Boston University), Zichen Zhu (Boston University), Manos Athanassoulis (Boston University)

2230 - 2243

ByShard: Sharding in a Byzantine Environment

Jelle Hellings (University of California Davis), Mohammad Sadoghi (University of California, Davis)

2244 - 2257

SetSketch: Filling the Gap between MinHash and HyperLogLog

Otmar Ertl (Dynatrace Research)

2258 - 2270

CGM: An Enhanced Mechanism for Streaming Data Collectionwith Local Differential Privacy

ergute bao (national university of singapore), Yin Yang (Hamad bin Khalifa University), Xiaokui Xiao (National University of Singapore), Bolin Ding (Data Analytics and Intelligence Lab, Alibaba Group)

2271 - 2272

Errata for "Teseo and the Analysis of Structural Dynamic Graph"

Dean De Leo (Centrum Wiskunde & Informatica), Per Fuchs (Technische Universität München), Peter Boncz (Centrum Wiskunde & Informatica)

2273 - 2282

QARTA: An ML-based System for Accurate Map Services

Mashaal Musleh (University of Minnesota), Sofiane Abbar (Qatar Computing Research Institute), Rade Stanojevic (Qatar Computing Research Institute), Mohamed Mokbel (University of Minnesota - Twin Cities)

2283 - 2295

Real-World Trajectory Sharing with Local Differential Privacy

Teddy Cunningham (University of Warwick), Graham Cormode (University of Warwick), Hakan Ferhatosmanoglu (University of Warwick), Divesh Srivastava (AT&T Labs Research)

2296 - 2304

PolyFrame: A Retargetable Query-based Approach to Scaling Dataframes

Phanwadee Sinthong (University of California, Irvine), Michael Carey (UC Irvine)

2305 - 2313

Scalable Community Detection via Parallel Correlation Clustering

Jessica Shi (MIT), Laxman Dhulipala (MIT CSAIL), David Eisenstat (Google), Jakub Łącki (Google), Vahab Mirrokni (Google)

2314 - 2326

SlimChain: Scaling Blockchain Transactions through Off-Chain Storage and Parallel Processing

Cheng Xu (Hong Kong Baptist University), Ce Zhang (Hong Kong Baptist University), Jianliang Xu (Hong Kong Baptist University), Jian Pei (Simon Fraser University)

2327 - 2340

Towards an Optimized GROUP BY Abstraction for Large-Scale Machine Learning

Side Li (University of California, San Diego), Arun Kumar (University of California, San Diego)

2341 - 2354

Accelerating Approximate Aggregation Queries with Expensive Predicates

Daniel Kang (Stanford University), John Guibas (Stanford University), Peter D Bailis (Stanford University), Tatsunori Hashimoto (Stanford), Yi Sun (University of Chicago), Matei Zaharia (Stanford and Databricks)

2355 - 2368

A four-dimensional Analysis of Partitioned Approximate Filters

Tobias Schmidt (TUM), Maximilian Bandle (TUM), Jana Giceva (TU Munich)

2369 - 2382

SKT: A One-Pass Multi-Sketch Data Analytics Accelerator

Monica Chiosa (ETH Zurich), Thomas B Preußer (Accemic Technologies), Gustavo Alonso (ETHZ)

2383 - 2396

A Practical Approach to Groupjoin and Nested Aggregates

Philipp Fent (TUM), Thomas Neumann (TUM)

2397 - 2409

Robust Voice Querying with MUVE: Optimally Visualizing Results of Phonetically Similar Queries

Ziyun Wei (Cornell University), Immanuel Trummer (Cornell), Connor Anderson (Cornell University)

2410 - 2418

CHEF: A Cheap and Fast Pipeline for Iteratively Cleaning Label Uncertainties

Yinjun Wu (University of Pennsylvania), James Weimer (University of Pennsylvania), Susan B Davidson (University of Pennsylvania)

2419 - 2431

COMPARE: Accelerating Groupwise Comparison in Relational Databases for Data Analytics

Tarique Siddiqui (Microsoft Research), Surajit Chaudhuri (Microsoft), Vivek Narasayya (Microsoft)

2432 - 2444

Crystal: A Unified Cache Storage System for Analytical Databases

Dominik Durner (TUM), Badrish Chandramouli (Microsoft Research), Yinan Li (Microsoft Research)

2445 - 2458

The Smallest Extraction Problem

Valerio Cetorelli (Roma Tre University), Paolo Atzeni (Univ.of Roma 3), Valter Crescenzi (Roma Tre University), Franco Milicchio (Roma Tre University)

2459 - 2472

Deep Learning for Blocking in Entity Matching: A Design Space Exploration

Saravanan Thirumuruganathan (QCRI), Han Li (Amazon Alexa AI), Nan Tang (Qatar Computing Research Institute, HBKU), Mourad OUZZANI (Qatar Computing Research Institute, HBKU), Yash Govind (UW - Madison), Derek Paulsen (University of Wisconsin-Madison), Glenn M Fung (American Family Insurance), AnHai Doan (University of Wisconsin-Madison)

2473 - 2482

Grain: Improving Data Efficiency of Graph Neural Networks via Diversified Influence Maximization

Wentao Zhang (Peking University), Zhi Yang (Peking University), YeXin Wang (Peking University), Yu Shen (Peking University), Yang Li (Peking University), Liang Wang (Alibaba group), Bin Cui (Peking University)

2483 - 2490

Database Technology for the Masses: Sub-Operators as First-Class Entities

Maximilian Bandle (TUM), Jana Giceva (TU Munich)

2491 - 2504

Columnar Storage and List-based Processing for Graph Database Management Systems

Pranjal Gupta (University of Waterloo), Amine Mhedhbi (University of Waterloo), Semih Salihoglu (University of Waterloo)

2505 - 2518

Phoebe: A Learning-based Checkpoint Optimizer

Yiwen Zhu (Microsoft), Matteo Interlandi (Microsoft), Abhishek Roy (Microsoft), Krishnadhan Das (Microsoft), Hiren Patel (Microsoft), Malay Bag (Facebook), Hitesh Sharma (Google), Alekh Jindal (Microsoft)

2519 - 2532

Tailoring Data Source Distributions for Fairness-aware Data Integration

Fatemeh Nargesian (University of Rochester), Abolfazl Asudeh (University of Illinois at Chicago), H. V. Jagadish (University of Michigan)

2533 - 2545

Missing Value Imputation on Multidimensional Time Series

Parikshit Bansal (IIT Bombay), Prathamesh Deshpande (IIT Bombay), Sunita Sarawagi (Indian Institute of Technology)

2546 - 2554

Horizon: Scalable Dependency-driven Data Cleaning

El Kindi Rezig (MIT), Mourad OUZZANI (Qatar Computing Research Institute, HBKU), Walid G. Aref (Purdue University), Ahmed Elmagarmid (QCRI), Ahmed Mahmood (Purdue University), Michael Stonebraker (MIT)

2555 - 2562

Declarative Data Serving: The Future of Machine Learning Inference on the Edge

Ted Shaowang (University of Chicago), Nilesh Jain (Intel), Dennis Matthews (Intel), Sanjay Krishnan (U Chicago)

2563 - 2575

Auto-Pipeline: Synthesize Data Pipelines By-Target Using Reinforcement Learning and Search

Junwen Yang, Yeye He, Surajit Chaudhuri

2576 - 2585

Explaining Inference Queries with Bayesian Optimization

Brandon Lockhart (Simon Fraser University), Jinglin Peng (Simon Fraser University), Weiyuan Wu (Simon Fraser University), Jiannan Wang (Simon Fraser University), Eugene Wu (Columbia University)

2586 - 2598

Decomposed Bounded Floats for Fast Compression and Queries

Chunwei Liu (University of Chicago), Hao Jiang (University of Chicago), John Paparrizos (University of Chicago), Aaron J Elmore (University of Chicago)

2599 - 2612

Beyond Equi-joins: Ranking, Enumeration and Factorization

Nikolaos Tziavelis (Northeastern University), Wolfgang Gatterbauer (Northeastern University), Mirek Riedewald (Northeastern University)

2613 - 2626

Exathlon: A Benchmark for Explainable Anomaly Detection over Time Series

Vincent Jacob (Ecole Polytechnique), Fei Song (Ecole Polytechnique), Arnaud Stiegler (Ecole Polytechnique), Bijan Rad (Ecole Polytechnique), Yanlei Diao (Ecole Polytechnique), Nesime Tatbul (Intel Labs and MIT)

2627 - 2641

Progressive Compressed Records: Taking a Byte out of Deep Learning Data

Michael Kuchnik (Carnegie Mellon University), George Amvrosiadis (Carnegie Mellon University), Virginia Smith (Carnegie Mellon University)

2642 - 2654

TQEL: Framework for Query-Driven Linking of Top-K Entities in Social Media Blogs

Abdulrahman Alsaudi (University of California Irvine), Yasser Altowim (King Abdulaziz City for Science and Technology), Sharad Mehrotra (U.C. Irvine), Yaming Yu (University of California Irvine)

Volume 14, No. 12

Xin Luna Dong and Felix Naumann: Front Matter

2655 - 2658

KDV-Explorer: A Near Real-Time Kernel Density Visualization System for Spatial Analysis

Tsz Nam Chan (Hong Kong Baptist University), Pak Lon Ip (University of Macau), Leong Hou U (University of Macau), Weng Hou Tong (University of Macau), Shivansh Mittal (The University of Hong Kong), Ye Li (University of Macau), Reynold Cheng (The University of Hong Kong, China)

2659 - 2662

Refiner: A Reliable Incentive-Driven Federated Learning System Powered by Blockchain

Zhebin Zhang (Zhejiang University), Dajie Dong (Zhejiang University), Yuhang Ma (Zhejiang University), Yilong Ying (Zhejiang University), Dawei Jiang (Zhejiang University), Ke Chen (Zhejiang University), Lidan Shou (Zhejiang University), Gang Chen (Zhejiang University)

2663 - 2666

MultiCategory: Multi-model Query Processing Meets Category Theory and Functional Programming

Valter Uotila (University of Helsinki), Jiaheng Lu (University of Helsinki), Dieter Gawlick (Oracle), Zhen Hua Liu (Oracle), Souripriya Das (Oracle), Gregory Pogossiants (Soulmates.ai)

2667 - 2670

Cquirrel: Continuous Query Processing over Acyclic Relational Schemas

Qichen Wang (Hong Kong University of Science and Technology), Chaoqi Zhang (Hong Kong University of Science and Technology), Danish Alsayed (Hong Kong University of Science and Technology), Ke Yi (Hong Kong Univ. of Science and Technology), Bin Wu (Alibaba), Feifei Li (Alibaba Group), Chaoqun Zhan (Alibaba Inc.)

2671 - 2674

DeFiHap: Detecting and Fixing HiveQL Anti-Patterns

Yuetian Mao (Shanghai Jiao Tong University), Shuai Yuan (Shanghai Jiao Tong University), Nan Cui (Shanghai Jiao Tong University), Tianjiao Du (Shanghai Jiao Tong University), Beijun Shen (Shanghai Jiao Tong University), Yuting Chen (Shanghai Jiao Tong University)

2675 - 2678

A Demonstration of KGLac: A Data Discovery and Enrichment Platform for Data Science

Ahmed Helal (Concordia University), Mossad Helali (Concordia University), Khaled Ammar (BorealisAI), Essam Mansour (Concordia University)

2679 - 2682

Assessing the Existence of a Model in your Data with ADESIT

Pierre Faure-giovagnoli, Marie Le Guilly, Vasile-marian Scuturici, Jean-marc Petit

2683 - 2686

Path Advisor: A Multi-Functional Campus Map Tool for Shortest Path

Yinzhao YAN (Hong Kong University of Science and Technology), Raymond Chi-Wing Wong (Hong Kong University of Science and Technology)

2687 - 2690

Intermittent Human-in-the-Loop Model Selection using Cerebro: A Demonstration

Liangde Li (UC San Diego), Supun C Nakandala (University of California, San Diego), Arun Kumar (University of California, San Diego)

2691 - 2694

Low-Latency Compilation of SQL Queries to Machine Code

Henning Funke (TU Dortmund University), Jens Teubner (TU Dortmund University)

2695 - 2698

Sound of Databases: Sonification of a Semantic Web Database Engine

Sven Groppe (University of Lübeck), Rico Klinckenberg (University of Lübeck), Benjamin Warnke (University of Lübeck)

2699 - 2702

HyMAC: A Hybrid Matrix Computation System

Zihao Chen (East China Normal University), Zhizhen Xu (East China Normal University), Chen Xu (East China Normal University), Juan Soto (TU Berlin), Volker Markl (Technische Universität Berlin), Weining Qian (East China Normal University), Aoying Zhou (East China Normal University)

2703 - 2706

GraphScope: A One-Stop Large Graph Processing System

Jingbo Xu (Peking University & Alibaba Group), Zhanning Bai (Ant Group), Wenfei Fan (Univ. of Edinburgh), Longbin Lai (Alibaba Group), Xue Li (Alibaba Group), Zhao Li (Alibaba Group), Zhengping Qian (Alibaba Group), Lei Wang (Alibaba Group), Yanyan Wang (Ant Group), Wenyuan Yu (Alibaba Group), Jingren Zhou (Alibaba Group)

2707 - 2710

Just Move It! Dynamic Parameter Allocation in Action

Alexander Renz-Wieland (Technische Universität Berlin), Tobias Drobisch (TU Berlin), Zoi Kaoudi (TU Berlin), Rainer Gemulla (Universität Mannheim), Volker Markl (Technische Universität Berlin)

2711 - 2714

CBench: Demonstrating Comprehensive Evaluation of Question Answering Systems over Knowledge Graphs Through Deep Analysis of Benchmarks

Abdelghny Orogat (Carleton University), Ahmed El-Roby (Carleton University)

2715 - 2718

PostCENN: PostgreSQL with Machine Learning Models for Cardinality Estimation

Lucas Woltmann (Technische Universität Dresden), Dominik Olwig (Technische Universität Dresden), Claudio Hartmann (Technische Universität Dresden), Dirk Habich (TU Dresden), Wolfgang Lehner (TU Dresden)

2719 - 2722

DENOUNCER: Detection of Unfairness in Classifiers

Jinyang Li (University of Michigan), Yuval Moskovitch (University of Michigan), H. V. Jagadish (University of Michigan)

2723 - 2726

A Demonstration of QARTA: An ML-based System for Accurate Map Services

Sofiane Abbar (Qatar Computing Research Institute), Rade Stanojevic (Qatar Computing Research Institute), Mashaal Musleh (University of Minnesota), Mohamed M Elshrif (Qatar Computing Research Institute), Mohamed Mokbel (University of Minnesota - Twin Cities)

2727 - 2730

TraNCE: Transforming Nested Collections Efficiently

Jaclyn Smith (Oxford University), Michael Benedikt (Oxford University), Brandon Moore (Oxford University), Milos Nikolic (University of Edinburgh)

2731 - 2734

Debugging Missing Answers for Spark Queries over Nested Data with Breadcrumb

Ralf Diestelkämper (University of Stuttgart), Seokki Lee (University of Cincinnati), Boris Glavic (Illinois Institute of Technology), Melanie Herschel (Universität Stuttgart)

2735 - 2738

Demonstration of Panda: A Weakly Supervised Entity Matching System

Renzhi Wu (Georgia Institute of Technology), Prem Sakala (Georgia Institute of Technology), Peng Li (GATECH), Xu Chu (GATECH), Yeye He (Microsoft Research)

2739 - 2742

Automatic Data Acquisition for Deep Learning

Jiabin Liu (Tsinghua University), Fu Zhu (Tsinghua University), Chengliang Chai (Tsinghua University), Yuyu Luo (Tsinghua University), Nan Tang (Qatar Computing Research Institute, HBKU)

2743 - 2746

DBMind: A Self-Driving Platform in openGauss

Xuanhe Zhou (Tsinghua), Lianyuan Jin (Tsinghua University), Ji Sun (Tsinghua University), xinyang zhao (Tsinghua university), Xiang Yu (Tsinghua University), Shifu Li (Huawei), Tianqing Wang (Huawei), kun li (Huawei), luyang liu (Huawei)

2747 - 2750

Demonstration of Dealer: An End-to-End Model Marketplace with Differential Privacy

Qiongqiong Lin (Zhejiang University), Jiayao Zhang (Zhejiang University), Jinfei Liu (Zhejiang University), Kui Ren (Zhejiang University), Jian Lou (Emory University), Junxu Liu (Renmin University of China), Li Xiong (Emory University), Jian Pei (Simon Fraser University), Jimeng Sun (UIUC)

2751 - 2754

Assassin: an Automatic claSSificAtion system baSed on algorithm SelectIoN

TianYu Mu (Harbin Institute of Technology), Hongzhi Wang (Harbin Institute of Technology), ShengHe Zheng (Harbin Institute of Technology), ShaoQing Zhang (Harbin Institute of Technology), Cheng Liang (Harbin Institute of Technology), HaoYun Tang (HIT)

2755 - 2758

ATLANTIC: Making Database Differentially Private and Faster with Accuracy Guarantee

Lei Cao (MIT), Dongqing Xiao (Google), Yizhou Yan (Worcester Polytechnic Institute), Samuel Madden (MIT), Guoliang Li (Tsinghua University)

2759 - 2762

Demo of Marius: A System for Large-scale Graph Embeddings

Anze Xie (University of Wisconsin-Madison), Anders Carlsson (University of Wisconsin-Madison), Jason M Mohoney (University of Wisconsin-Madison), Roger Waleffe (University of Wisconsin-Madison), Shanan Peters (University of Wisconsin-Madison), Theodoros Rekatsinas (University of Wisconsin-Madison), Shivaram Venkataraman (University of Wisconsin, Madison)

2763 - 2766

From Papers to Practice: The openclean Open-Source Data Cleaning Library

Heiko Mueller (NYU), Sonia Castelo (New York University), Munaf A Qazi (New York University), Juliana Freire (New York University)

2767 - 2770

Demonstration of Apperception: A Database Management System for Geospatial Video Data

Vanessa Lin (University of California Berkeley), Yongming Ge (University of California Berkeley), Maureen Daum (University of Washington), Alvin Cheung (University of California, Berkeley), Brandon Haynes (Microsoft), Magdalena Balazinska (UW)

2771 - 2774

Automated energy consumption forecasting with EnForce

Mary Karatzoglidi (National Technical University of Athens), Paraskevas Kerasiotis (National Technical University of Athens), Verena Kantere (National Technical University of Athens)

2775 - 2778

RealGraph-Web: A Graph Analysis Platform on the Web

Myung-Hwan Jang (Hanyang University), Yong-Yeon Jo (Hanyang University), Sang-Wook Kim (Hanyang University, Korea)

2779 - 2782

Interactive Demonstration of SQLCHECK

Arthita Ghosh (Georgia Institute Of Technology), Deven Bansod (Georgia Institute of Technology), Arpit Narechania (Georgia Institute of Technology), Visweswara Sai Prashanth Dintyala (Georgia Institute of Technology), Su Timurturkan (Georgia Institute of Technology), Joy Arulraj (Georgia Tech)

2783 - 2786

T-Cove: An exposure tracing System based on Cleaning Wi-Fi Events on Organizational Premises

Yiming Lin (University of California, Irvine), Pramod Khargonekar (), Sharad Mehrotra (U.C. Irvine), Nalini Venkatasubramanian (University of California, Irvine)

2787 - 2790

Demonstration of Generating Explanations for Black-Box Algorithms Using Lewis

Paul Y Wang (University of California, San Diego), Sainyam Galhotra (University of Chicago), Romila Pradhan (University of California San Diego), Babak Salimi (Unievristy of California at San Diego)

2791 - 2794

Auctus: A Dataset Search Engine for Data Discovery and Augmentation

Sonia Castelo (New York University), Remi Rampin (NYU), Aécio Santos (New York University), Aline Bessa (New York University), Fernando Chirigati (NYU), Juliana Freire (New York University)

2795 - 2798

A Demonstration of Relic: A System for REtrospective Lineage InferenCe of Data Workflows

Mohammed Suhail Rehman (University of Chicago), Silu Huang (Microsoft Research), Aaron J Elmore (University of Chicago)

2799 - 2802

SChain: A Scalable Consortium Blockchain Exploiting Intra- and Inter-Block Concurrency

Zhihao Chen (East China Normal University), Haizhen Zhuo (Ant Group), Quanqing Xu (Ant Group), Xiaodong Qi (East China Normal University), Chengyu Zhu (East China Normal University), Zhao Zhang (East China Normal University), Cheqing Jin (East China Normal University), Aoying Zhou (East China Normal University), Ying Yan (Ant Group), Hui Zhang (Ant Group)

2803 - 2806

EPICGen: An Experimental Platform for Indoor Congestion Generation and Forecasting

Chrysovalantis Anastasiou (University of Southern California), Constantinos Costa (University of Pittsburgh), Panos K. Chrysanthis (University of Pittsburgh), Cyrus Shahabi (Computer Science Department. University of Southern California)

2807 - 2810

Wikinegata: a Knowledge Base with Interesting Negative Statements

Hiba Arnaout (Max-Planck-Institut für Informatik), Simon Razniewski (Max-Planck-Institut für Informatik, Germany), Gerhard Weikum (Max-Planck-Institut fur Informatik), Jeff Z. Pan (The University of Edinburgh)

2811 - 2814

Full Encryption: An end to end encryption mechanism in GaussDB

Liang Guo (Huawei Technologies Co., Ltd.), jinwei zhu (Huawei Technologies Co., Ltd.), jiayang liu (Huawei Technologies Co., Ltd.), kun cheng (Huawei Technologies Co., Ltd.)

2815 - 2818

DatAgent: The Imminent Age of Intelligent Data Assistants

Antonis Mandamadiotis (Athena Research Center), Stavroula Eleftherakis (Athena Research Center), Apostolos Glenis (Athena Research Center), Dimitrios Skoutas (Athena Research Center), Yannis Stavrakas (Athena Research Center), Georgia Koutrika (Athena Research Center)

2819 - 2822

DICE: Data Discovery by Example

El Kindi Rezig (MIT), Anshul Bhandari (National Institute of Technology Hamirpur), Anna Fariha (Microsoft), Benjamin Price (MIT Lincoln Laboratory), Allan Vanterpool (United States Air Force), Vijay Gadepally (MIT Lincoln Laboratory), Michael Stonebraker (MIT)

2823 - 2826

AnyOLAP: Analytical Processing of Arbitrary Data-Intensive Applications without ETL

Felix M Schuhknecht (Johannes Gutenberg-University Mainz), Aaron Priesterroth (Johannes Gutenberg-University Mainz), Justus Henneberg (Johannes Gutenberg-University Mainz), Reza Salkhordeh (Johannes Gutenberg-University Mainz)

2827 - 2830

A Demonstration of the Exathlon Benchmarking Platform for Explainable Anomaly Detection

Vincent Jacob (Ecole Polytechnique), Fei Song (Ecole Polytechnique), Arnaud Stiegler (Ecole Polytechnique), Bijan Rad (Ecole Polytechnique), Yanlei Diao (Ecole Polytechnique), Nesime Tatbul (Intel Labs and MIT)

2831 - 2834

An Intermediate Representation for Hybrid Database and Machine Learning Workloads

Amir Shaikhha (University of Edinburgh), Maximilian Schleich (University of Washington), Dan Olteanu (University of Zurich)

2835 - 2838

How Divergent Is Your Data?

Eliana Pastor (Politecnico di Torino), Andrew Gavgavian (University of California, Santa Cruz), Elena Baralis (Dipartimento di Automatica e Informatica Politecnico di Torino), Luca de Alfaro (University of California, Santa Cruz)

2839 - 2842

An Extensible and Reusable Pipeline for Automated Utterance Paraphrases

auday berro (Université Claude Bernard Lyon 1), Mohammad-Ali Yaghub Zade Fard (University of New South Wales), Marcos Baez (Universite Claude Bernard lyon 1), Boualem Benatallah (UNSW Sydney, Australia & LIRIS-Lyon1, France), Khalid Benabdeslem (Université Claude Bernard Lyon 1)

2843 - 2846

Compliant Geo-distributed Data Processing in Action

Kaustubh Beedkar (TU Berlin), David Brekardin (TU Berlin), Jorge Arnulfo Quiane Ruiz (TU Berlin), Volker Markl (Technische Universität Berlin)

2847 - 2850

Query-Driven Video Event Processing for the Internet of Multimedia Things

Piyush Yadav (Insight SFI Research Centre for Data Analytics), Dhaval Salwala (Insight SFI Centre for Data Analytics), Felipe Pontes (Insight SFI Centre for Data Analytics), Praneet Dhingra (Insight SFI Centre for Data Analytics), Edward Curry (Insight SFI Centre for Data Analytics)

2851 - 2854

A Demonstration of NoDA: Unified Access to NoSQL Stores

Nikolaos Koutroumanis (University of Piraeus), Kousathanas Nikolaos (University of Pireaus), Christos Doulkeridis (University of Pireaus), Akrivi Vlachou (University of the Aegean)

2855 - 2858

AutoExecutor: Predictive Parallelism for Spark SQL Queries

Rathijit Sen (Microsoft), Abhishek Roy (Microsoft), Alekh Jindal (Keebo), Rui Fang (Microsoft), Jeff Zheng (Microsoft), Xiaolei Liu (Microsoft), Ruiping Li (Microsoft)

2859 - 2862

Catch a Blowfish Alive: A Demonstration of Policy-Aware Differential Privacy for Interactive Data Exploration

Jiaxiang Liu (University of Waterloo), Karl Knopf (University of Waterloo), Yiqing Tan (University of Waterloo), Bolin Ding (Data Analytics and Intelligence Lab, Alibaba Group), Xi He (University of Waterloo)

2863 - 2866

RONIN: Data Lake Exploration

Paul Ouellette (University of Rochester), Aidan Sciortino (University of Rochester), Fatemeh Nargesian (University of Rochester), Bahar Ghadiri Bashardoost (University of Toronto), Erkang Zhu (Microsoft Research), Ken Pu (Ontario Tech University), Renée J. Miller (Northeastern University)

2867 - 2870

SAND in Action: Subsequence Anomaly Detection for Streams

Paul Boniol (Université de Paris), John Paparrizos (University of Chicago), Themis Palpanas (University of Paris), Michael Franklin (University of Chicago)

2871 - 2874

Valentine in Action: Matching Tabular Data at Scale

Christos Koutras (TU Delft), Kyriakos Psarakis (TU Delft), George Siachamis (TU Delft), Andra Ionescu (TU Delft), Marios Fragkoulis (TU Delft), Angela Bonifati (Univ. of Lyon), Asterios Katsifodimos (TU Delft)

2875 - 2878

GEDet: Detecting Erroneous Nodes with A Few Examples

Sheng Guan (Case Western Reserve University), Hanchao Ma (Case Western Reserve University), Sutanay Choudhury (Pacific Northwest National Laboratory), Yinghui Wu (Case Western Reserve University)

2879 - 2892

GraphScope: A Unified Engine For Big Graph Processing

Wenfei Fan (Univ. of Edinburgh), Tao He (Alibaba Group), Longbin Lai (Alibaba Group), Xue Li (Alibaba Group), Yong Li (Alibaba Group), Zhao Li (Alibaba Group), Zhengping Qian (Alibaba Group), Chao Tian (Alibaba Grioup), Lei Wang (Alibaba Group), Jingbo Xu (Peking University & Alibaba Group), Youyang Yao (Alibaba Group), Qiang Yin (Alibaba Group), Wenyuan Yu (Alibaba Group), Jingren Zhou (Alibaba Group), Diwen Zhu (Alibaba), Rong Zhu (Alibaba Group)

2893 - 2905

Davos: A System for Interactive Data-Driven Decision Making

Zeyuan Shang (Einblick Analytics), Emanuel Zgraggen (Einblick Analytics), Benedetto Buratti (Einblick Analytics), Philipp Eichmann (Einblick Analytics), Navid Karimeddiny (Einblick Analytics), Charlie Meyer (Einblick Analytics), Wesley Runnels (Einblick Analytics), Tim Kraska (Einblick Analytics)

2906 - 2917

Mixer: Efficiently Understanding and Retrieving Visual Content at Web-Scale

An Qin (Baidu Inc.), Mengbai Xiao (Shandong University), Yongwei Wu (Baidu Inc.), Xinjie Huang (Baidu Inc.), Xiaodong Zhang (Ohio State U.)

2918 - 2931

Towards A Polyglot Framework for Factorized ML

David A Justo (UC San Diego), Shaoqing Yi (UC San Diego), Lukas Stadler (Oracle Labs), Nadia Polikarpova (University of California, San Diego), Arun Kumar (University of California, San Diego)

2932 - 2944

The End of Moore's Law and the Rise of The Data Processor

Niv Dayan (Pliops), Moshe Twitto (Pliops), Yuval Rochman (Pliops), Uri Beitler (Pliops), Itai Ben Zion (Pliops), Edward Bortnikov (Pliops), Shmuel Dashevsky (Pliops), Ofer Frishman (Pliops), Evgeni Ginzburg (Pliops), Igal Maly (Pliops), Avraham (Poza) Meir (Pliops), Mark Mokryn (Pliops), Iddo Naiss (Pliops), Noam Rabinovich (Pliops)

2945 - 2958

tf.data: A Machine Learning Data Processing Framework

Derek Murray (Microsoft), Jiri Simsa (Google), Ana Klimovic (ETH Zurich), Ihor Indyk (Google)

2959 - 2971

Not Black-Box Anymore! Enabling Analytics-Aware Optimizations in Teradata Vantage

Mohamed Eltabakh ((Teradata)), anantha subramanian (Teradata Labs), Awny AlOmari (Teradata Labs), Mohammed Al-kateb (Teradata), Sanjay Nair (Teradata), Mahbub Hasan (Teradata Labs), Wellington Cabrera (Teradata Labs), Charles Zhang (Teradata Labs), Amit Kishore (Teradata Labs), Snigdha Prasad (Teradata Labs)

2972 - 2985

Fangorn: Adaptive Execution Framework for Heterogeneous Workloads on Shared Clusters

Yingda Chen (Alibaba Group), Jiamang Wang (Alibaba), Yifeng Lu (Alibaba Group), Ying Han (Alibaba Group), Zhiqiang Lv (Alibaba Group), Xuebin Min (Alibaba Group), Hua Cai (Alibaba Group), Wei Zhang (Alibaba Group), Haochuan Fan (Alibaba Group), Chao Li (Alibaba Group), Tao Guan (Alibaba Group), Wei Lin (Alibaba Group), Yangqing Jia (Alibaba Group), Jingren Zhou (Alibaba Group)

2986 - 2998

Napa: Powering Scalable Data Warehousing with Robust Query Performance at Google

Ankur Agiwal (Google Inc), Kevin Lai (Google Inc), Gokul Nath Babu Manoharan (Google), Indrajit Roy (Google Inc), Jagan Sankaranarayanan (Google), Hao Zhang (Google Inc), Tao Zou (Google Inc), Min Chen (Google Inc), Jim Chen (Google Inc), Ming Dai (Google Inc), Thanh Do (Google, LLC), Haoyu Gao (Google Inc), Haoyan Geng (Google Inc), Raman Grover (Google Inc), Bo Huang (Google Inc), Yanlai Huang (Google Inc), Adam Li (Google Inc), Jianyi Liang (Google Inc), Tao Lin (Google Inc), Li Liu (Google Inc), Yao Liu (Google Inc), Xi Mao (Google Inc), Maya Meng (Google Inc), Prashant Mishra (Google Inc), Jay Patel (Google Inc), Rajesh SR (Google Inc), Vijayshankar Raman (Google), Sourashis Roy (Google Inc), Mayank Singh Shishodia (Google Inc), Tianhang Sun (Google Inc), Justin Tang (Google Inc), Jun Tatemura (Google), Sagar Trehan (Google Inc), Ramkumar Vadali (Google Inc), Prasanna Venkatasubramanian (Google Inc), Joey Zhang (Google Inc), Kefei Zhang (Google Inc), Yupu Zhang (Google Inc), Zeleng Zhuang (Google Inc), Goetz Graefe (Google), Divy Agrawal (Google), Jeff Naughton (Google), Sujata Kosalge (Google Inc), Hakan Hacigumus (Google)

2999 - 3013

The Art of Balance: A RateupDB Experience of Building a CPU/GPU Hybrid Database Product

Rubao Lee (Rateup Inc.), Minghong Zhou (Rateup Inc.), Chi Li (Rateup Inc.), Shenggang Hu (Rateup Inc.), Jianping Teng (Rateup Inc.), Dongyang Li (Rateup Inc.), Xiaodong Zhang (Ohio State U.)

3014 - 3027

RAMP-TAO: Layering Atomic Transactions on Facebook's Online TAO Data Store

Audrey Cheng (UC Berkeley), Xiao Shi (Facebook, Inc.), Lu Pan (Facebook, Inc.), Anthony Simpson (Facebook, Inc.), Neil Wheaton (Facebook, Inc.), Shilpa Lawande (Facebook, Inc.), Nathan Bronson (Rockset), Peter Bailis (), Natacha Crooks (UC Berkeley), Ion Stoica (UC Berkeley)

3028 - 3041

openGauss: An Autonomous Database System

Guoliang Li (Tsinghua University), Xuanhe Zhou (Tsinghua), Ji Sun (Tsinghua University), Xiang Yu (Tsinghua University), Yue Han (Tsinghua University), Lianyuan Jin (Tsinghua University), Wenbo Li (Tsinghua University), Tianqing Wang (Huawei), Shifu Li (Huawei)

3043 - 3055

Hyperspace: The Indexing Subsystem of Azure Synapse

Rahul Potharaju (Microsoft), Terry Kim (Microsoft), Eunjin Song (Microsoft), Wentao Wu (Microsoft Research), Lev Novik (Microsoft), Apoorve Dave (Microsoft), Pouria Pirzadeh (Microsoft), Andrew Fogarty (Microsoft), Jiying Li (Microsoft), Vidip Acharya (Microsoft), Sinduja Ramanujam (Microsoft), Nico Bruno (Microsoft), Cesar Galindo-Legaria (Microsoft), Vivek Narasayya (Microsoft), Surajit Chaudhuri (Microsoft), Anil Nori (Microsoft), Tomas Talius (Microsoft), Raghu Ramakrishnan (Microsoft)

3056 - 3068

SpeakNav: Voice-based Route Description Language Understanding for Template Driven Path Search

Bolong Zheng (Huazhong University of Science and Technology), Lei Bi (Huazhong University of Science and Technology), Juan Cao (Huazhong University of Science and Technology), Hua Chai (Didi Chuxing), Jun Fang (Didi Chuxing), Lu Chen (Zhejiang University), Yunjun Gao (Zhejiang University), Xiaofang Zhou (The Hong Kong University of Science and Technology), Christian S Jensen (Aalborg University)

3069 - 3082

Railgun: managing large streaming windows under MAD requirements

Ana Sofia Gomes (Feedzai), João Oliveirinha (Feedzai), Pedro Cardoso (Feedzai), Pedro Bizarro (Feedzai)

3083 - 3095

Big Metadata : When Metadata is Big Data

Pavan Edara (Google), Mosha Pasumansky (Google)

3096 - 3109

Tanium Reveal: A Federated Search Engine for Querying Unstructured File Data on Large Enterprise Networks

Joshua F Stoddard (Tanium), Adam Mustafa (Tanium), Naveen Goela (Tanium)

3110 - 3121

Hazelcast Jet: Low-latency Stream Processing at the 99.99th Percentile

Can Gencer (Hazelcast Inc.), Marko Topolnik (Hazelcast Inc.), Viliam Ďurina (Hazelcast Inc.), Emin Demirci (Layer Co), Basri Kahveci (-), Ali Gürbüz (Hazelcast Inc.), Ondřej Lukas (Hazelcast Inc.), Jozsef Bartok (Hazelcast Inc.), Grzegorz Gierlach (Hazelcast Inc), František Hartman (Hazelcast Inc.), Ufuk Yilmaz (Hazelcast Inc.), Mehmet Doğan (Layer Co), Mohamed Mandouh (Mansoura University), Marios Fragkoulis (TU Delft), Asterios Katsifodimos (TU Delft)

3122 - 3134

SparkCruise: Workload Optimization in Managed Spark Clusters at Microsoft

Abhishek Roy (Microsoft), Alekh Jindal (Keebo), Priyanka Gomatam (Microsoft), Xiating Ouyang (University of Wisconsin-Madison), Ashit Gosalia (Microsoft), Nishkam Ravi (Microsoft), Swinky Mann (Microsoft), Prakhar Jain (Microsoft)

3135 - 3147

Watermarks in Stream Processing Systems: Semantics and Comparative Analysis of Apache Flink and Google Cloud Dataflow

Tyler Akidau (Snowflake Inc), Edmon Begoli (Oak Ridge National Laboratory), Slab=va Chernyak (Google Inc.), Fabian Hueske (Ververica GmbH), Kathryn Knight (Oak Ridge National Laboratory), Kenneth Knowles (Google Inc.), Daniel Mills (Google Inc.), Dan Sotolongo (Snowflake Inc.)

3148 - 3161

The Cosmos Big Data Platform at Microsoft: Over a Decade of Progress and a Decade to Look Forward

Conor Power (Microsoft), Hiren Patel (Microsoft), Alekh Jindal (Keebo), Jyoti Leeka (Microsoft), Bob Jenkins (Microsoft), Michael Rys (Microsoft), Ed Triou (Microsoft), Dexin Zhu (Microsoft), Lucky Katahanas (Microsoft), Chakrapani Bhat Talapady (Microsoft), Josh Rowe (Microsoft), Fan Zhang (Microsoft), Rich Draves (Microsoft), Marc Friedman (Microsoft), Ivan Santa (Microsoft), Amrish Kumar (Microsoft)

3162 - 3163

The evolution of Amazon Redshift

Ippokratis Pandis

3240 - 3252

Using VDMS to Index and Search 100M Images

Luis Remis (ApertureData), Chaunte W Lacewell (Intel Corporation)

3175 - 3177

On the Limits of Machine Knowledge: Completeness, Recall and Negation in Web-scale Knowledge Bases

Simon Razniewski (Max-Planck-Institut für Informatik, Germany), Hiba Arnaout (Max-Planck-Institut für Informatik), Shrestha Ghosh (Max-Planck-Institut für Informatik), Fabian Suchanek (Télécom ParisTecg)

3178 - 3181

Managing ML Pipelines: Feature Stores and the Coming Wave of Embedding Ecosystems

Laurel Orr (Stanford University), Atindriyo Sanyal (Uber AI), Xiao Ling (Apple), Karan Goel (Stanford), Megan Leszczynski (Stanford)

3182 - 3185

Data Augmentation for ML-driven Data Preparation and Integration

Yuliang Li (Megagon Labs), Xiaolan Wang (Megagon Labs), Zhengjie Miao (Duke University), Wang-Chiew Tan (Facebook AI)

3186 - 3189

Array DBMS: Past, Present, and (Near) Future

Ramon Antonio Rodriges Zalipynis (National Research University Higher School of Economics)

3190 - 3193

Machine Learning for Databases

Guoliang Li (Tsinghua University), Xuanhe Zhou (Tsinghua), Lei Cao (MIT)

3194 - 3197

Extending the Lifetime of NVM: Challenges and Opportunities

Saeed Kargar (UCSC), Faisal Nawab (University of California at Irvine)

3198 - 3201

New Trends in High-D Vector Similarity Search: AI-driven, Progressive, and Distributed

Karima Echihabi (Mohammed VI Polytechnic University), Themis Palpanas (University of Paris), Kostas Zoumpatianos (Snowflake Computing)

3202 - 3205

Machine Learning for Cloud Data Systems: the Promise, the Progress, and the Path Forward

Alekh Jindal, Matteo Interlandi

3206 - 3206

It’s not just Cookies and Tea

Susan Davidson

3207 - 3210

Evolution of a Compiling Query Engine

Thomas Neumann (Technische Universität München)

3211 - 3221

Make Your Database System Dream of Electric Sheep: Towards Self-Driving Operation

Andy Pavlo (Carnegie Mellon University), Matthew Butrovich (Carnegie Mellon University), Lin Ma (Carnegie Mellon University), Prashanth Menon (Carnegie Mellon University), Wan Shen Lim (Carnegie Mellon University), Dana Van Aken (Carnegie Mellon University), William Zhang (Carnegie Mellon University)

3222 - 3232

Towards instance-optimized data systems

Tim Kraska (MIT)

3233 - 3238

Knowledge Graphs 2021: A Data Odyssey

Gerhard Weikum (Max Planck Institute for Informatics and Saarland University, Germany)

3239 - 3240

The future of data(base) education: Is the "cow book" dead?

Zachary Ives

Volume 14, No. 13

Yi Chen: Front Matter

3253 - 3266

TSCache: An Efficient Flash-based Caching Scheme for Time-series Data Workloads

Jian Liu (Louisiana State University), Kefei Wang (Louisiana State University), Feng Chen (Louisiana State University)

3267 - 3280

MP-RW-LSH: An Efficient Multi-Probe LSH Solution to ANNS-L_1

Huayi Wang (Georgia Institute of Technology), Jingfan Meng (Georgia Institute of Technology), Long Gong (Facebook), Jun Xu (Georgia Tech), Mitsunori Ogihara (University of Miami)

3281 - 3294

View Selection over Knowledge Graphs in Triple Stores

Theofilos Mailis (Kapodistrian University of Athens, Greece), Yannis Kotidis (Athens University of Economics and Business), Yannis Ioannidis (University of Athens)

3295 - 3307

Frequency-Hiding Order-Preserving Encryption with Small Client Storage

dongjie li (NanKai University), Siyi Lv (Nankai University), Yanyu Huang (Nankai University), Yijing Liu (Nankai University), Tong Li (Nankai University), Zheli Liu (Nankai University), Liang Guo (Huawei Technologies Co., Ltd.)

3308 - 3321

Modularis: Modular Relational Analytics over Heterogeneous Distributed Platforms

Dimitrios Koutsoukos (ETHZ), Ingo Müller (ETH Zürich), Renato Marroquín (Oracle Labs), Ana Klimovic (ETH Zurich), Gustavo Alonso (ETHZ)

3322 - 3334

Time-Topology Analysis

Yunkai Lou (Tsinghua University), Chaokun Wang (Tsinghua University), Tiankai Gu (Tsinghua University), Hao Feng (Tsinghua University), Jun Chen (Baidu Inc), Jeffrey Xu Yu (Chinese University of Hong Kong)

3335 - 3347

Quantifying identifiability to choose and audit epsilon in differentially private deep learning

Daniel Bernau (SAP), Günther Eibl (FH Salzburg), Philip-William Grassal (Visual Learning Lab, Heidelberg University), Hannah Keller (SAP SE), Florian Kerschbaum (University of Waterloo)

3348 - 3361

Data Management in Microservices: State of the Practice, Challenges, and Research Directions

Rodrigo N Laigner (University of Copenhagen), Yongluan Zhou (University of Copenhagen), Marcos Antonio Vaz Salles (University of Copenhagen (DIKU)), Yijian Liu (University of Copenhagen), Marcos Kalinowski (PUC-Rio)

3362 - 3375

PerfGuard: Deploying ML-for-Systems without Performance Regressions, Almost!

H M Sajjad Hossain (Microsoft), Marc T Friedman (Microsoft), Hiren Patel (Microsoft), Shi Qiao (Microsoft), Soundar Srinivasan (Microsoft), Markus Weimer (Microsoft), Remmelt Ammerlaan (Microsoft), Lucas Rosenblatt (NYU), Gilbert Antonius (Microsoft), Peter Orenberg (Microsoft), Vijay Ramani (Microsoft), Abhishek Roy (Microsoft), Irene Shaffer (Microsoft), Alekh Jindal (Microsoft)

3376 - 3388

DSB: A Decision Support Benchmark for Workload-Driven and Traditional Database Systems

Bailu Ding, Surajit Chaudhuri, Johannes Gehrke, Vivek Narasayya

3389 - 3401

Computing How-Provenance for SPARQL Queries via Query Rewriting

Daniel Hernández (Aalborg University), Luis Galárraga (INRIA), Katja Hose (Aalborg University)

3402 - 3414

UDO: Universal Database Optimization using Reinforcement Learning

Junxiong Wang (Cornell University), Immanuel Trummer (Cornell), Debabrota Basu (Inria)

3415 - 3415

Internet Traffic Analysis at Scale

Anja Feldmann

3416 - 3416

The Power of Summarization in Graph Mining and Learning: Smaller Data, Faster Methods, More Interpretability

Danai Koutra (University of Michigan)

3417 - 3417

Summarizing Patients Like Mine via an On-demand Consultation Service

Nigam Shah (Stanford)

3418 - 3418

Towards Scalable Online Machine Learning Collaborations with OpenML

Joaquin Vanschoren (Eindhoven University of Technology)

3419 - 3419

From ML Models to Intelligent Applications: The Rise of MLOps

Manasi Vartak (Verta)

3420 - 3420

Designing Production-Friendly Machine Learning

Matei Zaharia (Stanford and Databricks)

PVLDB is part of the VLDB Endowment Inc.

Privacy Policy