VLDB 2024: Schedule of Papers and Tutorials

Find Session: A1  A3  A4  A7  A8  C1  C2  C3  C4  C5  C6  C7  C8  D7  D8  E1  E2  E3  E4  E5  E6  E7  E8  F1  F2  F3  F4  F5  F6  F7  F8  G1  G2  G3  G4  G5  G6  G7  G8  H1  H2  H3  H4  H5  H6  B6  B3  B1  B2  B4  B5  Tutorial-1  Tutorial-2  Tutorial-3  Tutorial-4  Tutorial-5  Tutorial-6  Tutorial-7  Tutorial-8  Tutorial-9  Tutorial-10  Demo-Group-A  Demo-Group-B  Demo-Group-C  Demo-Group-D 


A1

Data management and support for ML/AI

Chair: Ashraf Aboulnaga (University of Texas at Arlington)

ElasticNotebook: Enabling Live Migration for Computational Notebooks Zhaoheng Li (University of Illinois at Urbana-Champaign)*; Pranav Gor (University of Illinois Urbana Champaign); Rahul Prabhu (University of Illinois Urbana-Champaign); Hui Yu (University of Illinois at Urbana-Champaign); Yuzhou Mao (University of Michigan); Yongjoo Park (University of Illinois at Urbana-Champaign)

How do Categorical Duplicates Affect ML? A New Benchmark and Empirical Analyses Vraj Shah (IBM Research)*; Thomas J Parashos (csun); Arun Kumar (University of California, San Diego)

RALF: Accuracy-Aware Scheduling for Feature Store Maintenance Sarah Wooders (UC Berkeley)*; Xiangxi Mo (UC Berkeley); Amit Narang (UC Berkeley); Kevin Lin (UC Berkeley); Ion Stoica (UC Berkeley); Joseph M Hellerstein (UC Berkeley); Natacha Crooks (UC Berkeley); Joseph E Gonzalez (UC Berkeley)

MetaStore: Analyzing Deep Learning Meta-Data at Scale Huayi Zhang (WPI); Binwei Yan (Binwei Yan); Lei Cao (University of Arizona/MIT)*; Samuel Madden (MIT); Elke A Rundensteiner (Worcester Polytechnic Institute)

Everything You Always Wanted to Know About Storage Compressibility of Pre-Trained ML Models but Were Afraid to Ask Zhaoyuan Su (University of Virginia)*; Ammar Ahmed (University of Minnesota); Zirui Wang (University of Virginia); Ali Anwar (University of Minnesota); Yue Cheng (University of Virginia)

Database Native Model Selection: Harnessing Deep Neural Networks in Database Systems Naili Xing (national university of singapore)*; Shaofeng Cai (National University of Singapore); Gang Chen (Zhejiang University); ZHAOJING LUO (Beijing Institute of Technology); Beng Chin Ooi (NUS); Jian Pei (Simon Fraser University)

A3

ML/AI for data management

Chair: Eric Lo (Chinese University of Hong Kong)

ALECE: An Attention-based Learned Cardinality Estimator for SPJ Queries on Dynamic Workloads Pengfei Li (Alibaba Group)*; Wenqing Wei (Alibaba Group); Rong Zhu (Alibaba Group); Bolin Ding ("Data Analytics and Intelligence Lab, Alibaba Group"); Jingren Zhou (Alibaba Group); Hua Lu (Roskilde University)

Oasis: An Optimal Disjoint Segmented Learned Range Filter Guanduo Chen (Fudan University); Zhenying He (Fudan University); Meng Li (Nanjing University); Siqiang Luo (Nanyang Technological University)*

PilotScope: Steering Databases with Machine Learning Drivers Rong Zhu (Alibaba Group)*; weng lianggui (alibaba); Wenqing Wei (Alibaba Group); Di Wu (Alibaba); Jiazhen Peng (ALIBABA); Yifan Wang (Alibaba Inc); Bolin Ding ("Data Analytics and Intelligence Lab, Alibaba Group"); Defu Lian (University of Science and Technology of China); Bolong Zheng (Huazhong University of Science and Technology); Jingren Zhou (Alibaba Group)

Leveraging Dynamic and Heterogeneous Workload Knowledge to Boost the Performance of Index Advisors zijia Wang (Xiamen University); haoran Liu (Xiamen University); Chen Lin (Xiamen University)*; Zhifeng Bao (RMIT University); Guoliang Li (Tsinghua University); Tianqing Wang (Huawei)

The Holon Approach for Simultaneously Tuning Multiple Components in a Self-Driving Database Management System with Machine Learning via Synthesized Proto-Actions William Zhang (Carnegie Mellon University)*; Wan Shen Lim (Carnegie Mellon University); Matthew Butrovich (Carnegie Mellon University); Andrew Pavlo (Carnegie Mellon University)

Eraser: Eliminating Performance Regression on Learned Query Optimizer weng lianggui (alibaba); Rong Zhu (Alibaba Group)*; Di Wu (Alibaba); Bolin Ding ("Data Analytics and Intelligence Lab, Alibaba Group"); Bolong Zheng (Huazhong University of Science and Technology); Jingren Zhou (Alibaba Group)

Refactoring Index Tuning Process with Benefit Estimation Tao Yu (Harbin Institute of Technology); Zhaonian Zou (Harbin Institute of Technology)*; Weihua Sun (Harbin Institute of Technology); Yu Yan (Harbin Institute of Technology )

Accelerating String-key Learned Index Structures via Memoization-based Incremental Training Minsu Kim (KAIST); Jinwoo Hwang (KAIST); Guseul Heo (KAIST); Seiyeon Cho (KAIST); Divya Mahajan (Georgia Institute of Technology); Jongse Park (KAIST)*

A4

Data management and support for ML/AI

Chair: Fatemeh Nargesian (University of Rochester)

Saturn: An Optimized Data System for Multi-Large-Model Deep Learning Workloads (Information System Architectures) Kabir Nagrecha (UC San Diego)*; Arun Kumar (University of California, San Diego)

Experimental Analysis of Large-scale Learnable Vector Storage Compression [Experiment, Analysis & Benchmark (EA&B)] Hailin Zhang (Peking University)*; Penghao Zhao (Peking University); Xupeng Miao (Carnegie Mellon University); Yingxia Shao (BUPT); Zirui Liu (Peking University); Tong Yang (Peking University); Bin Cui (Peking University)

nsDB: Architecting the Next Generation Database by Integrating Neural and Symbolic Systems (Vision) Ye Yuan ( Beijing Institute of Technology); Bo Tang (Southern University of Science and Technology)*; Tianfei Zhou (Beijing Institute of Technology); Zhiwei Zhang (Beijing Institute of Technology); Jianbin Qin (Shenzhen Institute of Computing Sciences, Shenzhen University )

How Can We Train Deep Learning Models Across Clouds and Continents? An Experimental Study [Experiment, Analysis & Benchmark] Alexander Erben (Technical University of Munich)*; Ruben Mayer (University of Bayreuth); Hans-Arno Jacobsen (University of Toronto)

Optimizing Data Pipelines for Machine Learning in Feature Stores [Scalable Data Science] Rui Liu, Kwanghyun Park, Fotis Psallidas, Xiaoyong Zhu, Jinghui Mo, Rathijit Sen, Matteo Interlandi, Konstantinos Karanasos, Yuanyuan Tian, Jesús Camacho-Rodríguez

D3-GNN: Dynamic Distributed Dataflow for Streaming Graph Neural Networks Rustam Guliyev (University of Warwick)*; Aparajita Haldar (University of Warwick); Hakan Ferhatosmanoglu (University of Warwick and Amazon Web Services)

A7

Data management and support for ML/AI

Chair: Arun Kumar (University of California, San Diego)

Biathlon: Harnessing Model Resilience for Accelerating ML Inference Pipelines Chaokun Chang (The Chinese University of Hong Kong); Chunxiao Ye (Chinese University of Hong Kong); Eric Lo (Chinese University of Hong Kong)*

Optimizing Data Acquisition to Enhance Machine Learning Performance Tingting Wang (RMIT University); Shixun Huang (University of Wollongong); Zhifeng Bao (RMIT University)*; Shane Culpepper (The University of Queensland); Volkan Dedeoglu (CSIRO); Reza Arablouei (CSIRO)

DAHA: Accelerating GNN Training with Data and Hardware Aware Execution Planning Zhiyuan Li (The Hong Kong University of Science and Technology)*; Xun Jian (Hong Kong University of Science and Technology); Yue Wang (Shenzhen Institute of Computing Sciences); Yingxia Shao (BUPT); Lei Chen (Hong Kong University of Science and Technology)

FusionFlow: Accelerating Data Preprocessing for Machine Learning with CPU-GPU Cooperation Taeyoon Kim (UNIST)*; Chanho Park (UNIST); Mansur Mukimbekov (UNIST); Heelim Hong (Ulsan National Institute of Science and Technology); Minseok Kim (UNIST); Ze Jin (ByteDance); Changdae Kim (ETRI); Ji-Yong Shin (Northeastern University); Myeongjae Jeon (UNIST)

Accelerating Sampling and Aggregation Operations in GNN Frameworks with GPU Initiated Direct Storage Accesses Jeongmin Brian Park (University of Illinois at Urbana-Champaign)*; Vikram Sharma Mailthody (NVIDIA); Zaid Qureshi (NVIDIA); Wen-Mei Hwu (NVIDIA/UIUC)

Efficiently Mitigating the Impact of Data Drift on Machine Learning Pipelines [Scalable Data Science] SIJIE DONG (University Paris Cite)*; Qitong Wang (Université Paris Cité); Sahri Soror (Paris Descartes University, France); Themis Palpanas (Université Paris Cité); Divesh Srivastava (AT&T Chief Data Office)

A8

LLM for data management

Chair: Arijit Khan (Aalborg University)

D-Bot: Database Diagnosis System using Large Language Models Xuanhe Zhou (Tsinghua); Guoliang Li (Tsinghua University)*; Zhaoyan Sun (Tsinghua University); Zhiyuan Liu (Tsinghua University); Weize Chen (Tsinghua University); Jianming Wu (TsingHua); Jiesi Liu (Tsinghua University); Ruohang Feng (PanJiYunShu); Guoyang Zeng (ModelBest)

Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes Simran Arora (Stanford University)*; Brandon Yang (Stanford University); Sabri Eyuboglu (Stanford University); Avanika Narayan (Stanford University); Andrew Hojel (Stanford University); Immanuel Trummer (Cornell University); Christopher Re (Stanford University)

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity Haojun Xia (University of Sydney)*; Zhen Zheng (Alibaba Group); Yuchao Li (Alibaba Group); Donglin Zhuang (The University of Sydney); Zhongzhu Zhou (The University of Sydney); Xiafei Qiu (Alibaba Group); Yong Li (Alibaba Group); Wei Lin (Alibaba Group); Shuaiwen Leon Song (University of Sydney)

Combining Small Language Models and Large Language Models for Zero-Shot NL2SQL zihui gu (Renmin University of China); Ju Fan (Renmin University of China)*; Songyue Zhang (Renmin University of China); Yuxin Zhang (Renmin university of China); Zui Chen (Tsinghua University); Lei Cao (University of Arizona/MIT); Guoliang Li (Tsinghua University); Samuel Madden (MIT); Xiaoyong Du (Renmin University of China); Nan Tang (HKUST (GZ))

GPTuner: A Manual-Reading Database Tuning System via GPT-Guided Bayesian Optimization Jiale Lao (Cornell University); Yibo Wang (Sichuan University); Yufei Li (Sichuan University); Jianping Wang (Northwest Normal University); Yunjia Zhang (University of Wisconsin-Madison); Zhiyuan Cheng (Purdue University); Wanghu Chen (Northwest Normal University); Mingjie Tang (Sichuan University)*; Jianguo Wang (Purdue University)

Generating Succinct Descriptions of Database Schemata for Cost-Efficient Prompting of Large Language Models Immanuel Trummer (Cornell University)*

Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation [Experiment, Analysis & Benchmark] Dawei Gao (Alibaba-inc)*; Haibin Wang (Alibaba Group); Yaliang Li (Alibaba Group); Xiuyu Sun (Alibaba Group); Yichen Qian (Alibaba Group); Bolin Ding ("Data Analytics and Intelligence Lab, Alibaba Group"); Jingren Zhou (Alibaba Group)

Can Large Language Models Predict Data Correlations from Column Names? [Experiment, Analysis & Benchmark] Immanuel Trummer (Cornell University)*

C1

Database Engines

Chair: Viktor Leis (TUM)

Flexible Resource Allocation for Relational Database-as-a-Service [Flavor: Systems] Pankaj Arora (Microsoft); Surajit Chaudhuri (Microsoft); Sudipto Das (Amazon Web Services); Junfeng Dong (Microsoft); Cyril George (Microsoft); Ajay Kalhan (Microsoft); Arnd Christian König (Microsoft); Willis Lang (Microsoft); Feng Li (Meta Platforms Inc.); Changsong Li (Microsoft); Jiaqi Liu (Microsoft); Lukas M Maas (Microsoft Research); Akshay Mata (Microsoft); Ishai Menache (Microsoft Research); Justin Moeller (Microsoft); Vivek Narasayya (Microsoft)*; Matthaios Olma (Microsoft Research); Morgan Oslake (Microsoft); Elnaz Rezai (Amazon); Yi Shan (Microsoft); Manoj Syamala (Microsoft); Shize Xu (Stripe Inc.); Vasileios Zois (Microsoft)

RTIndeX: Exploiting Hardware-Accelerated GPU Raytracing for Database Indexing Justus Henneberg (Johannes Gutenberg-University Mainz); Felix M Schuhknecht (Johannes Gutenberg University Mainz)*

DET-LSH: A Locality-Sensitive Hashing Scheme with Dynamic Encoding Tree for Approximate Nearest Neighbor Search Jiuqi Wei (Institute of Computing Technology, Chinese Academy of Sciences)*; Botao Peng (Institute of Computing Technology, Chinese Academy of Sciences); Xiaodong Lee (ICT); Themis Palpanas (Université Paris Cité)

Towards Systematic Index Dynamization Douglas B Rumbaugh (Penn State University)*; Dong Xie (Penn State University); Zhuoyue Zhao (University at Buffalo)

POLAR: Adaptive and Non-invasive Join Order Selection via Plans of Least Resistance David Justen (Technische Universität Berlin)*; Daniel Ritter (SAP); Campbell B Fraser (Google); Andrew Lamb (InfluxData); Nga Tran (InfluxData); Allison Lee (Snowflake); Thomas Bodner (Hasso Plattner Institute, University of Potsdam); MHD Yamen Haddad (Inria and Institut Polytechnique de Paris); Steffen Zeuch (TU Berlin); Volker Markl (Technische Universität Berlin); Matthias Boehm (Technische Universität Berlin)

Hit the Gym: Accelerating Query Execution to Efficiently Bootstrap Behavior Models for Self-Driving Database Management Systems Wan Shen Lim (Carnegie Mellon University)*; Lin Ma (University of Michigan); William Zhang (Carnegie Mellon University); Matthew Butrovich (Carnegie Mellon University); Samuel I Arch (Carnegie Mellon University); Andrew Pavlo (Carnegie Mellon University)

C2

Applied machine learning for data exploration and analytics

Chair: Jianzhong Qi (University of Melbourne)

CohortNet: Empowering Cohort Discovery for Interpretable Healthcare Analytics Qingpeng Cai (National University of Singapore)*; Kaiping Zheng (National University of Singapore); H. V. Jagadish (University of Michigan); Beng Chin Ooi (NUS); James WL Yip (National University Health System Singapore)

VOCALExplore: Pay-as-You-Go Video Data Exploration and Model Building Maureen Daum, Enhao Zhang, Dong He, Stephen Mussmann, Brandon Haynes, Ranjay Krishna, Magdalena Balazinska

Automating the Enterprise with Foundation Models Michael Wornow (Stanford)*; Avanika Narayan (Stanford University); Krista Opsahl-Ong (Stanford); Quinn McIntyre (Stanford); Nigam Shah (Stanford); Christopher Re (Stanford University)

Ensemble Clustering based on Meta-Learning and Hyperparameter Optimization Dennis Treder-Tschechlov (Universität Stuttgart)*; Manuel Fritz (Universität Stuttgart); Holger Schwarz (Universität Stuttgart); Bernhard Mitschang (University of Stuttgart)

A Shapelet-based Framework for Unsupervised Multivariate Time Series Representation Learning Zhiyu Liang (Harbin Institute of Technology); Jianfeng Zhang (Huawei Noah's Ark Lab); Chen Liang (Harbin Institute of technology); Hongzhi Wang (Harbin Institute of Technology)*; Zheng Liang (Harbin Institute of Technology); Lujia Pan (Huawei Noah's Ark Lab)

ADF & TransApp: A Transformer-Based Framework for Appliance Detection Using Smart Meter Consumption Series [SDS Paper] Adrien Petralia (EDF)*; Philippe Charpentier (EDF SA); Themis Palpanas (Université Paris Cité)

C3

Database Engines

Chair: Huanchen Zhang (Tsinghua University)

Bf-Tree: A Modern Read-Write-Optimized Concurrent Larger-Than-Memory Range Index Xiangpeng Hao (University of Wisconsin Madison)*; Badrish Chandramouli (Microsoft Research)

Aleph Filter: To Infinity in Constant Time Niv Dayan (University of Toronto)*; Ioana-Oriana Bercea (KTH Royal Institute of Technology); Rasmus Pagh (University of Copenhagen)

When Amnesia Strikes: Understanding and Reproducing Data Loss Bugs with Fault Injection Maria Ramos (INESC TEC & U. Minho)*; João Azevedo (INESC TEC); Kyle Kingsbury (Jepsen); José Pereira (U. Minho & INESCTEC); Tânia Esteves (INESC TEC & U. Minho); Ricardo Macedo (INESC TEC & University of Minho); João Paulo (INESC TEC & University of Minho)

Timestamp as a Service, not an Oracle Yishuai Li (Alibaba Group)*; Yunfeng Zhu (Alibaba Group); Chao Shi (Alibaba Group); Guanhua Zhang (Alibaba Group); Jianzhong Wang (Alibaba Group); Xiaolu Zhang (Alibaba Group)

DariusDB: Searching for Fast Transaction Schedules Audrey Cheng (UC Berkeley)*; Aaron N Kabcenell (Facebook); Jason Chan (UC Berkeley); Xiao Shi (Unaffiliated); Peter D Bailis (Stanford University); Natacha Crooks (UC Berkeley); Ion Stoica (UC Berkeley)

AMNES: Accelerating the computation of data correlation using FPGAs Monica Chiosa (ETH Zurich)*; Thomas Preusser (AMD Xilinx); Michaela Blott (AMD Xilinx); Gustavo Alonso (ETHZ)

Spectrum: Speedy and Strictly-Deterministic Smart Contract Transactions for Blockchain Ledgers Zhihao Chen (East China Normal University); Tianji Yang (East China Normal University); Yixiao Zheng (East China Normal University); Zhao Zhang (East China Normal University)*; Cheqing Jin (East China Normal University); Aoying Zhou (East China Normal University )

Rashnu: Data-Dependent Order-Fairness Heena Nagda (University of Pennsylvania); Shubhendra Pal Singhal (Georgia Institute of Technology); Mohammad Javad Amiri (Stony Brook University)*; Boon Thau Loo (Univ. of Pennsylvania)

C4

Memory and storage management

Chair: Wolfgang Lehner (Dresden University of Technology)

Blitzcrank: Fast Semantic Compression for In-memory Online Transaction Processing Yiming Qiao (Tsinghua University); Yihan Gao (None); Huanchen Zhang (Tsinghua University)*

An Empirical Evaluation of Columnar Storage Formats Xinyu Zeng (Tsinghua University)*; Yulong Hui (Tsinghua University); Jiahong Shen (Tsinghua University); Andrew Pavlo (Carnegie Mellon University); Wes McKinney (Voltron Data); Huanchen Zhang (Tsinghua University)

FCBench: Cross-Domain Benchmarking of Lossless Compression for Floating-Point Data Xinyu Chen (Washington State University); Jiannan Tian (Indiana University); Ian Beaver (Verint Systems Inc); Cynthia Freeman (Verint Intelligent Self-Service); Yan Yan (Washington State University); Jianguo Wang (Purdue University); Dingwen Tao (Indiana University)*

Two Birds With One Stone: Designing a Hybrid Cloud Storage Engine for HTAP (Flavor: Systems) Tobias Schmidt (TUM)*; Dominik Durner (CedarDB); Viktor Leis (Technische Universität München); Thomas Neumann (TUM)

The Art of Latency Hiding in Modern Database Engines Kaisong Huang (Simon Fraser University)*; Tianzheng Wang (Simon Fraser University); Qingqing Zhou (DB365000); Qingzhong Meng (Tencent Technology (Beijing) Co.Ltd)

Catalyst: Optimizing Cache Management for Large In-memory Key-value Systems Kefei Wang (Louisiana State University); Feng Chen (Louisiana State University)*

C5

Data management and support for ML/AI

Chair: Hazar Harmouch (University of Amsterdam)

SmartLite: A DBMS-based Serving System for DNN Inference in Resource-constrained Environments Qiuru Lin (Zhejiang University); Sai Wu (Zhejiang University)*; Junbo Zhao (Zhejiang University); Jian Dai (Alibaba Group); Meng Shi (Zhejiang University); Gang Chen (Zhejiang University); Feifei Li (Alibaba Group)

InferDB: In-Database Machine Learning Inference Using Indexes Ricardo Salazar-Díaz (Hasso Plattner Institute, University of Potsdam)*; Boris Glavic (Illinois Institute of Technology); Tilmann Rabl (HPI, University of Potsdam)

SepHash: A Write-Optimized Hash Index On Disaggregated Memory via Separate Segment Structure Xinhao Min (Huazhong University of Science and Technology); Kai Lu (Huazhong University of Science and Technology)*; Pengyu Liu (Huazhong University of Science and Technology); Jiguang Wan (Huazhong University of Science and Technology); Changsheng Xie ( Huazhong University of Science and Technology); Daohui Wang (Huawei Cloud Computing Technology Co., Ltd.); Ting Yao (Huawei Cloud Computing Technology Co., Ltd.); huatao wu (huawei)

FreshGNN: Reducing Memory Access via Stable Historical Embeddings for Graph Neural Network Training (Flavor: Systems) Kezhao Huang (Tsinghua University)*; Haitian Jiang (New York University); Minjie Wang (Amazon); Guangxuan Xiao (Massachusetts Institute of Technology); Wipf David (Amazon); Xiang Song (Amazon); Quan Gan (Amazon); Zengfeng Huang (Fudan University); Jidong Zhai (Tsinghua University); Zheng Zhang (Amazon)

Falcon: Fair Active Learning using Multi-armed Bandits Ki Hyun Tae (Samsung Research); Hantian Zhang (Georgia Tech); Jaeyoung Park (KAIST); Kexin Rong (Georgia Institute of Technology); Steven E Whang (KAIST)*

SplitDF: Splitting Dataframes for Memory-Efficient Data Analysis Aarati Kakaraparthy (University of Wisconsin-Madison)*; Jignesh Patel (Carnegie Mellon University)

C6

ML/AI for data management

Chair: Ibrahim Sabek (University of Southern California)

Breaking It Down: An In-depth Study of Index Advisors [EA&B] Wei Zhou (Xiamen University); Chen Lin (Xiamen University); Xuanhe Zhou (Tsinghua); Guoliang Li (Tsinghua University)*

CHORUS: Foundation Models for Unified Data Discovery and Exploration (SDS Track) Moe Kayali (University of Washington)*; Anton Lykov (UW); Ilias Fountalis (RelationalAI); Nikolaos Vasiloglou (RelationalAI); Dan Olteanu (University of Zurich); Dan Suciu (University of Washington)

ContTune: Continuous Tuning by Conservative Bayesian Optimization for Distributed Stream Data Processing Systems Jinqing Lian, Xinyi Zhang, Yingxia Shao, Zenglin Pu, Qingfeng Xiang, Yawen Li, Bin Cui

A Comparative Study and Component Analysis of Query Plan Representation Techniques in ML4DB Studies [Experiment, Analysis & Benchmark] Yue Zhao (Nanyang Technological University)*; Zhaodonghui Li (Nanyang Technological University); Gao Cong (Nanyang Technological Univesity)

QCore: Data-Efficient, On-Device Continual Calibration for Quantized Models David Campos (Aalborg University)*; Bin Yang (East China Normal University); Tung Kieu (Aalborg University); Miao Zhang (Harbin Institute of Technology (Shenzhen)); Chenjuan Guo (ECNU); Christian S. Jensen (Aalborg University)

Intelligent Pooling: Proactive Resource Provisioning in Large-scale Cloud Service Deepak Ravikumar (Purdue University); Alex Yeo (N/A); Yiwen Zhu (Microsoft)*; Aditya Lakra (Azure Data); Harsha N Nagulapalli (Microsoft); Santhosh Kumar Ravindran (Azure Data); Steve Suh (Microsoft); Niharika Dutta (Microsoft Corporation); Andrew F Fogarty (Microsoft); Yoonjae Park (Microsoft Corporation); Sumeet Khushalani (Microsoft); Arijit Tarafdar (Microsoft Corporation); Kunal Parekh (Microsoft); Subru Krishnan (Microsoft)

ShadowAQP: Efficient Approximate Group-by and Join Query via Attribute-oriented Sample Size Allocation and Data Generation Rong Gu, Han Li, Haipeng Dai, Wenjie Huang, Jie Xue, Meng Li, Jiaqi Zheng, Haoran Cai, Yihua Huang, Guihai Chen

C7

Applied machine learning for data exploration and analytics

Chair: Yanyan Shen (Shanghai Jiao Tong University)

METER: A Dynamic Concept Adaptation Framework for Online Anomaly Detection Jiaqi Zhu (Beijing Institute of Technology)*; Shaofeng Cai (National University of Singapore); Fang Deng (Beijing Institute of Technology); Beng Chin Ooi (NUS); Wenqiao Zhang (Zhejiang University)

DARKER: Efficient Transformer with Data-driven Attention Mechanism for Time Series Rundong Zuo (Hong Kong Baptist University)*; Guozhong Li (King Abdullah University of Science & Technology); Rui CAO (Hong Kong Baptist University); Byron Choi (Hong Kong Baptist University); Jianliang Xu (Hong Kong Baptist University); Sourav S Bhowmick (Nanyang Technological University)

Inductive Attributed Community Search: to Learn Communities across Graphs Shuheng Fang (The Chinese University of Hong Kong); Kangfei Zhao (Beijing Insitute of Technology)*; Yu Rong (Tencent AI Lab); Zhixun Li (The Chinese University of Hong Kong); Jeffrey Xu Yu (Chinese University of Hong Kong)

ReAcTable: Enhancing ReAct for Table Question Answering Yunjia Zhang (University of Wisconsin-Madison)*; Jordan Henkel (Microsoft); Avrilia Floratou (Microsoft); Joyce Cahoon (Microsoft); Shaleen Deep (Microsoft Gray Systems Lab); Jignesh Patel (Carnegie Mellon University)

FormaT5: Abstention and Examples for Conditional Table Formatting with Natural Language Mukul Singh (Microsoft)*; José P Cambronero Sanchez (Microsoft); Sumit Gulwani (Microsoft Research); Vu Le (Microsoft); Carina Negreanu (Microsoft Research); Elnaz Nouri (Microsoft Research); Mohammad Raza (Microsoft); Gust Verbruggen (Microsoft)

Observatory: Characterizing Embeddings of Relational Tables Tianji Cong (University of Michigan)*; Madelon Hulsebos (UC Berkeley); Zhenjie Sun (University of Michigan, Ann Arbor); Paul Groth (University of Amsterdam); H. V. Jagadish (University of Michigan)

C8

Heterogeneous and federated data management


A Blockchain System for Clustered Federated Learning with Peer-to-Peer Knowledge Transfer Honghu Wu (Nanjing University); XiangRong Zhu (Nanjing University); Wei Hu (Nanjing University)*

Uldp-FL: Federated Learning with Across Silo User-Level Differential Privacy Fumiyuki Kato (Preferred Networks)*; Li Xiong (Emory University); Shun Takagi (Kyoto University); Yang Cao (Tokyo Institute of Technology); Masatoshi Yoshikawa (Osaka Seikei University)

Contributions Estimation in Federated Learning: A Comprehensive Experimental Evaluation (EA&B) Yiwei Chen (Tsinghua University); Kaiyu Li (Tsinghua University)*; Guoliang Li (Tsinghua University); Yong Wang (Tsinghua University)

P-Shapley: Shapley Values on Probabilistic Classifiers Jinfei Liu (Zhejiang University)*; Haocheng Xia (University of Illinois Urbana-Champaign); Xiang Li (Zhejiang University); Junyuan Pang (Zhejiang University); Kui Ren (Zhejiang University); Li Xiong (Emory University)

Performance-Based Pricing for Federated Learning via Auction Zitao Li (Alibaba Group)*; Bolin Ding ("Data Analytics and Intelligence Lab, Alibaba Group"); Liuyi Yao (Alibaba Group); Yaliang Li (Alibaba Group); Xiaokui Xiao (National University of Singapore); Jingren Zhou (Alibaba Group)

D7

Database Engines

Chair: Vivek Narasayya

DEX: Scalable Range Indexing on Disaggregated Memory Baotong Lu (Microsoft Research)*; Kaisong Huang (Simon Fraser University); Chieh-Jan Mike Liang (Microsoft Research); Tianzheng Wang (Simon Fraser University); Eric Lo (Chinese University of Hong Kong)

Detecting Metadata-Related Logic Bugs in Database Systems via Raw Database Construction Jiansen Song (Institute of Software Chinese Academy of Sciences)*; Wensheng Dou (Institute of Software Chinese Academy of Sciences); Yu Gao (Institute of Software Chinese Academy of Sciences); Ziyu Cui (Institute of Software Chinese Academy of Sciences); Yingying Zheng (Institute of Software Chinese Academy of Sciences); Dong Wang (Institute of Software Chinese Academy of Sciences); Wei Wang (Institute of Software, Chinese Academy of Sciences); Jun Wei (Institute of Software, Chinese Academy of Sciences); Tao Huang (Institute of Software Chinese Academy of Sciences)

UltraLogLog: A Practical and More Space-Efficient Alternative to HyperLogLog for Approximate Distinct Counting Otmar Ertl (Dynatrace Research)*

LITS: An Optimized Learned Index for Strings Yifan Yang (IInstitute of Computing Technology, Chinese Academy of Sciences); Shimin Chen (Chinese Academy of Sciences)*

Cloud-Native Database Systems and Unikernels: Reimagining OS Abstractions for Modern Hardware Viktor Leis (Technische Universität München)*; Christian Dietrich (Technische Universität Braunschweig)

SeLeP: Learning Based Semantic Prefetching for Exploratory Database Workloads Farzaneh Zirak (The University of Melbourne)*; Farhana Choudhury (The University of Melbourne); Renata Borovica-Gajic (University of Melbourne)

D8

Query Processing


CGgraph: An Ultra-fast Graph Processing System on Modern Commodity CPU-GPU Co-processor pengjie cui (Northeastern University); Haotian Liu (Southern University of Science and Technology); Bo Tang (Southern University of Science and Technology); Ye Yuan ( Beijing Institute of Technology)*

X-TED: Massive Parallelization of Tree Edit Distance Dayi Fan (The Ohio State University)*; Rubao Lee (Freelance); Xiaodong Zhang (Ohio State U.)

RTScan: Efficient Scan with Ray Tracing Cores Yangming Lv (Fudan University)*; Kai Zhang (Fudan University); Ziming Wang (Fudan University); Xiaodong Zhang (Ohio State U.); Rubao Lee (Freelance); Zhenying He (Fudan University); Yinan Jing (Fudan University); X. Sean Wang (Fudan University)

FusionQuery: On-demand Fusion Queries over Multi-source Heterogeneous Data Junhao Zhu (Zhejiang University); Yuren Mao (Zhejiang University); Lu Chen (Zhejiang University); Congcong Ge (Huawei Technologies Co., Ltd.); Ziheng Wei (Huawei Technologies Co., Ltd.); Yunjun Gao (Zhejiang University)*

Robust Join Processing with Diamond Hardened Joins Altan Birler (TUM)*; Alfons Kemper (TUM); Thomas Neumann (TUM)

Hardware-Efficient Data Imputation through DBMS Extensibility Hubert Mohr-Daurat (Imperial College London)*; Georgios R Theodorakis (Imperial College London); Holger Pirk (Imperial College)

PairwiseHist: Fast, Accurate, and Space-Efficient Approximate Query Processing with Data Compression Aaron Hurst (Aarhus University)*; Daniel E Lucani (Aarhus University); Qi Zhang (Aarhus University)

E1

Graph Analytics

Chair: Xiang Zhao (National University of Defense Technology)

Transforming Property Graphs Angela Bonifati (Univ. of Lyon); Filip Murlak (University of Warsaw, Poland); Yann Ramusat (Lyon 1 Univ., Liris CNRS)*

Demystifying Graph Sparsification Algorithms in Graph Properties Preservation [Experiment, Analysis & Benchmark] Yuhan Chen (University of Michigan)*; Haojie Ye (University of Michigan); Sanketh Vedula (Technion); Alex Bronstein (Technion); Ronald Dreslinski (University of Michigan); Trevor Mudge (U Michigan); Nishil Talati (University of Michigan/AMD Research)

Efficient and Accurate SimRank-based Similarity Joins: Experiments, Analysis, and Improvement Qian Ge (Peking University); Yu Liu (Beijing Jiaotong University)*; Yinghao Zhao (Beijing JiaoTong University); Yuetian Sun (Beijing Jiaotong University); Lei Zou (Peking University); Yuxing Chen (Tencent); Anqun Pan (Tencent Inc., China)

MOSER: Scalable Network Motif Discovery using Serial Test Mohammad Matin Najafi (The University of Hong Kong)*; Chenhao Ma (The Chinese University of Hong Kong, Shenzhen); Xiaodong Li (The University of Hong Kong); Reynold Cheng ("The University of Hong Kong, China"); Laks V.S. Lakshmanan (The University of British Columbia)

Capturing More Associations by Referencing External Graphs Wenfei Fan (Univ. of Edinburgh ); Muyang Liu (University of Edinburgh); Shuhao Liu (Shenzhen Institute of Computing Sciences); Chao Tian (Beihang University)*

Influence Maximization via Vertex Countering Jiadong Xie (The Chinese University of Hong Kong); ZeHua Chen (Guangzhou University); Deming Chu (University of New South Wales); Fan Zhang (Guangzhou University)*; Xuemin Lin (University of New South Wales); Zhihong Tian (Guangzhou University)

E2

Graph Analytics

Chair: Sibo Wang (CUHK)

Minimum Strongly Connected Subgraph Collection in Dynamic Graphs Xin CHEN (The Chinese University of Hong Kong)*; Jieming Shi (The Hong Kong Polytechnic University); You Peng (University of New South Wales); Wenqing Lin (Tencent); Sibo Wang (The Chinese University of Hong Kong); Wenjie Zhang (University of New South Wales)

BIRD: Efficient Approximation of Bidirectional Hidden Personalized PageRank Haoyu Liu (Nanyang Technological University)*; Siqiang Luo (Nanyang Technological University)

Improving Graph Compression for Efficient Resource-Constrained Graph Analytics Qian Xu (Renmin University of China)*; Juan Yang (Beijing HaiZhi XingTu Technology Co., Ltd.); Feng Zhang (Renmin University of China); Zheng Chen (Renmin University of China); Jiawei Guan (Renmin University of China); Kang Chen (Tsinghua University); Ju Fan (Renmin University of China); Youren Shen (Beijing HaiZhi XingTu Technology Co., Ltd.); Ke Yang (Beijing HaiZhi XingTu Technology Co., Ltd.); Yu Zhang (Renmin University of China); Xiaoyong Du (Renmin University of China)

Accelerating Maximal Clique Enumeration via Graph Reduction Wen Deng (Fudan University)*; Weiguo Zheng (Fudan University); Hong Cheng (Chinese University of Hong Kong)

Poligras: Policy-based Graph Summarization Jiyang Bai (Florida State University); Peixiang Zhao (Florida State University)*

Efficient Influence Minimization via Node Blocking Jinghao Wang (University of Technology Sydney); Yanping Wu (University of Technology Sydney)*; Xiaoyang Wang (University of New South Wales); Ying Zhang (University of Technology Sydney); Lu Qin (UTS); Wenjie Zhang (University of New South Wales); Xuemin Lin (Shanghai Jiaotong University)

E3

Temporal & Streaming Graph Data Management

Chair: Jilin Hu (East China Normal University)

Everest: GPU-Accelerated System For Mining Temporal Motifs Yichao Yuan (University of Michigan)*; Haojie Ye (University of Michigan); Sanketh Vedula (Technion); Wynn M Kaza (University of Michigan); Nishil Talati (University of Michigan/AMD Research)

Efficient Temporal Butterfly Counting and Enumeration on Temporal Bipartite Graphs Xinwei Cai (Zhejiang University); Xiangyu Ke (Zhejiang University, China); Kai Wang (Shanghai Jiao Tong University); Lu Chen (Zhejiang University); Tianming Zhang (Zhejiang University Of Technology); Qing Liu (Zhejiang University); Yunjun Gao (Zhejiang University)*

QTCS: Efficient Query-Centered Temporal Community Search Longlong Lin (Southwest University)*; Pingpeng Yuan (Huazhong University of Science & Technology); Rong-Hua Li (Beijing Institute of Technology); Chunxue Zhu (HuaZhong University of Science and Technology); Hongchao Qin (Beijing Institute of Technology); Hai Jin (Huazhong University of Science and Technology); Tao Jia (Southwest University)

Incremental Sliding Window Connectivity over Streaming Graphs Chao Zhang (University of Waterloo)*; Angela Bonifati (Univ. of Lyon); Tamer Özsu (University of Waterloo)

TC-Match: Fast Time-constrained Continuous Subgraph Matching Jianye Yang (Guangzhou University)*; sheng fang (GZHU); Zhaoquan Gu (Harbin Institute of Technology (Shenzhen)); Ziyi Ma (Hebei University of Technology); Xuemin Lin (Shanghai Jiaotong University); Zhihong Tian (Guangzhou University)

Efficient Index for Temporal Core Queries over Bipartite Graphs Anxin Tian (Hong Kong University of Science and Technology)*; Alexander Zhou (Hong Kong University of Science and Technology); Yue Wang (Shenzhen Institute of Computing Sciences); Xun Jian (HKUST); Lei Chen (HKUST)

Evolution Forest Index: Towards Optimal Temporal $k$-Core Component Search via Time-Topology Isomorphic Computation Junyong Yang (Wuhan University); Ming Zhong (Wuhan University)*; Yuanyuan Zhu (Wuhan University); Tieyun Qian (Wuhan University); Mengchi Liu (South China Normal University); Jeffrey Xu Yu (Chinese University of Hong Kong)

Efficient Maximal Frequent Group Enumeration in Temporal Bipartite Graphs Yanping Wu (University of Technology Sydney)*; Renjie Sun (East China Normal University); Xiaoyang Wang (University of New South Wales); Dong Wen (University of New South Wales); Ying Zhang (University of Technology Sydney); Lu Qin (UTS); Xuemin Lin (Shanghai Jiaotong University)

E4

Graph Analytics

Chair: Dong Wen (University of New South Wales)

Fast Local Subgraph Counting Qiyan Li (The Chinese University of Hong Kong)*; Jeffrey Xu Yu (Chinese University of Hong Kong)

FSM: A Fine-grained Splitting and Merging Framework for Dual-balanced Graph Partition Chengjun Liu (Fudan University); Zhuo Peng (复旦大学); Weiguo Zheng (Fudan University)*; Lei Zou (Peking University)

Distributed Shortest Distance Labeling on Large-Scale Graphs yuanyuan zeng (Chinese University of Hong Kong, Shenzhen)*; Chenhao Ma (The Chinese University of Hong Kong, Shenzhen); Yixiang Fang (The Chinese University of Hong Kong, Shenzhen)

Efficient Parallel D-core Decomposition at Scale Wensheng Luo (School of Data Science, The Chinese University of Hong Kong, Shenzhen)*; Yixiang Fang (The Chinese University of Hong Kong, Shenzhen); Chunxu Lin (The Chinese University of Hong Kong,Shenzhen); Yingli Zhou (The Chinese University of Hong Kong, Shenzhen)

Efficient Algorithms for Pseudoarboricity Computation in Large Static and Dynamic Graphs Yalong Zhang (Beijing Institute of Technology); Ronghua Li (Beijing Institute of Technology)*; Qi Zhang (Beijing Institute of Technology); Hongchao Qin (Beijing Institute of Technology); Lu Qin (UTS); Guoren Wang (Beijing Institute of Technology)

Efficient Betweenness Centrality Computation over Large Heterogeneous Information Networks Xinrui Wang (Shandong University); Wang Yiran (Shandong University); Xuemin Lin (Shanghai Jiaotong University); Jeffrey Xu Yu (Chinese University of Hong Kong); Hong Gao (Zhejiang Normal University); Xiuzhen Cheng (Shandong University); Dongxiao Yu (Shandong University)*

E5

Graph Analytics

Chair: Jieming Shi (Hong Kong Polytechnic University)

Efficient Algorithms for Density Decomposition on Large Static and Dynamic Graphs Yalong Zhang (Beijing Institute of Technology); Ronghua Li (Beijing Institute of Technology)*; Qi Zhang (Beijing Institute of Technology); Hongchao Qin (Beijing Institute of Technology); Guoren Wang (Beijing Institute of Technology)

Efficient Maximal Motif-Clique Enumeration over Large Heterogeneous Information Networks Yingli Zhou (The Chinese University of Hong Kong, Shenzhen)*; Yixiang Fang (The Chinese University of Hong Kong, Shenzhen); Chenhao Ma (The Chinese University of Hong Kong, Shenzhen); Tianci Hou (The Chinese University of Hong Kong, Shenzhen); Xin Huang (Hong Kong Baptist University)

Host Profit Maximization: Leveraging Performance Incentives and User Flexibility Xueqin Chang (Zhejiang University); Xiangyu Ke (Zhejiang University, China); Lu Chen (Zhejiang University); Congcong Ge (Huawei Technologies Co., Ltd.); Ziheng Wei (Huawei Technologies Co., Ltd.); Yunjun Gao (Zhejiang University)*

Enabling Window-Based Monotonic Graph Analytics with Reusable Transitional Results for Pattern-Consistent Queries Zheng Chen (Renmin University of China); Feng Zhang (Renmin University of China)*; Yang Chen (Renmin University of China); Xiaokun Fang (Renmin University of China); Guanyu Feng (Tsinghua University); Xiaowei Zhu (Ant Group); Wenguang Chen (Tsinghua University); Xiaoyong Du (Renmin University of China)

RUSH: Real-time Burst Subgraph Discovery in Dynamic Graphs [Scalable Data Science] Yuhang Chen (National University of Singapore)*; Jiaxin Jiang (National University of Singapore); Shixuan Sun (Shanghai Jiao Tong University); Bingsheng He (National University of Singapore); Min Chen (Grab)

Efficient k-Clique Count Estimation with Accuracy Guarantee Lijun Chang (The University of Sydney)*; Rashmika Gamage (The University of Sydney); Jeffrey Xu Yu (Chinese University of Hong Kong)

E6

Graph Analytics & Systems

Chair: Long Yuan (Nanjing Univeristy of Science and Technology)

RAGraph: A Region-Aware Framework for Geo-Distributed Graph Processing Feng Yao (Northeastern University)*; Qian Tao (Alibaba Group); Wenyuan Yu (Alibaba Group); Yanfeng Zhang (Northeastern University); Shufeng Gong (NorthEastern University); Qiange Wang (National University of Singapore); Ge Yu (Northeastern University); Jingren Zhou (Alibaba Group)

CoroGraph: Bridging Cache Efficiency and Work Efficiency for Graph Algorithm Execution Xiangyu Zhi (Southern University of Science and Technology)*; Xiao Yan (Centre for Perceptual and Interactive Intelligence (CPII) ); Bo Tang (Southern University of Science and Technology); Ziyao Yin (Southern University of Science and Technology); yanchao zhu (huawei); Minqi Zhou (Huawei Company)

Mammoths Are Slow: The Overlooked Transactions of Graph Data Audrey Cheng (UC Berkeley)*; Jack Waudby (Neo4j); Hugo E Firth (Neo4j); Natacha Crooks (UC Berkeley); Ion Stoica (UC Berkeley)

FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework Junyi Mei (Shanghai Jiao Tong University)*; Shixuan Sun (Shanghai Jiao Tong University); Chao Li (Shanghai Jiao Tong University); Cheng Xu (Shanghai Jiao Tong University); cheng chen (bytedance); Yibo Liu (Shanghai Jiaotong University); Jing Wang (Shanghai Jiao Tong University); Cheng Zhao (ByteDance Inc.); Xiaofeng Hou (Shanghai Jiao Tong University); Minyi Guo (Shanghai Jiaotong University); Bingsheng He (National University of Singapore); cong xiaoliang (bytedance)

Extending Graph Rules with Oracles Xueli liu (Tianjin University)*; bowen dong (Tianjin University); wenzhi fu (UoE); Nannan Wu (Tianjin University); Xin Wang (Tianjin University); Wenjun Wang (Tianjin University)

AeonG: An Efficient Built-in Temporal Support in Graph Databases Jiamin Hou (Renmin University of China); Zhanhao Zhao (Renmin University of China); Zhouyu WANG (Renmin University of China); WEI LU (Renmin University of China)*; Guodong Jin (University of Waterloo); Dong Wen (University of New South Wales); Xiaoyong Du (Renmin University of China)

BYO: A Unified Framework for Benchmarking Large-Scale Graph Containers [Experiment, Analysis & Benchmark] Brian Wheatman (Johns Hopkins University)*; Xiaojun Dong (University of California, Riverside); Zheqi Shen (UC Riverside); Laxman Dhulipala (University of Maryland, College Park); Jakub Łącki (Google); Prashant Pandey (University of Utah); Helen Xu (Georgia Institute of Technology)

GraphOS: Towards Oblivious Graph Processing Javad Ghareh Chamani (Hong Kong University of Science and Technology)*; Ioannis Demertzis (UCSC); Dimitrios Papadopoulos (Hong Kong University of Science and Technology); Charalampos Papamanthou (Yale University); Rasool Jalili (Sharif University of Technology)

E7

Graph Search

Chair: Yixiang Fang (The Chinese University of Hong Kong, Shenzhen)

Efficient Unsupervised Community Search with Pre-trained Graph Transformer Jianwei Wang (University of New South Wales)*; Kai Wang (Shanghai Jiao Tong University); Xuemin Lin (Shanghai Jiaotong University); Wenjie Zhang (University of New South Wales); Ying Zhang (University of Technology Sydney)

Expanding Reverse Nearest Neighbors Wentao Li (The Hong Kong University of Science and Technology (Guangzhou)); Maolin Cai (Chongqing University); Min Gao (Chongqing University); Dong Wen (University of New South Wales); Lu Qin (UTS); Wei Wang (Hong Kong University of Science and Technology (Guangzhou))*

Querying Structural Diversity in Streaming Graphs Kaiyu Chen (University of New South Wales); Dong Wen (University of New South Wales)*; Wenjie Zhang (University of New South Wales); Ying Zhang (University of Technology Sydney); Xiaoyang Wang (University of New South Wales); Xuemin Lin (Shanghai Jiaotong University)

LM-SRPQ: Efficiently Answering Regular Path Query in Streaming Graphs Xiangyang Gou (The Chinese University of Hong Kong); Xinyi Ye (Peking University); Lei Zou (Peking University)*; Jeffrey Xu Yu (Chinese University of Hong Kong)

Cardinality Estimation of Subgraph Matching: A Filtering-Sampling Approach Wonseok Shin (Seoul National University); Siwoo Song (Seoul National University); Kunsoo Park (Seoul National University)*; Wook-Shin Han (POSTECH)

Efficient Regular Simple Path Queries under Transitive Restricted Expressions Qi Liang (Guangzhou University); Dian Ouyang (Guangzhou University)*; Fan Zhang (Guangzhou University); Jianye Yang (Guangzhou University); Xuemin Lin (Shanghai Jiaotong University); Zhihong Tian (Guangzhou University)

E8

Graph Search

Chair: Chenhao Ma (The Chinese University of Hong Kong, Shenzhen)

Truss-based Community Search over Streaming Directed Graphs Xuankun Liao (Hong Kong Baptist University); Qing Liu (Zhejiang University); Xin Huang (Hong Kong Baptist University); Jianliang Xu (Hong Kong Baptist University)*

Efficient Exact Subgraph Matching via GNN-based Path Dominance Embedding Yutong Ye (East China Normal University)*; Xiang Lian (Kent State University); Mingsong Chen (East China Normal University)

I/O Efficient Label-Constrained Reachability Queries in Large Graphs Long Yuan (Nanjing University of Science and Technology)*; Xia Li (University of New South Wales); Zi Chen (Nanjing University of Aeronautics and Astronautics); Xuemin Lin (Shanghai Jiaotong University); Xiang Zhao (National University of Defense Technology); Wenjie Zhang (University of New South Wales)

A Sampling-based Framework for Hypothesis Testing on Large Attributed Graphs [Scalable Data Science] Yun Wang (The University of Hong Kong)*; Chrysanthi Kosyfaki (The University of Hong Kong ); Sihem Amer-Yahia (CNRS); Reynold Cheng ("The University of Hong Kong, China")

Densest Multipartite Subgraph Search in Heterogeneous Information Networks Lu Chen (Swinburne University of Technology)*; Chengfei Liu (Swinburne University of Technology); Rui Zhou (Swinburne University of Technology); Kewen Liao (Australian Catholic University); Jiajie Xu (Soochow University); Jianxin Li (Deakin University)

Efficient kNN Search in Public Transportation Networks Qingshuai Feng (University of New South Wales)*; Junhua Zhang (University of New South Wales); Wenjie Zhang (University of New South Wales); Lu Qin (UTS); Ying Zhang (University of Technology Sydney); Xuemin Lin (Shanghai Jiaotong University)

F1

Data Mining and Analytics

Chair: Xiangyu Ke (Zhejiang University)

Scaling Package Queries to a Billion Tuples via Hierarchical Partitioning and Customized Optimization Anh Le Xuan Mai (New York University Abu Dhabi)*; Pengyu Wang (New York University Abu Dhabi); Azza Abouzied (New York University Abu Dhabi); Matteo Brucato (Microsoft Research); Peter Haas (University of Massachusetts Amherst); Alexandra Meliou (University of Massachusetts Amherst)

Cache-Efficient Top-k Aggregation over High Cardinality Large Datasets Tarique Siddiqui (Microsoft Research)*; Vivek Narasayya (Microsoft); Marius Dumitru (Microsoft); Surajit Chaudhuri (Microsoft)

Maximum Balanced $(k, \epsilon)$-Bitruss Detection in Signed Bipartite Graph Kai Hiu Chung (Hong Kong University of Science and Technology)*; Alexander Zhou (Hong Kong University of Science and Technology); Yue Wang (Shenzhen Institute of Computing Sciences); Lei Chen (Hong Kong University of Science and Technology)

TERI: An Effective Framework for Trajectory Recovery with Irregular Time Intervals Yile Chen (Nanyang Technological University)*; Gao Cong (Nanyang Technological Univesity); Cuauhtemoc Anda (DataSpark)

Data-Driven Insight Synthesis for Multi-Dimensional Data Junjie Xing (University of Michigan )*; Xinyu Wang (University of Michigan); H. V. Jagadish (University of Michigan)

DynaHB: A Communication-Avoiding Asynchronous Distributed Framework with Hybrid Batches for Dynamic GNN Training Zhen Song (Northeastern University)*; Yu Gu (Northeastern University); Qing Sun (Northeastern University); Tianyi Li (Aalborg University); Yanfeng Zhang (Northeastern University); Yushuai Li (Aalborg University); Christian S. Jensen (Aalborg University); Ge Yu (Northeastern University)

F2

Data Mining and Analytics

Chair: Chrysanthi Kosyfaki (The University of Hong Kong)

TSGBench: Time Series Generation Benchmark Yihao Ang (National University of Singapore); Qiang Huang (National University of Singapore)*; Yifan Bao (National University of Singapore); Anthony K. H. Tung (NUS); Zhiyong Huang (NUS School of Computing)

An Experimental Evaluation of Anomaly Detection in Time Series [Experiment, Analysis & Benchmark] Aoqian Zhang (Beijing Institute of Technology)*; Shuqing Deng (Beijing Institute of Technology); Dongping Cui (Beijing Institute of Technology); Ye Yuan ( Beijing Institute of Technology); Guoren Wang (Beijing Institute of Technology)

Raising the ClaSS of Streaming Time Series Segmentation Arik Ermshaus (Humboldt-Universität zu Berlin)*; Patrick Schäfer (Humboldt-Universität zu Berlin); Ulf Leser (Humboldt-Universität zu Berlin)

DIDS: Double Indices and Double Summarizations for Fast Similarity Search Han Hu (Harbin Institute of Technology); Jiye Qiu (Harbin Institute of Technology); Hongzhi Wang (Harbin Institute of Technology)*; Bin Liang (Harbin Institute of Technology); Songling Zou (Harbin Institute of Technology)

Efficient Stochastic Routing in Path-Centric Uncertain Road Networks Chenjuan Guo (ECNU); Ronghui Xu (East China Normal University)*; Bin Yang (East China Normal University); Yuan Ye (Aalborg University); Tung Kieu (Aalborg University); Yan Zhao (Aalborg University); Christian S. Jensen (Aalborg University)

AutoTSAD: Unsupervised Holistic Anomaly Detection for Time Series Data Sebastian Schmidl (Hasso Plattner Institute, University of Potsdam)*; Felix Naumann (Hasso Plattner Institute, University of Potsdam); Thorsten Papenbrock (Philipps University of Marburg)

F3

Data Mining and Analytics


A Multi-Scale Decomposition MLP-Mixer for Time Series Analysis Shuhan Zhong (The Hong Kong University of Science and Technology)*; Sizhe SONG (The Hong Kong University of Science and Technology); Weipeng Zhuo (BNU-HKBU United International College); Guanyao Li (Guangzhou Urban Planning and Design Survey Research Institute); Yang Liu (Guangzhou Urban Planning and Design Survey Research Institute); S.-H. Gary Chan (The Hong Kong University of Science and Technology)

BigST: Linear Complexity Spatio-Temporal Graph Neural Network for Traffic Forecasting on Large-Scale Road Networks [Scalable Data Science (SDS)] Jindong Han (The Hong Kong University of Science and Technology)*; Weijia Zhang (HKUST(GZ)); Hao Liu (HKUST); Tao Tao (DiDi); Naiqiang Tan (Didi Chuxing); Hui Xiong (Hong Kong University of Science and Tech)

Multiple Time Series Forecasting with Dynamic Graph Modeling kai zhao (AAU)*; Chenjuan Guo (ECNU); Peng Han (Aalborg University); Miao Zhang (Aalborg University); Yunyao Cheng (Aalborg University); Bin Yang (Aalborg University)

Weakly Guided Adaptation for Robust Time Series Forecasting Yunyao Cheng (Aalborg University)*; Peng Chen (East China Normal University); Chenjuan Guo (ECNU); kai zhao (AAU); Qingsong Wen (Alibaba Group U.S.); Bin Yang (Aalborg University); Christian S. Jensen (Aalborg University)

ImDiffusion: Imputed Diffusion Models for Multivariate Time Series Anomaly Detection Yuhang Chen (Peking University); Chaoyun Zhang (Microsoft)*; Minghua Ma (Microsoft Research); Yudong Liu (Peking University); Ruomeng Ding (Microsoft); Bowen Li (Tsinghua University); Shilin He (Microsoft); Saravan Rajmohan (Microsoft); Qingwei Lin (Microsoft Research); Dongmei Zhang (Microsoft Research Asia)

Co-movement Pattern Mining from Videos Dongxiang Zhang (Zhejiang University)*; Teng Ma (Zhejiang University); junnan hu (zhejiang university); Yijun Bei (Zhejiang University); Kian-Lee Tan (National University of Singapore); Gang Chen (Zhejiang University)

Efficient Discovery of Significant Patterns with Few-Shot Resampling Leonardo Pellegrina (University of Padova); Fabio Vandin (University of Padova)*

DAFDiscover: Robust Mining Algorithm for Dynamic Approximate Functional Dependencies on Dirty Data Xiaoou Ding (Harbin Institute of Technology); Yixing Lu (Harbin Institute of Technology); Hongzhi Wang (Harbin Institute of Technology)*; Chen Wang (" Tsinghua University, China"); Yida Liu (Harbin Institute of Technology); Jianmin Wang ("Tsinghua University, China")

F4

Data Mining and Analytics

Chair: Zhongle Xie (Zhejiang University)

Saving Money for Analytical Workloads in the Cloud Tapan Srivastava (The University of Chicago)*; Raul Castro Fernandez (The University of Chicago)

Fast and Space-Efficient Parallel Algorithms for Influence Maximization Letong Wang (University of California, Riverside)*; Xiangyun Ding (University of California, Riverside); Yan Gu (UC Riverside); Yihan Sun (University of California, Riverside)

Optimal Matrix Sketching over Sliding Windows Hanyan Yin (Renmin University of China)*; Dongxie Wen (Renmin University of China); Jiajun Li (Renmin University of China); Zhewei Wei (Renmin University of China); Xiao Zhang (Renmin University of China ); Zengfeng Huang (Fudan University); Feifei Li (Alibaba Group)

OEBench: Investigating Open Environment Challenges in Real-World Relational Data Streams [Experiment, Analysis & Benchmark] Yiqun Diao (National University of Singapore)*; Yutong Yang (National University of Singapore); Qinbin Li (UC Berkeley); Bingsheng He (National University of Singapore); mian lu (4Paradigm Inc.)

Efficient Differential Dependency Discovery Shulei Kuang (Fudan University); Honghui Yang (Fudan University); Zijing Tan (Fudan University)*; Shuai Ma (Beihang University)

Nuhuo: An Effective Estimation Model for Traffic Speed Histogram Imputation on A Road Network Haitao Yuan (Nanyang Technological University)*; Gao Cong (Nanyang Technological Univesity); Guoliang Li (Tsinghua University)

F5

Information Integration and Data Quality

Chair: Ziawasch Abedjan (TU Berlin)

LakeBench: A Benchmark for Discovering Joinable and Unionable Tables in Data Lakes Yuhao Deng (Beijing Institute of Technology); Chengliang Chai (Beijing Institute of Technology)*; Lei Cao (University of Arizona/MIT); Qin Yuan (Beijing Institute of Technology); Siyuan Chen (Beijing Institute of Technology); Yanrui Yu (Beijing Institute of Technology); Zhaoze Sun (Beijing Institute of Technology); Junyi Wang (Beijing Institute of Technology); Jiajun Li (Beijing Institute of Technology); Ziqi Cao (Beijing Institute of Technology); Kaisen Jin (Beijing Institute of Technology); Chi Zhang (Beijing Institute of Technology); Yuqing Jiang (Beijing Institute of Technology); Yuanfang Zhang (Beijing Institute of Technology); Yu-Ping Wang (Beijing Institute of Technology); Ye Yuan ( Beijing Institute of Technology); Guoren Wang (Beijing Institute of Technology); Nan Tang (HKUST (GZ))

Automatic Data Repair: Are We Ready to Deploy? [Experiment, Analysis & Benchmark] Wei Ni (Zhejiang University); Xiaoye Miao (Zhejiang University)*; Xiangyu Zhao (City University of Hong Kong); Yangyang Wu (Zhejiang University); Shuwei Liang (Zhejiang University); Jianwei Yin (Zhejiang University)

Efficient Validation of SHACL Shapes with Reasoning JIN KE (Ruhr-Universtät Bochum); Zenon G Zacouris (Ruhr Universität Bochum); Maribel Acosta (Technische Universität München)*

ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models Benjamin Feuer (New York University); Yurong Liu (New York University); Chinmay Hegde (New York University); Juliana Freire (New York University)*

Blocker and Matcher Can Mutually Benefit: A Co-Learning Framework for Low-Resource Entity Resolution Shiwen Wu (The Hong Kong University of Science and Technology)*; Qiyu Wu (The University of Tokyo); Honghua Dong (University of Toronto); Wen Hua (The Hong Kong Polytechnic University); Xiaofang Zhou (Hong Kong University of Sci and Tech)

ReCG: Bottom-Up JSON Schema Discovery Using a Repetitive Cluster-and-Generalize Framework Joohyung Yun (POSTECH); Byungchul Tak (Kyungpook National University); Wook-Shin Han (POSTECH)*

F6

Information Integration and Data Quality


Sparcle: Boosting the Accuracy of Data Cleaning Systems through Spatial Awareness Yuchuan Huang (University of Minnesota - Twin Cities); Mohamed Mokbel (University of Minnesota - Twin Cities)*

Win-Win: On Simultaneous Clustering and Imputing over Incomplete Data Yu Sun (Nankai University); Jingyu Zhu (Nankai university); Xiao Xu (Ant Group); xian xu (Ant Group); Yuyao Sun (Ant group); Shaoxu Song (Tsinghua University)*; Xiang Li (Ping An Technology); Xiaojie Yuan (Nankai Univeristy)

Missing Value Imputation for Multi-attribute Sensor Data Streams via Message Propagation Xiao Li (Roskilde University)*; Huan Li (Zhejiang University); Hua Lu (Roskilde University); Christian S. Jensen (Aalborg University); Varun Pandey (TU Berlin); Volker Markl (Technische Universität Berlin)

MisDetect: Iterative Mislabel Detection using Early Loss Yuhao Deng (Beijing Institute of Technology)*; Chengliang Chai (Beijing Institute of Technology); Lei Cao (University of Arizona/MIT); Nan Tang (Qatar Computing Research Institute, HBKU); Jiayi Wang (Tsinghua University); Ju Fan (Renmin University of China); Ye Yuan ( Beijing Institute of Technology); Guoren Wang (Beijing Institute of Technology)

Rapidash: Efficient Detection of Constraint Violations Zifan Liu (University of Wisconsin-Madison); Shaleen Deep (Microsoft Gray Systems Lab)*; Anna Fariha (University of Utah); Fotis Psallidas (Microsoft); Ashish Tiwari (Microsoft); Avrilia Floratou (Microsoft)

ZIP: Lazy Imputation during Query Processing Yiming Lin (University of California, Berkeley)*; Sharad Mehrotra (U.C. Irvine)

Enriching Relations with Additional Attributes for ER Mengyi Yan (Beihang University); Wenfei Fan (Univ. of Edinburgh ); Yaoshu Wang (Shenzhen Institute of Computing Sciences, Shenzhen University); Min Xie (Shenzhen Institute of Computing Sciences )*

SparqLog: A System for Efficient Evaluation of SPARQL 1.1 Queries via Datalog [Experiment, Analysis and Benchmark] Renzo Angles (Universidad de Talca); Georg Gottlob (Oxford University); Aleksandar Pavlovic (TU Wien); Reinhard Pichler (TU Wien); Emanuel Sallinger (TU Wien)*

F7

Distributed Database Systems

Chair: Peter Pietzuch (Imperial College London)

Caerus: Low-Latency Distributed Transactions for Geo-Replicated Systems Joshua T Hildred (University of Waterloo )*; Michael Abebe (University of Waterloo); Khuzaima Daudjee (University of Waterloo)

Partition, Don't Sort! Compression Boosters for Cloud Data Ingestion Pipelines Patrick Hansert (RPTU Kaiserslautern-Landau)*; Sebastian Michel (RPTU Kaiserslautern-Landau)

Agile-Ant: Self-managing Distributed Cache Management for Cost Optimization of Big Data Applications Hani Al-Sayeh (TU Ilmenau)*; Muhammad Attahir Jibril (TU Ilmenau); Kai-Uwe Sattler (TU Ilmenau)

Blueprinting the Cloud: Unifying and Automatically Optimizing Cloud Data Infrastructures with BRAD Geoffrey X. Yu (Massachusetts Institute of Technology)*; Ziniu Wu (Massachusetts Institute of Technology); Ferdinand Kossmann (MIT); Tianyu Li (MIT CSAIL); Markos Markakis (Massachusetts Institute of Technology); Amadou L Ngom (MIT); Samuel Madden (MIT); Tim Kraska (MIT)

Fast Commitment for Geo-Distributed Transactions via Decentralized Co-coordinators Zihao Zhang (East China Normal University); Huiqi Hu (East China Normal University)*; Xuan Zhou (East China Normal University); Yaofeng Tu (ZTE Corporation); Weining Qian (East China Normal University); Aoying Zhou (East China Normal University )

Solver-In-The-Loop Cluster Resource Management for Database-as-a-Service Arnd Christian König (Microsoft)*; Yi Shan (Microsoft); Karan Newatia (University of Pennsylvania ); Luke Marshall (Microsoft Research); Vivek Narasayya (Microsoft)

F8

Information Integration and Data Quality

Chair: Hongzhi Wang (Harbin Institute of Technology)

ZeroEA: A Zero-Training Entity Alignment Framework via Pre-Trained Language Model Nan Huo (The University of Hong Kong)*; Reynold Cheng ("The University of Hong Kong, China"); Ben Kao (University of Hong Kong); Wentao Ning (The University of Hong Kong); Nur Al Hasan Haldar (The University of Western Australia); Xiaodong Li (The University of Hong Kong); Jinyang LI (The University of Hong Kong); Mohammad Matin Najafi (The University of Hong Kong); Tian Li (TCL Corporate Research (Hong Kong) Co., Limited); Ge Qu (THE UNIVERSITY OF HONG KONG)

Are Large Language Models a Good Replacement of Taxonomies? [Experiment, Analysis & Benchmark] Yushi Sun (Hong Kong University of Science and Technology)*; XIN Hao (HKUST); Kai Sun (Meta Reality Labs); Yifan Xu (Meta); Xiao Yang (Meta Platforms); Xin Luna Dong (amazon.com, fb.com); Nan Tang (HKUST (GZ)); Lei Chen (Hong Kong University of Science and Technology)

VeriDKG: A Verifiable SPARQL Query Engine for Decentralized Knowledge Graphs Enyuan Zhou (The Hong Kong Polytechnic University)*; Song Guo (The Hong Kong University of Science and Technology); Zicong Hong (The Hong Kong Polytechnic University ); Christian S. Jensen (Aalborg University); Yang Xiao (Xidian University); Dalin Zhang (Aalborg University); Jinwen Liang (The Hong Kong Polytechnic University); Qingqi Pei (Xidian University)

Efficient and Reliable Estimation of Knowledge Graph Accuracy Stefano Marchesin (Università di Padova)*; Gianmaria Silvello (University of Padova)

Outlier Summarization via Human Interpretable Rules Yuhao Deng (Beijing Institute of Technology)*; Yu Wang (University of California, San Diego); Lei Cao (University of Arizona/MIT); Lianpeng N/A Qiao (Beijing Institute of Technology); Yu-Ping Wang (Beijing Institute of Technology); Xu Jingzhe (Beijing Institute of Technology ); Yizhou Yan (Worcester Polytechnic Institute); Samuel Madden (MIT)

The Dawn of Natural Language to SQL: Are We Fully Ready? [Experiment, Analysis & Benchmark (EA&B)] Boyan Li (HKUST(GZ)); Yuyu Luo (HKUST (GZ))*; Chengliang Chai (Beijing Institute of Technology); Guoliang Li (Tsinghua University); Nan Tang (HKUST (GZ))

Searching Data Lakes for Nested and Joined Data Yi Zhang (AWS AI Labs)*; Peter Chen (Massachusetts Institute of Technology); Zack Ives (University of Pennsylvania)

Fainder: A Fast and Accurate Index for Distribution-Aware Dataset Search Lennart Behme (Technische Universität Berlin)*; Sainyam Galhotra (Cornell University); Kaustubh Beedkar (Indian Institute of Technology Delhi); Volker Markl (Technische Universität Berlin)

G1

User Interfaces

Chair: Yuyu Luo (HKUST (Guangzhou))

Counterfactual Explanation of the Shapley Value in Data Coalitions Michelle Si (Duke University)*; Jian Pei (Simon Fraser University)

LION: Fast and High-Resolution Network Kernel Density Visualization Tsz Nam Chan (Shenzhen University)*; Rui Zang (Hong Kong Baptist University ); Bojian Zhu (Hong Kong Baptist University); Leong Hou U (University of Macau); Dingming Wu (Shenzhen University); Jianliang Xu (Hong Kong Baptist University)

Visualization-aware Time Series Min-Max Caching with Error Bound Guarantees Stavros Maroulis (ATHENA Research Center)*; Vassilis Stamatopoulos (Athena Research Center); George Papastefanatos (ATHENA Research Center); Manolis Terrovitis (Athena Research Center)

ScienceBenchmark: A Complex Real-World Benchmark for Evaluating Natural Language to SQL Systems [Benchmark Paper] Yi Zhang (zurich university of applied sciences); Jan M Deriu (ZHAW); George Katsogiannis-Meimarakis (Athena Research Center); Catherine M Kosten (ZHAW); Georgia Koutrika (ATHENA Research Center); Kurt Stockinger (ZHAW Zurich University of Applied Sciences)*

HAIChart: Human and AI Paired Visualization System Yupeng Xie (The Hong Kong University of Science and Technology (Guangzhou)); Yuyu Luo (HKUST (GZ))*; Guoliang Li (Tsinghua University); Nan Tang (HKUST (GZ))

Texera: A System for Collaborative and Interactive Data Analytics Using Workflows [Scalable Data Science] Zuozhi Wang (U C IRVINE)*; Yicong Huang (UC Irvine); Shengquan Ni (U C Irvine); Avinash Kumar (U C IRVINE); Sadeem Alsudais (King Saud University); Xiaozhen Liu (University of California, Irvine); Xinyuan Lin (University of California Irvine); Yunyan Ding (UCI); Chen Li (UC Irvine)

G2

Novel Database Architectures

Chair: Tianzheng Wang (Simon Fraser University)

CXL and the Return of Scale-Up Database Engines Alberto Lerner (University of Fribourg)*; Gustavo Alonso (ETHZ)

Breathing New Life into An Old Tree: Resolving Logging Dilemma of B+-tree on Modern Computational Storage Drives Kecheng HUANG (The Chinese University of Hong Kong)*; Zhaoyan Shen (Shandong University); Zili Shao (The Chinese University of Hong Kong); Tong Zhang (Rensselaer Polytechnic Institute, ScaleFlux Inc.); Feng Chen (Louisiana State University)

FluidKV: Seamlessly Bridging the Gap between Indexing Performance and Memory-Footprint on Ultra-Fast Storage Ziyi Lu (Huazhong University of Science and Technology); Qiang Cao (Huazhong University of Science and Technology)*; Hong Jiang (Department of Computer Science and Engineering, University of Texas at Arlington, USA); Yuxing Chen (Tencent); JIE YAO (Huazhong University of Science and Technology); Anqun Pan (Tencent Inc., China)

DDS: DPU-optimized Disaggregated Storage Qizhen Zhang (University of Toronto)*; Philip A Bernstein (Microsoft Research); Badrish Chandramouli (Microsoft Research); Jason Hu (University of Toronto); yiming zheng (univetsity of toronto)

OLAP on Modern Chiplet-Based Processors Alessandro Fogli (Imperial College London)*; Bo Zhao (Aalto University); Peter Pietzuch (Imperial College London); Maximilian Bandle (TUM); Jana Giceva (TU Munich)

Index Advisors on Quantum Platforms Manish Kesarwani (IBM Research Lab)*; Jayant R Haritsa (Indian Institute of Science)

G3

Query Processing

Chair: Rihan Hai (Delft University of Technology)

Sample-Efficient Cardinality Estimation Using Geometric Deep Learning Silvan Reiner (University of Konstanz)*; Michael Grossniklaus (University of Konstanz)

BOSS - An Architecture for Database Kernel Composition Hubert Mohr-Daurat (Imperial College London)*; Xuan Sun (Imperial College London); Holger Pirk (Imperial College)

Quantum-Inspired Digital Annealing for Join Ordering Manuel Schönberger (Technical University of Applied Sciences Regensburg)*; Immanuel Trummer (Cornell University); Wolfgang Mauerer (Technical University of Applied Sciences Regensburg / Siemens AG)

Window Function Expression: Let the Self-join Enter Radim Baca (VSB - Technical University of Ostrava)*

Sampling Methods for Inner Product Sketching Majid Daliri (New York University ); Juliana Freire (New York University); Christopher Musco (New York University)*; Aécio Santos (New York University); Haoxiang Zhang (New York University)

Robust Best Point Selection under Unreliable User Feedback Qixu Chen (HKUST)*; Raymond Chi-Wing Wong (Hong Kong University of Science and Technology)

Efficient Enumeration of Recursive Plans in Transformation-based Query Optimizers Amela Fejza (Inria); Pierre Genevès (CNRS)*; Nabil Layaïda (Inria)

TenGraph: A Tensor-Based Graph Query Engine Guanghua Li (The Hong Kong University of Science and Technology (Guangzhou))*; Hao Zhang (The Chinese University of Hong Kong); Xibo Sun (Hong Kong University of Science and Technology); Qiong Luo (Hong Kong University of Science and Technology); Yuanyuan Zhu (Wuhan University)

G4

Novel Database Architectures

Chair: Holger Pirk (Imperial College London)

BonsaiKV: Towards Fast, Scalable, and Persistent Key-Value Stores with Tiered, Heterogeneous Memory System Miao Cai (Nanjing University of Aeronautics and Astronautics)*; Junru Shen (Hohai University); Yifan Yuan (Intel Labs); Zhihao Qu (Hohai University); Baoliu Ye (NJU)

GPU Database Systems Characterization and Optimization [Experiment, Analysis & Benchmark] Jiashen Cao (Georgia Tech)*; Rathijit Sen (Microsoft); Matteo Interlandi (Microsoft); Joy Arulraj (Georgia Tech); Hyesoon Kim (Georgia Tech)

Sorting on Byte-Addressable Storage: The Resurgence of Tree Structure Ying Zheng (National University of Singapore)*; Kian-Lee Tan (National University of Singapore)

OmniSketch: Efficient Multi-Dimensional High-Velocity Stream Analytics with Arbitrary Predicates Wieger R. Punter (TU Eindhoven)*; Odysseas Papapetrou (TU Eindhoven); Minos Garofalakis (ATHENA Research Centre & Technical University of Crete)

CIVET: Exploring Compact Index for Variable-Length Subsequence Matching on Time Series Haoran Xiong (Fudan University); Hang Zhang (Fudan University); Zeyu Wang (Fudan University); Zhenying He (Fudan University)*; Peng Wang (" Fudan University, China"); X. Sean Wang (Fudan University)

Single Update Sketch with Variable Counter Structure Dimitrios Melissourgos (University of Florida); Haibo Wang (University of Kentucky)*; Shigang Chen (University of Florida); Chaoyi Ma (University of Florida); Shiping Chen (University of Shanghai for Science and Technology)

G5

Languages


Relational Query Synthesis ⋈ Decision Tree Learning Aaditya Naik (University of Pennsylvania)*; Aalok Thakkar (University of Pennsylvania); Adam Stein (University of Pennsylvania); Rajeev Alur (University of Pennsylvania ); Mayur Naik (University of Pennsylvania)

Complex Event Recognition with Symbolic Register Transducers Elias Alevizos (NCSR'D)*; Alexander Artikis (University of Piraeus); Georgios Paliouras (NCSR "Demokritos")

QED: A Powerful Query Equivalence Decider for SQL Shuxian Wang (University of California, Berkeley)*; Sicheng Pan (UC Berkeley); Alvin Cheung (University of California, Berkeley)

Mixed Covers of Keys and Functional Dependencies for Maintaining the Integrity of Data under Updates Zhuoxing Zhang (The University of Auckland); Sebastian Link (University of Auckland)*

G6

Graph Learning

Chair: Xin Wang (Tianjin University)

TIGER: Training Inductive Graph Neural Network for Large-scale Knowledge Graph Reasoning Kai Wang (Nanyang Technological University); Yuwei XU (Nanyang Technological University); Siqiang Luo (Nanyang Technological University)*

Billion-Scale Bipartite Graph Embedding: A Global-Local Induced Approach [Scalable Data Science] Xueyi Wu (East China Normal University); Yuanyuan Xu (University of New South Wales)*; Wenjie Zhang (University of New South Wales); Ying Zhang (University of Technology Sydney)

NeutronStream: A Dynamic GNN Training Framework with Sliding Window for Graph Streams Chaoyi Chen (Northeastern University)*; Dechao Gao (Northeastern University); Yanfeng Zhang (Northeastern University); Qiange Wang (National University of Singapore); Zhenbo Fu (Northeastern University); Xuecang Zhang (Huawei); Junhua Zhu (Huawei); Yu Gu (Northeastern University); Ge Yu (Northeastern University)

ETC: Efficient Training of Temporal Graph Neural Networks over Large-scale Dynamic Graphs Shihong Gao (The Hong Kong University of Science and Technology)*; Yiming Li (Hong Kong University of Science and Technology); Yanyan Shen (Shanghai Jiao Tong University); Yingxia Shao (BUPT); Lei Chen (Hong Kong University of Science and Technology)

XGNN: Boosting Multi-GPU GNN Training via Global GNN Memory Store Dahai Tang (Hunan University); Jiali Wang (Shanghai Jiao Tong University); Rong Chen (Shanghai Jiao Tong University)*; Lei Wang (Alibaba Group); Wenyuan Yu (Alibaba Group); Jingren Zhou (Alibaba Group); Kenli Li (Hunan University)

A Benchmark Study of Deep-RL Methods for Maximum Coverage Problems over Graphs [Experiment, Analysis & Benchmark] LIANG Zhicheng (City University of Hong Kong); Yu Yang (City University of Hong Kong); Xiangyu Ke (Zhejiang University)*; Xiaokui Xiao (National University of Singapore); Yunjun Gao (Zhejiang University)

LightDiC: A Simple yet Effective Approach for Large-scale Digraph Representation Learning Xunkai Li (Beijing Institute of Technology)*; Meihao Liao (Beijing Institute of Technology); Zhengyu Wu (Beijing Institute of Technology); Daohan Su (Beijing Institute of technology); Wentao Zhang (Peking University); Ronghua Li (Beijing Institute of Technology); Guoren Wang (Beijing Institute of Technology)

NeutronOrch: Rethinking Sample-based GNN Training under CPU-GPU Heterogeneous Environments Xin Ai (Northeastern University)*; Qiange Wang (National University of Singapore); Chunyu Cao (Northeastern University); Yanfeng Zhang (Northeastern University); Chaoyi Chen (Northeastern University); Hao Yuan (Northeastern University); Yu Gu (Northeastern University); Ge Yu (Northeastern University)

G7

Graph Learning

Chair: Yanfeng Zhang (Northeastern University, China)

Fight Fire with Fire: Towards Robust Graph Neural Networks on Dynamic Graphs via Actively Defense Haoyang Li (The Hong Kong University of Science and Technology)*; Shimin Di (The Hong Kong University of Science and Technology); Calvin Li (Evernorth); Lei Chen (Hong Kong University of Science and Technology); Xiaofang Zhou (The Hong Kong University of Science and Technology)

GENTI: GPU-powered Walk-based Subgraph Extraction for Scalable Representation Learning on Dynamic Graphs [Scalable Data Science] Zihao Yu (Nanyang Technological University)*; Ningyi Liao (Nanyang Technological University ); Siqiang Luo (Nanyang Technological University)

Comprehensive Evaluation of GNN Training Systems: A Data Management Perspective [Experiment, Analysis & Benchmark] Hao Yuan (Northeastern University)*; Yajiong Liu (Northeastern University); Yanfeng Zhang (Northeastern University); Xin Ai (Northeastern University); Qiange Wang (National University of Singapore); Chaoyi Chen (Northeastern University); Yu Gu (Northeastern University); Ge Yu (Northeastern University)

Eliminating Data Processing Bottlenecks in GNN Training over Large Graphs via Two-level Feature Compression Yuxin Ma (University of Science and Technology of China); Ping Gong (UNIVERSITY OF SCIENCE AND TECHNOLOGY OF CHINA)*; Tianming Wu (University of Science and Technology of China); Jiawei Yi (University of Science and Technology of China); Chengru Yang (USTC); Cheng Li (USTC); Qirong Peng (OPPO); Guiming Xie (OPPO); Yongcheng Bao (Oppo Co., Ltd.); Haifeng Liu (OPPO); Yinlong Xu (University of Science and Technology of China )

OUTRE: An OUT-of-core De-REdundancy GNN Training Framework for Massive Graphs within A Single Machine Zeang Sheng (Peking University)*; Wentao Zhang (Peking University); Yangyu Tao (Tencent); Bin Cui (Peking University)

FedGTA: Topology-aware Averaging for Federated Graph Learning Xunkai Li (Beijing Institute of Technology)*; Zhengyu Wu (Beijing Institute of Technology); Wentao Zhang (Mila); Yinlin Zhu (Sun Yat-sen University); Ronghua Li (Beijing Institute of Technology); Guoren Wang (Beijing Institute of Technology)

G8

Novel Database Architectures


Efficient Placement of Decomposable Aggregation Functions for Stream Processing over Large Geo-Distributed Topologies Xenofon Chatziliadis (Technische Universität Berlin)*; Eleni Tzirita Zacharatou (IT University of Copenhagen); Alphan Eracar (Technical University of Berlin); Steffen Zeuch (TU Berlin); Volker Markl (Technische Universität Berlin)

Distance-based Outlier Query Optimization in Apache IoTDB Yunxiang Su (Tsinghua University); Shaoxu Song (Tsinghua University)*; Xiangdong Huang (Tsinghua University); Chen Wang (Timecho Limited); Jianmin Wang ("Tsinghua University, China")

Enhancing Accuracy for Super Spreader Identification in High-Speed Data Streams Haibo Wang (University of Kentucky)*

Accelerating Merkle Patricia Trie with GPU Yangshen Deng (Southern University of Science and Technology); Muxi Yan (Southern University of Science and Technology); Bo Tang (Southern University of Science and Technology)*

Optimizing Video Selection LIMIT Queries With Commonsense Knowledge Wenjia He (University of Michigan)*; Ibrahim Sabek (University of Southern California); Yuze Lou (University of Michigan); Michael Cafarella (MIT CSAIL)

Optimizing Video Queries with Declarative Clues Daren Chao (University of Toronto); Yueting Chen (York University ); Nick Koudas (University of Toronto)*; Xiaohui Yu (York University)

TVM: A Tile-based Video Management Framework Tianxiong Zhong (Beijing Institute of Technology); Zhiwei Zhang (Beijing Institute of Technology)*; Guo Lu (Shanghai Jiao Tong University); Ye Yuan ( Beijing Institute of Technology); Yu-Ping Wang (Beijing Institute of Technology); Guoren Wang (Beijing Institute of Technology)

Spatialyze: A Geospatial Video Analytics System with Spatial-Aware Optimizations Chanwut Kittivorawong (University of California, Berkeley)*; Yongming Ge (University of California Berkeley); Yousef Helal (University of California, Berkeley); Alvin Cheung (University of California, Berkeley)

H1

Data Privacy and Security

Chair: Yang Cao (Tokyo Institute of Technology)

Cryptographically Secure Private Record Linkage Using Locality-Sensitive Hashing RUIDI Wei (University of Waterloo); Florian Kerschbaum (University of Waterloo)*

Confidential Consortium Framework: Secure Multiparty Applications with Confidentiality, Integrity, and High Availability Heidi Howard (Microsoft)*; Fritz Alder (imec-DistriNet, KU Leuven); Edward Ashton (Microsoft); Amaury Chamayou (Microsoft); Sylvan Clebsch (Microsoft); Manuel Costa (Azure Research, Microsoft); Antoine Delignat-Lavaud (Microsoft); Cedric Fournet (Microsoft Research); Andrew Jeffery (University of Cambridge); Matthew Kerner (Microsoft); FOTIOS KOUNELIS (Imperial College London); Markus A Kuppe (Microsoft Research); Julien Maffre (Microsoft); Mark Russinovich (Microsoft); Christoph M. Wintersteiger (Microsoft)

Privacy Amplification via Shuffling: Unified, Simplified, and Tightened Shaowei Wang (Guangzhou University)*; Yun PENG (Guangzhou University); Jin Li (Guangzhou University); Zikai Alex Wen (Hong Kong University of Science and Technology (Guangzhou)); Zhipeng Li (Guangzhou University); Shiyu Yu (Guangzhou University); Di Wang (KAUST); Wei Yang (University of Science and Technology of China)

From Zero to Hero: Detecting Leaked Data through Synthetic Data Injection and Model Querying Biao Wu (National University of Singapore); Qiang Huang (National University of Singapore)*; Anthony K. H. Tung (NUS)

Differentially Private Data Generation with Missing Data Shubhankar Mohapatra (University of Waterloo)*; Jianqiao Zong (University of Waterloo); Florian Kerschbaum (University of Waterloo); Xi He (University of Waterloo)

DP-PQD: Privately Detecting Per-Query Gaps In Synthetic Data Generated By Black-Box Mechanisms Shweta Patwa (Duke University)*; Danyu Sun (Duke University); Amir Gilad (The Hebrew University); Ashwin Machanavajjhala (Duke); Sudeepa Roy (Duke University, USA)

H2

Data Privacy and Security

Chair: Primal Pappachan (Portland State University)

Secure and Verifiable Data Collaboration with Low-Cost Zero-Knowledge Proofs Yizheng Zhu (National University of Singapore); Yuncheng Wu (Renmin University of China); ZHAOJING LUO (Beijing Institute of Technology); Beng Chin Ooi (NUS); Xiaokui Xiao (National University of Singapore)*

Doquet: Differentially Oblivious Range and Join Queries with Private Data Structures Lina Qiu (Boston University)*; Georgios Kellaris (Lerna AI); Nikos Mamoulis (University of Ioannina); Kobbi Nissim (Georgetown University); George Kollios (Boston University)

PriPL-Tree: Accurate Range Query for Arbitrary Distribution under Local Differential Privacy Leixia Wang (Renmin University of China)*; Qingqing Ye (Hong Kong Polytechnic University); Haibo Hu (Hong Kong Polytechnic University); Xiaofeng Meng (Renmin University of China)

HRNet: Differentially Private Hierarchical and Multi-Resolution Network for Human Mobility Data Synthesization Shun Takagi (Kyoto University)*; Li Xiong (Emory University); Fumiyuki Kato (Preferred Networks); Yang Cao (Tokyo Institute of Technology); Masatoshi Yoshikawa (Osaka Seikei University)

Communication Efficient and Provable Federated Unlearning Youming Tao (Shandong University); Cheng-Long Wang (KAUST); Miao Pan (University of Houston); Dongxiao Yu (Shandong University); Xiuzhen Cheng (Shandong University); Di Wang (KAUST)*

LLM-PBE: Assessing Data Privacy in Large Language Models [Experiment, Analysis & Benchmark] Qinbin Li (UC Berkeley)*; Junyuan Hong (University of Texas, Austin); Chulin Xie (University of Illinois at Urbana-Champaign); Jeffrey Tan (UC Berkeley); Rachel Xin (UC Berkeley); Junyi Hou (National University of Singapore); Xavier Yin (UC Berkeley); Zhun Wang (University of California, Berkeley); Dan Hendrycks (UC Berkeley); Zhangyang Wang (University of Texas at Austin); Bo Li (University of Illinois at Urbana–Champaign); Bingsheng He (National University of Singapore); Dawn Song (UC Berkeley)

H3

Data Privacy and Security

Chair: Qinbin Li (National University of Singapore)

Confidence Intervals for Private Query Processing Dajun Sun (Hong Kong University of Science and Technology)*; Wei DONG (Hong Kong University of Science and Technology, Hong Kong); Ke Yi (Hong Kong Univ. of Science and Technology)

AAA: an Adaptive Mechanism for Locally Differential Private Mean Estimation Fei Wei (Alibaba Group)*; Ergute Bao (National University of Singapore); Xiaokui Xiao (National University of Singapore); Yin Yang (Hamad bin Khalifa University); Bolin Ding ("Data Analytics and Intelligence Lab, Alibaba Group")

Privately Answering Queries on Skewed Data via Per-Record Differential Privacy Jeremy Seeman (Tumult Labs); William Sexton (Tumult Labs); David A Pujol (Tumult Labs)*; Ashwin Machanavajjhala (Tumult Labs)

DPSUR: Accelerating Differentially Private Stochastic Gradient Descent Using Selective Update and Release Jie Fu (East China Normal University)*; Qingqing Ye (Hong Kong Polytechnic University); Haibo Hu (Hong Kong Polytechnic University); Zhili Chen (East China Normal University); Lulu Wang (East China Normal University); Kuncan Wang (East China Normal University); Xun Ran (The Hong Kong Polytechnic University (PolyU))

SWAT: A System-Wide Approach to Tunable Leakage Mitigation in Encrypted Data Stores Leqian Zheng (City University of Hong Kong )*; Lei Xu (Nanjing University of Science and Technology); Cong Wang (City University of Hong Kong); Sheng Wang (Alibaba Group); Yuke Hu (Zhejiang University); Feifei Li (Alibaba Group); Kui Ren (Zhejiang University); Zhan Qin (Zhejiang University)

Utility-aware Payment Channel Network Rebalance Wangze Ni (Hong Kong University of Science and Technology); Pengze Chen (Hong Kong University of Science and Technology); Lei Chen (Hong Kong University of Science and Technology); Peng Cheng (East China Normal University)*; Chen Zhang (The Hong Kong Polytechnic University); Xuemin Lin (Shanghai Jiaotong University)

Towards Full Stack Adaptivity in Permissioned Blockchains [Vision Paper] Chenyuan Wu (University of Pennsylvania)*; Mohammad Javad Amiri (Stony Brook University); Haoyun Qin (University of Pennsylvania); Bhavana Mehta (University of Pennsylvania); Ryan Marcus (University of Pennsylvania); Boon Thau Loo (Univ. of Pennsylvania)

Algorithmic Complexity Attacks on Dynamic Learned Indexes Rui Yang (University of Virginia)*; Evgenios M. Kornaropoulos (George Mason University); Yue Cheng (University of Virginia)

H4

Specialized and Domain-Specific Data Management

Chair: Senjuti Basu Roy (New Jersey Institute of Technology)

Query Refinement for Diversity Constraint Satisfaction Jinyang Li (University of Michigan)*; Yuval Moskovitch (Ben Gurion University); Julia Stoyanovich (New York University); H. V. Jagadish (University of Michigan)

Chameleon: Foundation Models for Fairness-aware Multi-modal Data Augmentation to Enhance Coverage of Minorities Mahdi Erfanian (University of Illinois Chicago)*; H. V. Jagadish (University of Michigan); Abolfazl Asudeh (University of Illinois Chicago)

VeLP: Vehicle Loading Plan Learning from Human Behavior in Nationwide Logistics System [Scalable Data Science] Sijing Duan (Central South University)*; Feng Lyu (Central South University ); Xin Zhu (JD Logistics); Yi Ding (The University of Texas at Dallas); Haotian Wang (JD Logistics); Desheng Zhang (Rutgers University); Xue Liu (McGill University); Yaoxue Zhang (Tsinghua University); Ju Ren (Tsinghua University)

Efficient Dynamic Weighted Set Sampling and Its Extension Fangyuan Zhang (The Chinese University of Hong Kong); Mengxu Jiang (The Chinese University of Hong Kong); Sibo Wang (The Chinese University of Hong Kong)*

Optimizing Collections of Bloom Filters within a Space Budget Gabriel Mersy (University of Chicago)*; Zhuo Wang (university of Chicago); Stavros Sintos (University of Illinois Chicago); Sanjay Krishnan (U Chicago)

RoarGraph: A Projected Bipartite Graph for Efficient Cross-Modal Approximate Nearest Neighbor Search Meng Chen (Fudan University)*; Kai Zhang (Fudan University); Zhenying He (Fudan University); Yinan Jing (Fudan University); X. Sean Wang (Fudan University)

H5

Specialized and Domain-Specific Data Management

Chair: Jianqiu Xu (Nanjing University of Aeronautics and Astronautics)

DecLog: Decentralized Logging in Non-Volatile Memory for Time Series Database Systems Bolong Zheng (Huazhong University of Science and Technology)*; yongyong gao (Huazhong University of Science and Technology); Jingyi Wan (Huazhong University of Science and Technology); lingsen yan (Huazhong University of Science and Technology); Long Hu (Huazhong University of Science and Technology); bo liu ( Tongji Hospital, Huazhong University of Science and Technology); Yunjun Gao (Zhejiang University); Xiaofang Zhou (The Hong Kong University of Science and Technology); Christian S. Jensen (Aalborg University)

Real-time Insertion Operator for Shared Mobility on Time-Dependent Road Networks Zengyang Gong (The Hong Kong University of Science and Technology ); Yuxiang Zeng (Beihang University)*; Lei Chen (Hong Kong University of Science and Technology)

Trajectory Similarity Measurement: An Efficiency Perspective [Experiment, Analysis & Benchmark] Yanchuan Chang (The University of Melbourne); Egemen Tanin (University of Melbourne); Gao Cong (Nanyang Technological Univesity); Christian S. Jensen (Aalborg University); Jianzhong Qi (The University of Melbourne)*

PCSP: Efficiently Answering Label-Constrained Shortest Path Queries in Road Networks Libin Wang (Hong Kong University of Science and Technology)*; Raymond Chi-Wing Wong (Hong Kong University of Science and Technology)

KAMEL: A Scalable BERT-based System for Trajectory Imputation Mashaal Musleh (University of Minnesota)*; Mohamed Mokbel (University of Minnesota - Twin Cities)

TFB: Towards Comprehensive and Fair Benchmarking of Time Series Forecasting Methods xiangfei qiu (East China Normal University); Jilin Hu (Aalborg University)*; Lekui Zhou (Huawei Cloud Computing Technologies); Xingjian Wu (East China Normal University); Junyang Du (East China Normal University); Buang Zhang (ECNU); Chenjuan Guo (ECNU); Aoying Zhou (East China Normal University ); Christian S. Jensen (Aalborg University); Zhenli Sheng (Huawei Technologies Co., Ltd.); Bin Yang (Aalborg University)

H6

Database Performance and Manageability

Chair: Gao Cong (Nanyang Technological University, Singapore)

On Reducing Space Amplification with Multi-Column Compaction in Apache IoTDB Chenguang Fang (Tsinghua University); zijie chen (Tsinghua University); Shaoxu Song (Tsinghua University)*; Xiangdong Huang (Tsinghua University); Chen Wang (Timecho Limited); Jianmin Wang ("Tsinghua University, China")

An Efficient Transfer Learning Based Configuration Adviser for Database Tuning Xinyi Zhang (Peking University)*; HONG WU (Alibaba); Yang Li (Peking University); Zhengju Tang (Peking University); Jian Tan (Alibaba); Feifei Li (Alibaba Group); Bin Cui (Peking University)

A Spark Optimizer for Adaptive, Fine-Grained Parameter Tuning Chenghao Lyu (University of Massachusetts Amherst)*; Qi Fan (Ecole Polytechnique); Philippe Guyard (University College London); Yanlei Diao (Ecole Polytechnique)

Is Your Learned Query Optimizer Behaving As You Expect? A Machine Learning Perspective [Experiment, Analysis & Benchmark] Claude Lehmann (Zurich University of Applied Sciences)*; Pavel Sulimov (ZHAW); Kurt Stockinger (ZHAW Zurich University of Applied Sciences)

Testing Graph Database Systems via Graph-Aware Metamorphic Relations Zeyang Zhuang (The Chinese University of Hong Kong)*; Penghui Li (The Chinese University of Hong Kong); Pingchuan Ma (HKUST); Wei Meng (The Chinese University of Hong Kong); Shuai Wang (HKUST)

HyBench: A New Benchmark for HTAP Databases Chao Zhang (Tsinghua University); Guoliang Li (Tsinghua University)*; Tao Lv (China Software Testing Center)

Why TPC Is Not Enough: An Analysis of the Amazon Redshift Fleet Alexander van Renen (UTN); Dominik Horn (Amazon Web Services); Pascal Pfeil (Amazon Web Services); Kapil Vaidya (Amazon Web Services); Wenjian Dong (Amazon Web Services); Balakrishnan (Murali) Narayanaswamy (Amazon Web Services); Zhengchun Liu (Amazon Web Services); Gaurav Saxena (Amazon Web Services)*; Andreas Kipf (UTN); Tim Kraska (Amazon Web Services)

B6

Data Mining,Privacy and Security

Chair: Ke Yi (HKUST)

Complex-Path: Effective and Efficient Node Ranking with Paths in Billion-Scale Heterogeneous Graphs [Industry] Jinquan Hang (Rutgers University)*; Zhiqing Hong (Rutgers University); Xinyue Feng (Rutgers University); Guang Wang (Florida State University); Dongjiang Cao (JD Logistics); Jiayang qiao (jingdong); Haotian Wang (JD Logistics); Desheng Zhang (Rutgers University)

Large-Scale Metric Computation in Online Controlled Experiment Platform Tao Xiong (Tencent Inc.)*; Yong Wang (Tencent Inc.)

SecuDB: An In-enclave Privacy-preserving and Tamper-resistant Relational Database Xinying Yang (ByteDance)*; Cong Yue (National University of Singapore); Wenhui Zhang (Bytedance ); Yang Liu (ByteDance); Beng Chin Ooi (NUS); Jianjun Chen (Bytedance)

SecretFlow-SCQL: A Secure Collaborative Query pLatform Wenjing Fang (Ant group )*; Shunde Cao (Antgroup); Guojin Hua (Ant Group); Junming Ma (Ant Group); yongqiang yu (ant group); Qunshan Huang (Ant Group); Jun Feng (Ant Group); Jin Tan (Ant Group); Xiaopeng Zan (antgroup); Pu Duan (Ant Group); Yang Yang (Ant Group); Li Wang (Ant Group); Ke Zhang (Ant Group); Lei Wang (Ant Group)

Differentially Private Stream Processing at Scale [Industry] Bing Zhang (Google)*; Vadym Doroshenko (Google); Peter Kairouz (Google); Thomas Steinke (Google); Abhradeep Thakurta (Google); Ziyin Ma (Google); Eidan Cohen (Google); Himani Apte (Google, LLC); Jodi Spacek (Google)

Membrane - Safe and Performant Data Access Controls in Apache Spark in the Presence of Imperative Code [Industry] Andrei Paduroiu (Amazon Web Services)*; Sungheun Wi (Amazon Web Services); Yan Yan (Amazon AWS); Yaron Burd (Amazon Web Services); Ruhollah A Farchtchi (Amazon); Giovanni Matteo Fumarola (Amazon WebServices)

B3

Database Engines

Chair: Jianbin Qin (Shenzhen University)

Apache TsFile: An IoT-native Time Series File Format [Industry] Xin Zhao (Tsinghua University); Jialin Qiao (Timecho Ltd); Xiangdong Huang (Tsinghua University); Chen Wang (Timecho Limited); Shaoxu Song (Tsinghua University)*; Jianmin Wang ("Tsinghua University, China")

LavaStore: ByteDance’s Purpose-built, High-performance, Cost-effective Local Storage Engine for Cloud Services Hao Wang (ByteDance Inc.); Jiaxin Ou (Bytedance); 明 赵 (Bytedance Inc.); Sheng Qiu (ByteDance); Yizheng Jiao (ByteDance)*; Yi Wang (bytedance); Qizhong Mao (ByteDance); Zhengyu Yang (ByteDance Inc.); Yang Liu (Bytedance); Jianyang Hu (ByteDance); Jingwei Zhang (ByteDance); Jinrui Liu (ByteDance); Jiaqiang Chen (ByteDance); Yong Sheng (ByteDance); Cao Lixun (ByteDance); Heng Zhang (Bytedance); Hongde Li (Bytedance); Ming Li (bytedance); Yue Ma (Bytedance); Lei Zhang (ByteDance); Jian Liu (ByteDance); Guanghui Zhang (ByteDance Inc.); Fei Liu (ByteDance); Jianjun Chen (Bytedance)

Petabyte-Scale Row-Level Operations in Data Lakehouses [Industry] Anton Okolnychyi (Apple)*; Chao Sun (OpenAI); kazuyuki tanimura (Apple); Russell A Spitzer (Apple); Ryan Blue (Tabular); Szehon Ho (Apple); Yufei Gu (Apple); Vishwanath Lakkundi (Apple Inc.); DB Tsai (Apple)

Adaptive and Robust Query Execution for Lakehouses At Scale [Industry] Maryann Xue (Databricks); Yingyi Bu (Databricks)*; Abhishek Somani (Databricks); Wenchen Fan (Databricks); Ziqi Liu (Databricks); Steven Chen (Databricks); Herman van Hovell (Databricks); Bart Samwel (Databricks); Mostafa Mokhtar (Databricks); RK Korlapati (Databricks); Andy Lam (Databricks); Yunxiao Ma (Databricks); Vuk Ercegovac (Databricks); Jiexing Li (Databricks); Alexander Behm (Databricks); Yuanjian Li (Databricks); Xiao Li (Databricks); Sriram Krishnamurthy (Databricks); Amit Shukla (Databricks); Michalis Petropoulos (Databricks); Sameer Paranjpye (Databricks); Reynold Xin (Databricks); Matei Zaharia (Databricks)

Presto’s History-based Query Optimizer Pranjal Shankhdhar (Meta Platforms)*; Feilong Liu (Meta); Jay Narale (Uber Technologies); James Sun (Meta Platform, Inc); Rebecca Schlussel (Meta); Lyublena Antova (Meta)

Simple (yet Efficient) Function Authoring for Vectorized Engines Pedro Pedreira (Meta Platforms Inc.)*; Laith S Sakka (Meta)

B1

Distributed Database Systems

Chair: Jianguo Wang (Purdue University)

PALF: Replicated Write-ahead Logging for Distributed Databases Fusheng Han (OceanBase); Hao Liu (OceanBase); Bin Chen (OceanBase); Debin Jia (OceanBase); Jianfeng Zhou (OceanBase); xuwang Teng (oceanbase); Chuanhui Yang (OceanBase); huafeng xi (oceanbase.com); Wei Tian (OceanBase); Shuning Tao (OceanBase); Sen Wang (OceanBase); Quanqing Xu (OceanBase, Ant Group )*; Zhenkun YANG (OceanBase)

GaussDB: A Cloud-Native Multi-Master Database with Compute-Memory-Storage Disaggregation Guoliang Li (Tsinghua University)*; wengang tian (huawei); Jinyu Zhang (huawei); Ronen Grosman (Huawei); Zongchao Liu (Huawei Technologies Co., Ltd.); Sihao Li (Huawei Tech. Co Ltd)

TDSQL: Tencent Distributed Database System [Industry] Yuxing Chen (Tencent)*; Anqun Pan (Tencent Inc., China); hailin lei (Tencent Inc.); anda ye (Tencent Inc.); Shuo Han (Tencent Inc.); Yan Tang (Tencent Inc.); WEI LU (Renmin University of China); yunpeng chai (renmin university of china); Feng Zhang (Renmin University of China); Xiaoyong Du (Renmin University of China)

Galaxybase: A High Performance Native Distributed Graph Database for HTAP [Industry] Bing Tong (HKUST(GZ))*; Yan Zhou (Zhejiang CreateLink Technology); Chen Zhang (Zhejiang CreateLink Technology); Jianheng Tang (Hong Kong University of Science and Technology); Jing Tang (The Hong Kong University of Science and Technology); Leihong Yang (Zhejiang CreateLink Technology); Qiye Li (Zhejiang CreateLink Technology); Zhongxin Bao (Zhejiang CreateLink Technology); Jia Li (Hong Kong University of Science and Technology); Lei Chen (Hong Kong University of Science and Technology)

Towards Millions of Database Transmission Services in the Cloud [Industrial] Hua Fan (Alibaba Group)*; Dachao Fu (Alibaba Group); Xu Wang (Alibaba Cloud Intelligence); Jiachi Zhang (Alibaba Group); Chaoji Zuo (Alibaba Group); Zhengyi Wu (Alibaba); Miao Zhang (Alibaba Cloud); KANG YUAN (alibaba-inc.com); Xizi Ni (Alibaba Group); Huo Guocheng (Alibaba Group); Wenchao Zhou (Alibaba Group); Feifei Li (Alibaba Group); Jingren Zhou (Alibaba Group)

X-Stor: A Cloud-native NoSQL Database Service with Multi-model Support [Industry] Hongyu Lei (Huazhong University of Science and Technology)*; Chunhua Li (Huazhong University of Science and Technology); Ke Zhou (Huazhong University of Science and Technology); Jianping Zhu (Tencent Inc.); kezhou yan (Tencent); Fen Xiao (Tencent Inc.); Ming Xie (Tencent Inc.); Jiang Wang (Huazhong University of Science and Technology); Shiyu Di (Huazhong University of Science and Technology)

B2

Distributed Database Systems


Towards Resource Efficiency: Practical Insights into Large-Scale Spark Workloads at ByteDance [Industry] YiXin Wu (ByteDance); Xiuqi Huang (Shanghai Jiao Tong University); wei zhongjia (Bytedance); Hang Cheng (Bytedance); Chaohui Xin (ByteDance Inc.); Zuzhi Chen (ByteDance Inc.); Binbin Chen (ByteDance); Yufei WU (Bytedance Inc.); Hao Wang (ByteDance Inc.); Tieying Zhang (Bytedance); Rui Shi (ByteDance Inc.)*; Xiaofeng Gao (Shanghai Jiaotong University); Yuming Liang (ByteDance Inc.); Pengwei Zhao (Bytedance); Guihai Chen (Shanghai Jiao Tong University)

ResLake: Towards Minimum Job Latency and Balanced Resource Utilization in Geo-distributed Job Scheduling Xin-Chun Zhang (ByteDance Inc.); Aqsa Kashaf (ByteDance)*; Yihan Zou (ByteDance); Wei Zhang (Bytedance); Weibo Liao (ByteDance); Song Haoxiang (Bytedance); Jintao Ye (ByteDance); yakun li (bytedance); Rui Shi (ByteDance Inc.); Yong Tian (ByteDance); FENG WEI (ByteDance); Binbin Chen (ByteDance); Zuzhi Chen (ByteDance Inc.); Tieying Zhang (Bytedance); Yongping Tang (Bytedance)

Transparent Migration from Datastore to Firestore [Industry] Ed Davisson (Google, Inc.)*; Tilo Dickopp (Google); David Gay (Google, Inc.); Eric Karasuda (Google); Ram Kesavan (Google Inc.); Vadim Yushprakh (Google)

DLRover: Resource Optimization for Deep Recommendation Models Training at AntGroup wang qinlong (AntGroup); Tingfeng Lan (Sichuan University); Yinghao Tang (Sichuan University); BO SANG (Ant Group); Ziling Huang (Sichuan University); yiheng du (Sichuan University); Haitao Zhang (Ant Group); Jian Sha (Ant Group); Hui Lu (The University of Texas at Arlington); Yuanchun Zhou (Computer Network Information Center, Chinese Academy of Sciences); Ke Zhang (Ant Group); Mingjie Tang (Sichuan University)*

Cloud Actor-Oriented Database Transactions in Orleans [Application & Experience] Tamer Eldeeb (Columbia University)*; Sebastian C Burckhardt (Microsoft Research); Reuben Bond (Microsoft); Asaf Cidon (Columbia University); Junfeng Yang (Columbia University); Philip A Bernstein (Microsoft Research)

ClickHouse - Lightning Fast Analytics for Everyone [Industry] Robert Schulze (ClickHouse)*; Tom Schreiber (ClickHouse); Ilya Yatsishin (ClickHouse BV); Ryadh Dahimene (ClickHouse); Alexey Milovidov (ClickHouse)

B4

Machine Learning, AI, and Databases

Chair: Zhaojing Luo (Beijing Institute of Technology)

OptScaler: A Collaborative Framework for Robust Autoscaling in the Cloud [Industry] aaron zou (Ant Group); Wei Lu (Ant Group)*; Zhibo Zhu (Ant Group); Xingyu Lu (Ant Group ); Jun Zhou (Ant Services Group ); Xiaojin Wang (Alipay.com Co., Ltd); Kangyu Liu (ant group); Kefan Wang (Ant Group); Renen Sun (Ant Group); wang hai qing (ant group)

SPADE: Synthesizing Data Quality Assertions for Large Language Model Pipelines [Industry] Shreya Shankar (University of California Berkeley)*; Haotian Li (The Hong Kong University of Science and Technology); Parth Asawa (UC Berkeley); Madelon Hulsebos (UC Berkeley); Yiming Lin (University of California, Berkeley); J.D. Zamfirscu-Pereira (UC Berkeley); Harrison Chase (LangChain); William I Fu-Hinthorn (LangChain); Aditya Parameswaran (University of California, Berkeley); Eugene Wu (Columbia University)

SingleStore-V: An Integrated Vector Database System in SingleStore [Industry] Cheng Chen (SingleStore)*; Chenzhe Jin (Purdue University); Yunan Zhang (Purdue University - West Lafayette); Sasha Podolsky (SingleStore); Chun Wu (SingleStore); Szu-Po Wang (SingleStore); Eric N Hanson (SingleStore); Zhou Sun (SingleStore); Robert Walzer (SingleStore); Jianguo Wang (Purdue University)

A Flexible Forecasting Stack [Industry] Tim Januschowski (Zalando); Yuyang Wang (Amazon); Jan Gasthaus (Meta); Syama Sundar Rangapuram (Amazon); Caner Turkmen (Amazon); Jasper Zschiegner (None); Lorenzo Stella (Amazon Research); Michael Bohlke-Schneider (Amazon Research); Danielle Maddix (Amazon Research ); Konstantinos Benidis (Amazon Research); Alexander Alexandrov (Unaffiliated); Christos Faloutsos (CMU); Sebastian Schelter (BIFOLD & TU Berlin)*

Db2une: Tuning Under Pressure via Deep Learning [Industry] Alexander Bianchi (York University)*; Andrew Chai (York University); Vincent Corvinelli (IBM); Parke Godfrey (York University); Jaroslaw Szlichta (York University and IBM CAS); Calisto Zuzarte (IBM)

AutoTQA: Towards Autonomous Tabular Question Answering through Multi-Agent Large Language Models [Industry] Jun-Peng Zhu (East China Normal University)*; Peng Cai (East China Normal University); Kai Xu (PingCAP); Li Li (PingCAP); Yishen Sun (PingCAP); Shuai Zhou (PingCAP); Haihuang Su (PingCAP); Liu Tang (PingCAP); Qi Liu (PingCAP)

B5

Database Technologies and Management

Chair: Divesh Srivastava (AT&T)

Resource Management in Aurora Serverless [Industry] Bradley Barnhart (Amazon Web Services); Marc Brooker (Amazon); Daniil Chinenkov (Amazon); Tony Hooper (AWS Aurora); Jihoun Im (Amazon); Prakash Chandra Jha (Amazon); Tim Kraska (AWS); Ashok Kurakula (AWS Aurora); Alexey Kuznetsov (Amazon); Grant McAlister (Amazon Web Services (AWS)); Arjun Muthukrishnan (Amazon); Aravinthan Narayanan (Amazon); Douglas Terry (Amazon); Bhuvan Urgaonkar (Amazon and Penn State)*; Jiaming Yan (AWS)

An Examination of CXL Memory Use Cases for In-Memory Database Management Systems using SAP HANA MINSEON AHN (SAP Labs Korea)*; Thomas Willhalm (Intel Deutschland GmbH); Norman May (SAP SE); Donghun Lee (SAP Labs Korea); Suprasad Mutalik Desai (Intel); Daniel Booss (SAP SE); Jungmin Kim (SAP); Navneet Singh (Intel Technology India Pvt Ltd); Daniel Ritter (SAP); Oliver Rebholz (SAP SE)

Lindorm-UWC: An Ultra-Wide-Column Database for Internet of Vehicles [Industry] Qianyu Ouyang (Tsinghua University); chunhui shen (alibaba); Wenlong Yang (Alibaba Group); Peng Yu (Alibaba Group); qiang xiao (Alibaba Group); Jianhui Lei (Alibaba Group); Yadong Chen (Alibaba Group); Qilu Zhong (Alibaba Group); Xiang Wang (AlibabaCloud); yong lin (Alibaba Group); qingyi meng (alibaba); Zhicheng Ji (Alibaba Group); Wei Meng (Alibaba Group); Cen Zheng (Alibaba Group)*; Sheng Wang (Alibaba Group); Dan Pei (Tsinghua University); Wei Zhang (Alibaba Inc.); Feifei Li (Alibaba Group); Jingren Zhou (Alibaba Group)

Dealing with Acronyms, Abbreviations, and Typos in Real-World Entity Matching [Application and Experience] Joshua J Wu (UC Berkeley); Dixin Tang (The University of Texas, Austin)*; Nithin V Chalapathi (UC Berkeley); Tristan Chambers (Berkeley Institute for Data Science at UC Berkeley); Julie Ciccolini (NACDL); Cheryl E Phillips (Stanford University Department of Communication); Lisa Pickoff-White (The California Reporting Project); Aditya Parameswaran (University of California, Berkeley)

SQL has problems. We can fix them: Pipe syntax in SQL [Industry] Jeff Shute (Google, Inc.)*; Shannon Bales (Google, Inc.); Matthew Brown (Google, Inc.); Jean-Daniel Browne (Google, Inc.); Brandon Dolphin (Google, Inc.); Romit Kudtarkar (Google, Inc.); Andrey Litvinov (Google, Inc.); Jingchi Ma (Google, Inc.); John D Morcos (Google Inc.); Michael Shen (Google, Inc.); David Wilhite (Google, Inc.); Xi Wu (Google, Inc.); Lulan Yu (Google, Inc.)

KGFabric: A Scalable Knowledge Graph Warehouse for Enterprise Data Interconnection [Industry] Peng Yi (Ant Group)*; Lei Liang (Ant Group); Zhang Da (Ant Group); Chen Yong (ant group); Jinye Zhu (Ant Group); Xiangyu Liu (Ant Group); Kun Tang (antgroup); Jialin Chen (Ant Group); LIN HAO (AntGroup); Leijie Qiu (Ant Group); Jun Zhou (Ant Services Group )

Tutorial-1


Time-Series Anomaly Detection: Overview and New Trends [tutorial] Qinghua Liu (The Ohio State University)*; Paul Boniol (Inria, Ecole normale supérieure); Themis Palpanas (Université Paris Cité); John Paparrizos (The Ohio State University)

Tutorial-2


Efficient Training of Graph Neural Networks on Large Graphs [tutorial] Yanyan Shen (Shanghai Jiao Tong University)*; Lei Chen (Hong Kong University of Science and Technology); Jingzhi Fang (HKUST); Xin Zhang (Hong Kong University of Science and Technology); Shihong Gao (The Hong Kong University of Science and Technology); Hongbo Yin (HKUST(GZ))

Tutorial-3


A Reproducible Tutorial on Reproducibility in Database Systems Research [tutorial] Tim Fischer (Universität Tübingen); Denis Hirn (Universität Tübingen)*; Gokhan Kul (University of Massachusetts Dartmouth)

Tutorial-4


Fairness in Preference Queries: Social Choice Theories Meet Data Management [tutorial] Senjuti Basu Roy (New Jersey Institute of Technology)*; Baruch Schieber (NJIT); Nimrod Talmon (Ben Gurion University)

Tutorial-5


LLM for Data Management [tutorial] Guoliang Li (Tsinghua University)*; Xuanhe Zhou (Tsinghua); xinyang zhao (Tsinghua university)

Tutorial-6


Consensus in Data Management With Use Cases in Edge-Cloud and Blockchain Systems [tutorial] Faisal Nawab (University of California at Irvine)*; Mohammad Sadoghi (University of California, Davis)

Tutorial-7


Native Distributed Databases: Problems, Challenges and Opportunities [tutorial] Quanqing Xu (OceanBase, Ant Group )*; Chuanhui Yang (OceanBase); Aoying Zhou (East China Normal University )

Tutorial-8


Workload Placement on Heterogeneous CPU-GPU Systems [tutorial] Marcos N. L. Carvalho (Universitat Politècnica de Catalunya)*; Alkis Simitsis (Athena Research Center); Anna Queralt (UPC); Oscar Romero (Universitat Politècnica de Catalunya)

Tutorial-9


Spatial Query Optimization With Learning [tutorial] Xin Zhang (University of California, Riverside)*; Ahmed Eldawy (University of California, Riverside)

Tutorial-10


Composable Data Management: An Execution Overview [tutorial] Pedro Pedreira (Meta Platforms Inc.)*; Deepak Majeti (Ahana); Orri Erling (Meta Platforms)

Demo-Group-A


Spade: A Real-Time Fraud Detection Framework Jiaxin Jiang (National University of Singapore)*; Zhen Zhang (National University of Singapore); Bingqiao Luo (National University of Singapore); Bingsheng He (National University of Singapore); Min Chen (Grab); Wei Yang Wang (Grab); Jia Chen (Grab)

Data-driven Spatiotemporal Simulator for Reinforcement Learning Methods Dingyuan Shi (Beihang University)*; Bingchen Song (First Research Institute of China Aerospace Science and Technology Corporation (First overall design department)); Yuanyuan Zhang (Beihang University); Haolong Yang (Beihang University); Ke Xu (Beihang University)

BFTGym: An Interactive Playground for BFT Protocols Haoyun Qin (University of Pennsylvania)*; Chenyuan Wu (University of Pennsylvania); Mohammad Javad Amiri (Stony Brook University); Ryan Marcus (University of Pennsylvania); Boon Thau Loo (Univ. of Pennsylvania)

DTGraph: Declarative Transformations of Property Graphs Angela Bonifati (Univ. of Lyon); Yann Ramusat (Lyon 1 Univ., Liris CNRS)*; Filip Murlak (University of Warsaw, Poland); Amela Fejza (Inria); Rachid Echahed (CNRS)

MLOS in Action: Bridging the Gap Between Experimentation and Auto-Tuning in the Cloud Brian Kroth (Microsoft)*; Sergiy Matusevych (Microsoft Gray Systems Lab); Rana Alotaibi (Microsoft Gray Systems Lab); Yiwen Zhu (Microsoft); Anja Gruenheid (Microsoft); Yuanyuan Tian (Microsoft Gray Systems Lab)

Snapcase - Regain Control over Your Predictions with Low-Latency Machine Unlearning Sebastian Schelter (University of Amsterdam)*; Stefan Grafberger (University of Amsterdam); Maarten de Rijke (University of Amsterdam)

DiversiNews: Enriching News Consumption with Relevant yet Diverse News Articles Retrieval Yiqun Sun (National University of Singapore); Qiang Huang (National University of Singapore)*; Yanhao Wang (East China Normal University); Anthony K. H. Tung (NUS)

SpannerLib: Embedding Declarative Information Extraction in an Imperative Workflow Dean Light (Technion)*; Ahmad Aiashi (Technion); Mahmoud Diab (Technion); Daniel Nachmias (Technion); Stijn Vansummeren (Hasselt University); Benny Kimelfeld (Technion)

Demonstrating TabEE: Tabular Embedding Explanations Roni Copul (Tel Aviv University); Nave Frost (eBay); Tova Milo (Tel Aviv University); Kathy Razmadze (Tel Aviv University)*

Navigating Data Repositories: Utilizing Line Charts to Discover Relevant Datasets Daomin Ji (RMIT); Hui Luo (University of Wollongong); Zhifeng Bao (RMIT University)*; Shane Culpepper (The University of Queensland)

Graph Association Analyses for Early Drug Discovery Wenfei Fan (Univ. of Edinburgh ); Daji Li (Shenzhen Institute of Computing Science); Peiyu Liang ( Shenzhen Institute of Computing Sciences); Shuhao Liu (Shenzhen Institute of Computing Sciences); Yaoshu Wang (Shenzhen Institute of Computing Sciences, Shenzhen University); Yiming Wang (Shenzhen Institute of Computing Sciences); Min Xie (Shenzhen Institute of Computing Sciences )*; Runjie Zhang ( Shenzhen Institute of Computing Sciences)

Demonstration of MaskSearch: Efficiently Querying Image Masks for Machine Learning Workflows Lindsey Linxi Wei (University of Washington); Chung Yik Edward Yeung (University of Washington); Hongjian Yu (University of Washington); Jingchuan Zhou (University of Washington); Dong He (University of Washington)*; Magdalena Balazinska (UW)

Looking Deeply into the Magic Mirror: An Interactive Analysis of Database Index Selection Approaches Stefan Halfpap (TU Berlin)*; Jan Kossmann (Snowflake Inc.); Rainer Schlosser (Hasso Plattner Institute); Volker Markl (Technische Universität Berlin)

UTOPIA: Automatic Pivot Table Assistant Whanhee Cho (University of Utah)*; Anna Fariha (University of Utah)

TSGAssist: An Interactive Assistant Harnessing LLMs and RAG for Time Series Generation Recommendations and Benchmarking Yihao Ang (National University of Singapore); Yifan Bao (National University of Singapore); Qiang Huang (National University of Singapore)*; Anthony K. H. Tung (NUS); Zhiyong Huang (NUS School of Computing)

CMixing: An Efficient Coin Mixing Platform to Enhance Anonymity in Cryptocurrency Transactions Wangze Ni (Hong Kong University of Science and Technology)*; Yiwei Zhao (Hong Kong Polytechnic University); Pengze Chen (Hong Kong University of Science and Technology); Lei Chen (Hong Kong University of Science and Technology); Peng Cheng (East China Normal University); Chen Zhang (The Hong Kong Polytechnic University)

Demo-Group-B


LucidScript: Bottom-up Standardization for Data Preparation Eugenie Y. Lai (Massachusetts Institute of Technology )*; Yuze Lou (University of Michigan); Brit Youngmann (Technion - Israel institute of technology); Michael Cafarella (MIT CSAIL)

OSSInsight: Scalable GitHub Analysis Ahmad Ghazal (Facebook)*; Zhiyuan Liang (PingCAP); Sunny Bains (PingCAP); Hanumath Maduri (Workday)

IsoVista: Black-box Checking Database Isolation Guarantees Long Gu (Nanjing University); Si Liu (ETH Zurich)*; Tiancheng Xing (Nanjing University); Hengfeng Wei (Nanjing University); Yuxing Chen (Tencent); David A Basin (ETH Zurich)

ImputeVIS: An Interactive Evaluator to Benchmark Imputation Techniques for Time Series Data Mourad Khayati (University of Fribourg)*; Quentin Nater (University of Fribourg); Jacques Pasquier (Dept. of Informatics; University of Fribourg-CH)

An Interactive Multi-modal Query Answering System with Retrieval-Augmented Large Language Models Mengzhao Wang (Zhejiang University); Haotian Wu (Zhejiang University); Xiangyu Ke (Zhejiang University, China); Yunjun Gao (Zhejiang University)*; Xiaoliang Xu (Hangzhou Dianzi University); Lu Chen (Zhejiang University)

DBG-TP: A Large Language Model Assisted Query Performance Regression Debugger Victor Giannakouris (Cornell University)*; Immanuel Trummer (Cornell University)

Rodeo: Making Refinements for Diverse Top-k Queries Felix S Campbell (Ben-Gurion University of the Negev)*; Julia Stoyanovich (New York University); Yuval Moskovitch (Ben Gurion University)

QPJVis Demo: Quality-boost Progressive Join Query Processing System Xin Zhang (University of California, Riverside)*; Ahmed Eldawy (University of California, Riverside)

Counterfactual Explanation Analytics: Empowering Lay Users to Take Action Against Consequential Automated Decisions Peter M VanNostrand (WPI)*; Dennis M Hofmann (Worcester Polytechnic Institute); Lei Ma (WPI); Belisha Genin (Worcester Polytechnic Institute ); Randy Huang (Worcester Polytechnic Institute); Elke A Rundensteiner (Worcester Polytechnic Institute)

UniView: A Unified Autonomous Materialized View Management System for Various Databases Zhenrong Xu (Zhejiang University); Pengfei Wang (Zhejiang University); Guoze Xue (Zhejiang University); Qitong Yan (Zhejiang University); Shenghao Gong (Zhejiang University, China); Yelan Jiang (Zhejiang University); Yuren Mao (Zhejiang University); Yunjun Gao (Zhejiang University); Shu Shen (Huawei); Wei Zhang (Huawei); Dan Luo (Huawei); Lu Chen (Zhejiang University)*

A Demonstration of TENDS: Time Series Management System based on Model Selection Yuanyuan Yao (Zhejiang University); Shenjia Dai (Zhejiang University); Yilin Li (Zhejiang University); Lu Chen (Zhejiang University)*; Dimeng Li (Alibaba Group); Yunjun Gao (Zhejiang University); Tianyi Li (Aalborg University)

SEER: An End-to-End Toolkit for Benchmarking Time Series Database Systems in Monitoring Applications Luca Althaus (University of Fribourg); Mourad Khayati (University of Fribourg)*; Abdelouahab Khelifati (University of Fribourg); Anton Dignös (Free University of Bozen-Bolzano, Italy); Djellel Difallah (NYU); Philippe Cudre-Mauroux (University of Fribourg)

Demonstration of DB-GPT: Next Generation Data Interaction System Empowered by Large Language Models Siqiao Xue (Ant Group); Danrui Qi (Simon Fraser University); caigao jiang (HKUST); Fangyin Cheng (JD); Keting Chen (AntGroup); zhiping zhang (alibaba); hongyang zhang (Southwest University of Finance and Economics); Ganglin Wei (Ant Group); Wang Zhao (RUC); Fan Zhou (AntGroup); hong yi (Vmware); Shaodong Liu (meituan); HongJun Yang (Ant Group); faqiang chen (antgroup)*

DeepSketch: A Query Sketching Interface for Deep Time Series Similarity Search Zheng Zhang (Northwestern University); Joey Shao (Northwestern University); Andrew Crotty (Northwestern University)*

Rock: Cleaning Data with both ML and Logic Rules Zian Bao (SICS); bie binbin (Shenzhen Institute of Computing Science); Wenfei Fan (Univ. of Edinburgh ); Daji Li (Shenzhen Institute of Computing Science); Mengyun Li ( Shenzhen Institute of Computing Sciences); Kaiwen Lin ( Shenzhen Institute of Computing Sciences); Wei Lin (Shenzhen Institute of Computing Science); Peijie Liu ( Shenzhen Institute of Computing Sciences); peng liu (Shenzhen Institute of Computing Science); Lv Zhicong (Shenzhen Institute of Computing Science); Mingliang Ouyang (Shenzhen Institute of Computing Science); Chenyang Sun ( Shenzhen Institute of Computing Sciences); tang shuai (Shenzhen Institute of Computing Science); Yaoshu Wang (Shenzhen Institute of Computing Sciences, Shenzhen University)*; Qiyuan Wei (Shenzhen Institute of Computing Sciences); Xiangqian Wu ( Shenzhen Institute of Computing Sciences); Min Xie (Shenzhen Institute of Computing Sciences ); Jing Zhang (Shenzhen Institute of Computing Science); zhao runxiao (Shenzhen Institute of Computing Science); Jie Zhu (Shenzhen Institute of Computing Sciences); Yilin Zhu ( Shenzhen Institute of Computing Sciences)

Clean4TSDB: A Data Cleaning Tool for Time Series Databases Xiaoou Ding (Harbin Institute of Technology); Song YiChen (Harbin Institute of Technology); Hongzhi Wang (Harbin Institute of Technology)*; Donghua Yang (Harbin Institute of Technology); Chen Wang (" Tsinghua University, China"); Jianmin Wang ("Tsinghua University, China")

Demo-Group-C


LakeCompass: An End-to-End System for Table Maintenance, Search and Analysis in Data Lakes Chengliang Chai (Beijing Institute of Technology)*; Yuhao Deng (Beijing Institute of Technology); Yutong Zhan (Beijing Institute of Technology); Ziqi Cao (Beijing Institute of Technology); Yuanfang Zhang (Beijing Institute of Technology); Lei Cao (University of Arizona/MIT); Yu-Ping Wang (Beijing Institute of Technology); Zhiwei Zhang (Beijing Institute of Technology); Ye Yuan ( Beijing Institute of Technology); Guoren Wang (Beijing Institute of Technology); Nan Tang (HKUST (GZ))

DOP-SQL: A General-purpose, High-utility, and Extensible Private SQL System Jianzhe Yu (Hong Kong University of Science and Technology)*; Wei Dong (CMU); Juanru FANG (HKUST); Dajun Sun (Hong Kong University of Science and Technology); Ke Yi (Hong Kong Univ. of Science and Technology)

Catcher: A Cache Analysis System for Top-k Pub/Sub Service Baolong Mei (Zhengzhou University); Yafei Li (Zhengzhou University)*; Wei Chen (Zhengzhou University); Linshen Luan (Zhengzhou University); Guanglei Zhu (Zhengzhou University); Yuanyuan Jin (Zhengzhou University); Jianliang Xu (Hong Kong Baptist University)

Optimizing Distributed Tiered Data Storage Systems with DITIS Sotiris Vasileiadis (Cyprus University of Technology); Matthew Paraskeva (Cyprus University of Technology); George Savva (Cyprus University of Technology); Andreas Efstathiou (Cyprus University of Technology); Edson Ramiro Lucas Filho (Cyprus University of Technology); Jianqiang Shen (Huawei Technologies Co., Ltd.); Lun Yang (huawei); Kebo Fu (Huawei Technologies Co., Ltd.); Herodotos Herodotou (Cyprus University of Technology)*

MLN-Dashboard: Modeling, Analysis, Drill-Down, and Visualization of Complex Data Sets using Multilayer Networks Amey Shinde (UT Arlington); Viraj Sabhaya (UT Arlington); Kevin Farokhrouz (UT Arlington); Fariba Irany (University of North Texas); Ali Khan (University of North Texas); Sanjukta Bhowmick (University of North Texas); Abhishek Santra (The Department of Computer Science and Engineering, University of Texas at Arlington )*; Sharma Chakravarthy (University of Texas at Arlington)

VQFT: A Visual Query Approach Based on Full-Text Search for Knowledge Graphs ZhaoZhuo Li (Tianjin University); Xin Wang (Tianjin University)*; Meng Wang (Tongji University); Yajun Yang (Tianjin University); Bohan Li (Nanjing University of Aeronautics and Astronautics); Dong Han (Tianjin academy of fine arts)

CORAL: Collaborative Automatic Labeling System based on Large Language Models Zhen Zhu (Zhejiang University)*; Yibo Wang (Zhejiang University); Shouqing Yang (Zhejiang University); Lin Long (Zhejiang University); Runze Wu (NetEase Fuxi AI Lab); Xiu Tang (Zhejiang University); Junbo Zhao (Zhejiang University); Haobo Wang (Zhejiang University)

DoppelGanger++ in Action: A Database Replay System with Fast Dependency Graph Generation Wonseok Lee (POSTECH); Jaehyun Ha (POSTECH); Wook-Shin Han (POSTECH)*; Changgyoo Park (Databricks); Myunggon Park (SAP Labs Korea); Juhyeng Han (SAP)

CyNetDiff: A Python Library for Accelerated Implementation of Network Diffusion Models Eilot W Robson (UIUC); Dhemath R Reddy (University of Illinois Urbana-Champaign); Abhishek Kumar Umrawal (University of Illinois Urbana-Champaign)*

EncChain: Enhancing Large Language Model Applications with Advanced Privacy Preservation Techniques Zhe Fu (Alibaba Group)*; Mo Sha (Alibaba Group); Yiran Li (Alibaba Group); Huorong Li (Alibaba Group); Yubing Ma (Alibaba Group); Sheng Wang (Alibaba Group); Feifei Li (Alibaba Group)

FairEM360: A Suite for Responsible Entity Matching Nima Shahbazi (University of Illinois at Chicago)*; Mahdi Erfanian (University of Illinois Chicago); Abolfazl Asudeh (University of Illinois Chicago); Fatemeh Nargesian (University of Rochester); Divesh Srivastava (AT&T Chief Data Office)

Retrieval-Based Tabular Data Cleaning Using LLMs and Data Lakes Mohamed Eltabakh (Qatar Foundation)*; Zan Naeem (Qatar Computing Research Institute); Mohammad Shahmeer Ahmad (Qatar Computing Research Institute ); Mourad OUZZANI (Qatar Computing Research Institute, HBKU); Nan Tang (HKUST (GZ))

Mach: Firefighting Time-Critical Issues in Complex Systems Using High-Frequency Telemetry Franco Solleza (Brown University)*; Shihang Li (University of Washington); William H Sun (Brown University); Richard X Tang (Brown University); Malte Schwarzkopf (Brown University); Nesime Tatbul (Intel Labs and MIT); Andrew Crotty (Northwestern University); David E Cohen (Intel); Stan Zdonik (Brown University)

SketchQL Demonstration: Zero-shot Video Moment Querying with Sketches Renzhi Wu (Georgia Institute of Technology)*; Pramod Chunduri (Georgia Institute of Technology); Dristi Shah (Georgia Institute of Technology); Ashmitha Julius Aravind (Georgia Institute of Technology); Ali Payani (Cisco Systems Inc.); Xu Chu (GATECH); Joy Arulraj (Georgia Tech); Kexin Rong (Georgia Institute of Technology)

DataPrice: An Interactive System for Pricing Datasets in Data Marketplaces Yiding Zhu (Zhejiang University); Hongwei Zhang (Zhejiang University); Jiayao Zhang (Zhejiang University); Jinfei Liu (Zhejiang University)*; Kui Ren (Zhejiang University)

Demonstration of the VeriEQL Equivalence Checker for Complex SQL Queries Pinhan Zhao (University of Michigan)*; Yang He (Simon Fraser University); Xinyu Wang (University of Michigan); Yuepeng Wang (Simon Fraser University)

Demo-Group-D


FedSQ: A Secure System for Federated Vector Similarity Queries Zeqi Zhu (Beihang University)*; Zeheng Fan (Beihang University); Yuxiang Zeng (Beihang University); Yexuan Shi (Beihang University); Yi Xu (Beihang University); Mengmeng Zhou (Beijing Academy of Blockchain and Edge Computing); Jin Dong (Beijing Academy of Blockchain and Edge Computing)

FedSM: A Practical Federated Shared Mobility System Shuyue Wei (Beihang University)*; Yuanyuan Zhang (Beihang University); Zimu Zhou (City University of Hong Kong); Tianlong Zhang (Beihang University); Ke Xu (Beihang University)

DataLoom: Simplifying Data Loading with LLMs Alexander van Renen (UTN)*; Andreas Kipf (UTN); Mihail Stoian (UTN)

Demonstration of VCR: A Tabular Data Slicing Approach to Understanding Object Detection Model Performance Jie J Xu (Georgia Institute of Technology)*; Saahir Dhanani (Georgia Institute of Technology); Jorge H Piazentin Ono (Bosch Research North America); Wenbin He (Robert Bosch Research and Technology Center); Liu Ren (BOSCH Research North America); Kexin Rong (Georgia Institute of Technology)

ModsNet: Performance-aware Top-k Model Search using Exemplar Datasets Mengying Wang (Case Western Reserve University )*; Hanchao Ma (Case Western Reserve University); Sheng Guan (Case Western Reserve University); Yiyang Bian (Case Western Reserve Univerisity); Haolai Che (Case Western Reserve University); Abhishek A Daundkar (Case Western Reserve University); Alp Sehirlioglu (Case Western Reserve University); Yinghui Wu (Case Western Reserve University)

OFL-W3: A One-shot Federated Learning System on Web 3.0 Linshan Jiang (National University of Singapore)*; Moming Duan (National University of Singapore); Bingsheng He (National University of Singapore); Yulin Sun (Shanghai Jiao Tong University); Peishen Yan (Shanghai Jiao Tong University); Yang Hua (Queen's University Belfast); Tao Song (Shanghai Jiao Tong University)

Swift: A Data-Driven Flight Planning System at Scale Chang Gao (Beihang University); Tianlong Zhang (Beihang University); Yuxiang Zeng (Beihang University)*; Yi Xu (Beihang University); Shuyuan Li (Beihang University); Yuanyuan Zhang (Beihang University)

Pyneapple-G: Scalable Spatial Grouping Queries Laila Abdelhafeez (University of California, Riverside)*; Andres Calderon (University of California, Riverside); Amr Magdy (University of California Riverside); Vassilis J. Tsotras (UC Riverside)

PD-Explain: A Unified Python-native Framework for Query Explanations Over DataFrames Itay Elyashiv (Bar-Ilan University); Amir Gilad (The Hebrew University)*; Edna Isakov (Bar-Ilan University ); Tal Tikochinsky (Bar-Ilan University); Amit Somech (Bar-Ilan University)

HocoPG: A Database System with Homomorphic Compression for Text Processing Jiawei Guan (Renmin University of China)*; Feng Zhang (Renmin University of China); Yuxin Tang (School of Information, Renmin University of China); Weitang Ye (Renmin University of China ); Xiaoyong Du (Renmin University of China)

Chat2Data: An Interactive Data Analysis System with RAG, Vector Databases and LLMs xinyang zhao (Tsinghua university); Xuanhe Zhou (Tsinghua); Guoliang Li (Tsinghua University)*

PrismX: A Single-Machine System for Querying Big Graphs Shuhao Liu (Shenzhen Institute of Computing Sciences)*; Yang Liu (Beihang University); Wenfei Fan (Univ. of Edinburgh )

TimeCSL: Unsupervised Contrastive Learning of General Shapelets for Explorable Time Series Analysis Zhiyu Liang (Harbin Institute of Technology); Chen Liang (Harbin Institute of Technology); Zheng Liang (Harbin Institute of Technology); Hongzhi Wang (Harbin Institute of Technology)*; Bo Zheng (CnosDB Inc.)

HSAP: A Human-in-the-loop Social Media-based Situation Awareness Platform Xiangmin Zhou (RMIT University)*; Chengkun He (RMIT University); xi chen (Soochow university); Yanchun Zhang (Victoria University)

DB-MAGS: Multi-Anomaly Data Generation System for Transactional Databases Yiqi Shen (East China Normal University)*; Miaodong Shen (East China Normal University); Sijia Li (East China Normal University ); Peng Cai (East China Normal University); Weiyuan Xu (Meituan); Li Kai (Meituan); jinlong cai (meituan)

QuoteInspector: Gaining Insight about Social Media Discussions Peizhi Wu (University of Pennsylvania)*; Yi Zhang (AWS AI Labs); Wang-Chiew Tan (Meta); Zack Ives (University of Pennsylvania)