Proceedings of Workshops at the 50th International Conference on Very Large Data Bases (VLDB 2024)
VLDBW 2024
VLDB 2024 Workshop Chairs
Themis Palpanas, Philippe Bonnet
VLDB 2024 Workshop Proceeding Chair
Haixun Wang
Accepted Workshops
ADMS: 15th Workshop on Accelerating Analytics and Data Management Systems using Modern Processor and Storage Architectures
- Workshop Chairs: Rajesh Bordawekar and Tirthankar Lahiri
BigVis: 7th International Workshop on Big Data Visual Exploration and Analytics
- Workshop Chairs: Nikos Bikakis, Panos K. Chrysanthis, Guoliang Li, George Papastefanatos
DATAI: 1st International Workshop on Data-driven AI
- Workshop Chairs: Hongzhi Wang, Nan Tang, Lei Cao, Chengliang Chai, Xiaoou Ding
FAB: 6th International Workshop on Foundations and Applications of Blockchain
- Workshop Chairs: Mohammad Javad Amiri
LLM+KG: 1st International Workshop on Data Management Opportunities in Unifying Large Language Models + Knowledge Graphs
- Workshop Chairs: Arijit Khan, Tianxing Wu, Xi Chen
QDB: 13th International Workshop on Quality in Databases
- Workshop Chairs: Lisa Ehrlinger, Hazar Harmouch, Sourav S Bhowmick
QDSM: 2nd International Workshop on Quantum Data Science and Management
- Workshop Chairs: Sven Groppe, Jiaheng Lu, Wolfgang Mauerer, Le Gruenwald
TaDA: 2nd International Workshop on Tabular Data Analysis
- Workshop Chairs: Vasilis Efthymiou, Sainyam Galhotra, Oktie Hassanzadeh, Chuan Lei, Kavitha Srinivas
LSGDA: 3rd International Workshop on Large-Scale Graph Data Analytics
- Workshop Chairs: Wenjie Zhang, Lu Qin, Ying Zhang, Long Yuan, Zhengyi Yang
CloudDB: 2nd Workshop on Cloud Databases
- Workshop Chairs: Jiannan Wang, Kai Zeng, Guoliang Li, and Sanjay Krishnan
ADMS
15th Workshop on Accelerating Analytics and Data Management Systems using Modern Processor and Storage Architectures
Bandwidth Expansion via CXL: A Pathway to Accelerating In-Memory Analytical Processing
Wentao Huang, Mo Sha, Mian Lu, Yuqiang Chen, Bingsheng He, Kian-Lee Tan
Can Delta Compete with Frame-of-Reference for Lightweight Integer Compression?,
Julia Spindler, Philipp Fent, Adrian Riedl, Thomas Neumann
Optimizing Sorting for Chiplet-Based CPUs
Alessandro Fogli, Peter Pletzuch, Jana Giceva
Ghostwriter: a Distributed Message Broker on RDMA and NVM
Hendrik Makait, Bonaventura Del Monte, Tilmann Rabl
BigVis
7th International Workshop on Big Data Visual Exploration and Analytics
Techniques for interactive visual examination of autonomous vessel performance
Natalia Andrienko, Gennady Andrienko, Dimitris Zissis, Alexandros Troupiotis-Kapeliaris and Giannis Spiliopoulos
A Unified Visual Exploration Framework for (Semi-)structured Data
Théo Bouganim, Ioana Manolescu and Emmanuel Pietriga
QPV: An Input Control Component For Progressive Visualization Analytics
Xin Zhang and Ahmed Eldawy
InterpretStack: Interpretable Exploration and Interactive Visualization Construction of Stacking Algorithm
Yu Wang, Jing Lu, Le Liu, Junping Zhang and Siming Chen
Vizard: Improving Visual Data Literacy with Large Language Models
Rubab Zahra Sarfraz and Samar Haider
Tangible Progress: Employing Visual Metaphors and Physical Interfaces in AI-based English Language Learning
Mei Wang, Hai-Ning Liang, Yu Liu, Chengtao Ji and Lingyun Yu
EvalGPT: A Visual Analytic Framework for Enhancing Trust in Large Language Models
Xu Yang, Yiheng Liang, Le Liu, Lianwei Wu and Xiaoru Yuan
Partial Adaptive Indexing for Approximate Query Answering
Stavros Maroulis, Nikos Bikakis, Vasileios Stamatopoulos and George Papastefanatos
Enhancing Geographic Information Visualization: A Comparative Analysis of Digital Maps and Projection Augmented Relief Maps
Chang Yuan Lang Teng, Zhiwei Shi, Lingyun Yu and Yu Liu
DATAI
1st International Workshop on Data-driven AI
Addressing Data Management Challenges for Interoperable Data Science
Ilin Tolovski,Tilmann Rabl
Missing Value Imputation via Pre-trained Language Models with Trainable Prompt and Retrieval Augmentation
Xiang Huang, Shuang Hao
LLM-assisted Labeling Function Generation for Semantic Type Detection
Chenjie Li, Dan Zhang, Jin Wang
Approximate Functional Dependencies Discovery Using Markov Blanket
Jinqi Liu, Anzhen Zhang, Jiajia Li, Na Guo, Jing Zhang
FAB
6th International Workshop on Foundations and Applications of Blockchain
Proceedings of the Sixth International Workshop on Foundations and Applications of Blockchain (FAB)
Mohammad Javad Amiri
Benefits and Challenges of Decentralization in Data Systems: Opportunities for Data Management Research
Ruben Mayer
Practical Declarative Smart Contracts Optimization
Lan Lu, Tao Luo, Jingyi Li, Hongxun Ding, Brendan Massey, Haoxian Chen, Boon Thau Loo
CroCRPC: Cross-Chain Remote Procedure Calls Framework for dApps
Avishek De, Divyakant Agrawal, Amr El Abbadi
From On-chain to Macro: Assessing the Importance of Data Source Diversity in Cryptocurrency Market Forecasting
Giorgos Demosthenous, Chryssis Georgiou, Eliada Polydorou
SOK: Blockchain for Provenance
Asma Jodeiri Akbarfam, Hoda Maleki
LLM+KG
1st International Workshop on Data Management Opportunities in Unifying Large Language Models + Knowledge Graphs
LLM+KG: Data Management Opportunities in Unifying Large Language Models + Knowledge Graphs
Arijit Khan, Tianxing Wu, Xi Chen
OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System
Ningyu Zhang, Zekun Xi, Yujie Luo, Peng Wang, Bozhong Tian, Yunzhi Yao, Jintian Zhang, Shumin Deng, mengshu sun, Lei Liang, Zhiqiang Zhang, Xiaowei Zhu, Jun Zhou, Huajun Chen
Knowledge Graph Efficient Construction: Embedding Chain-of-Thought into LLMs
Jixuan Nie, Xia HOU, wenfeng song, xuan wang, Xingliang Jin, Xinyu Zhang, ShuoZhe Zhang, Jiaqi Shi
Benchmarking and Analyzing In-context Learning, Fine-tuning and Supervised Learning for Biomedical Knowledge Curation: a focused study on chemical entities of biological interest
Yusuf Abdulle, Emily Groves, Minhong Wang, Holger Kunz, Jason Hoelscher-Obermaier, Ronin Wu, Honghan Wu
Leveraging LLMs Few-shot Learning to Improve Instruction-driven Knowledge Graph Construction
Yongli Mou, Li Liu, Sulayman Sowe, Diego Collarana, Stefan Decker
Research Trends for the Interplay between Large Language Models and Knowledge Graphs
Hanieh Khorashadizadeh
Enhancing Large Language Models with Multimodality and Knowledge Graphs for Hallucination-free Open-set Object Recognition
Xinfu Liu, Wu Yirui, Yuting Zhou, Junyang Chen, huan wang, Ye Liu, Shaohua Wan
SPIREX: Improving LLM-based relation extraction from RNA-focused scientific literature using graph machine learning
Emanuele Cavalleri, Mauricio Soto-Gomez, Ali Pashaeibarough, Dario Malchiodi, Harry Caufield, Justin Reese, Chris Mungall, Peter Robinson, Elena Casiraghi, Giorgio Valentini, Marco Mesiti
InfuserKI: Enhancing Large Language Models with Knowledge Graphs via Infuser-Guided Knowledge Integration
Fali Wang, Runxue Bao, Suhang Wang, Wenchao Yu, Yanchi Liu, Wei Cheng, Haifeng Chen
From Instructions to ODRL Usage Policies: An Ontology Guided Approach
Daham M. Mustafa, Abhishek Nadgeri, Diego Collarana, Benedikt T. Arnold, Christoph Quix, Christoph Lange, Stefan Decker
QDB
13th International Workshop on Quality in Databases
13th International Workshop on Quality in Databases: Preface
Sourav S Bhowmick, Lisa Ehrlinger, Hazar Harmouch
Accelerating the Data Cleaning Systems Raha and Baran through Task and Data Parallelism
Fatemeh Ahmadi, Yusuf Mandirali, Ziawasch Abedjan
Towards Semi-Supervised Data Quality Detection In Graphs
Rubab Zahra Sarfraz
Valuation-based Data Acquisition for Machine Learning Fairness
Ekta Pradhan, Romila Pradhan
AutoFAIR : Automatic Data FAIRification via Machine Reading
Tingyan Ma, Wei Liu, Bin Lu, Xiaoying Gan, Yunqiang Zhu, Luoyi Fu, Chenghu Zhou
Compute Engine Testing with Privacy-Compliant Production-Like Synthetic Data
Yu Liu, Jiangnan Cheng, Steve Chuck, Lyublena Antova, Yurgis Baykshtis, Matt David, Ge Gao, Mehrdad Honarkhah, Kuan-Sung Huang, Chen-Kuei Lee, Usman Muhammad, Shihao Peng, Andrii Rosa, Rebecca Schlussel, Michael Shang, Kelvin Silva, Brandon Vo, Zac Wen, Yihao Zhou
Process Model-based Access Control Policies for Cross-Organizational Data Sharing
Liam Tirpitz, Leon Gentges
Tracking Consistency over Data Streams with InkStream [Demo]
Samuele Langhi, Angela Bonifati, Riccardo Tommasini
A Data Generator to Explore the Interactions Between Concept Drifts and Anomalies [Demo]
Jongjun Park, Akanksha Nehete, Tammy Zeng, Fei Chiang
QDSM
2nd International Workshop on Quantum Data Science and Management
Workshop Summary of the Second International Workshop on Quantum Data Science and Management (QDSM)
Valter Uotila, Sven Groppe, Le Guenwald, Jiaheng Lu, Wolfgang Mauerer
Quantum Storage Design for Tables in RDBMS
Tuodu Li, Gongsheng Yuan, Chang Yao, Meng Shi, Ziyue Wang, Ling Qian and Jiaheng Lu
Supervised Learning on Relational Databases with Quantum Graph Neural Networks
Martin Vogrin, Rok Vogrin, Sven Groppe and Jinghua Groppe
Quantum Data Structures for Enhanced Database Performance
Tim Littau, Ziyu Li and Rihan Hai
Graphs on Qubits: Demonstrating Three Graph Algorithms on Quantum Computers
Lauri Vuorenkoski and Valter Uotila
Is Quantum-Based SQL Query Execution Viable?
Manish Kesarwani and Jayant Haritsa
TaDA
2nd International Workshop on Tabular Data Analysis
2nd International Workshop on Tabular Data Analysis (TaDA)
Vasilis Efthymiou, Sainyam Galhotra, Oktie Hassanzadeh, Chuan Lei, Kavitha Srinivas
ALT-GEN: Benchmarking Table Union Search using Large Language Models
Koyena Pal, Aamod Khatiwada, Roee Shraga, Renée J. Miller
LLMs for Data Engineering on Enterprise Data
Jan-Micha Bodensohn, Ulf Brackmann, Liane Vogel, Matthias Urban, Anupam Sanghi, Carsten Binnig
Fast and Accurate Regional Effect Plots for Automated Tabular Data Analysis
Vasilis Gkolemis, Theodore Dalamagas, Eirini Ntoutsi, Christos Diou
Transform Table to Database Using Large Language Models
Zezhou Huang, Jia Guo, Eugene Wu
GFS: Graph-based Feature Synthesis for Prediction over Relational Databases
Han Zhang, Quan Gan, David Wipf, Weinan Zhang
Schema Matching with Large Language Models: an Experimental Study
Marcel Parciak, Brecht Vandevoort, Frank Neven, Liesbet M. Peeters, Stijn Vansummeren
Finding Support for Tabular LLM Outputs
Grace Fan, Roee Shraga, Renée J. Miller
4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on RDBs
Minjie Wang, Quan Gan, David Wipf, ZhenkunCai, Ning Li, Jianheng Tang, Yanlin Zhang, ZizhaoZhang, ZunyaoMao, YakunSong, Yanbo Wang, Jiahang Li, HanZhang, Guang Yang, Xiao Qin, Chuan Lei, Muhan Zhang, Weinan Zhang, Christos Faloutsos, Zheng Zhang
Large Language Models as Data Preprocessors
Haochen Zhang, Yuyang Dong, Chuan Xiao, Masafumi Oyamada
DEMA: Enhancing Causal Analysis through Data Enrichment and Discovery in Data Lakes
Kayvon Heravi, Saathvik Dirisala, Babak Salimi
Data Quality Management for Responsible AI in Data Lakes
Carolina Cortes, Camila Sanz, Lorena Etcheverry, Adriana Marotta
Humboldt: Metadata-Driven Extensible Data Discovery
Alex Bäuerle, Çağatay Demiralp, Michael Stonebraker
LSGDA
3rd International Workshop on Large-Scale Graph Data Analytics
Report on the 3rd International Workshop on Large-Scale Graph Data Analytics (LSGDA 2024)
Long Yuan, Zhengyi Yang, Qingqiang Sun, Alexander Zhou
XCrowd: Real-Time Dynamic Crowd Movement Simulation on Graph Networks
Jan Appel, Andreas Weiler
MRG-SER: Self-supervised Spatial Entity Resolution Based on Multi-Relational Graph
Hanchen Qiu, Haojia Zhu, Zhicheng Li, Jiahui Jin
Enhancing Neo4j Query Efficiency with Seamless Integration of the GOpt Optimization Framework
Bingqing Lyu, Xiaoli Zhou, Longbin Lai, Yufan Yang, Yunkai Lou, Yongfei Liu
Designing Graph Neural Networks in Compliance with the European Artificial Intelligence Act
Barbara Hoffmann, Jana Vatter, Ruben Mayer
Size Does (Not) Matter? Sparsification and Graph Neural Network Sampling for Large-scale Graphs
Jana Vatter, Maurice L. Rochau, Ruben Mayer, Hans-Arno Jacobsen
HyperFedNet: Communication-Efficient Personalized Federated Learning Via Hypernetwork
Xingyun Chen, Yan Huang, Zhenzhen Xie, Junjie Pang
Parallel Higher-order Truss Decomposition
Chen Chen, Jingya Qian, Hui Luo, Yongye Li, Xiaoyang Wang
Text to Graph Query Using Filter Condition Attributes
Yang Liu, Xin Wang, Jiake Ge, Hui Wang, Dawei Xu, Yongzhe Jia
CloudDB
2nd Workshop on Cloud Databases
Corra: Correlation-Aware Column Compression
Hanwen Liu, Mihail Stoian, Alexander van Renen, Andreas Kipf
MetaHive: A Cache-Optimized Metadata Management for Heterogeneous Key-Value Stores
Alireza Heidari, Amirhossein Ahmadi, Zefeng Zhi, Wei Zhang