VLDB 2021: Industrial Track Papers

All times are given to the Copenhagen local timezone at the conference time (CEST).


09:00 – 10:15 CESTIndustrial Session 1: AI and ML Meets DB

openGauss: An Autonomous Database System [Download Paper] Guoliang Li (Tsinghua University), Xuanhe Zhou (Tsinghua), Ji Sun (Tsinghua University), Xiang Yu (Tsinghua University), Yue Han (Tsinghua University), Lianyuan Jin (Tsinghua University), Wenbo Li (Tsinghua University), Tianqing Wang (Huawei), Shifu Li (Huawei)

Towards A Polyglot Framework for Factorized ML [Download Paper] David A Justo (UC San Diego), Shaoqing Yi (UC San Diego), Lukas Stadler (Oracle Labs), Nadia Polikarpova (University of California, San Diego), Arun Kumar (University of California, San Diego)

tf.data: A Machine Learning Data Processing Framework [Download Paper] Derek Murray (Microsoft), Jiri Simsa (Google), Ana Klimovic (ETH Zurich), Ihor Indyk (Google)

SpeakNav: Voice-based Route Description Language Understanding for Template Driven Path Search [Download Paper] Bolong Zheng (Huazhong University of Science and Technology), Lei Bi (Huazhong University of Science and Technology), Juan Cao (Huazhong University of Science and Technology), Hua Chai (Didi Chuxing), Jun Fang (Didi Chuxing), Lu Chen (Zhejiang University), Yunjun Gao (Zhejiang University), Xiaofang Zhou (The Hong Kong University of Science and Technology), Christian S Jensen (Aalborg University)

Mixer: Efficiently Understanding and Retrieving Visual Content at Web-Scale [Download Paper] An Qin (Baidu Inc.), Mengbai Xiao (Shandong University), Yongwei Wu (Baidu Inc.), Xinjie Huang (Baidu Inc.), Xiaodong Zhang (Ohio State U.)


09:00 – 10:15 CESTIndustrial Session 2: Databases for Data warehouse and Invited Talk

Not Black-Box Anymore! Enabling Analytics-Aware Optimizations in Teradata Vantage [Download Paper] Mohamed Eltabakh (Teradata), anantha subramanian (Teradata Labs), Awny AlOmari (Teradata Labs), Mohammed Al-kateb (Teradata), Sanjay Nair (Teradata), Mahbub Hasan (Teradata Labs), Wellington Cabrera (Teradata Labs), Charles Zhang (Teradata Labs), Amit Kishore (Teradata Labs), Snigdha Prasad (Teradata Labs)

Napa: Powering Scalable Data Warehousing with Robust Query Performance at Google [Download Paper] Ankur Agiwal (Google Inc), Kevin Lai (Google Inc), Gokul Nath Babu Manoharan (Google), Indrajit Roy (Google Inc), Jagan Sankaranarayanan (Google), Hao Zhang (Google Inc), Tao Zou (Google Inc), Jim Chen (Google Inc), Min Chen (Google Inc), Ming Dai (Google Inc), Thanh Do (Google, LLC), Haoyu Gao (Google Inc), Haoyan Geng (Google Inc), Raman Grover (Google Inc), Bo Huang (Google Inc), Yanlai Huang (Google Inc), Adam Li (Google Inc), Jianyi Liang (Google Inc), Tao Lin (Google Inc), Li Liu (Google Inc), Yao Liu (Google Inc), Xi Mao (Google Inc), Maya Meng (Google Inc), Prashant Mishra (Google Inc), Jay Patel (Google Inc), Rajesh SR (Google Inc), Vijayshankar Raman (Google), Sourashis Roy (Google Inc), Mayank Singh Shishodia (Google Inc), Tianhang Sun (Google Inc), Justin Tang (Google Inc), Jun Tatemura (Google), Sagar Trehan (Google Inc), Ramkumar Vadali (Google Inc), Prasanna Venkatasubramanian (Google Inc), Joey Zhang (Google Inc), Kefei Zhang (Google Inc), Yupu Zhang (Google Inc), Zeleng Zhuang (Google Inc), Goetz Graefe (Google), Divy Agrawal (Google), Jeff Naughton (Google), Sujata Kosalge (Google Inc), Hakan Hacigumus (Google)

Invited Talk - The evolution of Amazon Redshift [Download Paper] Ippokratis Pandis (Amazon Web Services)

16:15 – 17:45 CESTIndustrial Session 3: Indexing, Transaction, and Hardware-Software Co-Design

The End of Moore's Law and the Rise of The Data Processor [Download Paper] Niv Dayan (Pliops), Yuval Rochman (Pliops), Iddo Naiss (Pliops), Shmuel Dashevsky (Pliops), Noam Rabinovich (Pliops), Edward Bortnikov (Pliops), Igal Maly (Pliops), Ofer Frishman (Pliops), Itai Ben Zion (Pliops), Avraham (Poza) Meir (Pliops), Moshe Twitto (Pliops), Uri Beitler (Pliops), Evgeni Ginzburg (Pliops), Mark Mokryn (Pliops)

The Art of Balance: A RateupDB Experience of Building a CPU/GPU Hybrid Database Product [Download Paper] Rubao Lee (Rateup Inc.), Minghong Zhou (Rateup Inc.), Chi Li (Rateup Inc.), Shenggang Hu (Rateup Inc.), Jianping Teng (Rateup Inc.), Dongyang Li (Rateup Inc.), Xiaodong Zhang (Ohio State U.)

RAMP-TAO: Layering Atomic Transactions on Facebook's Online TAO Data Store [Download Paper] Audrey Cheng (UC Berkeley), Xiao Shi (Facebook, Inc.), Lu Pan (Facebook, Inc.), Anthony Simpson (Facebook, Inc.), Neil Wheaton (Facebook, Inc.), Shilpa Lawande (Facebook, Inc.), Nathan Bronson (Rockset), Peter Bailis (Sisu Data), Natacha Crooks (UC Berkeley), Ion Stoica (UC Berkeley)

Hyperspace: The Indexing Subsystem of Azure Synapse [Download Paper] Rahul Potharaju (Microsoft), Terry Kim (Microsoft), Eunjin Song (Microsoft), Wentao Wu (Microsoft Research), Lev Novik (Microsoft), Apoorve Dave (Microsoft), Pouria Pirzadeh (Microsoft), Andrew Fogarty (Microsoft), Gurleen Dhody (Microsoft), Jiying Li (Microsoft), Vidip Acharya (Microsoft), Sinduja Ramanujam (Microsoft), Nico Bruno (Microsoft), Cesar Galindo-Legaria (Microsoft), Vivek Narasayya (Microsoft), Surajit Chaudhuri (Microsoft), Anil Nori (Microsoft), Tomas Talius (Microsoft), Raghu Ramakrishnan (Microsoft)

Big Metadata : When Metadata is Big Data [Download Paper] Pavan Edara (Google), Mosha Pasumansky (Google)


09:00 – 10:15 CESTIndustrial Session 4: Databases for Streaming and Unstructured Data

Railgun: managing large streaming windows under MAD requirements [Download Paper] Ana Sofia Gomes (Feedzai), João Oliveirinha (Feedzai), Pedro Cardoso (Feedzai), Pedro Bizarro (Feedzai)

Hazelcast Jet: Low-latency Stream Processing at the 99.99th Percentile [Download Paper] Can Gencer (Hazelcast Inc.), Marko Topolnik (Hazelcast Inc.), Viliam Ďurina (Hazelcast Inc.), Emin Demirci (Hazelcast Inc.), Ensar Basri Kahveci (Hazelcast Inc.), Ali Gürbüz (Hazelcast Inc.), Jozsef Bartok (Hazelcast Inc.), Grzegorz Gierlach (Hazelcast Inc), František Hartman (Hazelcast Inc.), Ufuk Yilmaz (Hazelcast Inc.), Ondřej Lukas (Hazelcast Inc.), Mehmet Doğan (Hazelcast Inc.), Mohamed Mandouh (Hazelcast Inc.), Marios Fragkoulis (TU Delft), Asterios Katsifodimos (TU Delft)

Watermarks in Stream Processing Systems: Semantics and Comparative Analysis of Apache Flink and Google Cloud Dataflow [Download Paper] Edmon Begoli (Oak Ridge National Laboratory), Tyler Akidau (Snowflake Inc), Slab=va Chernyak (Google Inc.), Fabian Hueske (Ververica GmbH), Kathryn Knight (Oak Ridge National Laboratory ), Kenneth Knowles (Google Inc.), Daniel Mills (Google Inc.), Dan Sotolongo (Snowflake Inc.)

Tanium Reveal: A Federated Search Engine for Querying Unstructured File Data on Large Enterprise Networks [Download Paper] Joshua F Stoddard (Tanium), Adam Mustafa (Tanium), Naveen Goela (Tanium)

Using VDMS to Index and Search 100M Images [Download Paper] Luis Remis (ApertureData), Chaunte W Lacewell (Intel Corporation)

11:00 – 12:15 CESTIndustrial Session 5: Data-Intensive Computing

The Cosmos Big Data Platform at Microsoft: Over a Decade of Progress and a Decade to Look Forward [Download Paper] Conor Power (Microsoft),, Hiren Patel (Microsoft), Alekh Jindal (Microsoft), Jyoti Leeka (Microsoft), Bob Jenkins (Microsoft), Michael Rys (Microsoft), Ed Triou (Microsoft), Dexin Zhu (Microsoft), Lucky Katahanas (Microsoft), Chakrapani Bhat Talapady (Microsoft), Josh Rowe (Microsoft), Fan Zhang (Microsoft), Rich Draves (Microsoft), Ivan Santa (Microsoft), Amrish Kumar (Microsoft)

SparkCruise: Workload Optimization in Managed Spark Clusters at Microsoft [Download Paper] Abhishek Roy (Microsoft), Alekh Jindal (Microsoft), Priyanka Gomatam (Microsoft), Xiating Ouyang (University of Wisconsin-Madison), Ashit Gosalia (Microsoft), Nishkam Ravi (Microsoft), Swinky Mann (Microsoft), Prakhar Jain (Microsoft)

Fangorn: Adaptive Execution Framework for Heterogeneous Workloads on Shared Clusters [Download Paper] Yingda Chen (Alibaba Group), Jiamang Wang (Alibaba), Yifeng Lu (Alibaba Group), Ying Han ( Alibaba Group), Zhiqiang Lv (Alibaba Group), Xuebin Min (Alibaba Group), Hua Cai ( Alibaba Group), Wei Zhang (Alibaba Group), Haochuan Fan (Alibaba Group), Chao Li (Alibaba Group), Tao Guan (Alibaba Group), Wei Lin (Alibaba Group), Yangqing Jia ( Alibaba Group), Jingren Zhou (Alibaba Group)

Davos: A System for Interactive Data-Driven Decision Making [Download Paper] Zeyuan Shang (Einblick Analytics), Emanuel Zgraggen (Einblick Analytics), Benedetto Buratti (Einblick Analytics), Philipp Eichmann (Einblick Analytics), Navid Karimeddiny (Einblick Analytics), Charlie Meyer (Einblick Analytics), Wesley Runnels (Einblick Analytics), Tim Kraska (Einblick Analytics)

GraphScope: A Unified Engine For Big Graph Processing [Download Paper] Wenfei Fan (Alibaba Group), Tao He (Alibaba Group), Longbin Lai (Alibaba Group), Xue Li (Alibaba Group), Yong Li (Alibaba Group), Zhao Li (Alibaba Group), Zhengping Qian (Alibaba Group), Chao Tian (Alibaba Grioup), Lei Wang (Alibaba Group), Jingbo Xu (Peking University & Alibaba Group), Youyang Yao (Alibaba Group), Qiang Yin (Alibaba Group), Wenyuan Yu (Alibaba Group), Kai Zeng (Alibaba Group), Kun Zhao (Alibaba Group), Jingren Zhou (Alibaba Group), Diwen Zhu (Alibaba), Rong Zhu (Alibaba Group)