go back

Volume 18, No. 12

From Scale-Up to Scale-Out: PolarDB’s Journey to Achieving 2 Billion tpmC

Authors:
Xinjun Yang, Feifei Li, Yingqiang Zhang, Hao Chen, Qingda Hu, Panfeng Zhou, Qiang Zhang, Shuai Li, Zongzhi Chen, Zheyu Miao, Rongbiao Xie, Chuan Sun, Zetao Wei, Jing Fang, Xingxuan Zhou, Xiaofei Wu

Abstract

In the past decade, cloud databases have experienced rapid development and growth. PolarDB, Alibaba’s cloud-native OLTP database, has evolved significantly to meet the increasing demand for cloud-native architectures and now serves hundreds of thousands of customers across various industries. This paper presents PolarDB’s evolution over the past eight years, with a focus on scalability, performance, and cost-efficiency. Initially, PolarDB adopted a primary-replica architecture based on disaggregated storage, with an emphasis on enhancing single-node performance for scale-up in modern many-core systems. To achieve this, we co-designed PolarDB with cutting-edge hardware, including RDMA, to improve performance. Meanwhile, we refined the internal architecture, including improvements to B+ tree concurrency control and transaction management, ensuring high scalability in scale-up scenarios. More recently, our focus has shifted to scaling out PolarDB to meet the performance and scalability needs of ultra-large-scale applications. By leveraging RDMA, we optimized distributed transaction processing, transforming PolarDB into a high-performance, high-scalability and cost-effective distributed database . In the TPC-C benchmark, PolarDB scaled out to 2340 nodes and achieved over 2 billion tpmC, with a jitter rate of no more than 0.16% during the 8-hour stress test. Compared to the second- and third-highest-performing databases in public TPC-C results, PolarDB’s tpmC is 2.52 × and 2.91 × higher, respectively. In terms of cost-effectiveness, PolarDB’s per-tpmC cost is 37% and 79.5% lower than that of the other two systems, respectively.

PVLDB is part of the VLDB Endowment Inc.

Privacy Policy