Affiliation:
1. Department of Computer Science and Engineering, Southern University of Science and Technology
2. ByteDance
Abstract
In this work, we focus on the performance benchmarking problem of storage services in cloud-native database systems, which are widely used in various cloud applications. The core idea of these systems is to separate computation and storage in traditional monolithic OLTP databases. Specifically, we first present the characteristics of two representative real I/O workloads at the storage tier of ByteDance's cloud-native database veDB. We then elaborate the limitations of using standard benchmarks such as TPC-C and YCSB to resemble these workloads. To overcome these limitations, we devise a learning-based I/O workload benchmark called CDS-Ben. We demonstrate the superiority of CDSBen by deploying it at ByteDance and showing that its generated I/O traces accurately resemble the real I/O traces in production. Additionally, we verify the accuracy and flexibility of CDSBen by generating a wide range of I/O workloads with different I/O characteristics.
Publisher
Association for Computing Machinery (ACM)
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Reference35 articles.
1. Generating request streams on Big Data using clustered renewal processes
2. State-of-the-art in artificial neural network applications: A survey
3. Ibrahim Umit Akgun Geoff Kuenning and Erez Zadok. 2020. Re-Animator: Versatile High-Fidelity Storage-System Tracing and Replaying. In SYSTOR.61--74. Ibrahim Umit Akgun Geoff Kuenning and Erez Zadok. 2020. Re-Animator: Versatile High-Fidelity Storage-System Tracing and Replaying. In SYSTOR. 61--74.
4. Ahmad Al-Shishtawy and Vladimir Vlassov. 2013. ElastMan: Elasticity Manager for Elastic Key-Value Stores in the Cloud. In CAC. 1--10. Ahmad Al-Shishtawy and Vladimir Vlassov. 2013. ElastMan: Elasticity Manager for Elastic Key-Value Stores in the Cloud. In CAC. 1--10.
5. Panagiotis Antonopoulos Alex Budovski Cristian Diaconu Alejandro Hernandez Saenz Jack Hu Hanuma Kodavalla Donald Kossmann Sandeep Lingam Umar Farooq Minhas Naveen Prakash Vijendra Purohit Hugh Qu Chaitanya Sreenivas Ravella Krystyna Reisteter Sheetal Shrotri Dixin Tang and Vikram Wakade. 2019. Socrates: The New SQL Server in the Cloud. In SIGMOD. 1743--1756. Panagiotis Antonopoulos Alex Budovski Cristian Diaconu Alejandro Hernandez Saenz Jack Hu Hanuma Kodavalla Donald Kossmann Sandeep Lingam Umar Farooq Minhas Naveen Prakash Vijendra Purohit Hugh Qu Chaitanya Sreenivas Ravella Krystyna Reisteter Sheetal Shrotri Dixin Tang and Vikram Wakade. 2019. Socrates: The New SQL Server in the Cloud. In SIGMOD. 1743--1756.