CockroachDB: The Resilient Geo-Distributed SQL Database

Rebecca Taft,Irfan Sharif,Andrei Matei,Nathan VanBenschoten,Jordan Lewis,Tobias Grieger,Kai Niemi,Andy Woods,Anne Birzin,Raphael Poss,Paul Bardea,Amruta Ranade,Ben Darnell,Bram Gruneir,Justin Jaffray,Lucy Zhang,Peter Mattis
DOI: https://doi.org/10.1145/3318464.3386134
2020-06-11
Abstract:We live in an increasingly interconnected world, with many organizations operating across countries or even continents. To serve their global user base, organizations are replacing their legacy DBMSs with cloud-based systems capable of scaling OLTP workloads to millions of users. CockroachDB is a scalable SQL DBMS that was built from the ground up to support these global OLTP workloads while maintaining high availability and strong consistency. Just like its namesake, CockroachDB is resilient to disasters through replication and automatic recovery mechanisms. This paper presents the design of CockroachDB and its novel transaction model that supports consistent geo-distributed transactions on commodity hardware. We describe how CockroachDB replicates and distributes data to achieve fault tolerance and high performance, as well as how its distributed SQL layer automatically scales with the size of the database cluster while providing the standard SQL interface that users expect. Finally, we present a comprehensive performance evaluation and share a couple of case studies of CockroachDB users. We conclude by describing lessons learned while building CockroachDB over the last five years.
What problem does this paper attempt to address?