NoSQL Benchmark: MongoDB vs ScyllaDB (Part II)

In our previous study to MongoDB and ScyllaDB we analyzed the technical characteristics of these two important NoSQL databases.

Now, we added more than 133 performance measurement results in an in-depth benchmarking study for performance and scalability.

Let's dive into the extensive results!

Chapter 1: Executive Summary
Chapter 2: Introduction
Chapter 3: Benchmarking Setup
Chapter 4: Performance Results - Social Workload
Chapter 5: Performance Results - Caching Workload
Chapter 6: Performance Results - Sensor Workload
Chapter 7: Conclusion
Appendix
About the Author: Dr. Daniel Seybold

Executive Summary

NoSQL databases promise to provide high performance and horizontal scalability for data intensive workloads. In this report, we benchmark the performance and scalability of the market leading general purpose NoSQL database MongoDB and its performance-oriented challenger ScyllaDB.

The benchmarking study is carried out against the DBaaS offers of each database vendor to ensure a fair comparison.
The DBaaS clusters range from 3 to 18 nodes, classified in three scaling sizes that are comparably priced.
The benchmarking study comprises three workload types that cover read-heavy, read-update and write-heavy application domains with data set sizes from 250GB to 10TB.
The benchmarking study compares a total of 133 performance measurements that range from throughput (per cost) to latencies to scalability.

Results

ScyllaDB outperforms MongoDB in 132 of 133 measurements.

For all the applied workloads, ScyllaDB provides higher throughput (up to 20 times) results compared to MongoDB.
Regarding the latency results, ScyllaDB achieves P99 latencies below 10 ms for insert, read and write operations for almost all scenarios. In contrast, MongoDB achieves P99 latencies below 10 ms only for certain read operations, while the MongoDB insert and update latencies are up to 68 times higher compared to ScyllaDB.
The scalability results demonstrate that ScyllaDB achieves up to near linear scalability, while MongoDB shows less efficient horizontal scalability.
The price-performance ratio clearly shows the strong advantage of ScyllaDB with up to 19 times better price-performance ratios, depending on the workload and data set size.

In summary, this benchmarking study shows that ScyllaDB provides a great solution for applications that operate on data sets in the terabyte range and that require high (e.g, over 50,000 operations per second) throughput while providing predictable low latency for read and write operations. It also needs to be considered that this study does not target advanced data models such as time series or complex operation types such as aggregates or scans, which are subject to future benchmark studies.

Introduction

The NoSQL database landscape is continuously evolving. Over the last 15 years, it has already introduced many options and tradeoffs when it comes to selecting a high performance and scalable NoSQL database. In this report, we address the challenge of selecting a high performance database by evaluating two popular NoSQL databases: MongoDB, the market leading general purpose NoSQL database and ScyllaDB, the high-performance NoSQL database for large scale data. See our technical comparison article for an in depth analysis of MongoDB’s and ScyllaDB’s data model, query languages and distributed architecture. In addition, ScyllaDB itself has its own page comparing MongoDB and ScyllaDB.

For this project, we benchmarked both database technologies to get a detailed picture of their performance, price-performance and scalability capabilities under different workloads. For creating the workloads, we use the Yahoo! Cloud Serving Benchmark (YCSB), an open source and industry standard benchmarking tool. Database benchmarking is often said to be non-transparent and to compare apples with pears. In order to address these challenges, this benchmark comparison is based on benchANT’s scientifically proven Benchmarking-as-a-Service platform. The platform ensures a reproducible benchmark process (for more details, see the associated research papers on Mowgli and benchANT) which follows established guidelines for database benchmarking.

This benchmarking project was carried out by benchANT and was sponsored by ScyllaDB with the goal to provide a fair, transparent and reproducible comparison of both database technologies. For this purpose, all benchmarks were carried out on the database vendors’ DBaaS offers, namely MongoDB Atlas and ScyllaDB Cloud, to ensure a comparable production ready database deployment. Further, the applied benchmarking tool was the standard Yahoo! Cloud Serving benchmark and all applied configuration options are exposed. In addition, all results are publicly available on GitHub. In consequence, the interested reader is able to reproduce the results on their own even without the benchANT platform.

Benchmark Setup

DBaaS Setup

Starting from our technical comparison, MongoDB and ScyllaDB follow different distributed architecture approaches. Consequently, a fair comparison can only be achieved by selecting comparable cluster sizes in terms of costs/month or total compute power. Our comparison selects comparably priced cluster sizes with comparable instance types, having the goal to compare the provided performance (throughput and latency) as well as scalability for three cluster sizes under growing workloads.

The following table describes the selected database cluster sizes to be compared and classified into the scaling sizes small, medium and large. All benchmarks are run on AWS in the us-west region and the prices are based on the us-west-1 (N. California) region, which means that the DBaaS instances and the VMs running the benchmark instances are deployed in the same region. VPC peering is not activated for MongoDB or ScyllaDB. For MongoDB all benchmarks were run again version 6.0, for ScyllaDB against 2022.2. The period in which the benchmarks were carried out was March to June 2023.

DBaaS	Cluster Type	Version	Instance Type	Instance Specs	Total Data Nodes	Storage Capacity	Monthly Costs
MongoDB Atlas (small)	replica set	6	M60_NVME	8 vCPU /61 GB RAM	3	1.6 TB	$3,880
ScyllaDB Cloud (small)	n/a	2022.2.0	i4i.2xlarge	8 vCPU /64 GB RAM	3	1.3 TB	$4,620
MongoDB Atlas (med.)	sharded	6	M80_NVME	16 vCPU /122 GB RAM	9	4.8 TB	$19,180
ScyllaDB Cloud (med.)	n/a	2022.2.0	i4i.4xlarge	16 vCPU /128 GB RAM	6	5.2 TB	$17,870
MongoDB Atlas (large)	sharded	6	M200_NVME	32 vCPU /244 GB RAM	18	18.6 TB	$69,120
ScyllaDB Cloud (large)	n/a	2022.2.0	i4i.8xlarge	32 vCPU /256 GB RAM	12	20.9 TB	$73,932

Workload Setup

In order to simulate realistic workloads, we use YCSB in the latest available version 0.18.0-SNAPSHOT from the original GitHub repository. Based on YCSB, we define three workloads that map to real world use cases. The key parameters of each workload are shown in the following table, and the full set of applied parameters is available in the GitHub repository.

Workload	Distribution	Reads [%]	Updates [%]	Inserts [%]	Data Size small/med- ium/large [TB]	Record Size [KB]
caching (YCSB A)	uniform & hotspot	50	50	0	0.5/1/10	1
social (YCSB B)	uniform & hotspot	95	5	0	0.5/1	1
sensor	latest	10	0	90	0.25/0.5	1

The caching and social workloads are executed with two different request patterns: The uniform request distribution simulates a close-to-zero cache hit ratio workload, and the hotspot distribution simulates an almost 100% cache hit workload.

All benchmarks are defined to run with a comparable client consistency. For MongoDB, the client consistency writeConcern=majority and readPreference=primary is applied. For ScyllaDB, writeConsistency=QUORUM and readConsistency=QUORUM are used. For more details on the client consistencies, read some of the technical nuggets below and also, feel free to read our detailed comparison of the two databases. In addition, we have also analyzed the performance impact of weaker consistency settings for the social workload and caching workload.

Benchmark Process

In order to achieve a realistic utilization of the benchmarked database, each workload is scaled with the target database cluster (i.e. small, medium and large). For this, the data set size, the number of YCSB instances and the number of client threads is scaled accordingly to achieve 70-80% load with a stable throughput on the target database cluster.

Each benchmark run is carried out by the benchANT platform that handles the deployment of the required number of EC2 c5.4xlarge VMs with 16 vCores and 32GB RAM to run the YCSB instances, deployment of the DBaaS instances and orchestration of the LOAD and RUN phases of YCSB. After loading the data into the DB, the cluster is given a 15-minute stabilization time before starting the RUN phase executing the actual workload. In addition, we configured one workload to run for 12 hours to ensure the benchmark results are also valid for long-running workloads.

For additional details on the benchmarking methodology (for example, how we identified the optimal throughput per database and scaling size), see “Benchmarking Process” in the Appendix.

Limitations of the Comparison

The goal of this benchmark comparison focuses on performance and scalability in relation to the costs. It is by no means an all-encompassing guide on when to select MongoDB or ScyllaDB. Yet, by combining the insights of the technical comparison with the results of this benchmark article, the reader gets a comprehensive decision base.

YCSB is a great benchmarking tool for relative performance comparisons. However, when it comes to measuring absolute latencies under steady throughput, it is affected by the coordinated omission problem. The latest release of the YCSB introduces an approach to address this problem. Yet, during the benchmark dry runs, it turned out that this feature is not working as expected (unrealistic high latencies were reported).

In the early (and also later) days of cloud computing, the performance of cloud environments was known to be volatile. This required experimenters to repeat experiments several times at different times of the day. Only then were they able to gather statistically meaningful results. Recent studies such as the one by Scheuner and Leitner show that this has changed. AWS provides particularly stable service quality. Due to that, all experiments presented here were executed once.

Benchmarking Results

In the following sections, we present the benchmark results per workload. Each workload results section contains the following subsection:

Benchmark characteristics provide a brief outline of the applied workload
Key insights provide a high level summary of the results
Throughput results compare the throughput per request distribution
Scalability results present the throughput scaling
Throughput per cost ratio provides analysis of costs per month in relation to the provided throughput
Latency results compare the 99th percentile of measured latencies
Technical nuggets analyze additional configuration options and how they impact the resulting performance. It needs to be highlighted that these measurements are not 1:1 comparable to the standard measurements presented earlier, because a simplified benchmark process (compared to the standard benchmark process) was applied that differs in the following aspects: (i) We do not apply a target throughput, but run the YCSB with unlimited throughput and an identical number of threads for both databases. (ii) a single YCSB instance is applied.

The social workload is based on the YCSB Workload B and creates a read-heavy workload, with 95% read operations and 5% update operations. We use two shapes of this workload, which differ in terms of the request distribution patterns, namely uniform and hotspot distribution. These workloads are executed against the small database scaling size with a data set of 500GB and against the medium scaling size with a data set of 1TB.

Throughput Results

The throughput results for the social workload with the uniform request distribution show that the small ScyllaDB cluster is able to serve 60 kOps/s with a cluster CPU utilization of ~85% while the small MongoDB cluster serves only 10 kOps/s under a comparable cluster utilization of 80-90%. For the medium cluster sizes, ScyllaDB achieves an average throughput of 232 kOps/s showing ~85% cluster utilization while MongoDB achieves 42 kOps/s at a CPU utilization of ~85%.

The throughput results for the social workload with the hotspot request distribution show a similar trend, but with higher throughput numbers since the data is mostly read from the cache. The small ScyllaDB cluster serves 152 kOps/s while the small MongoDB serves 14 kOps/s. For the medium cluster sizes, ScyllaDB achieves an average throughput of 587 kOps/s and MongoDB achieves 48 kOps/s.

Scalability Results

These results also enable us to compare the theoretical throughput scalability with the actually achieved throughput scalability. For this, we consider a simplified scalability model that focuses on compute resources. It assumes the scalability factor is reflected by the increased compute capacity from the small to medium cluster size. For ScyllaDB, this means we double the cluster size from 3 to 6 nodes and also double the instance size from 8 cores to 16 cores per instance, resulting in a theoretical scalability of 400%. For MongoDB, we move from one replica set of three data nodes to a cluster with three shards and nine data nodes and increase the instance size from 8 cores to 16 cores, resulting in a theoretical scalability factor of 600%.

The ScyllaDB scalability results for the uniform and hotspot distributions both show that ScyllaDB is close to achieving linear scalability by achieving a throughput scalability of 386% (of the theoretically possible 400%).

With MongoDB, the gap between theoretical throughput scalability and the actually achieved throughput scalability is significantly higher. For the uniform distribution, MongoDB achieves a scaling factor of 420% (of the theoretically possible 600%). For the hotspot distribution, we measure 342% (of the theoretically possible 600%).

Throughput per Cost Results

In order to compare the costs/month in relation to the provided throughput, we take the MongoDB Atlas throughput/$ as baseline (i.e. 100%) and compare it with the provided ScyllaDB Cloud throughput/$.

The results for the uniform distribution show that ScyllaDB provides five times more operations/$ compared to MongoDB Atlas for the small scaling size and 5.7 times more operations/$ for the medium scaling size.

For the hotspot distribution, the results show an even better throughput/cost ratio for ScyllaDB, providing 9 times more operations/$ for the small scaling size and 12.7 times more for the medium scaling size.

MongoDB vs ScyllaDB - Throughput/Cost - social uniform

MongoDB vs ScyllaDB - Throughput/Cost - social hotspot

Latency Results

For the uniform distribution, ScyllaDB provides stable and low P99 latencies for the read and update operations for the scaling sizes small and medium. MongoDB generally has higher P99 latencies. Here, the read latencies are 2.8 times higher for the small scaling size and 5.5 times higher for the medium scaling size. The update latencies show an even more distinct difference; MongoDB’s P99 update latency in the small scaling size is 47 times higher compared to ScyllaDB and 12 times higher in the medium scaling size.

For the hotspot distribution, the results show a similar trend for the stable and low ScyllaDB latencies. For MongoDB, read and update latencies increase from the small to medium scaling size. It is interesting that in contrast to the uniform distribution, the read latency only increases by a factor of 2.8 while the update latency increases by 970%.

Technical Nugget 1 - Data Model Performance Impact

The default YCSB data model is composed of a primary key and a data item with 10 fields of strings that results in a document with 10 attributes for MongoDB and a table with 10 columns for ScyllaDB. We analyze how performance changes if a pure key-value data model is applied for both databases: a table with only one column for ScyllaDB and a document with only one field for MongoDB

The results show that for ScyllaDB the throughput improves by 24% while for MongoDB the throughput increase is only 5%.

Technical Nugget 2 - Client Consistency Performance Impact

All standard benchmarks are run with the MongoDB client consistency writeConcern=majority/readPreference=primary and for ScyllaDB with writeConsistney=QUORUM/readConsistency=QUORUM. Besides these client consistent configurations, we also analyze the performance impact of weaker read consistency settings. For this, we enable MongoDB to read from the secondaries (readPreference=secondarypreferred) and set readConsistency=ONE for ScyllaDB.

The results show an expected increase in throughput: for ScyllaDB 56% and for MongoDB 49%.

Performance Results - Caching Workload

The caching workload is based on the YCSB Workload A and creates a read-update workload, with 50% read operations and 50% update operations. The workload is executed in two versions, which differ in terms of the request distribution patterns (namely uniform and hotspot distribution). This workload is executed against the small database scaling size with a data set of 500GB, the medium scaling size with a data set of 1TB and a large scaling size with a data set of 10TB.

In addition to the regular benchmark runtime of 30 minutes, a long-running benchmark over 12 hours is executed.

MongoDB vs ScyllaDB - Key Insights - Caching Workload