MongoDB Adapted TPC-C Benchmark Using Python 3

The goal of this repository is to benchmark MongoDB using the TPC-C benchmark.

There are results of the benchmarks in ./results.

For more information about the TPC-C benchmark adaption for MongoDB, please see this paper.

Prerequisite

Install MongoDB.
Install Python.
Install pip modules.
```
pip install pymongo
pip install execnet
```

Setup

To use multi-document actions on MongoDB, a database should be either a shared cluster or a replica set.

In this test, I used a non-shared replica set with one node in order to make configuration simple.

To run 'mongod' as a replica set on localhost, run the server as below:

Terminate all the mongo and mongod process.

Run the MongoDB server.

mongod --port 27017 --dbpath YOUR_DB_DIRECTORY_PATH --replSet rs0 --bind_ip localhost

Initialize a replica set on client side.

mongo # start mongo shell
> rs.initiate()

Run the benchmark

In TPC-C tests, the amount of data is determined by the number of warehouses.

In this test, I used 100 warehouses and the benchmarks ran for 600 seconds.

The number of clients (threads) varied from 1 to 48.

If you just want to run the benchmarks as I did, execute the run.bat file in the root directory.

./run.bat

Otherwise, you can manually set up the benchmarks.

First, generate a MongoDB configuration file.

python ./tpcc.py --print-config mongodb > mongodb.config

Load the data.

# It takes about an hour and a half when there are 100 warehouses.
python ./tpcc.py --no-execute --warehouses=100 --config=mongodb.config mongodb

To measure how long the command takes to run on Windows, use this line instead.

Measure-Command {python ./tpcc.py --no-execute --warehouses=100 --config=mongodb.config mongodb}

Once you load the data, the data can be reused through multiple benchmarks.

Execute the benchmark

python ./tpcc.py --no-load --warehouses=100 --duration=600 --clients=1 --config=mongodb.config mongodb
# If you would like to measure the elapsed time
Measure-Command {python ./tpcc.py --no-load --warehouses=100 --duration=600 --clients=1 --config=mongodb.config mongodb}

Delete the data after finishing the tests.

mongo
> use tpcc
> db.dropDatabase()
> use local
> db.dropDatabase() # The mongo server has to stop running to drop local.

Arguments

--no-execute: Load the data only.
--no-load: Execute the benchmark only.
--warehouses=int: The number of warehouses to use. Default: 4.
--duration=int: How long to the execution phase of the benchmark (in seconds). Default: 60.
--clients=int: The process number of a single node (for concurrent execution). Default: 1.

Environment

OS: Windows 10, 2004
CPU: R5 3600
RAM: 16GB
Storage: MX500 1TB
MongoDB: 4.2.8
Python: 3.8.2

Name		Name	Last commit message	Last commit date
Latest commit History 224 Commits
configs		configs
drivers		drivers
examples		examples
results		results
runtime		runtime
util		util
.gitignore		.gitignore
README.md		README.md
README_v1.1		README_v1.1
__init__.py		__init__.py
constants.py		constants.py
coordinator.py		coordinator.py
message.py		message.py
run.bat		run.bat
tpcc.py		tpcc.py
tpcc.sql		tpcc.sql
verify.js		verify.js
worker.py		worker.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MongoDB Adapted TPC-C Benchmark Using Python 3

Prerequisite

Setup

Run the benchmark

Arguments

Environment

About

Releases

Packages

Languages

hyunjinjeong/py-tpcc

Folders and files

Latest commit

History

Repository files navigation

MongoDB Adapted TPC-C Benchmark Using Python 3

Prerequisite

Setup

Run the benchmark

Arguments

Environment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages