Name	Name	Last commit message	Last commit date
Latest commit History 142 Commits
.github	.github
docs	docs
examples	examples
src	src
tests	tests
tsperf	tsperf
.dockerignore	.dockerignore
.gitignore	.gitignore
CHANGES.md	CHANGES.md
DATA_GENERATOR.md	DATA_GENERATOR.md
DEVELOP.md	DEVELOP.md
Dockerfile	Dockerfile
KUBERNETES.md	KUBERNETES.md
LICENSE	LICENSE
MANIFEST.in	MANIFEST.in
README.md	README.md
pyproject.toml	pyproject.toml
requirements-dev.txt	requirements-dev.txt
requirements.txt	requirements.txt
setup.py	setup.py

TSPERF Time Series Database Benchmark Suite

TSPERF is a tool for evaluating and comparing the performance of time series databases, in the spirit of TimescaleDB's Time Series Benchmark Suite (TSBS).

About

The tsperf program is a database workload generator including two different domains, one for writing data and another one for reading.

Data generator: Generate time series data and feed it into database. Use tsperf write --help to explore its options.
Query timer: Probe responsiveness of database on the read path. Use tsperf read --help to explore its options.

For the purpose of capacity testing, both domains try to simulate the generation and querying of time-series data. As the program is easy to use, it provides instant reward without the need to set up a whole data ingestion chain.

Features

Generate random data which follows a statistical model to better reflect real world scenarios, real world data is almost never truly random.
The "steady load"-mode can simulate a constant load of a defined number of messages per second.
Ready-made to deploy and scale data generators with Docker containers. In order to maximize performance, multiple instances of the Data Generator can be run in parallel. This can be achieved by using Kubernetes.
Metrics are exposed for consumption by Prometheus.
Data generator features
- Easy to define your own schema.
- Full control on how many values will be inserted.
- Scale out to multiple clients is a core concept.
- Huge sets of data can be inserted without creating files as intermediate storage.

Supported databases

CrateDB
InfluxDB
Microsoft SQL Server
MongoDB
PostgreSQL
TimescaleDB
Timestream

Prior art

TSBS

The Time Series Benchmark Suite (TSBS) is a collection of Go programs that are used to generate datasets and then benchmark read and write performance of various databases.

cr8 + mkjson

mkjson combined with cr8 insert-json makes it easy to generate random entries into a table. See generate data sets using mkjson for an example how to use cr8 together with mkjson.

Install

Python package

pip install --user tsperf

Docker image

docker run -it --rm --network=host tsperf tsperf write --help

Usage

This section outlines the usage of tsperf on different databases. Please note that using Docker here is just for demonstration purposes. In reality, you will want to run the database workload against a database instance running on a decently powered machine.

For increasing concurrency, try --concurrency=8.
For enabling Prometheus metrics export, try --prometheus-enable=true and maybe --prometheus-listen=0.0.0.0:8000.
For increasing concurrency and number of iterations when querying, try --concurrency=10 --iterations=2000.
For displaying the list of built-in schemas, run tsperf schema --list.

CrateDB

# Run CrateDB
docker run -it --rm --publish=4200:4200 --publish=5432:5432 crate:4.5.1

# Feed data into CrateDB table.
# Adjust write parameters like `--partition=day --shards=6 --replicas=3`.
tsperf write --adapter=cratedb --schema=tsperf.schema.basic:environment.json
tsperf write --schema=tsperf.schema.basic:environment.json --adapter=cratedb --address=cratedb.example.org:4200

# Use Docker.
docker run -it --rm --network=host tsperf tsperf write --schema=tsperf.schema.basic:environment.json --adapter=cratedb

# Query data from CrateDB table.
tsperf read --adapter=cratedb --query="SELECT * FROM environment LIMIT 10;"

CrateDB+PostgreSQL

# Run CrateDB workload via PostgreSQL protocol.
tsperf write --adapter=cratedbpg --schema=tsperf.schema.basic:environment.json
tsperf read --adapter=cratedbpg --iterations=3000 --query="SELECT * FROM environment LIMIT 10;"

# Run PostgreSQL workload on CrateDB.
tsperf write --adapter=postgresql --schema=tsperf.schema.basic:environment.json
tsperf read --adapter=postgresql --username=crate --iterations=3000 --query="SELECT * FROM environment LIMIT 10;"

InfluxDB

# Run and configure InfluxDB
docker run -it --rm --publish=8086:8086 influxdb:2.0
influx setup --name=default --username=root --password=12345678 --org=acme --bucket=environment --retention=0 --force
cat /Users/amo/.influxdbv2/configs

# Configure tsperf
export ADAPTER=influxdb
export ADDRESS=http://localhost:8086/
export INFLUXDB_ORGANIZATION=acme
export INFLUXDB_TOKEN="X1kHPaXvS...p1IAQ=="
 
# Feed data into InfluxDB bucket.
tsperf write --schema=tsperf.schema.basic:environment.json

# Query data from InfluxDB bucket.
tsperf read --query='from(bucket:"environment") |> range(start:-2h, stop:2h) |> limit(n: 10)'

On InfluxDB Cloud, after generating an "All Access Token", configure tsperf like:

export ADAPTER=influxdb
export ADDRESS="https://eu-central-1-1.aws.cloud2.influxdata.com/"
export INFLUXDB_ORGANIZATION=a05test6edtest2d
export INFLUXDB_TOKEN="wpNtestfeNUveYitDLk8Ld47vrSVUTKB_vEaEwWC7qXj_ZqvOwYCRhQTB4EDty3uLFMXWP2C195gtestt4XGFQ=="

Microsoft SQL Server

# Run Microsoft SQL Server
docker run -it --rm --publish=1433:1433 --env="ACCEPT_EULA=Y" --env="SA_PASSWORD=yayRirr3" mcr.microsoft.com/mssql/server:2019-latest
docker exec -it aeba7fdd4d73 /opt/mssql-tools/bin/sqlcmd -S localhost -U sa -P yayRirr3 -Q "select @@Version"

# Install the Microsoft ODBC driver for SQL Server
- Visit: https://docs.microsoft.com/en-us/sql/connect/odbc/download-odbc-driver-for-sql-server

# Configure tsperf
export ADAPTER=mssql

# Feed data into MSSQL table.
tsperf write --schema=tsperf.schema.basic:environment.json

# Query data from MSSQL table.
tsperf read --iterations=3000 --query="SELECT TOP 10 * FROM environment;"

MongoDB

# Run and configure MongoDB
docker run -it --rm --publish=27017:27017 mongo:4.4

# Feed data into MongoDB collection.
tsperf write --adapter=mongodb --schema=tsperf.schema.basic:environment.json

# Query data from MongoDB collection.
tsperf read --adapter=mongodb --schema=tsperf.schema.basic:environment.json

# For connecting to MongoDB Atlas, use:
export ADDRESS="mongodb+srv://username:password@testdrive.fkpkw.mongodb.net/tsperf?retryWrites=true&w=majority"

PostgreSQL

# Run PostgreSQL
docker run -it --rm --env="POSTGRES_HOST_AUTH_METHOD=trust" --publish=5432:5432 postgres:13.3

# Configure tsperf
export ADAPTER=postgresql

# Feed data into PostgreSQL table.
tsperf write --schema=tsperf.schema.basic:environment.json

# Query data from PostgreSQL table.
tsperf read --iterations=3000 --query="SELECT * FROM environment LIMIT 10;"

TimescaleDB

# Run TimescaleDB
docker run -it --rm --env="POSTGRES_HOST_AUTH_METHOD=trust" --publish=5432:5432 timescale/timescaledb:2.3.0-pg13

# Configure tsperf
export ADAPTER=timescaledb

# Feed data into TimescaleDB hypertable.
# Adjust write parameters like `--timescaledb-distributed --timescaledb-pgcopy`.
tsperf write --schema=tsperf.schema.basic:environment.json

# Query data from TimescaleDB hypertable.
tsperf read --iterations=3000 --query="SELECT * FROM environment LIMIT 10;"

Timestream

# Run Timestream

# There is no way to run Amazon Timestream on premises.
# - https://aws.amazon.com/timestream/
# - https://docs.aws.amazon.com/timestream/

# Configure tsperf
export ADAPTER=timestream
export ADDRESS=ingest-cell1.timestream.us-west-2.amazonaws.com
export AWS_ACCESS_KEY_ID=AKIAIOSFODNN7EXAMPLE
export AWS_SECRET_ACCESS_KEY=wJalrXUtnFEMI/K7MDENG/bPxRfiCYEXAMPLEKEY
export AWS_REGION_NAME=us-west-2

# Feed data into Timestream table.
tsperf write --schema=tsperf.schema.basic:environment.json

# Query data from Timestream table.
tsperf read --iterations=3000 --query="SELECT * FROM environment LIMIT 10;"

Contributing

We are always happy to receive code contributions, ideas, suggestions and problem reports from the community.

So, if you’d like to contribute you’re most welcome. Spend some time taking a look around, locate a bug, design issue or spelling mistake and then send us a pull request or open an issue on GitHub.

Thanks in advance for your efforts, we really appreciate any help or feedback.

Acknowledgements

Thanks to all the contributors who helped to co-create and conceive tsperf in one way or another and kudos to all authors of the foundational libraries.

License

This project is licensed under the terms of the Apache 2.0 license.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TSPERF Time Series Database Benchmark Suite

About

Features

Supported databases

Prior art

TSBS

cr8 + mkjson

Install

Python package

Docker image

Usage

CrateDB

CrateDB+PostgreSQL

InfluxDB

Microsoft SQL Server

MongoDB

PostgreSQL

TimescaleDB

Timestream

Contributing

Acknowledgements

License

About

Releases 2

Packages

Contributors 4

Languages

License

crate/tsperf

Folders and files

Latest commit

History

Repository files navigation

TSPERF Time Series Database Benchmark Suite

About

Features

Supported databases

Prior art

TSBS

cr8 + mkjson

Install

Python package

Docker image

Usage

CrateDB

CrateDB+PostgreSQL

InfluxDB

Microsoft SQL Server

MongoDB

PostgreSQL

TimescaleDB

Timestream

Contributing

Acknowledgements

License

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 4

Languages

Packages