title | summary |
---|---|
Configure TiDB for Optimal Performance |
Learn how to optimize TiDB performance by configuring key settings and addressing edge cases. |
This guide describes how to optimize the performance of TiDB, including:
- Best practices for common workloads.
- Strategies for handling challenging performance scenarios.
Note:
The optimization techniques in this guide can help achieve optimal performance in TiDB. However, performance tuning often involves balancing multiple factors, and no single solution addresses all performance needs. Some techniques in this guide use experimental features, which are marked accordingly. Although these optimizations can significantly improve performance, they might not be suitable for production environments and require careful evaluation before implementation.
Optimizing TiDB for peak performance requires careful tuning of various settings. In many cases, achieving optimal performance involves adjusting configurations beyond their default values.
The default settings prioritize stability over performance. To maximize performance, you might need to use more aggressive configurations and, in some cases, experimental features. These recommendations are based on production deployment experience and performance optimization research.
This guide describes the non-default settings, including their benefits and potential trade-offs. Use this information to optimize TiDB settings for your workload requirements.
The following settings are commonly used to optimize TiDB performance:
- Enhance execution plan cache, such as SQL Prepared Execution Plan Cache, Non-prepared plan cache, and Instance-level execution plan cache.
- Optimize the behavior of the TiDB optimizer by using Optimizer Fix Controls.
- Use the Titan storage engine more aggressively.
These settings can significantly improve performance for many workloads. However, as with any optimization, thoroughly test them in your environment before deploying to production.
Execute the following SQL commands to apply the recommended settings:
SET GLOBAL tidb_enable_instance_plan_cache=on;
SET GLOBAL tidb_instance_plan_cache_max_size=2GiB;
SET GLOBAL tidb_enable_non_prepared_plan_cache=on;
SET GLOBAL tidb_ignore_prepared_cache_close_stmt=on;
SET GLOBAL tidb_analyze_column_options='ALL';
SET GLOBAL tidb_stats_load_sync_wait=2000;
SET GLOBAL tidb_opt_derive_topn=on;
SET GLOBAL tidb_runtime_filter_mode=LOCAL;
SET GLOBAL tidb_opt_enable_mpp_shared_cte_execution=on;
SET GLOBAL tidb_rc_read_check_ts=on;
SET GLOBAL tidb_guarantee_linearizability=off;
SET GLOBAL pd_enable_follower_handle_region=on;
SET GLOBAL tidb_opt_fix_control = '44262:ON,44389:ON,44823:10000,44830:ON,44855:ON,52869:ON';
The following table outlines the impact of specific system variable configurations:
System variable | Description | Note |
---|---|---|
tidb_enable_instance_plan_cache and tidb_instance_plan_cache_max_size |
Use instance-level plan cache instead of session-level caching. This significantly improves performance for workloads with high connection counts or frequent prepared statement usage. | This is an experimental feature. Test in non-production environments first and monitor memory usage as the plan cache size increases. |
tidb_enable_non_prepared_plan_cache |
Enable the Non-prepared plan cache feature to reduce compile costs for applications that do not use prepared statements. | N/A |
tidb_ignore_prepared_cache_close_stmt |
Cache plans for applications that use prepared statements but close the plan after each execution. | N/A |
tidb_analyze_column_options |
Collect statistics for all columns to avoid suboptimal execution plans due to missing column statistics. By default, TiDB only collects statistics for predicate columns. | Setting this variable to 'ALL' can cause more resource usage for the ANALYZE TABLE operation compared with the default value 'PREDICATE' . |
tidb_stats_load_sync_wait |
Increase the timeout for synchronously loading statistics from the default 100 milliseconds to 2 seconds. This ensures TiDB loads the necessary statistics before query compilation. | Increasing this value leads to a longer synchronization wait time before query compilation. |
tidb_opt_derive_topn |
Enable the optimization rule of Deriving TopN or Limit from window functions. | This is limited to the ROW_NUMBER() window function. |
tidb_runtime_filter_mode |
Enable Runtime Filter in the local mode to improve hash join efficiency. | The variable is introduced in v7.2.0 and is disabled by default for safety. |
tidb_opt_enable_mpp_shared_cte_execution |
Enable non-recursive Common Table Expressions (CTE) pushdown to TiFlash. | This is an experimental feature. |
tidb_rc_read_check_ts |
For the read-committed isolation level, enabling this variable avoids the latency and cost of getting the global timestamp and optimizes transaction-level read latency. | This feature is incompatible with the Repeatable Read isolation level. |
tidb_guarantee_linearizability |
Improve performance by skipping the commit timestamp fetch from the PD server. | This sacrifices linearizability in favor of performance. Only causal consistency is guaranteed. It is not suitable for scenarios requiring strict linearizability. |
pd_enable_follower_handle_region |
Activate the PD Follower feature, allowing PD followers to process Region requests. This helps distribute load evenly across all PD servers and reduces CPU pressure on the PD leader. | This is an experimental feature. Test in non-production environments. |
tidb_opt_fix_control |
Enable advanced query optimization strategies to improve performance through additional optimization rules and heuristics. | Test thoroughly in your environment, as performance improvements vary by workload. |
The following describes the optimizer control configurations that enable additional optimizations:
44262:ON
: Use Dynamic pruning mode to access the partitioned table when the GlobalStats are missing.44389:ON
: For filters such asc = 10 and (a = 'xx' or (a = 'kk' and b = 1))
, build more comprehensive scan ranges forIndexRangeScan
.44823:10000
: To save memory, plan cache does not cache queries with parameters exceeding the specified number of this variable. Increase plan cache parameter limit from200
to10000
to make plan cache available for query with long in-lists.44830:ON
: Plan cache is allowed to cache execution plans with thePointGet
operator generated during physical optimization.44855:ON
: The optimizer selectsIndexJoin
when theProbe
side of anIndexJoin
operator contains aSelection
operator.52869:ON
: The optimizer chooses index merge automatically if the optimizer can choose the single index scan method (other than full table scan) for a query plan.
Add the following configuration items to the TiKV configuration file:
[server]
concurrent-send-snap-limit = 64
concurrent-recv-snap-limit = 64
snap-io-max-bytes-per-sec = "400MiB"
[pessimistic-txn]
in-memory-peer-size-limit = "32MiB"
in-memory-instance-size-limit = "512MiB"
[rocksdb.titan]
enabled = true
[rocksdb.defaultcf.titan]
min-blob-size = "1KB"
blob-file-compression = "zstd"
[storage.flow-control]
l0-files-threshold = 60
Configuration item | Description | Note |
---|---|---|
concurrent-send-snap-limit , concurrent-recv-snap-limit , and snap-io-max-bytes-per-sec |
Set limits for concurrent snapshot transfer and I/O bandwidth during TiKV scaling operations. Higher limits reduce scaling time by allowing faster data migration. | Adjusting these limits affects the trade-off between scaling speed and online transaction performance. |
in-memory-peer-size-limit and in-memory-instance-size-limit |
Control the memory allocation for pessimistic lock caching at the Region and TiKV instance levels. Storing locks in memory reduces disk I/O and improves transaction performance. | Monitor memory usage carefully. Higher limits improve performance but increase memory consumption. |
rocksdb.titan , rocksdb.defaultcf.titan , min-blob-size , and blob-file-compression |
Enable the Titan storage engine to reduce write amplification and alleviate disk I/O bottlenecks. Particularly useful when RocksDB compaction cannot keep up with write workloads, resulting in accumulated pending compaction bytes. | Enable it when write amplification is the primary bottleneck. Trade-offs include: 1. Potential performance impact on primary key range scans. 2. Increased space amplification (up to 2x in the worst case). 3. Additional memory usage for blob cache. |
storage.flow-control.l0-files-threshold |
Control when write flow control is triggered based on the number of kvDB L0 files. Increasing the threshold reduces write stalls during high write workloads. | Higher thresholds might lead to more aggressive compactions when many L0 files exist. |
Add the following configuration items to the TiFlash configuration file:
[raftstore-proxy.server]
snap-io-max-bytes-per-sec = "300MiB"
Configuration item | Description | Note |
---|---|---|
snap-io-max-bytes-per-sec |
Control the maximum allowable disk bandwidth for data replication from TiKV to TiFlash. Higher limits accelerate initial data loading and catch-up replication. | Higher bandwidth consumption might impact online transaction performance. Balance between replication speed and system stability. |
This section compares performance between default settings (baseline) and optimized settings based on the preceding key settings for common loads.
The test environment is as follows:
- 3 TiDB servers (16 cores, 64 GiB)
- 3 TiKV servers (16 cores, 64 GiB)
- TiDB version: v8.4.0
- Workload: sysbench oltp_read_only
The following table compares throughput, latency, and plan cache hit ratio between baseline and optimized settings.
Metric | Baseline | Optimized | Improvement |
---|---|---|---|
QPS | 89,100 | 100,128 | +12.38% |
Average latency (ms) | 35.87 | 31.92 | -11.01% |
P95 latency (ms) | 58.92 | 51.02 | -13.41% |
Plan cache hit ratio (%) | 56.89% | 87.51% | +53.82% |
Plan cache memory usage (MiB) | 95.3 | 70.2 | -26.34% |
The instance plan cache demonstrates significant performance improvements over the baseline configuration:
-
Higher hit ratio: increases by 53.82% (from 56.89% to 87.51%).
-
Lower memory usage: decreases by 26.34% (from 95.3 MiB to 70.2 MiB).
-
Better performance:
- QPS increases by 12.38%.
- Average latency decreases by 11.01%.
- P95 latency decreases by 13.41%.
Instance plan cache improves performance through these mechanisms:
- Cache execution plans for
SELECT
statements in memory. - Share cached plans across all connections (up to 200) on the same TiDB instance.
- Can effectively store plans for up to 5,000
SELECT
statements across 1,000 tables. - Cache misses primarily occur only for
BEGIN
andCOMMIT
statements.
Although the benchmark using simple sysbench oltp_read_only
queries (14 KB per plan) shows modest improvements, you can expect greater benefits in real-word applications:
- Complex queries can run up to 20 times faster.
- Memory usage is more efficient compared to session-level plan cache.
Instance plan cache is particularly effective for systems with:
- Large tables with many columns.
- Complex SQL queries.
- High concurrent connections.
- Diverse query patterns.
Instance plan cache provides better memory efficiency than session-level plan cache because:
- Plans are shared across all connections
- No need to duplicate plans for each session
- More efficient memory utilization while maintaining higher hit ratios
In scenarios with multiple connections and complex queries, session-level plan cache would require significantly more memory to achieve similar hit ratios, making instance plan cache the more efficient choice.
The following sysbench oltp_read_only prepare
command loads data:
sysbench oltp_read_only prepare --mysql-host={host} --mysql-port={port} --mysql-user=root --db-driver=mysql --mysql-db=test --threads=100 --time=900 --report-interval=10 --tables=1000 --table-size=10000
The following sysbench oltp_read_only run
command runs workload:
sysbench oltp_read_only run --mysql-host={host} --mysql-port={port} --mysql-user=root --db-driver=mysql --mysql-db=test --threads=200 --time=900 --report-interval=10 --tables=1000 --table-size=10000
For more information, see How to Test TiDB Using Sysbench.
The test environment is as follows:
- 3 TiDB servers (16 cores, 64 GiB)
- 3 TiKV servers (16 cores, 64 GiB)
- TiDB version: v8.4.0
- Workload: go-ycsb workloada
The following table compares throughput (operations per second) between the baseline and optimized settings.
Item | Baseline (OPS) | Optimized (OPS) | Improvement |
---|---|---|---|
load data | 2858.5 | 5074.3 | +77.59% |
workloada | 2243.0 | 12804.3 | +470.86% |
Titan is enabled by default starting from v7.6.0 and the default min-blob-size
of Titan in TiDB v8.4.0 is 32KiB
. The baseline configuration uses a record size of 31KiB
to ensure data is stored in RocksDB. In contrast, for the key settings configuration, set min-blob-size
to 1KiB
, causing data to be stored in Titan.
The performance improvement observed in the key settings is primarily attributed to Titan's ability to reduce RocksDB compactions. As shown in the following figures:
- Baseline: The total throughput of RocksDB compaction exceeds 1 GiB/s, with peaks over 3 GiB/s.
- Key settings: The peak throughput of RocksDB compaction remains below 100 MiB/s.
This significant reduction in compaction overhead contributes to the overall throughput improvement seen in the key settings configuration.
The following go-ycsb load
command loads data:
go-ycsb load mysql -P /ycsb/workloads/workloada -p {host} -p mysql.port={port} -p threadcount=100 -p recordcount=5000000 -p operationcount=5000000 -p workload=core -p requestdistribution=uniform -pfieldcount=31 -p fieldlength=1024
The following go-ycsb run
command runs workload:
go-ycsb run mysql -P /ycsb/workloads/workloada -p {host} -p mysql.port={port} -p mysql.db=test -p threadcount=100 -p recordcount=5000000 -p operationcount=5000000 -p workload=core -prequestdistribution=uniform -p fieldcount=31 -p fieldlength=1024
This section shows you how to optimize TiDB for specific scenarios that need targeted adjustments beyond basic optimizations. You will learn how to tune TiDB for your particular use cases.
To identify edge cases, perform the following steps:
- Analyze query patterns and workload characteristics.
- Monitor system metrics to identify performance bottlenecks.
- Gather feedback from application teams about specific issues.
The following lists some common edge cases:
- High TSO wait for high-frequency small queries
- Choose the proper max chunk size for different workloads
- Tune coprocessor cache for read-heavy workloads
- Optimize chunk size for workload characteristics
- Optimize transaction mode and DML type for different workloads
- Optimize
GROUP BY
andDISTINCT
operations with TiKV pushdown - Mitigate MVCC version accumulation using in-memory engine
- Optimize statistics collection during batch operations
- Optimize thread pool settings for different instance types
The following sections explain how to handle each of these cases. You need to adjust different parameters or use specific TiDB features for each scenario.
Note:
Apply these optimizations carefully and test thoroughly, as their effectiveness might vary based on your use case and data patterns.
If your workload involves frequent small transactions or queries that frequently request timestamps, TSO (Timestamp Oracle) can become a performance bottleneck. To check if TSO wait time is impacting your system, check the Performance Overview > SQL Execute Time Overview panel. If TSO wait time constitutes a large portion of your SQL execution time, consider the following optimizations:
- Use low-precision TSO (enable
tidb_low_resolution_tso
) for read operations that do not need strict consistency. For more information, see Solution 1: use low-precision TSO. - Combine small transactions into larger ones where possible. For more information, see Solution 2: parallel mode for TSO requests.
You can reduce TSO wait time by enabling the low-precision TSO feature (tidb_low_resolution_tso
). After this feature is enabled, TiDB uses the cached timestamp to read data, reducing TSO wait time at the expense of potentially stale reads.
This optimization is particularly effective in the following scenarios:
- Read-heavy workloads where slight staleness is acceptable.
- Scenarios where reducing query latency is more important than absolute consistency.
- Applications that can tolerate reads that are a few seconds behind the latest committed state.
Benefits and trade-offs:
- Reduce query latency by enabling stale reads with a cached TSO, eliminating the need to request new timestamps.
- Balance performance against data consistency: this feature is only suitable for scenarios where stale reads are acceptable. It is not recommended to use it when strict data consistency is required.
To enable this optimization:
SET GLOBAL tidb_low_resolution_tso=ON;
The tidb_tso_client_rpc_mode
system variable switches the mode in which TiDB sends TSO RPC requests to PD. The default value is DEFAULT
. When the following conditions are met, you can consider switching this variable to PARALLEL
or PARALLEL-FAST
for potential performance improvements:
- TSO waiting time constitutes a significant portion of the total execution time of SQL queries.
- The TSO allocation in PD has not reached its bottleneck.
- PD and TiDB nodes have sufficient CPU resources.
- The network latency between TiDB and PD is significantly higher than the time PD takes to allocate TSO (that is, network latency accounts for the majority of TSO RPC duration).
- To get the duration of TSO RPC requests, check the PD TSO RPC Duration panel in the PD Client section of the Grafana TiDB dashboard.
- To get the duration of PD TSO allocation, check the PD server TSO handle duration panel in the TiDB section of the Grafana PD dashboard.
- The additional network traffic resulting from more TSO RPC requests between TiDB and PD (twice for
PARALLEL
or four times forPARALLEL-FAST
) is acceptable.
To switch the parallel mode, execute the following command:
-- Use the PARALLEL mode
SET GLOBAL tidb_tso_client_rpc_mode=PARALLEL;
-- Use the PARALLEL-FAST mode
SET GLOBAL tidb_tso_client_rpc_mode=PARALLEL-FAST;
You can improve query performance for read-heavy workloads by optimizing the coprocessor cache. This cache stores the results of coprocessor requests, reducing repeated computations of frequently accessed data. To optimize cache performance, perform the following steps:
- Monitor the cache hit ratio using the metrics described in Coprocessor Cache.
- Increase the cache size to improve hit rates for larger working sets.
- Adjust the admission threshold based on query patterns.
The following lists some recommended settings for a read-heavy workload:
[tikv-client.copr-cache]
capacity-mb = 4096
admission-max-ranges = 5000
admission-max-result-mb = 10
admission-min-process-ms = 0
The tidb_max_chunk_size
system variable sets the maximum number of rows in a chunk during the execution process. Adjusting this value based on your workload can improve performance.
-
For OLTP workloads with large concurrency and small transactions:
- Set the value between
128
and256
rows (the default value is1024
). - This reduces memory usage and makes limit queries faster.
- Use case: point queries, small range scans.
SET GLOBAL tidb_max_chunk_size = 128;
- Set the value between
-
For OLAP or analytical workloads with complex queries and large result sets:
- Set the value between
1024
and4096
rows. - This increases throughput when scanning large amounts of data.
- Use case: aggregations, large table scans.
SET GLOBAL tidb_max_chunk_size = 4096;
- Set the value between
TiDB provides different transaction modes and DML execution types to optimize performance for various workload patterns.
You can set the transaction mode using the tidb_txn_mode
system variable.
-
Pessimistic transaction mode (default):
- Suitable for general workloads with potential write conflicts.
- Provides stronger consistency guarantees.
SET SESSION tidb_txn_mode = "pessimistic";
-
- Suitable for workloads with minimal write conflicts.
- Better performance for multi-statement transactions.
- Example:
BEGIN; INSERT...; INSERT...; COMMIT;
.
SET SESSION tidb_txn_mode = "optimistic";
You can control the execution mode of DML statements using the tidb_dml_type
system variable, which is introduced in v8.0.0.
To use the bulk DML execution mode, set tidb_dml_type
to "bulk"
. This mode optimizes bulk data loading without conflicts and reduces memory usage during large write operations. Before using this mode, ensure that:
- Auto-commit is enabled.
- The
pessimistic-auto-commit
configuration item is set tofalse
.
SET SESSION tidb_dml_type = "bulk";
TiDB pushes down aggregation operations to TiKV to reduce data transfer and processing overhead. The performance improvement varies based on your data characteristics.
-
Ideal scenarios (high performance gain):
- Columns containing few distinct values (low NDV).
- Data containing frequent duplicate values.
- Example: status columns, category codes, date parts.
-
Non-ideal scenarios (potential performance loss):
- Columns containing mostly unique values (high NDV).
- Unique identifiers or timestamps.
- Example: User IDs, transaction IDs.
Enable pushdown optimizations at the session or global level:
-- Enable regular aggregation pushdown
SET GLOBAL tidb_opt_agg_push_down = ON;
-- Enable distinct aggregation pushdown
SET GLOBAL tidb_opt_distinct_agg_push_down = ON;
Excessive MVCC versions can cause performance bottlenecks, particularly in high read/write areas or due to issues with garbage collection and compaction. You can use the in-memory engine introduced in v8.5.0 to mitigate this issue. To enable it, add the following configuration to your TiKV configuration file.
Note:
The in-memory engine helps reduce the impact of excessive MVCC versions but might increase memory usage. Monitor your system after enabling this feature.
[in-memory-engine]
enable = true
You can optimize performance during batch operations while maintaining query optimization by managing statistics collection. This section describes how to manage this process effectively.
You can disable auto analyze by setting the tidb_enable_auto_analyze
system variable to OFF
in the following scenarios:
- During large data imports.
- During bulk update operations.
- For time-sensitive batch processing.
- When you need full control over the timing of statistics collection.
-
Before the batch operation:
-- Disable auto analyze SET GLOBAL tidb_enable_auto_analyze = OFF;
-
After the batch operation:
-- Manually collect statistics ANALYZE TABLE your_table; -- Re-enable auto analyze SET GLOBAL tidb_enable_auto_analyze = ON;
To improve TiKV performance, configure the thread pools based on your instance's CPU resources. The following guidelines help you optimize these settings:
-
For instances with 8 to 16 cores, the default settings are typically sufficient.
-
For instances with 32 or more cores, increase the pool sizes for better resource utilization. Adjust the settings as follows:
[server] # Increase gRPC thread pool grpc-concurrency = 10 [raftstore] # Optimize for write-intensive workloads apply-pool-size = 4 store-pool-size = 4 store-io-pool-size = 2