Concurrency control (MVTO) discussion

In trying to improve the performance of the CC system, I've identified several design decisions that I'd like to discuss here and possibly change in the future.

1. We maintain and rely on a doubly-linked version chain. This is at odds with both the MVCC paper:

> Since it is not possible to maintain a latch-free doubly linked list, the version chain only points in one direction.

...and our [own wiki entry on our CC protocol](https://github.com/cmu-db/peloton/wiki/Concurrency-Control-Protocol):

> The last meta-data field is the 64-bit pointer that stores the address of the neighboring (previous or next) version (if any) of a particular version.

2. ~~We add READs to a TransactionContext's RWSet. One of #1402's changes tries to optimize around this, but really it [shouldn't be done in the first place](https://github.com/cmu-db/peloton/blob/master/src/concurrency/timestamp_ordering_transaction_manager.cpp#L325). I understand the need to add READ_OWNs since we need to release the write lock on a tuple at transaction end, but a normal READ shouldn't be there.~~ Addressed in #1425.

3. Our tuple header is "full" at 64 bytes. We could always make the reserved field larger if we needed to, but spilling such a heavily-accessed structure beyond a single cache line sounds bad to me. Removing the doubly-linked version chain would buy us some space, as would exploring atomic operations on timestamps to remove the SpinLatch. I'm not convinced the latter is possible, but I am experimenting with 128-bit CAS implementations.

4. Reusing owned tuple slots for updates and deletes seems like a premature optimization that sacrifices correctness. As currently implemented, if we repeatedly update a tuple within a single transaction and then abort, we do not have enough information to reconstruct the old versions that need to be pruned from the indexes. If we stick with reusing tuple slots, then it seems like we need more info added to the TransactionContext, like index write sets (or wait for logging).

5. ~~Also related to reusing owned tuple slots, I can generate an assertion failure in the master branch (debug mode) with the following SQL:~~ Addressed in #1429.

> CREATE TABLE test(a INT PRIMARY KEY, b INT);
BEGIN;
INSERT INTO test VALUES (3, 30);
UPDATE test SET a=5, b=40;

> Assertion failed: ((GetLastReaderCommitId(tile_group_header, old_location.offset) == current_txn->GetCommitId())), function PerformDelete, file .../peloton/src/concurrency/timestamp_ordering_transaction_manager.cpp, line 525.

This is similar to #1336 and #1337.

6. ~~We don’t set the last-reader timestamp of a new version to the writer's timestamp. Instead it is 0. This goes against all of the MVTO references I've found, and may be a contributor to issue 5 listed above.~~ Addressed in #1429.

7. [Hybrid index scan is fiddling with CC stuff](https://github.com/cmu-db/peloton/blob/master/src/executor/hybrid_scan_executor.cpp#L422). This strikes me as legacy code from before we had a proper CC system since it's from 2016. Based on Coveralls it [doesn't seem like we exercise this codepath anyway](https://coveralls.io/builds/17582473/source?filename=src/executor/hybrid_scan_executor.cpp#L422), but anyone outside of the CC system changing tuple headers jumps out at me as concerning.

I'd like to have a discussion on these topics and if necessary come to a conclusion on how to address them.

@yingjunwu @gandeevan 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Concurrency control (MVTO) discussion #1420

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Concurrency control (MVTO) discussion #1420

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions