feat: add basic wal implementation for Edge #24570

pauldix · 2024-01-11T19:54:16Z

This WAL implementation uses some of the code from the wal crate, but departs pretty significantly from it in many ways. For now it uses simple JSON encoding for the serialized ops, but we may want to switch that to Protobuf at some point in the future. This version of the wal doesn't have its own buffering. That will be implemented higher up in the BufferImpl, which will use the wal and SegmentWriter to make data in the buffer durable.

The write flow will be that writes will come into the buffer and validate/update against an in memory Catalog. Once validated, writes will get buffered up in memory and then flushed into the WAL periodically (likely every 10-20ms). After being flushed to the wal, the entire batch of writes will be put into the in memory queryable buffer. After that responses will be sent back to the clients. This should reduce the write lock pressure on the in-memory buffer considerably.

In this PR:

Update the Wal, WalSegmentWriter, and WalSegmentReader traits to line up with new design/understanding
Implement wal (mainly just a way to identify segment files in a directory)
Implement WalSegmentWriter (write header, op batch with crc, and track sequence number in segment, re-open existing file)
Implement WalSegmentReader

Closes #24557

This WAL implementation uses some of the code from the wal crate, but departs pretty significantly from it in many ways. For now it uses simple JSON encoding for the serialized ops, but we may want to switch that to Protobuf at some point in the future. This version of the wal doesn't have its own buffering. That will be implemented higher up in the BufferImpl, which will use the wal and SegmentWriter to make data in the buffer durable. The write flow will be that writes will come into the buffer and validate/update against an in memory Catalog. Once validated, writes will get buffered up in memory and then flushed into the WAL periodically (likely every 10-20ms). After being flushed to the wal, the entire batch of writes will be put into the in memory queryable buffer. After that responses will be sent back to the clients. This should reduce the write lock pressure on the in-memory buffer considerably. In this PR: - Update the Wal, WalSegmentWriter, and WalSegmentReader traits to line up with new design/understanding - Implement wal (mainly just a way to identify segment files in a directory) - Implement WalSegmentWriter (write header, op batch with crc, and track sequence number in segment, re-open existing file) - Implement WalSegmentReader

mgattozzi

Mostly I just had a few questions that would help me with the review before I feel comfortable to approve it, but otherwise to me this looks really solid.

influxdb3/src/commands/serve.rs

influxdb3_write/src/lib.rs

influxdb3_write/src/wal.rs

mgattozzi · 2024-01-11T20:16:12Z

influxdb3_write/src/wal.rs

+const FILE_TYPE_IDENTIFIER: &[u8] = b"idb3.001";
+
+/// File extension for segment files
+const SEGMENT_FILE_EXTENSION: &str = "wal";


I think I might have been confused about our previous offline conversations. Are the segment files and wal one and the same? My understanding was that the wal just contained data that had not been persisted yet (with maybe 1 or more segments of nonpersisted data) and a segment that was persisted to disk was a blob of data containing the list of all the parquet files for that segment. To me the terminology feels a bit fuzzy and intermingled.

The wal contains all data that is in the in-memory buffer for a given segment. So there are a few different things here:

A Buffer Segment (in memory collection of writes)

A WAL Segment (a file on locally attached disk that has the durable record of what is in a buffer segment)

A Segment File (a file in object store that has the summary information of what parquet files were persisted for a given buffer segment)

The Segment File could arguably be renamed to something more like segment_persist_info or something like that.

Okay that makes more sense to me. I think when I work on the persister I'll give Segment Files a name like that to be more clear

influxdb3_write/src/wal.rs

Turn wal and write buffer references into a concrete type, rather than dyn.

mgattozzi

LGTM thanks for making those changes @pauldix!

pauldix added the v3 label Jan 11, 2024

pauldix requested a review from mgattozzi January 11, 2024 19:54

mgattozzi reviewed Jan 11, 2024

View reviewed changes

pauldix added 4 commits January 11, 2024 16:49

refactor: make Wal return impl reader/writer

8244e18

refactor: clean up wal segment open

f325127

fix: WriteBuffer and Wal usage

0ee3376

Turn wal and write buffer references into a concrete type, rather than dyn.

fix: have wal loading ignore invalid files

c47e202

pauldix requested review from dgnorton and mgattozzi January 12, 2024 16:23

mgattozzi approved these changes Jan 12, 2024

View reviewed changes

pauldix merged commit 02b4d28 into main Jan 12, 2024
12 checks passed

pauldix deleted the pd/v3_wal branch January 12, 2024 16:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add basic wal implementation for Edge #24570

feat: add basic wal implementation for Edge #24570

pauldix commented Jan 11, 2024

mgattozzi left a comment

mgattozzi Jan 11, 2024

pauldix Jan 11, 2024

mgattozzi Jan 12, 2024

mgattozzi left a comment

feat: add basic wal implementation for Edge #24570

feat: add basic wal implementation for Edge #24570

Conversation

pauldix commented Jan 11, 2024

mgattozzi left a comment

Choose a reason for hiding this comment

mgattozzi Jan 11, 2024

Choose a reason for hiding this comment

pauldix Jan 11, 2024

Choose a reason for hiding this comment

mgattozzi Jan 12, 2024

Choose a reason for hiding this comment

mgattozzi left a comment

Choose a reason for hiding this comment