Skip to content

Refactor parallel writer#608

Open
MarkWolters wants to merge 18 commits intomainfrom
refactor_parallel_writer
Open

Refactor parallel writer#608
MarkWolters wants to merge 18 commits intomainfrom
refactor_parallel_writer

Conversation

@MarkWolters
Copy link
Contributor

Sequence diagram of current OnDiskGraphIndexWriter usage by Cassandra:
Cassandra_OnDiskGraphIndexWriter_CurrentState_SequenceDiagram.md

Sequence diagram of proposed future OnDiskParallelGraphIndexWriter usage:
OnDiskParallelGraphIndexWriter_SequenceDiagram.md

Perf test results:
refactor_parallel.tar.gz

Refactoring of the parallelization of graph index writer.

This PR splits the parallel writer into a separate class rather than maintaining if-based branches throughout a single class (OnDiskGraphIndexWriter). A large amount of common code has been abstracted into the new RandomAccessOnDiskGraphIndexWriter making the hierarchy cleaner and easier to understand and maintain.

Previously it was discovered that calling write() after calling writeInline() would results in the features from writeInline() being overwritten with zeroes. This is resolved in this case by checking for feature provider being null, emulating how it is done in sequential writes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant