Re-design execution engine's output pipeline. #1270

pmenon · 2020-10-24T23:00:07Z

Feature Request

Summary

The execution currently buffers query output into local buffers and invokes a callback once these buffers are full. These intermediate buffers add overhead since we materialize "fat" SQL values, but also prevent the consumer of the query to push down logic into generated code.

Side note: The current OutputCallback can be invoked by multiple threads, but isn't documented for some reason.

Solution

I propose adding an ExecutionTarget interface type that participates in the translation infrastructure. Subclasses have the opportunity to generate code as part of the push-based data flow. This will replace the existing OutputTranslator. It is the responsibility of this ExecutionTarget to consume the result of the output. This also allows execution targets to be aware of and specialize handling for parallel query execution.

A trivial policy duplicates the current OutputTranslator logic into a BufferingExecutionTarget type that buffers results and dispatches into an injected callback. Another example would be a LibpqxxExecutionTarget that direclty invokes functions to serialize rows into network buffers.

This is the approach I've taken in my TPL repo, and it seems to work pretty well.

The text was updated successfully, but these errors were encountered:

apavlo assigned tanujnay112 Oct 27, 2020

lmwnshn unassigned tanujnay112 May 25, 2021

lmwnshn added the performance Performance related issues or changes. label May 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Re-design execution engine's output pipeline. #1270

Re-design execution engine's output pipeline. #1270

pmenon commented Oct 24, 2020

Re-design execution engine's output pipeline. #1270

Re-design execution engine's output pipeline. #1270

Comments

pmenon commented Oct 24, 2020

Feature Request

Summary

Solution