ESQL: Make PhysicalPlan.output() include all physically emitted columns/attributes

For `PhysicalPlan` inheritors that represent commands that add columns, like
- EvalExec
- DissectExec
- GrokExec
- EnrichExec
- LookupJoinExec
the `.output()` method returns the _logical_ output, which in case of name conflicts does not include conflicting attributes from the upstream plan.

E.g. for an index with 1 field `idx_field` the plan for `FROM idx | EVAL idx_field = to_upper(idx_field)` will have an `EvalExec` whose `.output()` method will include only the newly evaluated `idx_field`, not the original field from the index. This is facilitated by the helper method [mergeOutputAttributes](https://github.com/elastic/elasticsearch/blob/bb67a7c4abad248e28661d3d03bec4e8e51948c2/x-pack/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/expression/NamedExpressions.java#L26).

However, the corresponding, actual physical `EvalOperator` does not implement any handling of name conflicts; it simply [appends blocks to incoming pages](https://github.com/elastic/elasticsearch/blob/92d1d31eea496a014bd400dea727fe572f74a521/x-pack/plugin/esql/compute/src/main/java/org/elasticsearch/compute/operator/EvalOperator.java#L46). The same is true for the physical operators corresponding to the other `...Exec` classes from above.

The fact that name conflict logic spills into physical plans leads to complications. For instance, physical plans with remote `ENRICH`s sometimes require the presence of two columns with the same name: https://github.com/elastic/elasticsearch/issues/118531

It also makes PhysicalPlans harder to reason about and doesn't correctly represent actual physical operations.

To have more simplicity in our query plans, we should change the contract for `PhysicalPlan.output()` to not return the _logical_ output of the physical plan, but the actual physical output. Then https://github.com/elastic/elasticsearch/issues/118531 will be solved, too.

This is also required if we later want to go one step further and remove name conflict handling from `LogicalPlan`s (however that might look). Name conflicts only strictly need to be handled in the Analyzer, to resolve what each name in a query refers to.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ESQL: Make PhysicalPlan.output() include all physically emitted columns/attributes #121549

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

ESQL: Make PhysicalPlan.output() include all physically emitted columns/attributes #121549

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions