latest #2

tooptoop4 · 2020-03-31T21:24:33Z

No description provided.

Adds a new Optimiser rule PruneApplySourceColumns. This rule, and a project-off rule PruneApplyColumns, togrther provide the same column-pruning capability as PruneUnreferencedOutputs#visitApply().

…mic filters

Changed Domain#simplify to avoid copying internal structures to get size of Ranges or DiscreteValues

Things declared in Hive connector are for hive, so `@ForHive` does not add more information.

Previously the code assumed that every conjunct of `newJoinPredicate` is a `ComparisonExpression`. It's no longer the case.

It allows extending with new group providers by mounting different group-provider.properties files.

This is a fix for #2730. When merging small reads, if the first range and second range are more than 2 GB apart, mergeAdjacentDiskRanges() throw sn ArithmeticException because merging those two ranges is too big to fit in a DiskRange. The correct behavior is to not merge those ranges because this implies the ranges are farther apart than maxReadSizeBytes.

File.getPath() is the canonical way to convert File to its String representation suitable for passing to new File().

Some connectors need better control over various parts of the query.

Deprecate the overload that doesn't take `dynamicFilter` parameter.

`TupleDomain` is not none, so `Domain` is not none too.

Handle none `TupleDomain` on the calling side.

The original condition is equivalent to `isAll`, handled above.

Adds utility classes that enable explicit initialization and management of antlr parser and lexer ATN caches. Without them, these fields are static constants that can grow to retain multiple GB of heap space depending on input query strings.

`PredicatePushDown` may determine a join needs to be inner because of dynamic filter function call. Since dynamic filters pushdown is delayed after join reordering, when this happens the distribution type may be fixed already. For INNER joins we are able to completely remove join condition.

Nested loops operator for cross join does not support column pruning. Before this change, there was a check in the JoinNode constructor that did not allow symbol pruning in the case of a cross join. This change removes the constraint from the JoinNode constructor and adds a sanity check in ValidateDependenciesChecker to make sure that output symbols of cross join contain all input symbols. Delaying the check until post-optimization gives the convenience of creating a column-pruning cross join on intermediate steps of optimization. It simplifies changing JoinNode's parameters such as filter, criteria and type. This change will be completed by set of optimizer rules to ensure that the optimized plan does not contain pruning cross join nodes.

kasiafi and others added 30 commits March 17, 2020 14:16

Pass Context to Project-off rules

8dc234b

Add Project-off rule for ApplyNode

073e627

Add rule for pruning ApplyNode's subquery columns

810f704

Adds a new Optimiser rule PruneApplySourceColumns. This rule, and a project-off rule PruneApplyColumns, togrther provide the same column-pruning capability as PruneUnreferencedOutputs#visitApply().

Add support for Alluxio Hive metastore

cbf0d04

Quote table and partition name in error messages

ca99a3f

Remove some ambiguous static imports

2fde0f1

Prefer JDK method where applicable

47b26c2

Use updatable page source when effectivePredicate is None due to dyna…

b343cc9

…mic filters

Refactor toCompactTupleDomain to reuse Domain#simplify

c683e77

Changed Domain#simplify to avoid copying internal structures to get size of Ranges or DiscreteValues

Allow subsets of columns in SHOW STATS FOR (SELECT)

c2e37a4

Check SELECT permissions for SHOW STATS FOR

8167b73

Make access denied messages consistent for SHOW STATS

1f9088d

Fix expected error messages

f638ff5

Allow setting permissions for new directories

ff2ee77

Update to airbase 98

80fdb38

Rubix integration

540e14c

Support multiple event listener plugins

72a7511

Use testcontainers for kudu unit tests

ce59c62

Validate configuration in noop connector

87c1a80

Fix documentation for presto-server-rpm

574f5fe

Prefer variable over its value

13fa220

Add TupleDomain#simplify overload accepting threshold

10855a0

Remove redundant Guice annotation

e59b82d

Things declared in Hive connector are for hive, so `@ForHive` does not add more information.

Introduce common CatalogName

7300158

Perform cheap check first

fc2d08e

Use simpler always-false expression

5b546ba

Previously the code assumed that every conjunct of `newJoinPredicate` is a `ComparisonExpression`. It's no longer the case.

Create group provider configuration for product tests

7d20860

It allows extending with new group providers by mounting different group-provider.properties files.

Extract common configuration for Kerberos KMS

52e19be

Remove unused profiles in kudu's maven

c9cd1df

findepi and others added 29 commits March 29, 2020 09:59

Report formatted SQL when does not parse

d3eb25b

Ensure tests are run with appropriate QueryRunner

7ecf238

Avoid File.toString for getting file's path

5aa29ae

File.getPath() is the canonical way to convert File to its String representation suitable for passing to new File().

Allow custom projection and relation in QueryBuilder

0970e62

Some connectors need better control over various parts of the query.

Use meaningful identifier in test

2dc5691

Deprecate createPageSource overload

f6ad3e5

Deprecate the overload that doesn't take `dynamicFilter` parameter.

Remove redundant condition

99223ab

`TupleDomain` is not none, so `Domain` is not none too.

Remove return value from addConstraintPredicates method

b214009

Handle none `TupleDomain` on the calling side.

Fix condition

8384c18

The original condition is equivalent to `isAll`, handled above.

Record referenced routines in query event

4f9846f

Allow JDBC connectors to provide system tables

d19365e

Fix formatting

a6dc7fd

Extract createTableSql

789a305

Fix to documentation of BigQuery connector.

6176de1

Update Alluxio documentation in Hive connector docs

275db6a

Cleanup validation errors in HiveWriterFactory

cb98a8a

Remove result set from google sheets SplitInfo

9521cf3

Report actual query when DDL fails

a15aac2

Extract short-hand execute() method

ba3bb4f

Reject impossible rename

51d1eb0

Extract charReadFunction

4e1393d

Simplify test

ebb87a2

Test OUTER to INNER join normalization

fa80b8d

Support cross join in join pruning columns rules

ee6b6ee

Remove dedicated rule for pruning cross join columns

fbf4064

Support cross join outputs pruning in PruneUnreferencedOutputs

54946a1

tooptoop4 merged commit 9737f56 into tooptoop4:master Mar 31, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

latest #2

latest #2

tooptoop4 commented Mar 31, 2020

latest #2

latest #2

Conversation

tooptoop4 commented Mar 31, 2020