Skip to content

join statement with regex causes panic #16252

@PPL143

Description

@PPL143

Describe the bug

Using regex functions or regex logical operators will result to overflow.

To Reproduce

> CREATE EXTERNAL TABLE bigdata STORED AS CSV LOCATION 'bigdata.csv' OPTIONS ('has_header' 'true');
0 row(s) fetched.
Elapsed 0.007 seconds.

>
CREATE EXTERNAL TABLE find STORED AS CSV LOCATION 'find.csv' OPTIONS ('has_header' 'true');
0 row(s) fetched.
Elapsed 0.001 seconds.

>

select count(1) from bigdata;
+-----------------+
| count(Int64(1)) |
+-----------------+
| 2278791         |
+-----------------+
1 row(s) fetched.
Elapsed 0.023 seconds.

> select count(1) from find;
+-----------------+
| count(Int64(1)) |
+-----------------+
| 27              |
+-----------------+
1 row(s) fetched.
Elapsed 0.001 seconds.

>
select * from bigdata join find ON bigdata.a ~* find.regex;

thread 'tokio-runtime-worker' panicked at ..\index.crates.io-1949cf8c6b5b557f\arrow-select-55.1.0\src\take.rs:480:45:
overflow

Expected behavior

No response

Additional context

data.zip

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions