You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
On running the query below on the Clickbench multi file dataset,
SELECT REGEXP_REPLACE("Referer", '^https?://(?:www.)?([^/]+)/.*$', '1') AS k, AVG(length("Referer")) AS l, COUNT(*) AS c, MIN("Referer") FROM hits WHERE"Referer"<>''GROUP BY k HAVINGCOUNT(*) >100000ORDER BY l DESCLIMIT25;
we get this error,
Internal error: The "regex_replace"functioncan only accept strings.. This was likely caused by a bug in DataFusion's code and we would welcome that you file an bug report in our issue tracker
To Reproduce
Download the data using,
./benchmarks/bench.sh data clickbench_partitioned
A hits_multi directory with the parquet files will be created.
This looks similar to #7039 which @jonahgao fixed by adding a coercion from binary --> UTF8 for comparison. I think we could do something similar here.
Describe the bug
On running the query below on the Clickbench multi file dataset,
we get this error,
To Reproduce
Download the data using,
A
hits_multi
directory with the parquet files will be created.Execute the above queries,
Expected behavior
The queries should run successfully without erroring.
Additional context
Datafusion 29.0.0
The text was updated successfully, but these errors were encountered: