[BUG] cuDF.read_json fails with cudaErrorInvalidValue invalid argument #17068
Closed
Description
Describe the bug
cudf.read_json fails on a specific file in my dataset
Steps/Code to reproduce bug
import cudf
cudf.read_json("/path/to/file.json.gz", lines=True)
RuntimeError: CUDA error encountered at: /__w/cudf/cudf/cpp/src/io/json/read_json.cu:318: 1 cudaErrorInvalidValue invalid argument
Expected behavior
import pandas as pd
pd.read_json("/path/to/file.json.gz", lines=True) # works
Environment overview (please complete the following information)
- Environment location: Docker
- Method of cuDF install: Conda
- If method of install is [Docker], provide
docker pull
&docker run
commands used
- If method of install is [Docker], provide
Environment details
cudf 24.08, 24.12 (nightly) [ haven't checked with 24.10 but given 08, and 12 both fail I suspect the issue applies)
Additional context
Data here: 2022-33_1303_en_all.json.gz