Skip to content

[BUG] cuDF.read_json fails with cudaErrorInvalidValue invalid argument #17068

Closed
@ayushdg

Description

Describe the bug
cudf.read_json fails on a specific file in my dataset

Steps/Code to reproduce bug

import cudf

cudf.read_json("/path/to/file.json.gz", lines=True)

RuntimeError: CUDA error encountered at: /__w/cudf/cudf/cpp/src/io/json/read_json.cu:318: 1 cudaErrorInvalidValue invalid argument

Expected behavior

import pandas as pd
pd.read_json("/path/to/file.json.gz", lines=True) # works

Environment overview (please complete the following information)

  • Environment location: Docker
  • Method of cuDF install: Conda
    • If method of install is [Docker], provide docker pull & docker run commands used

Environment details
cudf 24.08, 24.12 (nightly) [ haven't checked with 24.10 but given 08, and 12 both fail I suspect the issue applies)

Additional context
Data here: 2022-33_1303_en_all.json.gz

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions