Skip to content

Croissant Builder does not have the DatasetBuilder name #11045

Open
@stefanches7

Description

@stefanches7

/!\ PLEASE INCLUDE THE FULL STACKTRACE AND CODE SNIPPET

Short description
Description of the bug.

Environment information

  • Operating System: win 11 pro

  • Python version: 3.12.9

  • tensorflow-datasets/tfds-nightly version: 4.9.8

  • tensorflow/tf-nightly version: 2.19.0

  • Does the issue still exists with the last tfds-nightly package (pip install --upgrade tfds-nightly) ? yes

Reproduction instructions

import tensorflow_datasets as tfds
import mlcroissant as mlc


builder = tfds.dataset_builders.CroissantBuilder(
    jsonld="export-dara.jsonld"
 )
builder.download_and_prepare()

where export-dara.jsonld is link

If you share a colab, make sure to update the permissions to share it.

Link to logs

Image

Expected behavior
Info of the dataset gets created

Additional context
Facing this issue with multiple dataset croissants on the web, e.g. https://www.scidb.cn/en/detail?dataSetId=067dfc6234b140ef9e3ba7b91c27684d# too

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions