Skip to content

Data loading is slow due to grabbing frames from huggingface dataset #93

@alexander-soare

Description

@alexander-soare

I ran an experiment where I benchmarked times for running a dataloader on xarm_lift_medium_replay with batch size 256 and 0 workers.

8 batches takes ~ 15s. Here's the breakdown:

  • dataset.__getitem__: 14.707566491064426
  • load_previous_and_future_frames: 13.887129129978348
  • hf_dataset.select_columns(key)[data_ids][key]: 9.562710228981814

Metadata

Metadata

Assignees

Labels

performanceIssues aimed at improving speed or resource usage

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions