I ran an experiment where I benchmarked times for running a dataloader on `xarm_lift_medium_replay` with batch size 256 and 0 workers. 8 batches takes ~ 15s. Here's the breakdown: - `dataset.__getitem__`: 14.707566491064426 - `load_previous_and_future_frames`: 13.887129129978348 - `hf_dataset.select_columns(key)[data_ids][key]`: 9.562710228981814