Data loading is slow due to grabbing frames from huggingface dataset

I ran an experiment where I benchmarked times for running a dataloader on `xarm_lift_medium_replay` with batch size 256 and 0 workers.

8 batches takes ~ 15s. Here's the breakdown:
- `dataset.__getitem__`: 14.707566491064426
- `load_previous_and_future_frames`: 13.887129129978348
- `hf_dataset.select_columns(key)[data_ids][key]`: 9.562710228981814