Skip to content

Releases: Lightning-AI/litData

v0.2.48

24 May 14:38
742fb52
Compare
Choose a tag to compare

What's Changed

  • readme: update Maintainers by @Borda in #594
  • chore: Add Benchmark Scripts and Performance Comparison of LitData vs FFCV for Streaming ImageNet by @bhimrazy in #572
  • fix: Move cache warning under debug by @bhimrazy in #598
  • Add support for torch.uint16 data type by @bhimrazy in #597
  • fix: Add error handling for empty Parquet files while indexing and corresponding tests by @bhimrazy in #601
  • fix: boto3 session options by @deependujha in #604
  • bump version 0.2.48 by @deependujha in #605

Full Changelog: v0.2.47...v0.2.48

v0.2.47

13 May 07:01
44ce484
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.2.46...v0.2.47

Release v0.2.46

03 May 07:58
96238b6
Compare
Choose a tag to compare

What's Changed

  • Feat: Add per_stream batching method to CombinedStreamingDataset by @schopra8 in #438
  • Fix parquet cache by @philgzl in #560
  • refactor: StreamingDataset variable names for better readability by @deependujha in #557
  • feat: Add GitHub Actions workflow for @benchmark bot by @deependujha in #561
  • fix: @benchmark bot fixes by @deependujha in #565
  • Fix IndexError when resuming after some workers are done by @philgzl in #567
  • ref: simplify cache dir creation and remove repeated parts by @bhimrazy in #568
  • fix: suppress FileNotFoundError when acquiring file lock for count file by @bhimrazy in #570
  • fix: Consolidate Cache Handling + Fix DDP Multi-Indexing for huggingface datasets by @bhimrazy in #569
  • update readme to include best practices for image data optimization by @bhimrazy in #577

New Contributors

Full Changelog: v0.2.45...v0.2.46

v0.2.45

14 Apr 16:34
68d23cd
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.2.44...v0.2.45

Release v0.2.44

26 Mar 16:40
cfac30a
Compare
Choose a tag to compare

What's Changed

  • Remove .lock download skipping, skip locks on force download by @JackUrb in #519
  • pre-release bump 0.2.44 by @tchaton in #530

Full Changelog: v0.2.43...v0.2.44

v0.2.43

25 Mar 18:23
65ba5a7
Compare
Choose a tag to compare

What's Changed

  • Fix: resume issues with resuming in combined streaming dataset in dataloader by @bhimrazy in #507
  • fix: s3 error by @deependujha in #510
  • Fix: unsigned s5cmd requests and also add option to disable s5cmd by @bhimrazy in #513
  • Turn on DEBUG logging based on DEBUG_LITDATA environment variable by @ouj in #518
  • Feat: Update indexing of parquet dataset and also add streaming support to huggingface datasets by @bhimrazy in #505
  • feat: correctly propagate storage_options by @deependujha in #514
  • fix: remove warnings for Streaming Dataset with hf dataset and shuffle enabled by @bhimrazy in #520
  • Revert '#506 Add s5cmd' – as boto3 Outperforms s5cmd in Latest Benchmarks by @bhimrazy in #521
  • Upd/hf-dataset-get-format by @bhimrazy in #522
  • Update documentation on Streaming Parquet Datasets from Huggingface and other cloud providers by @bhimrazy in #523
  • Bump version to 0.2.43 by @bhimrazy in #525
  • fix package config by @Borda in #526
  • example: sine function model prediction with litdata & pytorch-lightning by @deependujha in #517
  • fixing package & releasing by @Borda in #529

Full Changelog: v0.2.42...v0.2.43

Release v0.2.42

11 Mar 15:43
a8fc6a8
Compare
Choose a tag to compare

What's Changed

New Contributors

  • @ouj made their first contribution in #496

Full Changelog: v0.2.41...v0.2.42

v0.2.41

07 Mar 06:02
b4fd4ba
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.2.40...v0.2.41

v0.2.40

04 Mar 06:07
311e767
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.2.39...v0.2.40

Release 0.2.39

14 Feb 12:48
a6c50f4
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.2.38...v0.2.39