Tags · VectorInstitute/vectorlm

v0.1.2

Add revised benchmarking logic and results (#9)

* Revised estimation of batch count, directly retrieving from len(train_dataloader).
Deleted unused timer_handle argument in Trainer.
Revised handling of "max_seq_len" override in benchmarking.
Added support for automatic switching between  lora and full-rank sharding scheme in benchmarking.

* Revised handling of unspecified max_seq_length.
Added llama-3 to benchmark model_list.

* Benchmarking: Revised benchmark script to ensure consistent per-device train batch size.

* Benchmarking: replaced trainer.step with trainer.train_step to avoid eval overhead in benchmarking.
Revised benchmark parsing logic; display optimal batch size for each context width value.

* Benchmarking: Updated reference throughput based on updated logic.

* Benchmarking: Updated reference throughput descriptions.

May 28, 2024
9045f08
zip
tar.gz
Notes

v0.1.1

Dist ckpt refactor (#8)

* refactored for distributed checkpointing

* addressing comments

* missed commit

Apr 8, 2024
8320c48
zip
tar.gz
Notes

v0.1.0

Create LICENSE

Jan 23, 2024
69f6309
zip
tar.gz
Notes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.1.2

v0.1.1

v0.1.0

Tags: VectorInstitute/vectorlm

v0.1.2

v0.1.1

v0.1.0