-
Notifications
You must be signed in to change notification settings - Fork 188
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #66 from AnswerDotAI/avh_dev
Add profiling to train.py
- Loading branch information
Showing
4 changed files
with
625 additions
and
487 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,28 @@ | ||
# This generates uses export_stacks to generate profiling output | ||
# /tmp/profile_0.txt, /tmp/profile_1.txt, etc. (1 file per process.) | ||
# | ||
# Output files are generated using export_stacks(), note there are some | ||
# outstanding issues to be aware of: | ||
# https://github.com/pytorch/pytorch/issues/100253 | ||
# | ||
# Profiling output files can be used with speedscope or other tools. | ||
# | ||
# For additional information, see: | ||
# https://pytorch.org/tutorials/recipes/recipes/profiler_recipe.html | ||
|
||
python train.py \ | ||
--model_name meta-llama/Meta-Llama-3-8B \ | ||
--train_type hqq_dora \ | ||
--n_bits 4 \ | ||
--precision bf16 \ | ||
--dataset orca_math \ | ||
--dataset_samples 8 \ | ||
--batch_size 2 \ | ||
--context_length 512 \ | ||
--gradient_accumulation_steps 2 \ | ||
--use_gradient_checkpointing False \ | ||
--use_cpu_offload False \ | ||
--use_activation_cpu_offload False \ | ||
--save_model False \ | ||
--profiling_output /tmp/profile | ||
|
Oops, something went wrong.