Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add GitHub Workflow for Linting , Formatting and Test. Activate Workflow for Framework #7

Merged
merged 4 commits into from
May 17, 2024

Conversation

fabianlim
Copy link
Contributor

@fabianlim fabianlim commented May 15, 2024

This PR does the following:

  • Add linting, formatting and test as part of the github workflow
  • Activate workflow matrix for plugins/framework
  • Fix linting and formatting issues for plugins/framework

There should be a separate task to fix files for

  • scripts/
  • plugins/accelerated-peft

@fabianlim fabianlim force-pushed the lint branch 2 times, most recently from 91b35e0 to 2826ad2 Compare May 15, 2024 03:41
Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>
Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>
@fabianlim fabianlim changed the title Add Linting Github Workflow Add GitHub Workflow for Linting , Formatting and Test May 15, 2024
@fabianlim fabianlim requested a review from achew010 May 15, 2024 05:17
Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>
@fabianlim fabianlim changed the title Add GitHub Workflow for Linting , Formatting and Test Add GitHub Workflow for Linting , Formatting and Test. Activate Workflow for Framework May 15, 2024
Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>
@fabianlim fabianlim merged commit 6c189e8 into dev May 17, 2024
1 check passed
@fabianlim fabianlim deleted the lint branch May 17, 2024 02:33
fabianlim added a commit that referenced this pull request May 27, 2024
…or GPTQ-LoRA (#20)

* Add GitHub Workflow for Linting , Formatting and Test. Activate Workflow for Framework (#7)

* add lint workflow

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* add pylintrc, update .tox fix files

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* activate test and minor fix

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* lint benchmarks.py and add workflow to dev

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

---------

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* Improvements to Benchmark Scripts and Config Generation Workflow (#13)

* fix benches and add verify configs

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* update readme and add workflow

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* add packaging dep

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* update torch dep in framework and run-benches

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* take host env in run-benches

* add display bench results script

* rename summary.csv to raw_summary.csv and update run_benchmarks.sh

* export environment variables in shell command

* dump out pip requirements for repro, and add default FHT_branch

---------

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* Added support for running official HF baseline FSDP-QLoRA benchmark (#16)

* new baseline scenario

* rename variables

* added warning when plugin allows SFTTrainer to handle PEFT on single device

* Fix FSDP when performing GPTQ-LoRA with Triton V2  (#15)

* wrap in parameters and torch view to correct dtype

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* refactor to apply patch only on FSDP and simplify

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

---------

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* Provide Memory Benchmarking Feature to Benchmarking Code (#14)

* add gpu memory logging support

* made improvements to GPU reference and result collation

* Renamed memory logging argument to reflect its readings as reserved me
mory using nvidia-smi and changed aggregation function in result collation

* variable renames

* manual linting

* added memory logging functionality via HFTrainer

* added support to benchmark memory using HFTrainer and updated READMEwith explanation of the 2 memory benchmarking options

* addressed changes requested in PR #14

* fix bug and smplify gpu logs aggregation logic

* fixes to calculation of HFTrainer Mem Logging values

* fix calculations

* more fixes

* fix to ignore including  stage inside max calculation of alloc memory

* more comments and README updates

* added fix to keyerror due to empty output dict from OOM

* manual linting

* added benchmark results to refs

* remove unnecessary columns in results gathering

* made changes to results gathering

---------

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>
Co-authored-by: achew010 <165894159+achew010@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants