Add githash to nm-vllm #299

dhuangnm · 2024-06-11T13:07:44Z

Add git hash information to nm-vllm:

>>> import vllm
>>> vllm.githash()
'106796861914146372aba9386aeff9361edfb34d'

vllm/__init__.py

csrc/cpu/pybind.cpp

robertgshaw2-redhat · 2024-06-11T15:31:21Z

We might want to add this info to collect_env.py

dhuangnm · 2024-06-11T19:42:14Z

We might want to add this info to collect_env.py

Good idea, thanks for the suggestion. I added the githash, however I found that I cannot run the script from the repo root directory, otherwise I hit following error:

$ python vllm/collect_env.py 
Collecting environment information...
WARNING 06-11 19:28:07 _custom_ops.py:11] Failed to import from vllm._C with ModuleNotFoundError("No module named 'vllm._C'")
Traceback (most recent call last):
  File "/home/dhuang/vllm/collect_env.py", line 735, in <module>
    main()
  File "/home/dhuang/vllm/collect_env.py", line 714, in main
    output = get_pretty_env_info()
  File "/home/dhuang/vllm/collect_env.py", line 709, in get_pretty_env_info
    return pretty_str(get_env_info())
  File "/home/dhuang/vllm/collect_env.py", line 545, in get_env_info
    vllm_git_hash=get_vllm_git_hash(),
  File "/home/dhuang/vllm/collect_env.py", line 144, in get_vllm_git_hash
    return vllm.githash()
AttributeError: module 'vllm' has no attribute 'githash'

I think this is due to the vllm/ folder in the repo. If however I move collect_env.py to a different location and run it from there, it ran fine after I installed the nm-vllm wheel:

$ python collect_env.py 
Collecting environment information...
...
vllm git hash: 106796861914146372aba9386aeff9361edfb34d
Python version: 3.10.12 (main, Nov 20 2023, 15:14:05) [GCC 11.4.0] (64-bit runtime)
Python platform: Linux-5.19.0-1010-nvidia-lowlatency-x86_64-with-glibc2.35

Solved this issue by making installed vllm taking precedence over the local vllm/ directory.

dhuangnm · 2024-06-11T19:47:22Z

I file this PR as a reference for nm-vllm. Since we're planning to make this change to the upstream, I'll file another PR for the upstream so people can compare the code between the PRs. There are some slight difference between the two.

csrc/punica/punica_pybind.cpp

bnellnm · 2024-06-11T19:58:56Z

This looks good to me but the pybind stuff has just been switched over to torch library so that will probably require a few changes/merges.

dbarbuzzi · 2024-06-11T20:19:58Z

I found that I cannot run the script from the repo root directory, otherwise I hit following error:

@dhuangnm This is probably fine as the end-user typically would not have the vllm source or run it from that directory. Instead, their support process has you download just that file (collect_env.py) and run it within your environment, which as an end-user would be some environment that has vllm likely installed from PyPI.

They're actually likely to run it from the source directory since this python script is from the repo and they may just check out the repo first then run it from there. Anyways, the hack allows it to work no matter if it's called from the repo dir or not.

dhuangnm · 2024-06-12T02:28:24Z

This looks good to me but the pybind stuff has just been switched over to torch library so that will probably require a few changes/merges.

Yes I'll make the changes for the PR against the upstream. I'll post the PR shortly.

dhuangnm · 2024-06-14T16:55:28Z

Can I get an approval? Reran the failed 38 job and it passed now.

dhuangnm · 2024-06-14T17:40:08Z

Thanks Bill. It looks I need another approval?

collect_env.py

vllm/__init__.py

andy-neuma

thanks

dhuangnm · 2024-06-18T23:20:02Z

It looks there are two failures due to OOM across the python versions:

FAILED tests/models_core/test_magic_wand.py::test_magic_wand[5-32-half-model_format_extrablocks0] - torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 896.00 MiB. GPU
FAILED tests/models_core/test_magic_wand.py::test_magic_wand[5-32-half-model_format_extrablocks1] - torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 252.00 MiB. GPU

These are also failing in the nightly so seems not caused by this PR.

Add git hash information to nm-vllm: ``` >>> import vllm >>> vllm.githash() '106796861914146372aba9386aeff9361edfb34d' ``` --------- Co-authored-by: dhuangnm <dhuang@MacBook-Pro-2.local>

dhuangnm added 3 commits June 10, 2024 12:13

githash

0c034be

fix issue and add more places

9f38095

fix clang error

3ffcd0b

ProExpertProg reviewed Jun 11, 2024

View reviewed changes

vllm/__init__.py Outdated Show resolved Hide resolved

csrc/cpu/pybind.cpp Outdated Show resolved Hide resolved

dhuangnm added 2 commits June 11, 2024 10:12

fix namespace error

d975432

fix yapf error

1067968

clean up and add git hash to collect_env.py

cb7170e

dhuangnm requested review from andy-neuma and robertgshaw2-redhat June 11, 2024 19:45

dhuangnm changed the title ~~[WIP] add githash to nm-vllm~~ Add githash to nm-vllm Jun 11, 2024

dhuangnm requested review from tlrmchlsmth, mgoin and bnellnm June 11, 2024 19:46

bnellnm reviewed Jun 11, 2024

View reviewed changes

csrc/punica/punica_pybind.cpp Outdated Show resolved Hide resolved

fix import issue due to local vllm directory

bc88303

dhuangnm added 3 commits June 11, 2024 22:57

move git hash to where vLLM info locates

e4c1d6f

punica is not needed

84e1f1c

restore file

91b5bb0

bnellnm approved these changes Jun 14, 2024

View reviewed changes

dhuangnm added 2 commits June 17, 2024 09:42

merge main

38cb3fa

fix issue

0e9fd22

bnellnm reviewed Jun 17, 2024

View reviewed changes

collect_env.py Outdated Show resolved Hide resolved

bnellnm reviewed Jun 17, 2024

View reviewed changes

vllm/__init__.py Show resolved Hide resolved

dhuangnm added 2 commits June 17, 2024 12:27

address comment

ec1715a

return None

1112728

andy-neuma approved these changes Jun 18, 2024

View reviewed changes

dhuangnm merged commit d8da97b into main Jun 19, 2024
33 of 37 checks passed

dhuangnm deleted the githash branch June 19, 2024 13:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add githash to nm-vllm #299

Add githash to nm-vllm #299

Uh oh!

dhuangnm commented Jun 11, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

robertgshaw2-redhat commented Jun 11, 2024

Uh oh!

dhuangnm commented Jun 11, 2024 •

edited

Loading

Uh oh!

dhuangnm commented Jun 11, 2024

Uh oh!

Uh oh!

bnellnm commented Jun 11, 2024

Uh oh!

dbarbuzzi commented Jun 11, 2024 •

edited by dhuangnm

Loading

Uh oh!

dhuangnm commented Jun 12, 2024

Uh oh!

dhuangnm commented Jun 14, 2024

Uh oh!

dhuangnm commented Jun 14, 2024

Uh oh!

Uh oh!

Uh oh!

andy-neuma left a comment

Uh oh!

dhuangnm commented Jun 18, 2024

Uh oh!

Uh oh!

Uh oh!

Add githash to nm-vllm #299

Add githash to nm-vllm #299

Uh oh!

Conversation

dhuangnm commented Jun 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

robertgshaw2-redhat commented Jun 11, 2024

Uh oh!

dhuangnm commented Jun 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dhuangnm commented Jun 11, 2024

Uh oh!

Uh oh!

bnellnm commented Jun 11, 2024

Uh oh!

dbarbuzzi commented Jun 11, 2024 • edited by dhuangnm Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dhuangnm commented Jun 12, 2024

Uh oh!

dhuangnm commented Jun 14, 2024

Uh oh!

dhuangnm commented Jun 14, 2024

Uh oh!

Uh oh!

Uh oh!

andy-neuma left a comment

Choose a reason for hiding this comment

Uh oh!

dhuangnm commented Jun 18, 2024

Uh oh!

Uh oh!

Uh oh!

dhuangnm commented Jun 11, 2024 •

edited

Loading

dhuangnm commented Jun 11, 2024 •

edited

Loading

dbarbuzzi commented Jun 11, 2024 •

edited by dhuangnm

Loading