-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
update readme file (adding usage of HuggingFace model) #413
update readme file (adding usage of HuggingFace model) #413
Conversation
Thanks for the PR! I have a quick question, where do the models get saved? I can't find them. Thank you 🙏
When I clone using git the huggingface repo under a folder
but no model is there, just Then when I specify this folder in the
|
@wm-mask The model-path here I pass is the huggingface model path |
@mao-code Thanks for your reply. Sorry it's not clear. Also when you say "load_from_pretrained" do you mean you change some function in the code somewhere? Or just to load the model the first time for huggingface to cache it? I followed your updated README |
@wm-mask Yes, the model path I used is just the hugging face model path. For the example model, it is just “liuhaotian/LLaVA-Lightning-MPT-7B-preview”. And the load_from_pretrain is a function I import within this repository. I didn’t change anything. |
@wm-mask For the second part of the content that I update in the read me file, I import the eval function as I mentioned in the readme. By doing so, you can use it to “inference” directly using the eval function. For more detail, you can check these functions in the repository. |
…code snippets in readme
Sorry for the late response. I like the idea of adding the quick start scripts, made a few mods to the language and change it the llava-v1.5-7b model in the example. Also used the collapse toggle to avoid displaying large amount of code in README. Thank you! |
…e-usage update readme file (adding usage of HuggingFace model)
Based on this issue
I found that the contents of documents that describe the usage of LLaVA series model via HuggingFace are not that clear.
Therefore, I would like to add these information directly in the readme file. Hope that it can help someone who struggle with it.
If there are any mistakes, please correct me!
Thanks!