-
Notifications
You must be signed in to change notification settings - Fork 735
Simple short-form Self-RAG Pack #907
Simple short-form Self-RAG Pack #907
Conversation
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered by ReviewNB |
@anoopshrma could you please review this? |
Hey! Sure will do by tomorrow. Currently away from machine. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is fantastic! some comments but afterwards excited to land and promote this
from llama_cpp import Llama # noqa: F401 | ||
except ImportError: | ||
raise ImportError(_IMPORT_ERROR_MSG) | ||
self.llm = Llama(model_path=model_path, verbose=verbose, **model_kwargs) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we do have a llamacpp wrapper in llamaindex. out of curiosity is there a reason you are using the llama_cpp lib directly? (it's fine either way, just curious)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
oh nvm seems like its for logprobs
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Awesome @MarouaneMaatouk!
Hey @jerryjliu , since all the comments have been resolved. Can I go ahead and merge it or do you want to take another look. |
Description
Simple short-form Self-RAG implementation using llama_cpp adapted from the author's code
Paper: https://arxiv.org/abs/2310.11511
Model: https://huggingface.co/m4r1/selfrag_llama2_7b-GGUF
Fixes #8502 (run-llama/llama_index#8502)
Type of Change
Please delete options that are not relevant.
How Has This Been Tested?
Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration
Suggested Checklist:
make format; make lint
to appease the lint gods