-
Notifications
You must be signed in to change notification settings - Fork 260
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add resources to input downloader in the ollama plugin #2754
Conversation
Signed-off-by: Samhita Alla <aallasamhita@gmail.com>
plugins/flytekit-inference/flytekitplugins/inference/sidecar_template.py
Outdated
Show resolved
Hide resolved
plugins/flytekit-inference/flytekitplugins/inference/sidecar_template.py
Outdated
Show resolved
Hide resolved
Signed-off-by: Samhita Alla <aallasamhita@gmail.com>
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## master #2754 +/- ##
============================================
- Coverage 100.00% 74.60% -25.40%
============================================
Files 5 194 +189
Lines 122 19762 +19640
Branches 0 4121 +4121
============================================
+ Hits 122 14744 +14622
- Misses 0 4296 +4296
- Partials 0 722 +722 ☔ View full report in Codecov by Sentry. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
For my understanding, did this work before without setting the memory?
plugins/flytekit-inference/flytekitplugins/inference/ollama/serve.py
Outdated
Show resolved
Hide resolved
@@ -138,6 +140,16 @@ def __init__( | |||
V1VolumeMount(name="shared-data", mount_path="/shared"), | |||
V1VolumeMount(name="tmp", mount_path="/tmp"), | |||
], | |||
resources=V1ResourceRequirements( | |||
requests={ | |||
"cpu": 1, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I can see the need for setting the CPU as well. From my testing, downloads are usually faster if you give it more than one CPU.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i made it configurable.
Signed-off-by: Samhita Alla <aallasamhita@gmail.com>
@thomasjpfan it worked on the local flyte cluster. i haven't tested it on the demo cluster earlier because i needed to deploy a backend change to test it. |
Signed-off-by: Samhita Alla <aallasamhita@gmail.com>
Tracking issue
Why are the changes needed?
To prevent this error from occurring:
failed quota: project-quota: must specify limits.cpu for: input-downloader; limits.memory for: input-downloader
error.What changes were proposed in this pull request?
This PR adds resources to the input downloader init container, without which it results in
failed quota: project-quota: must specify limits.cpu for: input-downloader; limits.memory for: input-downloader
error.How was this patch tested?
On a prod cluster.
Setup process
Screenshots
Check all the applicable boxes
Related PRs
Docs link