-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add weekly tests for memory growth #3101
Conversation
Dockerfile.QA
Outdated
@@ -191,6 +191,12 @@ RUN if [ -d qa/L0_model_control_stress ]; then \ | |||
cp -r qa/L0_model_control_stress/. qa/L0_model_control_stress_valgrind_massif; \ | |||
fi | |||
|
|||
# Create L0_client_memory_growth_weekly and L0_memory_growth_weekly for weekly test |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You shouldn't need to copy or rename the tests. We are just going to run the existing tests weekly (instead of nightly). We may make changes to the existing tests but we can to that in the existing code.
@@ -35,7 +35,11 @@ | |||
|
|||
if __name__ == '__main__': | |||
today = date.today().strftime("%Y-%m-%d") | |||
subject = "Triton Client Memory Growth Summary: " + today | |||
# Set the subject for weekly and nightly tests |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can keep this, but for now we will just run this test weekly.
@@ -35,7 +35,11 @@ | |||
|
|||
if __name__ == '__main__': | |||
today = date.today().strftime("%Y-%m-%d") | |||
subject = "Triton Server Memory Growth Summary: " + today | |||
# Set the subject for weekly and nightly tests | |||
if (sys.argv[1] == "weekly"): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can keep this, but for now we will just run this test weekly.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since the argument is now a string, you don't need the conditional, instead, you can just set the subject
variable based on the argument subject = "Triton Server Memory Growth " + sys.argv[1] + " Summary: " + today
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Updated.
qa/L0_memory_growth/test.sh
Outdated
RET=1 | ||
fi | ||
set -e | ||
# The busy op model causes issues when running the CI. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Call out PTX here so we can quickly re-enable tests that were disabled due to PTX fsilures
Also make this a TODO
# TODO Re-enable after PTX issues are resolved.
1bcf6e9
to
8e5783b
Compare
@@ -37,10 +37,12 @@ | |||
today = date.today().strftime("%Y-%m-%d") | |||
subject = "Triton Client Memory Growth " + sys.argv[1] + " Summary: " + today | |||
memory_graphs = glob.glob("client_memory_growth*.log") | |||
html_content = "<html><head></head><body><pre style=\"font-size:11pt;font-family:Consolas;\">" | |||
write_up = "<p>This test is run for both HTTP and GRPC protocols using C++ and Python test scripts. The thresholds of memory growth are set to 10MB and 1MB for C++ and Python tests individually as the max-allowed difference between mean and maximum memory usage.</p>" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The threshold is not 10MB, the max difference between mean and max memory usage is 10MB
for mem_graph in sorted(memory_graphs): | ||
html_content += "\n" + mem_graph + "\n" | ||
with open(mem_graph, "r") as f: | ||
html_content += f.read() + "\n" | ||
html_content += "</pre></body></html>" | ||
nightly_email_helper.send(subject, html_content, is_html=True) | ||
nightly_email_helper.send(subject, html_content, is_html=True) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Add back empty line at end
for mem_graph in sorted(memory_graphs): | ||
memory_graphs_resnet = glob.glob("memory_growth_resnet*.log") | ||
memory_graphs_busyop = glob.glob("memory_growth_busyop.log") | ||
write_up = "<p>This test uses perf_analyzer as clients running on 4 different models. The threshold of memory growth is set to 150MB as the max allowed difference between mean and maximum memory usage.</p>" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Same fix here
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Minor changes, otherwise LGTM
Dockerfile.QA
Outdated
@@ -1,4 +1,4 @@ | |||
# Copyright (c) 2018-2021, NVIDIA CORPORATION. All rights reserved. | |||
# Copyright (c) 2018-2021, NVIDIA CORPORATION & AFFILIATES. All rights reserved. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove changes for copyright in files that were not modified.
edfbdcc
to
ab05849
Compare
No description provided.