Update the limitation of multiple servers binding to the same http/grp… #4991

krishung5 · 2022-10-18T00:26:19Z

…c port

dyastremsky · 2022-10-18T00:50:36Z

Are we sure this is the best place for this documentation? I think users usually check our documentation (e.g. .MD files) rather than the command line arguments help screen. Especially since it's quite long. If we want users to know about and understand these options, it might make sense to also include them in documentation.

krishung5 · 2022-10-18T01:01:36Z

@dyastremsky I might be wrong but I couldn't find any documentation regarding anything about http/grpc port except in main.cc. It seems like we ask users to use tritonserver --help to see those options that are not in the MD files. That's why I only updated main.cc, but I'm willing to add those options to the document if this makes more sense.

Tabrizian · 2022-10-18T14:53:33Z

src/main.cc

@@ -401,7 +401,9 @@ std::vector<Option> options_
       "The port for the server to listen on for HTTP requests."},
      {OPTION_REUSE_HTTP_PORT, "reuse-http-port", Option::ArgBool,
       "Allow multiple servers to listen on the same HTTP port when every "
-       "server has this option set."},
+       "server has this option set. The same set of models/same model "


"If you plan to use this option as a way to load-balance between different triton servers, the same model repository or set of models must be used for every server."

Note that this feature only supports stateless models.

I think it might be better to remove this sentence since the customers may figure out a way to control which server gets the requests and address this limitation.

I see. Updated the document, thanks for the comment!

dyastremsky · 2022-10-18T16:55:54Z

@dyastremsky I might be wrong but I couldn't find any documentation regarding anything about http/grpc port except in main.cc. It seems like we ask users to use tritonserver --help to see those options that are not in the MD files. That's why I only updated main.cc, but I'm willing to add those options to the document if this makes more sense.

I see, okay. Sounds like we're being consistent here then.

dyastremsky · 2022-10-18T18:08:09Z

src/main.cc

@@ -401,7 +401,9 @@ std::vector<Option> options_
       "The port for the server to listen on for HTTP requests."},
      {OPTION_REUSE_HTTP_PORT, "reuse-http-port", Option::ArgBool,
       "Allow multiple servers to listen on the same HTTP port when every "
-       "server has this option set."},
+       "server has this option set. If you plan to use this option as a way to "
+       "load-balance between different triton servers, the same model "


Here and below:

Triton should be capitalized.

Load balance should not have a dash.

Thanks for the catch! Updated.

Update the limitation of multiple server binding to the same http/grp…

6ca630d

…c port

krishung5 requested review from Tabrizian and dyastremsky October 18, 2022 00:26

krishung5 changed the title ~~Update the limitation of multiple server binding to the same http/grp…~~ Update the limitation of multiple servers binding to the same http/grp… Oct 18, 2022

Tabrizian reviewed Oct 18, 2022

View reviewed changes

Address comment

b433eb7

krishung5 requested a review from Tabrizian October 18, 2022 17:17

dyastremsky reviewed Oct 18, 2022

View reviewed changes

Address comment

1fe1659

krishung5 requested a review from dyastremsky October 18, 2022 18:10

dyastremsky approved these changes Oct 18, 2022

View reviewed changes

Tabrizian approved these changes Oct 18, 2022

View reviewed changes

krishung5 merged commit dfa101c into main Oct 18, 2022

krishung5 deleted the krish-port-doc branch October 18, 2022 18:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update the limitation of multiple servers binding to the same http/grp… #4991

Update the limitation of multiple servers binding to the same http/grp… #4991

krishung5 commented Oct 18, 2022

dyastremsky commented Oct 18, 2022

krishung5 commented Oct 18, 2022

Tabrizian Oct 18, 2022

krishung5 Oct 18, 2022

dyastremsky commented Oct 18, 2022

dyastremsky Oct 18, 2022

krishung5 Oct 18, 2022

Update the limitation of multiple servers binding to the same http/grp… #4991

Update the limitation of multiple servers binding to the same http/grp… #4991

Conversation

krishung5 commented Oct 18, 2022

dyastremsky commented Oct 18, 2022

krishung5 commented Oct 18, 2022

Tabrizian Oct 18, 2022

Choose a reason for hiding this comment

krishung5 Oct 18, 2022

Choose a reason for hiding this comment

dyastremsky commented Oct 18, 2022

dyastremsky Oct 18, 2022

Choose a reason for hiding this comment

krishung5 Oct 18, 2022

Choose a reason for hiding this comment