RLCSE: streaming SPMI mode #397

AndyAyersMS · 2024-02-16T19:43:57Z

Add support to use streaming SPMI servers. When enabled, RLCSE will launch one SPMI server process per core. Work dispatched to servers picks an idle server from the pool of servers. This greatly reduces the overhead of doing single-method evaluations.

SPMI streaming mode requires the changes in dotnet/runtime#98440.

Update the main Policy graident work loop and the greedy status loop to use the streaming mode, if enabled. MCMC can leverage this too, but I haven't done that just yet.

Add the ability to track server status.

Add the ability to log the server activity. The intent is that these logs can be replayed by a server instance to diagnose problems. This was useful during bring-up.

Add a running log of the Policy Gradient parameter values, so we can see how they change over time.

Refactor out some of the code in the main Policy Gradient loops.

Add support to use streaming SPMI servers. When enabled, RLCSE will launch one SPMI server process per core. Work dispatched to servers picks an idle server from the pool of servers. This greatly reduces the overhead of doing single-method evaluations. SPMI streaming mode requires the changes in dotnet/runtime#98440. Update the main Policy graident work loop and the greedy status loop to use the streaming mode, if enabled. MCMC can leverage this too, but I haven't done that just yet. Add the ability to track server status. Add the ability to log the server activity. The intent is that these logs can be replayed by a server instance to diagnose problems. This was useful during bring-up. Add a running log of the Policy Gradient parameter values, so we can see how they change over time. Refactor out some of the code in the main Policy Gradient loops.

AndyAyersMS · 2024-02-16T19:44:13Z

@EgorBo PTAL
cc @dotnet/jit-contrib

AndyAyersMS mentioned this pull request Feb 16, 2024

JIT: Streaming mode for SPMI dotnet/runtime#98440

Merged

EgorBo approved these changes Feb 17, 2024

View reviewed changes

AndyAyersMS merged commit 96aae83 into dotnet:main Feb 17, 2024

AndyAyersMS mentioned this pull request Feb 21, 2024

Investigate improving JIT heuristics with machine learning dotnet/runtime#92915

Closed

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

RLCSE: streaming SPMI mode #397

RLCSE: streaming SPMI mode #397

Uh oh!

AndyAyersMS commented Feb 16, 2024

Uh oh!

AndyAyersMS commented Feb 16, 2024

Uh oh!

Uh oh!

RLCSE: streaming SPMI mode #397

RLCSE: streaming SPMI mode #397

Uh oh!

Conversation

AndyAyersMS commented Feb 16, 2024

Uh oh!

AndyAyersMS commented Feb 16, 2024

Uh oh!

Uh oh!