[Neo] Add JumpStart Integration to SM Neo Neuron AOT compilation flow #1854

a-ys · 2024-04-30T23:53:47Z

Description

Neo Updates

This PR adds additional JumpStart integration in the Neo Neuron partitioning scripts. When a JumpStart model is passed in, the script will output Neuron subgraphs to be consumed by JumpStart. When JumpStart metadata files __model_info__.json and __script_info__.json files are found & and the environment variable SM_CACHE_JUMPSTART_FORMAT is set, the Neo partitioning script will output the Neuron Cache subgraphs for the current model under the directory PRE_COMPILED_NEURON_GRAPH_INFERin the partitioning output.

The subgraphs will also be saved to a secondary location in the Neuron cache: e.g:

/<cache dir>/JUMPSTART_COMPILED_GRAPHS/neuronxcc-2.13.68.0+6dfecc895/<JumpStart model id>/inference/PRE_COMPILED_NEURON_GRAPH_INFER/neuronxcc-2.13.68.0+6dfecc895/<Module folders>

this secondary location will be used by Neo service to better organize its Neuron cache.

Changes to DJL-Serving Code

There is one change to shared djl-serving code outside of Neuron scripts. The PartitionService is changed to use POpen from subprocess.run() so that standard output can be captured and returned from PartitionService.run_partition(). This output is used to capture the exact Neuron subgraphs associated with the model being compiled so that if there are extraneous subgraphs existing in the Neuron cache directory, only the subgraphs associated with the current model are returned.

…deepjavalibrary#1854) (cherry picked from commit 3f35e3a)

Co-authored-by: Andrew Song <40076917+a-ys@users.noreply.github.com>

[Neo] JumpStart integration to Neo Neuron flow

1b2f31c

a-ys requested review from zachgk, frankfliu and a team as code owners April 30, 2024 23:53

tosterberg approved these changes Apr 30, 2024

View reviewed changes

tosterberg merged commit 3f35e3a into deepjavalibrary:master May 1, 2024
8 checks passed

tosterberg pushed a commit to tosterberg/djl-serving that referenced this pull request May 1, 2024

[Neo] Add JumpStart Integration to SM Neo Neuron AOT compilation flow (…

f6b3a1d

…deepjavalibrary#1854) (cherry picked from commit 3f35e3a)

tosterberg added a commit that referenced this pull request May 1, 2024

[0.27.0-DLC][cherrypick][tnx] 0.27.0 neo script update (#1854) (#1856)

ad77111

Co-authored-by: Andrew Song <40076917+a-ys@users.noreply.github.com>

a-ys mentioned this pull request May 6, 2024

[Neo] Add JumpStart Integration to SM Neo Neuron AOT compilation flow a-ys/djl-serving-1#2

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Neo] Add JumpStart Integration to SM Neo Neuron AOT compilation flow #1854

[Neo] Add JumpStart Integration to SM Neo Neuron AOT compilation flow #1854

a-ys commented Apr 30, 2024

[Neo] Add JumpStart Integration to SM Neo Neuron AOT compilation flow #1854

[Neo] Add JumpStart Integration to SM Neo Neuron AOT compilation flow #1854

Conversation

a-ys commented Apr 30, 2024

Description

Neo Updates

Changes to DJL-Serving Code