[AUTO] Filter device when compile_model with file path #27019

yangwang201911 · 2024-10-12T06:14:11Z

Details:

Enable AUTO to filter device when compile_model with model file path. Device filtering applies only to compile_model with ov::model previously.
Add inference test case for loading stateful model path to AUTO
Disable runtime fallback as the default

Tickets:

CVS-155131

songbell · 2024-10-14T02:14:47Z

src/plugins/auto/src/plugin.cpp

+            support_devices = filter_device_by_model(support_devices_by_property, cloned_model, load_config);
+        } else {
+            auto_s_context->m_model_path = model_path;
+        }


@wangleis this will degrade model compile latency.
do you see other better solutions? not sure if possible to get stateful info from cache or model file?

@wangleis all of the OV HW plugins, including CPU, GPU and NPU, only have one compile model API that only accepts the model object, instead of model path, as the input parameter. The compile latency may not change when AUTO calls read_model() before try to compile model to HW plugins. However, Core has been implemented an virtual compile model API that will returns a model object created by calling read_model() API, which means HW plugin can override this API.

compile model API in CPU plugin:

openvino/src/plugins/intel_cpu/src/plugin.h

Line 18 in 421eaec

std::shared_ptr<ov::ICompiledModel> compile_model(const std::shared_ptr<const ov::Model>& model,

compile model API in GPU plugin:

openvino/src/plugins/intel_gpu/include/intel_gpu/plugin/plugin.hpp

Line 51 in 421eaec

std::shared_ptr<ov::ICompiledModel> compile_model(const std::shared_ptr<const ov::Model>& model,

compile model API in NPU plugin:

openvino/src/plugins/intel_npu/src/plugin/include/plugin.hpp

Line 36 in 421eaec

std::shared_ptr<ov::ICompiledModel> compile_model(const std::shared_ptr<const ov::Model>& model,

After detailed performance checks during model compile phase, the mem utilization and latency has no obvious changes when AUTO passes loaded model(read_model()), instead of model path, to HW plugin via Core. @wangleis @songbell

I don't think so, if core.compile_model("test.xml", GPU) with cache enabled can work, why auto cannot benefit?

Currently, only test compile model with cache disabled and no degrade happened for AUTO in this situation. Will check the performance change for compile model path with cache enabled soon. @songbell @wangleis

No performance gap was observed on 12 different scale models when compiling the model with cache enabled.

peterchen-intel · 2024-10-14T02:40:53Z

Need to check if disabling compile_model with model_path API in AUTO is acceptable.

src/plugins/auto/tests/functional/behavior/auto_func_test.cpp

…cache_dir[PR#24726]. 2. enable model type filter logic with cache enabled for AUTO. 3. add test case when cache enabled.

…r_stateful_model

src/plugins/auto/src/auto_schedule.cpp

…r_stateful_model

github-actions · 2024-12-11T00:25:28Z

This PR will be closed in a week because of 2 weeks of no activity.

peterchen-intel · 2024-12-16T02:20:51Z

Check if cache file exists, if yes, pass cache hash to HW plugin
If no, read model and check if it is stateful, pass the cache hash to HW plugin to generate cache blob

enabel AUTO to read model if passing model path into plugin.

bc2f794

yangwang201911 requested review from peterchen-intel, wangleis and songbell October 12, 2024 06:14

yangwang201911 requested a review from a team as a code owner October 12, 2024 06:14

github-actions bot added the category: AUTO OpenVINO AUTO device selection plugin label Oct 12, 2024

yangwang201911 requested a review from a team as a code owner October 12, 2024 07:10

github-actions bot added the category: inference OpenVINO Runtime library - Inference label Oct 12, 2024

WeldonWangwang added 5 commits October 12, 2024 21:16

enable test case.

3e2166a

update test case.

c435080

Update.

63b49bc

update.

6734760

update.

3a76f96

peterchen-intel requested review from zhaixuejun1993 and xufang-lisa October 14, 2024 01:20

peterchen-intel assigned wangleis Oct 14, 2024

songbell reviewed Oct 14, 2024

View reviewed changes

peterchen-intel added the do_not_merge label Oct 14, 2024

zhaixuejun1993 requested changes Oct 14, 2024

View reviewed changes

src/plugins/auto/tests/functional/behavior/auto_func_test.cpp Outdated Show resolved Hide resolved

xufang-lisa reviewed Oct 14, 2024

View reviewed changes

src/plugins/auto/tests/functional/behavior/auto_func_test.cpp Outdated Show resolved Hide resolved

ilya-lavrenov added this to the 2024.5 milestone Oct 14, 2024

ilya-lavrenov self-requested a review October 14, 2024 08:44

update.

3660bbd

yangwang201911 requested review from songbell, xufang-lisa and zhaixuejun1993 October 16, 2024 05:19

Update.

e23f124

zhaixuejun1993 approved these changes Oct 17, 2024

View reviewed changes

fix the issue of calculating the first infer time.

877360d

peterchen-intel changed the title ~~[AUTO] Uses single device when it is stateful model~~ [AUTO] Filter device when compile_model with file path Oct 18, 2024

github-actions bot removed the category: inference OpenVINO Runtime library - Inference label Oct 18, 2024

yangwang201911 added 2 commits October 19, 2024 00:32

fix the issue when cache enabled.

6208642

the default setting for runtime fallback is to be disabled.

4ba6a02

github-actions bot added the category: inference OpenVINO Runtime library - Inference label Oct 22, 2024

1. update the test case to disable CPU model cache when user app set …

b2f8c72

…cache_dir[PR#24726]. 2. enable model type filter logic with cache enabled for AUTO. 3. add test case when cache enabled.

github-actions bot removed the category: inference OpenVINO Runtime library - Inference label Oct 23, 2024

yangwang201911 added 5 commits October 23, 2024 18:36

update.

8e9a6b3

update.

56246ce

update.

8010546

update.

73da15b

Merge branch 'master' into ywang2/fix_query_statue_not_implemented_fo…

a917a76

…r_stateful_model

yangwang201911 requested a review from a team as a code owner October 28, 2024 02:37

yangwang201911 requested review from zKulesza and removed request for a team October 28, 2024 02:37

github-actions bot added the category: docs OpenVINO documentation label Oct 28, 2024

WeldonWangwang and others added 2 commits October 28, 2024 18:34

Update the description of runtime fallback.

1d4f0aa

Merge branch 'master' into ywang2/fix_query_statue_not_implemented_fo…

d8c5144

…r_stateful_model

sunxiaoxia2022 reviewed Nov 4, 2024

View reviewed changes

src/plugins/auto/src/auto_schedule.cpp Outdated Show resolved Hide resolved

yangwang201911 and others added 5 commits November 11, 2024 10:48

Merge branch 'master' into ywang2/fix_query_statue_not_implemented_fo…

726b3a4

…r_stateful_model

update.

54badff

Merge branch 'master' into ywang2/fix_query_statue_not_implemented_fo…

42e655e

…r_stateful_model

Merge branch 'master' into ywang2/fix_query_statue_not_implemented_fo…

eea0fd3

…r_stateful_model

Merge branch 'master' into ywang2/fix_query_statue_not_implemented_fo…

2e97ef4

…r_stateful_model

github-actions bot added the Stale label Dec 11, 2024

github-actions bot removed the Stale label Dec 17, 2024

wenjiew modified the milestones: 2024.5, 2025.0 Dec 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AUTO] Filter device when compile_model with file path #27019

[AUTO] Filter device when compile_model with file path #27019

yangwang201911 commented Oct 12, 2024 •

edited

Loading

songbell Oct 14, 2024

yangwang201911 Oct 15, 2024

yangwang201911 Oct 16, 2024

songbell Oct 17, 2024

yangwang201911 Oct 17, 2024 •

edited

Loading

yangwang201911 Oct 28, 2024

peterchen-intel commented Oct 14, 2024

github-actions bot commented Dec 11, 2024

peterchen-intel commented Dec 16, 2024

[AUTO] Filter device when compile_model with file path #27019

Are you sure you want to change the base?

[AUTO] Filter device when compile_model with file path #27019

Conversation

yangwang201911 commented Oct 12, 2024 • edited Loading

Details:

Tickets:

songbell Oct 14, 2024

Choose a reason for hiding this comment

yangwang201911 Oct 15, 2024

Choose a reason for hiding this comment

yangwang201911 Oct 16, 2024

Choose a reason for hiding this comment

songbell Oct 17, 2024

Choose a reason for hiding this comment

yangwang201911 Oct 17, 2024 • edited Loading

Choose a reason for hiding this comment

yangwang201911 Oct 28, 2024

Choose a reason for hiding this comment

peterchen-intel commented Oct 14, 2024

github-actions bot commented Dec 11, 2024

peterchen-intel commented Dec 16, 2024

yangwang201911 commented Oct 12, 2024 •

edited

Loading

yangwang201911 Oct 17, 2024 •

edited

Loading