Add file mapping for windows platform. #12183

caoting-dotcom · 2022-07-15T01:13:23Z

Description: Add the MapFileIntoMemor function into windows/env.cc

Motivation and Context
MapFileIntoMemor is to map the file into memory rather than loading it. The file will only be loaded when it is actually used.
This can help reduce the memory cost. We have tested it for Fluency model inference on Win32, and see the memory cost reduction.

MapFileIntoMemor is already supported in posix/env.cc, rather than windows. Our current PR implementation is basically the same as in posix/env.cc.

pranavsharma · 2022-07-15T02:06:52Z

Do you've a test case?

… for mis-aligned offset

caoting-dotcom · 2022-07-15T17:13:51Z

Do you've a test case?

Hi Pranav, I have added the unit test. Please check.

onnxruntime/test/platform/file_io_test.cc

… for mis-aligned offset

…ition for winml compiling.

RandySheriffH

@caoting-dotcom:
Pls fix errors/warnings reported by pipeline asap, we are working on the final round of cherry pick for release.

chausner · 2022-07-18T18:40:40Z

Interesting, but I couldn't find any documentation on MapFileIntoMemory. Could you explain how one one would leverage this function in practice? I didn't find any code in onnxruntime calling this function so I assume users are supposed to call it.

Can it be used to reduce the memory usage during loading of ONNX models?

* Add file mapping for windows platform. * Add unit test for file mapping for windows. Also add an error message for mis-aligned offset * Add unit test for file mapping for windows. Also add an error message for mis-aligned offset * Update data type to avoid warnings * Compitable data type to avoid warnings. Update CreatFileMapping2 condition for winml compiling. * Add type conversion to avoid warnings for X86 release build. Co-authored-by: Ting Cao <ticao@microsoft.com>

snnn · 2022-07-18T20:43:01Z

Interesting, but I couldn't find any documentation on MapFileIntoMemory. Could you explain how one one would leverage this function in practice? I didn't find any code in onnxruntime calling this function so I assume users are supposed to call it.

Can it be used to reduce the memory usage during loading of ONNX models?

See: https://github.com/microsoft/onnxruntime/blob/master/onnxruntime/core/framework/tensorprotoutils.cc#L567

It might need more work. The original design was: if you had a large model, you could split the weights to an external file then use GetFileContent function to load the weights, and leverage memory mapping when possible. For example, if you have multiple processes running on the same machine with the same ML model. Then you may be able to reduce memory usage by only having one copy of the model in memory.

* support optimizer opt for deepspeed 0.5.9 * resolve comments * resolve comments * FP16_Optimizer Support for more Deepspeed Versions (#12046) * fp16_optimizer for more ds versions * change ds version * bugfix * fix bug * Fix unused function warning for decodeMIDR(). (#12069) Changed from static function defined in header to function declared in header and defined in separate .cc file. * pin protobuf version to be compatible with onnx (#12132) Co-authored-by: Ashwini Khade <askhade@microsoft.com@orttrainingdev10.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net> * RoiAlign CPU EP add warning for max mode with samples != 1 (#12136) * RoiAlign add warning about incorrect max summation when sample size not 1 * include coreml_provider_factory.h in macos build instead of coreml_ex… (#12138) include coreml_provider_factory.h in macos build instead of coreml_execution_provider.h * List 3.10 as supported python version and remove 3.6 (#12141) list 3.10 as supported python version and remove 3.6 Co-authored-by: Randy Shuai <rashuai@microsoft.com> * Use updated symbolic_helper.check_training_mode (#11900) Co-authored-by: Jingyan Wang, Baiju Meswani * Fix GH issue 12151 by using inverse perms for updating DQ axis attribute (#12158) * Fix GH issue 12151. Need to use inverse perms for updating that axis to what is used for transposing the input. This only applies if the DQ node is doing per-axis dequantization. * fixing positions for beam search gpt2 (#12156) * fixing positions for beam search gpt2 Co-authored-by: Tianlei Wu <tlwu@microsoft.com> * remove wrong placed libs (#12201) * Add file mapping for windows platform. (#12183) * Add file mapping for windows platform. * Add unit test for file mapping for windows. Also add an error message for mis-aligned offset * Add unit test for file mapping for windows. Also add an error message for mis-aligned offset * Update data type to avoid warnings * Compitable data type to avoid warnings. Update CreatFileMapping2 condition for winml compiling. * Add type conversion to avoid warnings for X86 release build. Co-authored-by: Ting Cao <ticao@microsoft.com> * Fix bug where onnxruntime_USE_NCCL flag would default to ON (#12195) Fix bug where onnxruntime_USE_NCCL flag would default to ON, causing ORT to not build properly. New functionality: flag is ON when training is enabled and NCCL is not disabled. Flag is OFF otherwise Co-authored-by: zhijxu <zhijxu@microsoft.com> Co-authored-by: zhijxu <zhijxu> Co-authored-by: Vincent Wang <wangwchpku@outlook.com> Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com> Co-authored-by: Ashwini Khade <askhade@microsoft.com> Co-authored-by: Ashwini Khade <askhade@microsoft.com@orttrainingdev10.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net> Co-authored-by: Dwayne Robinson <dwayner@microsoft.com> Co-authored-by: Carson Swope <carsonswope@users.noreply.github.com> Co-authored-by: Randy Shuai <rashuai@microsoft.com> Co-authored-by: jingyanwangms <47403504+jingyanwangms@users.noreply.github.com> Co-authored-by: Scott McKay <skottmckay@gmail.com> Co-authored-by: Viswanath Boga <44417868+viboga@users.noreply.github.com> Co-authored-by: leqiao-1 <61653207+leqiao-1@users.noreply.github.com> Co-authored-by: caoting-dotcom <71617901+caoting-dotcom@users.noreply.github.com> Co-authored-by: Ting Cao <ticao@microsoft.com> Co-authored-by: Sean Murray <59740888+seanmurr1@users.noreply.github.com>

caoting-dotcom · 2022-07-28T08:37:51Z

Interesting, but I couldn't find any documentation on MapFileIntoMemory. Could you explain how one one would leverage this function in practice? I didn't find any code in onnxruntime calling this function so I assume users are supposed to call it.
Can it be used to reduce the memory usage during loading of ONNX models?

See: https://github.com/microsoft/onnxruntime/blob/master/onnxruntime/core/framework/tensorprotoutils.cc#L567

It might need more work. The original design was: if you had a large model, you could split the weights to an external file then use GetFileContent function to load the weights, and leverage memory mapping when possible. For example, if you have multiple processes running on the same machine with the same ML model. Then you may be able to reduce memory usage by only having one copy of the model in memory.

Right, this is where the function got called and the purpose is to reduce memory cost. It was only for POSIX only before. Now it is implemented for Win32.

Add file mapping for windows platform.

24eeb41

caoting-dotcom requested a review from pranavsharma July 15, 2022 01:14

Add unit test for file mapping for windows. Also add an error message…

e6e0da5

… for mis-aligned offset

snnn reviewed Jul 15, 2022

View reviewed changes

onnxruntime/test/platform/file_io_test.cc Show resolved Hide resolved

pranavsharma previously approved these changes Jul 15, 2022

View reviewed changes

RandySheriffH added the release:1.12 label Jul 15, 2022

Ting Cao added 2 commits July 16, 2022 21:53

Add unit test for file mapping for windows. Also add an error message…

ac7bfd6

… for mis-aligned offset

update the data type from long to be DWORD

be32771

caoting-dotcom dismissed pranavsharma’s stale review via be32771 July 16, 2022 14:01

Ting Cao added 2 commits July 16, 2022 23:47

Update data type to avoid warnings

877d18c

Compitable data type to avoid warnings. Update CreatFileMapping2 cond…

1d720d9

…ition for winml compiling.

RandySheriffH reviewed Jul 18, 2022

View reviewed changes

Add type conversion to avoid warnings for X86 release build.

0a858fe

RandySheriffH approved these changes Jul 18, 2022

View reviewed changes

RandySheriffH merged commit 4d38b84 into master Jul 18, 2022

RandySheriffH deleted the ticao/winfilemap branch July 18, 2022 16:24

yufenglee mentioned this pull request Aug 15, 2022

release calibrator before deleting temporary files #12601

Merged

snnn mentioned this pull request Apr 10, 2023

[Build] error C3861: “CreateFileMapping2”: 找不到标识符 #15445

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add file mapping for windows platform. #12183

Add file mapping for windows platform. #12183

caoting-dotcom commented Jul 15, 2022

pranavsharma commented Jul 15, 2022

caoting-dotcom commented Jul 15, 2022

RandySheriffH left a comment •

edited

Loading

chausner commented Jul 18, 2022

snnn commented Jul 18, 2022

caoting-dotcom commented Jul 28, 2022

Add file mapping for windows platform. #12183

Add file mapping for windows platform. #12183

Conversation

caoting-dotcom commented Jul 15, 2022

pranavsharma commented Jul 15, 2022

caoting-dotcom commented Jul 15, 2022

RandySheriffH left a comment • edited Loading

Choose a reason for hiding this comment

chausner commented Jul 18, 2022

snnn commented Jul 18, 2022

caoting-dotcom commented Jul 28, 2022

RandySheriffH left a comment •

edited

Loading