Running AI inference of phi3 and other llms from c# using NPU + GPU in comming processors?

Intel, AMD, Qualqomm, etc are getting powerful NPUs (+40TOPS) for inferencing.

Is there any plan to incluide in ml.net functionality to be able to run and inference these models easily from C# offloading to NPU or GPU or both. Next Intel processors will have 40TOPS NPU and 60TOPS CPU/GPU.

How from C# can we easily make the most and inference using all of these TOPS coming from NPU + GPU?

All samples i see about this require using python etc, would be great to have all this available in .NET C# directly.

Maybe including some C# wrapper around https://github.com/intel/intel-npu-acceleration-library but what about AMD and qualcomm?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running AI inference of phi3 and other llms from c# using NPU + GPU in comming processors? #7162

agonzalezm
openedon May 28, 2024

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Running AI inference of phi3 and other llms from c# using NPU + GPU in comming processors? #7162

Description

agonzalezmopenedon May 28, 2024

Metadata