LFX Workspace: A Rust library crate for mediapipe models for WasmEdge NN

# Motivation

[Mediapipe](https://developers.google.com/mediapipe) is a collection of ML models for streaming data. The official website provides Python, iOS, Android, and TFLite-JS SDKs for using those models. As WasmEdge is increasingly used in data streaming applications, we would like to build a Rust library crate that enables easy integration of Mediapipe models in WasmEdge applications.

# Details

Each MediaPipe model has [a description page](https://google.github.io/mediapipe/solutions/face_detection.html) that describes its input and output tensors. The [models](https://google.github.io/mediapipe/solutions/models.html) are available in Tensorflow Lite format, which is supported by the **WasmEdge Tensorflow Lite plugin**.

We need at least one set of library functions for each model in Mediapipe. Each library function takes in a media object and returns the inference result. The function performs the following tasks.

- Process the input media object (e.g., a byte array for a JPEG image) into a tensor for the model. As an example, you could use the Rust [imageproc](https://crates.io/crates/imageproc) crate to process the image into a vector.
- Use WasmEdge NN to run inference of the input tensor on the model.
- Collect and interpret the result tensor.
  - The function should at least return a struct containing the output parameters described in the model description page. For example, a face detection function should return a vector of structs. Each struct contains the coordinates of a detected page.
  - The function should also return a visual representation of the inference results. For example, we should overlay detected face boundaries and landmarks on the original image. As an example, the [draw_hollow_rect_mut()](https://docs.rs/imageproc/0.23.0/imageproc/drawing/fn.draw_hollow_rect_mut.html) in [imageproc](https://crates.io/crates/imageproc) could be used to draw detected boundaries.


# Milestones

* [x] Create a list of models, and then for each model, list the pre-, and post-processing functions needed.
* [x] Implement the tasks: image classification (no video support), object detection (no video support)   (1 week)
* [x] Implement the tasks: text classification and audio classification.    (2 weeks)
* [x] Find the function we need in OpenCV, and try to implement the video support for vision tasks.  (2 weeks)
* [x] Implement all other vision tasks such as hand landmarks detection. (2 weeks)
* [x] build a new TfLite library that includes [MediaPipe custom operators](https://github.com/google/mediapipe/blob/master/mediapipe/tasks/cc/core/mediapipe_builtin_op_resolver.cc)  (1 week)
* [x] Try to implement GPU support for MediaPipe models.  (1 week)
* [ ] Write the documents, then publish the library to ```crates.io```.   (1 week)

## Repository URL: origin: https://github.com/yanghaku/mediapipe-rs-dev, now it will transfer to  https://github.com/WasmEdge/mediapipe-rs


## Mediapipe tasks progress:

* [x] Object Detection
* [x] Image Classification
* [x] Image segmentation
* [x] Gesture Recognition
* [x] Hand Landmark Detection
* [x] Image embedding
* [x] Face Detection
* [x] Audio Classification
* [x] Text Classification

## Appendix
[feat: A Rust library crate for MediaPipe models for WasmEdge NN](https://github.com/WasmEdge/WasmEdge/issues/2229)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LFX Workspace: A Rust library crate for mediapipe models for WasmEdge NN #2355

Motivation

Details

Milestones

Repository URL: origin: https://github.com/yanghaku/mediapipe-rs-dev, now it will transfer to https://github.com/WasmEdge/mediapipe-rs

Mediapipe tasks progress:

Appendix

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

LFX Workspace: A Rust library crate for mediapipe models for WasmEdge NN #2355

Description

Motivation

Details

Milestones

Repository URL: origin: https://github.com/yanghaku/mediapipe-rs-dev, now it will transfer to https://github.com/WasmEdge/mediapipe-rs

Mediapipe tasks progress:

Appendix

Activity

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions