poseidon

poseidon is an ONNX inferencing server based on ONNX Runtime and Flask.

Motivation

We need an ONNX inference server for an academic environment which is able to serve a multitude of models in different versions. Currently, we use tensorflow serving (tf serving) for network-based inferencing of tensorflow saved models. This enables our researchers to access any model in any version over the network without the need to set up a local machine learning environment. tf serving is well suited for this task, as it unloads any unused models and frees up the resources. Currently, there is no inferencing server with the same properties for the ONNX file format.

We expect poseidon to be an interim solution until a more professional software package is available. I.e. ONNX Runtime server might be suitable in the future. Currently, it only supports a single model file.

Design Goals

Keep it as simple as possible.
- Minimal code base.
- HTTP/REST interface.
Minimize ressource footprint when not in use.
Ability to serve many models in different versions.

Model Directory

poseidon uses the environment variable POSEIDON_MODEL_PATH to set the model base path. If POSEIDON_MODEL_PATH is not set poseidon defaults to ./models. The models must be placed in the model base path according to the following convention:

{POSEIDON_MODEL_PATH}/{NAME}/{VERSION}/{FN}

For example: /models/mobilenetv2/7/mobilenetv2-7.onnx. Only a single ONNX file is allowed per {NAME}/{VERSION}.

Docker

You may deploy poseidon using Docker. Copy or mount the model base path into the docker environment and start the server:

docker run --rm \
  -p 80:80 \
  hbwinther/poseidon

You may deploy your own model directory by mounting your local model base path into the docker environment: -v "/MYMODELDIR:/models"

API Endpoints

/list
/model/{NAME}/{VERSION}:info
/model/{NAME}/{VERSION}:inference

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
docker		docker
src		src
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

poseidon

Motivation

Design Goals

Model Directory

Docker

API Endpoints

About

Releases

Packages

Languages

ml-workgroup/poseidon

Folders and files

Latest commit

History

Repository files navigation

poseidon

Motivation

Design Goals

Model Directory

Docker

API Endpoints

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages