Name	Name	Last commit message	Last commit date
Latest commit History 31 Commits
.github/workflows	.github/workflows
ci	ci
cmake	cmake
doc	doc
docker	docker
examples	examples
include/triton	include/triton
projects/sd	projects/sd
src	src
test	test
.clang-format	.clang-format
.gitignore	.gitignore
CMakeLists.txt	CMakeLists.txt
LICENSE	LICENSE
README.md	README.md

Name

Last commit message

Last commit date

31 Commits

OneFlow Serving

Currently, we have implemented an oneflow-backend for the Triton Inference Server that enables model serving.

Triton Inference Server OneFlow Backend

OneFlow Backend For Triton Inference Server

Get Started

Here is a tutorial about how to export the model and how to deploy it. You can also follow the instructions below to get started. Building the Docker image is necessary before you start.

Download and save model

cd examples/resnet50/
python3 export_model.py

Launch triton server

cd ../../  # back to root of the serving
docker run --rm --runtime=nvidia --network=host -v$(pwd)/examples:/models \
  serving:final
curl -v localhost:8000/v2/health/ready  # ready check

Send images and predict

pip3 install tritonclient[all]
cd examples/resnet50/
curl -o cat.jpg https://images.pexels.com/photos/156934/pexels-photo-156934.jpeg
python3 client.py --image cat.jpg

Documentation

Known Issues

Multiple model instance execution

The current version of oneflow does not support concurrent execution of multiple model instances. You can launch multiple containers (which is easy with Kubernetes) to bypass this limitation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

OneFlow Serving

Triton Inference Server OneFlow Backend

Get Started

Documentation

Known Issues

Multiple model instance execution

About

Uh oh!

Releases 2

Uh oh!

Contributors 6

Uh oh!

Languages

License

Oneflow-Inc/serving

Folders and files

Latest commit

History

Repository files navigation

OneFlow Serving

Triton Inference Server OneFlow Backend

Get Started

Documentation

Known Issues

Multiple model instance execution

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Uh oh!

Contributors 6

Uh oh!

Languages