How customer can run online WebAPI interface - https://github.com/ggerganov/llama.cpp/blob/master/examples/server/README.md