Streaming enabled inferencing of ChatGLM2-6B on AWS SageMaker
chatglm2-6b-djl.ipynb
is the notebook to prepare and deploy ChatGLM2-6B to AWS SageMakerlambda
directory is the Lambda Function source code for accpeting API calls and invoking SageMaker inference endpoint