C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
-
Updated
Jul 31, 2024 - C++
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
An spoken English chatbot runs in realtime and offline based on LLM.
This project accelerates local deployment of chatglm and vector inference using PyTorch compiled in C++, and includes an OpenAI API Mock script for quick setup of local speed testing services. This setup enhances performance and efficiency, ideal for high-performance applications and development testing.
Add a description, image, and links to the chatglm3 topic page so that developers can more easily learn about it.
To associate your repository with the chatglm3 topic, visit your repo's landing page and select "manage topics."