https://github.com/huggingface/text-generation-inference https://github.com/THUDM/GLM-130B https://github.com/THUDM/ChatGLM-6B