ChatTS-Training

This is a modified version of LLaMA-Factory that supports training ChatTS models.

News

2025/08/01: We have updated the code for data preprocessing. No need to preprocess the dataset before training now!
- Please download the latest datasets (we have updated them) from ChatTS-Training-Dataset.
- If you want to generate the datasets by yourself, please use no encoding instead of sp encoding when generating the data.

Requirements

Following the steps in LLaMA-Factory. Make sure that flash-attention and DeepSpeed are installed.

Usage

Put your training data in data/.
Set your training data path in data/dataset_info.json.
Configure your base model (see the instructions below), output model, training datasets and training parameters in scripts/train_chatts.sh.
Run bash scripts/train_chatts.sh for full SFT. Run bash scripts/train_lora.sh for LoRA SFT.

Instructions for converting base models (Qwen2 Series) to ChatTS format

Download the base models (Qwen2 Series) from huggingface
Replace *.py, added_tokens.json, config.json, special_tokens_map.json, tokenizer_config.json in the base model folder with the files in the ChatTS's model (https://huggingface.co/bytedance-research/ChatTS-14B) folder.

Credit

LLaMA-Factory

Name		Name	Last commit message	Last commit date
Latest commit History 2,935 Commits
.github		.github
assets		assets
data		data
docker		docker
ds_config		ds_config
evaluation		evaluation
examples		examples
scripts		scripts
src		src
tests		tests
.dockerignore		.dockerignore
.env.local		.env.local
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CITATION.cff		CITATION.cff
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
README_zh.md		README_zh.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ChatTS-Training

News

Requirements

Usage

Instructions for converting base models (Qwen2 Series) to ChatTS format

Credit

About

Uh oh!

Releases

Packages

Languages

License

xiezhe-24/ChatTS-Training

Folders and files

Latest commit

History

Repository files navigation

ChatTS-Training

News

Requirements

Usage

Instructions for converting base models (Qwen2 Series) to ChatTS format

Credit

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages