feat: Add multi-turn SFT support #195

xingyaoww · 2025-02-04T02:47:06Z

Multi-turn Conversation Fine-tuning Support

Overview

This PR adds support for fine-tuning models on multi-turn conversations, including proper chat template handling and loss masking for assistant responses. The implementation includes support for the OpenHands SFT dataset and handles conversations up to 32k tokens.

Key Features

New MultiTurnSFTDataset class for handling multi-turn conversations
Proper chat template integration using HuggingFace's apply_chat_template
Smart loss masking that targets only assistant responses
Support for both single-turn and multi-turn training in the same trainer
Token length limiting and conversation filtering

Implementation Details

Dataset:
- Uses proper chat templates from the model's tokenizer
- Handles system, user, and assistant messages
- Supports conversations with multiple turns
- Filters conversations exceeding token limits
Training:
- Added use_multiturn flag in config
- Added messages_key for multi-turn data format
- Maintains backward compatibility with single-turn training
- Works with existing features (FSDP, sequence parallel, etc.)
Examples and Tests:
- Added OpenHands SFT dataset preprocessing script
- Added multi-turn training example
- Added comprehensive unit tests
- Moved tests to appropriate locations

Usage Example

# Config for multi-turn training
data:
  use_multiturn: true
  messages_key: messages
  max_length: 32000
  truncation: right

##Testing

Unit tests for dataset functionality
Integration with existing training pipeline
Example scripts tested with OpenHands dataset
Coverage for both single-turn and multi-turn modes

Documentation

Added comments explaining multi-turn specific features
Updated config defaults with multi-turn options
Added example scripts with documentation
Added preprocessing script with dataset-specific handling

ok.. this is another PR mostly done by OpenHands with me messaging it about 10 times..

It is still WIP as I'm testing it on a training job now, will report back if it works

- Add MultiTurnSFTDataset class for handling multi-turn conversations - Support different roles (system, user, assistant) with role-specific prefixes - Set loss mask to 1 for assistant responses only - Add comprehensive test suite for the new dataset class

- Replace custom chat formatting with HuggingFace chat template - Use Qwen tokenizer for testing - Fix tensor indexing and loss mask generation - Update test to verify proper tokenization

- Use HuggingFace chat template instead of custom formatting - Add comprehensive tests for loss mask behavior - Verify both assistant and non-assistant content - Add debug output for test failures

- Add separate workflow for unit tests - Run tests in tests/soft directory - Generate and upload coverage reports - Use same container as e2e tests

- Move tests from tests/soft to tests/sft/unit for consistency - Update CI workflow paths - Keep all SFT-related tests under tests/sft

- Update trainer to support both single-turn and multi-turn datasets - Add example script for multi-turn training - Add data preprocessing script for multi-turn conversations - Use proper chat template for multi-turn data

- Add use_multiturn flag (default: false) - Add messages_key for multi-turn mode (default: messages) - Group single-turn and multi-turn settings

- Add OpenHands SFT dataset preprocessing script - Add token length limit (32k) for conversations - Move multi-turn example to tests/sft - Add train/test split and statistics

CLAassistant · 2025-02-26T00:32:53Z

All committers have signed the CLA.

xingyaoww · 2025-03-17T23:38:27Z

Actually i think this PR is ready. I've been using this PR for a while and trained couple model without issue. Would love review here!

Also, do we really need to sign CLA for openhands-agent? 🤣

eric-haibin-lin · 2025-03-30T17:44:14Z

Oh sorry I missed this PR. Will take a look

examples/data_preprocess/multiturn.py

verl/utils/dataset/multiturn_sft_dataset.py

xingyaoww · 2025-03-30T20:06:29Z

Sorry! Must be some copy-pasta :(

@OpenHands can you help me first merge from main - then help me address these review comments?

eric-haibin-lin

one last comment. otherwise looks good to me!

verl/trainer/config/sft_trainer.yaml

…config object

eric-haibin-lin · 2025-04-03T00:31:22Z

Is there any benchmark result you want to share on specific datasets?

eric-haibin-lin · 2025-04-03T01:21:11Z

Could u merge main for a fix of megatron tests. And also fix lint with format.sh?

xingyaoww · 2025-04-04T18:32:22Z

@eric-haibin-lin Actually, OpenHands LM 32B was trained using this PR :)
and it got decent performance on SWE-Bench Verified

https://www.all-hands.dev/blog/introducing-openhands-lm-32b----a-strong-open-coding-agent-model

openhands-agent and others added 10 commits February 3, 2025 15:52

fix: Use proper chat template for multi-turn dataset

55cc8df

- Replace custom chat formatting with HuggingFace chat template - Use Qwen tokenizer for testing - Fix tensor indexing and loss mask generation - Update test to verify proper tokenization

fix: Use proper chat template and improve tests

62e11a3

- Use HuggingFace chat template instead of custom formatting - Add comprehensive tests for loss mask behavior - Verify both assistant and non-assistant content - Add debug output for test failures

ci: Add unit tests workflow

0dbd4dd

- Add separate workflow for unit tests - Run tests in tests/soft directory - Generate and upload coverage reports - Use same container as e2e tests

refactor: Move unit tests to tests/sft/unit

60e7862

- Move tests from tests/soft to tests/sft/unit for consistency - Update CI workflow paths - Keep all SFT-related tests under tests/sft

feat: Add multi-turn training support

3c3be7a

- Update trainer to support both single-turn and multi-turn datasets - Add example script for multi-turn training - Add data preprocessing script for multi-turn conversations - Use proper chat template for multi-turn data

chore: Remove old test file location

46d08d2

feat: Add multi-turn config defaults

9e90427

- Add use_multiturn flag (default: false) - Add messages_key for multi-turn mode (default: messages) - Group single-turn and multi-turn settings

feat: Update multi-turn examples

8432ca1

- Add OpenHands SFT dataset preprocessing script - Add token length limit (32k) for conversations - Move multi-turn example to tests/sft - Add train/test split and statistics

move file

d4685ba

xingyaoww changed the title ~~feat: Add multi-turn SFT support~~ [WIP] feat: Add multi-turn SFT support Feb 4, 2025

openhands-agent added 3 commits February 13, 2025 15:03

Apply code formatting

8f3e5c6

Add license headers to multiturn-related files

e34b932

Apply formatting changes to multi-turn related files

8fb9c3b

Merge branch 'main' into feature/multi-turn-sft-dataset

934b573

seanexp mentioned this pull request Mar 17, 2025

support multiturn prompt for SFT #635

Closed

xingyaoww changed the title ~~[WIP] feat: Add multi-turn SFT support~~ feat: Add multi-turn SFT support Mar 17, 2025

xingyaoww marked this pull request as ready for review March 17, 2025 23:37

eric-haibin-lin reviewed Mar 30, 2025

View reviewed changes

examples/data_preprocess/multiturn.py Outdated Show resolved Hide resolved

eric-haibin-lin reviewed Mar 30, 2025

View reviewed changes

verl/utils/dataset/multiturn_sft_dataset.py Outdated Show resolved Hide resolved

openhands-agent added 3 commits March 30, 2025 20:14

Merge main into feature/multi-turn-sft-dataset

67125fb

Fix: Remove duplicate code in multiturn files

3016b47

Merge main into feature/multi-turn-sft-dataset and resolve conflicts

d34e769

eric-haibin-lin reviewed Apr 2, 2025

View reviewed changes

verl/trainer/config/sft_trainer.yaml Outdated Show resolved Hide resolved

openhands-agent added 2 commits April 2, 2025 18:01

refactor: Restructure multi-turn configuration to use nested structure

51821be

refactor: Remove default parameters from MultiTurnSFTDataset and use …

034cb42

…config object

openhands-agent added 4 commits April 2, 2025 18:29

refactor: Simplify MultiTurnSFTDataset config handling

11b2184

refactor: Update multiturn example script to use new config structure

eb8d7b2

feat: Add simple multi-turn dataset creation and verification scripts

dfdb77a

refactor: Simplify multiturn.py to create test datasets directly

4d318ff

openhands-agent added 2 commits April 4, 2025 15:20

Merge main branch to get Megatron tests fix

12926f9

Fix lint issues with format.sh

b457a94

eric-haibin-lin approved these changes Apr 4, 2025

View reviewed changes

eric-haibin-lin merged commit fb03941 into volcengine:main Apr 4, 2025
25 checks passed

yuchenwang3 pushed a commit to yuchenwang3/verl that referenced this pull request Apr 25, 2025

feat: Add multi-turn SFT support (volcengine#195)

3bcad08

histmeisah pushed a commit to SJTU-IAAR/verl that referenced this pull request Apr 27, 2025

feat: Add multi-turn SFT support (volcengine#195)

12356e4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add multi-turn SFT support #195

feat: Add multi-turn SFT support #195

Uh oh!

xingyaoww commented Feb 4, 2025

Uh oh!

CLAassistant commented Feb 26, 2025 •

edited

Loading

Uh oh!

xingyaoww commented Mar 17, 2025

Uh oh!

eric-haibin-lin commented Mar 30, 2025

Uh oh!

Uh oh!

Uh oh!

xingyaoww commented Mar 30, 2025

Uh oh!

eric-haibin-lin left a comment

Uh oh!

Uh oh!

eric-haibin-lin commented Apr 3, 2025

Uh oh!

eric-haibin-lin commented Apr 3, 2025

Uh oh!

xingyaoww commented Apr 4, 2025

Uh oh!

Uh oh!

Uh oh!

feat: Add multi-turn SFT support #195

feat: Add multi-turn SFT support #195

Uh oh!

Conversation

xingyaoww commented Feb 4, 2025

Multi-turn Conversation Fine-tuning Support

Overview

Key Features

Implementation Details

Usage Example

Documentation

Uh oh!

CLAassistant commented Feb 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xingyaoww commented Mar 17, 2025

Uh oh!

eric-haibin-lin commented Mar 30, 2025

Uh oh!

Uh oh!

Uh oh!

xingyaoww commented Mar 30, 2025

Uh oh!

eric-haibin-lin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

eric-haibin-lin commented Apr 3, 2025

Uh oh!

eric-haibin-lin commented Apr 3, 2025

Uh oh!

xingyaoww commented Apr 4, 2025

Uh oh!

Uh oh!

Uh oh!

CLAassistant commented Feb 26, 2025 •

edited

Loading