Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
22 changes: 22 additions & 0 deletions openapi.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -484,6 +484,18 @@ paths:
type: boolean
default: auto
description: Whether to mask the user messages in conversational data or prompts in instruction data.
training_method:
type: string
enum:
- sft
- dpo
default: sft
description: The training method to use. 'sft' for Supervised Fine-Tuning or 'dpo' for Direct Preference Optimization.
dpo_beta:
type: number
format: float
default: 0.1
description: The beta parameter for DPO training. Only applicable when training_method is 'dpo'.
training_type:
type: object
oneOf:
Expand Down Expand Up @@ -2337,6 +2349,16 @@ components:
enum:
- auto
default: auto
training_method:
type: string
enum:
- sft
- dpo
default: sft
dpo_beta:
type: number
format: float
default: 0.1
training_type:
type: object
oneOf:
Expand Down