Skip to content

[BUG] Reversal of convergence for warm-clones with DPO #124

@anay-rfai

Description

@anay-rfai

Bug Description

Convergence is disrupted at chunk boundaries when using warm clones, causing inflection points and convergence graph reversals. This issue occurs consistently across different model types and quantization settings, but only affects warm-started configurations.

To Reproduce

Steps to reproduce the behavior:

  1. Configure a DPO notebook experiment with chunks=4
  2. Set up warm clone configurations (as shown in runs 3, 5, 6, 7 from the attached screenshot)
  3. Execute the training run
  4. Monitor convergence plots during training
  5. Observe inflection points and convergence reversals at each chunk boundary

Expected Behavior

Convergence should remain smooth and monotonic across chunk boundaries, similar to the behavior observed in initial configs and clone modify configs (non-warm-started configurations). The convergence graph should not show inflection points or reversals at chunk transitions.

Screenshots

Screenshots show:

  • Configuration definitions for various runs (first screenshot)
  • Comparison between affected warm clone runs (3, 5, 6, 7) and unaffected configurations for various metrics
Image Image Image Image Image

Environment

  • OS: Ubuntu
  • Python version: 3.12
  • RapidFire AI version: 0.12.6
  • Browser (if applicable): Chrome

Additional Context

  • Issue occurs with both quantized and non-quantized models
  • Affects both RapidFire model and Mistral base model
  • Problem is isolated to warm clone configurations only
  • Initial configurations and clone modify configurations (non-warm-started) converge smoothly without this issue
  • Chunk size used: 4

Error Logs

No error logs

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions