Feature implementation from commits 516843d..c88d236 #4

yashuatla · 2025-06-23T18:01:31Z

This PR contains changes from a range of commits from the original repository.

Commit Range: 516843d..c88d236
Files Changed: 47 (22 programming files)
Programming Ratio: 46.8%

Commits included:

fix for gpu
Feat/fix no llama.cpp (Feat/fix no llama.cpp mindverse/Second-Me#297)
preserve training param (preserve training param mindverse/Second-Me#292)
Feat/0425/adjustment of training rule (Feat/0425/adjustment train rule mindverse/Second-Me#290)
Updated README with FAQ (Updated README with FAQ mindverse/Second-Me#285)
Feat/0423/train status (Feat/0423/train status mindverse/Second-Me#287)
add execute right (Feature/fix execute mode sh mindverse/Second-Me#289)
fix move trainprocess to solve loop (fix move trainprocess to solve loop mindverse/Second-Me#288)
Feature/fix training model switch bug2 (Feature/fix training model switch bug2 mindverse/Second-Me#281)
Optimize TrainProcessService Singleton Pattern Implementation (Optimize TrainProcessService Singleton Pattern Implementation mindverse/Second-Me#279)
... and 3 more commits

* feat: replace tutorial link * replace video link --------- Co-authored-by: kevin-mindverse <kevin@mindverse.ai>

* Add CUDA support - CUDA detection - Memory handling - Ollama model release after training * Fix logging issue added cuda support flag so log accurately reflected cuda toggle * Update llama.cpp rebuild Changed llama.cpp to only check if cuda support is enabled and if so rebuild during the first build rather than each run * Improved vram management Enabled memory pinning and optimizer state offload * Fix CUDA check rewrote llama.cpp rebuild logic, added manual y/n toggle if user wants to enable cuda support * Added fast restart and fixed CUDA check command Added make docker-restart-backend-fast to restart the backend and reflect code changes without causing a full llama.cpp rebuild Fixed make docker-check-cuda command to correctly reflect cuda support * Added docker-compose.gpu.yml Added docker-compose.gpu.yml to fix error on machines without nvidia gpu and made sure "\n" is added before .env modification * Fixed cuda toggle Last push accidentally broke cuda toggle * Code review fixes Fixed errors resulting from removed code: - Added return save_path to end of save_hf_model function - Rolled back download_file_with_progress function * Update Makefile Use cuda by default when using docker-restart-backend-fast * Minor cleanup Removed unnecessary makefile command and fixed gpu logging * Delete .gpu_selected * Simplified cuda training code - Removed dtype setting to let torch automatically handle it - Removed vram logging - Removed Unnecessary/old comments * Fixed gpu/cpu selection Made "make docker-use-gpu/cpu" command work with .gpu_selected flag and changed "make docker-restart-backend-fast" command to respect flag instead of always using gpu * Fix Ollama embedding error Added custom exception class for Ollama embeddings, which seemed to be returning keyword arguments while the Python exception class only accepts positional ones * Fixed model selection & memory error Fixed training defaulting to 0.5B model regardless of selection and fixed "free(): double free detected in tcache 2" error caused by cuda flag being passed incorrectly

…rse#279) * feature: use uv to setup python environment * TrainProcessService add singleten method: get_instance

* feature: use uv to setup python environment * TrainProcessService add singleten method: get_instance * feat: fix code * Added CUDA support (mindverse#228) * Add CUDA support - CUDA detection - Memory handling - Ollama model release after training * Fix logging issue added cuda support flag so log accurately reflected cuda toggle * Update llama.cpp rebuild Changed llama.cpp to only check if cuda support is enabled and if so rebuild during the first build rather than each run * Improved vram management Enabled memory pinning and optimizer state offload * Fix CUDA check rewrote llama.cpp rebuild logic, added manual y/n toggle if user wants to enable cuda support * Added fast restart and fixed CUDA check command Added make docker-restart-backend-fast to restart the backend and reflect code changes without causing a full llama.cpp rebuild Fixed make docker-check-cuda command to correctly reflect cuda support * Added docker-compose.gpu.yml Added docker-compose.gpu.yml to fix error on machines without nvidia gpu and made sure "\n" is added before .env modification * Fixed cuda toggle Last push accidentally broke cuda toggle * Code review fixes Fixed errors resulting from removed code: - Added return save_path to end of save_hf_model function - Rolled back download_file_with_progress function * Update Makefile Use cuda by default when using docker-restart-backend-fast * Minor cleanup Removed unnecessary makefile command and fixed gpu logging * Delete .gpu_selected * Simplified cuda training code - Removed dtype setting to let torch automatically handle it - Removed vram logging - Removed Unnecessary/old comments * Fixed gpu/cpu selection Made "make docker-use-gpu/cpu" command work with .gpu_selected flag and changed "make docker-restart-backend-fast" command to respect flag instead of always using gpu * Fix Ollama embedding error Added custom exception class for Ollama embeddings, which seemed to be returning keyword arguments while the Python exception class only accepts positional ones * Fixed model selection & memory error Fixed training defaulting to 0.5B model regardless of selection and fixed "free(): double free detected in tcache 2" error caused by cuda flag being passed incorrectly * fix: train service singlten --------- Co-authored-by: Zachary Pitroda <30330004+zpitroda@users.noreply.github.com>

* fix: adjustment status order * fix: adjustment train status * fix: split the status of service and train

* Update README.md Changed the updated tutorial link * Update README.md with FAQ New section for FAQ doc

* fix: adjustment status order * fix: adjustment train status * fix: split the status of service and train * feat: adjustment train rule

* feat: what? no llama.cpp * add cache

ryangyuan and others added 13 commits April 24, 2025 14:25

feat: replace tutorial link (mindverse#268)

f049167

* feat: replace tutorial link * replace video link --------- Co-authored-by: kevin-mindverse <kevin@mindverse.ai>

feature: use uv to setup python environment (mindverse#277)

71d54a5

Optimize TrainProcessService Singleton Pattern Implementation (mindve…

29a17c8

…rse#279) * feature: use uv to setup python environment * TrainProcessService add singleten method: get_instance

fix move trainprocess to solve loop (mindverse#288)

de8370b

add execute right (mindverse#289)

3ae664f

Feat/0423/train status (mindverse#287)

19adcac

* fix: adjustment status order * fix: adjustment train status * fix: split the status of service and train

Updated README with FAQ (mindverse#285)

c4a9b90

* Update README.md Changed the updated tutorial link * Update README.md with FAQ New section for FAQ doc

Feat/0425/adjustment of training rule (mindverse#290)

ef4c491

* fix: adjustment status order * fix: adjustment train status * fix: split the status of service and train * feat: adjustment train rule

preserve training param (mindverse#292)

1d8b48e

Feat/fix no llama.cpp (mindverse#297)

34d4329

* feat: what? no llama.cpp * add cache

fix for gpu

c88d236

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature implementation from commits 516843d..c88d236 #4

Feature implementation from commits 516843d..c88d236 #4

Uh oh!

yashuatla commented Jun 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Feature implementation from commits 516843d..c88d236 #4

Are you sure you want to change the base?

Feature implementation from commits 516843d..c88d236 #4

Uh oh!

Conversation

yashuatla commented Jun 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants