Tags · strangedove/mergekit

v0.1.2

Fix plumbing of apply_chat_template/fewshot_as_multiturn in mergekit-…

…evolve (arcee-ai#535)

Mar 14, 2025
48b0d48
zip
tar.gz

v0.1.1

LoRA extraction fixes (arcee-ai#522)

Addresses arcee-ai#521.

Also adds:
* `--lora-merge-dtype` to specify dtype to use when applying LoRA adapters to models
* `--gpu-rich` alias for convenience
* Organize display of options in `--help`

Feb 28, 2025
efd4ea0
zip
tar.gz

v0.1.0

Compute-graph based `mergekit-extract-lora` (arcee-ai#505)

Now with better embedding handling, multi-gpu execution, and lazy
loading/saving of tensors.

When extracting a LoRA from an 8B model, execution time goes from ~6
minutes down to 40 seconds with `--cuda --multi-gpu` on an 8-GPU
machine.

Additionally, the `--sv-epsilon` flag can be used to set a tolerance for
singular values to opportunistically reduce rank when the fine tuned
difference is inherently lower rank.

Also reimplement a couple of merge methods using the `@easy_define`
decorator and add some missing tests.

Feb 7, 2025
a2dda31
zip
tar.gz

v0.0.6

Update copyright (arcee-ai#497)

Also bump dependencies.

Jan 25, 2025
9017715
zip
tar.gz

v0.0.4

Update README.md

Feb 4, 2024
db30a71
zip
tar.gz

v0.0.3.2

Reorganize slightly

Jan 14, 2024
206102b
zip
tar.gz

v0.0.3.1

Bump copyright date to 2024

Jan 2, 2024
f5c1876
zip
tar.gz

legacy

Copy tokenizer

Sep 26, 2023
f9a4d51
zip
tar.gz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v0.1.2

v0.1.1

v0.1.0

v0.0.6

v0.0.4

v0.0.3.2

v0.0.3.1

legacy

Tags: strangedove/mergekit