Enhancements #55

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

thiswillbeyourgithub wants to merge 26 commits into vgel:main from thiswillbeyourgithub:enhancements

Contributor

thiswillbeyourgithub commented Dec 4, 2024

Hi!

I'm the dev who made a research fork of repeng to explore some questions about what's possible with repeng and repeng-like techniques. I first mentioned my ideas in these messages

The project is still ongoing but as I kept adding more and more little enhancements it felt a good idea to try to upstream some of them.

Notably, added a utils.py file with function make datasets, joblib caching for the activations, support for chat templates (with auto correction), l1/l2 norm, documentation, more controls on which layer to trigger, numpy v2 compatibility.

Note that I couldn't run the test file because of import issues with flash attention.

I also did not add any modules to the requirements yet.

I mainly wanted your feedback on this.

Also thanks a lot for making repeng, it's really nice and really allowed me to get my hands dirty!

Commits:

add file settings.py to store settings
add a utils.py file that contains a make_dataset function
move DatasetEntry to utils.py
feat: give more controls to how the user can select the layers to modify
feat: add joblib memory for caching model activations
fix: numpy v2 compatibility
feat: add arg norm_type when training that allows setting l1 or l2 (or other) norm
feat: add function to accept chat template as inputs + autocorrect
fix: forgot an import
minor: add tqdm for getting activations and applying the new directions
perf: potentially faster one liner to tokenize
feat: use flag LOW_MEMORY to reduce the amount of memory needed when storing the arrays before computing directions
fix: imports
update default model in example from mistral 7B v0.1 to v0.3
docs: add more details on how to load the model, including quantization, avoiding OOM, etc
docs: use chat messages in example
docs: in example, show how to login for models that require it
docs: add missing declaration of tokenizer
fix: in example, move the tokens directly to the correct device
minor: changed default in the example
docs: add link to github issue about OOM for gguf files
docs: add more examples of models
fix: autocorrect chat templates

thiswillbeyourgithub added 24 commits

December 4, 2024 17:35


          add file settings.py to store settings

a3fd485

allows to set verbosity, low_memory flags etc

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          add a utils.py file that contains a make_dataset function

30b8923

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          move DatasetEntry to utils.py

f65111d

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          feat: give more controls to how the user can select the layers to modify

962365a

now the layer_ids arg can be either a list of int (the index of the
layers), or "all" for all layers, "middle" for the middle half,
"only_middle" for only one layer that is at the middle, or ranges in the
form "0.3-0.7" (=layers from 30% of the depth to 70%)

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          feat: add joblib memory for caching model activations

06a77fe

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          fix: numpy v2 compatibility

f129548

should close vgel#51

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          feat: add arg norm_type when training that allows setting l1 or l2 (o…

ee358c0

…r other) norm

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          feat: add function to accept chat template as inputs + autocorrect

a63a93b

'autocorrect' meaning that we check that each message is indeed present
in the output after applying the chat template. This was needed as
there's a lot of cases where models (including official releases from
GAFAM!) have sketchy implementation that drop system prompts silently
etc

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          fix: forgot an import

07c5b1f

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          minor: add tqdm for getting activations and applying the new directions

5b96bab

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          perf: potentially faster one liner to tokenize

f238580

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          feat: use flag LOW_MEMORY to reduce the amount of memory needed when …

af8dd6d

…storing the arrays before computing directions

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          fix: imports

766ac7a

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          update default model in example from mistral 7B v0.1 to v0.3

f6efc48

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          docs: add more details on how to load the model, including quantizati…

6123b86

…on, avoiding OOM, etc

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          docs: use chat messages in example

50bdc42

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          docs: in example, show how to login for models that require it

d5c3baa

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          docs: add missing declaration of tokenizer

f9c90df

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          fix: in example, move the tokens directly to the correct device

63baad5

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          minor: changed default in the example

d523235

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          docs: add link to github issue about OOM for gguf files

9136c5b

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          docs: add more examples of models

0e07984

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          fix: autocorrect chat templates

6f25c17

Signed-off-by: thiswillbeyourgithub
<26625900+thiswillbeyourgithub@users.noreply.github.com>


          docs: mention the new way to specify the layers

ef30c30

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>

thiswillbeyourgithub marked this pull request as ready for review

December 4, 2024 17:48

This was referenced Dec 6, 2024

prototype support for n_components=1 YingfanWang/PaCMAP#83

Merged

Trying this out on llama3-8b , what range of layers do I use? #48

Closed

thiswillbeyourgithub added 2 commits

December 11, 2024 01:00


          fix: template function for non chats

ec3e28c

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>


          docs: mention in readme that qwen2.5 7B works out of the box

21652a8

Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>

Owner

vgel commented Dec 14, 2024

This is really cool! My first thought is there's probably too much going on here for one PR, but let me look it over more closely and figure out how we can split this up. I've also been considering adding a utils file for a similar reason as you--I might make a PR for that soon-ish and tag you in.

Owner

vgel commented Dec 14, 2024

I'm also currently migrating the library from poetry to uv, so no rush on adding dependencies!

Contributor Author

thiswillbeyourgithub commented Dec 14, 2024 •

edited

Loading

This is really cool! My first thought is there's probably too much going on here for one PR, but let me look it over more closely and figure out how we can split this up. I've also been considering adding a utils file for a similar reason as you--I might make a PR for that soon-ish and tag you in.

Glad you lile it! I take this opportunity to thank you once again for making repeng, it's been super helpful to get me started on many ideas I had.

I'm fine with splitting this up but I do care about my contribution being credited as I'm a self taught medical student (soon psychiatry resident!) so I want my interests and contributions to appear on my github for exposition :)

In any case I keep breaking and unbreaking my fork as I'm continuously experimenting (since yesterday I've been playing around with filtering samples that are retrievable after a 2 cluster kmeans on the umap projection and I feel good about this one!). Appart from the read_representation method I think the code is mostly stable. I'm sometimes unsure about my chat templating thingy but that might be because of errors in the original templates of their repo too so would appreciate a second look on this if you have the bandwith.

Also I couldn't run the tests on my machine, I'm thinking it would really help to know if my modifications are stable and with no side effects

edit: actually I was able to run the tests. But as I made extensive changes I'll keep them so far in my fork.

Owner

vgel commented Jan 8, 2025

definitely, will absolutely make sure you're credited on anything that gets merged :-)

Owner

vgel commented Jan 8, 2025 •

edited

Loading

here, a good initial thing to split off would be the ability to pass a string for layer_ids in ControlModel.__init__ . i think we should have three options:

a list / range object, like we currently have
"all"
"middle", which should keep the middle 2/3 (instead of the middle half as it does now)

then in the docstring, we should suggest people use "middle" as the default. i don't think we should support None, because people should be explicit about what layers are being controlled. and i think "only_middle" and the range stuff are marginal enough that we don't need to support them by default--people can always calculate it themselves if they need it, or we can add it later.

if you could split that off into a new PR, i can help make sure the tests are passing etc so we can get it merged, if that sounds good?

Contributor Author

thiswillbeyourgithub commented Jan 9, 2025

Sounds good to me, thanks! I'll do that when I have some time in the next week

Contributor Author

thiswillbeyourgithub commented Jul 28, 2025 •

edited

Loading

Hi, nearly 8 months later I just wanted to tell that I have not forgotten at all about all this. I refined the idea in my head and have upped my python skills so the PR project is still on.

I also wanted to share this new idea I had:

I intend to make a clean PR to repeng to improve it

ask the model to estimate its IQ. As IQ is defined by gaussian statistics, it would be nice to see that value move depending on which layer was repeng'ed on a dump<->smart vector. This would be a way to validate the depth most impacted by a model.

Other idea: on a young<->old vector, we could ask it to estimate its age.

At least those two ideas would allow automatic and objectively measuring the impact of repeng on the behavior

then map out the parameter space of how well the repeng vector works (i.e. visually show how much the vector at depth X moves the IQ/age estimation if we give the vector a strength of 1).

then do so for each other techniques (PCA, UMAP, with or without scaling, etc).

then do so for a few other models for comparison

Let me know what you think of this!

thiswillbeyourgithub mentioned this pull request

feat: chat messages, model templates, layer zones #65

Open

Contributor Author

thiswillbeyourgithub commented Sep 3, 2025

Hello everyone, I created the PR #65 that hopefully implements most of the features I needed. Notably:

It mainly brings 3 features:

The hugging face model's template is respected if possible, making it easier to try a different model without having to dig into the intricacies of its template.

It is now possible to supply the examples in the form of familiar chat messages (list of dicts, with role and content).

It is now possible to specify layer_zones instead of layer_ids. For example [[0.1, 0.5]] means we control the layers whose relative depth is between 0.1 (included) and 0.5 (not included).

Hence I'm closing this.

thiswillbeyourgithub closed this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet