Full DynaTemp implementation + UI #600

kalomaze · 2024-01-04T23:49:32Z

Fully implemented in the UI + backend. Allows the user to use a Dynamic Temperature that scales based on the entropy of token probabilities (normalized by the maximum possible entropy for a distribution so it scales well across different K values).

You can set a minimum Temperature and a maximum Temperature like so:

Allows for more unique and varied outputs, especially when paired with Min P and/or a reasonable Top K.

Controllable through experts.txt

Checkbox partial implemented, Min and Max Temp implemented

Trigger DynaTemp on checkbox

Hell Yeah! DynaTemp!

Fixes broken presets and mirostat

Fixed broken presets and miro

Also removed unnecessary softmax double precision

Reintroduce unused rep pen function, move temp functions first before entropy dynamic temp

and also delete experts.txt since adjustable routing is also being removed for the PR

ycros · 2024-01-05T01:14:21Z

llama.h

@@ -723,6 +723,15 @@ extern "C" {
                           float   p,
                          size_t   min_keep);

+    /// @details DYNATEMP! #TODO KALO


I think the assumption he made is there'd be documentation / explanation in line with the other samplers

Yes that is the assumption I made

ycros · 2024-01-05T01:15:58Z

gpttype_adapter.cpp

@@ -1943,7 +1958,7 @@ generation_outputs gpttype_generate(const generation_inputs inputs, generation_o

            id = SampleLogits(logitsPtr, nctx, n_vocab, last_n_size, repeat_penalty, presence_penalty,
            top_k, top_a, top_p, min_p, typical_p, tfs_z, temp, rng,
-            kcpp_params->mirostat, kcpp_params->mirostat_tau, kcpp_params->mirostat_eta, sampler_order, grammar);
+            kcpp_params->mirostat, kcpp_params->mirostat_tau, kcpp_params->mirostat_eta, sampler_order, grammar, dynatemp, min_temp, max_temp);


@LostRuins Not related to this PR but it feels like kcpp_params should just be passed in instead.

good point, will probably refactor it soon.

ycros · 2024-01-05T01:19:40Z

llama.cpp

 void llama_sample_temperature(struct llama_context * ctx, llama_token_data_array * candidates_p, float temp) {
    llama_sample_temp(ctx, candidates_p, temp);
 }

+void llama_sample_entropy(struct llama_context * ctx, llama_token_data_array * candidates_p, float temp, float min_temp = 0, float max_temp = 2.0f) {


It'd be good to hide all of the debugging printfs in this function behind the debugmode flag.
... although that's not accessible from here given that this is llama.cpp

ycros · 2024-01-05T06:27:31Z

llama.cpp

 void llama_sample_temperature(struct llama_context * ctx, llama_token_data_array * candidates_p, float temp) {
    llama_sample_temp(ctx, candidates_p, temp);
 }

+void llama_sample_entropy(struct llama_context * ctx, llama_token_data_array * candidates_p, float temp, float min_temp = 0, float max_temp = 2.0f) {


I guess temp just isn't used at all?

Yeah, it's not ran with dynamic temp turned on to avoid obvious conflicts

LostRuins · 2024-01-05T10:15:59Z

common/sampling.h

@@ -25,6 +25,9 @@ typedef struct llama_sampling_params {
    int32_t     mirostat              = 0;        // 0 = disabled, 1 = mirostat, 2 = mirostat 2.0
    float       mirostat_tau          = 5.00f;    // target entropy
    float       mirostat_eta          = 0.10f;    // learning rate
+    bool        dynatemp              = false;    // dynamic temperature


So the first thing that stands out first is - I don't think a bool for toggling dynatemp is necessary. I think this can be simplified and there are 2 very good ways of doing it.

First way is to make it part of the sampler, so having dynatemp_min and dynatemp_max both be 0 would leave dynatemp inactive, otherwise it would be active and applied (overriding temperature). That puts it in line with all other samplers that have an "inactive" value and an "active" value.

The second option which I am strongly in support is a single value dynatemp_range. This is a float which represents the allowed deviation from the min and max temperature when using dynatemp.

Thus, if we want a value of dynatemp_min=0.3, dynatemp_max=0.5, then we would simply set temperature=0.4 and dynatemp_range=0.1

If you want a dynatemp_min=0.8, dynatemp_max=2.0, then you set temperature=1.4 and dynatemp_range=0.6.

It will work for any value of min and max

To disable, set dynatemp_range=0

Thoughts?

I like this idea and it's nicely intuitive.

…which represents the allowed deviation from the min and max temperature when using dynatemp. Thus, if we want a value of dynatemp_min=0.3, dynatemp_max=0.5, then we would simply set temperature=0.4 and dynatemp_range=0.1. Functionally dynatemp would operate the same, but it would simplify usage and make it a single easy to adjust value.

LostRuins

@kalomaze please take a look see if I've missed anything - this is dynatemp refactored into a single value dynatemp_range. This is a float which represents the allowed deviation from the min and max temperature when using dynatemp. Thus, if we want a value of dynatemp_min=0.3, dynatemp_max=0.5, then we would simply set temperature=0.4 and dynatemp_range=0.1. Functionally dynatemp would operate the same, but it would simplify usage and make it a single easy to adjust value.

kalomaze and others added 20 commits December 31, 2023 17:55

move Dynatemp changes to new branch

2b476d4

fix float header

c4b4781

Properly reintroduce variable expert count

b983ae0

Controllable through experts.txt

first pass at DynaTemp UI

b8551b4

Checkbox partial implemented, Min and Max Temp implemented

DynaTemp UI Checkbox

1d0dec4

Trigger DynaTemp on checkbox

DynaTemp UI checkbox edition

8cf96ff

Hell Yeah! DynaTemp!

DynaTemp UI integration

7f11b7e

Remove greedy dynatemp

303a32e

Fix race condition caused by debug print

77e0f6c

Fixed broken presets and miro

69e293c

Fixes broken presets and mirostat

Merge pull request #11 from AAbushady/dynatemp-mainline

ce2e738

Fixed broken presets and miro

Remove debug function + HHI temp

c2d14ab

Also removed unnecessary softmax double precision

Merge branch 'dynatemp-pr-upstream' into dynatemp-mainline

b753bb4

Fix whitespace (?) for generate function

351262a

epic upstream renaming scheme fix

1470209

fix stupid indents

597f80d

Other cleanup

a4f8ff4

Reintroduce unused rep pen function, move temp functions first before entropy dynamic temp

Slight indent fix

1d9c9b5

revert batch pyinstaller maker to mainline

f61a441

and also delete experts.txt since adjustable routing is also being removed for the PR

Merge Dynamic Temp UI + cleanups into dynatemp-pr-upstream

79aeadf

ycros reviewed Jan 5, 2024

View reviewed changes

kalomaze mentioned this pull request Jan 5, 2024

Dynamic Temperature HF loader support oobabooga/text-generation-webui#5174

Merged

LostRuins reviewed Jan 5, 2024

View reviewed changes

LostRuins approved these changes Jan 5, 2024

View reviewed changes

LostRuins changed the base branch from concedo to concedo_experimental January 6, 2024 03:12

LostRuins merged commit 123bff9 into LostRuins:concedo_experimental Jan 6, 2024

DutchEllie mentioned this pull request Jan 9, 2024

[Feature Request] Dynamic temperature sampling for better coherence / creativity ggerganov/llama.cpp#3483

Closed

IdiotSandwichTheThird mentioned this pull request Jan 17, 2024

[Feature Request] Support Koboldcpp Dynamic Temperature lmg-anon/mikupad#15

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Full DynaTemp implementation + UI #600

Full DynaTemp implementation + UI #600

kalomaze commented Jan 4, 2024 •

edited

Loading

ycros Jan 5, 2024

kalomaze Jan 5, 2024

AAbushady Jan 5, 2024

ycros Jan 5, 2024

LostRuins Jan 5, 2024

ycros Jan 5, 2024 •

edited

Loading

ycros Jan 5, 2024

kalomaze Jan 5, 2024

LostRuins Jan 5, 2024

kalomaze Jan 6, 2024

LostRuins left a comment

Full DynaTemp implementation + UI #600

Full DynaTemp implementation + UI #600

Conversation

kalomaze commented Jan 4, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ycros Jan 5, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LostRuins left a comment

Choose a reason for hiding this comment

kalomaze commented Jan 4, 2024 •

edited

Loading

ycros Jan 5, 2024 •

edited

Loading