Add initial solution with ttc values as attacker penalties #178

mrkickling · 2025-09-19T11:04:13Z

Implement attacker ttc penalty
Write tests

if setting ttc_values_as_attacker_penalty is enabled,
attackers get rewards as before but additionally gets negative rewards for attack step TTCs.

Attack steps are successfully executed instantly even though they have a TTC value set, but the TTC value is given as a penalty to the attacker agent.

Requires TTCMode PRE_SAMPLE or EXPECTED_VALUE so there are ttc values pre calculated.

…en ttc_values_as_attacker_penalty is enabled

…enalty' is enabled

sandorstormen · 2025-09-19T12:15:00Z

Can I set ttc_values_as_attacker_penalty to True and still select how TTC values are sampled?

mrkickling · 2025-09-19T12:58:32Z

Can I set ttc_values_as_attacker_penalty to True and still select how TTC values are sampled?

Yes, but the current implementation on this branch requires you to pick either PRE_SAMPLE or EXPECTED_VALUE, since there otherwise isn't any pre-calculated ttc value. I explained a bit more detail in the top.

Note: this is not necessarily the final solution, we could chose the other approach that just penalizes attackers with 1 point for each step they attempt.

sandorstormen · 2025-09-22T07:28:41Z

this is not necessarily the final solution, we could chose the other approach that just penalizes attackers with 1 point for each step they attempt.

But isn't this solution already implemented? You won't be able to train an attacker RL agent with ttc_values_as_attacker_penalty=False otherwise.

mrkickling · 2025-09-22T08:40:43Z

No, an attacker agent is not penalized with 1 point per step it takes, and never was. It is not penalized at all as it is now.

mrkickling linked an issue Sep 19, 2025 that may be closed by this pull request

TTCs as Reward #165

Open

mrkickling marked this pull request as draft September 19, 2025 11:04

mrkickling requested a review from sandorstormen September 19, 2025 11:05

Add initial solution with ttc values as attacker rewards

25f5e66

mrkickling force-pushed the ttcs-as-reward branch from bc691d7 to 25f5e66 Compare September 19, 2025 11:05

mrkickling marked this pull request as ready for review September 19, 2025 11:06

mrkickling added 2 commits September 19, 2025 13:19

reward->penalty, also add scenario rewards to attacker reward even wh…

0385920

…en ttc_values_as_attacker_penalty is enabled

Let attacker compromise stept right away if 'ttc_values_as_attacker_p…

e3de9cf

…enalty' is enabled

sandorstormen changed the title ~~Add initial solution with ttc values as attacker rewards~~ Add initial solution with ttc values as attacker penalties Sep 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add initial solution with ttc values as attacker penalties #178

Add initial solution with ttc values as attacker penalties #178

Uh oh!

mrkickling commented Sep 19, 2025 •

edited

Loading

Uh oh!

sandorstormen commented Sep 19, 2025

Uh oh!

mrkickling commented Sep 19, 2025 •

edited

Loading

Uh oh!

sandorstormen commented Sep 22, 2025

Uh oh!

mrkickling commented Sep 22, 2025

Uh oh!

Uh oh!

Add initial solution with ttc values as attacker penalties #178

Are you sure you want to change the base?

Add initial solution with ttc values as attacker penalties #178

Uh oh!

Conversation

mrkickling commented Sep 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sandorstormen commented Sep 19, 2025

Uh oh!

mrkickling commented Sep 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sandorstormen commented Sep 22, 2025

Uh oh!

mrkickling commented Sep 22, 2025

Uh oh!

Uh oh!

mrkickling commented Sep 19, 2025 •

edited

Loading

mrkickling commented Sep 19, 2025 •

edited

Loading