-
Notifications
You must be signed in to change notification settings - Fork 1
Labels
enhancementNew feature or requestNew feature or request
Description
One way you can represent cost for RL algorithms is to adjust the reward function. We should implement an option to represent TTCs in terms of reward. The reward function (or more aptly named utility function) should give the negative TTC when performing an attack step that always succeeds.
There are two formulations for this:
such that
Let me note here that this has already been done by Sandor and Manuel with a simplified coreLang
.
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request