Internal Node Id Dependent on Order of Action Execution

The order of vulnerability execution defines the order that `__discovered_nodes` are added internally. There is functionality in place that uses this integer value for downstream tasks. ([1](https://github.com/microsoft/CyberBattleSim/blob/main/cyberbattle/_env/cyberbattle_env.py#L568C28-L568C28), [2](https://github.com/microsoft/CyberBattleSim/blob/main/cyberbattle/_env/cyberbattle_env.py#L588), [3](https://github.com/microsoft/CyberBattleSim/blob/main/cyberbattle/_env/cyberbattle_env.py#L668), [4](https://github.com/microsoft/CyberBattleSim/blob/main/cyberbattle/_env/cyberbattle_env.py#L690), etc)

This also means that the action masking is dependent on the order of vulnerability execution ([repro](https://github.com/forrestmckee/CyberBattleSim/blob/internal_id_error/notebooks/internal_id_error.ipynb))


Doesn't that mean that during training, the state-action value approximations are based on an integer encoding that changes according to the order of actions at every reset? Checkpointing or transfer learning would also suffer.

For ports, local/remote vulnerabilities, etc you use `model.Environment.identifiers`  when retrieving via integer encoding, so they're fixed.

Let me know if you need any clarification, or if I'm missing something here. 

Thanks!


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Internal Node Id Dependent on Order of Action Execution #117

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Internal Node Id Dependent on Order of Action Execution #117

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions