[BUG] Unexpected use of step in WandbLogger

## Describe the bug

When logging scalars, the wandb logger passes the provided step to a "train/step" key instead of wandbs global step: 

```
        if step is not None:
            self.experiment.log({name: value, "trainer/step": step})
        else:
            self.experiment.log({name: value})
```

This is an odd choice as it triggers wandb's internal auto-incrementing: https://github.com/wandb/wandb/blob/v0.20.1/wandb/sdk/wandb_run.py#L1812

This can cause a bit of a headache, particularly when there are multiple users of the same logger through a training loop.


## Expected behavior

I think the simplest approach for torchrl would be pass in the step as wandb intended: 
```
self.experiment.log({name: value}, step=step)
```


## Checklist

- [x] I have checked that there is no similar issue in the repo (**required**)
- [x] I have read the [documentation](https://github.com/pytorch/rl/tree/main/docs/) (**required**)
- [x] I have provided a minimal working example to reproduce the bug (**required**)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BUG] Unexpected use of step in WandbLogger #2998

Describe the bug

Expected behavior

Checklist

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[BUG] Unexpected use of step in WandbLogger #2998

Description

Describe the bug

Expected behavior

Checklist

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions