[Performance] Prioritised TensorDict replay buffers use for loops over the batch dimension

In prioritised tensordict replay buffers, the `update_tensordict_priority`, method performs a for loop over the batch dimension

https://github.com/pytorch/rl/blob/434fe58b44cb46650e80ff18030d43a45eb532f2/torchrl/data/replay_buffers/replay_buffers.py#L760-L763

This causes significant slowdowns as this is the vectorised dimension used in the training pipelines and can get to really high sizes.

This method is called every time the buffer is extended or the priorities are updated.

	priority = torch.tensor(
	[self._get_priority(td) for td in data],
	dtype=torch.float,
	device=data.device,

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Performance] Prioritised TensorDict replay buffers use for loops over the batch dimension #1574

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Performance] Prioritised TensorDict replay buffers use for loops over the batch dimension #1574

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions