Multi-node training jobs for LightningContainer models can get stuck at inference time

It appears that using any of the PyTorch Lightning metrics in the `test_step` can cause multi-node jobs to hang indefinitely. They appear to try to synchronize to the other GPUs, but those are terminated already.

[AB#4121](https://innereye.visualstudio.com/60ce1777-00d6-4015-82bc-488a0c00202f/_workitems/edit/4121)