Hello, I was playing with Dreambooth training and noticed a thing [here](https://github.com/huggingface/diffusers/blob/0025626cd9feca260e05f295f272af4f7b9ce44a/examples/dreambooth/train_dreambooth.py#L606) Why did the author use this way of calculation mean of the loss? Is there any particular technical need to do so comparing to [this](https://github.com/huggingface/diffusers/blob/0025626cd9feca260e05f295f272af4f7b9ce44a/examples/dreambooth/train_dreambooth.py#L609)