there are two version of calculate accuracy:
- calculate accuracy of each batch then do mean operation in
validation_epoch_end
- accumulate each batch's
correct, total values then compute accuracy once in validation_epoch_end
which ones does AccuracyMetric do?