It seems the new `gradient` function based on `ForwardDiff.gradient` is very slow, e.g. on the model `Turing/benchmarks/naive.bayes.run.jl`.