Deprecate MultiHeadAttention

Deprecate MultiHeadAttention since https://github.com/tensorflow/tensorflow/commit/f32c80b3ed0d64eb0363f4196171467de79390d1 is merged.

I'm quite confused what our roadmap is. Should we alias/wrap the functionality to core TF or just remove it? In `gelu` case, the default argument is changed, and for `MultiHeadAttention`, the input signature is totally different.