-
-
Notifications
You must be signed in to change notification settings - Fork 5.7k
Make Float16(::BigFloat) go through Float64
#40245
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make Float16(::BigFloat) go through Float64
#40245
Conversation
Doesn't fix double rounding issues, but makes them occur 2^29 times less frequently. Should have minimal performance effects.
|
Can we get a review and merge on this? It's very low impact and a strict improvement. |
|
Bump on this. |
|
Thanks! |
|
I think the best solution to this is probably to write our own version of |
|
I think the following is a correct implementation. It is 3x slower for |
|
The Ivy Bridge and newer x86 processors have See also #37510 |
Doesn't fix double rounding issues, but makes them occur 2^29 times less frequently. Should have minimal performance effects.