undefined

upvote

points

by bri3d21 hours ago |

upvote

by DavidVoid10 hours ago|

[-]

I would go even further and state that "you should never assume that floating point functions will evaluate the same on two different computers, or even on two different versions of the same application", as the results of floating point evaluations can differ depending on platform, compiler optimizations, compilation-flags, run-time FPU environment (rounding mode, &c.), and even memory alignment of run-time data.

There's a C++26 paper about compile time math optimizations with a good overview and discussion about some of these issues [P1383]. The paper explicitly states:

1. It is acceptable for evaluation of mathematical functions to differ between translation time and runtime.

2. It is acceptable for constant evaluation of mathematical functions to differ between platforms.

So C++ has very much accepted the fact that floating point functions should not be presumed to give identical results in all circumstances.

Now, it is of course possible to ensure that floating point-related functions give identical results on all your target machines, but it's usually not worth the hassle.

[P1383]: https://www.open-std.org/jtc1/sc22/wg21/docs/papers/2023/p13...

reply

upvote

by physicsguy10 hours ago|

[-]

Even the exact same source code compiled with different compilers, or the same compiler with different compiler options.

Intel Compiler for e.g. uses less than IEEE764 precision for floating point ops by default, for example.

reply

upvote

by sva_20 hours ago|

[-]

> floating point accumulation doesn't commute

It is commutative (except for NaN). It isn't associative though.

reply

upvote

by ekelsen18 hours ago|

[-]

I think it commutes even when one or both inputs are NaN? The output is always NaN.

reply

upvote

by DavidVoid12 hours ago|

[-]

Unless you compile with fast-math ofc, because then the compiler will assume that NaN never occurs in the program.

reply

upvote

by addaon18 hours ago|

[-]

NaNs are distinguishable. /Which/ NaN you get doesn't commute.

reply

upvote

by ekelsen18 hours ago|

[-]

I guess at the bit level, but not at the level of computation? Anything that relies on bit patterns of nans behaving in a certain way (like how they propagate) is in dangerous territory.

reply

upvote

by addaon17 hours ago|

[-]

> Anything that relies on bit patterns of nans behaving in a certain way (like how they propagate) is in dangerous territory.

Why? This is well specified by IEEE 754. Many runtimes (e.g. for Javascript) use NaN boxing. Treating floats as a semi-arbitrary selection of rational numbers plus a handful of special values is /more/ correct than treating them as real numbers, but treating them as actually specified does give more flexibility and power.

reply

upvote

by Dylan168071 hours ago|

[-]

> Many runtimes (e.g. for Javascript) use NaN boxing.

But I've never seen them depend on those NaNs surviving the FPU. Hell, they could use the same trick on bit patterns that overlap with valid float values if they really wanted to.

reply

upvote

by ekelsen16 hours ago|

[-]

Can you show me where in the ieee spec this is guaranteed?

My understanding is the exact opposite - that it allows implementations to return any NaN value at all. It need not be any that were inputs.

It may be that JavaScript relies on it and that has become more binding than the actual spec, but I don't think the spec actually guarantees this.

Edit: actually it turns out nan-boxing does not involve arithmetic, which is why it works. I think my original point stands, if you are doing something that relies on how bit values of NaNs are propagated during arithmetic, you are on shaky ground.

reply

upvote

by xmcqdpt27 hours ago|

[-]

See 6.2.3 in the 2019 standard.

> 6.2.3 NaN propagation

> An operation that propagates a NaN operand to its result and has a single NaN as an input should produce a NaN with the payload of the input NaN if representable in the destination format.

> If two or more inputs are NaN, then the payload of the resulting NaN should be identical to the payload of one of the input NaNs if representable in the destination format. This standard does not specify which of the input NaNs will provide the payload.

reply

upvote

by ekelsen5 hours ago|

[-]

As the comment below notes, the language should means it is recommended, but not required. And there are indeed platforms that do not implement the recommendation.

reply

upvote

by xmcqdpt25 hours ago|

[-]

Oh right sorry. That is confusing.

reply

upvote

by addaon14 hours ago|

[-]

Don't have the spec handy, but specifically binary operations combining two NaN inputs must result in one of the input NaNs. For all of Intel SSE, AMD SSE, PowerPC, and ARM, the left hand operand is returned if both are signaling or both or quiet. x87 does weird things (but when doesn't it?), and ARM does weird things when mixing signaling and quiet NaNs.

reply

upvote

by ekelsen13 hours ago|

[-]

I also don't have access to the spec, but the people writing Rust do and they claim this: "IEEE makes almost no guarantees about the sign and payload bits of the NaN"

https://rust-lang.github.io/rfcs/3514-float-semantics.html

See also this section of wikipedia https://en.wikipedia.org/wiki/NaN#Canonical_NaN

"On RISC-V, most floating-point operations only ever generate the canonical NaN, even if a NaN is given as the operand (the payload is not propagated)."

And from the same article:

"IEEE 754-2008 recommends, but does not require, propagation of the NaN payload." (Emphasis mine)

I call bullshit on the statement "specifically binary operations combining two NaN inputs must result in one of the input NaNs." It is definitely not in the spec.

reply

upvote

by j16sdiz11 hours ago|

[-]

Blame the long and confusing language in spec:

> For an operation with quiet NaN inputs, other than maximum and minimum operations, if a floating-point result is to be delivered the result shall be a quiet NaN which should be one of the input NaNs.

The same document say:

> shall -- indicates mandatory requirements strictly to be followed in order to conform to the standard and from which no deviation is permitted (“shall” means “is required to”)

> should -- indicates that among several possibilities, one is recommended as particularly suitable, without mentioning or excluding others; or that a certain course of action is preferred but not necessarily required; or that (in the negative form) a certain course of action is deprecated but not prohibited (“should” means “is recommended to”)

i.e. It required to be a quiet NaN, and recommended to use one of the input NaN.

reply

upvote

by ekelsen5 hours ago|

[-]

Thanks for the direct evidence that the output NaN is not required to be one of the input NaNs.

reply

upvote

by danpalmer20 hours ago|

[-]

FYI, the saying is "champing at the bit", it comes from horses being restrained.

reply

upvote

by mylifeandtimes19 hours ago|

[-]

hey, I appreciate your love of language and sharing with us.

I'm wondering if we couldn't re-think "bit" to the computer science usage instead of the thing that goes in the horse's mouth, and what it would mean for an AI agent to "champ at the bit"?

What new sayings will we want?

reply

upvote

by nilamo18 hours ago|

[-]

Byting at the bit?

reply

upvote

by odo124218 hours ago|

[-]

chomping at the bit

reply

upvote

by danpalmer18 hours ago|

[-]

Actually it was originally "champing" – to grind or gnash teeth. The "chomping" (to bite) alternative cropped up more recently as people misheard and misunderstood, but it's generally accepted as an alternative now.

reply

upvote

by kortilla18 hours ago|

[-]

It’s actually accepted as the primary now and telling people about “champing” is just seen as archaic.

reply

upvote

by danpalmer17 hours ago|

[-]

Do you have a source on this, or a definition for what it means to be "primary" here? All I can find is sources confirming that "champing" is the original and more technically correct, but that "chomping" is an accepted variant.

reply

upvote

by BeetleB19 hours ago|

[-]

As a sister comment said, floating point computations are commutative, but not associative.

a * b = b * a for all "normal" floating point numbers.

reply