Unfortunately this website doesn't talk much about the shortcomings of DEC64. Ju...

windsurfer · on June 23, 2019

I checked out the reference implementation and although there are some heuristics to make comparing numbers faster, you're completely right that the worst case scenario is quite complicated: https://github.com/douglascrockford/DEC64/blob/master/dec64....

bonzini · on June 23, 2019

That code is certainly not written by someone with x86 optimization experience. Using high-byte registers (e.g. ah which he rewrites as r0_h) is an awful idea on any Pentium Pro or more recent processor.

pwdisswordfish2 · on June 23, 2019

Alternatively: it was written by someone with obsolete x86 optimization experience.

oso2k · on June 23, 2019

IEEE754 also has multiple representations along with imprecise representations of integers, it’s limits (+Inf, -Inf, NaN) and simple decimals (0.1 + 0.2 != 0.3). For now, the biggest benefit is that we have hardware acceleration for IEEE754 everywhere, not that it is particularly good for any use case or that it is without weakness. But when we get FPGAs in commodity CPUs, the pros and cons value of IEEE754 change rapidly. Custom arithmetic formats could become more common.

saagarjha · on June 23, 2019

And it’s reasons like this why I think IEEE 754 is beautiful in its own right.

stilley2 · on June 23, 2019

The nan==nan bothered me

kazinator · on June 23, 2019

nan != nan severely bothers me, when it's the same object on both sides with exactly the same bit pattern. It's the work of imbeciles who are either unfamiliar with the concept of identity, or else don't understand why it's important and should be respected.

hakfoo · on June 23, 2019

It feels like an argument between useful and correct.

Correct is "These two items are not numbers, so the numeric concept of equality can't be conclusively applied." This argument gives you NaN != NaN.

Useful is "We consider everything that's escaped the plane of numbers to be equivalent because they all require the same special handling." This gives you Nan = NaN.

kazinator · on June 23, 2019

> This argument gives you NaN != NaN.

That looks conclusive to me, though.

> This gives you Nan = NaN.

Problem is, there are multiple representations identified as "NaN". The question whether one NaN should be considered equal to another NaN is different from whether a NaN should equal to itself.

Whether a NaN is equal to a different NaN depends on the kind of equality; some equalities don't care about certain differences.

The C == equality allows the integer 1 to equal 1.0, in spite of a different representation; it is not simply a bitwise equality. (This happens by way of conversion of 1 to 1.0).

However, a thing should equal to itself under every applicable equality. That is non-negotiable.

There is a sense in which eq(x, x) can be false, if we consider argument passing to work by copy, and argument position to be part of identity. Then x is not x, because one is the left x and one is the right x they are in different stack locations or machine registers or whatever. Common Lisp allows for this kind of chicanery in its definition of "implementation identity" (embodied in its eq function). Numbers are not required to be eq, even to themselves. (eq 0 0) could yield false, the idea being that each copy of 0 could be a distinct object. Though that sort of stinks, it's not what is at pay here. These NaNs have type double, and ordinary values of that type are equal to themselves when copied to different argument positions of the equality operation.

zeroxfe · on June 23, 2019

Mentioned on the site:

> There are 255 possible representations of zero. They are all considered to be equal.

nickodell · on June 23, 2019

It's more than just zero. Every number can be represented in multiple ways in DEC64. For example, you could represent 3 as

3 x 10^0

or

30 x 10^-1

Veedrac · on June 23, 2019

No, most numbers have only a single representation. A tenth have two, a hundredth have three, etc.

lvh · on June 23, 2019

Small numbers with many representations are much more common than the full width ones.

Veedrac · on June 23, 2019

This is true, but hardly important. Multiple representations hurt because they waste space, not because they're transiently evil.

If you're scared of normalizing before comparisons, or somesuch, nothing stops you normalizing around operations instead, like standard binary floats. Only you'd probably be shooting yourself in the foot, since you'd probably end up normalizing more and making the ‘integers are fast’ claim fail.

hedora · on June 23, 2019

If you have to normalize DEC64 values, then the whole point of this format (1 cycle overhead for common “integer” operations on x64) evaporates.

(In fairness, for small integers, the operators over normalized values are closed — they always return normalized values)

Veedrac · on June 23, 2019

That was my point, yes. Being unnormalized is a good thing. IEEE does the same thing for their decimals.

> (In fairness, for small integers, the operators over normalized values are closed — they always return normalized values)

Normally normalization refers to using the smallest possible exponent; in this light small integers are unnormalized.