The Desert of the Reals: Floating-Point Arithmetic on Deterministic Systems

N. E. Davis ~lagrev-nocfep
Urbit Foundation

Abstract

Floating-point calculations are critical to a number of special domains in modern computing, including machine learning, graphics, and scientiﬁc computing. Numerical calculations are particularly susceptible to opaque and system-local optimizations, which can break certain guarantees for deterministic computers. We consider the background and implementation of ieee 754 ﬂoating-point arithmetic and options for implementing mathematics compatibly with fully reproducible and portable computing. We consider hardware-based and software-based proposals.

1 Introduction
2 A Derivation of the Real Numbers
2.1 ieee 754 Basics
3 Urbit’s Implementation of ieee 754
4 Deterministic Computation with Fractional Part
4.1 Hardware-supported ﬂoating-point arithmetic
4.1.1 Control the stack
4.1.2 Simulate the hardware
4.1.3 Support a single hardware platform
4.1.4 Dock ﬂoating-point results
4.1.5 Consistency checks
4.2 Software-deﬁned ﬂoating-point library
4.3 Opaque calculations
4.4 Stored results
4.5 Proscribing ieee 754
4.6 Irregularities
5 Linear Algebra in Hoon
6 Conclusion
References

1 Introduction

Floating-point operations are a technically complex subject and the extent to which developers or source code alter or test this information will depend on many factors. Apart from the general exhortation to developers to be careful and to make sure they know what they are doing, there is little of practical use that can be recommended. ( Jones (2008), p. 197)

Modern digital computers deal, at their root, in binary representation, entirely zeros and ones.¹ These are often formally considered to be whole numbers in a number base of two. However, numerical calculations very frequently require the use of numbers with a fractional part to adequately represent the elements of a computation.

Early numeric computing tended to focus on problems of interest to military and national security applications, such as the solution of diﬀerential equations and numerical optimization. Such calculations typically involve arrays, and linear algebra was elaborated hand-in-hand with digital computing techniques in software and hardware. Numerics assumed prominence for a wider audience with the rise of gaming on personal computers, although these algorithms emphasized speed over exactness.² To this point in the history of computing, most software either ran on a single platform for its lifetime (as with supercomputing) or did not require portably deterministic algorithms (as with gaming).³

On the other hand, deterministic computing describes the ability for a given computation be reproducible exactly. Such reproducibility permits referential transparency and more powerful reasoning about a program’s results and dependencies. This includes, for Urbit as a state machine, that the event log replay be portable across platforms to yield the same result. Conceptual guarantees must be backstopped by actual implementation guarantees for determinism to hold.

2 A Derivation of the Real Numbers

Binary computer values are at root easily represented as non-negative integers. However, it is frequently convenient when working with human applications to either use other numeric bases (notably decimal and hexadecimal) or to permit non-integer mathematics.

In the historical development of mathematics, logical problems in each set of numbers drove the discovery and elaboration of more elaborate algebras. For instance, in the ﬁeld of natural numbers $\mathbb {N}$, the operation of addition $+$ or multiplication $\times $ produces a value within the set; however, permitting subtraction $-$ of a larger number from a smaller number can result in a value inexpressible in $\mathbb {N}$. This motivated the introduction of the integers $\mathbb {Z}$, augmenting the numbers from zero to (positive) inﬁnity with the negative numbers. Division $/$ similarly produced a crisis when applied to values which did not have a whole-number ratio between them, a situation resolved by the Pythagorean⁴ innovation of the rational numbers or fractions as a class $\mathbb {Q}$. Ultimately, the common reference set for engineering mathematics (and the human understanding of the continuum such as measurement) is the set of real numbers. The set of real numbers, denoted by $\mathbb {R}$, is characterized by its continuity, implying that for any two distinct values within this set, there exists a diﬀerence, no matter how small.

Since the operations and conventions of $\mathbb {R}$ have been found to be so useful, it is desirable to extend the semantics to computer programming. However, digital computers, by virtue of their binary representation, eﬀectively use natural numbers $\mathbb {N}$ to represent numbers (to the limit of memory rather than positive inﬁnity $+\infty $). Several schemes permit a computer integer to be interpreted as if it were a number with a fractional part, including a scaling factor, ﬁxed-point representation,⁵ and pairs as rational numbers.

The basic concept of ﬂoating point arithmetic is that it permits the representation of a discrete subset of $\mathbb {R}$ by composing a signiﬁcand, a base, and an exponent. The signiﬁcand is the set of signiﬁcant digits, possibly including the sign; the base is the understood number base (typically 2); and the exponent is the power to which that base is put before multiplying by the signiﬁcand to yield the result.⁶ The most ubiquitous ﬂoating-point format today is deﬁned by the ieee 754 standard, but certain hardware platforms such as gpus utilize alternative ﬂoating-point arithmetic representations.⁷

To summarize, given an abstract description of a ﬂoating point system, there are several practical implementations that can be derived. We need to specify at least four quantities: sign,⁸ signiﬁcand, base, and exponent.⁹ The base is presumably ﬁxed by the protocol, leaving three free values for the implementation to economically encode.

2.1 IEEE 754 Basics

Early computer systems with ﬂoating-point units chose bespoke but incompatible representations, ultimately leading to the ieee 754 (primarily architected by William Kahan). ieee 754 reconciled considerations from many ﬂoating-point implementations across hardware manufacturers into an internally consistent set of ﬁxed-width representations.¹⁰ For instance, the 32-bit “single precision” C ﬂoat/Fortran REAL*4 speciﬁcation denotes particular bit positions as meaningful,

SEEE.EEEE.EFFF.FFFF.FFFF.FFFF.FFFF.FFFF

where S is the sign bit, 0 for positive ($+$) and 1 for negative ($-$); E is the exponent in base-2 (8 bits); and F is the signiﬁcand (23 bits). The exponent is actually calculated at an oﬀset bias of 127 ($2^{7}$) so that a more expressive range of orders of magnitude can be covered. The signiﬁcand has an implied leading 1 bit unless all are zero. To wit,

(− 1)S × 2E−127 × 1.F

ieee 754 speciﬁes operations between numbers, including of diﬀerent magnitudes. The standard dictates behavior and provides outlines for arithmetic, but leaves algorithmic details to the implementation. Numbers are normalized by adjusting the exponent of the smaller operand and aligning the signiﬁcands, then the operations are carried out. In practice, extended precision values are used in the intermediate steps of many algorithms, leading to greater accuracy than would otherwise be expected.¹¹

Since the ieee 754 ﬂoating-point format packs values of diﬀerent kind together bitwise, conventional integer operations such as left shift $<<$ and addition $+$ do not trivially apply.¹²

Floating-point addition (add) proceeds per the following algorithm:

Compare exponents of the two numbers. Shift the smaller number rightwards until its exponent matches the larger exponent.
Add the signiﬁcands together.
Normalize the sum by either shifting right and incrementing the exponent, or shifting left and decrementing the exponent.
If an overﬂow or an underﬂow occurs, yield an exception.
Round the signiﬁcand to the appropriate number of bits.
Renormalize as necessary (back to step 3).

ieee 754 ﬂoating-point arithmetic and its predecessors have some signiﬁcant mathematical compromises even in its formal speciﬁcation. For instance, as a result of the discrete nature of the bitwise representation in E and F, ﬂoating-point mathematics are actually a subset of discrete mathematics masquerading as real mathematics. This has non-trivial consequences for certain aspects of calculations, including error accrual. In particular, three facts dominate the resolution:

The distance between two adjacent values changes based on the magnitude of the exponent and the distance from zero. (The signiﬁcand resolution stays the same but the exponent changes.)
There is a relative approximation error for a given bitwidth in ieee 754, called the machine epsilon.¹³
Operations between numbers of diﬀerent magnitudes are particularly aﬀected by their relative numerical horizon.

Variable precision and truncation error. For most values of the exponent E, the diﬀerence between two discrete values is determined by the absolute magnitude of the signiﬁcand S. The diﬀerence between serial values is \begin {align} \Delta \texttt {S} &= \texttt {␣000.0000.0000.0000.0000.0001}_{2} \nonumber \\ & = 1.00000011920928955078125 - 1.0 \nonumber \\ & = 0.00000011920928955078125 \nonumber \\ & = 2^{-23}\textrm {.} \end {align}

This is multipled by the the result of the exponent E and the bias, meaning that for each exponent value the diﬀerence between subsequent values changes. (Figure 1 represents this schematically.)

Figure 1: Schematic representation of granularity and variable precision of ﬂoating-point values and their relationship to the (continuous) set of real numbers.

However, for normalized numbers, or numbers that are left-shifted or right-shifted in order to carry out a calculation, values are determined by the relative shift in exponent $\Delta \texttt {E}$. For $\texttt {E} = 2$, for instance, the diﬀerence between serial values is $2^{-21}$. This variable precision means that the precision of ﬂoating-point values varies across the range of representable numbers when operations take place. Operations between two numbers of fairly diﬀerent precisions are particularly vulnerable to accuracy loss, although some numerical techniques can be employed to mitigate.

Truncation error results from terminating repeating “binaries” (by analogy with “decimals”). Just as $\frac {1}{3} = 0.\bar {3} = 0.3333\ldots {}$ has a ﬁnite precision when written in base-10, numbers that are not precise powers of two result in repeating fractions. These necessarily terminate at the resolution of the signiﬁcand. The number and nature of truncation and rounding can signiﬁcantly aﬀect the accuracy of ﬂoating-point arithmetic and algorithms ( Izquierdo and Polhill, 2006).

Machine epsilon. The machine epsilon, or smallest value discernable from 1.0, is determined by the precision of the ﬂoating-point representation. The machine epsilon for a particular bit width is determined by setting two to the negative power of the number of bits used for the magnitude of the mantissa and accounting for the leading implicit bit 1; for 32-bit single-precision ﬂoat this is $2^{-{23-1}} = 2^{-22}$. Diﬀerences from 1.0 smaller than this cannot be represented in this bit width.

Sequence ordering. In situations in which ﬂoating-point operations may occur in diﬀerent orders, even the basic guarantee of commutativity breaks. For instance, in 64-bit ﬂoating point arithmetic, the following holds true (example in Python):

In [1]: (1.1-0.3)-0.8 
Out[1]: 0.0 
 
In [2]: (1.1-0.8)-0.3 
5Out[2]: 5.551115123125783e-17

This occurs since operations of diﬀerent magnitude can aﬀect the resulting signiﬁcand, a sort of horizon of resolution leading to diﬀerences in the outcome. Sequence order can be changed (and thus commutativity broken) as a result of many common programmer design patterns, including compiler optimizations, race conditions, and parallelization.

Another problem in numerical analysis, error accrual is likewise due to the horizon of resolution. The accrual of error due to summing sequences of numbers (whether in parallel or serially) occurs in the summation of sequences of numbers since the error term can grow as $n$. Kahan–Babuška compensated summation can be used to track a separate error term (de facto extending the precision during the operations) and adding it back in to the sum before yielding the ﬁnal result ( Kahan (1965), Babuška (1969)).

Formally neither associative nor commutative for the above reasons, ﬂoating-point arithmetic can break our mathematical intuitions in interesting ways. However, this is a consistent and well-understood phenomenon. For our purposes as designers of deterministic computers, the most damning indictment has to do not with ieee 754 itself but with manufacturer deviation in hardware implementation. In 1997, William Kahan himself complained (justly) about the compromises inherent in the standard for compiler implementers:

Most computer linguists ﬁnd ﬂoating-point arithmetic too disruptive [due to] [t]heir predilection for “referential transparency” …. Computer linguists also dislike functions with side-eﬀects and functions aﬀected by implicit variables not explicit in argument lists. But ﬂoating-point operations can raise ieee 754 exception ﬂags as side-eﬀects, and operations are aﬀected implicitly by exception-handling and rounding modes eligible at run-time according to ieee 754. Alas, that standard omitted to bind ﬂags and modes to locutions in standard programming languages, and this omission grants computer linguists a licence for inaction. ( Kahan, 1997)

There are several sources of trouble for even single-threaded deterministic computation using hardware ieee 754 ﬂoating-point units (fpus):¹⁴

Optional, discretionary, or advisory aspects.
Gaps or omissions in the speciﬁcation.
Failure to implement the speciﬁcation exactly.
Out-of-sequence computations.

Optional aspects. Several aspects of ieee 754 are optional or advisory, including:

Exception handling means that the hardware may specify rounding via an overﬂow ﬂag.
Extended precisions formats are not a huge deal to leave out, but extended precision arithmetic (used for intermediate results) can materially change results.
Subnormals are optional;¹⁵ some platforms may ﬂush them to zero or (worse) allow subnormal support to be disabled in certain cases.¹⁶

Omissions. Whether something is a gap or optional is something of a philosophical question for us, but some parts are underspeciﬁed in a way that makes portability impossible. E. g., mixed-precision operations can yield unpredictable results depending on the compiler and hardware. This is a function of rounding modes, precision loss, precision of intermediate results, and the presence or absence of dedicated hardware support for certain precision combinations.

Inexact implementation. Failure to implement ieee 754 correctly may happen inadvertently, as with the Pentium fdiv bug in the 1990s ( Edelman, 1997). Alternatively, chipset designers may deviate from the speciﬁcation for reasons of performance or limitations in the architecture.

For instance, ieee 754 deﬁnes a range of numbers as “not-a-number” values, or NaNs. Per the speciﬁcation, a NaN can be a signalling NaN, meaning that it intends to ﬂag and possibly disrupt a problematic computation;¹⁷ or a quiet NaN, which does not raise such an exception and merely yields a result with the NaN propagated to the ﬁnal result.¹⁸ Not all processors implement this part of ieee 754 correctly: “The Motorola dsp563ccc does not support NaNs or inﬁnities. Floating-point arithmetic operations do not overﬂow to inﬁnity; they saturate at the maximum representable value” (Jones (2008), p. 338).

As a further example, fused multiply-add (fma) ($a \times b + c$) is implemented on certain hardware to favor double operations and not quadruple-precision operations ( Kahan (1997), p. 5).

Out-of-sequence computations. A modern compiler using optimization ﬂags or even modest parallelism can easily cause a ﬂoating-point calculation to rely on operands that were produced in an order diﬀerent than that speciﬁed in the code. This is largely opaque to the programmer, aside from some simple heuristics, and makes it diﬃcult to reproduce or reason about the ﬁne details of computations.

As demonstrated above, out-of-sequence or resequenced computations can aﬀect results due to rounding behavior and the “numerical horizon” which results between values. These can happen due to multithreaded computation or an optimizing compiler.

Rounding mode. ieee 754 ﬂoating-point operations take place using one of several rounding modes, for instance,

Round to nearest, ties to even. Set ties to the last bit as zero (even). The default.
Round to zero. Truncate, eﬀectively rounding positive numbers down and negative numbers up.
Round away from zero. Truncate, eﬀectively rounding positive numbers up and negative numbers down.
Round toward positive inﬁnity. Up regardless of sign.
Round toward negative inﬁnity. Down regardless of sign.

The rounding mode can aﬀect the result of computations, and if other processes are changing the mode (which can even be set per-thread), results may not be reliably reproducible.

“Obtaining the correctly rounded result of an addition or subtraction operation requires an additional bit in the signiﬁcand (as provided by the iec 60559 guard bit) to hold the intermediate result” ( Jones (2008), p. 65).

3 Urbit’s Implementation of IEEE 754

Urbit implements a subset of ieee 754 functionality in /sys/hoon, the Hoon language speciﬁcation. The Nock operations formally take place on integers. In practice, we could imagine several ways of implementing such operations: bitmasking the integers or breaking them apart into three components, for instance. We take Urbit’s implementation of @rs (single-precision ﬂoat) as representative.¹⁹

++rs is a wrapper core to instrument arithmetic arms like ++add using the ++ff ﬂoating-point functionality core. Ultimately this resolves to breaking out the components (sign, exponent, and signiﬁcand) into separate numbers for the actual operation.²⁰

The ++fn core oﬀers a generalized interface for a superset of ieee 754-style ﬂoating-point implementations, permitting bit width, precision, bias, and rounding mode to be freely speciﬁed.²¹ The actual implementation on +$fn-typed values is rather dense and features numerous rounding and overﬂow checks:

++  add 
  |=  [a=[e=@s a=@u] b=[e=@s a=@u] e=?]  ^-  fn 
  =+  q=(dif:si e.a e.b) 
  |-  ?.  (syn:si q)  $(b a, a b, q +(q)) 
5  ?:  e 
    [%f & e.b (^add (lsh [0 (abs:si q)] a.a) a.b)] 
  =+  [ma=(met 0 a.a) mb=(met 0 a.b)] 
  =+  ^=  w  %+  dif:si  e.a  %-  sun:si 
    ?:  (gth prc ma)  (^sub prc ma)  0 
10  =+  ^=  x  %+  sum:si  e.b  (sun:si mb) 
  ?:  =((cmp:si w x) --1) 
    ?-  r 
      %z  (lug %fl a &)  %d  (lug %fl a &) 
      %a  (lug %lg a &)  %u  (lug %lg a &) 
15      %n  (lug %na a &) 
    == 
  (rou [e.b (^add (lsh [0 (abs:si q)] a.a) a.b)])

There is, of course, a feint in the foregoing discussion. Nock is a virtual machine speciﬁcation, and in practice operations that would beneﬁt from more direct expression in C are jetted.²² Thus the actual call in this case will correspond to some C code using the SoftFloat library:²³

u3_noun u3qet_add(u3_atom a, u3_atom b, u3_atom r) { 
  union sing c, d, e; 
  // set IEEE 754 rounding mode 
  _set_rounding(r); 
5  // unwrap nouns into C-typed values 
  c.c = u3r_word(0, a); 
  d.c = u3r_word(0, b); 
  // perform addition and unify NaN 
  e.s = _nan_unify(f32_add(c.s, d.s)); 
10 
  // wrap C value back into noun 
  return u3i_words(1, &e.c); 
}

Why SoftFloat? Enter, stage left, the problem of platform-portable determinism.

4 Deterministic Computation with a
Fractional Part

Non-real arithmetic is less signiﬁcant for many of the core operations of Urbit as a personal server platform. However, gaming, machine learning, graphics, and other applications rely on ﬂoating-point calculations—preferably as fast as possible. In fact, not only applications-oriented processes rely on determinism: guarantees in cryptography and contractual correctness for web3; veriﬁcation and validation; accounting and legal compliance; and code correctness analysis all require reproducible determinism.²⁴

Why can’t we just allow diﬀerent results in the last binary places of the signiﬁcand? Philosophically, Urbit holds to the following statements ( ~wicdev-wisryt, 2020):

A10. Correctness is more important than performance.
A12. Correctness is more important than optimality.
A14. Deterministic beats heuristic.
F1. If it’s not deterministic, it isn’t real.

Urbit makes much of avoiding the “ball of mud” “standard software architecture” ( Foote and Yoder, 1999). In this design anti-pattern, a lack of guarantees and predictable behavior leads inevitably to haphazard and illegible software bloat. We can thus understand why Urbit as a platform considers even deviations in the last bit of a signiﬁcand to be threads fraying the edge of sanity ( ~wicdev-wisryt, 2020):

If you do the same thing twice, your computer should react the same way. This is comforting. This is also what makes it easy to reason about and use eﬀectively. If you’re not sure what your computer will do, you’ll be afraid of it and act defensively toward it. This inevitably leads to a big ball of mud.

For most purposes in the broader software world, tightly reproducible precision has not been a high priority. Precision having already been sacriﬁced, the gist of the calculation is more important than the fourth decimal place (e. g. in realtime 3D graphics). This leads to the phrase “implements the ieee 754 standard” being interpreted erroneously to imply full reproducibility ( Figueroa del Cid, 2000).

For example, consider the expression $(a \times b) + c$. If a compiler permits the two operations to be evaluated sequentially (a multiplication followed by an addition), then rounding occurs twice. If a compiler optimizes the operation into an fma, or fused multiply-add, then a single rounding occurs. Peters presents a pathological case for 32-bit single-precision ﬂoating-point values: $a = 1.00000011920929$, $b = 53400708$, and $c = -b$. In this case, the two-stage operation wipes out the $0.00000011920929$ component of $a$, yielding $a$ as an integer. Then $c$ is added and the result is $8$. With fma as a single-step operation, the (correct) answer $6.365860462$ is obtained. The optimization is more correct than the naïve route in this case.

However, in another example due to Dawson, fma yields incorrect results: for $a \times b + c \times d$ with $a = c$ and $b = -d$, the answer should be zero, and calculated in two steps will typically be zero. With a fused multiply-add, however, the code becomes fmadd(a, b, c*d), rounding the multiplication of $c$ and $d$ but not that of $a$ and $b$; the answer will likely not be zero.

The situation grows more ambiguous across architectures. Jones (2008), p. 346, presents the pathological case of a compliant platform that may use extended precision bits in the calculation of a + b:

#include <stdio.h> 
 
extern double a, b; 
 
5void f(void) { 
    double x; 
    x = a + b; 
    if (x != a + b) 
        printf("x != a + b\n"); 
10}

In this hypothetical case, “any extended precision bits will be lost in the ﬁrst calculation of a+b when it is assigned to x. The result of the second calculation of a+b may be held in a working register at the extended precision and potentially contain additional value bits not held in x, the result of the equality test then being false.” Higham (2002) provides further examples of pathological cases.

K&R C permitted the compiler to re-order ﬂoating-point expressions by associativity, which could run afoul of our limitations. ansi C (C89), recognizing the issue introduced by this innocuous change, forbade such re-ordering ( MacDonald, 1991). Compiler optimizations (e. g. gcc’s -O3) can bypass this restriction, once again breaking determinism;²⁵ for instance, ﬂoating-point operations can be pipelined, leading to out-of-order execution.

The ﬂy in the ointment for Urbit’s deterministic computing is that jet-accelerated Nock equivalents must reliably produce the same results (both to each other and to Nock) regardless of the runtime on which it is being evaluated. Thus even small irregularities in ﬂoating-point implementations have macroscopic ramiﬁcations for deterministic computing. Any guarantee broken breaks them all, just as it would for a formal correctness proof.²⁶

The challenge of the lack of determinacy for certain critical applications has been acknowledged before, such as by James Demmel and the ReproBLAS team ( Demmel et al. (2017), Ahrens, Nguyen, and Demmel (2018)) and by Dawson. Dawson makes much of the eﬀect of rounding modes and the option to disable subnormals, both of which would have major eﬀects on computational reproducibility. The situation is worse for transcendental functions, because there is necessarily truncation and/or rounding error (Dawson, 2013). Even conversion between bases for output and input is not necessarily reproducible, as Dawson continues: “Doing perfect conversions eﬃciently was an unsolved problem when the original ieee standard came out, and while it has since been solved, this doesn’t mean that everybody does correctly rounded printing.” Koenig (2018) expend much eﬀort on the problem of computing a dot product exactly in hardware, given the contingencies of multicore processors.²⁷

The ﬁeld of debate for possible solutions for implementing ﬂoating-point arithmetic which is portable across platforms includes:

Hardware-supported ﬂoating-point arithmetic.
Software-deﬁned ﬂoating-point library.
Opaque calculations.
Stored results.
Proscribing ieee 754.

We consider each in turn, with its ramiﬁcations for a deterministic computing platform and in particular its prospects for adoption in Nock-based systems.

4.1 Hardware-supported ﬂoating-point arithmetic

As outlined above, execution of code-equivalent ﬂoating-point computations produced from source by diﬀerent compilers on diﬀerent hardware architectures may lead to small diﬀerences in outcome, non-negligible for a deterministic computer. Thus, for this and a constellation of related reasons, hardware-supported ﬂoating-point arithmetic seems to be prima facie unviable for deterministic computing.²⁸

We do not know the ﬁeld of possible future hardware architectures which Nock as a deterministic computing platform may be called upon to execute.²⁹ Jet-accelerated code should be intelligently robust about its the hardware, but Hoon and Nock code should be completely agnostic to the hardware.

That’s the problem. What are some possible hardware-targeted solutions?

Control the compiler and runtime stack top to bottom.
Store a hardware and compiler tag and simulate when not on that platform.
Support only a single hardware for the lifetime of a ship.
Dock ﬂoating-point results.
Check consistency of results.

4.1.1 Control the stack

If you controlled the compiler and runtime execution stack to a suﬃcient degree, could you yield deterministic ﬂoating-point arithmetic from the hardware? “A translator that generates very high performance code is of no use if the ﬁnal behavior is incorrect” ( Jones (2008), p. 189); that is, optimizations often come at the cost of correctness.

To start oﬀ, what must be considered part of the stack in this sense? At a minimum, the compiler and linker toolchain (including ﬂags and options) and the actual runtime must be included. (This explicitly introduces a dependence between Martian software and Earthling software, repugnant to the Urbit ethos.)

We also must decide what the target is. Do we aim for the most portable conﬁguration (as determined by number of consumer or enterprise users)? Do we aim for the “closest” to ieee 754 adherence? Do we aim for simplicity, or compilation speed, or any of a half-dozen other optimizable variables?

For instance, suppose that one intended to use the C keyword volatile to block certain common optimizations on a ﬂoating-point value.³⁰ The runtime at the level of Nock does not know if a value is considered ﬂoating-point or not. At the level of a jet, the use of volatile can correctly bar certain hardware optimizations, but these need to be carefully enumerated and understood in the light of the other toolchain concerns enumerated in this section. Strictly speaking, volatile only seeks to guarantee that stale calculations are not inadvertently reused due to optimization. Without hardware optimization, the utility of an [fpu] for fast ﬂoating-point computations is questionable. The risk of a jet mismatch remains high, as does a nonportable jet.³¹

Can the C-deﬁned ﬂoating-point environment (as supplied by fenv.h) answer to this need? This aﬀords the ability to specify not only rounding modes and access ﬂoating-point exception status ﬂags, but it is not clear whether this environmental control portably spans the entire output of ﬂoating-point computations.³²

Finally, “[a]n implementation is not required to provide a facility for altering the modes for translation-time arithmetic, or for making exception ﬂags from the translation available to the executing program” ( Jones (2008), p. 200). The information we purport to gain by controlling the stack in the manner above outlined is possibly not even available to the compiler and the runtime executable.

We suggest that deterministically correct stack control in the sense we have described here is impossible for an arbitrary conﬁguration of the modern hardware stack.³³

4.1.2 Simulate the hardware

If you knew what the compiler and execution stack behavior looked like when a calculation was performed, could you reproduce it in software at need on a diﬀerent platform?

Hardware simulation faces some diﬃculties in the same vein as controlling the stack. The proposal yields a combinatorial explosion when considering the combinations of hardware chips, compilers, and compiler ﬂags. Nor is it clear that hardware documentation can be accrued in suﬃcient quantity and detail to guarantee the success of such a project.

The Urbit runtime provides an epoch system, meaning that the event log is separated into snapshots and subsequent events ( ~mastyr-bottec, 2023). This is currently used to monitor the use of old binaries which could potentially have a jet mismatch. It would be moderately straightforward to extend this functionality to record the compilation ﬂags and architecture of that Vere binary, which could be useful in event playback. However, this remains an unsatisfactory solution because it would lead to Urbit runtime instances intentionally producing diﬀerent code (rather than a jet mismatch which would require correction).

4.1.3 Support a single hardware platform

Marriage is a ﬁne institution, but I’m not ready for an institution. (Mae West)

A permanent commitment to a single hardware platform—either for the Urbit platform as a whole or for a particular running instance—could solve the determinism problem. This conﬁguration would be tenable for single-purpose ships with lifetime control (likely moons or comets), but inconvenient for the “hundred-year computer” model touted for planets and superior ranks in Urbit.

To make a lifelong commitment to a particular hardware platform when the lifetime of a deterministic computer is unknown is therefore deemed foolhardy.

4.1.4 Dock ﬂoating-point results

What about trimming ﬂoating-point values of their least signiﬁcant bits? When would this take place—at each step of a multi-step computation? At the level of single-bit rounding errors, this would potentially work, and amounts to selecting a rounding mode towards even (last digit 0). Accrual across multiple calculations could potentially render this unreliable, particularly if diﬀerent computational paths are supposed to lead to the same result and do not as a result of docking.

One could also envision docking more than the last bit. This introduces a step to check and adjust the ﬂoating-point value, and in addition breaks ieee 754 compliance—at which point the trouble of trying to reconcile ieee 754 with determinism fails.

In general, we cannot assign a high degree of signiﬁcance to ﬁgures beyond the ﬁrst few, but accruals across large data sets (such as large language models) can become signiﬁcant (as attested to by the need for compensated summation).

A related technique could pack bits of larger ﬂoating-point values into smaller ones, but this is functionally a software-deﬁned solution (see, e. g., Brun (2018)).

4.1.5 Consistency checks

Another option is to compare Nock and jet code for every computation and only accept the C code if it is “correct”. This immediately runs into a very undesirable characteristic: every ﬂoating-point calculation is run twice, obviating at least one calculation and destroying any eﬃciency gains from jetting the code.

One could cache ﬂoating-point computations somewhere in the system.³⁴ This is liable to become prohibitively large for systems as every individual ﬂoating point calculation of all time becomes archived against future need.

We conclude that, at the current time, naïve hardware-deﬁned ﬂoating point is not viable for deterministic systems.

4.2 Software-deﬁned ﬂoating-point library

In the absence of a dedicated ﬂoating-point unit (fpu) and ﬂoating-point assembly instructions, ﬂoating-point computations are carried out in software. The type can be decomposed from bits, operated on, then packed back into the single type of appropriate value. For instance, prior to the widespread advent of 64-bit consumer hardware, applications that needed double values on 32-bit PC architecture utilized software emulation using two 32-bit numbers together.

Urbit’s current solution for ﬂoating-point computation is to utilize a software-deﬁned ﬂoating-point library, the SoftFloat library by Hauser. SoftFloat is an implementation in software of a subset of ieee 754 for ﬁve ﬂoating-point types.³⁵ Urbit statically links the library into its runtime binary so it is always available for Nock to utilize as a jet.

While formally correct, software FP is slower than hardware ﬂoating point, and likely prohibitively slow for many large matrix applications such as llms. (“Correctness is more important than performance.”) Performance is the dolorous stroke against software-deﬁned ﬂoating point. (On the other hand, some early versions of the Apple–ibm–Motorola PowerPC risc architecture did not have dedicated hardware ﬂoating-point units (fpus) or ﬂoating-point assembler instructions at all, requiring full software implementation.³⁶ ) gcc has supported a software ﬂoating-point mode using fp-bit.c ( gnu Project, 2008); this was particularly used to accommodate the PowerPC limitations rather than to provide either speed or determinism (cf. Sidwell and Myers (2006)).

An optimized portable deterministic software library for ﬂoating-point calculations may be a suﬃciently fast solution to meet Urbit’s needs even for vector computations. A diﬀerent avenue worthy of investigation is to take ieee 754 compliant ﬂoating-point values as inputs and outputs, then transform into a local representation for an optimized portable deterministic calculation. For instance, Thall (2007) presents the concept of “unevaluated sums”, a generalized technique for accruing error in situations where additional precision is necessary for accuracy. However, even with an agreed-upon standard library like SoftFloat, it is important to keep in mind that exact ﬂoating-point results for transcendental functions are still not correctly known in many cases.³⁷ This particular poses a problem for functions like $\sin $ which may be calculated by diﬀerent routes in Hoon/Nock and in C/Rust. For the time being, we conclude that Urbit’s discipline requires only using Hoon/Nock implementations of transcendental functions.

4.3 Opaque calculations

When a request for data is made over the network, one is not certain what the resulting data will be. Their value is epistemically opaque. In Urbit’s event log, the results of network calls are persisted as eﬀects in the modiﬁed state (for successful events).

What if Urbit treated a call that had a ﬂoating-point computation as if it were a network call, that is, as if it were a referentially opaque injection into Urbit’s state? One diﬀerence is that network calls result as side eﬀects from hints to the runtime which then handles the plumbing, as it were, and injects the resulting gift task back into Arvo as if a deus ex machina, from Arvo’s perspective. (It should of course know how to handle such a contingency.) There are two main objections that can be made here:

From the programmer’s standpoint, every ﬂoating-point computation would need to be bundled as if it were a network call, and the result treated as if it were a new move passed back into the kernel. This destroys synchronicity and changes ﬂoating-point computations from lightweight programmer choices into heavy and occasional calls.
The storage of every result of every ﬂoating-point computation could become prohibitively large. Work on large matrices in numerical analysis or machine learning could rapidly balloon the event log since every intermediate state would also become part of the ship’s immutable history.

To the ﬁrst objection, we can point to the current design pattern utilized in scrying (or the request for values from the bound scry namespace). Local scry values (such as values exposed by a system service or vane) are accessed synchronously using the .^ dotket operator. This is straightforward and easy to integrate into a program. Remote scry values must be requested asynchronously from another ship, and return at an indeterminate future time as gifts to be processed in another part of the vane or application.

To the second, we observe that although Urbit is a state machine whose history is part of its state, in practice we can mitigate event log growth by either chopping the event log by storing its state and permitting replay forward from that point or tombstoning data which should never be available again.³⁸

In a successful implementation of this scenario, one could imagine distinguishing slower software execution (treated synchronously) from faster hardware acceleration (treated asynchronously).

4.4 Stored results

Instead of repeating computations that have been made in the past, what if we cached the result of all of them, so that any new computations with the same values are guaranteed to result in the same value via a cache lookup instead of a calculation? Urbit uses memoization frequently in Arvo and in the runtime, so this is an aesthetically compatible option; we consider its feasibility.

A recently proposed hardware acceleration technique is to store the results of previous multiplication and division operations in a cache, reusing rather than recalculating the result whenever possible. (Dynamic proﬁling has found that a high percentage of these operations share the same operands as previous operations.) ( Jones (2008), p. 1148)³⁹

On Urbit, this introduces an $O(1)$ average-case/$O(n)$ worst-case cache lookup from a MurmurHash3 hash key calculation (what Urbit calls a ++mug). This must be weighed against the ﬂoating-point algorithm in consideration, as well as what is actually hashed (likely the Nock of the calculation contained in the dynamic hint).

This bears some similarities to aspects of the network call suggestion above, in that the second objection to that one holds here. Event log and state bloat (via the cache) are liabilities. Such a cache would be a feature of the Arvo instance, not the runtime VM. Unlike a truncated event log, the cache must be a permanent feature of the ship’s state rather than a convenience.

“Storing results” could also be met by the use of sparc-style logging. In that hardware platform, suspicious computations are ﬂagged and hashed into a lookup table by site in the originating program. Such events are logged not by timestamp or by computation hash but by callsite in the originating program ( Kahan (1997), p. 6).⁴⁰ Sun implemented this in sparc for “retrospective diagnostics” but the technique could allow a more lapidary operation for Urbit. (Follow-on considerations include whether such computations should now be considered “bound” in a sense like that of the scry namespace.)

4.5 Proscribing IEEE 754

What if the Scylla of ieee 754 is avoided for some Charybdis? We can approach this solution space at two levels: either by sector or entirely.

Proscribe by sector. One solution to the speed-vs.-reproducibility dilemma is to permit hardware-accelerated ieee 754 operations, but only in a veriﬁed subset permissible for jets. This would require careful vetting of the hardware stack and compiler options to deﬁne a permissible subset of ieee 754 operations as “known good”. Coupled with the epoch system, it may be a feasible solution.

What degree of vetting will reliably answer the gap between ieee 754 and hardware implementation for any particular operation? ( Jones (2008), pp. 330ﬀ.) and Goldberg (1991) provide a careful analysis of accuracy errors inherent to ieee 754 as a standard, but due to the variety of possible scenarios do not treat of real compilers and chipsets much.⁴¹ Trivially, as demonstrated above in the Python example, $(a + b) + c \neq a + (b + c)$, and even modest reordering of operations by a zealous compiler optimization is susceptible of introducing nonportable and thus nondeterministic (in our sense) behavior.

Having identiﬁed an appropriate subset of operations, we may imagine that the use of #ifdef, Autotools’ conﬁgure, and a jetting library may answer to our need. Any jet library would have to be carefully constructed to avoid imposing tight discipline directly on the end user (modal Hoon author). We cannot recommend this path today but do not consider the way to be shut, especially given liberal use of volatile.

In particular, fused multiply-add operations are subject to reordering by an optimizing compiler. Avoiding these would require some discipline on the part of the jet developer, since code that does not explicitly fma may yet reduce to it in a compiler pass. A jetting library would be advantageous in this case.

As an example of a refactoring of ieee 754 operations for determinism, consider the ReproBLAS project (last update ~2016.2.21). ReproBLAS seeks to produce a set of reproducible deterministic algorithms reﬂecting the standard operations of blas ( Ahrens, Nguyen, and Demmel, 2018). It accomplishes this by introducing a binned data type and a set of basic operations carefully built on ieee 754 for the objective of completely portable reproducibility.⁴² This is similar to our proposal for a vetted jetting library and may be worth attention, particularly in association with requirements around -O0.

Proscribe by replacement. Finally, we face the possibility of jettisoning decades of ﬂoating-point libraries entirely and forging a new trail. We explicitly omit attempting to implement a new standard as hubristic, but would like to explore some alternatives.

Posits. In 2015, John Gustafson proposed a new standard for representing values drawn from $\mathbb {R}$ called universal numbers or unums ( Gustafson, 2015).⁴³ The current version supports interval arithmetic and greater resolution near 1.0, at the cost of decreased resolution for extremely large and extremely small values. Unums also guarantee associativity and distributivity of operations.

Gustafson’s criticisms of ieee 754 focused on determinism and exactness; underﬂow and overﬂow; ﬁxed bit widths for mantissa and exponent; rounding; and the large wasted block of NaNs ( Risse, 2016). Unums likewise must provide sign, exponent with bias, and signiﬁcand; they may additionally signal whether the value is an interval. Unlike ieee 754’s use of multiple bit widths, 32-bit “posits” (ﬁxed-size unums intended to facilitate hardware requirements) are argued to be suﬃcient for almost all applications.

The unum proposal appears to have settled somewhat after its initial state of relative ﬂux (as Type iii unums, cf. Posit Working Group (2022)). Although most implementations have been in software, the project has been speciﬁed in Verilog several times and implemented on fpgas ( Chen, Al-Ars, and Hofstee (2018), and VividSparks, https://vividsparks.tech/). Intriguingly, some initial work has been carried out towards a fundamental blas-like library built on posits ( van Dam et al., 2019).

A unum/posit implementation for Urbit would be as straightforward as its implementation of ieee 754. For jetting, there is a software library for posits available called SoftPosit based on the SoftFloat library ( Cerlane, 2018). (Until commercial hardware implementations become available, the eﬀect of optimizations on determinism cannot yet be assessed; it is presumed that the situation will be better than ieee 754 given the advantages of a clean slate.)

Hand-rolled ﬂoats. If ieee 754 presents too many diﬃculties to be viable at high speed, then hand-rolling a custom hybrid hardware–software scheme via bitmasking could be attractive. This returns to the more “Wild West” days before ieee 754’s introduction, but is presaged by the recent introduction of bﬂoat16, TensorFlow-32, and other types designed for machine learning applications. Without access to hardware manufacturers, however, this amounts in the end to software-deﬁned ﬂoating point and seems unlikely to be competitive speedwise. (We cite the idea put forth previously in this article to convert to an intermediate representation for computation, yielding ieee 754 as necessary.)

It may also be worth considering the use of a 3-tuple of sign, exponent, and signiﬁcand (with only software jetting), and leave details of jet implementation to library authors. Hoon provides such a primitive in ++fn, a tuple for base-2 ﬂoating-point arithmetic supplying ﬁelds for sign, a signed exponent without bias, and an arbitrary-precision signiﬁcand.

Fixed-point and $\mathbb {Q}$. A ﬁxed-point representation diﬀers from a ﬂoating-point scheme in that the exponent is ﬁxed by the protocol or metadata and thus only the sign and signiﬁcand need be included in the bit representation. (With an oﬀset, even the sign can be elided.) The advantage of such a scheme is that it aﬀords the beneﬁts of ﬂoating-point mathematics at near-integer operation speeds (e. g. left-shift to multiply by two). One disadvantage is that there is a smallest representable value; this lack of subnormals requires either an underﬂow handler or the possibility of inadvertent division by zero. Fixed-point operations could also be used as intermediates in calculations. (This echoes once again the idea of conversion to an intermediate representation then conversion back out to ieee 754.)⁴⁴

If a rational number scheme is implemented, then a variety of possible implementations are possible, ranging from bitpacked ﬁxed-width integers to pairs of arbitrary-width integers. Reduction to “simplest” values introduces some overhead; fractions are formally an ordered pair $(a, b)$ with $b \neq 0$, but there is an equivalence class of multiples. (That is, if we write $\frac {1}{2}$ as $(1, 2)$, we have also to consider $(2, 4)$, $(3, 6)$, indeed an inﬁnite sequence of such ordered pairs.) Rational numbers are a superset of ﬂoating-point numbers and ﬁxed-point numbers, but accrue processing overhead due to dereferencing arbitrary integers and other aspects of computation on operations.

However, deviation from the proscription scheme, even inadvertently, would mean that a ship is considered invalid in a sense equivalent to double-booting or breaking the scry namespace. This option is deemed worth investigation, likely viable, but bearing unknown risks.

4.6 Irregularities

Any approach to modeling real numbers runs the risk that diﬀerent calculation pathways will yield a diﬀerent kind of inexactness in the result. These can be mitigated by some of the approaches suggested above, and also by checking the correspondence of the Hoon code and the underlying jet, particular for known edge cases in behavior. While Hoon–jet compliance is an open research problem,⁴⁵ we can apply principles of unit testing together with a period of testing Nock and jet compliance.⁴⁶

Jet mismatches have been rare in the current era.⁴⁷ Some jet “mismatches” occur because the runtime raises a diﬀerent error than the corresponding Hoon—these are relatively innocuous. Others may occur because actually diﬀerent results are produced for diﬀerent input. These are grave, and ultimately motivated the introduction of the epoch system so that event log replays can take into account the previous less-perfect jet version in the runtime ( ~mastyr-bottec, 2023).

5 Linear Algebra in Hoon

Lagoon⁴⁸ is an Urbit library to facilitate Hoon-native mathematical operations. It envisions six native types,

%real, an ieee 754 ﬂoating-point value
%uint, an unsigned integer
%int2, a twos-complement signed integer
%cplx, a blas-compatible ordered pair
%unum, a unum/posit value
%fixp, a ﬁxed-precision value

for which %real allows the rounding mode to be speciﬁed; %cplx consists of a pair of two values, real and imaginary parts; and %fixp requires the expected precision.

Lagoon implements algorithmically correct reference implementation in Hoon with the expectation that /lib/lagoon will be jetted. Operations include basic arithmetic, vector and matrix row/column operations, matrix multiplication, and matrix inversion. The jetting scheme may take advantage of software libraries or appropriate hardware, but must hew to the dictum that “if it’s not deterministic, it isn’t real.”⁴⁹

Lagoon has passed through several implementations and remains in active development. The current implementation is the lagoon branch of the urbit/urbit repository ( Urbit Foundation, 2023).

6 Conclusion

To summarize, the most promising solutions for ﬂoating-point mathematics on Urbit per the above analysis include:

Hardware ﬂoating point on single machine for entire lifetime.
Optimized software ﬂoating point with vetted jetting library.
Opaque calculation as callback.
Cached results by callsite.
Utilizing a subset of ieee 754 in hardware.
Replacing ieee 754 with another approach of suﬃcient speed, ﬁxed-point and unum/posits chief among these.

Several recent eﬀorts on Urbit have encountered diﬃculties in producing reliably deterministic and suﬃciently fast ﬂoating-point calculations on a Nock-based system.⁵⁰ We anticipate that, water ﬁnding its own level, each will adopt a suitable deterministic solution for evaluation in Nock. We do not anticipate these to be the last foundational numerical libraries built on Urbit, but instead among the ﬁrst. Thus we have documented the paths we have explored as an annotated map for future travelers in search of a one true representation for continuous mathematics.

References

: @KloudKoder (2022) “Floating-point rounding mode control prototyping (WebAssembly Issue #1456)”. url: https://github.com/WebAssembly/design/issues/1456 (visited on ~2024.3.10).
: Ahrens, Peter, Hong Diep Nguyen, and James Demmel (2018) “ReproBLAS: Reproducible Basic Linear Algebra Subprograms”. url: https://bebop.cs.berkeley.edu/reproblas (visited on ~2024.3.10).
: Andryso, Marc, Ranjit Jhala, and Sorin Lerner (2016). “Printing Floating-Point Numbers: A Faster, Always Correct Method.” In: Proceedings of the 43rd Annual acm sigplan-sigact Symposium on Principles of Programming Languages (popl ’16), pp. 555–567. doi: 10.1145/2837614.2837654. url: https://dl.acm.org/doi/10.1145/2837614.2837654 (visited on ~2024.3.10).
: Apple Computer, Inc. (1994). Inside Macintosh: PowerPC System Software. Boston: Addison-Wesley. url: https://developer.apple.com/library/archive/documentation/mac/pdf/PPC_System_Software/Intro_to_PowerPC.pdf (visited on ~2024.3.10).
: Babuška, Ivo (1969). “Numerical stability in mathematical analysis.” In: Information Processing 68, pp. 11–23.
: Brun, Laurent Le (2018) “Making ﬂoating point numbers smaller”. url: https://www.ctrl-alt-test.fr/2018/making-ﬂoating-point-numbers-smaller/ (visited on ~2024.3.10).
: C++ Reference (2023) “Floating-point environment”. url: https://en.cppreference.com/w/c/numeric/fenv (visited on ~2024.3.10).
: Cerlane, Leong (2018) “SoftPosit”. url: https://gitlab.com/cerlane/SoftPosit (visited on ~2024.3.10).
: Chen, Jianyu, Zaid Al-Ars, and H. Peter Hofstee (2018). “A matrix-multiply unit for posits in reconﬁgurable logic leveraging (open) capi.” In: Proceedings of the Conference for Next Generation Arithmetic, pp. 1–5.
: Chohra, Chemseddine, Philippe Langlois, and David Parello (2016). “Eﬃciency of Reproducible Level 1 blas.” In: Scientiﬁc Computing, Computer Arithmetic, and Validated Numerics. Ed. by Marco Nehmeier, Jürgen Wolﬀ von Gudenberg, and Warwick Tucker. Berlin: Springer International Publishing, pp. 99–108.
: Citron, D., D. Feitelson, and L. Rudolph (1998). “Accelerating multi-media processing by implementing memoing in multiplication and division units.” In: Proceedings of 8th International Conference on Architectural Support for Programming Languages and Operating Systems (asplos-viii), pp. 252–261.
: Dawson, Bruce (2013) “Floating-Point Determinism”. Random ascii – tech blog of Bruce Dawson. url: https://randomascii.wordpress.com/2013/07/16/ﬂoating-point-determinism/ (visited on ~2024.3.10).
: Demmel, James et al. (2017) “A Proposal for a Next-Generation blas”. url: https://docs.google.com/document/d/1DY4ImZT1coqri2382GusXgBTTTVdBDvtD5I14QHp9OE/edit#heading=h.jtgipeoidy9 (visited on ~2024.3.10).
: ~dozreg-toplud, K. Afonin (2023) “UrWasm”. url: https://github.com/Quodss/urwasm (visited on ~2024.3.10).
: — (2024). “The urwasm WebAssembly Interpreter Suite on Urbit.” In: Urbit Systems Technical Journal 1.1, pp. 133–150.
: Edelman, Alan (1997). “The Mathematics of the Pentium Division Bug.” In: siam Review 39.1, pp. 54–67. doi: 10.1137/S0036144595293959.
: Figueroa del Cid, S. A. (2000). “A Rigorous Framework for Fully Supporting the ieee Standard for Floating-Point Arithmetic in High-Level Programming Languages.” PhD thesis. New York University.
: Foote, Brian and Joseph Yoder (1999) “Big Ball of Mud”. url: http://laputan.org/mud/mud.html (visited on ~2024.3.10).
: gnu Project (2008) “Software Floating Point”. url: https://gcc.gnu.org/wiki/Software_ﬂoating_point (visited on ~2024.3.10).
: Goldberg, David (1991). “What Every Computer Scientist Should Know About Floating-Point Arithmetic.” In: acm Computing Surveys 23.1, pp. 5–48. url: https://dl.acm.org/doi/pdf/10.1145/103162.103163 (visited on ~2024.3.10).
: Gustafson, John L. (2015). The End of Error: Unum Computing. A. K. Peters/crc Press. isbn: 978-1-4822-3986-7.
: — (2017a) “Beyond Floating Point: Next Generation Computer Arithmetic (Stanford Seminar)”. url: https://www.youtube.com/watch?v=aP0Y1uAA-2Y (visited on ~2024.3.10).
: — (2017b) “Posit Arithmetic”. url: https://posithub.org/docs/Posits4.pdf (visited on ~2024.3.10).
: Gustafson, John L. and Isaac Yonemoto (2017). “Beating Floating Point at its Own Game: Posit Arithmetic.” In: Journal of Supercomputing Frontiers and Innovations 4.2, pp. 71–86. doi: 10.14529/jsﬁ170206.
: Hauser, John R. (2018) “Berkeley SoftFloat Release 3e”. url: http://www.jhauser.us/arithmetic/SoftFloat-3/doc/SoftFloat.html (visited on ~2024.3.10).
: Higham, Nicholas J. (2002). Accuracy and Stability of Numerical Algorithms. 2nd ed. Philadelphia: siam.
: Huﬀman, Carl (2024). “Pythagoras.” In: The Stanford Encyclopedia of Philosophy. Ed. by Edward N. Zalta and Uri Nodelman. Spring 2024. Metaphysics Research Lab, Stanford University.
: ieee (2008). 754-2008 ieee Standard for Floating-Point Arithmetic. Tech. rep. Institute of Electrical and Electronics Engineers. url: https://ieeexplore.ieee.org/document/4610935 (visited on ~2024.3.10).
: iso/iec (2018). iso/iec 9899:2018 Information technology – Programming languages – C. Tech. rep. International Organization for Standardization. url: https://www.iso.org/standard/74528.html (visited on ~2024.3.10).
: Izquierdo, Luis R. and J. Gary Polhill (2006). “Is your model susceptible to ﬂoating point errors?” In: Journal of Artiﬁcial Societies and Social Simulation 9.4. url: https://www.jasss.org/9/4/4.html (visited on ~2024.3.10).
: Jones, Derek M. (2008). The New C Standard: An Economic and Cultural Commentary. url: http://www.knosof.co.uk/cbook/cbook.html (visited on ~2024.3.10).
: Kahan, William (1965). “Further remarks on reducing truncation errors.” In: Communications of the acm 8.1, p. 40. doi: 10.1145/363707.363723.
: — (1997a) “Lecture Notes on the Status of ieee Standard 754 for Binary Floating-Point Arithmetic”. url: https://people.eecs.berkeley.edu/~wkahan/ieee754status/IEEE754.PDF (visited on ~2024.3.10).
: — (1997b) “The Baleful Eﬀect of Computer Languages and Benchmarks upon Applied Mathematics, Physics and Chemistry (John von Neumann Lecture)”. url: https://people.eecs.berkeley.edu/~wkahan/SIAMjvnl.pdf (visited on ~2024.3.10).
: Kharva, Paresh (2020) “TensorFloat-32 in the A100 gpu Accelerates AI Training, hpc up to 20×”. url: https://blogs.nvidia.com/blog/2020/05/14/tensorﬂoat-32-precision-format/ (visited on ~2024.3.10).
: Koenig, Jack (2018). A Hardware Accelerator for Computing an Exact Dot Product. Tech. rep. ucb/eecs-2018-51. University of California, Berkeley. url: https://www2.eecs.berkeley.edu/Pubs/TechRpts/2018/EECS-2018-51.pdf (visited on ~2024.3.10).
: MacDonald, Tom (1991). “C for Numerical Computing.” In: Journal of Supercomputing 5.1, pp. 31–48. doi: 10.1007/BF00155856.
: ~mastyr-bottec, Matthew Levan (2023) “Epoch System”. url: https://roadmap.urbit.org/project/epoch-system (visited on ~2024.3.10).
: Peters, Christoph (2021). “fma: A faster, more accurate instruction.” In: Moments in Graphics. url: https://momentsingraphics.de/FMA.html (visited on ~2024.3.10).
: Posit Working Group (2022). Standard for Posit™ Arithmetic. Tech. rep. Posit Working Group. url: https://posithub.org/docs/posit_standard-2.pdf (visited on ~2024.3.10).
: Risse, Thomas (2016). “It’s Time for Unums—an Alternative to ieee 754 Floats and Doubles.” In: Proceedings of the Fifth International Conference on Signal & Image Processing (sip-2016), pp. 50–51.
: Sidwell, Nathan and Joseph Myers (2006). Improving Software Floating Point Support. Tech. rep. CodeSourcery. url: https://hashingit.com/elements/research-resources/2006-01-improving_software_ﬂoating_point_support.pdf (visited on ~2023.3.9).
: Steele Jr., Guy L. and Jon L. White (1991). “How to print ﬂoating-point numbers accurately.” In: Proceedings of the 1990 acm sigplan Conference on Programming Language Design and Implementation (pldi ’90), pp. 372–389. url: http://kurtstephens.com/ﬁles/p372-steele.pdf (visited on ~2024.3.10).
: Thall, Andrew (2007). Extended-Precision Floating-Point Numbers for gpu Computation. Tech. rep. cim-007-01. The University of North Carolina at Chapel Hill. url: http://andrewthall.org/papers/df64_qf128.pdf (visited on ~2024.3.10).
: Tlon Corporation (2020) “The Last Network Breach”. url: https://roadmap.urbit.org/project/last-network-breach (visited on ~2024.3.10).
: Urbit Foundation (2023) “Urbit”. url: https://github.com/urbit/urbit (visited on ~2024.3.10).
: van Dam, Laurens et al. (2019). “An Accelerator for Posit Arithmetic Targeting Posit Level 1 blas Routines and Pair-hmm.” In: Proceedings of the Conference for Next Generation Arithmetic, pp. 1–10.
: Wang, Shibo and Pankaj Kanwar (2019) “BFloat16: The secret to high performance on Cloud tpus”. url: https://cloud.google.com/blog/products/ai-machine-learning/bﬂoat16-the-secret-to-high-performance-on-cloud-tpus (visited on ~2024.3.10).
: ~wicdev-wisryt, Philip C. Monk (2020a) “Urbit Precepts”. url: https://urbit.org/blog/precepts (visited on ~2024.3.10).
: — (2020b) “Urbit Precepts (Discussion)”. url: https://urbit.org/blog/precepts-discussion (visited on ~2024.3.10).

Footnotes

¹Analog computers may operate on a continuum of value. Computer logic systems may be architected on other numeric bases for their logic, such as the 1837 Analytical Engine’s decimal system and the 1958 Setun’s () ternary system.⤴

²This is reﬂected in algorithms such as the “fast inverse square root”, which permits a degree of inaccuracy in exchange for a substantial speedup.⤴

³To be clear, 3D gaming algorithms are deterministic (assuming no random sources are used), but exact error values are often not reproducible across platforms, nor was such portability a design criterion. The reasons for this are discussed below.⤴

⁴A legendary attribution, alas, predicated on the Pythagorean discovery of the irrational numbers as a separate class thereby inducing a crisis ( Huﬀman, 2024).⤴

⁵Hand-in-hand with the development of linear algebra, machines such as eniac and maniac were employed in the 1940s for solving thermonuclear reaction calculations and neutron diﬀusion equations. Under the direction of John von Neumann, it appears that some calculations did experimentally involve a ﬂoating-point scheme, although this was later rejected deﬁnitively in favor of ﬁxed-point arithmetic. See Kahan (1997), p. 3, on ﬂoating-point representations.⤴

⁶Compare so-called “engineering” notation, such as 1e5 for 10,000, which compactly represents the signiﬁcand 1 and the exponent 5 with an understood base of 10 indicated by e and a sign, implicit for positive.⤴

⁷We cite bﬂoat16 ( Wang and Kanwar, 2019) and TensorFloat-32 ( Kharva, 2020), among others.⤴

⁸We could omit the sign by introducing an oﬀset or only allowing positive values.⤴

⁹The exponent has a bias so that it in turn does not need a sign.⤴

¹⁰We ignore the decimal representations introduced in ieee 754-2008, which do not materially change our argument.⤴

¹¹“Each of the computational operations that return a numeric result speciﬁed by this standard shall be performed as if it ﬁrst produced an intermediate result correct to inﬁnite precision and with unbounded range, and then rounded that intermediate result, if necessary, to ﬁt in the destination’s format.” ( ieee, 2008) Note that, per Risse, “there is no indication whether or not a computation with ieee 754 is exact even if all arguments are.” The iso/iec 9899 C standard confesses its own fallibility: “The ﬂoating-point model is intended to clarify the description of each ﬂoating-point characteristic and does not require the ﬂoating-point arithmetic of the implementation to be identical” ( iso/iec (2018), fn. 21).⤴

¹²For instance, left-shifting a ﬂoating-point value does not double it; we leave the mathematics of why as an exercise to the reader.⤴

¹³Note that this is diﬀerent from the smallest representable value for a given bit width; e. g., for 32-bit single-precision ﬂoat the smallest representable value is $\texttt {0000.0000.0000.0000.0000.0000.0000.0001} = 1\times 10^{-45}$.⤴

¹⁴We do not lay blame at the feet of any particular party; the facts are the facts. Indeed, a more recent revision of ieee 754 leads with a call for portability: “This standard provides a discipline for performing ﬂoating-point computation that yields results independent of whether the processing is done in hardware, software, or a combination of the two” ( ieee, 2008).⤴

¹⁵Subnormals are a convention that allows values smaller than the “normal” ieee 754 smallest non-zero value. They permit a graceﬂow underﬂow behavior, and can prevent an unintentional division by zero.⤴

¹⁶“Some processors do not support subnormal numbers in hardware” ( Jones (2008), p. 338). (The risk is that this permits inadvertent division by zero.) Various chipsets solve this exceptional behavior diﬀerently.⤴

¹⁷“C support for signaling NaNs, or for auxiliary information that could be encoded in NaNs, is problematic. Trap handling varies widely among implementations. Implementation mechanisms may trigger signaling NaNs, or fail to, in mysterious ways. The iec 60559 ﬂoating-point standard recommends that NaNs propagate; but it does not require this and not all implementations do.” ( Jones (2008), p. 339)⤴

¹⁸In Urbit, the Vere runtime uniﬁes NaNs, meaning that any bitwise information which may be encoded in the signiﬁcand ﬁeld—the “NaN payload”—is thrown away. This is called the “canonical NaN”.⤴

¹⁹The only signiﬁcant variation in the other real types in Hoon arises for quadruple-precision ﬂoating-point values @rq which are represented in the runtime by a pair of uint64_ts.⤴

²⁰The decimal output is produced using the traditional Steele–White Dragon4 algorithm ( Steele and White, 1991). It is worth considering upgrading Hoon from Dragon4 to Errol ( Andryso, Jhala, and Lerner, 2016) for speed and accuracy.⤴

²¹In practice, of course, Urbit hews to recognized types, but the temptation to design new ﬂoating-point layouts is intriguing.⤴

²²A shorthand for jet-accelerated code.⤴

²³u3 functions are Urbit noun library functions. The sing union is a union of uint32_t and SoftFloat ﬂoat32_t types.⤴

²⁴Support for ieee 754 is similar for support for the Markdown markup language. Many platforms support a subset of Markdown coupled with platform-speciﬁc extensions. (See also sql.) Internal references, html, inline LaTeX math mode, code block language speciﬁcation, and other features see varying levels of support with GitHub, Pandoc, Obsidian, and other editors and converters.⤴

²⁵This can be mitigated in turn by the use of the volatile designation, but this is suﬃcient to illustrate the problem.⤴

²⁶I was once asked by a retired computer science professor if such guarantees would make things easier. Well, at the end developer level!⤴

²⁷The problem of exactness is not exactly the same as reproducibility, but it is related. Exactness means that the result is the same as if the calculation were carried out to inﬁnite precision and then rounded to the appropriate number of bits. Reproducibility means that the result is the same across diﬀerent platforms. Reproducibility requires identical outcomes from the same inputs, while exactness requires the correct answer regardless of algorithmic path.⤴

²⁸Although not of grave consequence, the C language (as of C23) does not implement at least two types speciﬁed by ieee 754-2019 and recent predecessors: binary128 quadruple precision and binary256 octuple precision. While neither are signiﬁcant losses, we also note that Urbit does not currently support a C-style long double type. C’s long double is 80 bits wide on some common consumer hardware, such as the x86-64 architecture, but is 128 bits wide on the 64-bit arm architecture. (The situation is worse for Python, whose numpy.ﬂoat128 type eponymously advertises itself as quadruple precision but is in fact a regular 80-bit long double.) Some compilers and libraries do support quadruple-precision ﬂoating-point mathematics, such as gcc’s __ﬂoat128 type. We note that ieee 754 80-bit extended-precision could be implemented using the ++fn core should demand arise.⤴

²⁹Indeed, we do not know the future speciﬁcations which may be implemented to provide approximations of real values in either hardware or software.⤴

³⁰In any case, this assumes a legible and enumerable set of behaviors for volatile which is, alas, not the case. “volatile is a hint to the implementation to avoid aggressive optimization involving the object because the value of the object might be changed by means undetectable by an implementation” (Jones (2008), p. 472). “Actions on objects so declared shall not be ‘optimized out’ by an implementation or reordered except as permitted by the rules for evaluating expressions” (ibid, p. 1500). “The volatile qualiﬁer only indicates that the value of an object may change in ways unknown to the translator (therefore the quality of generated machine code may be degraded because a translator cannot make use of previous accesses to optimize the current access)” (ibid, p. 963). The same author provides examples of C code that is ambiguous in volatile’s semantics, pp. 1290–1291; and undeﬁned in volatile’s semantics, pp. 1482–1483.⤴

³¹“What constitutes an access to an object that has volatile-qualiﬁed type is implementation-deﬁned” (ibid, p. 1488). “Volatile-qualiﬁed objects can also be aﬀected by translator optimizations” (ibid, p. 1490). The C novice may at this point wonder what the intended utility of volatile in fact is: “[a] volatile declaration may be used to describe an object corresponding to a memory-mapped input/output port or an object accessed by an asynchronously interrupting function” (ibid, p. 1499).⤴

³²“The ﬂoating-point environment access and modiﬁcation is only meaningful when #pragma STDC FENV_ACCESS is set to ON. … In practice, few current compilers, such as HP aCC, Oracle Studio, and ibm XL, support the #pragma explicitly, but most compilers allow meaningful access to the ﬂoating-point environment anyway.” ( C++ Reference, 2023)⤴

³³The possibility of circumscribing the set of permissible ieee 754 operations, which may aﬀord a diﬀerent approach to this problem but seems similarly susceptible of shipwreck, is explored in a subsequent section, q.v.⤴

³⁴Indeed, something like this cache system was employed on Sun sparc architecture, as discussed in Section 4.4 .⤴

³⁵“The current release supports ﬁve binary formats: 16-bit half-precision, 32-bit single-precision, 64-bit double-precision, 80-bit double-extended-precision, and 128-bit quadruple-precision” (Hauser, 2018).⤴

³⁶“There are several 680x0-based Macintosh computers that do not contain ﬂoating-point coprocessors” ( Apple Computer, 1994); on the other hand, “ﬂoating-point calculations are performed even faster under the … emulator than on a real 680x0-based Macintosh computer,” indicating that optimized software acceleration is possible, modulo chipset versions and tuned libraries. The PowerPC 601, introduced in 1991, had a native fpu.⤴

³⁷To correctly calculate a trigonometric function for double may take over a hundred bits of precision before correct rounding can be determined. Furthermore, the C math.h implementation of $\sin $ may or may not use fsin.⤴

³⁸The memory implications of these are not necessary here, but take place in diﬀerent arenas: the runtime versus the Arvo noun arena.⤴

³⁹Cf. Jones’ citation of Citron, Feitelson, and Rudolph.⤴

⁴⁰What constitutes “suspicion” is only sparsely elaborated by Kahan in that article.⤴

⁴¹See particularly the note on “Common Implementations” on p. 346 of Jones (2008).⤴

⁴²“Using our default settings, reproducibly summing $n$ ﬂoating point types requires approximately $9n$ ﬂoating point operations (arithmetic, comparison, and absolute value). In theory, this count can be reduced to $5n$ using the new ‘augmented addition’ and ‘maximum magnitude’ instructions in the proposed ieee Floating Point Standard 754-2018.” (Ahrens, Nguyen, and Demmel, 2018)⤴

⁴³See also Gustafson (2017), Gustafson (2017), and Gustafson and Yonemoto (2017).⤴

⁴⁴“One solution to implementing ﬂoating-point types on processors that support ﬁxed-point types is to convert the source containing ﬂoating-point data operations to make calls to a ﬁxed-point library” ( Jones (2008), p. 346). Note that the sense of our current interest is reversed.⤴

⁴⁵Particularly as regards co-generation of Hoon and C/Rust, or formal proofs.⤴

⁴⁶The Vere runtime supports a debugging ﬂag which runs both the Nock and the jet and checks for identical results.⤴

⁴⁷By “current era”, we mean after the last global network breach on ~2020.12.8 ( Tlon Corporation, 2020).⤴

⁴⁸Linear AlGebra in hOON⤴

⁴⁹It is a worth a ﬁnal digression to address reproducibility on parallel systems. We do not consider this a design goal for Lagoon at the current time. Operations like reduction take place on a single computer; while jets may in principle utilize parallelism their points of entry and exit are unique. However, we note that the ReproBLAS project has addressed this issue in the context of reproducible parallelism ( Ahrens, Nguyen, and Demmel, 2018), as have Chohra, Langlois, and Parello (2016).⤴

⁵⁰Cf. UrWasm, which as a WebAssembly implementation directly relies on ﬂoating-point computations ( ~dozreg-toplud, 2023) (see e. g. @KloudKoder (2022) for a discussion of the issues involved, and in particular ~dozreg-toplud, pp. 133–150 in this issue), Quodss/urwasm; /lib/math, implemented purely in Hoon, sigilante/libmath; and Lagoon, which includes ieee 754 operations with its linear algebra library, urbit/numerics pending development and integration into Urbit.⤴