Wadler's Blog: Haskell equations, thirty-eight years later

10.9.25

Haskell equations, thirty-eight years later

One night, while drifting off to sleep (or failing to), I solved a conundrum that has puzzled me since 1987.

Before Haskell there was Orwell. In Orwell equations were checked to ensure order is unimportant (similar to Agda today). When an equation was to match only if no previous equation applied, it was to be preceded by ELSE. Thus, equality on lists would be defined as follows:

    (==) :: Eq a => [a] -> [a] -> Bool
    [] == []          =  True
    (x:xs) == (y:ys)  =  x == y && xs == ys
    ELSE
    _ == _            =  False

We pondered whether to include this restriction in Haskell. Further, we wondered whether Haskell should insist that order is unimportant in a sequence of conditionals, unless ELSE was included. Thus, equality on an abstract type Shape would be defined as follows:

    (==) :: Shape -> Shape -> Bool
    x == y | circle x && circle y  =  radius x == radius y
           | square x && square y  =  side x == side y
    ELSE
           | otherwise             =  False

In Orwell and early Haskell, guards were written at the end of an equation and preceded by the keyword if or the end of an equation could be labelled otherwise. (Miranda was similar, but lacked the keywords.) Here I use the guard notation, introduced later by Paul Hudak, where otherwise is a variable bound to True.

Sometime two equations or two guards not separated by ELSE might both be satisfied. In that case, we thought the semantics should ensure that both corresponding right-hand sides returned the same value, indicating an error otherwise. Thus, the following:

    plus :: Thing -> Thing -> Thing
    plus x y | zero x     =  y
             | zero y     =  x
    ELSE
             | otherwise  =  ...

would be equivalent to:

    plus :: Thing -> Thing -> Thing
    plus x y | zero x && zero y && x == y    =  x
             | zero x && zero y && x /= y    =  error "undefined"
             | zero x && not (zero y)        =  y
             | not (zero x) && zero y        =  x
             | not (zero x) && not (zero y)  =  ...

Here the code checks that if x and y are both zero then they are the same. (I will consider a refinement to the check for sameness later.) Of course, the compiler would issue code that performs the tests zero x, zero y, and x == y at most once.

We didn’t pursue this design in Haskell for two reasons. First, because we thought it might be too unfamiliar. Second, because the ELSE on a line by itself was syntactically awkward. It would be especially annoying if one ever wanted the usual cascading behaviour:

    f :: Thing -> Thing
    f x | p x  =  ...
    ELSE
        | q x  =  ...
    ELSE
        | r x  =  ...

Here each guard is tested in turn, and we take the first that succeeds.

Today, the first problem is perhaps no longer quite so strong an issue. Many applications using Haskell would welcome the extra assurance from flagging any cases where order of the equations is significant. But the syntactic awkwardness of ELSE remains considerable. It was syntax about which I had an insight while tossing in bed.

Above otherwise is a variable bound to True in the standard prelude. But say we were to treat otherwise as a keyword, and to give it the meaning that the equation applies only if no previous equation applies, and to allow it to optionally be followed by a further guard. Then our first example becomes:

    (==) :: Eq a => [a] -> [a] -> Bool
    [] == []            =  True
    (x:xs) == (y:ys)    =  x == y && xs == ys
    _ == _ | otherwise  =  False

And our second example becomes:

    (==) :: Shape -> Shape -> Bool
    x == y | circle x && circle y  =  radius x == radius y
           | square x && square y  =  side x == side
           | otherwise             =  False

And our third example becomes:

    plus :: Thing -> Thing -> Thing
    plus x y | zero x     =  y
             | zero y     =  x
             | otherwise  =  ...

If one doesn’t want to invoke the equality test in the case that both zero x and zero y hold then one would instead write:

    plus :: Thing -> Thing -> Thing
    plus x y | zero x            =  y
             | otherwise zero y  =  x
             | otherwise         =  ...

Similarly, the cascading example becomes:

    f :: Thing -> Thing
    f x | p x            =  ...
        | otherwise q x  =  ...
        | otherwise r x  =  ...

That’s it! The syntactic awkwardness is greatly reduced.

The proposed notation depends upon Paul’s clever insight to move the guard from the end of the equation to the middle, so evaluation works strictly left to right. But we’ve had guards in that position for quite a while now. Goodness knows why none of us hit upon this proposal thirty-odd years ago.

Of course, the change is not backward compatible. Changes to guards could be made backward compatible (with added ugliness) by using a different symbol than ‘|’ to mark guards with the new semantics. But now the old definition of (==) should not be accepted without an otherwise, and I cannot think of how to introduce that new semantics with a backward compatible syntax.

The solution, as with so much of Haskell nowadays, is to activate the new semantics with a pragma. Manual porting of legacy code would not be hard in most cases, and it would also be easy to write a tool that adds otherwisewhenever the equations are not easily shown to be independent of order.

John Hughes suggests a further refinement to the above. Using equality to check that the value of two equations is the same may not be appropriate if the values are computed lazily. Instead, he suggests that the plus example should translates as follows:

    plus :: Thing -> Thing -> Thing
    plus x y | zero x && zero y              =  x `meet` y
             | zero x && not (zero y)        =  y
             | not (zero x) && zero y        =  x
             | not (zero x) && not (zero y)  =  ...

Here we presume a type class

    class Meet a where
      meet : a -> a -> a

which confirms that the two arguments are the same and returns a value that is the same as both the arguments. For strict data types, two arguments are the same if they are equal.

    instance Meet Integer where
      x `meet` y | x == y     =  x
                 | otherwise  =  error "undefined"

For lazy data types, we check that they are the meet lazily.

    instance Meet a => Meet [a] where
      [] `meet` []           =  []
      (x:xs) `meet` (y:ys)   =  (x `meet` y) :: (xs `meet` ys)
      meet _ _  | otherwise  =  error "undefined"

If the compiler could not verify that equations are disjoint, it would require that their right-hand sides have a type belonging to the class Meet.

In most cases, one would hope the compiler could verify that equations are disjoint, and hence would not have to resort to meet or additional checks. One might wish to allow a pragma to declare disjointness, permitting the compiler to assume, for instance, that x < y and x >= y are disjoint. An SMT solver could do much of the work of checking for disjointness.

In general, equations not separated with otherwise would be checked to ensure they are disjoint or all give equivalent results. For example,

    g :: Thing -> Thing
    g x | p x             =  a x
        | q x             =  b x
        | otherwise r x   =  c x
        | s x             =  d x
        | otherwise t x   =  e x

would be equivalent to

    g :: Thing -> Thing
    g x | p x && q x              =  a x `meet` b x
        | p x && not (q x)        =  a x
        | q x && not (p x)        =  b x
        | otherwise r x && s x    =  c x `meet` d x
        | r x && not (s x)        =  c x
        | s x && not (r x)        =  d x
        | otherwise t x           =  e x

On the other hand, if we declared that p x and q x are disjoint, and the same for s x and r x, then the first code would instead compile to something equivalent to Haskell’s current behaviour,

    g :: Thing -> Thing
    g x | p x             =  a x
        | otherwise q x   =  b x
        | otherwise r x   =  c x
        | otherwise s x   =  d x
        | otherwise t x   =  e x

One drawback of this proposal is that the source code doesn’t directly indicate when extra tests and the use of meet are required. An IDE might provide feedback to make explicit which tests are performed, or one might also add pragmas or additional syntax to reflect that information in the source.

I hope some reader might be keen to take this forward. What do you think?

9 comments:

Justin said...: I think ordering is a feature. It’s easy to reason about (and familiar). Being able to write out of order code with multiple otherwise statements looks like a nightmare. I read and write Haskell daily and am bit afraid of new, useful syntax. I think more justification is needed to pursue this. What does it buy? Criticism offered with best possible intentions.; 10/9/25 9:09 PM
Philip Wadler said...: It's helpful for writing high-assurance programs. If you wrote `plus`, you might want to confirm that returning `x` when `y` is zero and returning `y` when `x` is zero are equivalent, and that is what the code will do under the new semantics; and now it does the same for any overlapping equations. Of course, you could write that out explicitly but it is longer. Now you have to be explicit about where you think the order of equations doesn't matter and where you think it does.; 11/9/25 8:57 AM
AntC said...: (Following Wadler's Law ...) `otherwise` is not currently a keyword, so there might be backwards compatibility trouble; also it's a pain to type out. `else` is a keyword; in your proposed syntax following `|` is not a currently valid position for `else`, so I prefer that.; 12/9/25 1:27 AM
Philip Wadler said...: Good point. Indeed, replacing `| otherwise` by `else` reads well if it is to be followed by a guard, but less well if it stands on its own.; 14/9/25 3:27 PM
John Hughes said...: That does look nice! I had two further thoughts on it…

1. What about implementation? The code expansion you suggest can increase code size exponentially (you get exponentially many cases in the number of unordered equations). That seems bad. Of course, there is a linear space translation, involving making a list of the matching right hand sides, and using a foldr meet to combine them (if the list is non-empty). Perhaps a compiler could switch from one implementation to the other when the number of cases exceeded some small bound; perhaps the “linear” case could be made more efficient by collecting the right hand sides on the stack instead of in a list. I think there is a case that this could be too expensive, though, so a bit of thought is needed.
2. What about laziness/evaluation order? Consider
f _ True = a
f True False = b
f _ _ | otherwise = b
What is f undefined True? In Haskell as it is today, of course, the answer would be a, because the equations are used in order. But with your idea, we would have to exclude the second equation, and doing so using left-to-right pattern matching would match undefined against True first, leading the entire call to be undefined, I suppose. Now of course, in this case right-to-left pattern matching would work. But what about
or True _ = True
or _ True = True
or False False = False
As we know, this can’t be implemented in a way faithful to all the equations without parallel evaluation of the two arguments. That’s quite heavyweight machinery for the pattern-matching compiler to introduce. Alternatively, there needs to be a syntactic restriction to guarantee the existence of a sequential matching order. And since evaluation order affects space use, then the programmer should be able to reason about the matching order without too much difficulty. That weighs against using a global analysis of all the left hand sides to determine matching order, in which a change to the last equation might lead to different matching order in the first one… I suppose I’m thinking that a reasonable goal might be that a group of equations WITHOUT ‘otherwise’ should have a semantics independent of their order. That seems a bit tricky to achieve in a straightforward way.; 29/9/25 4:11 PM
John Hughes said...: I had a further thought on sequentiality and matching order.

Here it comes. Matching order remains left-to-right, like today, but there is an additional restriction that all the equations in an unordered group must share the same matching order. This would have been awkward to achieve thirty-five years ago, but is easy today, thanks to bang patterns! Thus or becomes

or True !_ = True
or !_ True = True
or _ _ | otherwise = False

I think this restriction ensures that the order of equations is irrelevant. And matching semantics, including evaluation order, is no harder to reason about than today.; 29/9/25 4:12 PM
Anonymous said...: f _ True = a
f True False = b
f _ _ | otherwise = b

What is f _|_ True?

- Haskell today: True

- With this proposal: matching second equation left-to-right results in _|_
Right-to-left matching would work in this case.

But consider

or True _ = True
or _ True = True
or False False = False

There IS no (sequential) matching order that makes all the equations hold.

Proposal: require all the equations in a commuting group to use the eame matching order. Can be achieved more easily nowadays using ! to force strict matching.

So the or definition above would not be allowed. Instead write

or True !_ = True
or !_ True = True
or False False = False

To recover the old behaviour, write

or True _ = True
or !_ True | otherwise = True
or False False = False

You gave this example to motivate choosing the matching order on a
case-by-case basis.

f xs [] = ...
f [] (y::ys) = ...
f (x :: xs) (y :: ys) = ...

Here right-to-left order looks natural. But consider:

or False False = False
or _ _ | otherwise = True

What is or _|_ True? Today: _|_. With left-to-right matching,
_|_. With right to left: True. Yet the patterns give no reason to
choose one order over the other.

Possibly consider a type of three-valued booleans:

data TVB = TT | FF | X

and this definition:

f FF FF = FF
f _ X = X
f _ _ | otherwise = TT

(this is not the or function, just an example to make a point).

What is f _|_ TT? If the answer changes when equation (2) is deleted,
then something very funny is going on.

Here are three principles I think are important:

1. The meaning of every legal program should be defined unambiguously.

2. The order of equations in a commuting group (with no otherwise)
should make no difference to the semantics.

3. If there is an otherwise between every pair of equations, then the
semantics should be the same as today.

And actually, I would suggest a fourth:

4. Reasoning about evaluation order (or at least, strictness) should
be easy for the developer.

Why? Because it's a necessary part of debugging space leaks, which is
hard enough already. For me, the predicability of left-to-right
matching is a great strength.; 16/10/25 10:33 AM
Philip Wadler said...: The previous comment is, in fact, not by "Anonymous" but by John Hughes. Thanks, John! I agree with your four principles. I want to add a fifth: just as the order of equations in a group without otherwise should not matter, the order of patterns should not matter. Thus:
```
orL False x = x
orL True x = True
```
should match the left argument first (so `orR True _|_` is `True`, but `orR _|_ True` is `_|_`, while
```
orR x False = x
orR x True = True
```
matches the right argument first, so `orR True _|_` is `_|_`, but `orR _|_ True` is `True`. Whether one can define the semantics in this way is consistent with principles 1 and 4 is an important question, to which I do not (yet) know the answer.; 16/10/25 11:45 AM
Philip Wadler said...: Thanks to all for the discussion at the Haskell Symposium at ICFP/SPLASH in Singapore. Following the discussion, I believe that the idea of checking at run-time that all right-hand sides are equal when the left-hand sides overlap fails to hit the sweet spot: it might be helpful in a specification language (as pointed out by Sam Lindley) but if you are an implementor interested in correctness you would probably prefer stronger guarantees and a simpler map from what is written to what is executed. So I now prefer an alternative that requires all of the patterns in a group between "otherwise" equations to be disjoint, which for patterns at least is easy to check. Note that `orL` and `orR` above are both disjoint.

Also, thanks to a Haskell Symposium referee for pointing me at the following paper, which is relevant and which I enjoyed reading:

Binder, David, and Lean Ermantraut. ‘The Algebra of Patterns (Extended Version)’. LIPIcs, Volume 333, ECOOP 2025 333 (2025): 2:1-2:28. https://doi.org/10.4230/LIPIcs.ECOOP.2025.2.; 22/10/25 11:44 AM