Errors by dabrahams · Pull Request #41 · stlab/better-code

dabrahams · 2026-01-28T20:40:40Z

No description provided.

camio

Hopefully this submits the comments I made here. They ma have gotten lost?

camio · 2026-01-28T22:53:14Z

Looks like you made a different PR? Here are my comments on the old one: #39 (review)

dabrahams · 2026-01-29T21:22:23Z

Looks like you made a different PR? Here are my comments on the old one: #39 (review)

Thanks; they were good. I mistakenly deleted the branch in the server, which closed the PR.

better-code/src/chapter-3-errors.md

RishabhRD · 2026-01-30T10:26:43Z

I was wondering if it is worthy to provide wisdom on some of common error handling (mis)conceptions widely spread in industry:

try/catch just to log and rethrow.
Enriching the error with "context". Rust's anyhow::Context is an example and thus is becoming common practice. Usually done by throwing a custom error type that wraps the error thrown by functions called by callee.
Adding StackTrace to the error.

I have noticed the above patterns are common for "backend" applications (applications running on server, interacting with DB, etc).

I know the points I mentioned has nothing to do with correctness of the program and might be very contextual but the reason for pointing this out is I have noticed multiple people exchanging multiple theories/libraries around similar ideas(an example from r/rust) without even talking about design by contract. Maybe addressing this would help them.

better-code/src/chapter-3-errors.md

tothambrus11 · 2026-02-02T13:11:52Z

better-code/src/chapter-3-errors.md

+A useful middle ground is to describe reported errors at the module
+level, e.g.
+
+> Any `ThisModule` function that `throws` may report a
+> `ThisModule.Error`.
+
+A description like the one above does not preclude reporting other
+errors, such as those thrown by a dependency like `Foundation`, but
+calls attention to the error type introduced by `ThisModule`.


I like this middle ground. Whatever error doesn't conform to the mentioned error protocol can be usually still handled by the same handling method, it just wouldn't be as helpful. E.g. if something doesn't conform to the LocalizedStringConvertible protocol as suggested by the method's documentation, we can still display the non-localized description to the user. In a robotics system, any non-conforming error can be regarded as a fatal subsystem failure, triggering the emergency stop procedure at the top level.

I think there would be also value to have a language feature that can add a similar annotation for the module: "All throwing functions in module/scope/file X, unless otherwise specified, must throw something of a given type." Then we can express our intents more explicitly, e.g. with a required isFatalSubsystemFailure flag.
When writing a compiler, we generally want to throw Diagnostics, but and it's useful to know if the function can throw anything else, or if the function only throws the expected Diagnostics, but otherwise it's not doing any sketchy stuff. When we have that information, we can wrap throwing a function into something that returns a partial AST and a set of diagnostics, without the need to rethrow any extra errors.

I don't understand what you're getting at with isFatalSubsystemFailure, but fatal errors are distinct from the recoverable kind we report by throwing. You don't want to unwind the stack if the condition is going to be fatal.

Also, is there a suggestion for the text here or are we just chatting?

Fatal means slightly different things in a robotics context than in a regular application. When an application experiences a bug, it is reasonable to trap and expect the user to restart the application if they want. However, in a robotics system, we may need to perform safety measures, such as lowering a motorized arm slowly to avoid falling and damaging components. Often, the emergency stop also doesn't just cut the power but e.g. keeps holding onto suction cups so the robot doesn't drop a 10kg glass window.

Also, fatal subsystem failures can likely happen due to broken sensors, failed communication with motors, and I think should be distinguished from our regular fatal failures caused by bugs/precondition violations. Subsystem failures can lead to the graceful termination of that specific subsystem, while the same controller may go on with finishing some remaining tasks of other subsystems.

(I don't have any suggestions to the text, just discussing.)

Well, this discussion put me off of trying to recommend emergency shutdown measures (which were part of our general philosophy), but your robotics example is excellent. I think you should post about it in that thread. The biggest problem that I have for the book is that there's no way in Swift to do emergency shutdown other than by a monitor process that runs the program as a subprocess (which is extremely limiting and not even available on some OSes like iOS). That said, I see no reason to perform unwinding in these cases; it seems to me the program should go directly to the emergency shutdown procedure before terminating.

Broken sensors aren't fatal to the program if it is going to continue, so I personally wouldn't use the word “fatal.” If that is the term of art in Robotics, so be it.

tothambrus11 · 2026-02-02T13:24:27Z

better-code/src/chapter-3-errors.md

+
+```swift
+extension Array {
+  /// Exchanges the first and last elements.


If we change the condition count() == 1 to count() <= 1, we can easily make this function safe even if the precondition is turned off. I think this is orthogonal here, but still slightly distracting/weakening the argument.

Maybe it would be worth mentioning this example still though. What's the tradeoff between writing the two different conditions, and what is the ideal precondition of this function?

That is really not the point of the example, so I don't want to get into that here. However, I would welcome a better example that doesn't raise the concern (which occurred to me too).

A simple out of bounds access check for an element accessor could be sufficient to illustrate the problem. Maybe something that's not a subscript is even better, in case someone didn't know about subscripts.

We can't hold back from using fundamental language features. We do expect to have an appendix that gives an introduction to the Swift features we use.

What kind of element accessor would you suggest? I thought of middleElement but that's got such a weird precondition (that the length is odd)…

tothambrus11 · 2026-02-02T13:36:48Z

better-code/src/chapter-3-errors.md

+In most cases, the only acceptable behavior at that point is to
+present an error report to the user and leave their data unchanged,
+i.e. the program must provide the strong guarantee. That in turn
+means—unless the data is all in a transactional database—a program


I wrote a transactional undo/redo framework that lets us compose commands similarly as functions, ensuring transactional guarantees on every level. It may be useful for more general programming tasks other than implementing undo/redo in editors, as we may often not want to discard the changes to the whole document, just the changes done in a specific scope.

See an example at: https://github.com/tothambrus11/undonete-swift/blob/master/Tests/UndoneteTests/UndoneteTests.swift

Transactional guarantees don't compose, so using them at every level is incredibly inefficient. Is this a general remark or do you think the text should change somehow?

No need to change the text, just discussing.

My composite command system achieves composable transactionality, assuming all low-level commands are correctly implemented to be transactional with an undo, a redo and initial execution method.

Higher level commands are composed of the execution of lower level commands (which may be themselves composite). Once any command fails to execute, it throws an exception, and the composite command undoes all previously executed subcommands, so that the composite command either fully succeeds or leaves the state in the original state.

There is a bit of syntactic overhead over regular programming, but I think explicit undoable commands can be generally useful when making a copy of the original state is unfeasible while requiring transactionality, and it makes code much easier to reason about, as it takes care of a semantically correct unwinding.

Undoing is usually much less efficient and much more error prone and much much more code than rolling back to a snapshot. You just use a persistent data structure to represent snapshots and then small changes don't take much storage.

Thank you, those are great observations! Undoing is indeed pretty error prone. It's enough for one of the low-level components to be incorretly implemented and it would make all its user dependant commands non-transactional. If the transactionality is too fine-grained, my model also adds extra layers of undo stacks, so it also adds large memory overhead. Persistent data structures sound like they abstract over all of these bits, I will need to study them more.

All you have to do is use CoW at a reasonably fine-grained level, and you get a persistent data structure with mutable value semantics. “Persistent data structure” is just a fancy word for that. There are such things as “persistent arrays” that make it more fine-grained than “the whole array” and preserve the big-O of arrays, which might be worth looking into. However, beware the constant factors. I would venture that a CoW deque would beat this data structure in many cases.

Co-authored-by: Rishabh Dwivedi <rishabhdwivedi17@gmail.com>

Co-authored-by: Ambrus Tóth <32463042+tothambrus11@users.noreply.github.com>

dabrahams · 2026-02-02T19:35:28Z

@RishabhRD I would need some suggestions of what specific wisdom to offer. I thought about all this and it seems mostly irrelevant to the core issues, so I didn't know what to say about it. If you want to wrap your errors with context information, go ahead; it can be useful… I guess the one advice I'd give is to use a higher order function, e.g.

try withContext("What I'm doing right now") {
  // the code
}

but even that seems like I'd have to set up a lot of context in the text to even mention it.

better-code/src/chapter-3-errors.md

camio · 2026-02-02T20:18:50Z

better-code/src/chapter-3-errors.md

+[Perhaps the earliest use
+](https://dl.acm.org/doi/10.1145/800028.808489) of the term “error
+recovery” was in the domain of compilers, where the challenge, after


I have no reason to believe that the term error recovery was initially used in compiler design. See, for example, this 1959 paper, which talks about error recovery (or failure recovery) in the context of hardware issues. A quick search in Google Scholar found the term used as early as 1937.

Your 1937 paper appears to be from 1971. It says so on the first page.
The other reference does not mention “error recovery,” and I used the word “perhaps.”
Hardware issues are beyond the scope of this chapter.
What would you suggest I do? That would be actionable feedback.

better-code/src/chapter-3-errors.md

camio · 2026-02-02T20:36:27Z

better-code/src/chapter-3-errors.md

+compromising security. If user data is quietly corrupted and
+subsequently saved, the damage becomes permanent.
+
+In any case, unless the program has no mutable state and no external


Suggested change

In any case, unless the program has no mutable state and no external

In any case, unless the program lacks mutable state and external

@camio This suggestion causes me mental friction because of what I can best describe as the passiveness of the word “lacks.” The properties of having no mutable state or external effects have to be actively upheld in order to make this strategy work. I realize this is a subtle thing and could potentially be convinced that my reaction is misplaced.

better-code/src/chapter-3-errors.md

camio · 2026-02-02T21:03:13Z

better-code/src/chapter-3-errors.md

+
+Assertions are checked only in debug builds, compiling to nothing in
+release builds, thereby encouraging liberal use of `assert` without
+concern for slowing down release builds.


Aside: Swift got the defaults wrong here. 1) I don't think preconditions and inline assertions are fundamentally different such that one is in release and one isn't 2) For assert I think there should be two variations (assert and always_assert), but I prefer Rust's spelling: assert and debug_assert.

Perhaps focusing on the desired properties in different contexts rather than taking Swift's naming and compilation scheme as given is more useful to the general audience. The precondition identifier serves also a documentation purpose, and it's a bit awkward to replace that to assert for saving performance in release builds.

It may be worth adding the explicit imaginary alternative precondition(condition, errorMessage, debugOnly=False), and disclose that different languages use different names and defaults with regards to tagging.

At the same time, it's useful to mention where to write assert and where to write precondition with regards to their semantic/documenting purpose.

@camio The rationale for pairing the name assert with the semantics it has go the other way: if you're going to have a check that turns off in release builds, it should be named the same as checks with the same semantics from other languages. Precedent matters. Internally to the standard library there's _debugPrecondition which is used for precondition checks upon which safety does not depend (it also needs to be implemented differently from assert for… reasons). We decided against exposing something with that name to users because there was already going to be assert with identical semantics.

Now, you might take issue with the implication that self-checks for soundness can reasonably be disabled in release builds. That, however, is a critique of the way the chapter is written, not of Swift.

@tothambrus11 I thought the text was “focusing on the desired properties in different contexts.” If you have an idea how that could be strengthened, perhaps suggest an edit. Likewise for “it's useful to mention where to write assert and where to write precondition with regards to their semantic/documenting purpose.”

I was conscious of the awkwardness of using assert for a precondition check, but I absolutely don't want to add anything imaginary and write examples in terms of that. People should be able to test the examples. To that end, I spent a bunch of time building an implementation of preconditionUncheckedInRelease (and postconditionUncheckedInRelease) to put in the text and found that they required implementation tricks that would need to be explained. In all, it seemed very heavy for the amount of benefit it was bringing so I replaced them with the use of assert with a message that you see in the text.

camio · 2026-02-02T21:18:06Z

better-code/src/chapter-3-errors.md

+code in an unfinished state):
+
+1. Something your function uses has a precondition that you can't
+   be sure would be satisfied:


I'm having a lot of trouble understanding these examples. I'll try to think of something that's more straightforward.

The first example is not great and I wanted to find something better. I'm surprised you had trouble with the 2nd one though.

camio · 2026-02-02T21:21:57Z

better-code/src/chapter-3-errors.md

+In general, when a condition *C* is necessary for fulfilling your
+postcondition, there are three possible choices:
+
+1. You can make *C* a precondition of your function


I think it is worth clarifying (unless this is covered later) that option 1 is to add a precondition D such that when D holds, C holds.

It is sometimes desirable to add a precondition that is a strict superset of C, e.g. when it simplifies the function's interface.

When you make D a precondition, you are making C a precondition. Isn't the value of simple contracts already sufficiently covered in the previous chapter?

sean-parent

Looking very good. My notes are relatively minor.

sean-parent · 2026-02-02T20:59:21Z

better-code/src/chapter-3-errors.md

+> has a bug.
+
+In the interest of progressive disclosure, we didn't look closely at
+the idea, because behind that simple word lies a chapter's worth of


I would emphasis error in the quote, because even though you mention the
concept of errors above, it isn't clear if error is what "that simple word" is
referencing.

Really? It's in italics in the sentence that introduces the quote!

better-code/src/chapter-3-errors.md

sean-parent · 2026-02-02T21:25:11Z

better-code/src/chapter-3-errors.md

+[^techniques]: Techniques for ensuring that restarting is seamless,
+such as saving incremental backup files, are well-known, but outside
+the scope of this book.


Since persistent transactions are a way to achieve both the strong guarantee and
to ensure restarting is seamless, they are probably worth a longer reference in
the text. Naming them as "persistent transactions" also gives the reader something
to research.

I don't know what you have in mind. Please suggest a specific edit.

better-code/src/chapter-3-errors.md

sean-parent · 2026-02-02T22:29:13Z

better-code/src/chapter-3-errors.md

+    proper `x.randomShuffle()` would, and is not guaranteed to
+    preserve the same randomness properties.  Perhaps more


We should state this more strongly than "not guaranteed." With quick or
introsort you get a high probability that the pivot element is near the center,
and the next pivot element is near either the 1/4 or 3/4 mark, and so on.

It's not plainly obvious to me how much those facts harm the randomness. If it's obvious to you, please suggest “highly unlikely to” or something as an edit and I'll accept it.

better-code/src/chapter-3-errors.md

instead of adding preconditions.

Co-authored-by: David Sankel <camior@gmail.com>

better-code/src/chapter-3-errors.md

@camio

Very useful; thanks @camio and @sean-parent! Co-authored-by: Sean Parent <sean.parent@stlab.cc> Co-authored-by: David Sankel <camior@gmail.com>

dabrahams added 30 commits September 17, 2025 16:54

Begin Errors.

fb7839d

Errors WIP

252e110

WIP

661e6cf

WIP

81f3f44

Errors WIP

919822e

Recoverd talk notes

2ad2aef

Merge remote-tracking branch 'origin/main' into errors

d105bc6

WIP

0a366a8

More WIP

aed127c

WIPPITY WIP WIP WOW

ab2f6b5

X

333b81e

Progress

32a543b

End section on bugs.

0dbc6b3

Fix levels

1904312

Whitespace

1c53ffa

Bugfix

797f128

X

85c9686

Merge origin/errors2 into errors (using imerge)

21feed2

Checkpoint

cc1b758

Remove treatment of emergency shutdown measures.

4f5d481

Preface caveat

642a4e5

Checkpoint

bb19270

Checkpoint

ce7e1e6

checkpt

cf5a223

Tweekz

91c80db

Edits

cca0e8a

Checkpoindexter

66303f0

Copy-pasta

a3abe80

X

b7b3d42

Simplicity!

1ab7851

dabrahams added 2 commits January 22, 2026 15:23

Onward

842b853

Finish 1st draft

9fb694f

camio reviewed Jan 28, 2026

View reviewed changes

typo

8cbdf6a

dabrahams added 6 commits January 29, 2026 13:26

Consistent terminology.

8147eb3

Intro tweaks

25249da

Conclusion + fleshing out.

9ab6665

Remove flotsam

d8c1a43

Edits based on David Sankel's feedback.

08fa18e

Let it flow.

e7e7879

RishabhRD reviewed Jan 30, 2026

View reviewed changes

better-code/src/chapter-3-errors.md Outdated Show resolved Hide resolved

tothambrus11 reviewed Feb 2, 2026

View reviewed changes

better-code/src/chapter-3-errors.md Outdated Show resolved Hide resolved

tothambrus11 reviewed Feb 2, 2026

View reviewed changes

better-code/src/chapter-3-errors.md Outdated Show resolved Hide resolved

tothambrus11 reviewed Feb 2, 2026

View reviewed changes

better-code/src/chapter-3-errors.md Show resolved Hide resolved

tothambrus11 reviewed Feb 2, 2026

View reviewed changes

dabrahams and others added 3 commits February 2, 2026 10:33

typo

dffbada

Co-authored-by: Rishabh Dwivedi <rishabhdwivedi17@gmail.com>

Better wording from Ambrus

1eaf05e

Fix logic error

21127c5

Co-authored-by: Ambrus Tóth <32463042+tothambrus11@users.noreply.github.com>

camio reviewed Feb 2, 2026

View reviewed changes

sean-parent requested changes Feb 2, 2026

View reviewed changes

dabrahams and others added 2 commits February 2, 2026 16:27

Take Ambrus' suggestion to mention using type information

f7e3bfe

instead of adding preconditions.

typo

519fd45

Co-authored-by: David Sankel <camior@gmail.com>

dabrahams commented Feb 6, 2026

View reviewed changes

better-code/src/chapter-3-errors.md Outdated Show resolved Hide resolved

Apply suggestions from code review

e55fc3d

Very useful; thanks @camio and @sean-parent! Co-authored-by: Sean Parent <sean.parent@stlab.cc> Co-authored-by: David Sankel <camior@gmail.com>

	In any case, unless the program has no mutable state and no external
	In any case, unless the program lacks mutable state and external

		proper `x.randomShuffle()` would, and is not guaranteed to
		preserve the same randomness properties. Perhaps more

Conversation

dabrahams commented Jan 28, 2026

Uh oh!

camio left a comment

Choose a reason for hiding this comment

Uh oh!

camio commented Jan 28, 2026

Uh oh!

dabrahams commented Jan 29, 2026

Uh oh!

Uh oh!

RishabhRD commented Jan 30, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dabrahams Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dabrahams commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dabrahams Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dabrahams Feb 2, 2026 •

edited

Loading

dabrahams commented Feb 2, 2026 •

edited

Loading

dabrahams Feb 6, 2026 •

edited

Loading