January Release Notes and Themes

2022-01-30

This post describes the last two Oil releases, and then elaborates on emerging project themes.

The most important one is thinking of the #shell-runtime as a state machine that receives asynchronous messages. This work is in progress, but it's worth describing the motivations, and what we've done so far.

Oil version 0.9.7 - Source tarballs and documentation.

Table of Contents

Background

Oil 0.9.6 on December 30th

User-Facing Changes

Under the Hood

Oil 0.9.7 on January 28th

Under the Hood

Closed Issues

The Shell Runtime As A State Machine

Whac-A-Mole With Signal Handling Bugs

Credits

A Model of the Runtime

Background

I wrote about the 0.9.5 release in November, in Winter Blog Backlog: Recent Progress.

The work in the the 0.9.6 and and 0.9.7 releases has a variety of motivations:

Project-based. Language features for Nix, and a bug fix found by running ble.sh tests.
Bug Reports. A clear pattern is that Oil doesn't handle signals correctly! This led me down a rabbit hole related to SIGWINCH, the signal for terminal window size change. The fixes and testing made me realize that we should "recast" the #shell-runtime as a state machine.
- I appreciate the great bug reports that led to this, and credit several people below.
Bug backlog. I went through almost every bug in the issue tracker, and fixed a couple old ones. (Zulip: Issue Triage). Even though Oil is a long-winded project, it feels like it can "converge" with enough help.
Infrastructure: Preparing for a "compiler engineer" to join the project. Our continuous build is in good shape.

The next two sections have details and credits. If you're casually following the project, you may want to skip to the last section.

Oil 0.9.6 on December 30th

User-Facing Changes

Full changelog: https://www.oilshell.org/release/0.9.6/changelog.html

Motivated by Nix usage, assignment builtins now support the += operator. These both work, and many new test cases pass (spec/append results).
- export foo+=bar (static)
- export foo${x}+=bar (dynamic)
  - Aside: this dynamic variant is now implemented with regular languages, consistent with our style of principled parsing.
- Thanks to Raphael Megzari for reporting this, and for lots of testing with Nix.
Contributions from Peter Debelak:
- Fix a crash when sourcing a directory. I then discovered and fixed a similar bug with redirects (by properly handling the EISDIR error).
- osh now supports -l and --login (no-ops for now)
Contributions from Brandon:
- Fixed a crash bug in the fg builtin. Even more importantly, he added terminal-based pexpect tests to verify the fix!
- I put these tests in the continuous build, and greatly expanded on them in the subsequent release. This was crucial to fixing the tricky signal handling bugs.
- Not released: some work on the pgen-native parser generator, the fast version of pgen2. We still need help with this!
I made a pretty code overview for the compiler engineer I want to fund and hire: https://www.oilshell.org/release/0.9.6/pub/metrics.wwz/line-counts/for-translation.html
- Somehow this makes the code look big, but the line counts are small. The job is to work with around 10K lines of code, not 100K!
- Background: #oil-dev > Compiler Engineer Job
- I will write more about this, but feel free to ask me about it on Zulip.

Under the Hood

As mentioned in the recent retrospective, the continuous build now uses #containers. It runs on both Github Actions and sourcehut, with both the Docker and podman runtimes.
- I started a #containers Zulip stream and wrote way too many messages about this. I'll use these notes to write future blog posts. Slogan: Containers are complementary to shell!
- The multi-cloud CI system is now called Soil (formerly Toil). I'd like to eventually lift some parts of it to the user level.
I refactored and fixed Oil's error messages (at parse time and runtime), but there's still more work to do. I want to remove some duplicate messages and add optional color!
1131 spec tests pass under oil-native, which is an all-time high: https://www.oilshell.org/release/0.9.6/test/spec.wwz/cpp/osh-summary.html

Oil 0.9.7 on January 28th

I released Oil 0.9.7 two days ago. Let's start with the infrastructure changes. Then we'll look at user-facing changes, which leads into the larger state machine theme.

Under the Hood

I upgraded the busybox version used in the spec tests, which meant upgrading one of the new CI #containers. It worked well, although docker build has caveats with regard to reproducibility and correct incremental builds. We should address this in future work.
- Thanks to Peter Debelak for reporting the busybox version issue. I want to improve the dev process for contributors, so reports like these are very welcome.
I revived the app-tests CI task, which runs the ble.sh tests with OSH.
- Thanks to author Koichi Murase for excellent debugging and help in several areas! I'd still like Oil to run ble.sh.
The oil-native "dev build" now wraps shell with Ninja, so it's incremental and parallel.
- In contrast, the release build shouldn't depend on Ninja, so we'll probably keep using a shell script forever. It compiles in around 30 seconds, so it should be fine.
Now 1137 spec tests pass under oil-native: https://www.oilshell.org/release/0.9.7/test/spec.wwz/cpp/osh-summary.html
- That is, the automatic translation has been doing work "for free", but there's still a lot left. This is what I want to hire the compiler engineer for.

Now let's look at closed issues, which leads into the state machine theme.

Closed Issues

#1077	Add interactive tests that match other shells (e.g. Ctrl-C is exit code 130)
#1072	Fix vm-baseline benchmark after optimization
#1067	Terminal resize causes wait to exit
#1064	Tab completion does not suggest aliases or functions
#743	$PATH is empty when it's not in the parent's environment, unlike other shells
#467	Ctrl-C in command substitution exits parent shell

Full changelog: https://www.oilshell.org/release/0.9.7/changelog.html

The Shell Runtime As A State Machine

Whac-A-Mole With Signal Handling Bugs

Let's focus on two of these issues:

#467 - Ctrl-C during a command sub exited the shell, but it should bring us back to the interactive prompt.
#1067 - Resizing the terminal causes the wait builtin to return, but it should be ignored.

They don't seem related, or even that interesting. But they led me to reconceptualize the shell runtime as a state machine.

They both relate to signals: Ctrl-C causes SIGINT, and resizing the terminal causes SIGWINCH. And they reminded me of past bug fixes, like running trap handlers when the read builtin is interrupted.

I realized I've been playing Whac-A-Mole with this class of bug, which is bad. So I started to work on the pexpect tests that Brandon added for the fg bug, expanding the harness and planning a test matrix.

I fixed these bugs, and found more bugs to fix (e.g. in the wait -n variant). Revelation: We were missing an important way of testing the shell! Now that we have test/interactive.py, we can make monotonic progress on a multi-dimensional test matrix. More on this below.

Credits

Thanks to Lukas Wurzinger for reporting #1067, and then nudging me to do the right thing.
Thanks to Josh Nelson for reporting #467 a couple years ago! I unearthed this bug during the triage. I think I subconsciously put it off because it required manual testing at the time.

I also tagged older bugs #signal-handling so I can make another pass over them. They should fall in specific cells of the test matrix, and "disappear" once those cells are filled in.

A Model of the Runtime

Based on this experience, I sketched an idea for a blog post:

#blog-ideas > The Shell runtime is a waitpid(-1) loop that converges to a state

The idea is that the shell interpreter walks the syntax tree and:

Makes syscalls like
- fork() to start a process, or
- read() for the read builtin
Receives two types of asynchronous messages:
- The result of waitpid(-1). This is how the kernel tells us that a process changed state (it finished, or suspend/resume).
- Signals: from the keyboard, from the kill command, ...
Updates its state based on these messages
And then there are five ways that the shell waits for state to converge:
- 2 synchronous constructs: processes and pipelines
- 3 async constructs: wait (all jobs), wait -n (next job), and wait %1 (specific job)

One part of this is the singleton Waiter abstraction, which has existed for years. What's new is integrating syscalls and signals into a single state machine model.

It reminds me of DJB's self-pipe trick. How do you wait on a child process and a async read() concurrently? By writing a byte to a "self pipe" in the signal handler, so it reduces to select().

We're not using the same mechanism, but we also have to unify disparate concurrency styles. (Linux has signalfd, but Oil should be portable to all Unixes.)

Exhaustive Test Matrix

I wrote a comment in test/interactive.py that describes this five dimensional test matrix. It should let us explore a large portion of the state space and prevent regressions.

What is the main loop doing?
- Computation, waitpid(), read(), ...
What message is received?
- Which signal, which process completion, ...
If it's a signal, is it trapped by the user, or untrapped?
- Example: trap 'echo X' SIGWINCH
Is the shell interactive or batch?
- An interactive shell is one where stdin is connected to a terminal, as opposed to a file. They shouldn't exit in certain circumstances.
- Terminals distribute signals to groups of processes.
As we do in spec tests, the interactive state machine tests should compare OSH with shells like bash, dash, mksh, zsh, ...

Again, the idea is to make monotonic progress rather than playing Whac-A-Mole. This will also help the code translate cleanly and automatically to C++.

Recap

I wrote about related problems in Technical Issues and Risks (August 2020) > Deferred Issues: What the Interactive Shell Depends On.

Despite recently declaring that the interactive shell is "punted", I'm thinking about all these issues again, and I have a plan for each one:

Line editors like GNU readline
Concurrency with yield or pipes
Cancellation

One way to clarify this: I'm limiting the scope of the interactive shell to my own usage. I want to punt customizations outside the project -- to the headless shell. In contrast, I very much care about making the Oil language useful for others. That is the core of the project.

I expect that the state machine model will improve Oil, and that future blog posts will make reference to it. I've written a lot about principled and exhaustive parsing (#parsing-shell and #ASDL), but not much about the #shell-runtime. That's because we were missing something!

More Themes

This post is now too long, so I moved these themes to the next post:

Fundraising / Hiring for a Compiler Engineer
- See https://github.com/oilshell/oil/wiki/Compiler-Engineer-Job
- I applied for an NLnet grant and expect to hear back this month.
- Talk to me on Zulip about this! Do you know somebody interested and qualified?
Zulip: Ideas for an Oil Logo. Some good connotations:
- Oil as in delicious food
- Oil as essential lubricant for software systems
- Oil as a smooth upgrade path from bash

What's Next?

I want to write these posts:

Backlog: Software Architecture. This is the last of five "Winter Backlog" posts which I published at the end of 2021 (#blog-topics).
- The last post, Retrospective: Software Architecture, is the lead-in.
Backlog: Push vs. Pull, State Machines vs. Coroutines. This is an unappreciated aspect of both programming and language design. I've encountered in multiple times in the project.
- lobste.rs recap and Zulip #blog-ideas thread

As far as coding, these issues are on my mind:

Color error messages like Clang (and GCC). Precise and pretty errors are one of Four Features That Justify a Unix Shell.
Fix drawing problems in the #interactive-shell. What happens when you type past the end of the line?
Continue work on signals: Fix the SIGINT handler like I did the SIGWINCH handler.
The Soil CI subsystem. It should usable locally (which containers help with), and more reproducible and incremental. It should have fine-grained security to safely receive results from multiple clouds.

Appendix: Metrics for the 0.9.7 Release

As usual, these metrics help me keep track of the project. I hope they'll also give the compiler engineer color on what needs to be done.

Let's compare this release with Oil 0.9.4 - User Feedback.

OSH spec tests:

OSH spec tests for 0.9.4: 1958 tests, 1738 passing, 84 failing
OSH spec tests for 0.9.7: 1994 tests, 1771 passing, 86 failing

Translation progress:

OSH C++ spec tests for 0.9.4: 1688 run, 1555 passing in Python, 937 passing in C++
OSH C++ spec tests for 0.9.7: 1774 run, 1626 passing in Python, 1137 passing in C++

I forgot to mention that I fixed a mycpp bug related to field inheritance, which led to the big jump in tests passing in C++.

#oil-dev > Translation Bugs That Survived the Compiler

The source code is getting more correct, but not much bigger:

oil-cpp for 0.9.4: 92,115 lines of C++
oil-cpp for 0.9.7: 93,239 lines of C++

Ditto for the binary:

ovm-build for 0.9.4: 1,350,544 bytes of native code (under GCC)
ovm-build for 0.9.7: 1,376,608 bytes of native code (under GCC)

January Release Notes and Themes

Background

Oil 0.9.6 on December 30th

User-Facing Changes

Under the Hood

Oil 0.9.7 on January 28th

Under the Hood

Closed Issues

The Shell Runtime As A State Machine

Whac-A-Mole With Signal Handling Bugs

Credits

A Model of the Runtime

Other Shells Have This Problem

Exhaustive Test Matrix

Recap

More Themes

What's Next?

Appendix: Metrics for the 0.9.7 Release