Oils 0.18.0 - Progress on All Fronts

2023-09-17

This is the latest version of Oils, a Unix shell. It's our upgrade path from bash to a better language and runtime:

Oils version 0.18.0 - Source tarballs and documentation.

We're moving toward the fast C++ implementation, so there are two tarballs:

The reference implementation in Python. See INSTALL.txt in oil-*.tar.gz.
The C++ translation. See README-native.txt in oils-for-unix-*.tar.gz.

If you're new to the project, see the Oils 2023 FAQ and posts tagged #FAQ.

What's in this release?

We're moving toward retiring the Python tarball: OSH differs by 2 test cases in C++, and YSH differs by 77, down from 157 in the last release.

We're deep in the middle of implementing YSH: particularly functions, procs, and data languages. We're constantly testing and revising the language design.

This release "checkpoints" that work, as well as reflecting progress on every other part of the project.

Another highlight is the new Oils Reference, organized into two tables of contents, and 13 chapters. As YSH stabilizes, we'll document its behavior here.

Let's summarize what we've done, highlighting contributions. As always, we can use more help!

Based on User Feedback

Implemented the is-main builtin, with feedback from bar-g. (Described below.)
Fixed a build issue reported by joelatschool.
Fixed the getting started doc, reported by Peter Debelak.

The docs still need a lot of renaming from Oil → YSH. Feel free to open bugs on specific docs that are confusing.

Thank you for testing Oils!

Interactive Shell (One Breaking Change)

Melvin did a big overhaul of job control, and fixed bugs like the lack of notification for sleep 1 &.
- Our spec/stateful Tests with pexpect are now looking good. This completes a major milestone for our NLnet grant.
- I'll show screencasts of job control in the next post.
I fixed a long-standing escaping bug in shell auto-completion, issue #915.
- We can now run git-completion.bash, without patches. I'll also show a screencast of this.
- A few years ago, I wanted OSH to be "clean" with respect to escaping — not confusing the shell language with argv languages — but I now realize that's for YSH. OSH has to be compatible with a somewhat quirky bash API.
- Related Zulip thread: #shell-autocompletion > YSH Completion Roadmap (login required)
We register the name oils with GNU readline, not oil.
- This is the breaking change, issue #1695. I noticed it because somebody on Hacker News hooked into the oil name with their ~/.inputrc.

Headless Shell

We can now export shell completions as JSON, including the bash completions that OSH runs:

compexport -c 'echo $HO'  # completes shell language itself
                          # e.g. $HOME
compexport -c 'git ch'    # runs git's bash plugin for completion
                          # e.g. checkout, cherry-pick

I claim that this is a missing feature in bash. For example, the git plugin registers itself with this line:

complete -o bashdefault -o default -o nospace -F $wrapper

But programs that want to scrape bash completion can only see the logic in $wrapper, not the three -o options. See Projects Already Doing Something Like Shellac for such programs.

So OSH compexport may be the mechanism for other shells to reuse bash completion!

This work is also still in progress, and only lightly documented. Please join our #shell-gui channel, and help us test it.

It was motivated both by testing our completion logic (escaping), and the headless shell. We want to export shell completions to a GUI. But again, I think other programs can also use it, rather than scraping bash and re-implementing parts of it.

OSH

Fixed our own "dogfood" issues:

Respect globbing in word arguments to redirects, matching bash and zsh:

$ tar -x -z < Python*  # globbing occurs
                       # other shells disagree!

Fixed popd bug
Added is-main builtin

The is-main builtin returns 1 (false) if the current file was executed with the source builtin. It's designed to be used like Python:

if __name__ == '__main__':  # Python
  main(sys.argv)

OSH:

if is-main; then
  main "$@"
fi

YSH:

if is-main {
  main @ARGV
}

Related: #tools-for-oils > Tree Shaking - Static/Dynamic. Our bundling / tree shaking may work by dynamically walking dependencies, and cat-ting the result. It would have to rewrite source and is-main though.

Docs

As mentioned, we have the new Oils reference, which lives in doc/ref/.
Aidan noticed that the reference needs to link to source code. So I published almost all our code at https://www.oilshell.org/release/0.18.0/src-tree.wwz/, and you can link to specific lines. (The CSS links are unfortunately broken, which I've fixed for the next release.)
Rewrote the help builtin to use the reference.
- We want to document Oils once, not 2 or 3 times!
Translated the help builtin to C++.

Still TODO:

#oil-documentation > doc/ref Link and Topic Checker. We need to check the docs for link integrity, which should result in another counter like "146 of 450 topics documented". We can show this metric at the bottom of each release, like we do with our spec test counters.

Performance

Tuned the pool allocator that Chris Watkins implemented earlier this year. It was very pleasant to work with!

Instead of a single pool for objects under 32 bytes, we have two pools with thresholds of 24 and 48 bytes. In other words, these are members of our MarkSweepHeap:

Pool<682, 24> pool1_;
Pool<341, 48> pool2_;

16 KiB / 24 bytes = 682 cells (rounded)
16 KiB / 48 bytes = 341 cells (rounded)
These each multiply to 16,368 bytes, which is 16 bytes less than 16 KiB, conveniently leaving room for the glibc malloc() header.

The 24 bytes comes from the fixed-sized "head" of List<T>, which is the most common type in the program. We should also be able to fit the common Token in 24 bytes.

TODO: Tune our growth factors for Slab<T> so the first one fits exactly in 48 bytes.

Note: we tried other general purpose allocators like tcmalloc and mimalloc, but it's easy to do better on our workloads, measured with uftrace, cachegrind, perf, etc. I like that we have ~800 lines of code for our entire allocator and garbage collector, not 30,000!

Some color on how we arrived at this:

benchmarks/uftrace shows that 92-97% of objects fit in the 2 pools.
benchmarks/gc-cachegrind shows that the pool allocator approximates the bump allocator. That is, for each workload, the two mut+alloc lines are close together:
- 45.6 and 49.6 million irefs for parse.abuild
- 62.2 and 65.0 million irefs for ex.compute-fib
- A bump allocator is incompatible with our non-moving GC, because it would be too expensive to fit new objects into holes created by deleted objects.
benchmarks/osh-runtime shows new columns: num in pool 1, num in pool 2

So allocation is no longer a bottleneck, and GC rooting now sticks out as the next thing to optimize. There was also an unrelated slowdown due to more entries in our slow Dict, so we need to write a real Dict.

Some parts of the runtime are still "hilariously unoptimized", but we're making progress.

Under the hood: C++ Translation

As mentioned, the spec-cpp test delta for OSH is now 2, and for YSH it's 77.
Implemented osh --tool and ysh --tool, to expose various tools to the command line.
- For example, osh --tool cat-em stdlib/math.ysh prints a file from our nascent standard library.
Embed the git commit in the binary, and show it in --version.

Tightening up our metalanguage:

mycpp now disallows global class instances.
- This fixed a crash due to a global Token instance without a GC header. It was easy to reproduce and fix with ASAN!
It correctly translates dict literals, including global dicts.
- (It's arguably inconsistent that we allow global dicts and lists, but not global class instances. The design is "demand-driven" right now.)
It now disallows try/finally.
- Python has finally, but C++ doesn't, all usages were bugs!

YSH Language

Scrubbing the Semantics

I scrubbed the expression evaluator for consistency.

Unary operators like - and ~ do the same string/number conversions as binary operators.
- Remember that YSH has implicit conversions, but I claim we don't have "footguns" because operators aren't as polymorphic/overloaded. For example, + always means numeric addition, and ++ is concatenation, not + as in Python and JavaScript.
Ternary if and unary not are consistent with and or.

More details on issue #1710. There are still a few more behaviors to consider, like tightening up chained comparisons.

Top-Level / Interactive Syntax

YSH now allows shell-style myvar=x at the top level. I had disallowed it in favor of var myvar = 'x', but we now have a Language Design Principle of "not breaking the top level".

That is, these commands all work the same way in YSH, although with stricter error handling and no word splitting:

x=~/src

ls /tmp | wc -l 2>/dev/null

PYTHONPATH=. python3 -c 'print("hi")

What distinguishes YSH is really the compound commands: proc func, if case, and for while, not the top level.

The shell style is easier to type interactively, and is also useful for tilde expansion x=~/src. (YSH expressions don't have an equivalent of tilde expansion, since all strings are single- or double-quoted.)

On the other hand, myvar=x is disallowed in funcs and procs:

$ proc p { myvar=x; echo hi }
           ^~~~~~~
[ interactive ]:1: Use var/setvar to assign in YSH

Remember that the var foo = {age: 10} style lets you use typed data on the right-hand side.

This change allowed us to remove shopt -s parse_sh_assign, which is good because fewer global options is better.

Functions and YSH "Standard Library"

Thanks to Aidan Olsen and Melvin Walls, all YSH functions and methods are now either:

Implemented in typed Python, and translated to C++ (no more CPython dependency, aka "metacircular" hack), or ...
Implemented in YSH itself
- Remember that we're aiming for the harder thing of making YSH reflective and extensible. We want the "whole enchilada", not just a cleaned-up bash.

Aidan:

Implemented the error builtin
Implemented ...rest args to functions (aka "varargs")
Put functions in the variable namespace, not the proc namespace.
Fixed bugs in case arm parsing. (This reminds me that we still have some "smells" around joining the hand-written CommandParser and generated ExprParser.)
Implemented source --builtin math.ysh
- After the change to embed stdlib/math.ysh in the binary
Implemented functions in YSH, like min() max() abs(). We're testing the design of YSH by writing the "standard library" in YSH.
Docs in the new Oils reference

Melvin:

Translated many functions and methods:
- List methods like extend()
- len() glob() split() join() maybe() ...
- Type conversion functions bool() int() str() ...
- Docs in the new Oils reference
Added 1:n range expression (which we plan to change to 1..n)

What's Next? Zulip threads

As mentioned, we're deep in the middle of designing and implementing YSH. Here are Zulip threads that may be good starting points — they link to other threads.

#language-design > Remaining Parsing Changes
- Some upcoming breakages, similar to Oils 0.16.0 - Breaking Renames and YSH
#oil-dev > YSH minimal Roadmap is the minimum we should do for YSH:
1. Translate all the code to C++. This is basically done, except for J8 Notation.
2. Write our flag parser and test framework in YSH. These are good tests of the language, and require #language-design > proc argument binding, among other things.
3. We have to document everything in the Oils reference.
These 3 milestones are a lot of work, but definitely doable. We'll at least have a small, composable, "algebraically closed" YSH.
#language-design > YSH 2023 lists even more YSH features to implement. One way to think of it is that we're stuffing features from all these language categories together:
- Python/JavaScript/Ruby/Lua
- R/awk (tables, and streams of rows)
- sed/Perl, Make
- Lisp, JSON/YAML - Hay declares data with the same syntax as code.

It all fits, and is useful! (Paradox: shell encourages polyglot programming, but YSH could make scripting more "monoglot". Reducing language cacophony seems overdue in shell.)

Design Issues In Flight

Here's a random sampling of more detailed Zulip threads. They reflect what we're thinking about.

We're nailing down design details as we implement features. I know it's dense, but informed feedback helps!

Decided:

#language-design>append: method, func, or builtin.
- append() is now a method on List, not a free function.
- We'll also wrap it with a builtin command, for "word" syntax: append :mylist *.py
#language-design > camelCase API naming convention
- We should have a short style guide for kebab-case procs, and snake_case vars, etc.
#oil-dev > Overflow Detection for N-bit signed integers
- We'll implement simple overflow detection, so that later we can compatibly introduce BigInt. This will knock off 2 remaining spec-cpp differences.

Procs and funcs continue to be a difficult design issue. These threads cover a large chunk of our work in the next few months:

#language-design > Call split() strip() as both func and method?
- We may want some kinda of "universal function call syntax", to support both strip(x) and chaining x -> strip() -> upper().
- Once we figure out the rules, it should be easy to implement.
#language-design > Unifying Proc and Func Params
- Unify the evaluation of default args. Avoid Python's mutable=[] pitfall.
- Procs have 4 sections of params, not 3: word, positional-typed, named-typed, block arg.
#language-design > value.IO ideas
- Aidan wrote a script to test the language, and it exposed the confusion with proc and func.
- We may go for a functional style, where you have to pass an instance of value.IO into functions to start processes, e.g. $(date) and ls | wc -l. This is way of making funcs pure.
#language-design > func evaluator without redirects and $?
- There's an argument for implementing a separate, faster evaluator for func and Hay. It's more of a pure evaluator.
- On the other hand, funcs and procs share the same parser, and thus have the same syntax.
#language-design > N ways to "return" a value.
- It's helpful to write something about procs and funcs from the user point-of-view. Also see FAQ: proc main vs. func main

Also important:

#language-design > Things Oils Shipped Without
- Things Rust Shipped Without by Graydon Hoare is a useful way of thinking about language design, so we keep track of similar issues here.
#language-design > Fleshing out stdlib/ dir
- What belongs in the "standard library"?
#language-design > Issue Mixing for and const - Aidan ran into this.
- We decided dynamic const is "weak sauce" inherited from bash, and should be removed. Static const is better, though perhaps harder to implement in a shell.
- We may start with define at the top-level? This also relates to modules and tree-shaking.
#language-design > j"" and b"" prefixes for J8 notation?
- J8 notation has a j"" prefix, and you can consider a b"" prefix as well. I'm not sure this is worth it.
#language-design > Efficient Ninja/Make/HTML/URL escaping
- Is our string processing fast enough to implement these char-by-char functions in "user space"?

Threads related to modules / namespaces / tree-shaking:

#language-design > Deps at Top of File - inspired by Elixir.
#language-design > Arguments against namespaces - Use processes and files as namespaces? We provide ways to detect name conflicts.
#language-design > Avoiding name conflicts

Conclusion

We made progress on all fronts!

Interactive Shell, including auto-completion, job control, and the headless shell
OSH compatibility, YSH features
Documentation
Translation to C++, Performance

This release was delayed by about 2 weeks because I wanted to show Screencasts of an Interactive Shell.

That's the next post. We have concrete demos of things no other shell can do.

In the meantime, feel free to ask questions in the comments!

Appendix: Closed Issues

Some of the work we've done is reflected in these issues. You can also view the full changelog for this release.

#1722	Rewrite FUNCNAME BASH_LINENO BASH_SOURCE and fix minor bugs
#1716	Allow shell assignment foo=bar at the top level in YSH
#1707	Polish location info and fix bugs
#1703	Trying to build CPP release from website, and getting errors from ninja-rules-cpp.sh
#1695	[breaking] GNU readline app name is now "oils", not "oil"
#1687	Initialization with rc files section of getting started doc references old filepath
#1521	globs not expanded in redirects, and we use this in `deps/from-binary.sh`
#1500	popd directory stack empty differs from bash
#1415	Arguments to functions like eval_hay() aren't checked
#1362	Enable try/except/else and try/finally errors in mycpp
#1163	replace try/finally with context managers through the codebase
#1093	No notification when 'sleep 1 &' finishes
#1011	Add is-main builtin
#915	oilshell escapes completion candidates
#532	Improve the help builtin and toolchain

Appendix: Metrics for the 0.18.0 Release

These metrics help me keep track of the project. Let's compare this release with the previous one, version 0.17.0.

Spec Tests

We improved OSH, and the 2 extra failures are TODOs on completion and compexport:

OSH spec tests for 0.17.0: 2087 tests, 1858 passing, 88 failing
OSH spec tests for 0.18.0: 2100 tests, 1869 passing, 90 failing

Translating the help builtin the C++ tarball very close to parity:

OSH C++ spec tests for 0.17.0 - 1851 of 1861 passing - delta 10
OSH C++ spec tests for 0.18.0 - 1870 of 1872 passing - delta 2

YSH got a lot of new behavior:

YSH spec tests for 0.17.0: 561 tests, 514 passing, 47 failing
YSH spec tests for 0.18.0: 630 tests, 571 passing, 59 failing

And the C++ tarball is catching up rapidly:

YSH C++ spec tests for 0.17.0: 357 of 514 passing, delta 157
YSH C++ spec tests for 0.18.0: 492 of 569 passing, delta 77

When we write our own JSON library, the delta should be almost zero.

Benchmarks

The parser is faster due to the new pool allocator:

Parser Performance for 0.17.0: 18.2 thousand irefs per line
Parser Performance for 0.18.0: 16.3 thousand irefs per line

Parser memory usage increased slightly, which is a little surprising. My memory is that the pool allocator decreased memory usage, so this could have been something else:

benchmarks/gc for 0.17.0: parse.configure-coreutils 1.83 M objects comprising 62.1 MB, max RSS 68.9 MB
benchmarks/gc for 0.18.0: parse.configure-coreutils 1.83 M objects comprising 65.0 MB, max RSS 69.3 MB

A few more objects allocated at runtime, partially due to redirect args supporting globs (mentioned above):

Runtime Performance for 0.17.0: 2.32 M and 2.36 M objects allocated running CPython's configure
Runtime Performance for 0.18.0: 2.36 M and 2.40 M objects allocated running CPython's configure

Our fib benchmark got slower, and I tracked down the cause of it. It's due to translating more functions to YSH, and our poor Dict implementation! (Update: Melvin just brought this back down with a simple change.)

benchmarks/gc-cachegrind for 0.17.0 - fib takes 61.6 million irefs, mut+alloc+free+gc
benchmarks/gc-cachegrind for 0.18.0 - fib takes 65.4 million irefs, mut+alloc+free+gc

We still have to catch up with bash here:

Runtime Performance for 0.17.0: 18.5 and 14.0 seconds running CPython's configure
Runtime Performance for 0.18.0: 17.2 and 13.5 seconds running CPython's configure
- bash: 15.1 and 11.4 seconds running CPython's configure. We still have to catch up with bash.

Code Size

OSH got a little bigger (and we still need to add YSH):

cloc for 0.17.0: 20,985 lines of Python and C, and 406 lines of ASDL
cloc for 0.18.0: 21,025 lines of Python and C, 416 lines of ASDL

Translating YSH led to more C++ code in the tarball:

oil-cpp for 0.17.0 - 100,757 lines
oil-cpp for 0.18.0 - 104,155 lines

And more executable code:

ovm-build for 0.17.0: 1.52 MB of native code (with GCC on Debian), 1.51 MB on Ubuntu
ovm-build for 0.18.0: 1.70 MB of native code (with GCC on Debian), 1.66 MB on Ubuntu

Optimizing the GC rooting should bring the binary size down a bit.