Oils 0.17.0 - YSH Is Becoming Real

OSH runs existing shell / bash scripts, often unmodified.
YSH is the shell with tYped data, influenced by pYthon.
Oils is the whole project.
- You can use OSH by itself, YSH by itself, or upgrade from OSH to YSH.

Release Highlights

What's new? The previous release was Breaking Renames and YSH, and it prepared the codebase to implement YSH. So this release is a checkpoint along the way.

Aidan Olsen implemented core YSH features.
- Evaluate the YSH case statement on typed data, as shown in Sketches of YSH Features.
- Evaluate "method" calls like mystr->strip().
- Parse and evaluate simple pure functions, e.g. the func keyword and return (expr). This is still in progress.
Peter Debelak tested the C++ tarball on OS X, and fixed several build issues. This is great work, and I hope to hear from people who have run Oils on OS X and BSDs! Please report bugs if it doesn't work.

It compiles in ~30 seconds, requiring only a C++ compiler and a shell (no Make tool). I plan to publish a screencast of this.
We translated the YSH expression evaluator to C++, which makes it more real. This led to bug fixes, and to tightening up language semantics (described below).

357 of 514 tests pass in C++, compared to 185 of 479 in the previous release.
Reduce the number of GC objects allocated by the interpreter, which made it faster (CPU usage) and smaller (memory usage).

For example, running CPython's configure went from allocating 3.37 M objects to 2.32M objects, a decrease of 31%. Compared to our December baseline, it's a decrease of 42%.

The biggest win was shortcutting the word evaluator in common cases like bare-word and 'single quoted'. We also introduced object pools for frequently created objects.

Clarity in the Design

What's happened lately?

June was supposed to be "the month of docs". I had planned to rewrite the help builtin and re-organize our documentation. We need a place to record all the changes we're making!

That didn't happen, but I did write five blog posts about the design of YSH. They clarified what exactly we should work on, out of the seven features in YSH:

Python-like functions, both builtin and user-defined.
Shell-like procs — mainly the rich argument binding.
JSON-based data languages. I put up scaffolding in the codebase, which I'm happy with. We will reuse some of the old QSN implementation, and rewrite some of it.

(Stupid slogan I thought of for Oils: Imagine if bash, Python, and JSON kissed.)

A Stable Core, with External Growth

After writing those posts, and seeing Melvin's work on YSH, I realized we need to write more code in YSH, as opposed to typed Python. It's less verbose, and it's a good test of the language.

That is, I hope there will be a small part of YSH that's stable, and a larger part can grow for years. Examples:

Builtin functions like max(), sum(), any() can be written in YSH. They are sugar on top of >, + and or.
- CPython implements these in C, although they could be implemented in Python.
- Thanks to Albin Otterhäll for trying to write YSH, and reporting experiences on Zulip. I feel like we're a bit "behind" some expectations, but concrete experiences help a lot. I also remember a few people who ran into the lack of functions back in December/January. We're working on the holes you bumped into!
On the other hand, functions like len() and Bool() are "intrinsic".

This layering also applies to builtin proc as well as func: We should be able to write both a test framework describe and a flag parser argparse in YSH itself. You can see examples in Sketches of YSH Features.

OSH and YSH Have More Distinct Data Structures

Another point of clarity is the runtime relationship between OSH and YSH: their data types are now more distinct.

This is mostly so we can continue to increase bash compatibility ("conceding to reality"), without messing up the semantics of YSH.

(Aside: We've long understood the relationship at parse time: YSH is largely a mutually recursive expression sublanguage weaved into shell. Though Aidan found some good bugs here while implementing the YSH case statement, so it may change a bit.)

Here's some technical detail on the core data types. Building on Melvin's work, I statically-typed and translated the YSH expression evaluator for this release. It had previously relied on PyObject*, i.e. the "metacircular hack".

It now uses a central value_t type, expressed with algebraic data types in Zephyr ASDL. For example, this is what POSIX shell looks like:

value = Undef       # for ${x:-default} etc.
      | Str(str s)  # Everything is a string

This is what bash looks like:

        ...
      | Str(str s)
      | BashArray(List[str] strs)
        # quirk: a bash array is more like Dict[int, str] !
      | BashAssoc(Dict[str, str] d)
      ...

This is what YSH looks like:

        ...
      | Null  # e.g. for JSON
      | Int(int i)
      | Float(float f)
      | List(List[value] items)
      | Dict(Dict[str, value] d)
      ...
      # omitted: Eggex, Func, Proc, etc.

The main change is that I thought we would unify sequences and maps:

BashArray and List
BashAssoc and Dict

But again, I don't want future OSH quirks to affect YSH. Practically speaking, what this means is that OSH and YSH have mostly separate types and operations. You use YSH operations with YSH types:

$ var mylist = ['README', 'foo.py']  # value.List

$ echo @mylist  # YSH splice works
README foo.py

$ echo "${mylist[@]}"  # bash splice doesn't apply
  echo "${mylist[@]}"
        ^~
[ interactive ]:15: fatal: Invalid type value.List: ...
... Can't substitute into word

You also use OSH operations with OSH types. This shouldn't be a big deal because the most common Str type is shared and thus seamlessly interoperable.

$ declare -a array=(README foo.py)  # value.BashArray

$ echo "${array[@]}"  # bash splice works
README foo.py

$ var item = array[0]  # YSH array indexing doesn't apply
  var item = array[0]
  ^~~
[ interactive ]:21: fatal: Invalid type value.BashArray: ...
... subscript expected Str, List, or Dict

Also, features like param passing will "just work". You can copy from bash arrays to YSH lists, and vice versa.

The exact set of valid operations on each type can be tweaked based on usage, but we're no longer aiming to "complete the matrix". The interactions are more controlled.

Note that you can write these two styles of syntax in the same file. It's not recommended for new programs, but it may be useful when upgrading from OSH to YSH.

Risks / Open Questions

So we're deep in the middle of implementing YSH, and it's taking a nice shape. What are the remaining risks?

Past Risks

Let's look back 3 years to Technical Issues and Risks (2020). We're past the issues I enumerated:

"The main risk is memory management". Funded by our first NLnet grant, I got our garbage-collected runtime working in January, with essential help from Jesse Hughes.

It's also fair to say that the project's small amount of C++ code was a big mess before the first grant. The whole translation process was an experiment — almost a research prototype. But we're now past that phase.
"I'm deferring all the issues related to the interactive shell". I was trying to reduce scope of the project, since I knew it was too big.

Starting late last year, Melvin Walls pretty much single-handedly revived it, again funded by NLnet. This reminds me that I need to write a blog post with screencasts showing our interactive shell in pure C++. It can run virtualenv, bash completion, git prompts, and more.
pgen2 parser generator. Melvin translated this to C++ a few months ago.
Remove the "metacircular hack". Melvin and I solved most of this problem. As of this release, it's more than halfway done. There's more work, but we'll finish it.

So it's clear that our two NLnet grants (April 2022 and February 2023) have been critical. The project really needs concentrated attention. I welcome casual contribution, and I want to increase it, but we also need sustained contribution.

(As always, you're welcome to join https://oilshell.zulipchat.com/ and ask questions!)

So the main risks are that we won't have enough help, or that our funding runs out. There's a lifetime limit of 4 grants from NLnet, which definitely seems like enough to get the project off the ground, but we shouldn't take it for granted.

A related issue is that I've been "heads down" for a couple months, deep in the design of YSH. And I expect to be deep in documentation for the next month. But I also want to work on finding more people to work on the project.

I'm thinking of writing a blog post How are programming languages funded? I've noticed a common misconception that Python was Guido van Rossum's hobby project. This isn't true, since it's had a small amount of funding for most of its life, including from the US government early on.

So the "administrative" parts of a project definitely matter. A little funding goes a long way.

Key Questions

Another question that's on my mind:

Can YSH be a bounded design?

That is, can there be a stable core that supports "infinite" growth? This is essentially the idea behind the narrow waist blog posts.

It seems like it, but the only way to find out is to implement YSH. Luckily, this seems very feasible. I'm happy that the ysh/expr_eval.py file is only 1435 lines after static typing! That means we no longer depend on the Python interpreter, so its weight "doesn't count".

This brings to mind another risk:

Is the language too big?

Does it make sense to stuff together all this functionality from shell, Python, JSON, and TSV together? Is it too big to document?

To be honest, it certainly feels big, because it's a lot of work.

But the whole program is still small! I would say it's really small for the amount of work it does.

In April 2019, OSH was 26K physical lines of code.
- Compare with bash at 142K lines.
Current line counts:
- OSH is now 40K physical lines
- YSH is under 6K lines, which may rise to ~10K lines when "done".
- Our C++ runtime is 7K lines now, which may rise to ~10K lines as well.

This is pretty surprising! We'll have a shell with much more power and functionality than bash, at less than half the weight. I'll publish updated line counts when the interpreter is fully translated to C++.

Programmers adopt platforms, not languages.

I want the project to be self-sustaining, and language projects rarely are. What we really care about is operating systems and platforms (Unix, the web, the cloud, etc.)

Shell is interesting because it's arguably the language that's closest to the Unix operating system.

I may write a separate post about this. I guess the bottom line is that we still need to do things with YSH. We are overflowing with ideas, but again short on people.

Again, feel free join us on Zulip. Most people find it "dense", but asking the right questions is a great way to spread knowledge. The codebase is taking its "final" shape as well, so it should be easier to change.

Closed Issues

The issues represent some of the work we did:

#1658	--gc-sections not supported by ld on macos
#1657	READLINE_DIR is not used in build/ninja-rules-cpp.sh
#1656	HOST_NAME_MAX doesn't appear to be defined in macos
#1643	case NEWLINE crashes because newline accepted as pattern
#1092	Crash in ${a[0]} array evaluation
#954	${x:-default} when x is an integer fails with NotImplementedError
#840	Bug in integer / string conversion
#741	Fully nested data structures
#636	Oil expression evaluator shouldn't be "metacircular"

What's Next?

As mentioned, I want to overhaul the help builtin and documentation. It's great to have contributors working on YSH while that happens. A few years ago, progress on the code would grind to a halt whenever I wrote docs or a blog post!

And I need to do more on the "administrative" side of the project, which is easy to neglect. Please sponsor us if you appreciate this work. We use the money to onboard contributors before they're added to the grant:

Blog Backlog

Screencasts of an Interactive Shell. showing off what Oils can do.

Lower priority:

Oils vs. Crafting Interpreters. A few contributors have requested an overview of the architecture. The codebase is more mature, and I want to attract more contributors, so this makes sense.
History of Unix on One Page: From AT&T to Clusters. I've noticed a whole bunch of misconceptions about the history of Unix and F/OSS, so it would be nice to write a short summary of "what Unix users should know".
- "Unix is our Gilgamesh epic" — Neal Stephenson
How are programming languages funded? Mentioned above.
- Somewhat related: many popular languages have Danish or Scandinavian origin: C++, PHP, TypeScript, and more.

Appendix: Metrics for the 0.17.0 Release

These metrics help me keep track of the project. Let's compare this release with the previous one, version 0.16.0 from June.

Spec Tests

Not much work on OSH:

OSH spec tests for 0.16.0: 2084 tests, 1856 passing, 86 failing
OSH spec tests for 0.17.0: 2087 tests, 1858 passing, 88 failing

But I did fix a slight regression in translation:

OSH C++ spec tests for 0.16.0 - 1847 of 1859 passing - delta 12
OSH C++ spec tests for 0.17.0 - 1851 of 1861 passing - delta 10

Lots of work on YSH:

YSH/Oil spec tests for 0.16.0: 525 tests, 479 passing, 46 failing
YSH/Oil spec tests for 0.17.0: 561 tests, 514 passing, 47 failing

Especially making it work in C++, mentioned above:

YSH/Oil C++ spec tests for 0.16.0: 185 of 479 passing
YSH/Oil C++ spec tests for 0.17.0: 357 of 514 passing

Benchmarks

Parsing speed remained the same, despite some changes for YSH:

Parser Performance for 0.16.0: 18.2 thousand irefs per line
Parser Performance for 0.17.0: 18.2 thousand irefs per line

Also no change:

benchmarks/gc for 0.16.0: parse.configure-coreutils 1.83 M objects comprising 62.1 MB, max RSS 69.1 MB
benchmarks/gc for 0.17.0: parse.configure-coreutils 1.83 M objects comprising 62.1 MB, max RSS 68.9 MB

Many fewer allocations on a real workload:

Runtime Performance for 0.16.0: 3.37 M objects allocated running CPython's configure
Runtime Performance for 0.17.0: 2.32 M objects allocated running CPython's

Which appeared as a big speedup on the ex.compute-fib benchmark:

benchmarks/gc-cachegrind for 0.16.0 - 83.7 million irefs, mut+alloc+free+gc
benchmarks/gc-cachegrind for 0.17.0 - 61.6 million irefs, mut+alloc+free+gc

Wall times:

Runtime Performance for 0.16.0: 32.1 and 19.3 seconds running CPython's configure
Runtime Performance for 0.17.0: 18.5 and 14.0 seconds running CPython's configure
- I changed machines, so these numbers are not comparable.
- bash: 15.3 and 11.2 seconds running CPython's configure. We still have to catch up with bash.

Code Size

I need to update these metrics to include YSH as well as OSH:

cloc for 0.16.0: 20,732 lines of Python and C, 396 lines of ASDL
cloc for 0.17.0: 20,985 lines of Python and C, and 406 lines of ASDL

We translated more of YSH, resulting in more C++ code in the tarball:

oil-cpp for 0.16.0 - 97,233 lines
oil-cpp for 0.17.0 - 100,757 lines

And more executable code:

ovm-build for 0.16.0: 1.42 MB of native code (under GCC)
ovm-build for 0.17.0: 1.52 MB of native code (under GCC)
- on Debian, vs. 1.51 MB on Ubuntu