Why Sponsor Oils? | blog | oilshell.org
This is an update of the popular 2018 FAQ. Details have changed, but the spirit of the project is the same.
Whenever I announce a new Oil release, some readers are confused by the project.
This post explains the project's motivation from several perspectives. Unix shell is an old and successful technology, so there are many ways of looking at it.
Before explaining why I created Oil, let's review what it is. You can think of a Unix shell in two ways:
#!/bin/sh
.In this document, we'll think of Unix shells as languages. The Oil project actually has two languages: OSH and Oil. Let's define these terms, along with two others for context:
bash
,
dash
, ksh
, etc.bash
. The goal is to run existing
shell scripts. It's done so since January 2018, and has matured
in many regular releases since then.x=1
different than x = 1
?)More about the Oil language:
The best description of the project is on the home page:
Oil is our upgrade path from bash to a better language and runtime.
This section paraphrases questions I've received and summarizes the answers. In most cases, I link to the original comment thread, which you can read for details.
bash
or zsh
?Oil is taking shell seriously as a programming language, rather than treating it as a text-based UI that can be abused to write programs.
To see why this is valuable, consider these two groups of shell users:
Oil is aimed at group 2. If you're in group 1, there's admittedly no reason to use it right now.
However, group 2 writes scripts for group 1 to use! So I believe the benefits of Oil will eventually bubble up.
In other words, I'm building a solid foundation for a few more decades of shell usage.
It's important to be compatible with existing code. You might not personally use shell as a programming language, but all Unix users still rely on big shell programs. It's often used at build time, but it's still used at runtime too, e.g. on embedded Linux devices.
Some of this code is old, but much of it is new. It's not a small amount of code, either. Examples:
Original question and answer (reddit.com)
There are three problems with that:
It won't work. It would be like trying to convince productive PHP programmers not to use PHP. Many people have wasted breath on that, but important sites like Wikipedia are still written in hundreds of thousands of lines of PHP.
Like PHP, the shell language is useful, ubiquitous, flawed, and being improved.
Even if a new line of shell never gets written, there will still be a huge installed base of shell scripts that you may need to understand (e.g. when they don't work).
Shell is still the best tool for many jobs. Most new "cloud" projects rely on Linux system images, in VMs or containers, and shell is the ideal language for creating such images. Even if you use a framework like Chef or Docker, you're still using bits of shell.
The biggest misconception about shell is that you write only in shell. Shell is about polyglot programming. It's a language that grows.
It's clear that I like Python because:
But I also use C, C++, JavaScript, R, and dozens of DSLs like SQL and HTML. They're often the best or only reasonable language for a particular task (for speed, available libraries, or runtime environment). I also use programs written in languages that I don't know how to read or modify.
Factoring into heterogeneous processes is a software design skill, particularly in the domain of distributed systems.
That said, rewriting a shell script is reasonable in some circumstances. If everyone on your dev team knows Python, maintaining a shell script can be more costly than maintaining a Python script.
But Python and Ruby aren't good shell replacements in general. Shell is a domain-specific language for dealing with concurrent processes and the file system. Python and Ruby have too much abstraction over these concepts, sometimes in the name of portability (e.g. to Windows). They hide what's really going on.
I encountered a nice blog post, Replacing Shell Scripts with Python, which, in my opinion, inadvertently proves the opposite point. The Python version is longer and has more dependencies. It's more difficult to write and maintain.
It's true that Perl is closer to shell than Python and Ruby are. For example,
the perl -pie
idiom can take the place of awk
and sed
. However, Perl isn't
an acceptable shell either:
my_shell_func 2> err.txt
, where my_shell_func
can invoke
both functions and external commands?grep()
, but the real grep
is better for many
problems.Also:
sed
and awk
either.However, it's true that, in some respects, Oil is retreading the same ground as Perl. But Oil is more faithful to shell, and its syntax uses fewer punctuation characters. In other words, it's less like "line noise".
Threads:
You might be angry because you had to maintain a nasty shell script written by a coworker.
If that's the case, you should be helping Oil succeed! The only way to "kill bash" is to:
This is analogous to how Facebook is moving away from PHP by developing a similar, but cleaner, language called Hack.
Perl, Python, and Ruby have all existed for over 25 years, but they haven't replaced shell. New shell scripts are being written every day.
(Oil also has some similarity to CoffeeScript, which smoothed over some of JavaScript's rough edges and added syntactic sugar, but didn't stray from its core execution model. CoffeeScript was a success because it influenced subsequent versions of JavaScript.)
I've seen this suggestion a lot, and there are entire books devoted to it. If your script is small, it may be a reasonable goal.
For bigger programs, limiting yourself to POSIX is not just inconvenient, it's also an ill-defined and virtually untestable concept. Evidence:
The Debian project created a minimal shell variant called dash, which
replaced bash. However, dash
and the Debian Policy on Shell
Scripts both allow non-POSIX features, like local
. Local
variables are essential for writing maintainable shell scripts, but POSIX
doesn't mention them.
Most shells parse assignment builtins differently than other builtins, but the POSIX shell grammar has no notion of an assignment. This issue surfaced as early as 2010, but as of 2018, it's not in a published spec.
This issue isn't theoretical — bash and dash differ in practice, but POSIX doesn't specify which is correct.
In other words, POSIX is incomplete and out of date. (However, I've discovered that shells are highly conformant with respect to things the standard does specify.)
As of 2021, I believe that OSH is a "better POSIX". POSIX is a descriptive specification and not a normative one. That means that it's an observation of how popular shells like ksh and bash happened to behave at a certain time. In other words, it's a compromise.
Similarly, OSH is based on extensive testing of the behavior of bash,
dash, mksh, zsh, and busybox ash
. That is, it uses the
same philosophy as POSIX, but it specifies more of the language. Roughly
speaking, spec tests are an executable specification.
POSIX shell has global options that affect parsing and execution like set -o noglob
. And bash extends them with options like shopt -s lastpipe
.
Oil continues in the same vein, adding several shell options under shopt
.
However, they are organized into 2 binaries so you don't have to remember each
one:
bin/osh
runs your existing shell scripts.
shopt -s oil:basic
, a
large group of options. It won't break too much.bin/oil
is a brand new language. It's equivalent to shopt -s oil:all
.Historical note: I prototyped an OSH-to-Oil translator in 2017, but it's on hold for now. Contributors are welcome to try to revive it.
Making shell a good programming language is a prerequisite for creating a good interactive shell. See posts tagged #interactive-shell for details on Oil's unique support:
Anyone who's written a bash completion script from scratch can also see this.
Shell doesn't have true functions, so bash's mechanism involves reading global
variables likes $COMP_CWORD
and mutating ones like $COMP_REPLY
.
Although I'm a Vim user, I'm sometimes jealous that Emacs has a better programming language for customizing the UI. Like Emacs, I expect that many of Oil's interactive features will be written in Oil, and not C or Python.
It's more accurate to say that Oil is written in a collection of DSLs based on Python:
In other words, the interpreter is written in high-level code for correctness, but compiled to native code for speed. You could say that Oil is an experiment in #metaprogramming. It's similar to PyPy in spirit, but not in the details.
This unusual style affects Oil developers, but not end users. End users get a source tarball that is compiled with a normal C++ toolchain.
These posts explain our use of Python in more detail:
In the distant future, Oil's metalanguages may be evolved into the Tea language, i.e. a form of bootstrapping. But you'll still get a "normal" source tarball.
The questions below are inspired by threads like this one from Reddit.
See Why Use Oil? — I keep it up to date.
As of February 2021, Oil is best used as a dev tool alongside bash or POSIX shell. It will help you write better programs.
You can use Oil in production, but it's slow in some cases. The upcoming C++ translation will fix that.
I encourage people to test Oil, and to give feedback on the Oil language! I also recommend it to sophisticated shell users who understand Four Features That Justify a New Unix Shell.
This program prints Hello world
:
var name = 'World'
echo "Hello $name"
Here's a more complex program inspired by a lobste.rs
thread.
It deletes git branches that are merged, except for master
:
git branch --merged | while read --line {
var line = _line.strip()
# The * prefix means the branch is unmerged
if (line != 'master' and not line.startswith('*')) {
echo $line
}
} | readarray -t :branches
if (len(branches) == 0) {
echo "No merged branches"
} else {
git branch -D @branches
}
# $ git branch --merged
# master
# merged1
# * in-progress
# merged2
Notice these differences between Oil and shell:
do
/ done
and if
/ fi
[[ ${#branches[@]} -eq 0 ]]
var
keyword introduces typed variablesread --line
, which fills the $_line
variablereadarray
works at the end of a pipe because shopt -s lastpipe
is on by default:branches
has a :
sigil to remind you that a variable is being mutated@branches
for splicing instead of "${branches[@]}"
(related blog
post)I plan to port this snippet to more languages and write a longer post about it.
(I also think readarray -t
should be read --lines
.)
More code examples:
Yes, this is a problem. I commented on it in the last post. Oil should have some combination of these two solutions:
Note that Python, Ruby, and JavaScript also have dependency problems, and a better shell will help us fix those!
This section will help you understand Oil vs. other shells. Here are the primary differences:
You may be happy using another shell! Feel free to let us know what you like, and maybe analogous features can be integrated into Oil.
Fish is a good interactive shell, but a poor language for automation. So it's the opposite of Oil, and complementary to it.
It would make sense to combine the projects in some way:
Related:
$(command sub)
.I love fish as a shell, I just don't think it's good as a scripting language
See this section of the previous post. PowerShell is more natural on Windows, and it's embedded in a large VM.
I link to dozens of other shells on this wiki page:
Oil has taken cues from some of these projects (and vice versa). Again, the biggest difference is that Oil is the only shell that's a smooth upgrade from bash.
This post about Linux distros gives color on why Oil aims to be compatible. Most distros are based on shell, with languages like Python being either nonexistent or playing an ancillary role.
I also maintain a list of dozens of shell DSLs in various languages:
A lot of effort was spent on these projects, so I have no doubt that they're useful. Embedding has advantages in some situations.
But I don't consider any of these libraries to be bash or shell replacements. A shell is a foundational, low-level component of a Unix system.
Leave a comment if there is something you don't understand, and I'll answer and possibly update the FAQ.
In addition to the simplest explanation and the idioms doc mentioned above, see:
Again, we're removing the problems with shell, preserving the good parts, and gracefully upgrading it!
Future blog posts can explain Oil in different ways.
libc
kernel interface. A sandboxed
interpreter can be used for evaluating config files and untrusted code.