Bash benchmarks

24 Mar 2022

When writing bash scripts, one needs to find ways to do things that aren’t built in to bash. String manipulation like lowercase conversion, parsing, removing whitespace… all use tools/binaries built-in to the OS, but not in the language itself. I’m talking about tools like cut, awk, tr, sed, sort, …

Update: I now have developed benchmarking for both throughput (MB/s) and invocation (ops/sec) speed in my project, combined with all kinds of other improvements, so the content in this article was updated [2022-04-08]

Most of the time, there is more than one way to do something. So the question I would like to answer: what is the fastest way to do them, fastest either in the meaning of “throughput”, expressed in lines/second or MB/sec, or in “invocation” speed, how many times can I start up the program sequantially, expressed in operations/sec.

I’ve started a GitHub repo pforret/bash_benchmarks to collect these benchmarks.

Bash benchmarks

In short:

I generate a big file of random text (e.g. 10000 lines of 1000 chars each)
I then run that file through each algorithm 5 times
I check the time this took, and <filesize in MB> x <invocations> / <totaltime> = MB/s
I then invoke the program 2000 times on just 1 short string (1 line from the file above).
I check the time this took, and <invocations> / <totaltime> = ops/sec

Please find all the posts in these series under the bash-benchmark tag

Also on this blog ...

A static Docs site for every project • 13 Sep 2025 • TL;DR: I have started using mkdox (which uses mkdocs, material and Docker) to create a static documentation site for every repository/project I’m working on. It’s easy, light and fast. It also invites me to document everything in Markdown, which is great for coding agents.

Mkdox: fast and easy Mkdocs Material • 09 Mar 2024 • If I wouldn’t be able to control myself, I would create a new website every week. Creating a website about … any topic really, helps me structure the knowledge I have or am in the process of collecting about it. The tools to create a new website have changed through...

Using bashew in GitHub Actions • 15 Oct 2022 • GitHub Actions, used in countless CI/CD setups, are a good example of the ubiquity of bash scripting. Most of the run: lines in an Action YML document are nothing but (a sequence of) bash command lines.

Advanced dotenv config files for bash scripts • 25 Apr 2021 • A technique commonly used practice in (deployment of) software projects is to put your local configuration, environment variables and secrets in a .env file in the root of your project. This .env file is structured as a one-dimensional lookup table (a list of key=value lines), and saved only on that...

Peter Forret

Bash benchmarks

Also on this blog ...