Jay Taylor's notes
back to listing indexBenefits of using Golang for ad-hoc code: Leatherman
[web search]Benefits of using Golang for ad-hoc code: Leatherman
I recently stumbled upon a pattern that motivates me to write little scripts in Go instead of my normal default. I was surprised at some of the benefits.
If you can’t be bothered to read this whole post, let me frontload the gist:
Building little tools with Go will:
- supply you with first class libraries
- supply you with powerful, well supported tools
- amortize the cost of non-core libraries (to zero)
- work at runtime on almost any machine you are likely to use
- support concurrency and parallelism in an intuitive and performant manner
- encourage and support testing
I have blogged before about my automated dotfiles setup, which I use constantly. With this setup I have almost 100 (!!!) little tools at my fingertips included by my dotfiles. The majority of them are either straight shell or Perl with no dependencies. Those scripts work well and I do not expect to change them any time soon.
On the other hand, there are a couple dozen scripts with non-core dependencies. So those scripts either do not work on a remote box or suddently break on my laptop if I install a new Perl version.
I wanted an excuse to get more experience writing Go, and this seemed like a possible path forward, except I didn’t want to have to pull down two dozen binaries, as that would almost surely be tens of megabytes. My coworker Aaron Hopkins gave me a cool solution to that and I’ve run with it.
🔗 The Multitool
Instead of building fifteen little tools, each ending up being about a megabyte
or two because of the static linking, I build one giant tool, that acts
differently based on $0
. This may sound unusual if you’ve never heard of it,
but it’s actually quite common. busybox
is a great example; if you run
busybox ls
it will run its ls
applet. In the same way, if you make an ls
symlink to busybox
and call that, it will know to act like ls
.
So I wrote a tiny little go dispatcher to handle my tools:
package main import ( "os" "path" ) var Dispatch map[string]func([]string) func main() { which := path.Base(os.Args[0]) args := os.Args Dispatch = map[string]func([]string){ "addrspec-to-tabs": AddrspecToTabs, "clocks": Clocks, "csv2json": CsvToJson, "debounce": Debounce, "render-mail": RenderMail, "help": Help, "explode": Explode, } if which == "leatherman" && len(args) > 1 { args = args[1:] which = args[0] } fn, ok := Dispatch[which] if !ok { Help(os.Args) os.Exit(1) } fn(args) }
As you can tell, at the time of writing I have migrated four tools
(addrspec-to-tabs
, clocks
, csv2json
, and render-mail
,) and added a
script that was too hard to keep in my kit before (debounce
, more on that
one later.)
The tool is called Leatherman and I plan on improving the documentation as I go.
🔗 Unexpected Benefits of Writing “Scripts” in Go
There were a few things that jumped out at me immediately while I pursued this project. The first was that the standard library in Go is top notch. Of course I am mostly comparing to Perl and Python here, which are both much older than Go and have far less commercial support.
A good example might be net/mail
package. In Perl, I used one module to parse
the email (Email::MIME
), another module to parse the address headers
(Email::Address
), and yet another module to parse the Date header
(Email::Date
.) None of those perl modules are built in. In Go these were all
built into the same core package and exposed in an obvious and performant way.
A second, related benefit is that Go has some incredible tooling. I suspect
that this is due to a combination of having commercial support and because good
tooling was a first class design concern. A fascinating example can be found in
this blog post about new trace features in the latest release of
Go. That’s really just one of many examples though. I am not sure
I can catalogue all of the various handy tools, though I have used some of the
ones mentioned in the post above when debugging deadlocks. Anyone who writes go
probably uses gofmt
, which arranges source in the canonical format so there
aren’t annoying discussions about whitespace. gofmt
also has syntax
manipulation for replacing syntax patterns (as opposed to string patterns) in
your entire source tree.
Another nice benefit of Go, and this could apply to Rust and C/C++ with the
right compiler flags, is that binaries are static, and thus do not need runtime
dependencies (aside from typically libc for resolvers.) For me this means that
some tools, like csv2json
, actually work on boxes that are not my laptop; when
it was a Perl script I had to have certain non-core modules installed.
Because of this the binaries produced by Go are actually more portable than many scripts. This means that some scripts that tend to depend on either overly new or rare packages can be rewritten in a way that is more likely to work on less up-to-date machines.
Another first class feature of Go (and hopefully any other language released in the current generation) is concurrency and parallelism. While Perl, Python, and other common languages can do asynchronous programming for concurrency or forking (and maybe threading) for parallelism, those features are either not included in the main language or are haphazardly bolted on. The combination of lightweight threads and channels makes parallelism and asynchrony more integrated and natural. I’ve written about the results of this before.
The final benefit I’ve found is the solid testing infrastructure. On the one
hand this only feels like a benefit because Go requires more ceremony than
Perl or Python; a Perl script written in the Go style is just as testable, but
because Go has to be run this way anyway, Go programs have better testability
out of the box. Basically with go you can have a file called ${foo}_test.go
and it has access to all of the private data of the $foo
file. Then you
basically write methods called TestWhatever
and those are run at test time,
able to test private methods. Very handy.
🔗 On the Other Hand…
Go is definitely not perfect. I understand that the designers of the language
have an emphasis on simplicity, and I appreciate that, but the lack of generics
is pretty painful. Go has first class functions, but does not have Map
,
Select
or other common functional constructs because of this.
The non-core Go libraries are of very uneven quality. I suspect that part of this is because there is no blessed index of Go libraries (like CPAN or PyPI.) With the CPAN inspired index-based model, people write code, get it to a state they are happy with, and release it. The Go model instead assumes that any Git (or I guess Mercurial or probably Bazaar or Subversion) repository referenced is OK to use. The upshot of this is that some modules are great, but some are incredibly bad.
🔗 Debounce
I mentioned debounce
before. Debounce is a tool I have written both in Perl
and in Go. The Perl version was a hassle to use because it required async deps
that are unlikely to exist universally. The Go version has worked fine (and
indeed I wrote it three years ago.)
If you have never heard of debouncing before, it is something you have to do to
avoid “chatter” in physical switches. The use case I tend to run into is that
you want to do something based on a discrete event, but you do not want to do
whatever per event, instead you want to do it after the events stop. Let me
give a concrete example: you could set up an inotify watcher on file system
events and run tests whenever an event comes through. You will quickly find
that even a simple :w
from vi
triggers multiple events.
So what you do is have debounce
watch for the lines, and it reads the first
line and waits up to one second until it releases that first line. Here’s how
you do it:
inotifywait -mr -e modify,move . | debounce | xargs -I{} make test
I have wanted this more than once on a server but it’s always been too much of a hassle to wire up. With the leatherman I have solved this handily.
🔗 Automate It All
First, I have TravisCI set up to both build and deploy the built binary to a github release. It was a little weird because I had to install a ruby gem to build the linked yaml file, but now that it’s set up I don’t think I’ll need to do it again for quite a while. So there are releases for each push. One really cool thing here is that the CI actually downloads all required libraries, so when you use the built binary, you need nothing other than libc.
That’s great, but it gets even better. I don’t want to have to remember to download and install the multitool, so in my dotfiles install script I automated the downloading and installation:
if test ! -e ~/bin/leatherman || older-than ~/bin/leatherman c 7d; then LMURL="$(curl -s https://api.github.com/repos/frioux/leatherman/releases/latest | grep browser_download_url | cut -d '"' -f 4)" curl -sL "$LMURL" > ~/bin/leatherman chmod +x ~/bin/leatherman ~/bin/leatherman explode fi
So the above will install the leatherman if it doesn’t exist, and install the
latest version if it’s over a week old. The leatherman explode
command
installs all of the relevant symlinks for each tool.
I hope demonstrated some of the benefits of using Go for little tools. I have enjoyed writing Go for the most part, and intend to continue writing Go when it makes the most sense. I hope this post inspires you to use Go, Rust, or any other modern, compiled language where it makes sense.
If you want to learn more about programming Go, you should check out The Go Programming Language. I haven’t finished it but intend to once I finish the current tech books I’m reading. At the very least it takes a more instructive approach than most programming books I’ve read, which is helpful for a language like Go that is so distinct from most other programming languages.
I have also heard that Go Systems Programming is very good, though I haven’t read or even purchased it myself.
Posted Fri, Jan 12, 2018