Things I Wish I'd Known About Bash

tzahola | karma 1317 | avg karma 1.48 · 2018-01-06 14:17:00

My take on the same topic:

- use the unofficial strict mode: http://redsymbol.net/articles/unofficial-bash-strict-mode/

- use parameter substitutions like ${foo#prefix}, ${foo%suffix} instead of invoking sed/awk

- process substitution instead of named pipes: <(), >()

- know the difference between an inline group {} and a subshell ()

- use printf "%q" when passing variables to another shell (e.g. assembling a command locally and executing it via SSH)

reply

lobster_johnson | karma 10160 | avg karma 2.95 · 2018-01-06 19:53:48+00:00

While process substitution is great on the surface, it comes with a troubling downside: There is no way, that I have found, to reliably catch errors. That is, this:

  some-command <(some-failing-command)

...will succeed. And -e and pipefail do nothing here. I have not found any way to push the error up. Plenty of questions on Stackoverflow and elsewhere, no good answers.

mdaniel | karma 5981 | avg karma 1.56 · 2018-01-06 20:15:09+00:00

pipefail do nothing here

Isn't that because the syntax provided doesn't use a pipe? Which is why actually using a pipe, rather than paren redirection, is easier on the eyes and the shell. It reads more like the flow of text:

    maybe_failing() {
        if ! some-failing-command; then
           echo "That did not shake out" >&2
           return 1
        fi
    }

    maybe_failing | some-command

Is that one of the things StackOverflow said? And if so, why is it not a "good" answer (verbosity concerns aside)?

lobster_johnson | karma 10160 | avg karma 2.95 · 2018-01-06 20:23:20+00:00

This requires that some-command takes input from stdin (many commands don't) and that there's only a single input. What about:

  process-stuff --file <(make-input) \
    --extra-data <(make-more-stuff | grep blah)

This is the point at which one reaches for tempfiles, usually.

The point remains that <() was made for a purpose, yet lacks error handling, thus undermining its usefulness to the point of uselessness.

Edit: Isn't substitution using pipes, though? As in FIFO pipes? You get a file descriptor device which is closed at the end of the script. I don't know if the fd is wired directly to the command's stdout or whether the output is written in its entirety to a hidden tempfile first; the former sounds more natural and efficient.

reply

tzahola | karma 1317 | avg karma 1.48 · 2018-01-06 21:23:05+00:00

>I don't know if the fd is wired directly to the command's stdout or whether the output is written in its entirety to a hidden tempfile first; the former sounds more natural and efficient.

Nope. <() and >() is equivalent to creating a named pipe with mkfifo and writing to it:

    tmp_pipe_dir="$(mktemp -d)"
    mkfifo "$tmp_pipe_dir/pipe"
    make-input >"$tmp_pipe_dir/pipe" &
    process-stuff --file "$tmp_pipe_dir/pipe"

That's the whole point of it, so that the data can be consumed in parallel with the producer.

lobster_johnson | karma 10160 | avg karma 2.95 · 2018-01-06 21:39:48

Right, that's what I meant. The command is run in a child process with its stdout writing to the pipe. The script gets the read end.

mdaniel | karma 5981 | avg karma 1.56 · 2018-01-06 21:41:50

Yes, you absolutely have me about the stdin part

But, the multiple case example highlights that if (for argument's sake) `make-input` and `make-more-stuff` had run _prior_ to that line, writing their (possibly empty) output to a (file|FIFO), how then would you want bash to behave? It would still open file descriptors to those (file|FIFO)s, which would still be just as blank.

It seems to me that if one wishes more fine grained control over the error handling in a multi-subshell-fd-trickery situation, then creating the FIFO(s) and managing the sender's exit status is the supervising script's responsibility.

a file descriptor device which is closed at the end of the script

I checked, and it's actually not even at the end of the script; those fds only exist for that one child, as `process-stuff` is exec-ed by bash.

whether the output is written in its entirety to a hidden tempfile

It's not a temp-file, it's an actual file descriptor, which bash `dup2`s for the subprocess, then cheekily uses the `/dev/fd/63` syntax to make appear as a file; you can peer into its brain a little:

    $ showme() {
        echo "showme.args=$@" >&2
        the_fd="${1##/dev/fd/}"
        cat <&${the_fd}
    }
    $ showme <(date -u)
    showme.args=/dev/fd/63
    Sat Jan  6 21:31:25 UTC 2018

Thanks so much for highlighting this scenario; I learned a ton about how that works researching this answer. That's why I like answering stuff on S.O., too: win-win

lobster_johnson | karma 10160 | avg karma 2.95 · 2018-01-06 22:18:29+00:00

Good point about how to abort. Maybe it would be possible to short-circuit the fd somehow, so that the reader got an EOF, and that would in turn cause failure in the program reading the stream. At the same time, any success code from the program should be turned into an error exit code so the script could detect the failure. Not perfect, but arguably better than no failure at all.

oblio | karma 25694 | avg karma 2.47 · 2018-01-06 20:01:46

Parameter substitution is far from intuitive. Every time I have to open one of my older shell scripts (>3 months ago), I'm thankful I wrote comments. Otherwise I'd have to man/google things again. I really wish they'd use function names or something, instead (sub, etc.).

tzahola | karma 1317 | avg karma 1.48 · 2018-01-06 21:11:02+00:00

My "mnemonic" is that # means prefix, because every shell script starts with a shebang too. From this I can deduce that ## means longest prefix, therefore % and %% means suffix.

jwilk | karma 8094 | avg karma 2.47 · 2018-01-06 14:20:45+00:00

> if [ x$(grep not_there /dev/null) = 'x' ]

This is still wrong if the command can output spaces or meta-characters. You should quote the left operand, and then you don't need to prepend x:

if [ "$(grep not_there /dev/null)" = '' ]

reply

LukeShu | karma 6759 | avg karma 4.09 · 2018-01-06 17:13:50+00:00

The legacy of autoconf! People look at autoconf's output to learn shell, but autoconf output is full of bad practices, working around bugs in shells that noone's ever heard of.

Autoconf output is expected to run on buggy shells with broken empty-string comparison. Your scripts probably aren't.

reply

tzs | karma 45790 | avg karma 3.13 · 2018-01-06 17:29:13+00:00

Because when displayed in a variable width font two single quotes can look very similar to one double quote, that last line is very easy to misread. Here it is in code mode:

  if [ "$(grep not_there /dev/null)" = '' ]

gkya | karma 4805 | avg karma 2.02 · 2018-01-06 19:08:44+00:00

Does the second only work with bash? Because IIRC it didn't work with FreeBSD's /bin/sh, you needed the initial x too.

jwilk | karma 8094 | avg karma 2.47 · 2018-01-06 19:32:53+00:00

No, this should work in any POSIX-compliant shell.

Sir_Cmpwn | karma 19702 | avg karma 5.33 · 2018-01-06 14:25:12+00:00

#0: If you're writing scripts that are destined for other users, use POSIX sh instead.

francis-io | karma 574 | avg karma 3.96 · 2018-01-06 14:42:03+00:00

Can you explain why?

Sir_Cmpwn | karma 19702 | avg karma 5.33 · 2018-01-06 14:50:24+00:00

For portability. bash is not available everywhere, but POSIX sh is a requirement of POSIX so you can expect it to be there.

yjftsjthsd-h | karma 28510 | avg karma 2.78 · 2018-01-06 15:12:26

Exactly: GNU bash is on most GNU/Linux systems, but:

* mac OS uses an ancient version of bash

* Embedded systems use busybox sh

* Android uses its own version of sh

* None of the BSDs come with bash, since, y'know GNU licensing

EDIT: Formatting

reply

davewongillies | karma 288 | avg karma 3.39 · 2018-01-06 15:43:01

And to make things interesting Ubuntu uses dash as sh

https://wiki.ubuntu.com/DashAsBinSh

reply

yjftsjthsd-h | karma 28510 | avg karma 2.78 · 2018-01-06 16:44:51+00:00

In fairness, that's just for sh; if for some reason you actually need bash you can just use `#!/usr/bin/env bash`

mjevans | karma 9752 | avg karma 2.24 · 2018-01-06 19:07:50+00:00

If you actually //need// bash ask for it, not sh.

This comes from more armature script authors not being precise enough with requirements, or not targeting the more common and portable spec.

reply

OJFord | karma 22072 | avg karma 2.2 · 2018-01-07 14:18:55+00:00

`sh` is only supposed to be _a_ POSIX-compliant shell, it's not odd/interesting/confusing that Ubuntu uses dash any more than it is that something else uses bash.

The shebang line #!/bin/sh should only be used for POSIX-compliant scripts; if it needs {bash, dash, python, etc.} it should start #!/usr/bin/env {bash, dash, python, etc.}.

reply

skj | karma 2779 | avg karma 2.68 · 2018-01-06 14:50:41+00:00

Not all systems have bash.

v_lisivka | karma 0 | avg karma 0.0 · 2018-01-06 16:31:23

Not all systems have POSIX, perl, python, javascript, C++, rust, go, etc., so should we write all our programs and scripts in sh?

Goladus | karma 4410 | avg karma 1.97 · 2018-01-06 17:28:58+00:00

The relevant point is that all systems running bash should also be POSIX compliant. So, taking some care to avoid bash-only constructs in a shell script can reap substantial gains in portability at fairly minor cost.

v_lisivka | karma 0 | avg karma 0.0 · 2018-01-06 17:48:05+00:00

Why I should care about all systems? If targeted systems have bash4.x and GNU tools, why I should write scripts in sh? If you care about all systems, ANSI C is better choice, IMHO.

Goladus | karma 4410 | avg karma 1.97 · 2018-01-07 14:30:15

If you're targeting a limited set of systems and can reasonably assume the requirements won't change in the future, obviously use whatever tools you know will be available. I write Bash and Python all the time for those reasons.

But it's silly to think that there's nothing in between "complete control over target environment" and "use ANSI C so it can be compiled to any architecture and platform under the Sun." POSIX compliance will cover a lot. Try browsing /usr/bin on any Unix system you'll see plenty of use cases. I just looked at /usr/bin on my laptop and saw that mysqld_safe is a Bourne shell script, for just one example.

reply

wirrbel | karma 2515 | avg karma 2.93 · 2018-01-06 14:56:50

less magic (which you start to care for if you have to use other people's magic), easier to port stuff to zsh, etc.

reacharavindh | karma 6222 | avg karma 4.57 · 2018-01-06 16:08:52

Do you know of any learning resources that are strictly POSIX sh instead of a specific shell? I'm looking to learn and it will be very helpful.

Sir_Cmpwn | karma 19702 | avg karma 5.33 · 2018-01-06 16:25:29

How about the standard?

http://pubs.opengroup.org/onlinepubs/9699919799/utilities/V3...

reply

kasabali | karma 1893 | avg karma 2.18 · 2018-01-07 13:10:28+00:00

No need to learn strict POSIX shell. If you learn bash from a good guide and it should explicitly inform you about bash specific features. This way you can learn both at the same time.

On top of that you can use `checkbashisms` tool to lint your script against bash specific syntax.

reply

frou_dh | karma 8157 | avg karma 3.09 · 2018-01-07 14:18:51+00:00

Check out this presentation: https://youtu.be/olH-9b3VJfs

And the accompanying reference site: http://shellhaters.org/

reply

v_lisivka | karma 0 | avg karma 0.0 · 2018-01-06 16:33:01

Are you writing your scripts in sh instead of bash, perl, python, JavaScript?

Sir_Cmpwn | karma 19702 | avg karma 5.33 · 2018-01-06 16:41:16

I prefer sh over bash for most of my scripts, but I will sometimes use Python instead.

v_lisivka | karma 0 | avg karma 0.0 · 2018-01-06 17:56:40+00:00

So you use python instead of sh, but you argue that we should use sh instead of bash, perl, or python. I'm not convinced.

IMHO, we should use proper tool for the job instead of making artificial limitations. Bash is proper tool in lot of cases, unless old proprietary OS or very limited embedded OS are targeted.

reply

Sir_Cmpwn | karma 19702 | avg karma 5.33 · 2018-01-06 18:01:07

>So you use python instead of sh

That's not what I said

>we should use proper tool for the job

This is the implication of what I said.

I do not think that bash is the proper tool in most cases.

reply

v_lisivka | karma 0 | avg karma 0.0 · 2018-01-06 23:16:08+00:00

So it boils down to your personal taste.

Sir_Cmpwn | karma 19702 | avg karma 5.33 · 2018-01-06 23:22:53+00:00

For most workloads where bash is suited, so is sh. For workloads where sh is insufficient, bash probably is too and you should just use a higher level language.

Chill. You don't have to hold bash so close to your heart.

reply

v_lisivka | karma 0 | avg karma 0.0 · 2018-01-07 14:11:03

I'm author of bash-modules project. It's my attempt to create set of libraries for easier scripting on bash in strict mode. Most bash libraries are not designed for strict mode, so I created my own. If you prefer sh over bash so much, you can help me to make it compatible with sh, or just fork it.

See https://github.com/vlisivka/bash-modules

reply

Sir_Cmpwn | karma 19702 | avg karma 5.33 · 2018-01-07 15:10:05

Or option three: not care about your project because bash sucks and continue to write sh without it.

gkya | karma 4805 | avg karma 2.02 · 2018-01-06 19:06:07

> old proprietary OS or very limited embedded OS

i.e. all the *BSDs? Many Linux distros that ship with BusyBox only or FreeBSD/Linux type systems (AFAIK Debian and also maybe Gentoo allowed FreeBSD userlands)?

POSIX Sh is quite capable. I maintain even my bashrc POSIX compatible, in order to not have any problems when I want to run my config on some remote machine, or on some local BSD installation (and I did run FreeBSD for more than a year, quite feasible option for a workstation IMO).

reply

sedachv | karma 4347 | avg karma 2.41 · 2018-01-06 19:13:16+00:00

I see so many #!/bin/sh scripts with bashisms in them. These scripts are broken and will cause errors. /bin/sh is not always Bash even on Linux systems. It definitely is not on BSDs. There is no reason to use Bash for shell scripts when you can do the same thing portably.

v_lisivka | karma 0 | avg karma 0.0 · 2018-01-06 23:06:45

Your example shows that advise to "use /bin/sh" doesn't help but creates problem. If someone wants to write portable script, then he must test it on significant subset of target systems.

chriswarbo | karma 7592 | avg karma 2.33 · 2018-01-06 23:39:21

I disagree with this.

There's nothing wrong with writing bash scripts, with a shebang like `#!/usr/bin/env bash`, just like there's nothing wrong with writing python scripts (with `#!/usr/bin/env python`), or Haskell scripts (with `#!/usr/bin/env runhaskell`), or whatever other language you like/think is appropriate/etc.

It's true that bash is unavailable by default on (say) microcontrollers, but I don't see the relevance given that loads of common scripting languages aren't available by default on microcontrollers (Python, Ruby, JS, PHP, etc.).

PS: With this said, don't start your bash scripts with a `sh` path like `#!/bin/sh` or `#!/usr/bin/sh`. In fact, don't use a hard-coded path like `#!/bin/bash` or `#!/usr/bin/bash` either, always use `#!/usr/bin/env bash` unless you have a good reason not to. (Whilst `/usr/bin/env` is a hard-coded path, it can also be treated as a single special-case by systems which don't follow FHS, like NixOS, GuixSD and GoboLinux).

reply

Sir_Cmpwn | karma 19702 | avg karma 5.33 · 2018-01-06 23:46:52

Bash is a special case where there exists a similar tool which is standardized in POSIX, and for most cases where sh is insufficient bash is probably insufficient too. Therefore POSIX sh is almost always the better choice.

thibran | karma 526 | avg karma 3.06 · 2018-01-06 14:28:35+00:00

Does fish-shell have an equivalent for '<()'?

0942v8653 | karma 1247 | avg karma 2.37 · 2018-01-06 15:00:39+00:00

https://fishshell.com/docs/current/commands.html#psub

thibran | karma 526 | avg karma 3.06 · 2018-01-06 15:19:00+00:00

Thanks a lot. Shells have so many features it's easy to miss one.

lloeki | karma 11319 | avg karma 2.78 · 2018-01-06 15:33:32+00:00

Watch out, there are some limitations to fish's psub preventing it to work as bash's >()

https://github.com/fish-shell/fish-shell/issues/1786

reply

unixthrowaway50 | karma 24 | avg karma 24.0 · 2018-01-06 14:30:13

The sections on quoting and globbing suggest this author, though obviously trying to be helpful, isn't really knowledgeable enough to be writing such a guide.

I suggest reading this instead: http://www.grymoire.com/Unix/Sh.html

Granted, it's about the Bourne shell, but since Bash, Korn, and every standard UNIX shell is supposed to be compatible with it, it's well worth learning.

And IMO, if you need more than what the Bourne shell provides, you should be using a proper programming language like Python instead.

reply

Nimitz14 | karma 648 | avg karma 1.08 · 2018-01-06 14:34:00+00:00

..it'd be nice if what was actually happening was explained of just statements like "[ is the original form for tests, and then [[ was introduced, which is more flexible and intuitive"

jwilk | karma 8094 | avg karma 2.47 · 2018-01-06 16:05:49+00:00

The primary differences are:

* word splitting and pathname expansion don't happen in [[...]];

* "==" does pattern matching in [[...]], but it does string comparison in [...].

reply

falcolas | karma 33589 | avg karma 3.15 · 2018-01-06 16:06:44+00:00

I recommend reading `man bash` for the section on Conditional Execution. It's remarkably readable and useful for man pages. It also cleanly explains the difference between the two.

jwilk | karma 8094 | avg karma 2.47 · 2018-01-06 14:39:16+00:00

    rename -n 's/(.*)/new$1$2/' *

There's a chance this won't work on your system.

There are two incompatible versions of rename in the wild:

1) Perl one: https://metacpan.org/pod/distribution/File-Rename/rename.PL

2) from util linux: http://man7.org/linux/man-pages/man1/rename.1.html

Debian (and dertivaties) ship the former; other Linux distros likely the latter.

reply

falsedan | karma 2127 | avg karma 1.33 · 2018-01-06 17:20:34

I think this is not specific to the shell you are using, so it happens in bash, fish, ksh, zsh, and so on.

gbacon | karma 1191 | avg karma 2.13 · 2018-01-07 00:52:51+00:00

It’s misleading because rename is a command that accepts regex arguments. The shell isn’t doing anything more sophisticated with the quoted argument than passing it to rename as a positional parameter.

The shell of course does expand unquoted globs.

reply

indigodaddy | karma 9732 | avg karma 3.32 · 2018-01-06 15:06:54+00:00

Can someone explain :h as the article's description was not clear at all to me what was going on there.

mmjaa | karma 2668 | avg karma 2.18 · 2018-01-06 15:11:52

Yeah, I couldn't make that work for me (on Linux or MacOS) .. although I'd love it if there were a way to quickly get 'just the directory' or 'just the filename' in bash with a shortcut, instead of having to resort to $(dirname blah) and so on .. I'm sure there is some way but :h doesn't look to be the shortcut as expected.

lloeki | karma 11319 | avg karma 2.78 · 2018-01-06 15:31:45

Assuming blah is in a var named name, you can use:

    dirname => "${name%/*}"
    basename => "${name##*/}"

Sadly you can't use it with !$ since it's not a variable. The closest you can do is:

    $ ls foo/bar/baz
    ls: foo/bar/baz: No such file or directory
    $ last=!$; echo "${last##*/}"
    baz
    $ echo "${last%/*}"
    foo/bar

falcolas | karma 33589 | avg karma 3.15 · 2018-01-06 16:15:05+00:00

    $ echo foo bar
    foo bar

    $ echo !$
    echo bar
    bar

The bang syntax is for history expansion, so it won't work on directories directly. But you could hack that with something like:

    $ls /root/my-dir
    [...]

    $echo !:t
    echo my-dir
    my-dir

jwilk | karma 8094 | avg karma 2.47 · 2018-01-06 15:48:51

Wild guess: perhaps OP confused bash syntax with vim syntax, where :h removes the last component?

http://vimdoc.sourceforge.net/htmldoc/cmdline.html#filename-...

reply

falcolas | karma 33589 | avg karma 3.15 · 2018-01-06 16:10:28

It's !:h - not just a plain :h

    $ echo foo /bar
    foo /bar

    $ !:h
    echo foo
    foo

It's under `man bash` in the History Expansion section.

pimlottc | karma 7866 | avg karma 3.44 · 2018-01-06 18:58:22

That's not quite what the author was talking about; your example does the same thing as !:0-

This is what the author meant:

  $ echo foo/bar
  foo/bar

  $ !:h
  echo foo
  foo

emmelaich | karma 6824 | avg karma 1.64 · 2018-01-06 23:37:43+00:00

Yes, the article is flat out wrong there. Probably a typo or pasto though.

Instead of

    ls /long/path/to/some/file/or/other.txt:h

it should be

    ls !:$:h

which would produce

    other.txt

joshbaptiste | karma 2731 | avg karma 4.64 · 2018-01-06 15:13:06+00:00

Hang out in #bash on IRC Freenode and you will be a Bash jedi http://mywiki.wooledge.org/BashFAQ best resource IMO for quick Bash syntax lookups as I always need to refer to the BashFaq to remember parameter expansion sub-string retrieval.

  parameter     result
  -----------   ------------------------------
  $name         polish.ostrich.racing.champion
  ${name#*.}           ostrich.racing.champion
  ${name##*.}                         champion
  ${name%%.*}   polish
  ${name%.*}    polish.ostrich.racing

ZenoArrow | karma 4965 | avg karma 1.24 · 2018-01-06 15:52:45

Extending your example above, how would you get a result of racing.champion or polish.ostrich ?

Xophmeister | karma 2008 | avg karma 3.54 · 2018-01-06 16:11:22

You'd have to do it twice, using a temporary variable:

    $ name="polish.ostrich.racing.champion"
    $ temp="${name#*.}"
    $ echo "${temp#*.}"
    racing.champion

That's if you want it generic (i.e., splitting on "." characters). In one shot, you could make the pattern more specific:

    $ echo "${name#*ostrich.}"
    racing.champion

...or use something like sed or awk:

    $ awk 'BEGIN {FS=OFS="."} {print $(NF-1), $NF}' <<< "${name}"
    racing.champion

mbudde | karma None | avg karma None · 2018-01-06 16:13:49+00:00

    ${name%.*.*}   polish.ostrich
    ${name#*.*.}                  racing.champion

itwy | karma -4 | avg karma -0.03 · 2018-01-06 22:29:28+00:00

They are needlessly rude and mean at #bash. A bunch of scumbags, actually.

LambdaComplex | karma 1447 | avg karma 3.96 · 2018-01-07 00:18:13+00:00

I think it's a result of constantly dealing with people who ask for help, receive good advice, and then ignore it

viraptor | karma 41139 | avg karma 2.79 · 2018-01-07 12:20:58+00:00

If that annoys then, they can always quit. Being rude in this situation is either a choice or lack of ability to cope with stress. Neither excuses being rude...

jakeogh | karma 1780 | avg karma 0.83 · 2018-01-07 04:13:59+00:00

Complete opposite of my experience.

wand3r | karma 2501 | avg karma 3.75 · 2018-01-06 15:21:16

I understand "the hard way" is a commonly used phrase but it does seem a bit infringing on Zed Shaw's entire series Learn Code the Hard Way. Easily confusing. Other than that, good work

amelius | karma 42902 | avg karma 1.63 · 2018-01-06 15:21:42

Bash has a huge number of little shortcuts that are difficult to learn. When one encounters a sequence of symbols like $(...), it is difficult to Google for its meaning. The reason shells nevertheless have these shortcuts is of course because they are shells: from the commandline it can be very convenient to use shortcuts.

But, in my opinion, that's where it should stop: one shouldn't use a shell language for scripting. In scripts, it is simpler to use more verbose and clear constructs, because most editors are very powerful and provide shortcuts themselves.

reply

yorwba | karma 12391 | avg karma 2.31 · 2018-01-06 15:34:39+00:00

Yeah, but why are you trying to use a search engine that cares less and less about exact matches, when there's a manual?

    >man bash
    /\$\(  # search pattern needs escaping
    ...
    value is evaluated as an arithmetic expression even if the $((...)) expansion is not used (see Arithmetic Expansion below).   Word  split-
    ...
    n      # go to next match
    Command Substitution
       Command substitution allows the output of a command to replace the command name.  There are two forms:

              $(command)
       or
              `command`
    (detailed description follows)

I'm still in favor of using more verbose and especially more clear constructs, but not because they are easier for Google, but because they ideally hold enough information on their own that you don't even need to look it up to know what it does.

orev | karma 6146 | avg karma 4.18 · 2018-01-06 15:42:05+00:00

Man pages are specifically reference documents, not tutorials or guidebooks. To say one should use a man page is to say that one must completely digest the entirety of the tool prior to ever actually using it. That’s just simply not feasible, nor should it be expected of anyone beyond trivial tools. Man pages simply don’t provide the context for solving a problem like a guidebook or tutorial would, which is why there are so many sites that start with a problem and then explain the tools.

tomsmeding | karma 1360 | avg karma 2.57 · 2018-01-06 15:54:34

Which is why the parent advised to treat the man page like a reference document, by searching in it. Some man pages are just badly written and are indigestible even when searching for a specific thing, but in general, that approach works quite often.

falcolas | karma 33589 | avg karma 3.15 · 2018-01-06 16:04:30+00:00

And even in the worst case, the man pages give you more context to use in your subsequent web search.

To use the original example, once you've identified that $(...) is Command Substitution, you have something that pulls up meaningful results in every search engine.

reply

hyperpape | karma 8632 | avg karma 3.7 · 2018-01-06 16:26:28+00:00

Is there some trick to searching man pages that I don’t know? Because my usual experience is:

  type man foo
  type /-p
  type n n n n n n n n n

as there are a bunch of matches like “...does bar when combined with -p...”

A presentation of man pages that used hypertext would make me a lot happier.

reply

imurray | karma 1398 | avg karma 3.87 · 2018-01-06 16:34:15+00:00

You might want to search with

    /   -p

hyperpape | karma 8632 | avg karma 3.7 · 2018-01-06 19:02:16

Thanks, but I think this validates a demand for real hypertext.

gbacon | karma 1191 | avg karma 2.13 · 2018-01-07 00:44:31+00:00

While you’re waiting for the rest of the world to agree with you and then implement the true hypertext manuals, consider sharpening your regex saw.

anjbe | karma 1006 | avg karma 3.49 · 2018-01-07 05:11:26+00:00

In OpenBSD, we have true hypertext manuals today: https://news.ycombinator.com/item?id=16089300

nine_k | karma 29426 | avg karma 2.95 · 2018-01-06 18:05:58+00:00

    man man
    man less

Not a joke. Learn the simple tools that help you daily.

hyperpape | karma 8632 | avg karma 3.7 · 2018-01-06 18:59:43

Did you have something in mind? I’ve read those man pages before, and I read them again today, and with the possible exception of tags in less (but I’m not sure about that), nothing seems relevant.

nine_k | karma 29426 | avg karma 2.95 · 2018-01-07 05:48:32+00:00

I'm sorry for sounding condescending! I did not understand your problem initially.

What helps me somehow is the fact that definitions in man pages are usually start on a new line and are indented by several spaces.

    man bash 
    / -o

This finds an inline mention, not very useful.

    man bash
    /^ +-o

This finds the definition: start of line, then some space, then the -o.

Pete_D | karma 1119 | avg karma 4.19 · 2018-01-06 18:06:06

If foo has a Texinfo manual (GNU tools like bash usually do) then you can try `info foo` and search the index with i or I for -p. Texinfo manuals also have hyperlinks you can press enter on.

info is a greatly underused system and I'd recommend any *nix users to spend some time learning how to navigate it.

reply

hyperpape | karma 8632 | avg karma 3.7 · 2018-01-06 19:00:39

Thanks. Is there a good way to open that in a browser, rather than a console?

teddyh | karma 23902 | avg karma 2.94 · 2018-01-06 19:25:55+00:00

In general, the best way to read Info documentation is inside Emacs.

oblio | karma 25694 | avg karma 2.47 · 2018-01-06 19:57:09

"If you are a bash newbie, you should read the bash manual. If you want proper search for the manual, you should use info. If you want proper use of info, you should use Emacs."

Kind of a deep rabbit hole, isn't it?

reply

teddyh | karma 23902 | avg karma 2.94 · 2018-01-06 20:53:55+00:00

Not in this case; the bash manual is not in Info form, but a regular old-style Unix man page.

gbacon | karma 1191 | avg karma 2.13 · 2018-01-07 00:39:36+00:00

True. You can do just fine in general eschewing info wankery in favor of man pages. Stallman may disapprove, but I don’t lose sleep over it.

Pete_D | karma 1119 | avg karma 4.19 · 2018-01-06 20:06:32+00:00

I don't know if you can locally, but they're often published online in HTML format, like at https://www.gnu.org/software/bash/manual/html_node/index.htm....

gbacon | karma 1191 | avg karma 2.13 · 2018-01-07 00:35:28+00:00

I get that they may be unfamiliar, but learning your system’s tools will pay off over the long run. Search facilities in less and info are generally much more powerful than in your browser.

pdkl95 | karma 17069 | avg karma 4.63 · 2018-01-07 02:46:16

In addition to the HTML info pages hosted by GNU[1], a variety of GUI texinfo readers are available, such as tkinfo[2].

[1] https://www.gnu.org/software/bash/manual/html_node/index.htm...

[2] http://math-www.uni-paderborn.de/~axel/tkinfo/

reply

khedoros1 | karma 2911 | avg karma 1.53 · 2018-01-06 20:09:19

For bash: 'i' gives me "no indices found" and 'I' says "no index". I can do a '/' search, which finds some "-p" strings, but "n" doesn't work to find the next.

From some other comments, it sounds like "info" uses emacs at its core? So I suppose I'd have to learn some emacs commands, if that's the case.

reply

Pete_D | karma 1119 | avg karma 4.19 · 2018-01-06 20:27:40

info works without emacs, but some people prefer emacs' info viewer to the console one. The console viewer does have some idiosyncratic keybindings - for example, "n" means "next node at same level", and "}" means "search for next occurrence". "H" will give you a quick overview.

I'm surprised indexes aren't working for you - unfortunately I don't know of any suggestions to fix that.

reply

khedoros1 | karma 2911 | avg karma 1.53 · 2018-01-07 17:38:45+00:00

Thank you for the comment! I've always been aware of the info pages, but it's honestly been years since I've tried to use them.

This thread is a good reminder to give it another go.

reply

metaobject | karma 555 | avg karma 1.75 · 2018-01-06 18:31:15+00:00

You can also do keyword searches across all man pages like:

    man -k <keyword>

mdaniel | karma 5981 | avg karma 1.56 · 2018-01-06 20:08:34+00:00

as there are a bunch of matches like "...does bar when combined with -p..."

It has been my experience that the definition of switches, unlike their use in examples or other text, occurs as the first thing on the line, so my search expression would be:

    /^ *-p

(you likely can't see it, but there is a trailing space, too, to ensure it's just that flag, and not "-parallel" or whatever)

It's possible the text will be tab-indented, in which case:

    /^[ ^I]*-p[ ^I]

(most pagers will accept just pressing the tab key in the search string, and it may show up as ^I or the literal tab character)

npongratz | karma 2205 | avg karma 4.09 · 2018-01-06 20:17:47

You can use apropos to search names and descriptions inside man pages. For example:

    $ apropos timezone
    Date::Manip::DM5abbrevs (3pm) - A list of all timezone abbreviations
    dm_zdump (1p)        - timezone dumper
    Time::Zone (3pm)     - - miscellaneous timezone manipulations routines
    timezone (3)         - initialize time conversion information
    tzfile (5)           - timezone information
    tzselect (1)         - view timezones
    tzselect (8)         - select a timezone
    zdump (8)            - timezone dumper
    zic (8)              - timezone compiler

Can even search using regular expressions.

gbacon | karma 1191 | avg karma 2.13 · 2018-01-07 00:31:24+00:00

Usually options are typeset similar to

    -p

    Potrzebie mode ...

So search for it, e.g.,

    /^ *-p

The caret regex anchor matches at the beginning of line, and the Kleene star matches zero or more of the previous pattern. In English, the above pattern reads “match -p only when it occurs as the first nonblank characters on the line.”

The surrounding context may be different, so adapt your search pattern accordingly.

reply

anjbe | karma 1006 | avg karma 3.49 · 2018-01-07 05:04:34+00:00

BSD man pages (and many Linux man pages, though a minority) are written in semantic “mdoc” macros, rather than the classic “man” macros that are strictly presentational.

If you’re using mandoc (http://mandoc.bsd.lv/) as your man(1) program—the default on OpenBSD and a couple of Linuxes like Void and Alpine—it will use these semantics to generate hyperlinks in the terminal using more(1) and less(1)’s ctags support.

So on my machine, your example becomes:

    type man foo
    type :t
    type p

This brings me to the first instance of a command-line flag named “-p” or environment variable named “p” in an itemized list.

It translates to HTML too—check out the links generated by the web viewer, which uses the same backend. https://man.openbsd.org/ls.1

reply

hyperpape | karma 8632 | avg karma 3.7 · 2018-01-08 00:35:28+00:00

That’s awesome! I think a table of contents could be nice addition for the HTML version (at least for pages that are longer than less).

barrkel | karma 34063 | avg karma 3.87 · 2018-01-06 18:57:05

I generally find things easier to learn with reference documents than tutorials unless I'm unfamiliar with the problem domain and have no mental model to map things to. That's very rare these days, so reference documents are the way to go.

Amusingly, reference documents are themselves increasingly rare, and what you usually get is a rough tutorial and set of examples that cover perhaps 15% of the feature set, and need to dive into the source to figure everything else out.

reply

gbacon | karma 1191 | avg karma 2.13 · 2018-01-07 00:33:07+00:00

Look up the name of the concept in the manual and search for that. Compare Google or Stack Overflow searches for <() versus “process substitution.”

AlecSchueler | karma 3231 | avg karma 2.1 · 2018-01-06 15:59:30+00:00

It is a reference, and searching for the pattern GP mentioned finds you references to arithmetic expression evaluation and to command substitution. Those are the terms one can Google for if further explanation is required.

gumby | karma 61183 | avg karma 3.86 · 2018-01-06 16:18:07

This is why the less program has the / command — so you can search the man page for the syntax you are curious about.

Goladus | karma 4410 | avg karma 1.97 · 2018-01-06 17:42:07+00:00

It's not just from the command line that these shortcuts are useful. Once you have a decent baseline on shell scripting fundamentals, the constructs are often highly optimized towards efficient and readable scripts.

Not having to put every command in quotes, for example, is a huge advantage all by itself, especially if any argument lists have quoted arguments.

The $(...) construct is pretty fundamental to bash scripting and should be covered in any decent tutorial.

reply

jelgt2011 | karma 6 | avg karma 1.5 · 2018-01-07 00:15:48+00:00

explainshell.com is your friend here.

luckydude | karma 3428 | avg karma 4.01 · 2018-01-06 15:56:19+00:00

+1 for the parts that are portable to Bourne shell/ksh/zsh.

-1000 for the parts that are specific to Bash. Stuff like that has been huge pain in my ass over the years. Some clever programmer uses some bash-ism and the build breaks on some ancient hardware that doesn't have bash.

I realize my complaint sounds like Henry Spencer's Ten Commandments and perhaps it feels outdated but trust me, you don't want to wade into thousands of lines of shell to track down why something doesn't work on the stupid AIX box.

reply

rocqua | karma 9129 | avg karma 2.16 · 2018-01-06 16:00:06+00:00

The article doesn't mention which are portable and which aren't.

kps | karma 7825 | avg karma 2.95 · 2018-01-06 16:41:53+00:00

It's a cultural thing. GNU is intended to replace Unix, so interoperability is (at best) not a priority. Same as Microsoft's “embrace, extend, extinguish”, which shows how important a good slogan is if you want your ambitions recognized.

A POSIX (or older Bourne-descended) shell can be distinguished from ‘bash --posix’ in a script with a fragment like “date&>F”, because Bash authors either didn't realize or didn't care that the sequence ‘&>’ already had meaning.

reply

LukeShu | karma 6759 | avg karma 4.09 · 2018-01-06 20:16:29

I wanted to argue that POSIX didn't define this behavior. But, reading the spec: I agree with you, Bash is non-compliant there.

According to POSIX (2016 edition)

    date&>F

should be equivalent to

    date &
    (exec >F)

where in Bash, it's equivalent to

    date >F 2>&1

coldtea | karma 86593 | avg karma 2.38 · 2018-01-06 16:25:35+00:00

>Stuff like that has been huge pain in my ass over the years. Some clever programmer uses some bash-ism and the build breaks on some ancient hardware that doesn't have bash.

Shouldn't the problem be the "ancient hardware that doesn't have bash" itself?

reply

stephenr | karma 7998 | avg karma 1.42 · 2018-01-06 17:19:45

A brand new macbook pro will not have bash4.

lillesvin | karma 599 | avg karma 2.94 · 2018-01-06 18:54:06+00:00

The machine may not be ancient but I would still argue that the problem there is the ~10 year old version of Bash that Apple has decided to ship rather than the programmers that use features added to the shell within the last 10 years.

(I should add that I don't know what version High Sierra ships with but Sierra seemed to ship with 3.2.5x-ish which was 9 years old at the time.)

reply

stephenr | karma 7998 | avg karma 1.42 · 2018-01-06 19:08:31+00:00

Well put it this way:

You can assume everyone has a modern bash, and make it the end users problem if they don't, or you can write portable shell scripts and know it will work.

Honestly the things you can't do in posix shell compared to bash border on "use a fully featured language" anyway.

reply

lillesvin | karma 599 | avg karma 2.94 · 2018-01-06 19:19:19+00:00

Thing is that Bash is something you can assume to be reasonably widely available[0] — like Perl or Python — but I wouldn't expect to have to avoid any features from the last 10 years of either of those two. Sure, a certain grace period is to be expected but I think 10 years is way past that.

[0]: I know POSIX is supposed to be even more widely available but depending on what you're targeting then it may not be the best option [source: https://en.wikipedia.org/wiki/POSIX#POSIX-oriented_operating...].

reply

stephenr | karma 7998 | avg karma 1.42 · 2018-01-07 04:58:16+00:00

Afaik neither perl or python are gpl3-only licensed.

Thats the blocker on macOS.

reply

sigjuice | karma 1677 | avg karma 1.63 · 2018-01-06 20:29:07

How do you write a portable shell script? The programs invoked by your shell script need to behave the same everywhere. Even fundamental things like cp, rm, etc. don’t universally behave the same across the various Unix and Unix-like systems.

The joke is that your shell is actually more portable than your shell script :)

reply

stephenr | karma 7998 | avg karma 1.42 · 2018-01-07 03:10:22+00:00

Most of the shell utilities described by posix have standard flags, and then gnu/bsd extra flags.

If you use the standard ones (and use the "posix mode" flags when available) you're mostly ok.

Also, a shell script can have logic to handle different tools available (either different flavours of the same tool or even different tools that do similar things).

If the basic syntax it uses (or the shebang) are bash specific then you need bash to run it.

reply

coldtea | karma 86593 | avg karma 2.38 · 2018-01-06 20:08:18+00:00

And it's all 2 minutes to add it as your default shell (including installing brew itself).

  22:06  ~  $ bash --version
  GNU bash, version 4.4.12(1)-release (x86_64-apple-
  darwin16.3.0)

Much easier than constraining oneself about what to put in one's script (assuming one is indeed targeting Linux, OS X etc released in the last 10+ years and not some embedded etc platforms).

stephenr | karma 7998 | avg karma 1.42 · 2018-01-07 03:05:23+00:00

If your project says it requires something from brew thats a blocker for a number of people.

coldtea | karma 86593 | avg karma 2.38 · 2018-01-07 04:09:04+00:00

A mythical kind of project that depends on macOS having a recent bash?

stephenr | karma 7998 | avg karma 1.42 · 2018-01-07 04:17:55

If your project depends on a bash script and that script depends on bash4, it wont run out of the box on a brand new macOS machine.

A posix conptible shell script has no such limitation. Thats all im saying. Im not arguing the merits of which bash is better or what a project might need.

reply

sethrin | karma 1083 | avg karma 1.6 · 2018-01-06 15:58:09

I wrote a book on Bash too. The most important thing for anyone to know about Bash is that it's intended as a command language, not a general purpose scripting language. If it's longer than 10 lines, or if it uses two or more variables, you should probably have written it in something other than Bash.

Ancalagon | karma 2753 | avg karma 2.41 · 2018-01-06 16:11:46

Amen to that. Now if only I could have gotten my Systems Programming professor to feel the same way... :)

chucknelson | karma 664 | avg karma 2.19 · 2018-01-06 16:25:02

Would you consider non-trivial install scripts as an exception to this general rule? I mean, you wouldn't write some install script in ruby or python, right?

ghettoimp | karma 839 | avg karma 3.03 · 2018-01-06 17:58:20

I don't know why not?

At the least, rather than Bash, you might consider Perl as a default, lowest common denominator for scripts that need to run anywhere.

- It's nearly as ubiquitous as bash.

- It has approximately the same kinds of file/path operations built in.

- It has reasonably good support for strings/regexes/etc. all built-in, so you don't have to call out to tools like sed/awk/grep all the time and hope that they are available and compatible across your target platforms.

- It provides reasonably good arrays and hashes, which are horribly horrible in bash.[1]

- You can use syscalls very easily if you really need to, but usually you don't.

[1] Of course, no language can save you from the file system disaster (https://www.dwheeler.com/essays/fixing-unix-linux-filenames....), but being able to know that "foo bar" is a string instead of two array elements is a good start.

Mostly this all applies to Ruby or Python too, modulo perhaps the degree of ubiquity.

reply

gbacon | karma 1191 | avg karma 2.13 · 2018-01-07 00:22:44

Great point about ubiquity.

Perl regexes are the best of breed that everyone else replicates — far better than “reasonably good.”

The Perl erasure in this HN thread is startling.

reply

grzm | karma 13860 | avg karma 2.67 · 2018-01-07 01:01:34

> "The Perl erasure in this HN thread is startling."

"Erasure" to me implies some active effort to remove Perl from discourse. I don't see anything like that here: indeed, there are a number of positive mentions, and no negative ones I see. Granted, Python and Ruby are both mentioned more often, but none of those is at Perl's expense. Am I misunderstanding what you mean by 'erasure'?

reply

gbacon | karma 1191 | avg karma 2.13 · 2018-01-07 01:26:50+00:00

This is what Perl is designed to be. Perl unlike the others is almost certain to be on any Unix or Linux installation. Several commenters leaving out Perl in discussions of the next step up from bash scripts is truly strange.

I suppose being ignored beats the typical herp-derp anti-Perl bigotry, but I’d prefer all-around civility.

reply

grzm | karma 13860 | avg karma 2.67 · 2018-01-07 01:45:50

> "Several commenters leaving out Perl in discussions of the next step up from bash scripts is truly odd."

I'm having a hard time following you here. Do you think that they're doing so for any other reason that Perl is no longer their go-to tool? There are communities where Perl is still used: PostgreSQL for example uses Perl for some of its scripting, as well as its build farm tool, in particular because of its portability on older systems.

That said, from what I've seen over the past 10 years or so, Perl hasn't had much of a presence in areas where a lot of computer work in tech is being done. For example, in cloud computing, or scientific computing, or machine learning, or web frameworks. Please don't read this to mean that Perl couldn't be or isn't being used in these cases or wouldn't be a better fit. (As an aside, I think Perl missed out a lot while a large portion of the community was focused on Perl 6: there's only so much energy in a community, and that absorbed on Perl 6 wasn't focusing on evangelism. But that's not something I'm interested in litigating here.) Or that there isn't something a bit frustrating in seeing the wheel reinvented time and time again. And so many examples on the web use bash as a common denominator. This puts Perl further out of mind if it's not already part of your everyday workflow. And how many developers today have come of age without seeing Perl in their everyday environments?

Consider the current forum. What's the percentage of front-page posts that are about Perl or tools where Perl is a part of the tool chain? It would be understandable for the people who frequent HN to not view Perl as their go-to. I don't consider it uncivil for people to neglect to mention some other language when it's not something they'd actually think of reaching for. It seems the solution would be to share examples of where Perl provides advantages, both in the comments here and in submissions to HN.

reply

Too | karma 4126 | avg karma 1.55 · 2018-01-07 07:04:25+00:00

Well said, perl might be the theoretically best match in specifically this problem domain but the thing is there are only so many programming languages one can learn.

If i had to choose only one of ruby/python or perl i would choose the former and it would be able to cover my base both as glue-code and for more programs. Perl would maybe make the glue code a bit easier but instead i would be much less employable and have a much harder time finding other people who can read the glue. I'm not qualified to have an opinion on perls capabilities for other programs but i'm sure there are valid reasons most people prefer other alternatives.

reply

pletnes | karma 1915 | avg karma 2.33 · 2018-01-07 20:56:46

Perl erasure?

After using the linux command line (or its many «relatives» like cygwin, mac os, unixes) for more than 10 years now, I’ve talked to exactly one person that used perl to accomplish anything at all. He used it to edit text files, so he could have done the same in awk/sed/vim in my opinion.

I know more people who write fortran 77 than perl.

People just don’t seem to use (or like) Perl very much.

reply

thomaslangston | karma 586 | avg karma 2.15 · 2018-01-06 17:59:11

Yes, you would write non-trivial install scripts in ruby.

https://en.m.wikipedia.org/wiki/Chef_(software)

reply

vinceguidry | karma 12286 | avg karma 2.49 · 2018-01-06 19:29:00

If any of my bash scripts get long enough to where I want a reusable class, I rewrite in Ruby.

khedoros1 | karma 2911 | avg karma 1.53 · 2018-01-06 19:52:57

Bash has too many surprising edge cases. A lot of my install scripts have at least snippets of Perl in them. The installation/upgrade/maintenance scripts for my employer's main product is Ruby-based.

We support a lot of different OSes, and there's usually less variance between the Perl deployed on them than there is in the shell.

reply

sethrin | karma 1083 | avg karma 1.6 · 2018-01-06 20:01:21

I would not consider those an exception, and would typically prefer to see the use of some other language. Bash is excellent at dealing with semi-structured text, and as part of a command pipeline, and if you have no alternative than to write POSIX sh, well, it exists. Bash does not have niceties like typed variables or named function arguments, and arrays are best avoided. Even parsing command line arguments is more fun in other languages.

The problem is that Bash has about the lowest barrier to entry which can be found: just dump the things you were going to type anyway into a text file and mark it executable. It's simple, and then you bloody your nose on one of Bash's many idiosyncrasies, and people will say, "Oh yes, ']' is just a required last argument to '['. You gotta watch for that." And the true Bash master knows that my rule is silly and that anything may be written in Bash.

Just...please don't.

reply

mbrock | karma 7744 | avg karma 3.19 · 2018-01-07 08:06:51

I say please don't write any goddamned software at all.

weaksauce | karma 5956 | avg karma 2.07 · 2018-01-06 20:59:36+00:00

Isn't that what chef and Capistrano are at some level?

Goladus | karma 4410 | avg karma 1.97 · 2018-01-06 17:13:11+00:00

I disagree. Here's how I decide:

Do I need to manipulate rich data structures like hash-maps or nested lists? That sort of thing tends to stretch the capabilities of Bash to its limits and I tend to set the bar fairly low here.

Is the program oriented around commands? If I'm gluing executable scripts and binaries, using bash is often superior to a scripting language. Argument passing is more natural and convenient and the built-in support for the standard i/o streams makes it easy for the different commands to pass data between them. The number of variables or lines of logic is usually less important than the number of different installed commands I need to combine. (Or the number of variations on a single command)

Do I need to modify the environment in a significant way? In a bash script it's trivial to source in an environment script, which can be done conditionally or even interactively. This is usually more tedious to do in scripting languages.

reply

falsedan | karma 2127 | avg karma 1.33 · 2018-01-06 17:31:14+00:00

Also: do I need to run a producer & consumer (and maybe some intermediary filters) on a stream of data, and want to run them in parallel? Shell pipelines are trivially easy to set up & test.

kinkrtyavimoodh | karma 4896 | avg karma 5.07 · 2018-01-06 18:22:58+00:00

As someone who has to occasionally modify 100+ line bash scripts written by Coworkers from Christmas Past which matched your spec in terms of what they had to do, please please just use Python (or similar).

Yes, you will have a few extra lines but it will be vastly more readable and maintainable.

And yes, I know I will get the standard the person who wrote the script did a bad job but at some point it should be okay to blame the tools instead of the workman if workmen disproportionately create worse results with a set of tools.

reply

pritambaral | karma 3587 | avg karma 2.13 · 2018-01-06 18:44:35

As someone who also has to semi-frequently modify 100+ lines bash scripts written by others, I'd suggest every serious bash scripter read the bash man page. It is much smaller than any book on Python.

For scripts written in Python, I'd use a similar argument and suggest every serious Python scripter to learn Python. As for Perl, or Ruby, or Julia, or anything really. It's just that learning bash from its man page is, IMO, much easier than learning any of those languages to the same degree.

Of course – and it goes without saying — that some things are just not suited to bash, and in those cases a suitable language/framework must be used and learned if not learned. As far as process calling, environment management, or stdio streams management are concerned, bash is better suited than (all the other languages I've tried) Python, Ruby, or Go.

reply

Too | karma 4126 | avg karma 1.55 · 2018-01-06 19:11:59+00:00

The problem is most people don't want to consider themselves "serious bash scripters" but still think they can write bash, which always results in unstable and vulnerable scripts. Python on the other hand can be written by unserious python scripters and more often be at least accidentally correct. Another big issue is that the quoting, escaping and expansion rules in bash can be daunting even for serious bash scripters, with silent errors or completely different behaviour just because you forgot a : before (or was it after?) the $-

As a test, run shellcheck on any random shell-script written by these Coworkers from Christmas Past (or your own past) and it will spew serious warnings on almost every single line, run an equivalent analyzer on an equivalently unserious python file and you might in bad cases get 2 or 3 minor warnings per 100 lines.

The comparison is a bit unfair because just by getting these compiler errors and exceptions from a real language you force yourself into a more serious mental mode of programming instead of happy scripting, but that is just another argument in favor for not using bash IMO.

reply

pritambaral | karma 3587 | avg karma 2.13 · 2018-01-06 19:59:56

I agree with you, mostly, and let me point out where I don't.

1. People who don't want to consider themselves "serious bash scripts" shouldn't be writing non-trivial bash scripts unless they're okay with it turning out buggy. I agree this is a personal standards thing, and reality is often less simple and more lenient than that.

2. The "compiler errors and exceptions" that you speak of in regards to Python also have equivalents in Bash. Agreed, they're still optional and non-"serious bash scripters" don't often know of them. Which is why I make my coworkers use them when I review their code.

3. There are classes of bugs that would happen in Python (and Go, from my experience) that wouldn't happen in bash, simply because they are in areas where bash shines. At work, I've seen process management and stdio management bugs — some of which have bit us in the field — simply because the non-bash language (Go) has a weird affinity to its child processes, or it (both Python and Go) defaults to not wiring up the stdio of child processes. In the latter case, the proper thing for the developer to do was the same as with bash: read the documentation. Most of our process management code is now in bash (because it's simple and safe) and systemd (because it's thorough and absolute).

reply

Goladus | karma 4410 | avg karma 1.97 · 2018-01-07 04:39:28+00:00

In my experience, those kinds of shell scripts get written by people who needed automate something quickly and didn't have (or at least perceive) any other language available besides maybe Perl. They're either Unix admins without much programming experience or they're part-time programmers who primarily work in some other language and just don't have the familiarity with Python or Ruby needed to write a good glue script.

But they know how to accomplish this task interactively in the shell, so scripting what they're already doing (or already know how to do) seems like the natural next step. So you wind up with an imperative shell script that's basically a long, flat sequence of commands with some logic and variables sprinkled in haphazardly as they realized they needed it.

Due to the organic way these scripts often emerge, it's not like advocating Python is an easy sell. By the time they think to consider alternatives, the shell version already exists.

reply

kinkrtyavimoodh | karma 4896 | avg karma 5.07 · 2018-01-06 20:02:54+00:00

I agree. And I agree with your bash 'whitelist'. I'd also add a hard blacklist for any serious string manipulation. A series of awks and seds look clever but they are quite annoying to deal with.

If it's just a bunch of cuts or tr, sure.

reply

DonHopkins | karma 23426 | avg karma 2.38 · 2018-01-06 22:30:44

And it's SO MUCH MORE expensive to fork all those processes. Just use a real language, please.

aplorbust | karma 414 | avg karma 2.86 · 2018-01-07 17:02:00

"A series of awks and seds look clever but they are annoying to deal with."

Would the following be annoying for you to deal with?

   #!/bin/sh
   sed 's/#.*//' \
   | sed 's/:/#/g' \
   | cat AMD64 - \
   | ./qhasm-ops \
   | ./qhasm-regs \
   | ./qhasm-fp \
   | ./qhasm-as \
   | sed 's/%32/d/g' \
   | sed 's/%raxd/%eax/g' \
   | sed 's/%rbxd/%ebx/g' \
   | sed 's/%rcxd/%ecx/g' \
   | sed 's/%rdxd/%edx/g' \
   | sed 's/%rsid/%esi/g' \
   | sed 's/%rdid/%edi/g' \
   | sed 's/%rbpd/%ebp/g'

where qhasm-as and qhasm-fp are each awk scripts (222 and 427 lines, respectively).

source: http://cr.yp.to/qhasm/qhasm-20061116.tar.gz qhasm-20061116/qhasm-amd64

reply

make3 | karma 1634 | avg karma 0.98 · 2018-01-06 23:06:07

i'd argue that it's much easier to write bad Bash than bad Python

Goladus | karma 4410 | avg karma 1.97 · 2018-01-07 04:04:49+00:00

Assuming that "bad" doesn't mean merely "ugly to glace at." I find that depends largely on the problem at hand.

natecavanaugh | karma 371 | avg karma 1.24 · 2018-01-07 01:04:36

I think this might help others, but ShellCheck[0] is a good place to start to help eliminate poor shell scripting.

And I would make an argument though that even large shell scripts in bash have their place.

I often write scripts in either Node or Python, but only when I need things bash is bad about (any sort of proper data structure beyond strings or arrays).

But there are just so many things bash makes insanely easy, especially with operating on files and directories.

And functions that are used as completions or need access to aliases or functions in the current process are also better in bash.

I wish there were a scripting language like bash, but enhanced with at least some hash maps and proper array manipulation, and maybe some formal IPC to allow scripts to request info from the parent process.

reply

mkl | karma 11432 | avg karma 2.64 · 2018-01-07 01:42:00+00:00

[0] https://www.shellcheck.net/

natecavanaugh | karma 371 | avg karma 1.24 · 2018-01-07 08:29:38+00:00

Whoops! Thanks for that

Goladus | karma 4410 | avg karma 1.97 · 2018-01-07 03:40:39+00:00

> As someone who has to occasionally modify 100+ line bash scripts written by Coworkers from Christmas Past which matched your spec in terms of what they had to do, please please just use Python (or similar).

As someone who has inherited thousand-line shell scripts, and had to debug many 3rd party scripts, I stand by my assertion.

> Yes, you will have a few extra lines but it will be vastly more readable and maintainable.

Readability is important but it's not the only aspect to maintainability, nor is maintainability to sole concern of a tool. A low bug rate helps maintainability and actually having the features you need, in an acceptable timeframe, is also important.

For example, the OP mentioned the 'set -e' option that causes the script to exit if any command returns a non-zero exit code. In Python, you'd either have to remember to check the return code for every subprocess or define a wrapper, which adds complexity, reducing readability and can lead to bugs and errors. Nor is Python always the best answer for readability anyway. In many cases, it's not like it's just a few lines you're saving. Here are some functions I've used when scripting in Python

    import subprocess, shlex
    def process_run(cmd_string, stdin=None):
        return subprocess.Popen(shlex.split(cmd_string),
                                stdin=stdin,
                                stdout=subprocess.PIPE,
                                stderr=subprocess.PIPE)
    
    def process_results(process_object):
        (stdout, stderr)=process_object.communicate()
        return (process_object.returncode, stdout, stderr)
    
    def process(cmd_string, stdin=None):
        return process_results(process_run(cmd_string, stdin=stdin))

It's 10 lines of boilerplate to set up an approximation of behavior that is trivial to achieve any shell language. There's actually 7 more functions I use to handle different common subprocess execution patterns. For example, the "stdin" in that process_run function needs to be a filehandle (at least in Python 2.7, I'm not sure about python 3). To pass a string to standard input you'll need something like this:

    f=SpooledTemporaryFile()
    f.write(stdin_string)
    f.seek(0)
    results=process(cmd_string, stdin=f)
    f.close()
    return results

> And yes, I know I will get the standard the person who wrote the script did a bad job but at some point it should be okay to blame the tools instead of the workman if workmen disproportionately create worse results with a set of tools.

Actually what I'd say first is that it's quite possible the person writing the script knew what they were doing. I've inherited bad code in my life, I've inherited some real gems, and I've inherited a lot of code in between. One thing I've learned is that I tend to be unfairly critical of average code. It's hard to read unfamiliar code and easy to criticize inconvenient design choices when you have to adapt their code to some new problem that they never anticipated. Usually I'll be better off just buckling down and untangling the spaghetti.

reply

Too | karma 4126 | avg karma 1.55 · 2018-01-07 06:52:18+00:00

subprocess.run does exactly what your wrapper does, subprocess.check_output returns stdout only and automatically throws exception on non-zero return code, this is the function you should be using 99% of the time. Those functions both accept a string as stdin-parameter.

Goladus | karma 4410 | avg karma 1.97 · 2018-01-07 13:58:09+00:00

Subprocess.run is Python 3+ only. If we're talking about replacing bash, Python 2.7 (possibly with 2.6 compatibility) is the more reasonable target. CentOS 7 and Debian 8 (I've not used 9 yet) still ship with Python 2.7.

Also, who is to say what I should be using "99% of the time?" Each problem has different constraints and different priorities.

reply

mkesper | karma 3161 | avg karma 2.71 · 2018-01-07 15:01:11

Please have a look at https://pythonclock.org/ and stop riding dead horses.

Goladus | karma 4410 | avg karma 1.97 · 2018-01-07 17:41:15+00:00

The domain under discussion is scripting and specifically comparisons with Bash. Python 3 has not achieved anywhere close to the platform deployment that Python 2.7 has. When the common OS distributions you're likely to need to script on ship with Python 3 as the default rather than python 2.7, we can start using Python 3 in random comparisons with shell scripts. Until then, Python 2.7 is the language for comparison no matter what rhetoric you want to employ.

theptip | karma 9431 | avg karma 3.56 · 2018-01-07 18:13:48

Most distros ship with python3 (though `python` will refer to python2).

Is there a reason you cannot just say `#!/usr/bin/env python3` in your scripts? I don't see why you require python3 to be the default, am I missing something here?

reply

Goladus | karma 4410 | avg karma 1.97 · 2018-01-07 22:11:44+00:00

The first OS I've used that includes Python3 in the base install is Debian 8(Jessie) and I no longer use Debian in production. CentOS 7 does not include Python3 in the base install. You can install it, sure, but why bother when you can just use the python that is already there? Or better yet, /bin/bash...

Again, in the context of this discussion, the whole argument is yet another point in shell's favor. There's no major backwards-incompatible change in the language. With Bash, you just decide whether POSIX compliance is something you need, and that's basically it. Both versions are still supported and no one interrupts discussions to announce that beatings will continue until morale improves whenever the deprecated version of Python comes up.

reply

mixmastamyk | karma 12554 | avg karma 1.22 · 2018-01-21 21:36:54+00:00

If you are using a ten year old software and can’t install a package, that’s your problem. Quit acting like it is the default situation.

Or you could put the wrapper functions in a module and call it a day, either way problem solved.

reply

mountainerd | karma None | avg karma None · 2018-01-30 17:44:52+00:00

He's not acting like it's the default situation. For the distributions he mentioned, it is the default situation.

MichaelMoser123 | karma 4337 | avg karma 1.94 · 2018-01-07 07:44:27+00:00

problem is that it is difficult to anticipate; It may look like a simple 'glue the commands together' task, but it may turn out to be more tricky.

koala_man | karma 2309 | avg karma 6.19 · 2018-01-07 20:08:38

My favorite advice was from a search giant's dev infra engineer who said that "any Python script over 100 lines should be rewritten in Bash, because at least that way you're not kidding yourself into thinking it's production quality"

pletnes | karma 1915 | avg karma 2.33 · 2018-01-06 21:07:49+00:00

Great arguments. Add one: bash is a more lightweight dependency, frequently available even on windows, weird *nix versions and minimalistic environments (e.g busybox linux, fresh arch install, ...).

ilyash | karma 233 | avg karma 1.18 · 2018-01-07 06:35:41+00:00

[shameless]

> Do I need to manipulate rich data structures like hash-maps or nested lists? That sort of thing tends to stretch the capabilities of Bash

> If I'm gluing executable scripts and binaries, using bash is often superior to a scripting language

The two points above resonate with my view that there is a missing piece. On one hand we have bash which is optimized for being the glue (second point). On the other hand we have Ruby, Python, Perl, Go, etc which are good for first point. What I think is missing is newer, more powerful shell, which supports both use cases and more. I'm working on it:

https://github.com/ilyash/ngs

Please note that I'm not the only one that thinks there is room for more powerful shells. See the readme for links to other projects.

reply

JepZ | karma 2073 | avg karma 2.95 · 2018-01-06 19:55:36+00:00

I disagree too. Bash has many downsides, but there are very few languages out there which have the ability to 'connect' different programs so easily.

Bash scripts are slow as hell (as most commands have to spawn new processes), it is hard to write "secure" code (if even possible), handling whitespaces can be a pain in the * and the amount of repetition is awful. If your kid has done something wrong, just tell it to write a bash script: it is the equivalent of writing a hundred times:

  x="$(...)"

Nevertheless, I enjoy writing bash scripts. Many times it starts with a simple curl command and by the end of the day you have a new OS installer. Granted, there are tasks which other languages can do better, but thats the real power of bash, it doesn't care: Then off you go write your super complicated algorithm in Rust, Go, Python, R or whatever and just call the other program, that's what Bash is good at.

It is the glue which keeps everything working together.

Disclaimer: Please don't build a complete cathedral out of glue.

reply

joncrane | karma 3375 | avg karma 3.04 · 2018-01-06 22:39:05+00:00

"Bash is the glue which keeps everything working together. Please don't build a complete cathedral out of glue."

--JepZ

reply

IshKebab | karma 13023 | avg karma 1.29 · 2018-01-07 00:02:08+00:00

The reason it can connect programs so "easily" is because it basically ignores errors and robustness - it really relies on there being a user looking at the output and going "hmm that looked like it failed".

Doing things properly in Python or Go may take a few more lines (not much more really) but it is 100 tones more robust, and you need that if you are writing anything more than a 10-line one-off hack.

reply

mbrock | karma 7744 | avg karma 3.19 · 2018-01-07 08:05:15+00:00

No, it doesn't. Unix error codes are extremely well understood. Tell the Git maintainers that their program ignores errors and robustness—they'll be surprised.

viraptor | karma 41139 | avg karma 2.79 · 2018-01-07 12:14:24+00:00

Even when well understood, it doesn't mean easy to work with. If you're lucky, the app you're calling only has two states: success+result, or failure+error message. But working with text commands, one day you'll get a "skipped file Xyz" somewhere in the output, because it's neither an error not a success. If you're very lucky, you'll get it in stderr, otherwise it will be mixed with the output. If you're not lucky, the command will print out the error and exit with 0 anyway. What crazy app would do that? For example standard initctl on Ubuntu: https://bugs.launchpad.net/ubuntu/+source/upstart/+bug/55278...

Exit codes are a poor substitute for proper error handling with verbose error reports. They do the job most of the time, as long as you remember exactly which command behaves which way. And that's a clear path to mistakes :-(

reply

dgsb | karma 34 | avg karma 0.74 · 2018-01-08 09:09:55+00:00

I'd rather use perl for these use cases. It's almost as concise as bash regarding subprocess management. It's ubiquitous as bash, and have the python like 2 vs 3 version issue.

shrumm | karma 778 | avg karma 8.37 · 2018-01-06 20:07:08+00:00

I think the 10 line limit seems a little harsh. Bash is great to prototype something, especially because of all the great commands at your disposal versus writing it yourself. e.g. grep, sed, cut are all commands I use frequently. Once you've got a working script and it proves useful or needs to scale - that's a good time to go the 'real language' route like Python.

bch | karma 4139 | avg karma 2.83 · 2018-01-06 21:00:21+00:00

> I think the 10 line limit seems a little harsh.

I think it's a fine guide. If I'm writing something that starts approaching a program rather than a few lines of utility-throw-away, it's time to at least immediately start co-developing in something more sane. My exception is for Makefiles, if one considers them "shell".

reply

joncrane | karma 3375 | avg karma 3.04 · 2018-01-06 22:29:57

How do you feel about the use of bash for AWS userdata and the like?

You pretty much have to put all the init stuff in a bash script. I can't think of any real-world examples of userdata scripts being less than 10 lines.

The alternative of course would be a three line script that downloads a file and executes it. But what language would that file be in? Probably bash. And it would make the infrastructure-as-code tracking and deployment process much more complicated.

reply

pas | karma 7438 | avg karma 1.12 · 2018-01-07 00:09:43+00:00

Not OP, but ...

You can use Go/Rust for static linked bootstrap instead of Bash. (See rustup.)

I'm thinking about combining GitLab (private token access + repository files: https://docs.gitlab.com/ce/api/repository_files.html#reposit... ) and a bootstrap script in Bash, that then launches something better. Plus self registration back into a GitLab repo (hey, it has the access token already, so it can push).

Yes, Ansible/SaltStack/Chef/Puppet reinvented, but you're now not bound to the horrible database, language, bootstrap, speed (slowness) and workflow of any of them.

If you want more access control, then creating a HTTP microservice that has the real private token (or proper OAuth2 access) to GitLab plus handles the self-registration and token handout/revocation is easy compared to the other parts. (And it can also store everything in GitLab.)

reply

yeukhon | karma 4894 | avg karma 1.44 · 2018-01-07 00:14:56

I believe in your userdata script you can write a line to download a Python script from S3, then proceed to execute. Don't quote me on this since I don't use userdata.

However, note that userdata will only run once at launch time. If you ever need to re-run userdata, you have to stop the instance, remove the userdata from settings, and then add again. YMMV.

I do the Python way when launching a beanstalk instance (basically cloud init).

reply

_kst_ | karma 2175 | avg karma 3.48 · 2018-01-06 22:42:41

I'd make the limit a lot more than 10 lines.

I certainly prefer Perl for large scripts (Python is good too but I happen to know Perl better), but I often have to write scripts for targets that have bash but don't have Perl or Python. Since I've been forced to use bash, I've found it to be a better scripting language than I expected it to be.

reply

sethrin | karma 1083 | avg karma 1.6 · 2018-01-06 23:25:36+00:00

10 lines is a rule which is meant to be broken. Rules exist to guide the novice, and to inform the expert. Bash is an extremely useful tool, but not one which should be employed carelessly or casually.

gumby | karma 61183 | avg karma 3.86 · 2018-01-06 16:22:53

If you want all the args to the previous (or earlier) command just use !. (e.g. a common idiom for me is cat `!`). Or you tried a git mv out of habit but the dir isn’t being managed by git: !gi:

gumby | karma 61183 | avg karma 3.86 · 2018-01-06 21:38:45+00:00

Oops that’s the asterisk (star character) that HN interpreted as italics. In hacker news formattingnmy two examples are

  `!*` and !gi:*

carapace | karma 9859 | avg karma 1.41 · 2018-01-06 16:24:37+00:00

Damn it this boils down to RTFM, specifically the bash man page.

falsedan | karma 2127 | avg karma 1.33 · 2018-01-06 17:27:43

The real 'Learn bash the hard way' is to read the man page top to bottom every 6 months. Ye gods, I've read it so many times…

Also: lol at HN downvoting advice to read a tool's docs. Bash is terrible for a lot of reasons, but not because of its lack of informative documentation.

reply

jwilk | karma 8094 | avg karma 2.47 · 2018-01-06 17:28:34

From the HN guidelines:

Please don't comment about the voting on comments. It never does any good, and it makes boring reading.

reply

falsedan | karma 2127 | avg karma 1.33 · 2018-01-06 17:52:07+00:00

I've read the guidelines; you can just downvote instead.

carapace | karma 9859 | avg karma 1.41 · 2018-01-08 16:45:23+00:00

Yes, the bash man page is a treasure trove. Reading the article I kept thinking, "this is in the man page", over and over.

There seem to be things like "Testing" and "RTFM" that many people (including me) resist until they actually try it and see for themselves. I can remember the feeling of revelation when I finally learned to try "$ man foo" on everything...

reply

dorfsmay | karma 5069 | avg karma 2.45 · 2018-01-06 16:29:06

Two important things missing:

1) This is a huge pet peeve of mine, but it kills me when my coworkers "bash" emacs and sing the praise of vim, then proceed to explain to me that ctrl-R in bash searches for the last command with a given pattern. They also often refuse to believe me that they're basically using emacs controls (because you know emacs' dirty). So, please, if you're in love with vim and use ksh or bash, learn about "set -o vi". Oh, and stop preaching!

2) "help xxx" for any xxx bash functionality, right there from the command line!

reply

throwanem | karma 18321 | avg karma 2.56 · 2018-01-06 16:38:01

> they're basically using emacs controls

But not well implemented! You're supposed to be able to edit your search string, and I've never found a way.

reply

LukeShu | karma 6759 | avg karma 4.09 · 2018-01-06 17:04:33+00:00

Huh? Backspace works the same for me in Bash C-r as it does in Emacs C-r.

throwanem | karma 18321 | avg karma 2.56 · 2018-01-06 17:13:03+00:00

It never does anywhere for me. Good to know it's supposed to, though; maybe I've got something mapped weird, or the versions of the shell I'm using are old enough to be unwelcoming in this way, or I don't know what, but knowing it's not by design means there's a fix to be found for it. Thanks!

jmiserez | karma 1416 | avg karma 2.74 · 2018-01-06 19:55:05+00:00

You should try hstr (https://github.com/dvorka/hstr). It replaces CTRL-R with a full page interactive history search that really works.

Demo GIF here: https://unix.stackexchange.com/a/375914

reply

dorfsmay | karma 5069 | avg karma 2.45 · 2018-01-06 23:37:47+00:00

If you use either vim or emacs, there's something to be said about using that knowledge for your command line history.

jmiserez | karma 1416 | avg karma 2.74 · 2018-01-07 07:57:54+00:00

hstr has a vi mode.

lvillani | karma 794 | avg karma 4.9 · 2018-01-06 16:30:44

shellcheck (https://www.shellcheck.net) is an absolute godsend when writing Bash/POSIX sh scripts.

It catches so many errors that I think it's a must have in every programmer's toolbox. It even catches bash-isms when you are targeting POSIX sh. It saved me many many hours of grief trying to debug shell scripts I wrote and changed the way I write them for the better.

reply

aequitas | karma 3431 | avg karma 3.59 · 2018-01-06 17:52:38

This is also my number one thing I wish I'd known about Bash. It saves on so many trivial bugs.

The documentation is especially great, for ever problem it detects you get an unique reference which you can lookup on the wiki eg: https://github.com/koalaman/shellcheck/wiki/SC2086 It then not only describes the problem but also shows different ways of solving it with some great examples and reasoning. I think I learned more Bash from Shellcheck than anywhere else.

reply