Hacker Read top | best | new | newcomments | leaders | about | bookmarklet login

> `sort LastWriteTime`

What really is the difference between `sort LastWriteTime` and `sort --LastWriteTime`? An argument without the dashes doesn't seem any fundamentally different from a flag.



sort by: page size:

Aaargh! "cat | sort" is an antipattern.

Sort accepts a list of files as arguments. "sort [flags] file1 file2 ..." is more concise, more efficient, and is less likely to run out of space.


but that isn't the command line, that's unix... windows has its own sort utility but the flags are different...

also contrast the submission with this post: http://pgbovine.net/command-line-bullshittery.htm


Indeed, I don't see why people get so upset (or pedantic) about what are, effectively, NOPs in command-lines.

However, things change if you start adding certain options to sort:-

  sort -m order.*
and

  cat order.* | sort -m
are definitely not the same thing (for most input files at least).

You need the -f flag to sort as well to ignore case during the sorting. I don't know how portable the flags to sort are though which is why I didn't bother using them.

> Doing cat from left to right helps readability - it's important.

People may balk because it's unfamiliar, but this is syntactically legal:

    < logs.txt sort | uniq > /dev/null
That is, the redirection customarily goes at the end, but it doesn't have to.

EDIT: Also, in this specific case, the "sort" command can take a file argument, so you can also do this:

    sort logs.txt | uniq > /dev/null

Or just `sort -u` (if you are using GNU sort, not sure about other implementations)

Another difference is that sort is optimized to handle large files [0]

[0] https://unix.stackexchange.com/questions/279096/scalability-...


"The unix command line ('cat foo.txt | sort | uniq -c | sort -rn') is wonderfully concise and powerful". And yet, contrary to the author's assertion, it can be made even more concise without sacrificing the power:

sort foo.txt | uniq -c | sort -rn


Ah, yeah, I had misread the docs there. Though I imagine it creates that unexpected for beginner's thing where `ls | sort` comes back with "no such file |" or similar. Basically, I'm saying there is a place for an interface that does pass a single string to the shell, and an interface where you specify args explicitly.

`tr [:punct:][:space:] "\n" < tmp.txt | sort -fu`

No need to run tr twice, and the -f flag for sort makes it ignore case.


Sort uses -t and -k, cut uses -d and -f. Your point is valid, of course - the options are not consistent across the unix tools.

'cat christmas_list.txt | sort > christmas_list.txt'

the above is wrong. that will clobber the file.


Unless you want to show before/after context (-A/-B flags), one should grep before sort, not after. Less work to do.

They are both technical, but 'order by bytes desc' has got to be more expressive than 'sort -nr'. It's almost natural human English, whereas the latter doesn't express anything.

That said, I don't know how much time it would genuinely save. As with most of these tools, you shouldn't be installing them on production servers, so you still have to know Bash anyway.


I can't believe this past has sort | uniq when GNU sort (important distinction, the BSD version and hence OSX can't) has a -u flag so sort -u == sort | uniq

In addition... "PUT commands write to a file, GET commands read the file back, sort and reduce the data, then return the file"

I guess the author had to add the sort/reduce to "prove" his point...


And that line, at least for sorting, belongs firmly outside the filesystem.

Sorting is locale-dependent. Whether a letter-with-dots sorts next to letter-without-dots or somewhere completely different has no correct global answer.


(You sound like someone who would probably know this, but a lot of people don't).

Note that in general "sort -nu" is not equivalent to "sort -n | uniq", although in this particular case they are. If you have more than one field on the line, they can differ.

For example, if the input is:

  3 first
  2 second
  3 second
  1 second
  2 first
  1 first
then this is the "sort -nu" output:

  1 second
  2 second
  3 first
whereas "sort -n | uniq" gives:

  1 first
  1 second
  2 first
  2 second
  3 first
  3 second
I'm a bit confused as to how "sort -nu" chooses which record to print when there are more than one with the same key. I would have expected the chosen record to be the one that would have come first in "sort -n" output, so that "sort -n | sort -nu" would be equivalent to "sort -nu", but that is not the case. The pipe gives:

  1 first
  2 first
  3 first
It is as if -nu automatically adds -s.

Not explaining, trying to tell that you are comparing apples to oranges and making a conclusion based on that.

Also, you don't need to spawn a subshell nor feed sort via stdin in the first example :)


> s='du -sh * | sort -h'

Quick shoutout, you should probably alias to 'du -sh -- *'. Otherwise file names starting with '-' are interpreted as input for your command.

next

Legal | privacy