There’s also vastly faster ways of sorting lists of bounded numbers (or tuples), the most linear being: cut straws of the exact right length labelled with an Id, bang them on a table so the ends without the ids all line up, read the ids back in order.
I only said it took linear time, I didn’t say it was fast...
I disagree, because it is perfectly possible to sort without comparisons. For example, sorting a list of distinct integers is theoretically O(N). (radix sorting)
nit: runtime for radix sort is O(nk), since you sort the full list by each of the digits, sequentially.
I've always thought claiming radix sort on integers to be linear was a bit disingenuous, since k << log(n) only if your list has a lot of duplicate entries.
In fact, if your integers are all unique, then k >= log(n).
But isn't this just radix sort, effectively? That's linear-time, no problem. There's no fundamental rule saying that sorting has to be O(n lg n), it's only sorting-by-comparison that needs O(N lg n) comparisons.
Though I guess one might point out that, without direct (and infinitely precise!) comparisons, eventually you'll have enough sticks that you won't really be able to tell which is the largest.
Exactly. Radix sort isn't really linear. It is O(N * M) where N is the size of the collection and M is the number of possible items in the collection. If the number of possible items is fixed (like fixed-size integers) then it is effectively linear. However you can do this with strings just the same way.
Computer sorts require at least linear time, as you need to read the input (if you don't know other things about the input, say it has an uniform distribution). There are several linear time sorts, sleepsort, postman sort, counting sort. (For a bounded set of numbers or sortable keys.)
He's essentially cheating. When he analyzes the algorithm presented, he says "I’ll now describe an algorithm that does not depend on the word length. For any word length w, w = log n, it sorts n word-sized integers in time proportional to n log log n."
But that's entirely cheating. If you give the same assumption to radix sort, then you could say "For any word length w, w = log n, radix sort sorts n word-sized integers in time proportional to n". Which is much better.
The only time radix sort actually goes slower than linear time is when the size of the integers is not polynomial in n. If the size of the integers is exponential n, I think this algorithm would similarly slow down, as it would take a linear number of words to represent a single integer, meaning that it would take log n reductions to get the integers to fit into a single word, not log log n as he assumes.
Radix sort really does have linear time complexity, but requires fixed size keys. If your keys are variable sized, e.g. strings, it's O(n log n) as usual.
Similarly, sometimes your sorting algorithm takes linear time because the input was a sorted list. That's the lower bound, but lower bounds usually aren't as interesting as upper bounds.
You can sort strings also in linear time with a trie (for example). This requires O(n) time, where n is the total length of the input (i.e. sum of lengths of strings).
I remember reading a book about analogue computation, and one fun fact stuck in my teenage brain - you can build a sort 'algorithm' that always takes only one operation. For every 'number' that you want to sort, cut a piece of uncooked spaghetti to a length that represents the number. Take the pieces of spaghetti and hold them loosely in your hand parallel to each other. Then bang one end down on the table, and they will all be at the same position on the table end. Read the resulting order of the numbers from the other end.
Someone could argue that cutting/labelling all the pieces of spaghetti takes considerably more time. I'm sure we are all capable of seeing past such a boring technicality.
The book described how big a pile of spaghetti would need to be to sort large numbers, and I'm afraid I can't remember any of the details, but there's a certain point where you need to bang the pile of spaghetti against something the size of the moon.
I only said it took linear time, I didn’t say it was fast...
reply