Is stdlib's qsort recursive?

Asked 31/7, 2010 at 20:43 Answered 31/7, 2010 at 22:2

I've read that qsort is just a generic sort, with no promises about implementation. I don't know about how libraries vary from platform to plaform, but assuming the Mac OS X and Linux implementations are broadly similar, are the qsort implementations recursive and/or require a lot of stack?

I have a large array (hundreds of thousands of elements) and I want to sort it without blowing my stack to oblivion. Alternatively, any suggestions for an equivalent for large arrays?

Langouste answered 31/7, 2010 at 20:43 Comment(0)

Here's a version from BSD, copyright Apple, presumably used in OS X at some time or another:

http://www.opensource.apple.com/source/xnu/xnu-1456.1.26/bsd/kern/qsort.c

It is call-recursive, although the upper bound on the depth of recursion is small, as Blindy explains.

Here's a version from glibc, presumably used in Linux systems at some time or another:

http://www.umcs.maine.edu/~chaw/200801/capstone/n/qsort.c

It's not call recursive. For exactly the same reason that the limit on call-recursion is small, it can use a small fixed amount of stack to manage its loop-recursion.

Can I be bothered to look up the latest versions? Nope ;-)

For a few hundred thousand array elements, even the call-recursive implementation won't call more than 20 levels deep. In the grand scheme of things that is not deep, except on very limited embedded devices, which wouldn't have enough memory for you to have an array that big to sort in the first place. When N is bounded above, O(log N) obviously is a constant, but more than that it's normally quite a manageable constant. Usually 32 or 64 times "small" is "reasonable".

Enervate answered 31/7, 2010 at 21:5 Comment(2)

+1 for actually looking at the source code. It's interesting to note that glibc uses a quicksort/insertion sort hybrid in their qsort() – Wideranging 31/7, 2010 at 22:37

@nos: IIRC that's what Knuth tells you to do, so interesting but hopefully not surprising ;-) – Enervate 1/8, 2010 at 12:27

You know, the recursive part is logn deep. In 64 levels of recursion (which is ~64*4=~256 bytes of stack total) you can sort an array of size ~2^64, ie an array as large as you can address on a 64 bit cpu, which is 147573952589676412928 bytes for 64 bit integers. You can't even hold it in memory!

Worry about stuff that matters imo.

Husted answered 31/7, 2010 at 20:50 Comment(11)

+1. It may be a few more bytes than 256 depending on how much is pushed on the stack for each level, but it's still a small constant. – Chessa 31/7, 2010 at 21:8

-1: This is wrong. Quicksort has worst case space complexity O(n), not O(log n). A large array can blow the stack. – Matthiew 31/7, 2010 at 21:36

@Luther: when properly implemented (when recursing, sort the smaller partition first), stack usage is limited to approximately logarithmic growth. To be exact, Knuth gives it as [lg (N+1)/(M+2)] (with "[]" signifying "floor"), where N=number of elements being sorted and M=size of partition where you stop recursing (presuming an "improved" Quicksort that switches to insertion sort when the whole thing is nearly sorted). – Aiello 31/7, 2010 at 22:0

Luther, qsort() isn't "Quicksort"— or rather the actual algorithm is implementation defined. Glibc's qsort() for example, switches to insertion sort in order to avoid the worst case space complexity problem. – Carnay 31/7, 2010 at 22:56

@Jerry: Worst case space complexity is O(n). This can be confirmed at a number of websites searchable via Google at many reputable forms of higher education around the world. (i.e. ugweb.cs.ualberta.ca/~c115/F06/schedule/lectures/lecture12.pdf) – Pilsudski 1/8, 2010 at 12:12

@0A0D: that Alberta slideshow is not useful. Possibly a fine simplification for teaching purposes, but nobody actually implements the partition step by allocating two new arrays, one for each side of the pivot, and copying the elements into them. So, the analysis is not relevant to any implementation of Quicksort written by someone who knows what they're doing - part of the benefit of Quicksort is that it's an (almost) in-place algorithm. – Enervate 1/8, 2010 at 12:35

@Steve: Even Wikipedia lists O(n) [though that doesn't say much...] – Pilsudski 1/8, 2010 at 12:41

.. which is not to say that the academics at Alberta don't know how to write a quicksort. I'm sure they do. They just aren't teaching students "Sedgewick's improved version" (as Wikipedia calls it) in that particular lecture, so it's wrong to take their specimen implementation as typical, or even as plausible. – Enervate 1/8, 2010 at 13:5

@0A0D: If you're going to try to appeal to authority instead of using a logic-based argument, you need to consider the source. When it comes to analysis of algorithms, who do you trust more: Duane Szafron or Donald Knuth? [As an off-topic aside, "i.e." is an abbreviation for "id est" which translates to "that is"; you seem to have wanted "e.g.", an abbreviation for "example gratis", which translates to "for example".] – Aiello 1/8, 2010 at 15:37

Why not just forget about whether "quick" sort could blow the stack and use the One True Sort, heap sort? No temp space and real O(n log n) time. :-) – Showthrough 1/8, 2010 at 17:49

@Jerry: I really don't care whether it's i.e. or e.g. – Pilsudski 1/8, 2010 at 20:50

Yes it's recursive. No, it probably will not use large amounts of stack. Why not simply try it? Recursion is not some kind of bogey - it's the solution of choice for very many problems.

Layout answered 31/7, 2010 at 20:46 Comment(15)

Trust me, I use recursion all the time! Just never before for depths like this, hence my asking first. – Langouste 31/7, 2010 at 20:58

@Langouste Depths like what? The recursion in quicksort pushes stack frames (i.e. local variables and return addresses) to the stack, not copies of the thing being sorted. This is very little data. – Layout 31/7, 2010 at 21:1

I'd only care if it were a PIC. +1 ;-) – Saretta 31/7, 2010 at 21:6

@Neil: unless you screw up your implementation, and call-recurse on the "big half". Which obviously those sound implementations don't, but it takes a little thought to realise why the bound on stack usage is so small even in the pathological worst run-time of quicksort. Everyone probably uses at least Introsort for qsort now anyway... – Enervate 31/7, 2010 at 21:8

I realise the nature of the data on the stack. It was more the number of frames (which, now I actually look at the numbers, isn't that great). I still defend asking the question of whether it's suitable for inputs of a given size, though. The stdlib manpage on my system doesn't mention the kinds of data over which it is suitable. – Langouste 31/7, 2010 at 21:11

@Langouste qsort would not be the sort of choice if it didn't handle very large datasets well. Nothing wrong with the question though, except I do find the reluctance of many people here to actually try things out a bit off-pissing. – Layout 31/7, 2010 at 21:12

Entirely offtopic: Neither is the Pope catholic, nor do bears mostly shit in the woods – Chessa 31/7, 2010 at 21:13

-1: Quicksort has worst case space complexity O(n) which means that sorting a large array can blow the stack. If stack space is not abundant (like in a thread or coroutine), then this is something to consider. – Matthiew 31/7, 2010 at 21:40

@Luther Blisset I am talking about qsort not Quicksort. And why not take this up (or downvote) Andrey T's answer - I made no mention of the complexity of the sort. – Layout 31/7, 2010 at 21:44

Sigh; the quip attracted quite a slew of "offensive", so edited out. – Livelong 31/7, 2010 at 22:15

@Luther: as noted in response to another comment, you're just plain wrong on this one. The worst case is logarithmic, not linear, as long as the Quicksort is implemented correctly. – Aiello 31/7, 2010 at 22:17

@Jerry: Quicksort worst case is O(n^2) [time complexity] but case is rare if it is implemented properly. Otherwise it is O(n log n). – Pilsudski 31/7, 2010 at 22:58

@0A0D, the discussion was about space complexity, not time complexity. – Monniemono 1/8, 2010 at 4:54

@Lars: In which case, the worst case space complexity is O(n) – Pilsudski 1/8, 2010 at 11:49

@0A0D: "worst-case" means, "the worst possible input", not "the worst possible implementation". Of course it is possible to implement Quicksort so that it has O(n) worst-case space complexity. It's also possible to implement Quicksort so that it has O(n^2) space complexity (for instance in a language which lets you pass and assign arrays by value as copies). Either would be a bug, although of course the latter is a worse one. – Enervate 1/8, 2010 at 12:20

A properly implemented qsort does not require more than log2(N) levels of recursion (i.e. depth of stack), where N is the largest array size on the given platform. Note that this limit applies regardless of how good or bad the partitioning happens to be, i.e. it is the worst case depth of recursion. For example, on a 32-bit platform, the depth of recursion will never exceed 32 in the worst possible case, given a sane implementation of qsort.

In other words, if you are concerned about the stack usage specifically, you have nothing to worry about, unless you are dealing with some strange low-quality implementation.

Negrophobe answered 31/7, 2010 at 21:21 Comment(0)

I remember reading in this book: C Programming: A Modern Approach that the ANSI C specification doesn't define how to implement qsort.

And the book wrote that qsort could in reality be a another kind of sort, merge sort, insertion sort and why not bubble sort :P

So, the qsort implementation might not be recursive.

Manchukuo answered 31/7, 2010 at 21:19 Comment(2)

Good standards don't describe how to implement anything - they do though for things like sorts specify minimum complexity guarantees which may limit the choice of algorithm for implementations. – Layout 31/7, 2010 at 21:25

@Neil: regardless of what good standards do, as it happens the C standard doesn't specify the complexities of qsort and bsearch. Fortunately the question is about two implementations in particular, so the standard is pretty much irrelevant. If Apple is going to perversely switch OS X to Bogosort in the next release, then whether they get away with that will not depend on whether it breaks the C standard... – Enervate 31/7, 2010 at 22:15

With quicksort, the stack will grow logarithmically. You will need a lot of elements to blow up your stack.

Consumption answered 31/7, 2010 at 20:51 Comment(1)

@msw: Seeing as you insist on being pedantic, you forgot to define N as the size of the the array. As far as I'm concerned, the term "logarithmic growth" is generally understood to mean O(lg(n)) when talking about algorithms. – Consumption 31/7, 2010 at 21:21

I'd guess that most modern implementations of qsort actually use the Introsort algorithm. A reasonably written Quicksort won't blow the stack anyway (it'll sort the smaller partition first, which limits stack depth to logarithmic growth).

Introsort goes a step further though -- to limit the worst case complexity, if it sees that Quicksort isn't working well (too much recursion, so it could have O(N²) complexity), it'll switch to a Heapsort which guarantees O(N log₂ N) complexity and limits stack usage as well. Therefore, even if the Quicksort it uses is sloppily written, the switch to Heapsort will limit stack usage anyway.

Aiello answered 31/7, 2010 at 21:6 Comment(0)

A qsort implementation which can fail on large arrays is extremely broken. If you're really worried I'd go RTFS, but I suspect any half-decent implementation will either use an in-place sorting algorithm or use malloc for temporary space and fall back to an in-place algorithm if malloc fails.

Showthrough answered 31/7, 2010 at 20:53 Comment(0)

The worst-case space-complexity of a naive quicksort implementation (which is still a popular option for qsort) is O(N). If the implementation is modified to sort the smaller arary first and tail-recursion optimisation or an explicit stack and iteration is used then the worst case space can be brought down to O(log N), (what most answers here wrote already). So, you will not blow up your stack if the implementation of quick-sort is not broken and the library was not broken by improper compiler flags. But, for example, most compiler which support tail recursion elimination won't do this optimization it in unoptimized debug builds. A library built with the wrong flags (i.e. not enough optimization, for example in the embedded domain where you sometimes build your own debug libc) might crash the stack then.

For most developers, this will never be an issue (they have vendor tested libc's which have O(log N) space complexity), but I'd say it is a good idea to have an eye on potential library issues from time to time.

UPDATE: Here's an example for what I mean: A bug in libc (from 2000) where qsort would start thrashing virtual memory because the qsort implementation would switch internally to mergesort because it though there is enough memory to hold a temporary array.

http://sources.redhat.com/ml/libc-alpha/2000-03/msg00139.html

Matthiew answered 31/7, 2010 at 22:2 Comment(10)

Questioner is asking about particular systems, which have reasonable quality of implementation. "naive quicksort implementation is still a popular option" is simply false. It is not popular with people who write C libraries, which is what the question concerns. – Enervate 31/7, 2010 at 22:7

Questioner asked about "Linux". Linux has no implementation of qsort, because it's a kernel. qsort is a function the C-runtime library for which there are several options (glibc,uclibc,newlib,dietlibc..and then there's this thing they've put into Android). Also: see my update. – Matthiew 31/7, 2010 at 22:22

-1 from me: a hypothetical badly implemented qsort is pretty irrelevant. The glibc qsort is implemented quite well, and I assume the OS X one is as well. A bad implementation of qsort is a bug, which needs to be fixed. – Monniemono 31/7, 2010 at 22:28

@Lars: I just gave an example how glibc's qsort was implemented in a way you would deem hypothetical and it gave someone concrete headaches. It was of course a fixed. – Matthiew 31/7, 2010 at 22:33

+1 This is a good answer. In fact, it is along the same lines as AndreyT except Luther doesn't have over 30K rep. – Pilsudski 31/7, 2010 at 22:33

@Luther, the bug you link to is not because of a naive implementation, but a smart implementation with a bad cornercase. I see no point in assuming the stdlib implemntors are incompetent. – Monniemono 1/8, 2010 at 4:52

@Lars: The bad cornercase is "The machine has virtual memory". – Matthiew 1/8, 2010 at 7:19

@Luther: thanks for that party political broadcast from GNU. It may interest you to learn that in the English language, which is what Joe and I are both speaking, the word "Linux" is used as a generic term for GNU/linux-based OS distributions ;-p – Enervate 1/8, 2010 at 11:24

And I further note that, while the GPL of course requires proper copyright attribution, it does not give RMS the right to name derivative works (and neither does he claim that right, just acts a bit grumpy when people don't do what he says). So if for example Red Hat calls "their" OS "Linux", then that is its name, regardless of whether it "should" be called GNU/Linux... – Enervate 1/8, 2010 at 11:30

@0A0D: of course, but having worked so hard for Red Hat's right not to have to do what he (the author of the software they use) wants, RMS must grudgingly accept they they sometimes won't do what he wants :-) – Enervate 1/8, 2010 at 12:5

Hot tags

Godot Unity Godot Help Programming Godot 4.X GUI GDScript 3D 2D Physics CSharp Godot 3.X VR XR Projects C++

Recommended topics

Hot tags