I've never seen dual pivot quick sort before.
Is it an upgraded edition of quick sort?
And what is the difference between dual pivot quick sort and quick sort?
I find this in the Java doc.
The sorting algorithm is a Dual-Pivot Quicksort by Vladimir Yaroslavskiy, Jon Bentley, and Joshua Bloch. This algorithm offers O(n log(n)) performance on many data sets that cause other quicksorts to degrade to quadratic performance, and is typically faster than traditional (one-pivot) Quicksort implementations.
Then I find this in the Google search result. Theory of quick sort algorithm:
- Pick an element, called a pivot, from the array.
- Reorder the array so that all elements, which are less than the pivot, come before the pivot and all elements greater than the pivot come after it (equal values can go either way). After this partitioning, the pivot element is in its final position.
- Recursively sort the sub-array of lesser elements and the sub-array of greater elements.
In comparison, dual-pivot quick sort:
- For small arrays (length < 17), use the Insertion sort algorithm.
- Choose two pivot elements P1 and P2. We can get, for example, the first element a[left] as P1 and the last element a[right] as P2.
- P1 must be less than P2, otherwise they are swapped. So, there are the following parts:
- part I with indices from left+1 to L–1 with elements, which are less than P1,
- part II with indices from L to K–1 with elements, which are greater or equal to P1 and less or equal to P2,
- part III with indices from G+1 to right–1 with elements greater than P2,
- part IV contains the rest of the elements to be examined with indices from K to G.
- The next element a[K] from the part IV is compared with two pivots P1 and P2, and placed to the corresponding part I, II, or III.
- The pointers L, K, and G are changed in the corresponding directions.
- The steps 4 - 5 are repeated while K ≤ G.
- The pivot element P1 is swapped with the last element from part I, the pivot element P2 is swapped with the first element from part III.
- The steps 1 - 7 are repeated recursively for every part I, part II, and part III.
O(n log n)
and InsertionSort is O(n^2)
, however this is really C1 * n * log(n)
and C2 * n * n
(where C1 and C2 are some constants). When n
is large, the constants are irrelevant. When n
is small they can be relevant. Presumably the implementors investigated and found that 17 is the tipping point where: C1 * log(17) > C2 * 17
, but C1 * log(18) < C2 * 18
. –
Under I just want to add that from the algorithm point of view (i.e. the cost only considers the number of comparisons and swaps), 2-pivot quicksort and 3-pivot quicksort is not better than classical quicksort (which uses 1 pivot), if not worse. However, they are faster in practice since they take the benefits of modern computer architecture. Specifically, their numbers of cache misses are smaller. So if we remove all caches and there are only CPU and main memory, in my understanding, 2/3-pivot quicksort is worse than classical quicksort.
References: 3-pivot Quicksort: https://epubs.siam.org/doi/pdf/10.1137/1.9781611973198.6 Analysis of why they perform better than classical Quicksort: https://arxiv.org/pdf/1412.0193v1.pdf A complete and not-too-much-details reference: https://algs4.cs.princeton.edu/lectures/23Quicksort.pdf
For those who are interested, take a look how they implemented this algorithm in Java:
As stated in source:
"Sorts the specified range of the array using the given workspace array slice if possible for merging
The algorithm offers O(n log(n)) performance on many data sets that cause other quicksorts to degrade to quadratic performance, and is typically faster than traditional (one-pivot) Quicksort implementations."
© 2022 - 2024 — McMap. All rights reserved.