Popularity

6.7

Growing

Activity

0.0

Stable

Stars 2,281

Watchers 73

Forks 95

Last Commit 5 months ago

Programming language: C++

License: zlib License

Tags: Miscellaneous

pdqsort alternatives and similar libraries

Based on the "Miscellaneous" category.
Alternatively, view pdqsort alternatives based on common mentions on social networks and blogs.

ZXing

9.9 8.6 L3 pdqsort VS ZXing

ZXing ("Zebra Crossing") barcode scanning library for Java, Android
stb

9.8 6.7 L2 pdqsort VS stb

stb single-file public domain libraries for C/C++

WorkOS - The modern identity platform for B2B SaaS

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

Promo workos.com

C++ Format

9.6 9.8 L1 pdqsort VS C++ Format

A modern formatting library
HTTP Parser

8.9 0.0 L1 pdqsort VS HTTP Parser

http request/response parser for c
RE2

8.9 8.9 L1 pdqsort VS RE2

RE2 is a fast, safe, thread-friendly alternative to backtracking regular expression engines like those used in PCRE, Perl, and Python. It is a C++ library.
Cppcheck

8.6 9.9 pdqsort VS Cppcheck

static analysis of C/C++ code
SDS

8.1 0.0 L2 pdqsort VS SDS

Simple Dynamic Strings library for C
Klib

7.9 4.3 L4 pdqsort VS Klib

A standalone and lightweight C library
ZBar

7.8 0.0 L3 pdqsort VS ZBar

Clone of the mercurial repository http://zbar.hg.sourceforge.net:8000/hgroot/zbar/zbar
American fuzzy lop

7.6 0.0 pdqsort VS American fuzzy lop

american fuzzy lop - a security-oriented fuzzer
Serial Communication Library

7.5 0.0 L1 pdqsort VS Serial Communication Library

Cross-platform, Serial Port library written in C++
libssh2

6.5 9.3 pdqsort VS libssh2

the SSH library
PHP-CPP

6.4 6.2 L4 pdqsort VS PHP-CPP

Library to build PHP extensions with C++
Better Enums

6.1 3.7 L5 pdqsort VS Better Enums

C++ compile-time enum to string, iteration, in a single header file
c-smart-pointers

5.9 0.0 pdqsort VS c-smart-pointers

Smart pointers for the (GNU) C programming language
Experimental Boost.DI

5.7 3.5 L3 pdqsort VS Experimental Boost.DI

C++14 Dependency Injection Library
Mach7

5.7 0.0 L2 pdqsort VS Mach7

Functional programming style pattern-matching library for C++
UNITS

5.2 0.0 L3 pdqsort VS UNITS

a compile-time, header-only, dimensional analysis and unit conversion library built on c++14 with no dependencies.
stdman

4.9 1.8 pdqsort VS stdman

Formatted C++20 stdlib man pages (cppreference)
constexpr-8cc

4.7 3.9 L3 pdqsort VS constexpr-8cc

Compile-time C Compiler implemented as C++14 constant expressions
SLRE

4.6 0.0 L3 pdqsort VS SLRE

Super Light Regexp engine for C/C++
DynaMix

4.4 6.7 pdqsort VS DynaMix

:fish_cake: A new take on polymorphism
Stage

4.3 0.0 L1 pdqsort VS Stage

Mobile robot simulator
outcome

4.3 7.4 pdqsort VS outcome

Provides very lightweight outcome<T> and result<T> (non-Boost edition)
cxx-prettyprint

4.2 0.0 L5 pdqsort VS cxx-prettyprint

A header-only library for C++(0x) that allows automagic pretty-printing of any container.
libcpuid

4.1 6.9 L2 pdqsort VS libcpuid

a small C library for x86 CPU detection and feature extraction
STX

4.1 8.1 pdqsort VS STX

C++17 & C++ 20 error-handling and utility extensions.
Better String

4.0 0.0 L2 pdqsort VS Better String

The Better String Library
kangaru

3.8 4.5 L5 pdqsort VS kangaru

🦘 A dependency injection container for C++11, C++14 and later
CppVerbalExpressions

3.8 0.0 L5 pdqsort VS CppVerbalExpressions

C++ regular expressions made easy
value-category-cheatsheet

3.6 2.1 pdqsort VS value-category-cheatsheet

A C++14 cheat-sheet on lvalues, rvalues, xvalues, and more
leaf

3.4 7.5 pdqsort VS leaf

Lightweight Error Augmentation Framework
casacore

3.3 7.0 L1 pdqsort VS casacore

Suite of C++ libraries for radio astronomy data processing
neither

3.1 0.0 pdqsort VS neither

Either and Maybe monads for better error-handling in C++ ↔️
semver.c

3.0 0.0 L5 pdqsort VS semver.c

Semantic version in ANSI C
libusb

3.0 6.7 pdqsort VS libusb

Access USB devices from Ruby via libusb-1.x
gcc-poison

2.9 0.0 pdqsort VS gcc-poison

gcc-poison
StrTk

2.9 0.0 pdqsort VS StrTk

C++ String Toolkit Library https://www.partow.net/programming/strtk/index.html
ub-canaries

2.8 0.0 L4 pdqsort VS ub-canaries

collection of C/C++ programs that try to get compilers to exploit undefined behavior
Boost.Signals

2.8 4.8 L2 pdqsort VS Boost.Signals

Boost.org signals2 module
libnih

2.6 0.0 L2 pdqsort VS libnih

NIH Utility Library
sigslot

2.4 0.0 L5 pdqsort VS sigslot

C++11 signal/slot implementation
QtVerbalExpressions

2.3 0.0 L5 pdqsort VS QtVerbalExpressions

This Qt lib is based off of the C++ VerbalExpressions library. [MIT]
libsigc++

2.1 0.7 L5 pdqsort VS libsigc++

A typesafe callback system for standard C++. [LGPL]
access_profiler

2.1 0.0 L5 pdqsort VS access_profiler

a tool to count accesses to member variables in c++ programs
cppq

2.1 10.0 pdqsort VS cppq

Simple, reliable & efficient distributed task queue for C++17
FastFormat

2.0 4.8 pdqsort VS FastFormat

The fastest, most robust C++ formatting library
strf

2.0 0.0 pdqsort VS strf

Yet another C++ text formatting library.
libevil

1.8 0.0 L4 pdqsort VS libevil

The Evil License Manager
CommonPP

1.8 5.0 L4 pdqsort VS CommonPP

Small library helping you with basic stuff like getting metrics out of your code, thread naming, etc.

* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.

Do you think we are missing an alternative of pdqsort or a related project?

Add another 'Miscellaneous' Library

Popular Comparisons

README

pdqsort

Pattern-defeating quicksort (pdqsort) is a novel sorting algorithm that combines the fast average case of randomized quicksort with the fast worst case of heapsort, while achieving linear time on inputs with certain patterns. pdqsort is an extension and improvement of David Mussers introsort. All code is available for free under the zlib license.

Best        Average     Worst       Memory      Stable      Deterministic
n           n log n     n log n     log n       No          Yes

Usage

pdqsort is a drop-in replacement for std::sort. Just replace a call to std::sort with pdqsort to start using pattern-defeating quicksort. If your comparison function is branchless, you can call pdqsort_branchless for a potential big speedup. If you are using C++11, the type you're sorting is arithmetic and your comparison function is not given or is std::less/std::greater, pdqsort automatically delegates to pdqsort_branchless.

Benchmark

A comparison of pdqsort and GCC's std::sort and std::stable_sort with various input distributions:

Performance graph

Compiled with -std=c++11 -O2 -m64 -march=native.

Visualization

A visualization of pattern-defeating quicksort sorting a ~200 element array with some duplicates. Generated using Timo Bingmann's The Sound of Sorting program, a tool that has been invaluable during the development of pdqsort. For the purposes of this visualization the cutoff point for insertion sort was lowered to 8 elements.

Visualization

The best case

pdqsort is designed to run in linear time for a couple of best-case patterns. Linear time is achieved for inputs that are in strictly ascending or descending order, only contain equal elements, or are strictly in ascending order followed by one out-of-place element. There are two separate mechanisms at play to achieve this.

For equal elements a smart partitioning scheme is used that always puts equal elements in the partition containing elements greater than the pivot. When a new pivot is chosen it's compared to the greatest element in the partition before it. If they compare equal we can derive that there are no elements smaller than the chosen pivot. When this happens we switch strategy for this partition, and filter out all elements equal to the pivot.

To get linear time for the other patterns we check after every partition if any swaps were made. If no swaps were made and the partition was decently balanced we will optimistically attempt to use insertion sort. This insertion sort aborts if more than a constant amount of moves are required to sort.

The average case

On average case data where no patterns are detected pdqsort is effectively a quicksort that uses median-of-3 pivot selection, switching to insertion sort if the number of elements to be (recursively) sorted is small. The overhead associated with detecting the patterns for the best case is so small it lies within the error of measurement.

pdqsort gets a great speedup over the traditional way of implementing quicksort when sorting large arrays (1000+ elements). This is due to a new technique described in "BlockQuicksort: How Branch Mispredictions don't affect Quicksort" by Stefan Edelkamp and Armin Weiss. In short, we bypass the branch predictor by using small buffers (entirely in L1 cache) of the indices of elements that need to be swapped. We fill these buffers in a branch-free way that's quite elegant (in pseudocode):

buffer_num = 0; buffer_max_size = 64;
for (int i = 0; i < buffer_max_size; ++i) {
    // With branch:
    if (elements[i] < pivot) { buffer[buffer_num] = i; buffer_num++; }
    // Without:
    buffer[buffer_num] = i; buffer_num += (elements[i] < pivot);
}

This is only a speedup if the comparison function itself is branchless, however. By default pdqsort will detect this if you're using C++11 or higher, the type you're sorting is arithmetic (e.g. int), and you're using either std::less or std::greater. You can explicitly request branchless partitioning by calling pdqsort_branchless instead of pdqsort.

The worst case

Quicksort naturally performs bad on inputs that form patterns, due to it being a partition-based sort. Choosing a bad pivot will result in many comparisons that give little to no progress in the sorting process. If the pattern does not get broken up, this can happen many times in a row. Worse, real world data is filled with these patterns.

Traditionally the solution to this is to randomize the pivot selection of quicksort. While this technically still allows for a quadratic worst case, the chances of it happening are astronomically small. Later, in introsort, pivot selection is kept deterministic, instead switching to the guaranteed O(n log n) heapsort if the recursion depth becomes too big. In pdqsort we adopt a hybrid approach, (deterministically) shuffling some elements to break up patterns when we encounter a "bad" partition. If we encounter too many "bad" partitions we switch to heapsort.

Bad partitions

A bad partition occurs when the position of the pivot after partitioning is under 12.5% (1/8th) percentile or over 87,5% percentile - the partition is highly unbalanced. When this happens we will shuffle four elements at fixed locations for both partitions. This effectively breaks up many patterns. If we encounter more than log(n) bad partitions we will switch to heapsort.

The 1/8th percentile is not chosen arbitrarily. An upper bound of quicksorts worst case runtime can be approximated within a constant factor by the following recurrence:

T(n, p) = n + T(p(n-1), p) + T((1-p)(n-1), p)

Where n is the number of elements, and p is the percentile of the pivot after partitioning. T(n, 1/2) is the best case for quicksort. On modern systems heapsort is profiled to be approximately 1.8 to 2 times as slow as quicksort. Choosing p such that T(n, 1/2) / T(n, p) ~= 1.9 as n gets big will ensure that we will only switch to heapsort if it would speed up the sorting. p = 1/8 is a reasonably close value and is cheap to compute on every platform using a bitshift.

*Note that all licence references and agreements mentioned in the pdqsort README section above are relevant to that project's source code only.