How are hash collisions handled?

Asked 7/2, 2015 at 7:42 Answered 22/4, 2017 at 21:57

I've recently learned a little bit about hash values, and therefore also heard of about the problem of hash collisions.
I therefore wondered: How does one deal with those?

E.g. Swift's Dictonary uses hash values with its keys. I assume that it looks up its values via the hash. So how would Swift's Dictionary then store values for different keys, that happen to have the same hash?

Kedgeree answered 7/2, 2015 at 7:42 Comment(0)

Fundamentally, there are two major ways of handling hash collisions - separate chaining, when items with colliding hash codes are stored in a separate data structure, and open addressing, when colliding data is stored in another available bucket that was selected using some algorithm.

Both strategies have numerous sub-strategies, described in Wikipedia. The exact strategy used by a particular implementation is, not surprisingly, implementation-specific, so the authors can change it at any time for something more efficient without breaking the assumptions of their users.

A this point, the only way to find out how Swift handles collisions would be disassembling the library (that is, unless you work for Apple, and have access to the source code). Curious people did that to NSDictionary, and determined that it uses linear probing, the simplest variation of the open addressing technique.

Skunk answered 7/2, 2015 at 8:11 Comment(0)

There are two basic techniques:

Rehash using a different prime, typically N- 2 where N is the original prime, chosen such that both N and N-2 are prime.
Use a list per hash.

Or both.

Prole answered 7/2, 2015 at 8:42 Comment(0)

Swift dictionaries uses open addressing and linear probing.

Here is a link to the actual source documentation explaining everything: https://github.com/apple/swift/blob/master/stdlib/public/core/HashedCollections.swift.gyb

Maomaoism answered 22/4, 2017 at 21:57 Comment(0)

Recommended topics

Hot tags