What is the optimal algorithm for generating an unbiased random integer within a range?

I

7

18

In this StackOverflow question:

the accepted answer suggests the following formula for generating a random integer in between given min and max, with min and max being included into the range:

output = min + (rand() % (int)(max - min + 1))

But it also says that

This is still slightly biased towards lower numbers ... It's also possible to extend it so that it removes the bias.

But it doesn't explain why it's biased towards lower numbers or how to remove the bias. So, the question is: is this the most optimal approach to generation of a random integer within a (signed) range while not relying on anything fancy, just rand() function, and in case if it is optimal, how to remove the bias?

EDIT:

I've just tested the while-loop algorithm suggested by @Joey against floating-point extrapolation:

static const double s_invRandMax = 1.0/((double)RAND_MAX + 1.0);
return min + (int)(((double)(max + 1 - min))*rand()*s_invRandMax);

to see how much uniformly "balls" are "falling" into and are being distributed among a number of "buckets", one test for the floating-point extrapolation and another for the while-loop algorithm. But results turned out to be varying depending on the number of "balls" (and "buckets") so I couldn't easily pick a winner. The working code can be found at this Ideone page. For example, with 10 buckets and 100 balls the maximum deviation from the ideal probability among buckets is less for the floating-point extrapolation than for the while-loop algorithm (0.04 and 0.05 respectively) but with 1000 balls, the maximum deviation of the while-loop algorithm is lesser (0.024 and 0.011), and with 10000 balls, the floating-point extrapolation is again doing better (0.0034 and 0.0053), and so on without much of consistency. Thinking of the possibility that none of the algorithms consistently produces uniform distribution better than that of the other algorithm, makes me lean towards the floating-point extrapolation since it appears to perform faster than the while-loop algorithm. So is it fine to choose the floating-point extrapolation algorithm or my testings/conclusions are not completely correct?

Ileostomy answered 1/8, 2012 at 12:5 Comment(10)

It is biased towards lower number, since rand() produces number within certain range (defined by RAND_MAX), and RAND_MAX is usually not divisible by the divisor, so all numbers from 0 to RAND_MAX % divisor - 1 will have higher chance of being chosen. – Maelstrom 1/8, 2012 at 12:9

Lets say you had a random number generator that gives with equal chances 0, 1 or 2. If you apply a modulo 2 on it to get either 0 or 1, you can see that 0 has twice the odds of getting chosen. I guess that is what was implied by the statement you quoted. – Unbridle 1/8, 2012 at 12:9

eternallyconfuzzled.com/arts/jsw_art_rand.aspx – Eslinger 1/8, 2012 at 12:17

The input is an integer and the output is an integer - converting to floating point during the conversion isn't going to change the fact that some numbers will be more likely than others, although it might change which numbers those will be. – Preengage 1/8, 2012 at 16:40

@Desmond: See my reply here – Mariselamarish 1/8, 2012 at 18:44

I hate voting to close questions, but the original question which this one references has high-scoring answers that do explain why a simple modulus introduces bias and give at least two ways of eliminating the bias. – Pattison 1/8, 2012 at 20:58

possible duplicate of Generating random integer from a range – Pattison 1/8, 2012 at 20:58

@AdrianMcCarthy: The previous question predates C++11, which now solves this problem. – Bollinger 3/8, 2012 at 15:6

@MSalters: The other question has high-scoring answers that recommend appropriate C++11 library classes and functions as well pointers to their Boost predecessors. In the long term, it's more useful to update answers to old questions than to start duplicate ones. – Pattison 3/8, 2012 at 15:55

I didn't see the edit so sorry if this comment is over a year late. If you really want to test the distribution, replace the random number generator with a simple sequence generator from 0 to RAND_MAX and make the number of balls RAND_MAX+1 times the number of buckets. See ideone.com/SaagxZ for a modification of my original test. You really need to use a much larger number of balls to test with true random numbers. – Preengage 30/9, 2013 at 20:5

P

14

The problem occurs when the number of outputs from the random number generator (RAND_MAX+1) is not evenly divisible by the desired range (max-min+1). Since there will be a consistent mapping from a random number to an output, some outputs will be mapped to more random numbers than others. This is regardless of how the mapping is done - you can use modulo, division, conversion to floating point, whatever voodoo you can come up with, the basic problem remains.

The magnitude of the problem is very small, and undemanding applications can generally get away with ignoring it. The smaller the range and the larger RAND_MAX is, the less pronounced the effect will be.

I took your example program and tweaked it a bit. First I created a special version of rand that only has a range of 0-255, to better demonstrate the effect. I made a few tweaks to rangeRandomAlg2. Finally I changed the number of "balls" to 1000000 to improve the consistency. You can see the results here: http://ideone.com/4P4HY

Notice that the floating-point version produces two tightly grouped probabilities, near either 0.101 or 0.097, nothing in between. This is the bias in action.

I think calling this "Java's algorithm" is a bit misleading - I'm sure it's much older than Java.

int rangeRandomAlg2 (int min, int max)
{
    int n = max - min + 1;
    int remainder = RAND_MAX % n;
    int x;
    do
    {
        x = rand();
    } while (x >= RAND_MAX - remainder);
    return min + x % n;
}

Preengage answered 1/8, 2012 at 20:6 Comment(10)

Well, it was when reading Java's source that I came across their implementation (which I still find quite nice, despite a bit non-obvious). Of course, that's not what is written in any of the answers here, as the naïve check as you copied from my answer is much easier to understand. – Parthenogenesis 2/8, 2012 at 7:20

I wonder if it should properly handle the case when n is greater than RAND_MAX, so far it's an infinity loop. – Ileostomy 2/8, 2012 at 8:33

Hm, this doesn't ever not terminate (or takes a very long time), with all the possible PRNGs? I suppose the RNG would have to eventually return all values in [0, RAND_MAX] for this to always work. – Luanneluanni 2/8, 2012 at 8:49

@DesmondHume, no it doesn't handle the case where n is greater than RAND_MAX - how could you hope to get a uniform distribution when some of the values aren't possible? You could put a check in the code for that condition. – Preengage 2/8, 2012 at 13:7

@AmbrozBizjak, the worst case is when n is RAND_MAX/2+1 and half of the random numbers will be thrown away on average. Worst-case can be much worse of course, but it should drop back to the mean if run enough times. A RNG that didn't produce all possible values in its range would fail most definitions of random. – Preengage 2/8, 2012 at 13:11

Worst-case n gives an average number of times around the loop of exactly 2, as it's the infinite sum of probabilities 1 + 1/2 + 1/4 + 1/8 + 1/16 + 1/32 + ... However, that's the average number of times around the loop. The actual number of times around the loop is nondeterministic. You might actually go 50 times around the loop, but it's exceedingly rare. – Kellie 30/7, 2015 at 4:44

@DesmondHume when I first answered this question I hadn't seen "Lost" yet. Your user name is much more interesting now. – Preengage 31/5, 2021 at 15:45

@MarkRansom, the number of possible outputs from rand() is equal to RAND_MAX + 1, i.e. rand() generates pseudo-random numbers in the range 0 to RAND_MAX, both inclusive. So, when min == 0 and max == RAND_MAX, signed integer overflow occurs (on the systems in which RAND_MAX == INT_MAX), invoking undefined behaviour, even though the arguments min == 0 and max == RAND_MAX are perfectly valid. On the systems in which RAND_MAX < INT_MAX, the function still goes into an infinite loop. – Candide 24/4, 2022 at 8:12

I think that using size_t values by type-casting and using remainder = (RAND_MAX + 1) % n and while (x >= (RAND_MAX + 1) - remainder) should take care of this edge case. – Candide 24/4, 2022 at 8:16

@KushagrJaiswal you're right, max==INT_MAX is going to be a problem, but see my response to Desmond Hume. I don't see how it's possible to get into an infinite loop otherwise. I'm most familiar with the Microsoft compiler where RAND_MAX is infamously only 32767 and it has never been a problem. – Preengage 25/4, 2022 at 3:24

P

16

The problem is that you're doing a modulo operation. This would be no problem if RAND_MAX would be evenly divisible by your modulus, but usually that is not the case. As a very contrived example, assume RAND_MAX to be 11 and your modulus to be 3. You'll get the following possible random numbers and the following resulting remainders:

0 1 2 3 4 5 6 7 8 9 10
0 1 2 0 1 2 0 1 2 0 1

As you can see, 0 and 1 are slightly more probable than 2.

One option to solve this is rejection sampling: By disallowing the numbers 9 and 10 above you can cause the resulting distribution to be uniform again. The tricky part is figuring out how to do so efficiently. A very nice example (one that took me two days to understand why it works) can be found in Java's java.util.Random.nextInt(int) method.

The reason why Java's algorithm is a little tricky is that they avoid slow operations like multiplication and division for the check. If you don't care too much you can also do it the naïve way:

int n = (int)(max - min + 1);
int remainder = RAND_MAX % n;
int x, output;
do {
  x = rand();
  output = x % n;
} while (x >= RAND_MAX - remainder);
return min + output;

EDIT: Corrected a fencepost error in above code, now it works as it should. I also created a little sample program (C#; taking a uniform PRNG for numbers between 0 and 15 and constructing a PRNG for numbers between 0 and 6 from it via various ways):

using System;

class Rand {
    static Random r = new Random();

    static int Rand16() {
        return r.Next(16);
    }

    static int Rand7Naive() {
        return Rand16() % 7;
    }

    static int Rand7Float() {
        return (int)(Rand16() / 16.0 * 7);
    }

    // corrected
    static int Rand7RejectionNaive() {
        int n = 7, remainder = 16 % n, x, output;
        do {
            x = Rand16();
            output = x % n;
        } while (x >= 16 - remainder);
        return output;
    }

    // adapted to fit the constraints of this example
    static int Rand7RejectionJava() {
        int n = 7, x, output;
        do {
            x = Rand16();
            output = x % n;
        } while (x - output + 6 > 15);
        return output;
    }

    static void Test(Func<int> rand, string name) {
        var buckets = new int[7];
        for (int i = 0; i < 10000000; i++) buckets[rand()]++;
        Console.WriteLine(name);
        for (int i = 0; i < 7; i++) Console.WriteLine("{0}\t{1}", i, buckets[i]);
    }

    static void Main() {
        Test(Rand7Naive, "Rand7Naive");
        Test(Rand7Float, "Rand7Float");
        Test(Rand7RejectionNaive, "Rand7RejectionNaive");
    }
}

The result is as follows (pasted into Excel and added conditional coloring of cells so that differences are more apparent):

enter image description here

Now that I fixed my mistake in above rejection sampling it works as it should (before it would bias 0). As you can see, the float method isn't perfect at all, it just distributes the biased numbers differently.

Parthenogenesis answered 1/8, 2012 at 12:8 Comment(10)

Do you think that an obvious floating-point-based approach aimed to map the RAND_MAX range to a custom range (almost) without branching would be less CPU efficient than multiple divisions (%) and branches implied by the while loop? – Ileostomy 1/8, 2012 at 12:32

If you use (double)rand() / RAND_MAX * n instead you get the same problem, it's just that it's not the lower numbers that are more likely but that you distribute the bias somewhat over the whole range. You don't remove the bias at all using that method. You still have the problem of evenly fitting 10 input numbers into 3 output numbers, which isn't possible. – Parthenogenesis 1/8, 2012 at 12:35

I do not understand the phrase distribute the bias somewhat over the whole range -- appears self-contradictory. Also, ITYM ... evenly fitting *11* input numbers into 3 output numbers ... – Aleida 1/8, 2012 at 13:39

Instead of having the more biased numbers up front as with the modulus you might bias 0 and 2 for example. Or 1 and 2. It's a little unpredictable because it depends on how the floating point numbers look like and where they switch to another number. You still have the bias though. And yes, it's 11 numbers. Forgive me a small typo on a comment I can no longer edit. The point stays the same. – Parthenogenesis 1/8, 2012 at 13:53

@Parthenogenesis Please see the edit to my post regarding some tests I've run using this code. – Ileostomy 1/8, 2012 at 16:28

@Јοеу Great answer, thanks. I have been trying to wrap my head around nextInt(n) for two days now, up until I stumbled upon this thread and your answer. One comment regarding your benchmark; there really isn't much difference between the naïve and java implementation of the rejection algorithm, particularly no more/less division/multiplication operations. Or am I missing something? – Bedding 31/1, 2014 at 13:23

@posdef: There isn't. It's just that Java's variant needs quite a bit of thinking to understand what it does, while the naïve version is more straightforward (and should be slightly slower, I guess). – Parthenogenesis 31/1, 2014 at 14:44

@Јοеу but since there really isn't a significant difference between them, I am not sure why you'd expect it to be slower – Bedding 31/1, 2014 at 14:47

@posdef: Just re-read; ok, in this case there's probably no difference at all. The original Java code used bit masks to achieve the comparison. Those probably got lost in the attempt to reduce the range to 7. My code uses an explicit modulus operation per call (which somehow got copied over into the port of the Java method, I should probably fix that) which isn't present in the original code. Since the loop in most cases runs only once the additional overhead per call can make a difference. But not much (I'd also guess the code is much older, when Java was much slower). – Parthenogenesis 31/1, 2014 at 16:49

In Rand7RejectionNaive, what in Heaven's name is that output variable for? There is absolutely no need for it — and it's only slowing things down. All that needs to happen at the end is return x % n; — it doesn't need to compute anything % n inside the loop. – Kellie 30/7, 2015 at 4:49

P

14

The problem occurs when the number of outputs from the random number generator (RAND_MAX+1) is not evenly divisible by the desired range (max-min+1). Since there will be a consistent mapping from a random number to an output, some outputs will be mapped to more random numbers than others. This is regardless of how the mapping is done - you can use modulo, division, conversion to floating point, whatever voodoo you can come up with, the basic problem remains.

The magnitude of the problem is very small, and undemanding applications can generally get away with ignoring it. The smaller the range and the larger RAND_MAX is, the less pronounced the effect will be.

I took your example program and tweaked it a bit. First I created a special version of rand that only has a range of 0-255, to better demonstrate the effect. I made a few tweaks to rangeRandomAlg2. Finally I changed the number of "balls" to 1000000 to improve the consistency. You can see the results here: http://ideone.com/4P4HY

Notice that the floating-point version produces two tightly grouped probabilities, near either 0.101 or 0.097, nothing in between. This is the bias in action.

I think calling this "Java's algorithm" is a bit misleading - I'm sure it's much older than Java.

int rangeRandomAlg2 (int min, int max)
{
    int n = max - min + 1;
    int remainder = RAND_MAX % n;
    int x;
    do
    {
        x = rand();
    } while (x >= RAND_MAX - remainder);
    return min + x % n;
}