## What is the "best" way to Generate a Bernoulli Process?

### Derek O'Connor (view profile)

on 3 Mar 2011

A Bernoulli Process is a sequence of iid (independent, identically-distributed) random variables x1,x2,...,xi,..., where the probability mass function of xi is

``` xi = S(success) with probability p,
xi = F(failure) with probability q,
with p+q = 1.```

Depending on the application, (S,F) may be (1,0), (H,T), (true, false), etc.

Here are two obvious ways to generate a (1,0) Bernoulli process in Matlab:

1. Vectorized Form: bernp1 = rand(n,1) <= p; Process vector bernp1
2. Component Form: for k = 1:n, bernp2 = rand <= p; Process scalar bernp2; end;

The "best" answer will depend on the application and computer at hand.

If the computer has lots of memory then the vectorized version is the simplest, and may be the fastest.

If "Process vector or scalar" takes a lot of time then the time to generate bernp by either method may be negligible.

If a fixed amount of memory is available for bernp then we must use the component form or a mini-vectorized form.

Using rand to generate 1 random bit seems wasteful, given that rand returns a double precision floating point number which has about 53 random bits. Can we use rand more efficiently?

My interest in the Bernoulli process was sparked by re-reading Feller Volume 1. At the bottom of page 18 Feller says: "In this volume we shall consider only discrete sample spaces". A closer examination shows that Feller derives (nearly) everything in this book from the Bernoulli process.

Derek O'Connor

## Products

No products are associated with this question.

### Andrew Newell (view profile)

on 3 Mar 2011

Interesting question. The number of wasted bits depends on the number of bits you need to define p. If p = 1/sqrt(3), no wasted bits! Any algorithm trying to beat your algorithm 1 would require a lot of special cases, and would be competing with a built-in algorithm.

The Statistics Toolbox uses a variation on your first algorithm.

( edited to expand on answer )

Derek O'Connor

### Derek O'Connor (view profile)

on 4 Mar 2011

Andrew,

The value of p would be represented by a double precision f.p. number, i.e., a 64-bit string. This same string is used repeatedly for each rand generated.

Derek O'Connor

Andrew Newell

### Andrew Newell (view profile)

on 4 Mar 2011

Each time you generate a new random number, you need to compare it with p, and you don't know in advance how many bits you'll need to compare. It might be all 53.

### Walter Roberson (view profile)

on 4 Mar 2011

Yes, you can make better use of rand.

If the probability is expressible in lowest terms with a denominator of 2^n, n <= 26. With the 2^n denominator, you take n consecutive bits at a time out of the float representation.

For other probabilities, you use Arithmetic Coding

Andrew Newell

### Andrew Newell (view profile)

on 4 Mar 2011

I don't know what you mean. Float representation of what? And what do you do with the bits?

#### Join the 15-year community celebration.

Play games and win prizes!

MATLAB and Simulink resources for Arduino, LEGO, and Raspberry Pi