MATLAB Answers


What is the "best" way to Generate a Bernoulli Process?

Asked by Derek O'Connor on 3 Mar 2011

A Bernoulli Process is a sequence of iid (independent, identically-distributed) random variables x1,x2,...,xi,..., where the probability mass function of xi is

 xi = S(success) with probability p, 
 xi = F(failure) with probability q,
 with p+q = 1.

Depending on the application, (S,F) may be (1,0), (H,T), (true, false), etc.

Here are two obvious ways to generate a (1,0) Bernoulli process in Matlab:

  1. Vectorized Form: bernp1 = rand(n,1) <= p; Process vector bernp1
  2. Component Form: for k = 1:n, bernp2 = rand <= p; Process scalar bernp2; end;

The "best" answer will depend on the application and computer at hand.

If the computer has lots of memory then the vectorized version is the simplest, and may be the fastest.

If "Process vector or scalar" takes a lot of time then the time to generate bernp by either method may be negligible.

If a fixed amount of memory is available for bernp then we must use the component form or a mini-vectorized form.

Using rand to generate 1 random bit seems wasteful, given that rand returns a double precision floating point number which has about 53 random bits. Can we use rand more efficiently?

My interest in the Bernoulli process was sparked by re-reading Feller Volume 1. At the bottom of page 18 Feller says: "In this volume we shall consider only discrete sample spaces". A closer examination shows that Feller derives (nearly) everything in this book from the Bernoulli process.

Derek O'Connor



No products are associated with this question.

2 Answers

Answer by Andrew Newell
on 3 Mar 2011

Interesting question. The number of wasted bits depends on the number of bits you need to define p. If p = 1/sqrt(3), no wasted bits! Any algorithm trying to beat your algorithm 1 would require a lot of special cases, and would be competing with a built-in algorithm.

The Statistics Toolbox uses a variation on your first algorithm.

( edited to expand on answer )



I don't understand the first part of your answer.

The value of p would be represented by a double precision f.p. number, i.e., a 64-bit string. This same string is used repeatedly for each rand generated.

Derek O'Connor

Each time you generate a new random number, you need to compare it with p, and you don't know in advance how many bits you'll need to compare. It might be all 53.

Answer by Walter Roberson
on 4 Mar 2011

Yes, you can make better use of rand.

If the probability is expressible in lowest terms with a denominator of 2^n, n <= 26. With the 2^n denominator, you take n consecutive bits at a time out of the float representation.

For other probabilities, you use Arithmetic Coding

  1 Comment

I don't know what you mean. Float representation of what? And what do you do with the bits?

Join the 15-year community celebration.

Play games and win prizes!

Learn more
Discover MakerZone

MATLAB and Simulink resources for Arduino, LEGO, and Raspberry Pi

Learn more

Discover what MATLAB® can do for your career.

Opportunities for recent engineering grads.

Apply Today

MATLAB Academy

New to MATLAB?

Learn MATLAB today!