Multinomial Distribution

Last Updated :

21 Aug, 2024

Blog Author :

N/A

Edited by :

Ashish Kumar Srivastav

Reviewed by :

Dheeraj Vaidya

Table Of Contents

arrow

Multinomial Distribution Definition

Multinomial distribution is a multivariate version of the binomial distribution. It is the probability distribution of the outcomes from a multinomial experiment. It is used in the case of an experiment that has a possibility of resulting in more than two possible outcomes.

Multinomial Distribution definition

The simplest technique to construct a multinomial random variable is to replicate an experiment (by drawing n uniform random numbers and assigning them to certain bins based on the cumulative value of the p vector) to produce a multinomial random variable. In instructional statistics, this distribution is put to various uses.

  • The multinomial distribution represents the likelihood of receiving a certain set of counts where each trial has a discrete number of possible outcomes.
  • The most direct goodness-of-fit test is based on the multinomial distribution of response patterns.
  • Its applications and use cases frequently involve the evaluation of the likelihood of a set of outcomes that are usually more than two or at least two.
  • It is a type of probability distribution of which binomial distribution is a subtype.

Multinomial Distribution Explained

The multinomial distribution is a multivariate discrete distribution. Multinomial experiments include the following characteristics:

  • The experiment comprises of n repeated trials.
  • Its results have statistical significance.
  • There are a finite number of possible outcomes for each trial, and the likelihood of any event occurring is constant throughout the experiment.
  • The results of one experiment do not influence the results of the others.

Assuming the model is valid, the most straightforward way to determine a model's fit is to use the multinomial distribution of response patterns. Accordingly, there are 'nm' potential response patterns ranging from (l,...,l) to (m,...,m) for n items with m response categories for each item. Hence, all response patterns have the same probability of occurring if u is a typical pattern.

Thus, to estimate the frequencies of the response patterns, use multinomial distribution with parameters n and actual probability for all the response patterns. According to the multivariate central limit theorem, the multivariate normal distribution can approximate the distribution for large sample sizes.

While considering the entire data, the distribution of the observations has a multinomial shape for observations from different Poisson distributions. Probabilities in the multinomial distribution are based on the Poisson mean for each cell multiplied by all Poisson mean values.

Multinomial vs Binomial Distribution

A multinomial experiment has a subtype known as a binomial one. In this regard, there is one major distinction. Accordingly, in a binomial experiment, there are only two possibilities for each trial. At the same time, each experiment in a multinomial trial has the potential difference for two or more different results.

For instance, you perform n times an experiment with K outcomes. Then, you denote by Xi the number of times you obtain the i-th outcome. In that case, the random vector X is defined as  X = is a multinomial random vector.

The multinomial distribution is the generalization of the binomial distribution to the case of n repeated trials where there are more than two possible outcomes for each. If an event may occur with k possible outcomes, each with a probability, pi(i = 1,1,…,k), with ∑k(i=1)pi = 1, and if ri is the number of the outcome associated with 'pi' occurs, then the random variables ri (i = 1,2,…,k-1) have a multinomial probability defined as

f (r1,r2,....,rk-1) = n! ∏ki=1 piri/∏ki=1 ri!, ri = 0,1,2,....,n.

Note that each of the ri ranges from 0 to n inclusive with only (k-1) variables because of the linear constraint:

ki=1 ri = n

Just as the binomial distribution tends to the univariate normal, so does the multinomial distribution tend to limit to the multivariate normal distribution.

Formula

Suppose a random variable X has a multinomial distribution. In that case, the following multinomial distribution calculator calculates the likelihood that event 1 occurs exactly x1 times, event 2 occurs exactly x2 times, event 3 occurs exactly x3 times, and so on. Hence following is the multinomial distribution formula:

Probability = n!*(p1x1 * p2x2 * ... * pkxk)/(x1!*x2!*...*xk!)

Where:

  • n: the total number of events
  • x1, x2, xk: the number of occurrences of event 1, event 2, and event k, respectively.
  • p1, p2, pk: the likelihood that results in 1, 2, and k happen, respectively, in a trail.

Example

Let us have a look at the multinomial distribution example to understand the concept better:

Rebecca, a portfolio manager, utilizes it to assess the probability of her client's investment. For 60% of the time, she chooses a small-cap index to outperform a large-cap index. Whereas for 40% of the time, Rebecca opts for a large-cap index to outperform a small-cap index. For 10% of the time, the indexes may have the same or approximate return.

Since the trial may last a full year of trading days in such cases, Rebecca uses actual market data to validate the outcomes. In scenarios where the likelihood of this set of occurrences is high, her clients may tempt to overinvest in the small-cap index.

Likewise, Neil, a financial analyst, uses this method to evaluate the likelihood of events, like potential quarterly sales for a business when its competitors post lower-than-expected profits.

Frequently Asked Questions (FAQs)

What is multinomial distribution?

The multinomial distribution is used to express the chance of receiving a particular number of counts for k distinct outcomes where the likelihood of each occurrence is known in advance.

What are the properties of multinomial distribution?

The following are multinomial distribution properties: The experiment consists of repeated n trials. Each experiment has a limited number of outcomes. The likelihood of a specific outcome occurring in everyone's trial remains static.

Multinomial distribution vs. normal distribution

Multinomials are employed when order doesn't matter for a finite number of classes/groups. A normal distribution is used for continuous data, which can take on infinite values if recorded accurately (though, in practice, we will round to a finite subset).

How does a multinomial distribution differ from a binomial distribution?

It is nearly identical to a binomial experiment, except for one major difference: a binomial experiment can only yield two results, but a multinomial experiment can yield several results. For example, suppose you roll a die twelve times to see what number you come up with each time. The results of a binomial experiment will be distributed in a binomial manner.

This article is a guide to Multinomial Distribution & its definition. We explain its properties, formula, calculator, comparison with binomial, & example. You can learn more from the following articles -