World Library  
Flag as Inappropriate
Email this Article

Generalized Dirichlet distribution

In statistics, the generalized Dirichlet distribution (GD) is a generalization of the Dirichlet distribution with a more general covariance structure and almost twice the number of parameters. Random variables with a GD distribution are neutral but not completely neutral.[1]

The density function of p_1,\ldots,p_{k-1} is

\left[ \prod_{i=1}^{k-1}B(a_i,b_i)\right]^{-1} p_k^{b_{k-1}-1} \prod_{i=1}^{k-1}\left[ p_i^{a_i-1}\left(\sum_{j=i}^kp_j\right)^{b_{i-1}-(a_i+b_i)}\right]

where we define p_k= 1- \sum_{i=1}^{k-1}p_i. Here B(x,y) denotes the Beta function. This reduces to the standard Dirichlet distribution if b_{i-1}=a_i+b_i for 2\leqslant i\leqslant k-1 (b_0 is arbitrary).

For example, if k=4, then the density function of p_1,p_2,p_3 is

\left[\prod_{i=1}^{3}B(a_i,b_i)\right]^{-1} p_1^{a_1-1}p_2^{a_2-1}p_3^{a_3-1}p_4^{b_3-1}\left(p_2+p_3+p_4\right)^{b_1-\left(a_2+b_2\right)}\left(p_3+p_4\right)^{b_2-\left(a_3+b_3\right)}

where p_1+p_2+p_3<1 and p_4=1-p_1-p_2-p_3.

Connor and Mosimann define the PDF as they did for the following reason. Define random variables z_1,\ldots,z_{k-1} with z_1=p_1, z_2=p_2/\left(1-p_1\right), z_3=p_3/\left(1-(p_1+p_2)\right),\ldots,z_i = p_i/\left(1-p_1+\cdots+p_{i-1}\right). Then p_1,\ldots,p_k have the generalized Dirichlet distribution as parametrized above, if the z_i are iid beta with parameters a_i,b_i, i=1,\ldots,k-1.


  • Alternative form given by Wong 1
  • General moment function 2
  • Reduction to standard Dirichlet distribution 3
  • Bayesian analysis 4
  • Sampling experiment 5
  • See also 6
  • References 7

Alternative form given by Wong

Wong [2] gives the slightly more concise form for x_1+\cdots +x_k\leqslant 1


where \gamma_j=\beta_j-\alpha_{j+1}-\beta_{j+1} for 1\leqslant j\leqslant k-1 and \gamma_k=\beta_k-1. Note that Wong defines a distribution over a k dimensional space (implicitly defining x_{k+1}=1-\sum_{i=1}^kx_i) while Connor and Mosiman use a k-1 dimensional space with x_k=1-\sum_{i=1}^{k-1}x_i.

General moment function

If X=\left(X_1,\ldots,X_k\right)\sim GD_k\left(\alpha_1,\ldots,\alpha_k;\beta_1,\ldots,\beta_k\right), then

E\left[X_1^{r_1}X_2^{r_2}\cdots X_k^{r_k}\right]= \prod_{j=1}^k \frac{ \Gamma\left(\alpha_j+\beta_j\right) \Gamma\left(\alpha_j+r_j\right) \Gamma\left(\beta_j+\delta_j\right) }{ \Gamma\left(\alpha_j\right) \Gamma\left(\beta_j\right) \Gamma\left(\alpha_j+\beta_j+r_j+\delta_j\right) }

where \delta_j=r_{j+1}+r_{j+2}+\cdots +r_k for j=1,2,\cdots,k-1 and \delta_k=0. Thus


Reduction to standard Dirichlet distribution

As stated above, if b_{i-1}=a_i+b_i for 2\leqslant i\leqslant k then the distribution reduces to a standard Dirichlet. This condition is different from the usual case, in which setting the additional parameters of the generalized distribution to zero results in the original distribution. However, in the case of the GDD, this results in a very complicated density function.

Bayesian analysis

Suppose X=\left(X_1,\ldots,X_k\right)\sim GD_k\left(\alpha_1,\ldots,\alpha_k;\beta_1,\ldots,\beta_k\right) is generalized Dirichlet, and that Y|X is multinomial with n trials (here Y=\left(Y_1,\ldots,Y_k\right)). Writing Y_j=y_j for 1\leqslant j\leqslant k and y_{k+1}=n-\sum_{i=1}^ky_i the joint posterior of X|Y is a generalized Dirichlet distribution with

X|Y\sim GD_k\left( {\alpha'}_1,\ldots,{\alpha'}_k; {\beta'}_1,\ldots,{\beta'}_k \right)

where {\alpha'}_j=\alpha_j+y_j and {\beta'}_j=\beta_j+\sum_{i=j+1}^{k+1}y_i for 1\leqslant k.

Sampling experiment

Wong gives the following system as an example of how the Dirichlet and generalized Dirichlet distributions differ. He posits that a large urn contains balls of k+1 different colours. The proportion of each colour is unknown. Write X=(X_1,\ldots,X_k) for the proportion of the balls with colour j in the urn.

Experiment 1. Analyst 1 believes that X\sim D(\alpha_1,\ldots,\alpha_k,\alpha_{k+1}) (ie, X is Dirichlet with parameters \alpha_i). The analyst then makes k+1 glass boxes and puts \alpha_i marbles of colour i in box i (it is assumed that the \alpha_i are integers \geq 1). Then analyst 1 draws a ball from the urn, observes its colour (say colour j) and puts it in box j. He can identify the correct box because they are transparent and the colours of the marbles within are visible. The process continues until n balls have been drawn. The posterior distribution is then Dirichlet with parameters being the number of marbles in each box.

Experiment 2. Analyst 2 believes that X follows a generalized Dirichlet distribution: X\sim GD(\alpha_1,\ldots,\alpha_k;\beta_1,\ldots,\beta_k). All parameters are again assumed to be positive integers. The analyst makes k+1 wooden boxes. The boxes have two areas: one for balls and one for marbles. The balls are coloured but the marbles are not coloured. Then for j=1,\ldots,k, he puts \alpha_j balls of colour j, and \beta_j marbles, in to box j. He then puts a ball of colour k+1 in box k+1. The analyst then draws a ball from the urn. Because the boxes are wood, the analyst cannot tell which box to put the ball in (as he could in experiment 1 above); he also has a poor memory and cannot remember which box contains which colour balls. He has to discover which box is the correct one to put the ball in. He does this by opening box 1 and comparing the balls in it to the drawn ball. If the colours differ, the box is the wrong one. The analyst puts a marble (sic) in box 1 and proceeds to box 2. He repeats the process until the balls in the box match the drawn ball, at which point he puts the ball (sic) in the box with the other balls of matching colour. The analyst then draws another ball from the urn and repeats until n balls are drawn. The posterior is then generalized Dirichlet with parameters \alpha being the number of balls, and \beta the number of marbles, in each box.

Note that in experiment 2, changing the order of the boxes has a non-trivial effect, unlike experiment 1.

See also


  1. ^ R. J. Connor and J. E. Mosiman 1969. Concepts of independence for proportions with a generalization of the Dirichlet distribution. Journal of the American Statistical Association, volume 64, pp194--206
  2. ^ T.-T. Wong 1998. Generalized Dirichlet distribution in Bayesian analysis. Applied Mathematics and Computation, volume 97, pp165-181
This article was sourced from Creative Commons Attribution-ShareAlike License; additional terms may apply. World Heritage Encyclopedia content is assembled from numerous content providers, Open Access Publishing, and in compliance with The Fair Access to Science and Technology Research Act (FASTR), Wikimedia Foundation, Inc., Public Library of Science, The Encyclopedia of Life, Open Book Publishers (OBP), PubMed, U.S. National Library of Medicine, National Center for Biotechnology Information, U.S. National Library of Medicine, National Institutes of Health (NIH), U.S. Department of Health & Human Services, and, which sources content from all federal, state, local, tribal, and territorial government publication portals (.gov, .mil, .edu). Funding for and content contributors is made possible from the U.S. Congress, E-Government Act of 2002.
Crowd sourced content that is contributed to World Heritage Encyclopedia is peer reviewed and edited by our editorial staff to ensure quality scholarly research articles.
By using this site, you agree to the Terms of Use and Privacy Policy. World Heritage Encyclopedia™ is a registered trademark of the World Public Library Association, a non-profit organization.

Copyright © World Library Foundation. All rights reserved. eBooks from Project Gutenberg are sponsored by the World Library Foundation,
a 501c(4) Member's Support Non-Profit Organization, and is NOT affiliated with any governmental agency or department.