Allele frequencies

Allele frequency or gene frequency is the proportion of a particular allele (variant of a gene) among all allele copies being considered.

Formal definition: Allele or gene frequency is the percentage of all alleles at a given locus in a population gene pool represented by a particular allele.[1][2]

In other words, it is the number of copies of a particular allele divided by the number of copies of all alleles at the genetic place (locus) in a population. It is usually expressed as a percentage. In population genetics, allele frequencies are used to depict the amount of genetic diversity at the individual, population, and species level. It is also the relative proportion of all alleles of a gene that are of a designated type.

Given the following:

  1. a particular locus on a chromosome and the gene occupying that locus
  2. a population of N individuals carrying n loci in each of their somatic cells (e.g. two loci in the cells of diploid species, which contain two sets of chromosomes)
  3. different alleles of the gene exist
  4. one allele exists in a copies

then the allele frequency is the fraction or percentage of all the occurrences of that locus that is occupied by a given allele and the frequency of one of the alleles is a/(n*N).

For example, if the frequency of an allele is 20% in a given population, then among population members, one in five chromosomes will carry that allele. Four out of five will be occupied by other variant(s) of the gene.

Note that for diploid genes the fraction of individuals that carry this allele may be nearly two in five (36%). The reason for this is that if the allele distributes randomly, then the binomial theorem will apply: 32% of the population will be heterozygous for the allele (i.e. carry one copy of that allele and one copy of another in each somatic cell) and 4% will be homozygous (carrying two copies of the allele). Together, this means that 36% of diploid individuals would be expected to carry an allele that has a frequency of 20%. However, alleles distribute randomly only under certain assumptions, including the absence of selection. When these conditions apply, a population is said to be in Hardy–Weinberg equilibrium.

The frequencies of all the alleles of a given gene often are graphed together as an allele frequency distribution histogram, or allele frequency spectrum. Population genetics studies the different "forces" that might lead to changes in the distribution and frequencies of alleles—in other words, to evolution. Besides selection, these forces include genetic drift, mutation and migration.

Calculation of allele frequencies from genotype frequencies

The actual frequency calculations depend on the ploidy of the species for autosomal genes.


The frequency of an allele a is the quotient of the number of copies of the allele and the population or sample size.


If f(AA), f(Aa), and f(aa) are the frequencies of the three genotypes at a locus with two alleles, then the frequency p of the A-allele and the frequency q of the a-allele are obtained by counting alleles. Because each homozygote AA consists only of A-alleles, and because half of the alleles of each heterozygote Aa are A-alleles, the total frequency p of A-alleles in the population is calculated as

p=f(\mathbf{AA})+ \frac{1}{2}f(\mathbf{Aa})= \mbox{frequency of A}

Similarly, the frequency q of the a allele is given by

q=f(\mathbf{aa})+ \frac{1}{2}f(\mathbf{Aa})= \mbox{frequency of a}

It would be expected that p and q sum to 1, since they are the frequencies of the only two alleles present. Indeed they do:


and from this we get:

q=1-p and p=1-q

If there are more than two different allelic forms, the frequency for each allele is simply the frequency of its homozygote plus half the sum of the frequencies for all the heterozygotes in which it appears. Allele frequency can always be calculated from genotype frequency, whereas the reverse requires that the Hardy–Weinberg conditions of random mating apply. This is partly due to the three genotype frequencies and the two allele frequencies. It is easier to reduce from three to two.

An example population

Consider a population of ten individuals and a given locus with two possible alleles, A and a. Suppose that the genotypes of the individuals are as follows:

AA, Aa, AA, aa, Aa, AA, AA, Aa, Aa, and AA

Then the allele frequencies of allele A and allele a are:


so if a locus is chosen at random there is a 70% chance it will be the A allele, and a 30% chance it will be the a allele.


Allele frequency dynamics

The dynamics of allele and gene frequencies are affected by several factors such as migration, mutation, drift, population size, mating and others. The Hardy-Weinberg law describes an equilibrium for diploids genes. See details under population genetics.

See also


External links

  • ALFRED database
  • - Earth Human STR Allele Frecuencies Database
  • VWA 17 Allele Frequency in Human Population (Poster)
  • Allele Frequencies in Worldwide Populations

This article was sourced from Creative Commons Attribution-ShareAlike License; additional terms may apply. World Heritage Encyclopedia content is assembled from numerous content providers, Open Access Publishing, and in compliance with The Fair Access to Science and Technology Research Act (FASTR), Wikimedia Foundation, Inc., Public Library of Science, The Encyclopedia of Life, Open Book Publishers (OBP), PubMed, U.S. National Library of Medicine, National Center for Biotechnology Information, U.S. National Library of Medicine, National Institutes of Health (NIH), U.S. Department of Health & Human Services, and, which sources content from all federal, state, local, tribal, and territorial government publication portals (.gov, .mil, .edu). Funding for and content contributors is made possible from the U.S. Congress, E-Government Act of 2002.
Crowd sourced content that is contributed to World Heritage Encyclopedia is peer reviewed and edited by our editorial staff to ensure quality scholarly research articles.
By using this site, you agree to the Terms of Use and Privacy Policy. World Heritage Encyclopedia™ is a registered trademark of the World Public Library Association, a non-profit organization.