A common problem in this field is finding the probability of certain genotype after a given number of years. For example, suppose we want to study the fractions of the three genotypes in the generation of cows in terms of the initial genotype fractioins. (See the following example.)
Professor Vetar, at UC Davis, discovers that cows with genotype AA can produce a better quality milk than other genotypes. Professor Vetar is interested in discovering the fraction of offspring cows with genotype AA. If professor Vetar choses to cross only genotype with other genotypes, what are the probabilities of the offspring being AA, Aa, or aa ?
To analyse the problem,we will consider three cases:
First assume, the crossing of AA with AA. This will always gives the genotype AA, therefore the probabilities of an offspring to be AA, Aa, and aa respectively are equal to 1, 0, and 0.
Second, assume crossing of Aa with AA. The offspring will have half chance to be of genotype AA and half chance the genotype Aa, therefore the probabilities of AA, Aa, and aa repectively are 1/2, 1/2, and 0.
Third, consider crossing of aa with AA. This will always results in genotype Aa. Therefore, the probabilities of genotypes AA, Aa, and aa repectively are 0, 1, and 0, respectively.
The following matrix is the result of the pervious obseration:
This matrix can be veiwed as the following table:
Assume that the initial prpulation of cows made up of an equal number of each genotype, therefore, the initial distribution vector is given by :
After another year passed by, the distribution vector can be obtained as follow:
For any positive integer , that is years later,
Now if gets larger and larger, the matrix approaches to
Therefore will approach to
Suppose someone is intested to see the number of cows with genotype AA, Aa, and aa after 20 geneations geneation. One way to answer this question is to compute which may result in computaional error during the matrix multicaption. Another approach using of diagonalization reduces computation. If the matrix can be writen as product of an invertible matrix a diagonal matrix and inverse of , that is , then computation wil be much simples and therefore much less comoutational error. This is mainly because of the following fact:
Not evey matrix is diagonalizable. For to be diagonalizable it needs to have -linearly independent eigenvectors. The matrix is formed by writing these eigenvectors as columns of . To find eigenvectors we will first find the eigenvalues.
For the matrix above the eigenvalues are
and corresponding eigenvectors are:
The diagonal matrix
and
Therefore,
as n approach infinity..