Calculate allele frequencies using Hardy-Weinberg principles. Analyze two-allele and three-allele systems, perform chi-square tests for HWE, compute carrier frequencies, and handle X-linked genes with step-by-step solutions.
Problem: In a population of 10,000 people, 1 in 2,500 has cystic fibrosis (an autosomal recessive disorder). What are the allele frequencies and carrier frequency?
Solution: q² = 1/2500 = 0.0004
q = √0.0004 = 0.02 (2% recessive allele frequency)
p = 1 − q = 0.98 (98% dominant allele frequency)
Carrier frequency (2pq) = 2 × 0.98 × 0.02 = 0.0392 ≈ 3.9%
This means about 392 out of 10,000 people are carriers (heterozygous) for cystic fibrosis, even though only 4 have the disease.
Problem: In a population of 100 individuals, there are 36 AA, 48 Aa, and 16 aa. Calculate allele frequencies and test for HWE.
Solution:
Total alleles = 2 × 100 = 200
Count of A = 2×36 + 48 = 120 → p = 120/200 = 0.60
Count of a = 2×16 + 48 = 80 → q = 80/200 = 0.40
Expected: AA = p² × 100 = 36, Aa = 2pq × 100 = 48, aa = q² × 100 = 16
χ² = (36−36)²/36 + (48−48)²/48 + (16−16)²/16 = 0.00 (Population in HWE)
Problem: Red-green color blindness is an X-linked recessive trait. In a population, 8 out of 50 males are color blind. Estimate the frequency of the recessive allele.
Solution: In males, the phenotype directly reflects the genotype (hemizygous).
q (a) = 8/50 = 0.16 (16%)
p (A) = 1 − 0.16 = 0.84 (84%)
Expected female carrier frequency (2pq) = 2 × 0.84 × 0.16 = 26.9%
For X-linked traits, allele frequency in males equals phenotype frequency because males have only one X chromosome.
Problem: In a population of 200 individuals, observed counts are: AA = 80, Aa = 40, aa = 80. Is the population in HWE?
Solution: p = (2×80 + 40) / 400 = 0.50, q = 0.50
Expected: AA = 0.25 × 200 = 50, Aa = 0.50 × 200 = 100, aa = 0.25 × 200 = 50
χ² = (80−50)²/50 + (40−100)²/100 + (80−50)²/50 = 18 + 36 + 18 = 72 (p < 0.001)
With χ² = 72 and df = 1, p < 0.001 — significant deviation from HWE. This population may be experiencing selection, non-random mating, or other evolutionary forces.
Problem: In a population, the observed genotype counts for a three-allele system (alleles 1, 2, 3) are: 11=25, 12=30, 13=15, 22=10, 23=12, 33=8. Find allele frequencies.
Solution: Total individuals = 25+30+15+10+12+8 = 100
p (allele 1) = (2×25 + 30 + 15) / 200 = 95/200 = 0.475
q (allele 2) = (2×10 + 30 + 12) / 200 = 62/200 = 0.310
r (allele 3) = (2×8 + 15 + 12) / 200 = 43/200 = 0.215
Sum check: 0.475 + 0.310 + 0.215 = 1.000 ✓
Where p is the frequency of the dominant allele (A), q is the frequency of the recessive allele (a), p² is the frequency of homozygous dominant (AA), 2pq is the frequency of heterozygous (Aa), and q² is the frequency of homozygous recessive (aa).
The chi-square test compares observed genotype counts (O) with expected counts under HWE (E). For a two-allele system with three genotypes, df = 1. A significant χ² (p < 0.05) indicates the population deviates from Hardy-Weinberg equilibrium, suggesting evolutionary forces at work.
For a gene with three alleles, allele frequencies are calculated by counting each allele copy from the genotype counts: p = (2×n₁₁ + n₁₂ + n₁₃) / 2N, q = (2×n₂₂ + n₁₂ + n₂₃) / 2N, r = (2×n₃₃ + n₁₃ + n₂₃) / 2N, where N is the total number of individuals.
When genotype counts are unknown but phenotype counts are available, the recessive allele frequency can be estimated as the square root of the recessive phenotype frequency. This method assumes Hardy-Weinberg equilibrium and complete dominance.
For X-linked genes, males are hemizygous — their allele frequency equals phenotype frequency directly. Female allele frequencies are calculated from genotype counts as in autosomal genes. The combined population frequency can be weighted by the sex ratio.
The carrier frequency (2pq) represents the proportion of heterozygous individuals in a population. For recessive disorders, carriers are unaffected but can pass the disease allele to their offspring. Carrier frequency is always much higher than disease frequency.
Hardy-Weinberg equilibrium requires: (1) random mating, (2) no mutation, (3) no natural selection, (4) large population size (no genetic drift), and (5) no gene flow. Real populations rarely satisfy all conditions perfectly.
A small χ² (p > 0.05) suggests the population is in HWE. A large χ² (p < 0.05) indicates significant deviation from HWE — possible causes include selection, inbreeding, population stratification, or genotyping errors.
Some genetic disorders persist at high frequencies because carriers have a selective advantage. Example: Sickle cell trait (HbAS) confers resistance to malaria, maintaining the HbS allele at higher frequencies in malaria-endemic regions.
⚠️ Important Note: Hardy-Weinberg equilibrium assumes ideal conditions (random mating, no mutation, no selection, large population size, no gene flow). Real populations rarely satisfy all conditions perfectly. The chi-square test for HWE is sensitive to sample size — large sample sizes may detect statistically significant but biologically trivial deviations.