Population genetics - Core Population Genetic Theory
Understand the key forces shaping allele frequencies, the determinants of genetic diversity, and the theoretical limits on adaptive evolution.
Summary
Read Summary
Flashcards
Save Flashcards
Quiz
Take Quiz
Quick Practice
Which researchers provided the mathematical framework for allele frequency changes under selection, drift, mutation, and migration in 1970?
1 of 15
Summary
Population Genetic Theory and Demographic Modeling
Introduction
Population genetics provides a mathematical framework for understanding how genetic variation changes over time within populations. The core insight is that allele frequencies—the relative abundances of different versions of a gene—shift due to several key forces: natural selection, genetic drift, mutation, and gene flow. Understanding these forces and how they interact forms the foundation for nearly all modern evolutionary and conservation biology. This material focuses on the key principles that explain genetic diversity patterns we observe in nature and the fundamental limits on how quickly populations can evolve.
Core Evolutionary Forces: The Four Main Processes
Four main processes determine how allele frequencies change in populations:
Natural Selection occurs when some alleles increase the survival or reproduction of individuals carrying them. An allele that makes an organism more likely to survive and reproduce will tend to increase in frequency over generations. Conversely, alleles that reduce fitness decrease in frequency. The efficiency of selection depends on the strength of the fitness difference—how much better one allele is than another.
Genetic Drift is random change in allele frequencies due to chance events during reproduction. Imagine flipping a coin many times—you expect 50% heads and 50% tails, but random variation means you might get 51% or 49%. Similarly, when parents reproduce, the alleles passed to offspring occur randomly. In small populations, these random events are large relative to population size and can cause allele frequencies to fluctuate substantially. This process is particularly important in small populations because drift can overwhelm weak selection.
Mutation introduces new genetic variation into populations. Mutations occur spontaneously during DNA replication and create new alleles that didn't previously exist. The mutation rate is typically very low (around $10^{-8}$ to $10^{-9}$ per base pair per generation), but over many generations and across large genomes, mutations constantly supply new variation.
Gene Flow (also called migration) occurs when individuals move from one population to another and reproduce, introducing alleles from one population into another. Even small amounts of gene flow between populations can have substantial effects on allele frequencies, particularly in preventing populations from diverging completely.
Determinants of Genetic Diversity: Why Some Species Are More Variable Than Others
Genetic diversity—the amount of genetic variation present in a population—is not randomly distributed across species. Some species have much higher heterozygosity (the proportion of individuals that are heterozygous at a given locus) than others. Three primary factors determine genome-wide diversity:
Effective Population Size ($Ne$) is the most important determinant. This is not the same as the census population size (the actual number of individuals you could count). Instead, effective population size accounts for the fact that not all individuals reproduce equally. If a population has 1,000 individuals but only 100 of them breed, the effective population size is much smaller. Populations with larger effective sizes maintain more variation because drift has a weaker effect on allele frequencies. In contrast, small effective populations lose variation rapidly through drift.
Mutation Rate directly supplies new genetic variation. Species with higher mutation rates accumulate more variation, all else being equal. However, mutation rates are relatively similar across many species, so this is often a secondary factor compared to $Ne$.
Selection Intensity actively removes variation from populations. This is a crucial but sometimes counterintuitive point: natural selection reduces the amount of genetic variation at linked neutral sites. This happens because when a beneficial mutation increases in frequency through selection, it carries along nearby DNA sequences in a process called "linkage drag" or background selection. The neutral variation at these linked sites is lost along with the selected mutation.
The Drift-Barrier Hypothesis
An important insight connecting these factors is the drift-barrier hypothesis, which explains why selection cannot remove all deleterious mutations from small populations. In populations smaller than a critical size, genetic drift overwhelms weak selection. This means that mildly harmful mutations can increase in frequency by chance alone, preventing their removal. As a result, small populations accumulate slightly deleterious mutations, and this genetic load can reduce population fitness. This creates a fundamental tension: the same small population sizes that lose beneficial variation also struggle to eliminate harmful mutations.
Background Selection and Selective Sweeps
Two processes—background selection and selective sweeps—jointly create patterns of variation along chromosomes. Background selection occurs continuously as natural selection removes deleterious mutations linked to neutral sites, reducing heterozygosity across the genome. Selective sweeps occur when a beneficial mutation rises to fixation (reaches 100% frequency), eliminating variation at linked sites in the process. Together, these processes explain why genomic regions near genes under strong selection show dramatically reduced diversity compared to neutral regions far from genes.
Limits to the Rate of Adaptive Substitution: How Fast Can Populations Evolve?
An important theoretical question is: what is the maximum rate at which a population can incorporate beneficial mutations and adapt to environmental change? This question has practical importance for understanding evolutionary potential and responses to selection.
In theory, the rate of adaptive substitution depends on the rate at which beneficial mutations arise and their fixation probability. Larger populations produce more mutations per generation, and beneficial mutations have higher fixation probabilities than neutral ones. This suggests that adaptation should be faster in larger populations.
However, there is a critical constraint: interference among linked beneficial mutations. In sexual populations, beneficial mutations that arise at different locations on the chromosome compete with each other. When selection favors multiple beneficial mutations, but they are inherited together (because they're on the same chromosome), recombination is needed to separate them. Without sufficient recombination, one lineage may carry both the beneficial mutation and a harmful linked allele, reducing its relative fitness. This genetic interference slows the rate at which beneficial mutations can spread through the population.
The practical implication is that the rate of adaptation in large sexual populations scales with the product of effective population size ($Ne$) and the beneficial mutation rate ($\mub$). However, this rate is ultimately capped by the recombination rate—the genome cannot recombine faster than its physical structure allows. This fundamental limit explains why even very large populations cannot adapt arbitrarily quickly.
<extrainfo>
Empirical estimates from model organisms like Drosophila (fruit flies) and from human populations broadly support these theoretical predictions, showing that adaptation rates do increase with population size up to the point where recombination becomes the limiting factor.
</extrainfo>
Neutral Theory: Understanding Evolution When Selection Is Weak
Classical evolutionary theory emphasizes natural selection as the primary driver of evolutionary change. However, neutral theory provides a different perspective: much of the genetic variation we observe, and a substantial fraction of evolutionary change at the DNA level, occurs in regions where selection is weak or absent.
The core of neutral theory is straightforward: mutations fall into two categories. Most mutations are deleterious and reduce fitness; these are removed by selection and never reach appreciable frequencies in populations. The remaining mutations are neutral—they have little to no effect on fitness and thus behave as if they are invisible to selection. Under neutral theory, genetic drift, not selection, determines the fate of these neutral alleles.
This is a crucial distinction from the adaptationist perspective. Under adaptationist thinking, observed genetic variation reflects selection maintaining multiple alleles. Under neutral theory, much observed variation represents neutral alleles temporarily present in the population while they slowly drift toward fixation or loss.
Origin-Fixation Dynamics
A powerful way to think about evolutionary change is through the origin-fixation framework. Rather than tracking how allele frequencies change over time, we instead ask: at what rate do new mutations arise, and what is the probability that a new mutation becomes fixed (reaches 100% frequency)?
The key insight is that the rate of evolutionary change equals the mutation rate multiplied by the fixation probability:
$$\text{Rate of substitution} = \mu \times P{\text{fix}}$$
where $\mu$ is the mutation rate and $P{\text{fix}}$ is the probability that a new mutation becomes fixed.
For neutral mutations, the fixation probability is simple: $P{\text{fix}} = \frac{1}{2Ne}$, where $Ne$ is the effective population size. This means that most new mutations are lost by drift, but the rare one that happens to drift toward fixation does so. The surprising result is that the substitution rate for neutral mutations is $\mu$—it does not depend on population size. Larger populations have higher mutation rates, but lower fixation probabilities per mutation, and these effects exactly cancel. This is called the molecular clock: neutral sequences accumulate substitutions at a roughly constant rate over time.
In contrast, for beneficial mutations, $P{\text{fix}}$ is much higher (roughly proportional to the selection coefficient), so adaptation accelerates in larger populations.
Putting It Together: How These Forces Interact
The four evolutionary forces—selection, drift, mutation, and gene flow—do not act in isolation. Their relative importance depends on population size and the strength of selection:
In very large populations with strong selection, natural selection dominates and shapes allele frequencies precisely. Drift is negligible.
In very small populations, genetic drift overwhelms weak to moderate selection. The population's evolutionary fate is largely determined by chance.
Intermediate-sized populations experience a balance of forces. Beneficial mutations are selected but interfere with each other; deleterious mutations are purged but some escape removal by drift.
These principles explain broad patterns in genetic diversity across life. Species with large, stable populations (like many invertebrates) maintain high genetic diversity and evolve rapidly. Species with small, fragmented populations accumulate slightly deleterious mutations and lose beneficial variation through drift, reducing their adaptive potential. Understanding these principles is essential for conservation biology, animal breeding, and predicting how populations respond to environmental change.
Flashcards
Which researchers provided the mathematical framework for allele frequency changes under selection, drift, mutation, and migration in 1970?
Crow and Kimura
Who formulated the basic equations describing the causes of evolutionary change in 1932?
Haldane
Which scientist summarized the genetic basis of evolutionary change in 1973, highlighting variation and selection?
Lewontin
How did Gutenkunst et al. (2009) infer the joint demographic histories of multiple populations?
From multidimensional SNP frequency spectra
What are the three primary determinants of genome-wide diversity identified by Ellegren and Galtier?
Effective population size
Mutation rate
Selection intensity
What is the effect of natural selection on neutral diversity across species, according to Corbett-Detig et al. (2015)?
It reduces neutral diversity
What hypothesis links reduced diversity to selection efficiency limits in small populations?
The drift-barrier hypothesis
Which two processes jointly shape patterns of chromosomal heterozygosity?
Background selection
Selective sweeps
How does a species' census size generally relate to its levels of standing variation?
Larger census sizes retain higher levels of variation
What factor limits the rate of adaptation in large sexual populations, even as it scales with effective population size and mutation rate?
Recombination
Why does interference among linked beneficial mutations slow down the process of adaptation?
It reduces the efficacy of selection (linked mutations interfere with one another)
According to Neutral Theory, what is the primary mechanism by which neutral mutations evolve?
Genetic drift
In Neutral Theory, what drives evolutionary change rather than changes in the frequency of existing alleles?
The arrival of new mutations
How is the rate of evolutionary change calculated under the origin-fixation perspective?
The product of the mutation rate and the fixation probability of a new allele
What are the four main forces that influence allele frequencies in a population?
Natural selection
Genetic drift
Gene flow
Recurrent mutation
Quiz
Population genetics - Core Population Genetic Theory Quiz Question 1: Which evolutionary forces are incorporated in the mathematical framework for allele frequency change described by Crow and Kimura (1970)?
- Selection, genetic drift, mutation, and migration (correct)
- Selection, recombination, epistasis, and gene conversion
- Mutation, gene flow, horizontal transfer, and polyploidy
- Selection, mutation, gene duplication, and hybridization
Population genetics - Core Population Genetic Theory Quiz Question 2: Under neutral theory, neutral mutations primarily evolve in a population by which mechanism?
- Random genetic drift (correct)
- Positive natural selection
- Gene flow between populations
- Recurrent mutational input alone
Population genetics - Core Population Genetic Theory Quiz Question 3: Which Bayesian method introduced by Pritchard, Stephens, and Donnelly (2000) is widely used to infer population structure from multilocus genotype data?
- STRUCTURE (correct)
- ADMIXTURE
- FASTSTRUCTURE
- PLINK
Population genetics - Core Population Genetic Theory Quiz Question 4: According to the origin‑fixation perspective, the rate of evolutionary change is defined as the product of which two components?
- Mutation rate and fixation probability of a new allele (correct)
- Selection coefficient and effective population size
- Genetic drift strength and migration rate
- Recombination rate and mutation bias
Population genetics - Core Population Genetic Theory Quiz Question 5: According to Ellegren and Galtier (2016), which three factors are primary determinants of genome‑wide genetic diversity?
- Effective population size, mutation rate, and selection intensity (correct)
- Recombination rate, gene flow, and chromosomal length
- Genetic drift, population subdivision, and epigenetic modification
- Environmental variability, lifespan, and metabolic rate
Population genetics - Core Population Genetic Theory Quiz Question 6: What did Weissman, Barton, and McVean (2012) quantify for sexual populations?
- The upper bound on the rate of adaptive substitution (correct)
- The average effect size of deleterious mutations
- The minimum recombination rate needed for adaptation
- The exact time to fixation of a beneficial allele
Population genetics - Core Population Genetic Theory Quiz Question 7: Which of the following is a main force influencing allele frequencies under neutral theory?
- Genetic drift (correct)
- Recombination
- Horizontal gene transfer
- Epigenetic modification
Which evolutionary forces are incorporated in the mathematical framework for allele frequency change described by Crow and Kimura (1970)?
1 of 7
Key Concepts
Population Genetics Concepts
Population genetics
Effective population size
Genetic drift
Natural selection
Mutation rate
Background selection
Selective sweep
Drift‑barrier hypothesis
Origin‑fixation dynamics
Statistical Methods
Bayesian population structure inference
Definitions
Population genetics
The study of how allele frequencies change in populations under forces such as selection, drift, mutation, and migration.
Effective population size
A measure of the number of individuals in a population who contribute genes to the next generation, influencing genetic drift and diversity.
Genetic drift
Random fluctuations in allele frequencies due to sampling effects in finite populations.
Natural selection
The differential reproductive success of individuals caused by differences in phenotype, leading to changes in allele frequencies.
Mutation rate
The frequency at which new genetic mutations arise per generation in a genome.
Background selection
The reduction of neutral genetic variation caused by the continual removal of deleterious alleles linked to neutral sites.
Selective sweep
The process by which a beneficial mutation rapidly rises to fixation, dragging along linked genetic variation and reducing diversity.
Drift‑barrier hypothesis
The idea that the efficacy of natural selection is limited in small populations, setting a lower bound on the level of genetic drift.
Bayesian population structure inference
A statistical method (e.g., STRUCTURE) that uses multilocus genotype data to assign individuals to genetic clusters.
Origin‑fixation dynamics
A framework that models the rate of evolutionary change as the product of the mutation rate and the probability that a new mutation fixes.