Fundamental Sequencing Concepts and Methods
Understand the basics of DNA sequencing, how Sanger and pyrosequencing methods operate, and their significance for research and technology.
Summary
Read Summary
Flashcards
Save Flashcards
Quiz
Take Quiz
Quick Practice
What level of biopolymer structure is determined by sequencing?
1 of 16
Summary
DNA Sequencing: Determining Genetic Information
Introduction to DNA Sequencing
DNA sequencing is the process of determining the order of nucleotides in a DNA molecule. This nucleotide order is called the primary structure of the DNA and represents the fundamental genetic information encoded in the molecule. Sequencing produces a symbolic linear depiction—simply called a sequence—that summarizes the atomic-level structure of the DNA in an easy-to-read format.
Because DNA sequencing reveals genetic information, it has become essential for fundamental biological research, medical diagnosis, pathogen identification, and biotechnology applications. The cost and speed of sequencing have improved dramatically over recent decades, following a trend similar to Moore's law in computing—a pattern known as the Carlson curve.
Two major sequencing methods have dominated the field: Sanger sequencing (the historical standard) and pyrosequencing (an emerging technology). Understanding how each method works will help you grasp the principles underlying modern genetic analysis.
Sanger Sequencing: The Chain Termination Method
Sanger sequencing, also called chain terminator sequencing, is based on a clever principle: selectively halt DNA synthesis at each position in the sequence, creating fragments of all possible lengths, then determine where each fragment ends.
The Basic Process
Step 1: Primer Binding
The process begins with a short DNA fragment called a primer that is complementary to the template DNA strand. This primer binds to the template at a specific location, providing a starting point for DNA synthesis.
Step 2: DNA Polymerase Extension
DNA polymerase enzymes are proteins that build new DNA strands by reading a template and adding nucleotides one at a time. In Sanger sequencing, the polymerase extends the primer using normal deoxynucleotides (the standard building blocks of DNA), but here's the critical addition: a small proportion of dideoxynucleotides (ddNTPs) are also present in the reaction mixture.
The Chemistry of Chain Termination
Understanding why chain termination works requires knowing the structural difference between normal nucleotides and dideoxynucleotides.
A normal deoxynucleotide has a hydroxyl group ($-OH$) at the 3′ position of its sugar ring. This hydroxyl group is essential because DNA polymerase uses it to form the phosphodiester bond that attaches the next nucleotide to the growing chain.
Dideoxynucleotides lack the hydroxyl group at the 3′ position. When DNA polymerase accidentally incorporates a dideoxynucleotide (since it's present at low concentration), the polymerase cannot add the next nucleotide. The chain is permanently terminated.
This creates a population of DNA fragments of different lengths: some terminate at the first position where a dideoxynucleotide was incorporated, others at the second position, others at the third, and so on, covering every possible termination point in the sequence.
Identifying Fragment Sizes
Since each fragment differs by exactly one nucleotide, separating fragments by size reveals the sequence. The original method used gel electrophoresis: DNA fragments are loaded into a polyacrylamide gel and an electric current is applied. Smaller fragments move faster through the gel matrix, while larger fragments move slower. This creates a pattern of bands, with each band representing fragments of a specific length.
The image shows a classic Sanger sequencing result. The four lanes represent reactions terminated with different dideoxynucleotides (A, T, G, and C). Reading the bands from bottom to top gives you the DNA sequence—wherever a band appears indicates a nucleotide at that position in the sequence.
Dye Terminator Sequencing: A Critical Improvement
The method described above required four separate reactions (one for each nucleotide type) to determine a complete sequence. A major improvement came with dye terminator sequencing, which labels each of the four dideoxynucleotides with a distinct fluorescent dye:
dATP labeled with one dye (one wavelength of light)
dGTP labeled with a different dye (different wavelength)
dCTP labeled with another dye
dTTP labeled with a fourth dye
Now a single reaction tube contains all four dideoxynucleotides (all present at low concentration) plus normal nucleotides. When fragments of different sizes are separated—either by gel electrophoresis or more commonly in modern labs, by capillary electrophoresis—a detector reads the wavelength of fluorescence for each fragment size. The color identifies which nucleotide terminated that fragment.
This improvement is significant: a single reaction determines the entire sequence, rather than requiring four separate reactions and manual assembly of results.
Modern Sanger sequencing uses dye terminator methods. Recent improvements in enzymes and dyes have reduced incorporation variability—meaning that dideoxynucleotides are incorporated more consistently at different sequence positions—which has reduced the problem of uneven peak heights in the output data.
Pyrosequencing: Detection-Based Sequencing
Pyrosequencing is a fundamentally different approach that doesn't rely on chain termination or size separation. Instead, it detects which nucleotide is being incorporated in real time.
Preparation and Setup
Before pyrosequencing begins, the DNA strand must be amplified (typically using the polymerase chain reaction, or PCR) to create many identical copies. This amplification is necessary because the detection system is sensitive to the chemical signals released during individual nucleotide incorporation events.
The Detection Chemistry
When DNA polymerase incorporates a nucleotide, it releases a byproduct called pyrophosphate. Pyrosequencing harnesses a chain of biochemical reactions to convert this release into a detectable light signal:
ATP Production: The enzyme ATP sulfurylase converts pyrophosphate into ATP (adenosine triphosphate), the energy currency of cells.
Light Generation: ATP powers the enzyme luciferase, which catalyzes a reaction that produces visible light.
Signal Detection: A detector records the light signal as a pyrogram—a graph showing signal intensity over time. Each nucleotide incorporation produces a peak.
Reading Homopolymer Stretches
Here's a key advantage of pyrosequencing: the intensity of the light signal is proportional to the number of nucleotides incorporated at each step.
If the template DNA has a homopolymer stretch—a series of identical nucleotides like AAAA or TTTTT—then when the complementary nucleotides are added, multiple identical nucleotides are incorporated in a single synthesis step. This releases multiple pyrophosphate molecules, which produce a stronger light signal.
By reading the signal strength, you can determine how many of that nucleotide were added in a single incorporation event. This is a direct measurement of homopolymer length.
Removal of Unincorporated Nucleotides
A potential problem: what if nucleotides remain in the reaction mixture and produce false signals? The enzyme apyrase solves this by degrading any unincorporated nucleotides, ensuring that only signals from actually incorporated nucleotides are detected.
Advantages and Limitations
Pyrosequencing has significant advantages over Sanger sequencing:
It requires neither fluorescently labeled nucleotides nor gel electrophoresis, simplifying the chemistry
It produces data in real time as nucleotides are incorporated
It handles homopolymer regions directly
Because of these advantages, pyrosequencing now generates more genome data worldwide than Sanger sequencing, despite Sanger's historical dominance.
<extrainfo>
Additional Context: Applications and Market Trends
DNA sequencing knowledge is applied across many fields: deciphering gene sequences to understand protein function, diagnosing genetic diseases, developing treatments, tracking pathogenic viruses, and engineering organisms for biotechnology. The rapid decrease in sequencing costs (the Carlson curve) has made these applications increasingly accessible, similar to how the decreasing cost of computing has transformed technology generally.
</extrainfo>
Flashcards
What level of biopolymer structure is determined by sequencing?
Primary structure
What specific order does DNA sequencing determine in a fragment?
Nucleotide order
Which historical method, developed by Frederick Sanger, has dominated DNA sequencing?
Chain termination method
What concept describes the rapid decrease in cost and increase in performance of DNA sequencing, analogous to Moore's law?
Carlson curve
What molecule initiates extension at a specific site on the DNA template?
Short oligonucleotide primer
What specific type of chain-terminating nucleotide is most commonly added to the reaction?
Dideoxynucleotide
Why do dideoxynucleotides prevent further elongation when incorporated?
They lack hydroxyl groups at both the 2′ and 3′ positions of the ribose
How are the resulting DNA fragments of different sizes separated?
Electrophoresis (in polyacrylamide gel or a capillary)
How does dye terminator sequencing distinguish between the four types of dideoxynucleotides?
Each is labeled with a distinct fluorescent dye
What physical property allows the identification of the terminating base in this method?
Wavelength of the emitted light
What process must occur to the DNA strand before pyrosequencing begins?
Amplification by Polymerase Chain Reaction (PCR)
What chemical is released when DNA polymerase incorporates a nucleotide?
Pyrophosphate
Which enzyme uses $ATP$ to produce the light signal recorded in a pyrogram?
Luciferase
To what is the intensity of the light signal proportional in pyrosequencing?
Number of nucleotides incorporated in that synthesis step
What is the role of the enzyme apyrase in the reaction?
To remove unincorporated nucleotides (preventing false signals)
How does pyrosequencing detect homopolymer stretches (identical nucleotides in a row)?
Through the strength of the recorded light signal
Quiz
Fundamental Sequencing Concepts and Methods Quiz Question 1: What does sequencing determine about an unbranched biopolymer?
- Its primary structure (correct)
- Its tertiary structure
- Its secondary structure
- Its biological function
Fundamental Sequencing Concepts and Methods Quiz Question 2: Which DNA sequencing method was historically the most widely used?
- Chain‑termination (Sanger) method (correct)
- Pyrosequencing
- Illumina sequencing‑by‑synthesis
- Nanopore single‑molecule sequencing
Fundamental Sequencing Concepts and Methods Quiz Question 3: What does the Carlson curve illustrate in the context of DNA sequencing technologies?
- A rapid decline in cost and rise in performance over time (correct)
- The error rate of sequencing reads
- The maximum length of DNA fragments that can be sequenced
- The proportion of GC content in sequenced genomes
Fundamental Sequencing Concepts and Methods Quiz Question 4: In pyrosequencing, what molecule is released when DNA polymerase incorporates a nucleotide?
- Pyrophosphate (PPi) (correct)
- Water (H₂O)
- Carbon dioxide (CO₂)
- Hydrogen peroxide (H₂O₂)
Fundamental Sequencing Concepts and Methods Quiz Question 5: What does DNA sequencing determine for a given DNA fragment?
- The nucleotide order of the fragment (correct)
- The three‑dimensional shape of the fragment
- The methylation pattern of the fragment
- The protein it encodes
Fundamental Sequencing Concepts and Methods Quiz Question 6: In pyrosequencing, which molecule provides the energy that powers luciferase to emit light?
- Adenosine triphosphate (ATP) (correct)
- Nicotinamide adenine dinucleotide (NADH)
- Guanosine triphosphate (GTP)
- cAMP
What does sequencing determine about an unbranched biopolymer?
1 of 6
Key Concepts
DNA Sequencing Techniques
DNA sequencing
Sanger sequencing
Dye terminator sequencing
Pyrosequencing
Chain termination method
Fluorescent detection in DNA sequencing
DNA Amplification and Cost Trends
Polymerase chain reaction (PCR)
Carlson curve
Definitions
DNA sequencing
The process of determining the precise order of nucleotides within a DNA molecule.
Sanger sequencing
A chain‑termination method for DNA sequencing that uses dideoxynucleotides to generate fragments of varying length.
Dye terminator sequencing
A variation of Sanger sequencing in which each dideoxynucleotide is labeled with a distinct fluorescent dye, enabling automated detection.
Pyrosequencing
A sequencing-by-synthesis technique that detects pyrophosphate release as light, allowing real‑time base identification without fluorescent labels.
Polymerase chain reaction (PCR)
A laboratory method that amplifies specific DNA fragments exponentially, providing sufficient template for downstream analyses.
Carlson curve
An empirical relationship describing the rapid decline in cost and increase in throughput of DNA sequencing technologies, analogous to Moore’s law.
Chain termination method
The foundational principle of Sanger sequencing where incorporation of a chain‑terminating nucleotide halts DNA synthesis, producing length‑graded fragments.
Fluorescent detection in DNA sequencing
The use of wavelength‑specific fluorescent dyes to identify terminating nucleotides, enabling high‑throughput, automated sequence readout.