Virology - Molecular Tools and Viral Genomics
Understand key molecular techniques for virus purification and sequencing, mechanisms of viral genetic variation, and how bioinformatics and reverse genetics enable viral engineering and vaccine development.
Summary
Read Summary
Flashcards
Save Flashcards
Quiz
Take Quiz
Quick Practice
What is the two-step process used in differential centrifugation to separate viruses from larger contaminants?
1 of 17
Summary
Molecular Biology Techniques in Virology
Introduction
Working with viruses requires specialized laboratory techniques to isolate, identify, and manipulate viral components. This section covers the fundamental methods scientists use to purify viruses, separate and visualize their components, sequence their genomes, and create genetically modified viral strains. These techniques form the backbone of modern virology research and clinical diagnostics.
Purification of Viruses and Components
Before analyzing viral properties or using viruses for research, scientists must remove contaminating particles and cellular material. This requires a systematic approach using centrifugation methods.
Differential Centrifugation
Differential centrifugation exploits differences in particle size and density to progressively purify viruses. The process works in stages:
Low-speed centrifugation (typically 1,000–10,000 × g) first removes large debris—unbroken cells, cell nuclei, and mitochondria settle as a pellet while smaller viral particles remain in the supernatant (the liquid above the pellet).
High-speed ultracentrifugation (typically 100,000 × g or higher) then pellets the viral particles themselves, which sediment more slowly than larger structures but much faster than dissolved proteins.
The key principle: centrifugal force pushes heavier or denser particles outward faster, causing them to separate based on their sedimentation rate.
Buoyant Density Centrifugation
While differential centrifugation separates by size, buoyant density centrifugation separates by density alone. This technique uses a density gradient—a solution that gradually increases in density from top to bottom. The most common gradient uses cesium chloride (CsCl), which creates a smooth density range.
When a virus sample is loaded onto this gradient and centrifuged, each virus type moves to the layer where its density matches the surrounding solution density—a position where it no longer floats up or sinks down. This equilibrium separation can isolate viruses from proteins (which have different buoyant densities) and even separate nucleic acids from protein coats.
Electrophoretic Separation
After purification, researchers often need to separate viral proteins or nucleic acids to examine individual components. Gel electrophoresis accomplishes this by using electrical current to push charged molecules through a porous matrix.
How Gel Electrophoresis Works
Proteins and nucleic acids are charged molecules. In an electric field, they migrate toward the electrode of opposite charge. The gel matrix (made of either agarose or polyacrylamide) acts like a molecular sieve—smaller molecules move faster through the pores, while larger molecules move more slowly.
The result is separation by molecular size: molecules of the same charge but different sizes end up at different positions along the gel, creating distinct bands.
Visualization with Stains
The separated molecules are invisible unless stained:
Coomassie blue binds to proteins and turns them bright blue, making protein bands visible
Ethidium bromide intercalates into DNA and RNA, fluorescing under ultraviolet (UV) light, making nucleic acid bands visible
This combination of separation and staining allows researchers to verify the purity and size of viral proteins or genomes.
Genome Sequencing
Modern virology depends critically on determining viral nucleotide sequences—the order of bases in viral DNA or RNA. Two main approaches are used today.
Sanger Sequencing
Sanger sequencing is the classical method. It works by synthesizing new DNA strands that are complementary to the target sequence, but incorporating chain-terminating nucleotides (called dideoxynucleotides, or ddNTPs) at random positions. This produces a collection of DNA fragments of different lengths, each terminating at a different position in the sequence. By separating these fragments by size, the sequence can be read like a ladder. Sanger sequencing is highly accurate and still used for validating sequences, though it is slower and more labor-intensive than newer methods.
Next-Generation Sequencing (NGS)
Next-generation sequencing technologies dramatically increased sequencing speed and throughput. NGS machines simultaneously sequence millions of small DNA fragments (called reads), generating enormous amounts of sequence data quickly and at lower cost. For viral research, these millions of reads are computationally assembled into complete viral genome sequences. NGS is the standard approach for discovering new viruses, surveying viral populations, and sequencing clinical samples.
Over two million unique viral sequences have been deposited in public databases like GenBank, providing a searchable reference for comparing newly discovered viruses against known sequences.
Phylogenetic Analysis
Once viral sequences are obtained, comparing them reveals evolutionary relationships. Phylogenetic analysis constructs evolutionary trees that show how viruses relate to one another.
Building Phylogenetic Trees
Specialized software (such as PHYLIP) compares viral genome sequences and calculates evolutionary distances—roughly, the percentage of nucleotides that differ between two sequences. Viruses with very similar sequences are placed close together on the tree; distantly related viruses are farther apart. The tree branches represent inferred common ancestors.
Why This Matters
Phylogenetic trees serve several critical functions:
Tracing origins: identifying where a new viral outbreak originated
Understanding spread: showing how a virus has evolved as it spread through a population
Detecting recombination: identifying regions where two viruses have exchanged genetic material
Predicting antigenicity: similar sequences often indicate similar immune responses
Cloning of Viral Genes
Producing viral proteins for research or vaccines without culturing the actual virus is possible through cloning. This process inserts viral genes into cloning vectors—usually plasmids, small circular DNA molecules that can replicate independently in bacterial cells.
The Process
Viral nucleotide sequences (usually derived from cDNA reverse-transcribed from viral RNA) are inserted into a plasmid
The recombinant plasmid is introduced into bacterial cells
Bacteria reproduce rapidly, copying the plasmid and its viral gene insert millions of times
The cloned viral genes can be expressed (transcribed and translated) in the bacteria to produce viral proteins
Advantages
This approach has major benefits:
Safety: no need to culture infectious virus
Scale: bacteria grow rapidly, producing large quantities of viral protein
Purity: the desired viral protein can be purified away from other bacterial proteins
Reproducibility: the cloned gene sequence is permanent and stable
Cloned viral genes are particularly valuable for developing diagnostic tests and vaccine antigens.
Phage Virology
Bacteriophages (or phages) are viruses that infect bacteria, archaea, and fungi. Despite their small size and simple nature, phages are extraordinarily useful research tools.
Applications
Phage display: protein sequences are fused to phage coat proteins, allowing researchers to screen millions of variants for desired properties (binding affinity, enzymatic activity, etc.)
Model system: phages replicate rapidly and have small genomes, making them ideal for studying fundamental viral biology, genetic recombination, and replication mechanisms
Tools: phages are used to transfer DNA between bacteria (a process called transduction) and to study bacterial genetics
<extrainfo>
Phage virology is a classical field that contributed many foundational insights to molecular biology. The study of phages like T4 and λ (lambda) in the mid-20th century established basic principles of DNA replication, gene regulation, and recombination that apply broadly across biology.
</extrainfo>
Viral Genetics
Viruses generate genetic diversity through multiple mechanisms. Understanding these processes explains how viral populations change and adapt.
Reassortment
Some viruses have segmented genomes—multiple separate RNA or DNA molecules, each encoding different genes. Influenza virus, for example, has eight separate RNA segments.
Reassortment occurs when two different viruses infect the same cell and exchange whole genome segments. A daughter virus might inherit segments from both parent viruses, creating a new combination of genes. This is fundamentally different from gradual evolution—it creates novelty through wholesale segment swapping.
When it matters: Reassortment becomes epidemiologically important when, for example, a human-adapted influenza virus and an animal influenza virus co-infect the same host cell, producing a hybrid virus that might be novel to human immune systems.
Recombination
Recombination is the joining of nucleic acid fragments from different parent viruses during replication. Unlike reassortment, recombination can occur at any point along the genome—not just at segment boundaries.
During replication, when two viral genomes are being copied in the same cell, the replication machinery can switch from copying one parent genome to the other, creating a recombinant genome with nucleotide sequences from both parents. This generates genetic variation within segments and is a source of evolutionary change in all viruses.
Reverse Genetics and Infectious Clones
Reverse genetics reverses the typical direction of investigation: instead of observing a virus and inferring its genes, researchers design specific genes and create viruses that carry them.
The Approach
The complete viral genome sequence is copied into cDNA (complementary DNA, made from viral RNA)
This cDNA is cloned into plasmids—one plasmid per genome segment (for segmented viruses) or one large plasmid for non-segmented viruses
The plasmids are introduced into cells, which reconstitute infectious viral particles
The reconstituted viruses are infectious clones—their genomes are known sequences that can be manipulated genetically
Research Applications
Reverse genetics enables researchers to:
Examine gene function: delete or mutate specific genes and observe the phenotypic effects
Study virulence: identify which genetic changes make a virus more or less dangerous
Assess transmissibility: determine genetic requirements for spreading between hosts
Develop vaccines: create attenuated (weakened) strains with reduced disease potential
The power of this approach is genetic precision: scientists know exactly which nucleotides have been changed and can causally link genetic changes to phenotypic outcomes.
Viral Genomics, Bioinformatics, and Metaviromics
High-Throughput Sequencing of Viral Genomes
Next-generation sequencing generates millions of short reads—typically 50–250 bases each—very rapidly. For viral genome projects, these reads must be assembled into complete sequences.
Assembly works by finding overlaps: if read A's end matches read B's beginning, they overlap and can be joined. Specialized software detects all such overlaps and stitches reads together into longer contigs (continuous sequences), ultimately reconstructing complete viral genomes.
This approach has made it possible to sequence viral genomes in days or even hours, a dramatic acceleration from the months or years required by Sanger sequencing alone.
Metaviromics: Exploring Viral Communities
Metaviromics takes a fundamentally different approach to discovering viruses. Instead of culturing viruses or extracting them from known infected hosts, metaviromics:
Collects environmental samples (soil, ocean water, sewage, respiratory secretions, gut contents, etc.)
Extracts all nucleic acids directly from the sample
Sequences the entire pool of genetic material
Computationally separates viral sequences from host and bacterial sequences
Why This Matters
This approach has revealed enormous viral diversity previously unknown:
Environmental viruses: thousands of new phages and plant viruses in natural ecosystems
Human virome: diverse viral populations in the gut, respiratory tract, and other sites, including viruses not culturable in the lab
Rare and emerging viruses: viruses present at low levels or in unexpected hosts
Metaviromics has fundamentally expanded our understanding of the virosphere—the total viral diversity on Earth. Most viruses are still unknown; metaviromics discovers them without needing to culture them.
Phylogenetic Analysis and Virus Evolution
Phylogenetic trees constructed from viral genome sequences reveal not just relationships but also evolutionary patterns. By sequencing many strains and comparing them:
Origins are traced: determining geographic regions and likely animal or plant sources
Transmission chains are mapped: showing how viruses spread through human populations
Recombination is detected: certain regions of a genome may have evolutionary histories that disagree with others, indicating past recombination
<extrainfo>
Recent examples illustrate this power. Phylogenetic analyses of SARS-CoV-2 sequences early in the pandemic helped establish that the virus originated in bats, identified the likely initial human spillover event in Wuhan, and tracked spread to other continents. Similarly, analysis of Ebola virus sequences has repeatedly identified animal reservoir sources and traced outbreaks to specific transmission events.
</extrainfo>
Reverse Genetics and Viral Engineering
Reverse Genetics for Influenza Virus
Influenza virus is a segmented virus—eight separate RNA molecules, each packaged into virions. Reverse genetics for influenza is particularly elegant:
All eight viral RNA segments are reverse-transcribed into cDNA and cloned into plasmids
Each plasmid is introduced into mammalian cells (not bacteria, because influenza genes need eukaryotic transcription machinery to be properly expressed)
Cells co-transfected with all eight plasmids produce authentic influenza viruses
This approach enabled the construction of reassortant viruses—viruses combining genes from two different strains—without requiring natural co-infection. Researchers can deliberately combine genes from seasonal virus strains with genes conferring immunological novelty, or study the impact of individual genetic changes by comparing viruses differing at just one or two nucleotides.
Vaccine Development
Reverse genetics has accelerated flu vaccine development. If a novel influenza strain emerges but grows poorly in standard lab media, researchers can use reverse genetics to transfer its surface protein genes (hemagglutinin and neuraminidase) onto a vaccine strain backbone that grows well, creating a vaccine strain matching the pandemic virus but replicating efficiently for industrial production.
Construction of Recombinant Adenovirus Vectors
Adenoviruses are large DNA viruses with genomes large enough to accommodate foreign genes. Scientists have engineered adenoviruses as vectors—vehicles for delivering genes into cells.
Recombinant adenoviruses typically have:
Viral genes necessary for replication and cell entry intact (allowing the vector to infect target cells)
Early viral genes deleted (reducing viral protein expression and immune responses)
Foreign genes inserted in place of the deleted genes (allowing production of desired proteins in infected cells)
Applications
Gene therapy: delivering corrective genes to replace defective cellular genes
Vaccine delivery: expressing viral antigens or tumor antigens in cells to trigger immune responses
Research: introducing experimental genes to study their effects in living tissues
The advantage of adenovirus vectors is their size capacity (allowing large inserts), their ability to infect many cell types, and their safety profile in modern designs.
Flashcards
What is the two-step process used in differential centrifugation to separate viruses from larger contaminants?
First low‑speed centrifugation, then high‑speed ultracentrifugation to pellet viruses.
What does buoyant density centrifugation use to further purify viruses or isolate nucleic acids and proteins?
Density gradients (such as cesium chloride).
Which matrices are typically used for the electrophoretic separation of viral components?
Agarose or polyacrylamide matrices.
Which stain is commonly used to visualize separated protein bands in gel electrophoresis?
Coomassie blue.
Which stain is commonly used to visualize separated nucleic acid bands in gel electrophoresis?
Ethidium bromide.
Which two methods are used to determine viral nucleotide sequences for research and clinical diagnosis?
Sanger sequencing
Next‑generation sequencing
What is the primary purpose of using phylogenetic software like PHYLIP on sequenced viral genomes?
To infer evolutionary relationships and construct phylogenetic trees.
What do phylogenetic trees illustrate regarding virus strains?
Evolutionary relationships based on genome sequences.
What three factors do phylogenetic trees help trace regarding pathogenic viruses?
Origin
Spread
Recombination events
What type of cloning vectors are usually used to carry viral nucleic acid inserts for propagation in bacteria?
Plasmids.
What is the primary advantage of using cloned viral genes for diagnostics and vaccine development?
Enables production of viral proteins without needing native virus cultures.
How does reassortment differ from other forms of genetic exchange in viruses?
It involves the exchange of whole genome segments between two parental viruses.
What occurs during viral recombination to allow genetic exchange without whole-segment swapping?
The joining of portions of nucleic acid molecules during replication.
What does reverse genetics use to generate genetically modified viruses?
Complementary DNA copies of viral genomes (infectious clones).
What are modified viruses created through reverse genetics typically used to study?
Functions such as virulence and transmissibility.
How is influenza virus reconstituted using reverse genetics?
From cloned plasmids encoding each viral RNA segment.
What are two primary uses for recombinant adenovirus vectors?
Gene therapy
Vaccine delivery
Quiz
Virology - Molecular Tools and Viral Genomics Quiz Question 1: What process enables exchange of genetic material between viral nucleic acid molecules without swapping entire genome segments?
- Recombination (correct)
- Reassortment
- Transduction
- Mutation
Virology - Molecular Tools and Viral Genomics Quiz Question 2: Which technique directly extracts viral nucleic acids from environmental samples to profile viral communities?
- Metaviromics (correct)
- Metagenomics
- Transcriptomics
- Proteomics
Virology - Molecular Tools and Viral Genomics Quiz Question 3: Which viral vector is created by inserting foreign DNA into the adenovirus genome for gene therapy and vaccine delivery?
- Recombinant adenovirus vector (correct)
- Retroviral vector
- Lentiviral vector
- Adeno‑associated virus (AAV) vector
Virology - Molecular Tools and Viral Genomics Quiz Question 4: What information is obtained by constructing a phylogenetic tree from viral genome sequences?
- Evolutionary relationships among virus strains (correct)
- Exact mutation rates per nucleotide
- Absolute geographic origins of each strain
- Protein folding patterns of viral enzymes
Virology - Molecular Tools and Viral Genomics Quiz Question 5: What principle does buoyant density centrifugation use to further purify viruses or isolate their nucleic acids and proteins?
- Separation based on buoyant density in a gradient (correct)
- Separation based on molecular weight differences
- Separation based on charge differences
- Separation based on hydrophobic interactions
Virology - Molecular Tools and Viral Genomics Quiz Question 6: What term describes the exchange of whole genome segments between two parental viruses that have segmented genomes?
- Reassortment (correct)
- Recombination
- Mutation
- Transduction
Virology - Molecular Tools and Viral Genomics Quiz Question 7: What experimental approach allows researchers to create genetically modified viruses by using complementary DNA copies of the viral genome?
- Reverse genetics (correct)
- CRISPR‑Cas9 editing
- RNA interference
- Site‑directed mutagenesis
Virology - Molecular Tools and Viral Genomics Quiz Question 8: How many individual plasmids encoding viral RNA segments are required to reconstitute a complete influenza virus via reverse genetics?
- Eight (correct)
- Six
- Ten
- Twelve
Virology - Molecular Tools and Viral Genomics Quiz Question 9: What analytical approach uses viral genome sequences to infer evolutionary relationships and construct phylogenetic trees?
- Phylogenetic analysis (correct)
- Southern blotting
- Western blotting
- Restriction mapping
Virology - Molecular Tools and Viral Genomics Quiz Question 10: What is the term for the process that combines millions of short reads from next‑generation sequencing into a complete viral genome?
- Genome assembly (correct)
- Polymerase chain reaction
- Cloning
- Electrophoretic separation
Virology - Molecular Tools and Viral Genomics Quiz Question 11: Which sequencing method is traditionally used to obtain high‑accuracy reads of individual viral DNA fragments?
- Sanger sequencing (correct)
- Next‑generation sequencing
- Polymerase chain reaction
- Southern blotting
Virology - Molecular Tools and Viral Genomics Quiz Question 12: Approximately how many unique viral genome sequences are archived in public databases such as GenBank?
- Over two million (correct)
- About one hundred thousand
- Ten million
- Five thousand
Virology - Molecular Tools and Viral Genomics Quiz Question 13: What type of cloning vector is most commonly employed to propagate viral DNA inserts in bacterial cells?
- Plasmid vector (correct)
- Bacteriophage lambda
- Yeast artificial chromosome
- Cosmid
Virology - Molecular Tools and Viral Genomics Quiz Question 14: Which stain is most commonly used to visualise viral proteins after separation by gel electrophoresis?
- Coomassie blue (correct)
- Ethidium bromide
- SYBR Gold
- Silver nitrate
What process enables exchange of genetic material between viral nucleic acid molecules without swapping entire genome segments?
1 of 14
Key Concepts
Centrifugation Techniques
Differential centrifugation
Buoyant density centrifugation
Molecular Techniques
Gel electrophoresis
Next‑generation sequencing
Phylogenetic analysis
Reverse genetics
Metaviromics
Viral Studies
Recombinant adenovirus vector
Viral reassortment
Bacteriophage
Definitions
Differential centrifugation
A technique that separates particles by size and density using sequential low‑ and high‑speed spins, commonly employed to isolate viruses from cellular debris.
Buoyant density centrifugation
A method that separates macromolecules in a density gradient (e.g., cesium chloride) to purify viruses or their nucleic acids based on buoyant density.
Gel electrophoresis
A laboratory technique that resolves proteins or nucleic acids in an agarose or polyacrylamide matrix under an electric field according to size and charge.
Next‑generation sequencing
High‑throughput DNA sequencing technologies that generate millions of short reads, enabling rapid assembly of complete viral genomes.
Phylogenetic analysis
Computational reconstruction of evolutionary relationships among organisms or viruses using sequence data to produce phylogenetic trees.
Reverse genetics
An approach that starts with a known viral genome sequence to synthesize infectious clones, allowing the creation of engineered viruses for functional studies.
Metaviromics
The study of viral diversity by directly sequencing viral nucleic acids from environmental samples without prior culturing.
Recombinant adenovirus vector
A genetically modified adenovirus engineered to deliver therapeutic genes or vaccine antigens to target cells.
Viral reassortment
The exchange of whole genome segments between two related segmented viruses, generating novel genotype combinations.
Bacteriophage
A virus that infects bacteria, widely used as a tool in molecular biology and for applications such as phage display.