Central Dogma and DNA‑Protein Interactions
Understand DNA replication, transcription, and translation, and how DNA‑binding proteins—including histones, HMG proteins, and transcription factors—regulate and organize genetic information.
Summary
Read Summary
Flashcards
Save Flashcards
Quiz
Take Quiz
Quick Practice
In what direction does DNA polymerase synthesize a new DNA strand?
1 of 16
Summary
DNA Replication, Transcription, and Translation
DNA is the molecule of heredity, but it only exerts its influence through its interactions with other cellular molecules. The core processes of DNA replication, transcription, and translation form the foundation for how genetic information is stored, copied, and expressed. Understanding these processes is essential to understanding all of molecular biology.
DNA Replication: The Leading and Lagging Strands
DNA replication is the process by which cells copy their genetic material before dividing. The enzyme DNA polymerase is responsible for synthesizing new DNA strands that are complementary to the template strands. A critical feature of this process is the directionality of synthesis: DNA polymerase can only synthesize DNA in the 5′→3′ direction (from the 5′ phosphate end toward the 3′ hydroxyl end).
This creates a problem. Recall that DNA strands are antiparallel—they run in opposite directions. When the DNA double helix unwinds during replication, one template strand runs 3′→5′, making it easy for DNA polymerase to synthesize the complementary strand continuously in the 5′→3′ direction. This continuously synthesized strand is called the leading strand.
The other template strand runs 5′→3′, which is opposite to the direction DNA polymerase works. To solve this, the complementary strand is synthesized in short fragments called Okazaki fragments, which are made in the 5′→3′ direction but on the overall "backward" strand. This discontinuously synthesized strand is called the lagging strand. These fragments are later joined together by DNA ligase to form a continuous strand.
Transcription: From DNA to RNA
While DNA replication copies DNA into DNA, transcription is the process of copying a DNA gene into a complementary RNA molecule. This process is performed by the enzyme RNA polymerase, which reads the DNA template strand and synthesizes a complementary RNA strand.
The key difference between DNA and RNA is one nucleotide: RNA contains uracil (U) instead of thymine (T). So wherever DNA has an adenine (A) paired with thymine (T), the RNA transcript will have adenine paired with uracil. This RNA molecule is called messenger RNA (mRNA) because it carries the genetic code from the DNA to the ribosome, where the information will be translated into a protein.
Translation and the Genetic Code
Proteins are synthesized by ribosomes using instructions encoded in mRNA. The mRNA sequence is "read" in groups of three nucleotides called codons. Since there are four possible nucleotides (A, U, G, C), there are $4^3 = 64$ possible codons.
Each codon specifies which amino acid should be added to the growing protein chain. With only 20 standard amino acids, this means that multiple codons code for the same amino acid (the genetic code is degenerate). Three special codons—UAA, UAG, and UGA—do not code for amino acids. Instead, they function as stop signals that terminate translation and signal the release of the completed protein from the ribosome.
DNA-Protein Interactions: The Foundation of Genetic Regulation
DNA doesn't work alone in the cell. Its functions—replication, transcription, repair, and regulation—all depend critically on interactions with proteins. Understanding the types of DNA-protein interactions is essential for understanding how cells control gene expression.
Non-Specific vs. Sequence-Specific Binding
DNA-protein interactions fall into two broad categories:
Non-specific DNA-protein interactions occur when a protein binds to DNA regardless of its base sequence. These interactions typically involve the protein contacting the sugar-phosphate backbone of DNA, which is the same along the entire length of both strands. Structural proteins that compact DNA into chromatin often use this type of binding.
Sequence-specific DNA-protein interactions occur when a protein binds only to particular DNA sequences. These interactions involve the protein recognizing and binding to the bases themselves, which differ depending on the sequence. Regulatory proteins like transcription factors use this type of binding to find their target genes.
Structural DNA-Binding Proteins: Organizing the Genome
The human genome contains about 3 billion base pairs. If stretched out, this DNA would be roughly 2 meters long—yet it must fit inside a nucleus that is only about 10 micrometers in diameter. Structural DNA-binding proteins solve this problem by compacting DNA into an organized structure called chromatin.
Histones and Nucleosomes
In eukaryotes, the primary structural protein that organizes DNA is histones. DNA wraps around a core of eight histone proteins (an octamer: two copies each of four different histone types) approximately 1.65 times around the octamer to form a structure called a nucleosome. Each nucleosome contains about 147 base pairs of DNA in two complete turns.
The interaction between histones and DNA is electrostatic in nature: histone proteins are rich in basic amino acids (lysine and arginine, which carry positive charges), while the DNA backbone is acidic (the phosphate groups carry negative charges). These ionic bonds between the positive histones and negative DNA backbone hold the nucleosome together.
Nucleosomes are not randomly distributed—they are spaced roughly 200 base pairs apart along the DNA, with linker DNA between them. This creates the "beads on a string" appearance visible under electron microscopy.
Histone Chemical Modifications and Gene Regulation
A crucial feature of histone-DNA interactions is that they are not permanent. The histone proteins have flexible "tails" extending from the nucleosome core, and these tails can be chemically modified by three types of post-translational modifications:
Methylation: Addition of methyl groups to lysine or arginine residues
Phosphorylation: Addition of phosphate groups to serine, threonine, or tyrosine residues
Acetylation: Addition of acetyl groups to lysine residues
These chemical modifications alter the charge on the histone tail, which in turn affects how tightly the histone binds to the DNA. For example, acetylation adds negative charges to positively charged lysines, weakening the ionic bond between histones and DNA and making the DNA more accessible to other proteins. Conversely, certain methylation patterns strengthen histone-DNA binding.
Because these modifications change DNA accessibility, they play a critical role in regulating gene expression. A gene wrapped tightly around histones is inaccessible to transcription factors and RNA polymerase, so it is not transcribed. Loosening the histone-DNA interaction through acetylation makes the gene accessible and can turn it "on."
<extrainfo>
High-Mobility Group Proteins
Beyond histones, another class of structural proteins called high-mobility group (HMG) proteins helps organize DNA at larger scales. These proteins bind to DNA that is bent or distorted (not the typical straight double helix). By binding to bent DNA, HMG proteins help compact nucleosome arrays into higher-order chromosomal structures. These proteins are named for their high mobility in gel electrophoresis, a separation technique that was historically important for their discovery.
Single-Stranded DNA-Binding Proteins
During DNA replication, recombination, and repair, the DNA double helix is unwound, exposing single-stranded DNA that is vulnerable to degradation by enzymes called nucleases. A protein called replication protein A (RPA) binds to these single strands, stabilizing them and protecting them from nuclease attack. RPA covers the single-stranded DNA without blocking access to other proteins that participate in replication and repair, making it an essential non-specific DNA-binding protein for these processes.
</extrainfo>
Sequence-Specific DNA-Binding Proteins: Controlling Gene Expression
While structural proteins organize DNA into chromatin, sequence-specific DNA-binding proteins regulate which genes are transcribed. The most important class of these regulatory proteins is transcription factors.
How Transcription Factors Work
Transcription factors are proteins that bind to specific DNA sequences, usually located near the promoter—the region where RNA polymerase begins transcription. A transcription factor recognizes and binds only to its target sequences through specific contacts with the DNA bases.
The specificity of this recognition comes from an important structural detail: transcription factors make contacts with the edges of the DNA bases, primarily in the major groove (the wider groove of the DNA double helix). The bases line the major groove with different patterns of hydrogen bond donors and acceptors depending on whether they are adenine, guanine, cytosine, or thymine. A transcription factor's DNA-binding domain has a three-dimensional shape that is complementary to these patterns, allowing it to "recognize" specific sequences like a lock recognizing its key.
Once bound to their target sequences, transcription factors regulate gene expression through two main mechanisms:
Activation: A transcription factor can activate transcription by physically recruiting RNA polymerase to the promoter, or indirectly by recruiting mediator proteins—large protein complexes that bridge between transcription factors and RNA polymerase. This brings the machinery for transcription into position and allows the gene to be transcribed.
Repression: A transcription factor can repress transcription by recruiting enzymes that modify histones in ways that strengthen histone-DNA binding. For example, certain methylation patterns or deacetylation by histone deacetylases increase the strength of histone binding, making the DNA less accessible and preventing transcription.
The Widespread Impact of Transcription Factors
One transcription factor can influence the expression of thousands of genes because its target sequences are scattered throughout the genome. Different genes may have the same transcription factor binding site in their promoter regions, meaning that a single transcription factor regulates a whole suite of genes.
This widespread regulatory capacity makes transcription factors common targets for signal-transduction pathways—cellular communication systems that respond to external signals (like hormones or growth factors) and internal conditions. When a cell receives a signal to differentiate, proliferate, or respond to stress, it often does so by activating or inactivating specific transcription factors. These transcription factors then coordinately regulate the genes necessary for the appropriate cellular response.
Understanding transcription factors is therefore crucial to understanding how cells respond to their environment and how cells differentiate during development.
Flashcards
In what direction does DNA polymerase synthesize a new DNA strand?
5′→3′ direction
Which DNA strand is synthesized continuously during replication?
Leading strand
Which DNA strand is synthesized discontinuously during replication?
Lagging strand
Which nitrogenous base does RNA polymerase substitute for Thymine (T) when copying DNA?
Uracil (U)
What is the primary function of messenger RNA (mRNA) in the cell?
Carrying the genetic code to the ribosome
How many total possible codons exist in the genetic code?
64
What are the three specific mRNA codons that act as stop signals to terminate translation?
UAA
UAG
UGA
What part of the DNA molecule do non-specific DNA-binding proteins interact with?
Sugar-phosphate backbone
What determines the binding site for sequence-specific DNA-binding proteins?
Particular DNA sequences recognized by the protein
What is the name of the highly organized complex formed by compacting DNA with structural proteins?
Chromatin
What type of chemical bond mediates the interaction between histones and the DNA backbone?
Ionic bonds
What are the three primary types of chemical modifications that can occur on histone tails?
Methylation
Phosphorylation
Acetylation
What is the primary role of high-mobility group (HMG) proteins in chromosome structure?
Bending nucleosome arrays into higher-order structures
What is the function of Replication protein A (RPA) when it binds to single-stranded DNA?
Stabilizing the DNA and protecting it from nucleases
In which part of the DNA double helix do most sequence-specific transcription factor contacts occur?
Major groove
Why can a single transcription factor influence thousands of different genes?
Its target sequences are distributed throughout the genome
Quiz
Central Dogma and DNA‑Protein Interactions Quiz Question 1: In which direction does DNA polymerase add nucleotides during DNA synthesis?
- 5′→3′ (correct)
- 3′→5′
- Both directions simultaneously
- Randomly
Central Dogma and DNA‑Protein Interactions Quiz Question 2: Which newly synthesized strand is formed discontinuously during DNA replication?
- Lagging strand (correct)
- Leading strand
- Both strands equally
- Neither strand
Central Dogma and DNA‑Protein Interactions Quiz Question 3: How many different codons exist in the standard genetic code?
- 64 (correct)
- 61
- 20
- 4
Central Dogma and DNA‑Protein Interactions Quiz Question 4: Which class of enzymes binds DNA to copy its sequence during both transcription and replication?
- Polymerases (correct)
- Ligases
- Topoisomerases
- Helicases
Central Dogma and DNA‑Protein Interactions Quiz Question 5: Which type of chemical interaction primarily mediates histone‑DNA binding?
- Ionic bonds (correct)
- Covalent bonds
- Hydrogen bonds
- Van der Waals forces
Central Dogma and DNA‑Protein Interactions Quiz Question 6: Where do transcription factors typically bind to regulate gene expression?
- Specific DNA sequences near promoters (correct)
- Random locations throughout the genome
- Only within introns
- At telomeric repeats
Central Dogma and DNA‑Protein Interactions Quiz Question 7: The sequence specificity of many transcription factors mainly arises from contacts with which DNA region?
- Major groove (correct)
- Minor groove
- Phosphate backbone
- RNA polymerase binding site
Central Dogma and DNA‑Protein Interactions Quiz Question 8: Which enzyme is responsible for synthesizing a complementary RNA strand from a DNA gene during transcription?
- RNA polymerase (correct)
- DNA polymerase
- Ribosome
- Helicase
Central Dogma and DNA‑Protein Interactions Quiz Question 9: Which type of DNA‑protein interaction involves binding to the sugar‑phosphate backbone regardless of the DNA base sequence?
- Non‑specific DNA‑protein interactions (correct)
- Sequence‑specific DNA‑protein interactions
- Covalent DNA‑protein interactions
- Enzymatic DNA‑protein interactions
Central Dogma and DNA‑Protein Interactions Quiz Question 10: High‑mobility group (HMG) proteins preferentially bind which form of DNA?
- Bent or distorted DNA (correct)
- Straight double‑helix DNA
- RNA molecules
- Lipid membranes
Central Dogma and DNA‑Protein Interactions Quiz Question 11: Which chemical modification of histone tails is most directly associated with loosening histone‑DNA interaction and increasing DNA accessibility?
- Acetylation (correct)
- Methylation
- Phosphorylation
- Ubiquitination
Central Dogma and DNA‑Protein Interactions Quiz Question 12: Sequence‑specific DNA‑binding proteins, such as transcription factors, primarily recognize DNA through what feature?
- Specific base sequences (correct)
- The overall shape of the DNA backbone
- Interactions with RNA molecules
- Binding to histone tails
Central Dogma and DNA‑Protein Interactions Quiz Question 13: Which of the following is NOT a role performed by DNA‑binding proteins in the cell?
- Synthesizing lipids (correct)
- Organizing DNA into higher‑order structures
- Protecting DNA from damage
- Regulating transcription and other DNA processes
Central Dogma and DNA‑Protein Interactions Quiz Question 14: The highly organized nuclear complex formed when structural DNA‑binding proteins bind DNA is called what?
- Chromatin (correct)
- Nucleolus
- Ribosome
- Spliceosome
Central Dogma and DNA‑Protein Interactions Quiz Question 15: One of the main protective functions of Replication protein A after binding single‑stranded DNA is to prevent degradation by what type of enzyme?
- Nucleases (correct)
- Kinases
- Ligases
- Polymerases
In which direction does DNA polymerase add nucleotides during DNA synthesis?
1 of 15
Key Concepts
DNA Processes
DNA replication
Transcription
Translation
DNA Structure and Regulation
DNA‑binding protein
Histone
Nucleosome
Transcription factor
High‑mobility group protein
Replication protein A
Genetic Information
Genetic code
Definitions
DNA replication
The cellular process by which DNA polymerases synthesize new strands complementary to each template strand, producing two identical DNA molecules.
Transcription
The synthesis of an RNA strand from a DNA template by RNA polymerase, substituting uracil for thymine.
Translation
The ribosomal decoding of messenger RNA codons into a polypeptide chain, assembling amino acids according to the genetic code.
DNA‑binding protein
A protein that interacts with DNA, either non‑specifically with the backbone or sequence‑specifically with particular nucleotide motifs, to regulate DNA functions.
Histone
A basic protein that forms octameric cores around which DNA wraps, constituting the core of nucleosomes and facilitating chromatin compaction.
Nucleosome
The fundamental unit of chromatin, consisting of ~147 base pairs of DNA wrapped in two turns around a histone octamer.
Transcription factor
A sequence‑specific DNA‑binding protein that modulates gene expression by recruiting or inhibiting RNA polymerase and associated co‑activators or co‑repressors.
High‑mobility group protein
A non‑histone chromatin protein that binds to bent or distorted DNA, promoting DNA flexibility and higher‑order chromosomal architecture.
Replication protein A
A heterotrimeric single‑stranded DNA‑binding protein complex that stabilizes ssDNA during replication, recombination, and repair.
Genetic code
The set of 64 three‑nucleotide codons that specify the twenty standard amino acids or act as stop signals in protein synthesis.