RemNote Community
Community

Central Dogma and DNA‑Protein Interactions

Understand DNA replication, transcription, and translation, and how DNA‑binding proteins—including histones, HMG proteins, and transcription factors—regulate and organize genetic information.
Summary
Read Summary
Flashcards
Save Flashcards
Quiz
Take Quiz

Quick Practice

In what direction does DNA polymerase synthesize a new DNA strand?
1 of 16

Summary

DNA Replication, Transcription, and Translation DNA is the molecule of heredity, but it only exerts its influence through its interactions with other cellular molecules. The core processes of DNA replication, transcription, and translation form the foundation for how genetic information is stored, copied, and expressed. Understanding these processes is essential to understanding all of molecular biology. DNA Replication: The Leading and Lagging Strands DNA replication is the process by which cells copy their genetic material before dividing. The enzyme DNA polymerase is responsible for synthesizing new DNA strands that are complementary to the template strands. A critical feature of this process is the directionality of synthesis: DNA polymerase can only synthesize DNA in the 5′→3′ direction (from the 5′ phosphate end toward the 3′ hydroxyl end). This creates a problem. Recall that DNA strands are antiparallel—they run in opposite directions. When the DNA double helix unwinds during replication, one template strand runs 3′→5′, making it easy for DNA polymerase to synthesize the complementary strand continuously in the 5′→3′ direction. This continuously synthesized strand is called the leading strand. The other template strand runs 5′→3′, which is opposite to the direction DNA polymerase works. To solve this, the complementary strand is synthesized in short fragments called Okazaki fragments, which are made in the 5′→3′ direction but on the overall "backward" strand. This discontinuously synthesized strand is called the lagging strand. These fragments are later joined together by DNA ligase to form a continuous strand. Transcription: From DNA to RNA While DNA replication copies DNA into DNA, transcription is the process of copying a DNA gene into a complementary RNA molecule. This process is performed by the enzyme RNA polymerase, which reads the DNA template strand and synthesizes a complementary RNA strand. The key difference between DNA and RNA is one nucleotide: RNA contains uracil (U) instead of thymine (T). So wherever DNA has an adenine (A) paired with thymine (T), the RNA transcript will have adenine paired with uracil. This RNA molecule is called messenger RNA (mRNA) because it carries the genetic code from the DNA to the ribosome, where the information will be translated into a protein. Translation and the Genetic Code Proteins are synthesized by ribosomes using instructions encoded in mRNA. The mRNA sequence is "read" in groups of three nucleotides called codons. Since there are four possible nucleotides (A, U, G, C), there are $4^3 = 64$ possible codons. Each codon specifies which amino acid should be added to the growing protein chain. With only 20 standard amino acids, this means that multiple codons code for the same amino acid (the genetic code is degenerate). Three special codons—UAA, UAG, and UGA—do not code for amino acids. Instead, they function as stop signals that terminate translation and signal the release of the completed protein from the ribosome. DNA-Protein Interactions: The Foundation of Genetic Regulation DNA doesn't work alone in the cell. Its functions—replication, transcription, repair, and regulation—all depend critically on interactions with proteins. Understanding the types of DNA-protein interactions is essential for understanding how cells control gene expression. Non-Specific vs. Sequence-Specific Binding DNA-protein interactions fall into two broad categories: Non-specific DNA-protein interactions occur when a protein binds to DNA regardless of its base sequence. These interactions typically involve the protein contacting the sugar-phosphate backbone of DNA, which is the same along the entire length of both strands. Structural proteins that compact DNA into chromatin often use this type of binding. Sequence-specific DNA-protein interactions occur when a protein binds only to particular DNA sequences. These interactions involve the protein recognizing and binding to the bases themselves, which differ depending on the sequence. Regulatory proteins like transcription factors use this type of binding to find their target genes. Structural DNA-Binding Proteins: Organizing the Genome The human genome contains about 3 billion base pairs. If stretched out, this DNA would be roughly 2 meters long—yet it must fit inside a nucleus that is only about 10 micrometers in diameter. Structural DNA-binding proteins solve this problem by compacting DNA into an organized structure called chromatin. Histones and Nucleosomes In eukaryotes, the primary structural protein that organizes DNA is histones. DNA wraps around a core of eight histone proteins (an octamer: two copies each of four different histone types) approximately 1.65 times around the octamer to form a structure called a nucleosome. Each nucleosome contains about 147 base pairs of DNA in two complete turns. The interaction between histones and DNA is electrostatic in nature: histone proteins are rich in basic amino acids (lysine and arginine, which carry positive charges), while the DNA backbone is acidic (the phosphate groups carry negative charges). These ionic bonds between the positive histones and negative DNA backbone hold the nucleosome together. Nucleosomes are not randomly distributed—they are spaced roughly 200 base pairs apart along the DNA, with linker DNA between them. This creates the "beads on a string" appearance visible under electron microscopy. Histone Chemical Modifications and Gene Regulation A crucial feature of histone-DNA interactions is that they are not permanent. The histone proteins have flexible "tails" extending from the nucleosome core, and these tails can be chemically modified by three types of post-translational modifications: Methylation: Addition of methyl groups to lysine or arginine residues Phosphorylation: Addition of phosphate groups to serine, threonine, or tyrosine residues Acetylation: Addition of acetyl groups to lysine residues These chemical modifications alter the charge on the histone tail, which in turn affects how tightly the histone binds to the DNA. For example, acetylation adds negative charges to positively charged lysines, weakening the ionic bond between histones and DNA and making the DNA more accessible to other proteins. Conversely, certain methylation patterns strengthen histone-DNA binding. Because these modifications change DNA accessibility, they play a critical role in regulating gene expression. A gene wrapped tightly around histones is inaccessible to transcription factors and RNA polymerase, so it is not transcribed. Loosening the histone-DNA interaction through acetylation makes the gene accessible and can turn it "on." <extrainfo> High-Mobility Group Proteins Beyond histones, another class of structural proteins called high-mobility group (HMG) proteins helps organize DNA at larger scales. These proteins bind to DNA that is bent or distorted (not the typical straight double helix). By binding to bent DNA, HMG proteins help compact nucleosome arrays into higher-order chromosomal structures. These proteins are named for their high mobility in gel electrophoresis, a separation technique that was historically important for their discovery. Single-Stranded DNA-Binding Proteins During DNA replication, recombination, and repair, the DNA double helix is unwound, exposing single-stranded DNA that is vulnerable to degradation by enzymes called nucleases. A protein called replication protein A (RPA) binds to these single strands, stabilizing them and protecting them from nuclease attack. RPA covers the single-stranded DNA without blocking access to other proteins that participate in replication and repair, making it an essential non-specific DNA-binding protein for these processes. </extrainfo> Sequence-Specific DNA-Binding Proteins: Controlling Gene Expression While structural proteins organize DNA into chromatin, sequence-specific DNA-binding proteins regulate which genes are transcribed. The most important class of these regulatory proteins is transcription factors. How Transcription Factors Work Transcription factors are proteins that bind to specific DNA sequences, usually located near the promoter—the region where RNA polymerase begins transcription. A transcription factor recognizes and binds only to its target sequences through specific contacts with the DNA bases. The specificity of this recognition comes from an important structural detail: transcription factors make contacts with the edges of the DNA bases, primarily in the major groove (the wider groove of the DNA double helix). The bases line the major groove with different patterns of hydrogen bond donors and acceptors depending on whether they are adenine, guanine, cytosine, or thymine. A transcription factor's DNA-binding domain has a three-dimensional shape that is complementary to these patterns, allowing it to "recognize" specific sequences like a lock recognizing its key. Once bound to their target sequences, transcription factors regulate gene expression through two main mechanisms: Activation: A transcription factor can activate transcription by physically recruiting RNA polymerase to the promoter, or indirectly by recruiting mediator proteins—large protein complexes that bridge between transcription factors and RNA polymerase. This brings the machinery for transcription into position and allows the gene to be transcribed. Repression: A transcription factor can repress transcription by recruiting enzymes that modify histones in ways that strengthen histone-DNA binding. For example, certain methylation patterns or deacetylation by histone deacetylases increase the strength of histone binding, making the DNA less accessible and preventing transcription. The Widespread Impact of Transcription Factors One transcription factor can influence the expression of thousands of genes because its target sequences are scattered throughout the genome. Different genes may have the same transcription factor binding site in their promoter regions, meaning that a single transcription factor regulates a whole suite of genes. This widespread regulatory capacity makes transcription factors common targets for signal-transduction pathways—cellular communication systems that respond to external signals (like hormones or growth factors) and internal conditions. When a cell receives a signal to differentiate, proliferate, or respond to stress, it often does so by activating or inactivating specific transcription factors. These transcription factors then coordinately regulate the genes necessary for the appropriate cellular response. Understanding transcription factors is therefore crucial to understanding how cells respond to their environment and how cells differentiate during development.
Flashcards
In what direction does DNA polymerase synthesize a new DNA strand?
5′→3′ direction
Which DNA strand is synthesized continuously during replication?
Leading strand
Which DNA strand is synthesized discontinuously during replication?
Lagging strand
Which nitrogenous base does RNA polymerase substitute for Thymine (T) when copying DNA?
Uracil (U)
What is the primary function of messenger RNA (mRNA) in the cell?
Carrying the genetic code to the ribosome
How many total possible codons exist in the genetic code?
64
What are the three specific mRNA codons that act as stop signals to terminate translation?
UAA UAG UGA
What part of the DNA molecule do non-specific DNA-binding proteins interact with?
Sugar-phosphate backbone
What determines the binding site for sequence-specific DNA-binding proteins?
Particular DNA sequences recognized by the protein
What is the name of the highly organized complex formed by compacting DNA with structural proteins?
Chromatin
What type of chemical bond mediates the interaction between histones and the DNA backbone?
Ionic bonds
What are the three primary types of chemical modifications that can occur on histone tails?
Methylation Phosphorylation Acetylation
What is the primary role of high-mobility group (HMG) proteins in chromosome structure?
Bending nucleosome arrays into higher-order structures
What is the function of Replication protein A (RPA) when it binds to single-stranded DNA?
Stabilizing the DNA and protecting it from nucleases
In which part of the DNA double helix do most sequence-specific transcription factor contacts occur?
Major groove
Why can a single transcription factor influence thousands of different genes?
Its target sequences are distributed throughout the genome

Quiz

In which direction does DNA polymerase add nucleotides during DNA synthesis?
1 of 15
Key Concepts
DNA Processes
DNA replication
Transcription
Translation
DNA Structure and Regulation
DNA‑binding protein
Histone
Nucleosome
Transcription factor
High‑mobility group protein
Replication protein A
Genetic Information
Genetic code