Core Mechanism of Gene Expression
Understand transcription mechanisms, mRNA processing steps, and translation initiation and elongation.
Summary
Read Summary
Flashcards
Save Flashcards
Quiz
Take Quiz
Quick Practice
What is the general process of transcription?
1 of 22
Summary
Mechanism of Gene Expression
Gene expression is the process by which cells convert the genetic information stored in DNA into functional proteins. This involves two major steps: transcription, where DNA is copied into messenger RNA (mRNA), and translation, where mRNA is decoded to synthesize proteins. Let's walk through each component of this process.
Transcription: Creating RNA from DNA
Transcription is the synthesis of an RNA copy from a DNA template strand. An enzyme called RNA polymerase reads the template DNA strand and builds a complementary RNA molecule, nucleotide by nucleotide.
A crucial difference between DNA and RNA is that wherever DNA contains the base thymine (T), RNA contains uracil (U) instead. This substitution happens automatically as RNA polymerase synthesizes the RNA strand.
For example, if the DNA template reads: 3'-TACGAT-5' The RNA produced reads: 5'-AUGCUA-3'
RNA Polymerases: Prokaryotes vs. Eukaryotes
The complexity of transcription varies dramatically between prokaryotes and eukaryotes.
In prokaryotes, a single RNA polymerase performs all transcription. This polymerase cannot bind directly to DNA on its own—it requires a protein called a sigma factor that helps it recognize and bind to a specific promoter sequence called the Pribnow box. Once the sigma factor positions the RNA polymerase at the promoter, transcription begins.
In eukaryotes, three different nuclear RNA polymerases exist, each with specialized roles:
RNA Polymerase I (Pol I) transcribes most ribosomal RNA genes
RNA Polymerase II (Pol II) transcribes all protein-coding genes and many non-coding RNAs
RNA Polymerase III (Pol III) transcribes transfer RNA genes, 5S ribosomal RNA, and other small non-coding RNAs
This specialization in eukaryotes reflects their greater complexity and the need for more sophisticated gene regulation.
Transcription Termination
Transcription doesn't continue indefinitely. When RNA polymerase encounters a terminator sequence in the DNA, it stops synthesis and releases the newly formed RNA molecule. The specific mechanism of termination differs slightly between prokaryotes and eukaryotes, but the principle is the same: a DNA signal tells the polymerase "stop here."
mRNA Processing in Eukaryotes: From Pre-mRNA to Mature mRNA
Here's something that often surprises students: in eukaryotes, the RNA that RNA polymerase II initially produces is not the same as the mRNA that gets translated into protein. The initial product, called pre-mRNA (or primary transcript), must undergo extensive modifications before it becomes functional mature mRNA.
This processing step is critical and doesn't occur in prokaryotes—an important distinction. Let's examine each modification:
5' Capping: Protecting the RNA's Beginning
The first modification happens at the 5' end of pre-mRNA. A special chemical structure called a 7-methylguanosine cap is added. Think of this cap as a protective hat on the RNA molecule.
This cap serves multiple functions:
Protects the RNA from being degraded by enzymes called exonucleases that would otherwise chew away at the 5' end
Aids in nuclear export, helping the mRNA leave the nucleus
Marks the RNA as "self", protecting it from immune defenses that target foreign RNA
The cap is recognized by proteins called the cap-binding complex, which binds to it and prevents removal of the cap.
3' Polyadenylation: Protecting the RNA's End
At the 3' end, a different protection strategy is employed. The pre-mRNA is cleaved at a specific location downstream of a polyadenylation signal sequence (the sequence AAUAAA). After cleavage, approximately 200 adenine nucleotides are added one by one, forming a poly(A) tail.
Much like the 5' cap, the poly(A) tail protects the RNA from degradation. Additionally, proteins called poly(A)-binding proteins attach to this tail and:
Promote the mRNA's export from the nucleus
Help the ribosome reinitiate translation (restart protein synthesis) during translation
However, the poly(A) tail doesn't last forever. Over time, deadenylation occurs—enzymes like the CCR4-Not exonuclease complex gradually shorten the poly(A) tail. This shortening process often leads to transcript decay, allowing cells to control how long an mRNA persists.
RNA Splicing: Removing Introns and Joining Exons
Here's perhaps the most striking difference between eukaryotic and prokaryotic genes: eukaryotic genes are typically interrupted. They contain exons (regions that code for protein) and introns (non-coding regions inserted between exons). Prokaryotic genes typically lack introns.
During splicing, the spliceosome—a massive, complex molecular machine composed of RNA and proteins—removes the introns and joins the exons together. Remarkably, the intron is removed as a lariat-shaped structure (it forms a loop), leaving no gaps in the final mRNA.
Consider a simplified example:
Pre-mRNA: Exon1—Intron1—Exon2—Intron2—Exon3
After splicing: Exon1—Exon2—Exon3
The introns are discarded, and the exons are directly joined.
Alternative Splicing: Multiplying Protein Diversity
Here's where gene expression becomes truly elegant. Alternative splicing allows a single gene to produce multiple different mRNAs by selectively including or excluding specific exons (or introns).
For example:
One version of the mRNA might include Exon1, Exon2, and Exon3
A different version might skip Exon2 entirely (Exon1, Exon3)
A third version might include an intron sequence that normally would be removed
Each different mRNA produces a slightly different protein. This mechanism dramatically expands protein diversity—estimates suggest that humans produce over 100,000 different proteins from only about 20,000 genes, largely due to alternative splicing. This explains how organisms can achieve such proteome complexity from a relatively modest number of genes.
<extrainfo>
Non-Coding RNA Processing
Beyond mRNA, eukaryotic cells produce many other types of RNA, each with their own processing requirements.
Ribosomal RNA Maturation
Ribosomal RNA (rRNA) genes are transcribed as large precursor molecules containing one or more rRNA sequences. These precursors must be cleaved to produce the individual rRNA molecules. Additionally, rRNA undergoes chemical modifications—specifically 2'-O-methylation and pseudouridine formation—at specific sites within the molecule. Small nucleolar RNAs (snoRNAs) and associated proteins guide these modifications to the correct locations.
MicroRNA Biogenesis
MicroRNAs (miRNAs) are small regulatory RNAs that play important roles in controlling gene expression. Their biogenesis is a multi-step process:
Primary transcript (pri-miRNA) is transcribed and capped and polyadenylated like mRNA
Nuclear processing: An enzyme called Drosha (along with its partner Pasha) cleaves the pri-miRNA in the nucleus, producing a hairpin-shaped precursor miRNA (pre-miRNA)
Nuclear export: The pre-miRNA is exported to the cytoplasm
Cytoplasmic processing: An enzyme called Dicer cuts the pre-miRNA into a small double-stranded RNA
RISC assembly: The mature miRNA associates with a protein called Argonaute to form the RNA-induced silencing complex (RISC), which can then silence target genes
</extrainfo>
Translation: Reading the mRNA Code
Translation is the process where ribosomes read the mRNA sequence and synthesize proteins. Before diving into the mechanism, you need to understand how the genetic code is structured in mRNA.
mRNA Structure: The Blueprint for Proteins
Every mRNA has a consistent three-part structure:
5' Untranslated Region (5' UTR): This region comes before the protein-coding sequence and is not translated into amino acids. It often contains regulatory sequences.
Open Reading Frame (ORF): This is the actual protein-coding region. It's called "open" because it contains a continuous sequence of codons with no stop signals between them.
3' Untranslated Region (3' UTR): This region comes after the protein-coding sequence and is also not translated. It often contains sequences that regulate mRNA stability or localization.
The ORF is read in groups of three nucleotides called codons. Each codon specifies which amino acid should be added to the growing protein chain. With 4 nucleotide bases and codons of length 3, there are 64 possible codons, which code for 20 amino acids (plus stop signals). This redundancy is called the genetic code's degeneracy.
How the Ribosome Reads Codons
The ribosome doesn't directly recognize codons on its own. Instead, it depends on transfer RNA (tRNA) molecules. Each tRNA carries:
One specific amino acid attached to its 3' end
An anticodon (three nucleotides) that is complementary to an mRNA codon
When the ribosome brings an mRNA codon into the correct position, a tRNA with the matching anticodon base-pairs with it through complementary base pairing. This ensures the correct amino acid is selected.
The ribosome then catalyzes the formation of a peptide bond between the newly arrived amino acid and the previous amino acid in the growing chain. This process repeats—the ribosome moves one codon forward, brings in the next tRNA, and continues adding amino acids until it encounters a stop codon (UAA, UAG, or UGA), signaling the end of translation.
Monocistronic vs. Polycistronic mRNA
An important distinction exists between how prokaryotes and eukaryotes organize protein-coding sequences:
Monocistronic mRNA (typical of eukaryotes) contains a single protein-coding sequence. One mRNA produces one protein.
Polycistronic mRNA (typical of prokaryotes) contains multiple protein-coding sequences. One mRNA produces multiple different proteins. This is possible because prokaryotic ribosomes can bind to internal ribosome binding sites and begin translation at multiple locations on the same mRNA.
Cotranslational Translocation: Where Proteins Are Made Matters
Here's a subtle but important point: not all proteins are synthesized in the same location. The destination of the protein determines where translation occurs.
Proteins destined for secretion or membrane insertion are synthesized on ribosomes bound to the endoplasmic reticulum (ER). This process begins when the ribosome synthesizes a signal peptide—a short amino acid sequence near the beginning of the protein. The signal recognition particle (SRP) recognizes this signal peptide and directs the entire ribosome-mRNA-nascent protein complex to the ER membrane. Translation then continues with the growing protein chain threading into the ER lumen as it's synthesized.
Cytoplasmic proteins (destined to remain in the cytoplasm) are synthesized on free ribosomes floating in the cytoplasm, not attached to any membrane. These proteins never encounter the ER.
This targeting system ensures proteins are synthesized in the compartment where they'll be used—a crucial aspect of cellular organization.
Flashcards
What is the general process of transcription?
Synthesis of an RNA copy from a DNA template strand by RNA polymerase.
Which DNA nucleotide is replaced by uracil in the resulting RNA strand?
Thymine.
What event marks the end of transcription?
RNA polymerase encounters a DNA terminator sequence.
What factor is required for the prokaryotic RNA polymerase to bind the promoter?
Sigma factor.
What are the three nuclear RNA polymerases and their primary targets?
RNA polymerase I: Ribosomal RNA (rRNA) genes
RNA polymerase II: Protein-coding genes and some non-coding RNAs
RNA polymerase III: 5S rRNA, tRNA genes, and small non-coding RNAs
What specific molecule is added to the 5′ end of eukaryotic pre-mRNA?
7-methylguanosine cap.
What is the specific polyadenylation signal sequence?
AAUAAA.
Roughly how many adenine nucleotides are added to the poly(A) tail?
Approximately 200.
Which complex is responsible for deadenylation and transcript decay?
CCR4-Not exonuclease.
What is the difference between exons and introns?
Exons are coding segments; introns are non-coding segments.
In what form are introns removed during splicing?
Lariat structures.
What is the mechanism that allows a single gene to produce multiple distinct proteins?
Alternative splicing.
What molecules guide the chemical modification (methylation/pseudouridine) of pre-rRNA?
Small nucleolar RNAs (snoRNAs).
Which enzymes cleave primary microRNA (pri-miRNA) to form precursor hairpins?
Drosha and Pasha.
What cytoplasmic enzyme processes precursor microRNA into mature microRNA?
Dicer.
What complex does mature microRNA assemble into to silence genes?
RNA-induced silencing complex (RISC).
How do tRNAs recognize specific codons on the mRNA?
Via complementary anticodons.
What are the three main functions facilitated by ribosomes during translation?
tRNA binding
Peptide bond formation
Polypeptide chain elongation
What are the three main regions of a messenger RNA molecule?
5′ untranslated region (UTR)
Open reading frame (ORF)
3′ untranslated region (UTR)
What is the difference between monocistronic and polycistronic mRNA?
Monocistronic encodes one protein (eukaryotes); polycistronic encodes multiple (prokaryotes).
Where does translation occur for proteins destined for secretion?
At the endoplasmic reticulum (ER).
What recognizes the signal peptide to direct a ribosome to the ER?
Signal recognition particle (SRP).
Quiz
Core Mechanism of Gene Expression Quiz Question 1: Which factor is required for prokaryotic RNA polymerase to recognize the promoter?
- Sigma factor (correct)
- TATA‑binding protein
- Transcription factor IIB
- Polyadenylation signal
Core Mechanism of Gene Expression Quiz Question 2: What modification is added to the 5′ end of pre‑mRNA to protect it from degradation?
- A 7‑methylguanosine cap (correct)
- A poly(A) tail
- A 5′ phosphate group
- A 5′ triphosphate group
Core Mechanism of Gene Expression Quiz Question 3: What complex removes introns as lariat structures during RNA splicing?
- The spliceosome (correct)
- The ribosome
- RNA polymerase II
- The exosome
Core Mechanism of Gene Expression Quiz Question 4: Which enzymes cleave primary microRNA transcripts to form precursor hairpins?
- Drosha and Pasha (correct)
- Dicer and Argonaute
- RNase P and tRNase Z
- RNA polymerase II and III
Core Mechanism of Gene Expression Quiz Question 5: How does each codon specify an amino acid?
- Via complementary anticodons on transfer RNA (correct)
- Through ribosomal binding sites alone
- By direct interaction with ribosomal RNA
- By aminoacyl‑tRNA synthetase recognition only
Core Mechanism of Gene Expression Quiz Question 6: What describes monocistronic messenger RNA?
- Contains a single protein‑coding sequence (correct)
- Contains multiple protein‑coding sequences
- Lacks a 5′ cap
- Is found only in prokaryotes
Core Mechanism of Gene Expression Quiz Question 7: In the RNA transcript, which nucleotide replaces thymine found in DNA?
- Uracil (correct)
- Cytosine
- Adenine
- Guanine
Core Mechanism of Gene Expression Quiz Question 8: Small nucleolar RNAs guide which chemical modifications of rRNA?
- 2′‑O‑methylation and pseudouridylation (correct)
- Phosphorylation and ubiquitination
- Methylation of DNA
- Glycosylation of proteins
Core Mechanism of Gene Expression Quiz Question 9: Where are proteins that function in the cytoplasm synthesized?
- On free ribosomes in the cytosol (correct)
- On ribosomes bound to the endoplasmic reticulum
- In the Golgi apparatus
- In mitochondria
Core Mechanism of Gene Expression Quiz Question 10: Which set of modifications does a primary RNA transcript (pre‑mRNA) typically undergo before becoming a mature mRNA?
- Capping at the 5′ end, splicing to remove introns, and polyadenylation at the 3′ end (correct)
- Addition of a 3′ phosphate, methylation of all nucleotides, and RNA‑directed DNA synthesis
- Direct export to the cytoplasm without any changes
- Conversion into a double‑stranded RNA molecule
Core Mechanism of Gene Expression Quiz Question 11: Which nucleotide is added repeatedly to the 3′ end of a eukaryotic pre‑mRNA during polyadenylation?
- adenine (A) (correct)
- uracil (U)
- guanine (G)
- cytosine (C)
Core Mechanism of Gene Expression Quiz Question 12: Which region of a messenger RNA is located immediately downstream of the 5′ cap and before the start codon?
- 5′ untranslated region (5′ UTR) (correct)
- Open reading frame (ORF)
- 3′ untranslated region (3′ UTR)
- Poly(A) tail
Core Mechanism of Gene Expression Quiz Question 13: What event directly causes transcription to stop during gene expression?
- RNA polymerase encounters a DNA terminator sequence (correct)
- RNA polymerase binds the promoter region
- RNA polymerase adds a poly(A) tail to the transcript
- RNA polymerase reaches a start codon
Which factor is required for prokaryotic RNA polymerase to recognize the promoter?
1 of 13
Key Concepts
Key Topics
Transcription
RNA polymerase II
5′ cap
Polyadenylation
RNA splicing
Alternative splicing
MicroRNA biogenesis
Translation initiation
Polycistronic mRNA
Ribosomal RNA processing
Definitions
Transcription
Synthesis of an RNA copy from a DNA template strand by RNA polymerase.
RNA polymerase II
Eukaryotic enzyme that transcribes all protein‑coding genes and many non‑coding RNAs.
5′ cap
A 7‑methylguanosine modification added to the 5′ end of pre‑mRNA to protect it and aid export.
Polyadenylation
Addition of a poly(A) tail to the 3′ end of pre‑mRNA, enhancing stability and translation.
RNA splicing
Removal of introns and ligation of exons from pre‑mRNA by the spliceosome.
Alternative splicing
Generation of multiple distinct mRNA isoforms from a single gene by varying exon inclusion.
MicroRNA biogenesis
Production of mature microRNAs from primary transcripts via Drosha and Dicer processing steps.
Translation initiation
First phase of protein synthesis where the ribosome assembles on mRNA and locates the start codon.
Polycistronic mRNA
Messenger RNA that encodes multiple protein‑coding sequences, typical of prokaryotes.
Ribosomal RNA processing
Cleavage and chemical modification of pre‑rRNA guided by small nucleolar RNAs to form functional rRNAs.