RemNote Community
Community

Molecular Basis of CRISPR Immunity

Understand the discovery of CRISPR immunity, the molecular steps of adaptation, expression, and interference, and the key functions of major Cas proteins.
Summary
Read Summary
Flashcards
Save Flashcards
Quiz
Take Quiz

Quick Practice

What is the physical structure of the repeat sequences in a CRISPR array as identified in early research?
1 of 25

Summary

CRISPR-Associated Immunity: From Discovery to Molecular Mechanism Introduction CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) systems represent a revolutionary discovery in molecular biology—a remarkable adaptive immune system that bacteria use to defend against viral infection and foreign DNA. Understanding CRISPR requires grasping three interconnected ideas: (1) how bacteria acquire and store genetic "memories" of past attacks, (2) how these memories are activated into defensive molecules, and (3) how these molecules precisely locate and destroy invading nucleic acids. This system works so well that it has become the most powerful tool available for genetic engineering. Discovery and Early Evidence of CRISPR-Associated Immunity What Researchers First Noticed Early genetic analysis revealed an unusual pattern in bacterial genomes: tandem repeat sequences interrupted by unique spacer sequences of variable length. These arrays were puzzling at first—why would bacteria maintain such organized repetitive DNA? The Critical 2007 Breakthrough In a landmark 2007 publication in Science, Barrangou and colleagues provided the first experimental proof that CRISPR provides acquired resistance against bacteriophages (viruses that infect bacteria). Crucially, they demonstrated that when researchers added new spacer sequences derived from specific phages to the CRISPR array, bacterial strains that were previously vulnerable to those phages suddenly became immune. This was striking evidence that the spacers encode specific information about threats. Proving the Immune Function (2008) One year later, Marraffini and Sontheimer showed that CRISPR acts as an adaptive immune system by limiting horizontal gene transfer in Staphylococcus. By targeting DNA through a process called "interference," they proved that CRISPR functions analogously to the adaptive immune system in animals—bacteria store records of past infections and use this information to prevent future ones. The Three Stages of CRISPR Immunity CRISPR immunity operates through three sequential stages: adaptation (acquiring foreign DNA), expression and processing (converting stored information into functional molecules), and interference (using this information to destroy invaders). Think of it like developing a vaccine—first you're exposed to a pathogen, then your body produces antibodies, then those antibodies protect you. Stage 1: Spacer Acquisition (Adaptation) When a phage infects a bacterial cell, a specialized protein complex must capture a fragment of the invading DNA and integrate it into the CRISPR array. The Cas1-Cas2 Complex The Cas1 and Cas2 proteins work together as a molecular machinery for this integration: Cas1 is a metal-dependent nuclease/integrase (an enzyme that cuts and rearranges DNA). Importantly, Cas1 binds double-stranded DNA without sequence specificity—it doesn't need to recognize a particular sequence, so it can capture DNA from any source. Cas2 acts as a scaffold protein, holding DNA fragments in position while Cas1 catalyzes the integration reaction. In some systems, Cas4 also assists, helping ensure that spacers are inserted at the correct length and orientation. The Role of PAM Sequences Spacer acquisition preferentially occurs adjacent to Protospacer Adjacent Motif (PAM) sequences—short sequences (typically 3-5 base pairs) that mark where the spacer should be captured. For example, the common PAM for Streptococcus pyogenes Cas9 is "NGG" (where N is any nucleotide). PAMs are essential for acquisition in type I and type II CRISPR systems but are not required for type III systems. Directional Integration and Memory New spacers are always added at the leader-proximal end of the CRISPR array (the end nearest a regulatory region called the "leader"). This directional insertion creates a chronological record—the oldest spacers are farthest from the leader, while the newest acquired spacers are closest. Researchers can read this "molecular clock" to determine the order of past infections. Primed Acquisition When an existing spacer partially matches an incoming phage's DNA, the system is "primed" to acquire additional spacers from that same phage more efficiently. This mechanism allows bacteria to rapidly capture multiple pieces of information about a particularly threatening virus. Stage 2: CRISPR RNA Expression and Processing Once spacers are integrated into the genome, they must be transcribed and processed into mature, functional molecules. Transcription of the CRISPR Array The entire CRISPR array (containing multiple repeats and spacers) is transcribed as a single, long precursor RNA called the pre-crRNA. This long transcript contains information about multiple past infections encoded as spacer sequences. Processing into Mature crRNAs The pre-crRNA must be cleaved into individual CRISPR RNAs (crRNAs), each containing one spacer. The processing mechanism differs by CRISPR type: Type I and III systems: An enzyme called Cas6 (or its variants Cas6e/Cas6f) recognizes the characteristic stem-loop structure formed by the repeat sequences. Cas6 cuts at precise positions along these repeat boundaries, releasing individual crRNAs. Type II systems: These systems use a different strategy. RNase III (a common cellular enzyme) works together with a trans-activating crRNA (tracrRNA) to process the precursor. This extra component is important because Type II systems like S. pyogenes Cas9 require two RNA molecules to function, not just one. Mature crRNA Structure The final mature crRNA retains the spacer sequence plus portions of the repeat sequence (called "repeat handles") at one or both ends. These repeat handles are crucial—they help recruit other Cas proteins and they contain information that distinguishes self from non-self, as we'll discuss below. Stage 3: Interference (Target Detection and Destruction) Now comes the action: the crRNA loaded into Cas protein complexes scans the bacterial cell for matching invader nucleic acids and destroys them. The Surveillance Complex The mature crRNA associates with multiple Cas proteins to form a surveillance complex (named differently in each type: Cascade in Type I systems, just Cas9 in Type II, etc.). This ribonucleoprotein complex constantly scans the cell's DNA and RNA looking for matches. Search Strategy: Seed Sequence and PAM Scanning The surveillance complex searches for two requirements simultaneously: Seed sequence match: A 7-12 nucleotide region near the beginning of the spacer sequence must perfectly match the target DNA. This is the "seed" that initiates recognition—mismatches here abort the search. PAM requirement: The target must have the correct PAM sequence adjacent to it. Interestingly, mismatches outside the seed region are often tolerated. This flexibility is important because it allows for some evolutionary tolerance while maintaining specificity. Type I System Interference In Type I systems, once the Cascade complex (composed of Cas5, Cas6, Cas7, Cas8 proteins along with the crRNA) finds a match, it recruits the Cas3 protein—a helicase-nuclease. Cas3 unwinds the double-stranded DNA and degrades the target strand, destroying the invading genetic material. Type II System Interference (Cas9) Type II systems, exemplified by Cas9, work through a different mechanism. Cas9 binds both the crRNA and a tracrRNA, scanning for a PAM. Upon PAM recognition, Cas9 unwinds the DNA locally to check if the spacer sequence matches. If it does, two nuclease domains within Cas9 activate: one cuts each DNA strand, creating a double-stranded break. This direct cleavage is very efficient for destroying target DNA. Type III System Interference Type III systems are unique because they can target both DNA and RNA. The complex associates with Cas10 protein, which can cleave DNA in a transcription-dependent manner while also degrading complementary mRNA. Self vs. Non-Self Discrimination: Preventing Friendly Fire A critical question: how does the system avoid destroying the bacterial chromosome itself, which now contains the integrated spacer sequences? The Mismatch Outside the Seed Region The answer lies in a clever molecular design: the repeat handle sequences flanking the spacer in the mature crRNA contain a sequence that doesn't perfectly match the template strand of the integrated spacer. This creates a forced mismatch outside the seed region at one end. When the surveillance complex scans the chromosome itself, this mismatch prevents full target recognition, so the system doesn't attack itself. This principle—perfect matching within the seed region but tolerance of mismatches elsewhere—provides flexibility while maintaining safety. Key Cas Proteins and Their Molecular Functions Different CRISPR systems use different "effector" Cas proteins to destroy targets. Understanding these proteins is essential for understanding system capabilities and CRISPR applications. Cas9: The General-Purpose Nuclease Structure and Function Cas9 is the most widely used CRISPR protein in genetic engineering. It functions as a dual-RNA-guided DNA endonuclease, meaning it requires two RNA molecules (the crRNA and tracrRNA, often engineered as a single "guide RNA" for convenience) to guide its cutting activity. Nuclease Domains Cas9 contains two distinct nuclease domains: RuvC domain: Cuts the non-complementary DNA strand (the strand that doesn't base-pair with the guide RNA) HNH domain: Cuts the complementary strand Together, these create a blunt double-stranded break approximately 3-4 base pairs upstream of the PAM sequence. PAM Recognition Cas9 recognizes the PAM "NGG" in S. pyogenes (though different Cas9 variants recognize different PAMs). The PAM must be immediately adjacent to the target sequence. Cas12a: Alternative Architecture Key Differences Cas12a offers a different cutting strategy compared to Cas9: Recognizes a T-rich PAM (such as "TTTV" where V is A, C, or G) Creates staggered cuts with 5′ overhangs, rather than blunt cuts Uniquely, processes its own CRISPR RNA without requiring a separate trans-activating RNA This self-processing capability makes Cas12a simpler to engineer in some contexts. Cas13: RNA-Targeting Immunity Target Specificity Unlike Cas9 and Cas12a, which target DNA, Cas13 proteins specifically target single-stranded RNA. This capability enables: RNA-based diagnostics (detecting viral RNA) RNA therapeutics (targeting disease-associated RNAs) Collateral Activity An important feature of Cas13: once it binds its target RNA, it exhibits collateral RNase activity, non-specifically degrading surrounding RNAs. This means one Cas13 protein recognizing its target can trigger widespread RNA degradation in the cell. This property is paradoxically useful for diagnostics—amplified signal that confirms target detection—but requires careful handling in applications. Cas3: The Helicase-Nuclease Molecular Activities Cas3 combines two enzymatic activities: Helicase activity: Unwinds double-stranded DNA Nuclease activity: Degrades the unwound, single-stranded DNA Working with the Type I Cascade complex, Cas3 creates a processive degradation of target DNA—it unwinds and cuts continuously along the DNA strand rather than making a single cut like Cas9. Summary: Why CRISPR Is Revolutionary The elegance of CRISPR systems lies in their three-stage simplicity: acquire information about threats, process that information into functional molecules, and use those molecules to target and destroy invaders with remarkable precision. The discovery that this mechanism could be reprogrammed by changing the guide RNA sequences—making the targeting information modular—opened the door to precise genetic engineering. Understanding the molecular details of acquisition, expression, and interference helps explain both why CRISPR systems work so well naturally and why they've become so powerful as biotechnology tools. <extrainfo> Additional Context: CRISPR System Diversity Bacteria use multiple CRISPR system types (I, II, III, IV, V, VI, plus variants), each with different architectures and capabilities. The systems discussed in this outline (Types I, II, and III) represent the most well-characterized and most commonly used for biotechnology. Researchers continue discovering new CRISPR systems with novel properties, suggesting that the diversity of bacterial immune strategies is even greater than currently understood. </extrainfo>
Flashcards
What is the physical structure of the repeat sequences in a CRISPR array as identified in early research?
They are interspaced by unique spacer sequences of variable length.
Which 2007 study experimentally demonstrated that CRISPR provides acquired resistance against viruses in bacteria?
The Science paper by Barrangou and colleagues.
What change to a CRISPR array was shown to confer immunity to previously vulnerable bacterial strains?
The addition of new spacers.
Which 2008 study proved that CRISPR acts as an adaptive immune system by limiting horizontal gene transfer in Staphylococcus?
The Science paper by Marraffini and Sontheimer.
Which two proteins capture a fragment of phage DNA and integrate it as a new spacer during infection?
Cas1 Cas2
Where are new spacers directionally added within the CRISPR array to preserve a chronological record?
At the leader proximal end.
What is the term for the process where an existing spacer enhances the uptake of additional spacers from a partially matching phage?
Primed acquisition.
What are the two primary enzymatic/functional roles of the Cas1 protein?
Metal-dependent nuclease and integrase.
What is the typical length of a PAM sequence used during spacer acquisition?
$3$–$5$ bp.
In which CRISPR system types are PAMs essential during the acquisition phase?
Type I and Type II systems.
What is the initial transcript of the CRISPR array before it is cleaved into individual crRNAs?
A long precursor RNA (pre-crRNA).
Which protein recognizes repeat stem-loops and cuts at repeat boundaries in Type I and Type III systems?
Cas6 (or Cas6e/f).
Which two components are required by Type II systems to process the precursor RNA?
RNase III trans-activating crRNA (tracrRNA)
What sequences are retained in a mature crRNA?
The spacer sequence and a portion of the repeat.
What specific sequence within the crRNA must perfectly match the target to govern interference?
The seed sequence.
Which protein is recruited by the Cascade complex to unwind and degrade target DNA?
Cas3.
Which four Cas proteins typically compose the Cascade complex in Type I systems?
Cas5 Cas6 Cas7 Cas8
In addition to phage mRNA, what can Type III systems degrade via Cas10 in a transcription-dependent manner?
DNA.
What type of DNA break is generated by the dual-RNA-guided endonuclease Cas9?
A blunt double-stranded break.
What are the two nuclease domains in Cas9 that each cut one DNA strand?
RuvC-like domain HNH domain
What specific PAM sequence type does Cas12a recognize?
T-rich PAM.
How does Cas12a's RNA processing differ from Cas9?
Cas12a processes its own CRISPR RNA without a separate tracrRNA.
What is the primary target substrate of the Cas13 protein?
Single-stranded RNA.
What is the term for the non-specific degradation of surrounding RNAs by Cas13 upon target binding?
Collateral RNase activity.
Which enzymatic activity allows Cas3 to unwind double-stranded DNA?
Helicase activity.

Quiz

How does Cas12a differ from Cas9 in the pattern of DNA cleavage?
1 of 2
Key Concepts
CRISPR Mechanism Overview
CRISPR‑Cas adaptive immune system
Spacer acquisition (adaptation)
Protospacer Adjacent Motif (PAM)
crRNA biogenesis
Interference (target cleavage)
CRISPR Effectors
Cas9
Cas12a
Cas13
Cascade complex
Cas3