RemNote Community
Community

CRISPR Foundations

Understand the structure, classification, and major nucleases (Cas9, Cas12a, Cas13) of CRISPR‑Cas systems.
Summary
Read Summary
Flashcards
Save Flashcards
Quiz
Take Quiz

Quick Practice

What does the acronym CRISPR stand for?
1 of 25

Summary

Overview of CRISPR-Cas Systems What is CRISPR? CRISPR stands for Clustered Regularly Interspaced Short Palindromic Repeats. It's a family of DNA sequences found in bacteria and archaea that functions as a heritable immune system against invading genetic elements like bacteriophages (viruses that infect bacteria) and plasmids. Think of CRISPR as a molecular "mugshot database" combined with a "search and destroy" system. Bacteria record snippets of invading DNA and use these records to recognize and eliminate future attacks from the same invaders. This system is incredibly common—approximately 50% of bacterial genomes and 90% of archaeal genomes contain CRISPR loci, highlighting its evolutionary success. The Structure of a CRISPR Locus Repeats and Spacers A CRISPR array has a distinctive structure that gives the system its name. Here's what you'll find: The repeating pattern starts with an AT-rich leader sequence (the "header"), followed by alternating short repeat sequences and unique spacers. The repeats themselves are typically 28–37 base pairs long and can form stem-loop structures when transcribed into RNA. These repeats are largely identical throughout the array. The spacers, in contrast, are the variable parts—usually 32–38 bp each—and represent the "mugshots." Each spacer is derived from actual invading phage or plasmid DNA that previously infected (or attempted to infect) the bacterium. This is why spacers are unique: they record specific past infections. Cas Genes Adjacent to the CRISPR repeat-spacer array are protein-coding genes called cas genes (CRISPR-associated genes). These genes encode the molecular machinery that processes the CRISPR array and executes the immune response. Over 90 cas genes exist in nature, grouped into 35 different families. Of these, 11 families form the cas core, comprising the basic genes Cas1 through Cas9. This core group appears across most systems, though not all systems contain every core gene. Classification: Classes and Types The diversity of CRISPR-Cas systems is organized into a classification scheme based on the architecture of their effector complexes. Class 1 and Class 2 Systems The broadest division separates systems into two classes: Class 1 systems use multi-protein effector complexes—several different Cas proteins work together to recognize and cleave target DNA. These systems are generally larger and more complex. Class 2 systems use a single large Cas protein as the molecular tool. This "all-in-one" design is simpler, which is partly why Class 2 systems have been more widely adopted in biotechnology. Six System Types (I through VI) Within these classes, there are six distinct system types: Type I: Class 1 system with multi-protein complex Type II: Class 2 system using a single protein (characteristic: uses Cas9) Type III: Class 1 system Type IV: Class 1 system Type V: Class 2 system using Cas12a as the effector Type VI: Class 2 system using Cas13 as the effector (targets RNA, not DNA) Each type is defined by a unique signature gene that distinguishes it from the others. This classification helps researchers identify which system they're working with and predict its properties. How CRISPR-Cas Systems Work: The Molecular Mechanism The CRISPR-Cas system operates in two main phases: first, it acquires spacers from invading DNA (the learning phase), and second, it uses those spacers to provide immunity against future invasions (the defense phase). Acquisition Phase When new foreign DNA enters the cell, the Cas1 and Cas2 proteins recognize and capture short fragments of this foreign DNA. These fragments are integrated into the CRISPR array as new spacers. This process is heritable—it becomes a permanent part of the bacterial genome, so all descendants inherit this immunity. This is why we say CRISPR provides "acquired, heritable immunity." Defense Phase When the same pathogen returns, the CRISPR array is transcribed into a long RNA molecule. Cas proteins process this transcript into shorter guide RNAs (crRNA), which contain both the spacer sequence and structural elements that help them bind to Cas proteins. The Cas protein-guide RNA complex then scans the cell looking for DNA sequences matching the spacer. When a match is found (with specific requirements we'll discuss below), the DNA is cleaved, destroying the invading genetic element. Major CRISPR Nucleases: Molecular Details Three nucleases dominate CRISPR research and applications. Understanding their properties is essential because their specific target requirements and cutting patterns make them suited for different applications. Cas9: The Workhorse Nuclease Cas9 is the archetypal Class 2 Type II nuclease and remains the most widely used CRISPR tool in research and medicine. Structure and mechanism: Cas9 operates as a single protein paired with guide RNA. It contains two distinct cutting domains: The HNH domain cuts the DNA strand that matches the guide RNA (the "target strand") The RuvC domain cuts the opposite, non-matching strand (the "non-target strand") This dual-domain design ensures that both strands are cut, producing blunt-ended double-strand breaks. Targeting requirements (PAM): Cas9 doesn't cut at just any location where the guide RNA sequence matches. It has a critical requirement: a protospacer adjacent motif (PAM)—a short DNA sequence that must be present immediately next to the target site. For the most commonly used Streptococcus pyogenes Cas9, the PAM is 5′-NGG-3′ (where N is any nucleotide, and GG must be present). This means Cas9 only cuts if there's a match to the guide RNA and a GG sequence immediately following (on the 3′ end of the non-target strand). This PAM requirement limits where Cas9 can cut in the genome but also provides specificity. Cas12a (Formerly Called Cpf1): The Alternative Nuclease Cas12a is a Class 2 Type V nuclease that offers advantages over Cas9 in certain applications. Structure and targeting simplicity: Unlike Cas9, Cas12a requires only a single guide RNA (crRNA) for targeting—no additional tracrRNA needed. This simplification makes Cas12a easier to design for applications. PAM sequence: Cas12a uses a T-rich PAM: 5′-TTTV-3′ (where V = A, C, or G). This is fundamentally different from Cas9's GG-rich PAM, meaning Cas12a can target different genomic locations. The PAM is also located on the opposite end of the target sequence (upstream rather than downstream), which is another key difference. Cutting pattern: Cas12a produces staggered cuts with 5′ overhangs, meaning it creates sticky ends rather than the blunt ends generated by Cas9. This difference affects how the cell repairs the break and can influence the efficiency of inserting new DNA sequences during gene editing. Cas13: The RNA-Targeting Nuclease Cas13 stands apart from Cas9 and Cas12a because it targets RNA, not DNA. Mechanism: Cas13 is an RNA-guided RNA endonuclease. When it binds its target RNA with help from a guide RNA, it cleaves single-stranded RNA at specific sites. After this initial cleavage, something unusual happens: Cas13 remains activated and indiscriminately cleaves other RNA molecules nearby. This property is called collateral cleavage and is a key feature. Applications: The collateral cleavage activity makes Cas13 valuable for diagnostic applications. When Cas13 detects its target RNA (such as viral RNA in a patient sample), it "turns on" and creates an explosive signal by destroying all nearby RNA, which can be detected easily. This is the principle behind diagnostic tests for conditions like COVID-19. <extrainfo> Evolutionary Origins: Comparative genomic studies have revealed that Cas1 is universally conserved across all CRISPR-associated systems and likely originated from mobile genetic elements called casposons. This suggests that CRISPR systems may have evolved from "selfish DNA" elements and were later co-opted for immunity. The prevalence of this conserved Cas1 across diverse organisms highlights the ancient origins of CRISPR technology. </extrainfo> Key Takeaways CRISPR-Cas systems are widespread bacterial immune systems that record and eliminate invading DNA through a guide RNA-directed mechanism Structure: CRISPR arrays consist of repeating sequences interspersed with unique spacer sequences derived from past infections Classification: Systems divide into Class 1 (multi-protein) and Class 2 (single-protein), further subdivided into Types I through VI Cas9 uses dual cutting domains, requires a GG-rich PAM, and produces blunt ends Cas12a requires only crRNA, has a T-rich PAM located differently, and produces staggered ends Cas13 uniquely targets RNA and exhibits collateral cleavage activity useful for diagnostics
Flashcards
What does the acronym CRISPR stand for?
Clustered Regularly Interspaced Short Palindromic Repeats
In which types of organisms is the CRISPR family of DNA sequences naturally found?
Bacteria and archaea
What is the primary biological function of the CRISPR-Cas system in nature?
Heritable, acquired immunity against bacteriophages and mobile genetic elements
What components separate the short repeats within a CRISPR array?
Unique spacers
What is the original source of the unique spacers found in a CRISPR array?
Past viral or plasmid DNA
What specific sequence is located at the beginning of a CRISPR array?
An AT-rich leader sequence
What structural feature may repeats form in transcribed CRISPR RNA?
Stem-loop structures
What is the typical length range of spacers in a CRISPR array?
32–38 bp
What are the two primary enzymatic functions of Cas proteins?
Processing RNA and cleaving nucleic acids
Which specific Cas protein is universally conserved across all CRISPR-associated systems?
Cas1
From what mobile genetic elements is Cas1 likely to have originated?
Casposons
What are the two types of guide RNAs that direct Cas proteins to target sequences?
crRNA tracrRNA
What defines a Class 1 CRISPR system?
The use of multi-protein (multi-subunit) effector complexes
What defines a Class 2 CRISPR system?
The use of a single large Cas protein as an effector
Which system types belong to Class 1 CRISPR systems?
Type I Type III Type IV
Which system types belong to Class 2 CRISPR systems?
Type II Type V Type VI
What are the two catalytic domains in Cas9 that cut opposite DNA strands?
HNH and RuvC domains
What type of DNA ends are produced by a Cas9 cleavage event?
Blunt ends
What is the specific Protospacer Adjacent Motif (PAM) for Streptococcus pyogenes Cas9?
5′-NGG-3′
What was the former name of the Cas12a nuclease?
Cpf1
What type of DNA cuts does Cas12a generate?
Staggered cuts with 5′ overhangs
What is the typical sequence of the T-rich PAM used by Cas12a?
5′-TTTV-3′ (where V = A, C, or G)
Does Cas12a require a tracrRNA for target recognition?
No (it only requires a crRNA)
What is the primary target molecule cleaved by Cas13?
Single-stranded RNA
What unique enzymatic property does Cas13 exhibit after binding its target RNA?
Collateral cleavage of other RNAs

Quiz

What does the acronym CRISPR stand for?
1 of 13
Key Concepts
CRISPR Mechanisms
CRISPR‑Cas system
CRISPR array
Cas protein
Guide RNA
Classifications of CRISPR Systems
Class 1 CRISPR system
Class 2 CRISPR system
Cas Nucleases
Cas9
Cas12a
Cas13
Protospacer adjacent motif (PAM)