CRISPR Foundations
Understand the structure, classification, and major nucleases (Cas9, Cas12a, Cas13) of CRISPR‑Cas systems.
Summary
Read Summary
Flashcards
Save Flashcards
Quiz
Take Quiz
Quick Practice
What does the acronym CRISPR stand for?
1 of 25
Summary
Overview of CRISPR-Cas Systems
What is CRISPR?
CRISPR stands for Clustered Regularly Interspaced Short Palindromic Repeats. It's a family of DNA sequences found in bacteria and archaea that functions as a heritable immune system against invading genetic elements like bacteriophages (viruses that infect bacteria) and plasmids.
Think of CRISPR as a molecular "mugshot database" combined with a "search and destroy" system. Bacteria record snippets of invading DNA and use these records to recognize and eliminate future attacks from the same invaders. This system is incredibly common—approximately 50% of bacterial genomes and 90% of archaeal genomes contain CRISPR loci, highlighting its evolutionary success.
The Structure of a CRISPR Locus
Repeats and Spacers
A CRISPR array has a distinctive structure that gives the system its name. Here's what you'll find:
The repeating pattern starts with an AT-rich leader sequence (the "header"), followed by alternating short repeat sequences and unique spacers. The repeats themselves are typically 28–37 base pairs long and can form stem-loop structures when transcribed into RNA. These repeats are largely identical throughout the array.
The spacers, in contrast, are the variable parts—usually 32–38 bp each—and represent the "mugshots." Each spacer is derived from actual invading phage or plasmid DNA that previously infected (or attempted to infect) the bacterium. This is why spacers are unique: they record specific past infections.
Cas Genes
Adjacent to the CRISPR repeat-spacer array are protein-coding genes called cas genes (CRISPR-associated genes). These genes encode the molecular machinery that processes the CRISPR array and executes the immune response.
Over 90 cas genes exist in nature, grouped into 35 different families. Of these, 11 families form the cas core, comprising the basic genes Cas1 through Cas9. This core group appears across most systems, though not all systems contain every core gene.
Classification: Classes and Types
The diversity of CRISPR-Cas systems is organized into a classification scheme based on the architecture of their effector complexes.
Class 1 and Class 2 Systems
The broadest division separates systems into two classes:
Class 1 systems use multi-protein effector complexes—several different Cas proteins work together to recognize and cleave target DNA. These systems are generally larger and more complex.
Class 2 systems use a single large Cas protein as the molecular tool. This "all-in-one" design is simpler, which is partly why Class 2 systems have been more widely adopted in biotechnology.
Six System Types (I through VI)
Within these classes, there are six distinct system types:
Type I: Class 1 system with multi-protein complex
Type II: Class 2 system using a single protein (characteristic: uses Cas9)
Type III: Class 1 system
Type IV: Class 1 system
Type V: Class 2 system using Cas12a as the effector
Type VI: Class 2 system using Cas13 as the effector (targets RNA, not DNA)
Each type is defined by a unique signature gene that distinguishes it from the others. This classification helps researchers identify which system they're working with and predict its properties.
How CRISPR-Cas Systems Work: The Molecular Mechanism
The CRISPR-Cas system operates in two main phases: first, it acquires spacers from invading DNA (the learning phase), and second, it uses those spacers to provide immunity against future invasions (the defense phase).
Acquisition Phase
When new foreign DNA enters the cell, the Cas1 and Cas2 proteins recognize and capture short fragments of this foreign DNA. These fragments are integrated into the CRISPR array as new spacers. This process is heritable—it becomes a permanent part of the bacterial genome, so all descendants inherit this immunity. This is why we say CRISPR provides "acquired, heritable immunity."
Defense Phase
When the same pathogen returns, the CRISPR array is transcribed into a long RNA molecule. Cas proteins process this transcript into shorter guide RNAs (crRNA), which contain both the spacer sequence and structural elements that help them bind to Cas proteins. The Cas protein-guide RNA complex then scans the cell looking for DNA sequences matching the spacer. When a match is found (with specific requirements we'll discuss below), the DNA is cleaved, destroying the invading genetic element.
Major CRISPR Nucleases: Molecular Details
Three nucleases dominate CRISPR research and applications. Understanding their properties is essential because their specific target requirements and cutting patterns make them suited for different applications.
Cas9: The Workhorse Nuclease
Cas9 is the archetypal Class 2 Type II nuclease and remains the most widely used CRISPR tool in research and medicine.
Structure and mechanism: Cas9 operates as a single protein paired with guide RNA. It contains two distinct cutting domains:
The HNH domain cuts the DNA strand that matches the guide RNA (the "target strand")
The RuvC domain cuts the opposite, non-matching strand (the "non-target strand")
This dual-domain design ensures that both strands are cut, producing blunt-ended double-strand breaks.
Targeting requirements (PAM): Cas9 doesn't cut at just any location where the guide RNA sequence matches. It has a critical requirement: a protospacer adjacent motif (PAM)—a short DNA sequence that must be present immediately next to the target site.
For the most commonly used Streptococcus pyogenes Cas9, the PAM is 5′-NGG-3′ (where N is any nucleotide, and GG must be present). This means Cas9 only cuts if there's a match to the guide RNA and a GG sequence immediately following (on the 3′ end of the non-target strand). This PAM requirement limits where Cas9 can cut in the genome but also provides specificity.
Cas12a (Formerly Called Cpf1): The Alternative Nuclease
Cas12a is a Class 2 Type V nuclease that offers advantages over Cas9 in certain applications.
Structure and targeting simplicity: Unlike Cas9, Cas12a requires only a single guide RNA (crRNA) for targeting—no additional tracrRNA needed. This simplification makes Cas12a easier to design for applications.
PAM sequence: Cas12a uses a T-rich PAM: 5′-TTTV-3′ (where V = A, C, or G). This is fundamentally different from Cas9's GG-rich PAM, meaning Cas12a can target different genomic locations. The PAM is also located on the opposite end of the target sequence (upstream rather than downstream), which is another key difference.
Cutting pattern: Cas12a produces staggered cuts with 5′ overhangs, meaning it creates sticky ends rather than the blunt ends generated by Cas9. This difference affects how the cell repairs the break and can influence the efficiency of inserting new DNA sequences during gene editing.
Cas13: The RNA-Targeting Nuclease
Cas13 stands apart from Cas9 and Cas12a because it targets RNA, not DNA.
Mechanism: Cas13 is an RNA-guided RNA endonuclease. When it binds its target RNA with help from a guide RNA, it cleaves single-stranded RNA at specific sites. After this initial cleavage, something unusual happens: Cas13 remains activated and indiscriminately cleaves other RNA molecules nearby. This property is called collateral cleavage and is a key feature.
Applications: The collateral cleavage activity makes Cas13 valuable for diagnostic applications. When Cas13 detects its target RNA (such as viral RNA in a patient sample), it "turns on" and creates an explosive signal by destroying all nearby RNA, which can be detected easily. This is the principle behind diagnostic tests for conditions like COVID-19.
<extrainfo>
Evolutionary Origins: Comparative genomic studies have revealed that Cas1 is universally conserved across all CRISPR-associated systems and likely originated from mobile genetic elements called casposons. This suggests that CRISPR systems may have evolved from "selfish DNA" elements and were later co-opted for immunity. The prevalence of this conserved Cas1 across diverse organisms highlights the ancient origins of CRISPR technology.
</extrainfo>
Key Takeaways
CRISPR-Cas systems are widespread bacterial immune systems that record and eliminate invading DNA through a guide RNA-directed mechanism
Structure: CRISPR arrays consist of repeating sequences interspersed with unique spacer sequences derived from past infections
Classification: Systems divide into Class 1 (multi-protein) and Class 2 (single-protein), further subdivided into Types I through VI
Cas9 uses dual cutting domains, requires a GG-rich PAM, and produces blunt ends
Cas12a requires only crRNA, has a T-rich PAM located differently, and produces staggered ends
Cas13 uniquely targets RNA and exhibits collateral cleavage activity useful for diagnostics
Flashcards
What does the acronym CRISPR stand for?
Clustered Regularly Interspaced Short Palindromic Repeats
In which types of organisms is the CRISPR family of DNA sequences naturally found?
Bacteria and archaea
What is the primary biological function of the CRISPR-Cas system in nature?
Heritable, acquired immunity against bacteriophages and mobile genetic elements
What components separate the short repeats within a CRISPR array?
Unique spacers
What is the original source of the unique spacers found in a CRISPR array?
Past viral or plasmid DNA
What specific sequence is located at the beginning of a CRISPR array?
An AT-rich leader sequence
What structural feature may repeats form in transcribed CRISPR RNA?
Stem-loop structures
What is the typical length range of spacers in a CRISPR array?
32–38 bp
What are the two primary enzymatic functions of Cas proteins?
Processing RNA and cleaving nucleic acids
Which specific Cas protein is universally conserved across all CRISPR-associated systems?
Cas1
From what mobile genetic elements is Cas1 likely to have originated?
Casposons
What are the two types of guide RNAs that direct Cas proteins to target sequences?
crRNA
tracrRNA
What defines a Class 1 CRISPR system?
The use of multi-protein (multi-subunit) effector complexes
What defines a Class 2 CRISPR system?
The use of a single large Cas protein as an effector
Which system types belong to Class 1 CRISPR systems?
Type I
Type III
Type IV
Which system types belong to Class 2 CRISPR systems?
Type II
Type V
Type VI
What are the two catalytic domains in Cas9 that cut opposite DNA strands?
HNH and RuvC domains
What type of DNA ends are produced by a Cas9 cleavage event?
Blunt ends
What is the specific Protospacer Adjacent Motif (PAM) for Streptococcus pyogenes Cas9?
5′-NGG-3′
What was the former name of the Cas12a nuclease?
Cpf1
What type of DNA cuts does Cas12a generate?
Staggered cuts with 5′ overhangs
What is the typical sequence of the T-rich PAM used by Cas12a?
5′-TTTV-3′ (where V = A, C, or G)
Does Cas12a require a tracrRNA for target recognition?
No (it only requires a crRNA)
What is the primary target molecule cleaved by Cas13?
Single-stranded RNA
What unique enzymatic property does Cas13 exhibit after binding its target RNA?
Collateral cleavage of other RNAs
Quiz
CRISPR Foundations Quiz Question 1: What does the acronym CRISPR stand for?
- Clustered Regularly Interspaced Short Palindromic Repeats (correct)
- Complex Repetitive Sequence of Prokaryotic RNAs
- Coordinated Repetitive Adaptive System for Phage Resistance
- Conserved Repetitive Antigenic Sequences in Prokaryotes
CRISPR Foundations Quiz Question 2: Into how many main classes are CRISPR‑associated systems divided?
- Two classes (correct)
- Three classes
- Four classes
- Five classes
CRISPR Foundations Quiz Question 3: After the AT‑rich leader sequence, what pattern characterizes a typical CRISPR array?
- Alternating short repeats and unique spacers (correct)
- Alternating long repeats and conserved spacers
- Only spacers without any repeats
- Only repeats without any spacers
CRISPR Foundations Quiz Question 4: What protospacer adjacent motif (PAM) does *Streptococcus pyogenes* Cas9 recognize?
- 5′‑NGG‑3′ (correct)
- 5′‑TTTV‑3′
- 5′‑CCN‑3′
- 5′‑ATG‑3′
CRISPR Foundations Quiz Question 5: How many cas‑gene families constitute the core set, and which Cas proteins are included in this core?
- 11 families; Cas1 through Cas9 (correct)
- 35 families; Cas1 through Cas35
- 90 families; Cas1 through Cas90
- 5 families; Cas1 through Cas5
CRISPR Foundations Quiz Question 6: How many CRISPR system classes exist, and what defines each subtype?
- Six classes; each subtype defined by a signature gene (correct)
- Four classes; subtypes defined by genome size
- Eight classes; subtypes defined by PAM sequence
- Ten classes; subtypes defined by host organism
CRISPR Foundations Quiz Question 7: Which CRISPR system type employs Cas12a (formerly Cpf1) as its effector nuclease?
- Type V (correct)
- Type I
- Type II
- Type IV
CRISPR Foundations Quiz Question 8: Which Cas protein is universally conserved across all CRISPR‑associated systems, and what is its likely evolutionary origin?
- Cas1; originated from casposons (correct)
- Cas9; originated from plasmids
- Cas12a; originated from viruses
- Cas13; originated from transposons
CRISPR Foundations Quiz Question 9: What is the typical protospacer adjacent motif (PAM) recognized by Cas12a?
- 5′‑TTTV‑3′ (correct)
- 5′‑NGG‑3′
- 5′‑CCN‑3′
- 5′‑AT‑rich‑3′
CRISPR Foundations Quiz Question 10: Which RNA component alone is sufficient for Cas12a to recognize its target?
- crRNA alone (correct)
- tracrRNA alone
- crRNA–tracrRNA duplex
- protein cofactor
CRISPR Foundations Quiz Question 11: Which property of Cas13 enables its use in diagnostic assays?
- Collateral cleavage of other RNAs after target binding (correct)
- Direct cleavage of DNA targets
- Self‑degradation of the Cas13 protein
- Release of fluorescent proteins upon activation
CRISPR Foundations Quiz Question 12: Which CRISPR‑associated proteins are responsible for both processing precursor CRISPR transcripts into mature guide RNAs and cleaving target nucleic acids?
- Cas proteins (correct)
- tracrRNA
- PAM sequences
- ribosomes
CRISPR Foundations Quiz Question 13: From which component of the CRISPR locus are guide RNAs (crRNA and sometimes tracrRNA) transcribed?
- The CRISPR array (correct)
- The cas operon
- Plasmid DNA
- Viral genome
What does the acronym CRISPR stand for?
1 of 13
Key Concepts
CRISPR Mechanisms
CRISPR‑Cas system
CRISPR array
Cas protein
Guide RNA
Classifications of CRISPR Systems
Class 1 CRISPR system
Class 2 CRISPR system
Cas Nucleases
Cas9
Cas12a
Cas13
Protospacer adjacent motif (PAM)
Definitions
CRISPR‑Cas system
A heritable, RNA‑guided adaptive immune mechanism in bacteria and archaea that defends against viruses and plasmids.
CRISPR array
A genomic locus composed of short repeat sequences interspaced by unique spacers derived from foreign DNA.
Cas protein
An enzyme associated with CRISPR loci that processes guide RNAs and cleaves nucleic acids.
Class 1 CRISPR system
Multi‑subunit effector complexes (Types I, III, IV) that mediate interference against invading nucleic acids.
Class 2 CRISPR system
Single‑protein effectors (Types II, V, VI) that perform target cleavage using a solitary Cas nuclease.
Cas9
A Class 2 Type II endonuclease that uses a crRNA–tracrRNA duplex (or sgRNA) to create blunt double‑strand DNA breaks adjacent to an NGG PAM.
Cas12a
A Class 2 Type V nuclease that requires only a crRNA, generates staggered DNA cuts with 5′ overhangs, and recognizes a T‑rich PAM.
Cas13
A Class 2 Type VI RNA‑guided RNase that cleaves single‑stranded RNA and exhibits collateral RNA cleavage after target binding.
Protospacer adjacent motif (PAM)
A short conserved DNA sequence flanking the target site that is required for Cas nuclease binding and cleavage.
Guide RNA
The CRISPR‑derived RNA (crRNA, sometimes paired with tracrRNA) that directs Cas proteins to complementary nucleic acid targets.