Cyrillic script Study Guide
Study Guide
📖 Core Concepts
Cyrillic script – a writing system derived from Greek uncial letters, later supplemented with Glagolitic symbols to cover Slavic sounds.
Majuscule vs. minuscule – uppercase and lowercase forms; case distinction appeared after the 17th‑century civil‑script reform.
Alphabet‑numeral duality – many Cyrillic letters double as numbers, inheriting Greek numeric values (not alphabetical order).
Unicode blocks – modern computers store Cyrillic characters in several Unicode ranges (U+0400–U+04FF, U+0500–U+052F, etc.).
Romanization vs. Cyrillization – Romanization converts Cyrillic to Latin (e.g., ISO 9, scientific transliteration); Cyrillization does the opposite.
📌 Must Remember
Geographic reach: 250 M users (2019); official script in Russia, Bulgaria, Serbia, Ukraine, Belarus, Kazakhstan, Kyrgyzstan, Mongolia, etc.
Key historical reforms:
9th c. Early Cyrillic (Preslav) – Greek base + Glagolitic additions.
1708‑1710 Peter the Great’s Civil script – introduced lowercase, removed archaic letters, Latin‑style shapes.
19th c. Vuk Karadžić’s Serbian reform – added Љ, Њ, Ђ, Ћ, Џ, Ј; eliminated obsolete graphemes.
20th c. Soviet standardization – created unified alphabets for many non‑Slavic languages.
Unicode ranges (most frequently needed):
Cyrillic: U+0400 – U+04FF
Cyrillic Supplement: U+0500 – U+052F
Cyrillic Extended‑A/B/C/D for historic/rare letters.
8‑bit legacy encodings: ISO/IEC 8859‑5 and Windows‑1251 (pre‑UTF‑8 era).
Romanization standards: Scientific transliteration (Serbo‑Croatian base) and ISO 9 (one‑to‑one mapping).
🔄 Key Processes
Peter the Great’s Civil‑Script Reform
Identify archaic letters → remove them.
Create distinct lowercase forms → add minuscule glyphs.
Align shapes with Latin typographic conventions → redesign letter silhouettes.
Unicode addition (v5.1)
Survey early Cyrillic manuscripts → compile missing glyphs.
Assign code points in Extended‑A/B/C/D blocks → ensure one‑to‑one mapping.
Release updated Unicode standard (04 Apr 2008).
Romanization (ISO 9)
Take each Cyrillic character → map to a unique Latin counterpart (no diacritics needed).
Preserve case and order → produce reversible transliteration.
🔍 Key Comparisons
Cyrillic vs. Latin script (Serbian)
Official status: Cyrillic constitutionally official; Latin de‑facto common.
Letter inventory: Cyrillic includes unique letters (Љ, Њ, etc.); Latin uses digraphs (lj, nj).
Unicode vs. 8‑bit encodings
Scope: Unicode covers all historic and modern Cyrillic characters; ISO‑8859‑5/Windows‑1251 cover only modern Russian‑style set.
Portability: Unicode works across platforms; 8‑bit encodings can cause mojibake on non‑compatible systems.
⚠️ Common Misunderstandings
“Cyrillic = Russian” – false; many non‑Russian languages (Bulgarian, Serbian, Kazakh, Mongolian, etc.) use distinct Cyrillic alphabets.
Numeric values follow alphabet order – they follow Greek‑derived values, e.g., А = 1, В = 2, Г = 3, but later letters jump to tens/hundreds.
Lowercase existed from the start – lowercase forms were introduced only in the early 18th‑century civil script.
🧠 Mental Models / Intuition
“Greek base + local tweaks” – picture the Greek uncial alphabet as a skeleton; every extra Cyrillic letter is a “plug‑in” for a sound missing in Greek.
Unicode as “address book” – each block is a floor; the main floor (U+0400‑U+04FF) holds everyday letters, upper floors hold historic/rare ones.
🚩 Exceptions & Edge Cases
Letter ё (Yo) – optional in many Russian texts; often omitted in newspapers, but mandatory in school books and dictionaries for disambiguation.
Cyrillic in EU – after Bulgaria’s 2007 EU accession, Cyrillic became the EU’s third official script (Latin, Greek, Cyrillic).
Script switching – Central Asian republics (Azerbaijan, Turkmenistan, Uzbekistan) officially moved to Latin in the 21st c., but legacy Cyrillic still appears in older documents.
📍 When to Use Which
Choosing a romanization system:
Scientific transliteration → academic linguistic work, where diacritics are acceptable.
ISO 9 → any context requiring reversible, one‑to‑one mapping (databases, GIS).
Encoding choice for legacy data:
If the text is purely modern Russian → Windows‑1251 may suffice.
For multilingual or historic corpora → UTF‑8 (Unicode) is mandatory.
Keyboard layout selection:
Use the national layout (e.g., Russian JCUKEN, Serbian Latin/Cyrillic) for native typing; switch to “US‑International” only for transliteration tasks.
👀 Patterns to Recognize
Letter‑value pattern: Cyrillic numerals follow Greek numeric groups (units, tens, hundreds). Spot a letter with a “hook” (e.g., ҂) → indicates a numeral.
Reform pattern: Major script reforms coincide with political shifts (e.g., Peter the Great → modernization; Soviet era → standardization).
Dual‑script pattern: In Serbian texts, the same word may appear in both scripts side‑by‑side; look for identical morphology despite different glyphs.
🗂️ Exam Traps
Distractor: “Cyrillic was invented by Saint Cyril.” – Correct answer: named after him; created by his disciples.
Distractor: “All Cyrillic letters have a one‑to‑one numeric value in alphabetical order.” – Wrong; numeric values are inherited from Greek and are non‑sequential.
Distractor: “Unicode block U+0400‑U+04FF contains every Cyrillic character ever used.” – Incorrect; historic and minority letters reside in Extended‑A/B/C/D blocks.
Distractor: “The letter ё is mandatory in all Russian publications.” – Not true; many modern publications omit it for brevity.
---
Use this guide for a rapid review before your exam – focus on the bolded keywords and the step‑by‑step processes to cement your recall.
or
Or, immediately create your own study flashcards:
Upload a PDF.
Master Study Materials.
Master Study Materials.
Start learning in seconds
Drop your PDFs here or
or