Subjects/Languages/Language Studies/Linguistics/Cyrillic script

Cyrillic script Study Guide

Study Guide

📖 Core Concepts Cyrillic script – a writing system derived from Greek uncial letters, later supplemented with Glagolitic symbols to cover Slavic sounds. Majuscule vs. minuscule – uppercase and lowercase forms; case distinction appeared after the 17th‑century civil‑script reform. Alphabet‑numeral duality – many Cyrillic letters double as numbers, inheriting Greek numeric values (not alphabetical order). Unicode blocks – modern computers store Cyrillic characters in several Unicode ranges (U+0400–U+04FF, U+0500–U+052F, etc.). Romanization vs. Cyrillization – Romanization converts Cyrillic to Latin (e.g., ISO 9, scientific transliteration); Cyrillization does the opposite. 📌 Must Remember Geographic reach: 250 M users (2019); official script in Russia, Bulgaria, Serbia, Ukraine, Belarus, Kazakhstan, Kyrgyzstan, Mongolia, etc. Key historical reforms: 9th c. Early Cyrillic (Preslav) – Greek base + Glagolitic additions. 1708‑1710 Peter the Great’s Civil script – introduced lowercase, removed archaic letters, Latin‑style shapes. 19th c. Vuk Karadžić’s Serbian reform – added Љ, Њ, Ђ, Ћ, Џ, Ј; eliminated obsolete graphemes. 20th c. Soviet standardization – created unified alphabets for many non‑Slavic languages. Unicode ranges (most frequently needed): Cyrillic: U+0400 – U+04FF Cyrillic Supplement: U+0500 – U+052F Cyrillic Extended‑A/B/C/D for historic/rare letters. 8‑bit legacy encodings: ISO/IEC 8859‑5 and Windows‑1251 (pre‑UTF‑8 era). Romanization standards: Scientific transliteration (Serbo‑Croatian base) and ISO 9 (one‑to‑one mapping). 🔄 Key Processes Peter the Great’s Civil‑Script Reform Identify archaic letters → remove them. Create distinct lowercase forms → add minuscule glyphs. Align shapes with Latin typographic conventions → redesign letter silhouettes. Unicode addition (v5.1) Survey early Cyrillic manuscripts → compile missing glyphs. Assign code points in Extended‑A/B/C/D blocks → ensure one‑to‑one mapping. Release updated Unicode standard (04 Apr 2008). Romanization (ISO 9) Take each Cyrillic character → map to a unique Latin counterpart (no diacritics needed). Preserve case and order → produce reversible transliteration. 🔍 Key Comparisons Cyrillic vs. Latin script (Serbian) Official status: Cyrillic constitutionally official; Latin de‑facto common. Letter inventory: Cyrillic includes unique letters (Љ, Њ, etc.); Latin uses digraphs (lj, nj). Unicode vs. 8‑bit encodings Scope: Unicode covers all historic and modern Cyrillic characters; ISO‑8859‑5/Windows‑1251 cover only modern Russian‑style set. Portability: Unicode works across platforms; 8‑bit encodings can cause mojibake on non‑compatible systems. ⚠️ Common Misunderstandings “Cyrillic = Russian” – false; many non‑Russian languages (Bulgarian, Serbian, Kazakh, Mongolian, etc.) use distinct Cyrillic alphabets. Numeric values follow alphabet order – they follow Greek‑derived values, e.g., А = 1, В = 2, Г = 3, but later letters jump to tens/hundreds. Lowercase existed from the start – lowercase forms were introduced only in the early 18th‑century civil script. 🧠 Mental Models / Intuition “Greek base + local tweaks” – picture the Greek uncial alphabet as a skeleton; every extra Cyrillic letter is a “plug‑in” for a sound missing in Greek. Unicode as “address book” – each block is a floor; the main floor (U+0400‑U+04FF) holds everyday letters, upper floors hold historic/rare ones. 🚩 Exceptions & Edge Cases Letter ё (Yo) – optional in many Russian texts; often omitted in newspapers, but mandatory in school books and dictionaries for disambiguation. Cyrillic in EU – after Bulgaria’s 2007 EU accession, Cyrillic became the EU’s third official script (Latin, Greek, Cyrillic). Script switching – Central Asian republics (Azerbaijan, Turkmenistan, Uzbekistan) officially moved to Latin in the 21st c., but legacy Cyrillic still appears in older documents. 📍 When to Use Which Choosing a romanization system: Scientific transliteration → academic linguistic work, where diacritics are acceptable. ISO 9 → any context requiring reversible, one‑to‑one mapping (databases, GIS). Encoding choice for legacy data: If the text is purely modern Russian → Windows‑1251 may suffice. For multilingual or historic corpora → UTF‑8 (Unicode) is mandatory. Keyboard layout selection: Use the national layout (e.g., Russian JCUKEN, Serbian Latin/Cyrillic) for native typing; switch to “US‑International” only for transliteration tasks. 👀 Patterns to Recognize Letter‑value pattern: Cyrillic numerals follow Greek numeric groups (units, tens, hundreds). Spot a letter with a “hook” (e.g., ҂) → indicates a numeral. Reform pattern: Major script reforms coincide with political shifts (e.g., Peter the Great → modernization; Soviet era → standardization). Dual‑script pattern: In Serbian texts, the same word may appear in both scripts side‑by‑side; look for identical morphology despite different glyphs. 🗂️ Exam Traps Distractor: “Cyrillic was invented by Saint Cyril.” – Correct answer: named after him; created by his disciples. Distractor: “All Cyrillic letters have a one‑to‑one numeric value in alphabetical order.” – Wrong; numeric values are inherited from Greek and are non‑sequential. Distractor: “Unicode block U+0400‑U+04FF contains every Cyrillic character ever used.” – Incorrect; historic and minority letters reside in Extended‑A/B/C/D blocks. Distractor: “The letter ё is mandatory in all Russian publications.” – Not true; many modern publications omit it for brevity. --- Use this guide for a rapid review before your exam – focus on the bolded keywords and the step‑by‑step processes to cement your recall.

Or, immediately create your own study flashcards:

Upload a PDF.
Master Study Materials.

Start learning in seconds

Drop your PDFs here or