Gaj's Latin alphabet
| Gaj's Latin alphabet Gajeva latinica | |
|---|---|
| .svg.png) | |
| Script type | |
| Period | early 19th century – present | 
| Languages | Serbo-Croatian | 
| Related scripts | |
| Parent systems | |
| Child systems | Slovene alphabet Montenegrin Latin alphabet Macedonian Latin alphabet Bulgarian Latin Alphabet | 
| Sister systems | Slovak alphabet Latvian alphabet Lithuanian alphabet | 
| Unicode | |
| subset of Latin | |
Gaj's Latin alphabet (Serbo-Croatian: Gajeva latinica / Гајева латиница, pronounced [ɡâːjeva latǐnit͡sa]), also known as abeceda (Serbian Cyrillic: абецеда, pronounced [abet͡sěːda]) or gajica (Serbian Cyrillic: гајица, pronounced [ɡǎjit͡sa]), is the form of the Latin script used for writing all four standard varieties of Serbo-Croatian: Bosnian, Croatian, Montenegrin, and Serbian. It contains 27 individual letters and 3 digraphs. Each letter (including digraphs) represents one Serbo-Croatian phoneme, yielding a highly phonemic orthography. It closely corresponds to the Serbian Cyrillic alphabet.
The alphabet was initially devised by Croatian linguist Ljudevit Gaj in 1835 during the Illyrian movement in ethnically Croatian parts of the Austrian Empire. It was largely based on Jan Hus's Czech alphabet and was meant to serve as a unified orthography for three Croat-populated kingdoms within the Austrian Empire at the time, namely Croatia, Dalmatia and Slavonia, and their three dialect groups, Kajkavian, Chakavian and Shtokavian, which historically utilized different spelling rules. The alphabet's final form was defined in the late 19th century.
A slightly reduced version is used as the alphabet for Slovene, and a slightly expanded version is used for modern standard Montenegrin. A modified version is used for the romanization of Macedonian. It further influenced alphabets of Romani languages that are spoken in Southeast Europe, namely Vlax and Balkan Romani.
Letters
The alphabet consists of thirty upper and lower case letters:
| Majuscule forms (also called uppercase or capital letters) | |||||||||||||||||||||||||||||
| A | B | C | Č | Ć | D | Dž | Đ | E | F | G | H | I | J | K | L | Lj | M | N | Nj | O | P | R | S | Š | T | U | V | Z | Ž | 
| Minuscule forms (also called lowercase or small letters) | |||||||||||||||||||||||||||||
| a | b | c | č | ć | d | dž | đ | e | f | g | h | i | j | k | l | lj | m | n | nj | o | p | r | s | š | t | u | v | z | ž | 
| Broad IPA Value | |||||||||||||||||||||||||||||
| /a/ | /b/ | /t͡s/ | /t͡ʃ/ | /t͡ɕ/ | /d/ | /d͡ʒ/ | /d͡ʑ/ | /e/ | /f/ | /ɡ/ | /x/ | /i/ | /j/ | /k/ | /l/ | /ʎ/ | /m/ | /n/ | /ɲ/ | /o/ | /p/ | /r/ | /s/ | /ʃ/ | /t/ | /u/ | /ʋ/ | /z/ | /ʒ/ | 

Letters are referred to by their name: a, be, ce, če, će, de, dže, đe, e, ef, ge, ha, i, je, ka, el, elj, em, en, enj, o, pe, er, es, eš, te, u, ve, ze, že,[1][2] or, in the case of consonants, by being appended by schwa, e.g. /fə/.[3][4][5] In mathematics, ⟨j⟩ is commonly pronounced jot, as in the German of Germany.
Foreign letters
Various foreign letters are utilised in orthographically unadapted loanwords and foreign proper names, such as Québec.[6][7][8] Orthographically unadapted spelling of foreign names and some loanwords is standard in Croatia, whereas Serbians prefer to use orthographically adapted spellings. Non-native letters Q, W, X, and Y appear on the Serbo-Croatian keyboard. These four letters are usually named as follows: ⟨q⟩ as kve or ku, ⟨w⟩ as duplo ve or dvostruko ve, ⟨x⟩ as iks, and ⟨y⟩ as ipsilon.[6][9][10]
Digraphs
Digraphs ⟨dž⟩, ⟨lj⟩ and ⟨nj⟩ are considered to be single letters, and they signify single phonemes. However, they are distinguished from occurrences of two such letters that signify two distinct phonemes: džep (/d͡ʒêp/, Cyrillic џеп) uses the digraph, while nadživjeti (/nadʒǐːvjeti/, Cyrillic надживјети, morphological boundary: prefix nad- + base živjeti) uses two separate letters.
- In dictionaries, njegov comes after novine, in a separate ⟨nj⟩ section after the end of the ⟨n⟩ section; bolje comes after bolnica; nadžak (digraph ⟨dž⟩) comes after nadživjeti (⟨d⟩+⟨ž⟩ sequence), and so forth.
- If only the initial letter of a word is capitalized, only the first of the two component letters is capitalized: Njemačka ('Germany'), not NJemačka. Uppercase is used only if the entire word was capitalized: NJEMAČKA.[11] In Unicode, the form ⟨Nj⟩ is referred to as titlecase, as opposed to the uppercase form ⟨NJ⟩, representing one of the few cases in which titlecase and uppercase differ.
| U LJ E | M J E NJ A Č N I C A | 
- In vertical writing (such as on signs), ⟨dž⟩, ⟨lj⟩, ⟨nj⟩ are written horizontally, as a unit. For instance, if ulje ('oil') is written vertically, ⟨lj⟩ appears on the second line. In crossword puzzles, ⟨dž⟩, ⟨lj⟩, ⟨nj⟩ each occupy a single square. The word mjenjačnica ('bureau de change') is written vertically with ⟨nj⟩ on the fourth line, while ⟨m⟩ and ⟨j⟩ appear separately on the first and second lines, respectively, because ⟨mj⟩ contains two letters, not one.
- If words are written with a space between each letter (such as on signs), each digraph is written as a unit. For instance: U LJ E, M J E NJ A Č N I C A.
Accent marks
The vowels a, e, i, o, u, along with the syllabic consonants r and l, can take one of 5 accents: the double grave accent (◌̏) for a short vowel with falling tone, the inverted breve (◌̑) for a long vowel with falling tone, the grave accent (◌̀) for a short vowel with rising tone, the acute accent (◌́) for long vowel with rising tone, and macron (◌̄) for a non-tonic long vowel. These diacritic accents are typically used in dictionaries and linguistic publications, and in poetry to denote metrically correct reading. In ordinary prose they occur when needed to resolve semantic ambiguity between homographs: kod ('at') vs. kȏd ('code'), sam ('am') vs. sȃm ('alone'). For the same reason, the length of an unaccented syllable can be marked with ⟨◌̄⟩ or circumflex ⟨◌̂⟩, without accentuating the rest of the word. This is typically used to distinguish homographic nominative singular and genitive plural forms of nouns, where the genitive plural has a long final vowel: knjiga ('book' Nsg.) vs. knjigâ or knjigā ('books' Gpl.).[12][13]
History
Croatian Latin alphabet before Gaj
In Croatian writing the Latin alphabet became dominant in the 16th century, marginalising the Cyrillic and the Glagolitic alphabets.[14] In the 17th century there coalesced two major orthographic practices for using the Latin alphabet. Dalmatia used a system based on the Italian orthography, whereas the continental Kaykavian writing was based on Hungarian. In the 18th century the Slavonian orthography arose as well, a mixture of the previous two.[15] However, the specifics of the alphabetic systems tended to vary from writer to writer.[16]
In addition to these three widely used systems, multiple individual writers attempted their own reforms of the alphabet. These include Rajmund Đamanjić (1639), the early 1700s Dubrovnik academy work led by Đuro Matijašević and Ignjat Đurđević, as well as the early 1700s Lexicon Latino-Illyricum by Pavao Ritter Vitezović.
.png)
Gaj's reform and its revisions
The Serbo-Croatian Latin alphabet was mostly designed by Ljudevit Gaj, who modelled it after Czech (č, ž, š) and Polish (ć), and invented ⟨lj⟩, ⟨nj⟩ and ⟨dž⟩, according to similar solutions in Hungarian (ly, ny and dzs, although dž combinations exist also in Czech (and Polish as dż)). In 1830 in Buda, he published the book Kratka osnova horvatsko-slavenskog pravopisanja ("Brief basics of the Croatian-Slavonic orthography"), which was the first common Croatian orthography book.
Gaj followed the example of Pavao Ritter Vitezović and the Czech orthography, making one letter of the Latin script for each sound in the language. Following Vuk Karadžić's reform of Cyrillic in the early nineteenth century, in the 1830s Ljudevit Gaj did the same for latinica, using the Czech system and producing a one-to-one grapheme-phoneme correlation between the Cyrillic and Latin orthographies, resulting in a parallel system.[17]
In 1878 Đuro Daničić proposed a replacement of the digraphs ⟨dž⟩, ⟨dj⟩,[a] ⟨lj⟩ and ⟨nj⟩ with single letters: ⟨ģ⟩, ⟨đ⟩, ⟨ļ⟩ and ⟨ń⟩ respectively.[20] Of the four, ⟨đ⟩ was accepted in Ivan Broz's 1892 Hrvatski pravopis ("Croatian Orthography") and it thus became a part of the standard alphabet, though it was not immediately accepted by all writers and publishers.[21][19] The other three letters remained in use only in certain philological publications.[18][19] Names of individual people have sometimes retained the pre-đ spelling: Ksaver Šandor Gjalski (/d͡ʑâːlskiː/),[22] Gjuro Szabo (/d͡ʑǔːro/).[23][24]
Correspondence between Cyrillic and Latin alphabets
Each Cyrillic and Latin Serbo-Croatian letter has its exact counterpart in the other alphabet, although Latin digraphs ⟨lj⟩, ⟨nj⟩ and ⟨dž⟩ correspond to Cyrillic single letters ⟨љ⟩, ⟨њ⟩ and ⟨џ⟩. The following table provides the upper and lower case forms of Gaj's Latin alphabet, along with the equivalent forms in the Serbo-Croatian Cyrillic alphabet.
| 
 | 
 | 
Computing
In the 1990s, there was a general confusion about the proper character encoding to use to write text in Latin Croatian on computers.
- An attempt was made to apply the 7-bit "YUSCII", later "CROSCII", which included the five letters with diacritics at the expense of five non-letter characters ([, ], {, }, @), but it was ultimately unsuccessful. Because the ASCII character @ sorts before A, this led to jokes calling it žabeceda (žaba=frog, abeceda=alphabet).
- Other short-lived vendor-specific efforts were also undertaken.
- The 8-bit ISO 8859-2 (Latin-2) standard was developed by ISO.
- MS-DOS introduced 8-bit encoding CP852 for Central European languages, disregarding the ISO standard.
- Microsoft Windows spread yet another 8-bit encoding called CP1250, which had a few letters mapped one-to-one with ISO 8859-2, but also had some mapped elsewhere.
- Apple's Macintosh Central European encoding does not include the entire Gaj's Latin alphabet. Instead, a separate codepage, called MacCroatian encoding, is used.
- EBCDIC also has a Latin-2 encoding.[25]
The preferred character encoding for Croatian today is either the ISO 8859-2, or the Unicode encoding UTF-8 (with two bytes or 16 bits necessary to use the letters with diacritics). However, as of 2010, one can still find programs as well as databases that use CP1250, CP852 or even CROSCII.
Digraphs ⟨dž⟩, ⟨lj⟩ and ⟨nj⟩ in their upper case, title case and lower case forms have dedicated Unicode code points as shown in the table below, However, these are included chiefly for backwards compatibility with legacy encodings which kept a one-to-one correspondence with Cyrillic; modern texts use a sequence of characters.
| Character sequence | Composite character | Unicode code point | 
|---|---|---|
| DŽ | DŽ | U+01C4 | 
| Dž | Dž | U+01C5 | 
| dž | dž | U+01C6 | 
| LJ | LJ | U+01C7 | 
| Lj | Lj | U+01C8 | 
| lj | lj | U+01C9 | 
| NJ | NJ | U+01CA | 
| Nj | Nj | U+01CB | 
| nj | nj | U+01CC | 
Usage for Slovene
Since the early 1840s, Gaj's alphabet was increasingly used for Slovene. In the beginning, it was most commonly used by Slovene authors who treated Slovene as a variant of Serbo-Croatian (such as Stanko Vraz), but it was later accepted by a large spectrum of Slovene-writing authors. The breakthrough came in 1845, when the Slovene conservative leader Janez Bleiweis started using Gaj's script in his journal Kmetijske in rokodelske novice ("Agricultural and Artisan News"), which was read by a wide public in the countryside. By 1850, Gaj's alphabet (known as gajica in Slovene) became the only official Slovene alphabet, replacing three other writing systems that had circulated in the Slovene Lands since the 1830s: the traditional bohoričica, named after Adam Bohorič, who codified it; the dajnčica, named after Peter Dajnko; and the metelčica, named after Franc Serafin Metelko.
The Slovene version of Gaj's alphabet differs from the Serbo-Croatian one in several ways:
- The Slovene alphabet does not have the characters ⟨ć⟩ and ⟨đ⟩; the sounds they represent do not occur in Slovene.
- In Slovene, the digraphs ⟨lj⟩ and ⟨nj⟩ are treated as two separate letters and represent separate sounds (the word polje is pronounced [ˈpóːljɛ] or [pɔˈljéː] in Slovene, as opposed to [pôʎe] in Serbo-Croatian).
- While the phoneme /dʒ/ exists in modern Slovene and is written ⟨dž⟩, it is used in only borrowed words and so ⟨d⟩ and ⟨ž⟩ are considered separate letters, not a digraph.
As in Serbo-Croatian, Slovene orthography does not make use of diacritics to mark accent in words in regular writing, but headwords in dictionaries are given with them to account for homographs. For instance, letter ⟨e⟩ can be pronounced in four ways (/eː/, /ɛ/, /ɛː/ and /ə/), and letter ⟨v⟩ in two ([ʋ] and [w], though the difference is not phonemic). Also, it does not reflect consonant voicing assimilation: compare e.g. Slovene ⟨odpad⟩ and Serbo-Croatian ⟨otpad⟩ ('junkyard', 'waste').
Usage for Macedonian
Romanization of Macedonian is done according to Gaj's Latin alphabet[26][27] with slight modification. Gaj's ć and đ are not used at all, with ḱ and ǵ introduced instead. The rest of the letters of the alphabet are used to represent the equivalent Cyrillic letters. Also, Macedonian uses the letter dz, which is not part of the Serbo-Croatian phonemic inventory. As per the orthography, both lj and ĺ are accepted as romanisations of љ and both nj and ń for њ. For informal purposes, like texting, most Macedonian speakers will omit the diacritics or use a digraph- and trigraph-based system for ease as there is no Macedonian Latin keyboard supported on most systems. For example, š becomes sh or s, and dž becomes dzh or dz.
Keyboard layout
The standard Gaj's Latin alphabet keyboard layout for personal computers is as follows:
See also
| South Slavic languages and dialects | 
|---|
- Glagolitic alphabet
- Yugoslav braille
- Yugoslav manual alphabet
- Romanization of Serbian – describes usage not the alphabet
- Romanization of Montenegrin – describes usage not the alphabet
Notes
References
- ^ Babić et al. 2007, p. 173.
- ^ Žagarová & Pintarić 1998, p. 129.
- ^ Babić et al. 2007, p. 115, 173.
- ^ Žagarová & Pintarić 1998, p. 130.
- ^ Пипер, Клајн & Драгичевић 2022, p. 19.
- ^ a b Badurina, Marković & Mićanović 2008, p. 5.
- ^ Halilović 2017, p. 11, 141.
- ^ Пешикан, Јерковић & Пижурица 2010, p. 17.
- ^ Mihaljević, Milica (2003). "Internetsko nazivlje u govornim medijima". Govor. 20 (1–2). Zagreb: Hrvatsko filološko društvo: 267.
- ^ Halilović 2017, p. 11.
- ^ Badurina, Marković & Mićanović 2008, p. 3.
- ^ Badurina, Marković & Mićanović 2008, p. 107-108.
- ^ Пешикан, Јерковић & Пижурица 2010, p. 139-140.
- ^ Badurina 2012, p. 69.
- ^ Badurina 2012, p. 73, 77.
- ^ Maretić 1889, passim.
- ^ Comrie, Bernard; Corbett, Greville G., eds. (2003). The Slavonic Languages. London: Taylor & Francis. p. 45. ISBN 978-0-203-21320-9. Retrieved 23 December 2013. Following Vuk's reform of Cyrillic (see above) in the early nineteenth century, Ljudevit Gaj in the 1830s performed the same operation on Latinica, using the Czech system and producing a one-to-one symbol correlation between Cyrillic and Latinica as applied to the Serbian and Croatian parallel system. 
- ^ a b Babić et al. 2007, p. 176.
- ^ a b c Maretić 1963, p. 25.
- ^ Daničić 1975–1976, pp. 5–9, Dodatak: Materijali o rječniku.
- ^ Moguš 2009, p. 185.
- ^ "Ђа̑лскӣ". Речник српскохрватског књижевног и народног језика. Књига V (дугуљан—закључити). Београд: Институт за српскохрватски језик. 1968.
- ^ Deanović, Mirko; Jernej, Josip (1975). "Đúro". Hrvatsko ili srpsko-talijanski rječnik (4th ed.). Zagreb: Školska knjiga.
- ^ Šimunović, Petar (2009). Uvod u hrvatsko imenoslovlje. Zagreb: Golden Marketing - Tehnička knjiga. p. 129.
- ^ "IBM Knowledge Center". www.ibm.com/us-en. Archived from the original on 2022-11-09. Retrieved 2023-09-29.
- ^ Lunt, Horace G. (1952). Grammar of the Macedonian Literary Language. Skopje.
- ^ Macedonian Latin alphabet, Pravopis na makedonskiot literaturen jazik, B. Vidoeski, T. Dimitrovski, K. Koneski, K. Tošev, R. Ugrinova Skalovska - Prosvetno delo Skopje, 1970, p.99
Sources
- Anić, Vladimir; Silić, Josip (1987). Pravopisni priručnik hrvatskoga ili srpskoga jezika (in Croatian) (2nd ed.). Zagreb: Liber / Školska knjiga.
- Babić, Stjepan; Brozović, Dalibor; Škarić, Ivo; Težak, Stjepko (2007). Glasovi i oblici hrvatskoga književnoga jezika. Velika hrvatska gramatika. Vol. 1. Zagreb: Globus / HAZU. ISBN 978-953-167-202-3.
- Badurina, Lada; Marković, Ivan; Mićanović, Krešimir (2008). Hrvatski pravopis (2nd ed.). Zagreb: Matica hrvatska.
- Badurina, Lada (2012). "Hrvatski slovopis i pravopis u predstandardizacijskome razdoblju" (PDF). In Mićanović, Krešimir (ed.). Povijest hrvatskoga jezika / Književnost i kultura devedesetih: Zbornik radova 40. seminara Zagrebačke slavističke škole. Zagreb: Zagrebačka slavistička škola. p. 65-96. ISBN 978-953-175-431-6.
- Daničić, Đuro (1975–1976) [1878]. "Ogled". In Pavešić, Slavko; Jonke, Ljudevit (eds.). Rječnik hrvatskoga ili srpskoga jezika: Dio XXIII (2. zlotvor – žvuknuti / popis izvora, dodatak). Zagreb: JAZU.
- Halilović, Senahid (2017). Pravopis bosanskoga jezika (2nd ed.). Sarajevo: Slavistički komitet.
- Maretić, Tomo (1889). Istorija hrvatskoga pravopisa latinskijem slovima (PDF). Zagreb: JAZU.
- Maretić, Tomo (1963) [1899]. Gramatika hrvatskoga ili srpskoga književnog jezika (3rd ed.). Zagreb: Matica hrvatska.
- Jojić, Ljiljana (2003). Pravopisni priručnik - dodatak Velikom rječniku hrvatskoga jezika (in Croatian). Zagreb: Novi liber.
- Moguš, Milan; Vončina, Josip (1969). "Latinica u Hrvata". Radovi Zavoda za slavensku filologiju. 11. Zagreb: Sveučilište u Zagrebu, Filozofski fakultet: 61–81.
- Moguš, Milan (2009). Povijest hrvatskoga književnoga jezika (3rd ed.). Zagreb: Globus.
- Пешикан, Митар; Јерковић, Јован; Пижурица, Мато (2010). Правопис српскога језика. Нови Сад: Матица српска.
- Пипер, Предраг; Клајн, Иван; Драгичевић, Рајна (2022) [2013]. Нормативна граматика српског језика (4th ed.). Нови Сад: Матица српска. ISBN 978-86-7946-377-7.
- Vončina, Josip (1985). "Temelji i putovi Gajeve grafijske reforme". Filologija. 13. Zagreb: JAZU: 7–88.
- Žagarová, Margita; Pintarić, Ana (July 1998). "O nekim sličnostima i razlikama između hrvatskoga i slovačkoga jezika" [On some similarities and differences between Croatian and Slovakian]. Jezikoslovlje (in Croatian). 1 (1). Filozofski fakultet u Osijeku: 129–134. ISSN 1331-7202.
External links

