Estonian orthography

From Vero - Wikipedia
Jump to navigation Jump to search

Template:Short description

ABD ehk Luggemise-Ramat Lastele ("ABD or the Reading Book for Children") by Otto Wilhelm Masing, published in 1795, was the first Estonian alphabet book with a separate title page. The abbreviation "ABC" was not used for the alphabet as the letter C does not appear in Estonian vocabulary.<ref>Sven-Erik Soosaar & Urmas Sutrop: Eesti keel. Estonian Institute, 2019. Template:ISBN.</ref>

Estonian orthography is the system used for writing the Estonian language and is based on the Latin alphabet. The Estonian orthography is generally guided by phonemic principles, with each grapheme corresponding to one phoneme.

Alphabet

Due to German and Swedish influence, the Estonian alphabet (Template:Langx) has the letters Ä, Ö, and Ü (A, O, and U with diaeresis), which represent the vowel sounds Template:IPAblink, Template:IPAblink and Template:IPAblink, respectively. Unlike German umlauts, they are considered, and alphabetised as, separate letters. The most distinctive letter in the Estonian alphabet, however, is the Õ (O with tilde), which was added to the alphabet in the 19th century by Otto Wilhelm Masing and stands for the vowel Template:IPAblink. In addition, the alphabet also differs from the Latin alphabet by the addition of the letters Š and Ž (S and Z with caron/háček), and by the position of Z in the alphabet: it has been moved from the end to between S and T (or Š and Ž).

The official Estonian alphabet has 27 letters: A, B, D, E, F, G, H, I, J, K, L, M, N, O, P, R, S, Š, Z, Ž, T, U, V, Õ, Ä, Ö, Ü. The letters F, Š, Z, Ž are so-called "foreign letters" (Template:Lang), and occur only in loanwords and proper names.

Additionally, C, Q, W, X, and Y are "foreign letters" used only in writing foreign proper names. These letters do not occur in any Estonian words, and thus are not usually considered part of the "Estonian proper" alphabet. Including all the foreign letters, the entire alphabet consists of the following 32 letters:

Letter IPA Name Notes Letter IPA Name Notes
A a Template:IPA aa Template:IPA Q q kuu Template:IPA Template:Efn
B b Template:IPA bee Template:IPA R r Template:IPA err Template:IPA or ärr Template:IPA
C c tsee Template:IPA Template:Efn S s Template:IPA ess Template:IPA
D d Template:IPA dee Template:IPA Š š Template:IPA or Template:IPA šaa Template:IPA Template:Efn
E e Template:IPA ee Template:IPA Z z Template:IPA zett Template:IPA Template:Efn
F f Template:IPA or Template:IPA eff Template:IPA Template:Efn Ž ž Template:IPA žee Template:IPA Template:Efn
G g Template:IPA gee Template:IPA T t Template:IPA or Template:IPA tee Template:IPA
H h Template:IPA haa Template:IPA or ašš Template:IPA U u Template:IPA uu Template:IPA
I i Template:IPA ii Template:IPA V v Template:IPA vee Template:IPA
J j Template:IPA jott Template:IPA W w kaksisvee Template:IPA Template:Efn
K k Template:IPA or Template:IPA kaa Template:IPA Õ õ Template:IPA õõ Template:IPA
L l Template:IPA ell Template:IPA Ä ä Template:IPA ää Template:IPA
M m Template:IPA emm Template:IPA Ö ö Template:IPA öö Template:IPA
N n Template:IPA enn Template:IPA Ü ü Template:IPA üü Template:IPA
O o Template:IPA oo Template:IPA X x iks Template:IPA Template:Efn
P p Template:IPA or Template:IPA pee Template:IPA Y y igrek Template:IPA or üpsilon Template:IPA Template:Efn

Template:Notelist In Blackletter script W was used instead of V. In some reference works in the past, V and W were sorted as if they were one and the same letter.

Johannes Aavik suggested that the letter Ü be replaced by Y, as it has been in the Finnish alphabet.

Double letters are used to write half-long and overlong vowels and consonants, e. g. aa Template:IPA or Template:IPA, nn Template:IPA or Template:IPA, kk Template:IPA. For more information, see below.

As the distinction between voiced and voiceless plosives is not native to Estonian, the names of the letters 'b', 'd', 'g' may be pronounced Template:IPA, Template:IPA, Template:IPA, so the letters 'b' and 'd' are also named Template:Lang (Template:Gloss) and Template:Lang (Template:Gloss) to distinguish them from Template:Lang (Template:Gloss) and Template:Lang (Template:Gloss). About usage of these letters, see below.

Orthographic principles

Although Estonian orthography is generally guided by phonemic principles, with each grapheme corresponding to one phoneme, there are some historical and morphological deviations from this: for example the initial letter h in words, preservation of the morpheme in declension of the word (writing b, g, d in places where p, k, t is pronounced) and in the use of i and j. Where it is impractical or impossible to type š and ž, they are substituted with sh and zh in some written texts, although this is considered incorrect. Otherwise, the h in sh represents a voiceless glottal fricative, as in Template:Lang (pas-ha); this also applies to some foreign names.

Some features of the modern Estonian orthography are:

Syllabification

One consonant between two vowels belongs to the following syllable: Template:Lang 'fish' is syllabified ka-la. Consonant combinations are syllabified before the last consonant: Template:Lang 'town [[[:Template:Abbr]] Template:Abbr]' is syllabified lin-na, Template:Lang 'acquaintance' is syllabified Template:Lang. Consonant digraphs and trigraphs in foreign names are regarded as single consonants: Template:Lang is syllabified Man-ches-ter. Two vowels usually form a long vowel or a diphthong, e. g. Template:Lang 'song' is syllabified lau-lu. However, a hiatus is formed in morpheme bounds, e. g. Template:Lang 'opening' is syllabified a-va-us as the word is composed from the root ava- and the suffix -us. Combinations of three vowel letters represent a hiatus of a long vowel or a diphthong and another vowel, e. g. Template:Lang 'dry, droughty, arid (lacking rain)' is syllabified põu-a-ne; but some loanwords have a hiatus of a short vowel followed by a long vowel: Template:Lang 'oasis' is syllabified o-aas. Compound words are syllabified as combinations of their parts: Template:Lang 'grandmother' is syllabified as va-na-e-ma as the word is composed from Template:Lang 'old' and Template:Lang 'mother'. Etymologically compound loanwords and foreign names may be syllabified as compound or simple words: Template:Lang 'photographer' is syllabified fo-to-graaf or fo-tog-raaf, Template:Lang is syllabified Pet-ro-grad or Pet-rog-rad.

These syllabification rules are used for hyphenating words at the end of line, with the additional rule that a single letter is not left on a line.

Foreign words

Loanwords are normally adapted to Estonian spelling: Template:Lang 'web', Template:Lang 'jazz'. However, foreign words and phrases sometimes may be used in the original spelling, such as Latin phrases, Italian musical terms, exotic words. Such citations are typographically emphasized using italics and declined using an apostrophe: croissant'id 'croissants'.

Foreign proper names from Latin-script languages are written in their original spelling: Margaret Thatcher, Bordeaux. Names from non-Latin-script languages are written using either Estonian orthographic transcription or established romanization systems. Some geographical names (and some names of historical personalities, such as monarchs) have traditional Estonian forms (including some adapted spellings such as Template:Lang for German Template:Lang 'Vienna').

Derivations from foreign proper names with the suffixes -lik, -lane, -lus, -ism, -ist usually conserve the spelling of names (e. g. Template:Lang, Template:Lang), but a few are adapted by established tradition: Template:Lang, Template:Lang, Template:Lang. Derivations without suffixes or with other suffixes are adapted to Estonian spelling: Template:Lang 'newton' (unit of force), Template:Lang (maxillary sinusitis, inflammation of the antrum of Highmore), Template:Lang 'ytterbium' (chemical element), etc.

Expressions such as Template:Lang 'degree Celsius', Template:Lang 'Cheddar cheese' conserve the spelling of proper names (adding case endings). However, names of plants and animals are usually written in adapted forms, e.g. Template:Lang 'Colorado beetle'.

Apostrophe is used when adding case endings to proper names with unusual grapheme-to-phoneme correspondences (such as ending on a consonant orthographically but on a vowel phonetically or vice versa), e.g. Template:Lang (genitive of Provence).

Capitalization

Capital letters are written at the beginning of the first word in a sentence, proper names, and official names functioned as proper names. May be used in the pronouns Template:Lang 'you (singular)' and Template:Lang 'you (plural, also used as formal singular)' to show respect.

Names of months, days of the week, holidays, Chinese zodiac years, and titles of people such as Template:Lang are not capitalized.

Titles of books, films, etc. are written in quotation marks with only the first word and proper names capitalized.

Compound words

Compound words are written as one word, but they are often composed of genitive+nominative and hard to distinguish from simple word combinations. A compound word is considered a single word and written together when: 1) it has a separate meaning, e. g. Template:Lang 'chapter' but Template:Lang 'part of a head'; 2) it is different from the genitive+nominative combination, e. g. Template:Lang (nominative+nominative) 'watermill'; 3)some combinations may be together or separately, but writing them together is preferred in more complex word phrases: Template:Lang 'member of a party' — Template:Lang 'every member of the party'. Rare and long word combinations are typically written separately.

The hyphen is used: 1) in compounds where one of the parts is a letter (Template:Lang 'vitamin C'), an initialism (Template:Lang 'text TV'), a foreign citation (Template:Lang 'joke show') or a word part (Template:Lang 'word containing Template:Lang); 2) in compound adjectives where the first part as a proper name; 3) in compound geographical names such as Template:Lang 'South Estonia'; 4) as a suspended hyphen, e.g. Template:Lang 'gold and silver things' (also in compound words such as Template:Lang 'export-import bank'); 5) in "nominative+ablative" adverbs, e. g. Template:Lang 'day after day'; 6) in dvandva compounds, e. g. Template:Lang 'father and mother'; 7) in compound adjectives from word phrases, e.g. Template:Lang 'related to tentative phonetics'; 8) in compound adjectives with coordinating meaning, e.g. Template:Lang 'Estonian-English dictionary'; 9) in double names such as Template:Lang. It can be optionally used in unusual compounds such as Template:Lang 'gooseberry disease'; in compounds with three or four identical letters in a row (e. g. Template:Lang 'yearly', Template:Lang 'bone groove'); in compounds with numbers (see below) or with signs (e.g. Template:Lang '+ sign'); in the construction 'genitive of a proper name + nominative' after another genitive (e. g. Template:Lang 'European part of Russia'); in the colloquial construction 'genitive of a proper name + noun' instead of 'noun + proper name', e. g. Template:Lang instead of Template:Lang 'Uncle Kuusk'; in ad hoc compounds such as Template:Lang; in words from two-or-more-component proper names, e. g. Template:Lang.

Abbreviations

The abbreviation period (full stop) may be used, but it is not mandatory. Commonly used abbreviations are usually written without the abbreviation period: Template:Lang, Template:Lang, or Template:Lang for Template:Lang 'street'; Template:Lang for Template:Lang 'see'; Template:Lang for Template:Lang 'and many others'. Using the abbreviation period is recommended when an abbreviation may be misread as another word: Template:Lang for Template:Lang 'figure, draft' but Template:Lang 'line'. If an abbreviation of a word phrase may be mistaken for a word or for another abbreviation, periods are used after every letter but the last one, and spaces are not used: Template:Lang for Template:Lang but Template:Lang 'mother', Template:Lang for Template:Lang but Template:Lang for Template:Lang 'economy'.

The hyphen is used in some abbreviations of compound words, e. g. Template:Lang for Template:Lang 'doctor of pedagogy', Template:Lang for Template:Lang 'capitan lieutenant', especially in the construction abbreviation + complete word, such as Template:Lang for Template:Lang 'reinforced concrete panels'.

Numerals

Numerals may be written in words (Template:Lang 'one', Template:Lang 'two', Template:Lang 'three'...) or in figures (1, 2, 3, ...). In Estonian texts, the comma is used as the decimal separator, and the space is used as thousands separator (in financial documents, the point can be used as thousands separator to avoid inserting an extra digit). The point as a separator is used for dates, daytime, prices, and sports results in meters and centimeters. For prices in euros and cents, writing € 84.95 as well as 84,95 € is accepted. Daytime in hours and minutes (24-hour format) may be written using the point or the colon (without spaces): 16.15 or 16:15; but seconds are separated by the point: 16:15.25. The colon with spaces is used for ratios. e. g. 2 : 3.

When written in words, numerals with Template:Lang or Template:Lang (11 to 19), Template:Lang (tens) and Template:Lang (hundreds) are written together, e.g. Template:Lang 'fifteen', Template:Lang 'fifty', Template:Lang 'five hundred'. Other compound numerals are written separately: Template:Lang 'twenty-five'.

For writing ordinal numbers in figures, the ordinal dot is used: 16. for Template:Lang 'the sixteenth'. In some cases, ordinals are written as Roman numerals (without the ordinal dot). Roman numerals followed by a dot may be used in numbered lists.

Case forms of cardinal and ordinal numerals may be written in the form "figures+case ending" with or without a hyphen: 16s or 16-s for Template:Lang 'sixteen [inessive]', Template:Lang or Template:Lang for Template:Lang 'the sixteenth [inessive]'. For case endings beginning with the letter l, the hyphen is mandatory to avoid confusion with the digit 1: 16-le for Template:Lang 'sixteen [allative]'. Case endings after figures are not used when a cardinal or ordinal numeral is in a case concordance with a following noun. Likewise, compound words with numbers written in figures may be written with or without the hyphen: Template:Lang or Template:Lang for Template:Lang '60-watt light'.

Punctuation

The period (full stop) is used at the end of sentences, as an ordinal mark and sometimes as an abbreviation mark and as a number separator (see above).

The comma is used for appositions (but appositions in genitive require the comma only before them), for more than one attribute after a determined word, for enumerations (but the serial comma is not used), between coordinated or subordinated clauses, between direct speech and author's words, before and after parenthetic or vocative phrases, and before and after some other constructions. It is also used between placenames and dates in the nominative case (but not in locative cases); between a surname and a given name, if they are written in this order; before parts of and address; and as a decimal mark.

The colon is used before lists, before direct speech, before explanations, and also in writing daytime and ratios (see above).

The semicolon is used between weakly related parts of sentences, especially containing commas.

The hyphen is used for writing compound words (see above). It is also used for hyphenating words at the end of line, for declining letters and abbreviations, and optionally for declining acronyms/initialisms, numbers, and symbols.

The dash is used when there appears a generalizing word after an enumeration; instead of the comma for accenting clauses and appositions or for relatively long parenthetical constructions; before words indicating surprise; for slight pauses (interchangeably with the ellipsis); in the meaning "from...to" (instead of the word kuni); for indicating lines or routes (when in attributive function, the hyphen is also accepted); between coordinated attributes if at least one attribute has a hyphen or a space; between remarks of a dialogue written as one line without author's words; as a marker before enumeration items. The dash is not used to indicate omission of a word that would be repeated.

The exclamation and question marks are used at the end of exclamative and interrogative sentences. Occasionally, they may be parenthesized and written after words within sentences to show doubt or surprise. The exclamation mark is also used for addressing people in letters, e.g. Template:Lang. Using the comma or the colon in this case is considered inappropriate.

The quotation marks, written as „ ”, are used for direct speech, citations, scare quotes, and names of books, documents, episodes, enterprises, etc. Names of plant sorts may be written in double or in single quotation marks (looking like apostrophes: ’ ’) and are normally italicized. For cited words and phrases, including words in a linguistic context, quotation marks or italics may be used. Quotation marks are not used in the names of institutions, periodicals, awards, wares, and vehicles.

The apostrophe is used for adding case endings and suffixes to foreign names with unusual grapheme-to-phoneme correspondences and to foreign citations in the original spelling (see above). Sometimes the apostrophe is used for adding case endings and suffixes to Estonian names, to make the original form clear: Template:Lang (allative of the surname Template:Lang), Template:Lang (the apostrophe is used to conserve the spelling of the surname Template:Lang, otherwise the double consonant would become a single consonant). Also, the apostrophe is sometimes used in poetry to indicate omission of a sound: Template:Lang, Template:Lang, Template:Lang instead of Template:Lang, Template:Lang, Template:Lang are found in Lydia Koidula's poems. Single quotation marks (' ') are used for word meanings in a linguistic context.

The parentheses are used for parenthetical words or sentences, and also for optional parts of words in a linguistic context.

The square brackets are used for citer's notes to citations and for showing pronunciation in linguistic and reference works.

The slash is used for division in fractions and unit symbols, for connecting alternatives, to show line breaks when citing poetry in the single-line format, and for non-calendar years. In practice, it occasionally appears in abbreviations made of more than one word (e. g. Template:Lang for Template:Lang 'school year'), but this usage is considered nonstandard (correct abbreviation: Template:Lang). Spaces are used before and after the slash only if it separates text fragments of more than one word.

The ellipsis is used for slight pauses and for unfinished thoughts. It is surrounded by spaces. Also, the ellipsis is used for bowdlerizing obscene words.

History

Modern Estonian orthography is based on the Newer Orthography created by Eduard Ahrens in the second half of the 19th century based on Finnish orthography. The Older Orthography it replaced was created in the 17th century by Bengt Gottfried Forselius and Johann Hornung based on standard German orthography. In the old orthography, single consonants following short vowels were written double even if they are short (Template:Lang 'fish' was written as Template:Lang) and long vowels in an open syllable were written single (Template:Lang 'to create' was written as Template:Lang). Before Otto Wilhelm Masing introduced the letter õ in the early 19th century, its sound had not been distinguished in writing from ö. Earlier writing in Estonian had by and large used an ad hoc orthography based on Latin and Middle Low German orthography. Some influences of the standard German orthography — for example, writing W/w instead of V/v persisted well into the 1930s.

In Fraktur typesetting (which was common in Estonian publications before the first half of the 20th century), two kinds of the small letter s were distinguished: the short s and the long ſ. The long ſ was used at the beginning and in the middle of syllables, and the short s was used at the end of syllables. For example: Template:Lang 'cat' — Template:Lang 'cat [gen. sg., part. sg.]'.

In the second half of the 20th century, some Estonian words and names were quoted in international publications from Soviet sources, and were often in fact spelled as incorrect back-transliterations from Russian Cyrillic. Such examples of Russian transliteration include the use of я ("ya") for ä (e.g. Pyarnu (Пярну) for Pärnu), ы ("y") for õ (e.g., Pylva (Пылва) for Põlva) and ю ("yu") for ü (e.g., Pyussi (Пюсси) for Püssi).

See also

References

Template:Reflist

Template:Language orthographies