The acute accent (Welsh: acen ddyrchafedig), the grave accent (Welsh: acen ddisgynedig), the circumflex (Welsh: acen grom, to bach, or hirnod) and the diaeresis mark (Welsh: didolnod) are also used on vowels, but accented letters are not regarded as part of the alphabet.
The letter j has been accepted into Welsh orthography only relatively recently: for use in those words borrowed from English in which the /d?/ sound is retained in Welsh, even where that sound is not represented by j in English spelling, as in garej ("garage") and ffrij ("fridge"). Older borrowings of English words containing /d?/ resulted in the sound being pronounced and spelt in various other ways, resulting in occasional doublets such as Siapan and Japan ("Japan").
The letters k, q, v, x and z are sometimes used in technical terms, like kilogram, volt and zero, but in all cases can be, and often are, replaced by Welsh letters: cilogram, folt and sero.
The earliest samples of written Welsh date from the 6th century and are in the Latin alphabet (see Old Welsh). The orthography differs from that of modern Welsh, particularly in the use of p, t and c to represent the voiced plosives /b, d, ?/ in the middle and at the end of words. Similarly, the voiced fricatives /v, ð/ were written with b and d.
By the Middle Welsh period, this had given way to much variability: although b, d and g were now used to represent /b, d, ?/, these sounds were also often written as in Old Welsh, while /v/ could be denoted by u, v, f or w. In earlier manuscripts, moreover, fricatives were often not distinguished from plosives (e.g. t for /?/, the sound now written with th). The grapheme k was also used, unlike in the modern alphabet, particularly before front vowels. The disuse of this letter is at least partly due to the publication of William Salesbury's Welsh New Testament and William Morgan's Welsh Bible, whose English printers, with type letter frequencies set for English and Latin, did not have enough k letters in their type cases to spell every /k/ sound as k, so the order went "C for K, because the printers have not so many as the Welsh requireth"; this was not liked at the time, but has become standard usage.
In this period, ð (capitalised as Ð) was also used as a letter, interchangeable with dd, such as the passage in the 1567 New Testament: A Dyw y sych ymaith yr oll ðeigre oddiwrth y llygeid, which contains both ð and dd. Elsewhere, the same word is spelt in different ways, e.g. newydd and newyð.
The printer and publisher Lewis Jones, one of the co-founders of Y Wladfa, the Welsh-speaking settlement in Patagonia, favoured a limited spelling reform which replaced Welsh f and ff - standing for /v/ and /f/ - with the letters v and f (as in English), and from circa 1866 to 1886 Jones employed this innovation in a number of newspapers and periodicals he published and/or edited in the colony. However, the only real relic of this practice today is the Patagonian placename Trevelin ("mill town"), which in standard Welsh orthography would be Trefelin.
In 1928 a committee chaired by Sir John Morris-Jones standardised the orthography of modern Welsh.
In 1987, a committee chaired by Professor Stephen J. Williams made further small changes. The conventions established by these committees are not adhered to by all modern writers.
"N" and "S" indicate variants specific to the northern and southern dialects of Welsh. Throughout Wales an alternative system is also in use in which all consonant letters are named using the corresponding consonant sound plus a schwa (e.g. cy /k?/ for èc). In this system the vowels are named as below.
|Letter||Name||Corresponding sounds||English approximation|
|a||a||/a, ?:/||cat (short) / father (long)|
|ch||èch||/?/||No English equivalent; similar to loch in Scottish|
|e||e||/?, e:/||bed (short) / closest to hey (long)|
|i||i, i dot (S)||/?, i:, j/||bit (short) / machine (long) / yes (as consonant; before vowels)|
|j||je||/d/||jump (only found in loanwords, usually from English)|
|ll||èll||/?/||not present in English; a voiceless alveolar lateral fricative. A bit like what the consonant cluster "hl" would sound like.|
|o||o||/?, o:/||Short, like "bog" in RP; long like stove in Scottish English, North Central American English and Standard Canadian English|
|rh||rhi||/r?/||Voiceless rolled R|
|u||u (N), u bedol (S)||/, ?:/ (N),[* 3]
/?, i:/ (S)
|for Southern variants: bit (short) / machine (long); in Northern dialects /, ?:/ not found in English. Identical to "î" and "â" in Romanian, and similar to the "e" in English roses.|
|w||w||/?, u:, w/||book (short) / pool (long) / wet (as consonant)|
|y[* 4]||?||/, ?:, ?/ (N),[* 3]
/?, i:, ?, ?:/ (S)
|for Southern variants: bit (final syllable, short) / machine (final syllable, long) |
above (other places, short) / roses /, ?:/, found in certain dialects of English that differentiate "Rosa's" and "roses", for example, General American.
|Orthography||Northern dialects||Southern dialects||English (approximation only)|
|ae||/?:/, /e:/||/ai?/, /?i?/||eye, may|
|au||/a/, /a/||/ai?/, /?/||eye. Realised as bet (south) and cat (north) in plural endings.|
|ei||/?i?/||/?i?/||As in eight|
|eu||//||/?i?/||As in hight|
|ew||/?u?, e:u?/||/?u?/||Roughly like Edward with the d removed: E'ward, or Cockney pronunciation of -ell in words like well, hell.|
|ey||/e./||/e.?/||Two distinct vowels.|
|iw||/?u?/||/?u?/||not usually present in English except in the interjection Ew!; closest to 'i-oo' (short i). A small number of English dialects have this sound in words that have "ew" or "ue". Such words, in the majority of English dialects that distinguish ew/ue and oo, would usually have /ju/ instead. See the Phonological history of English consonant clusters article for more information.|
|ow||/?u?/||/?u?/||throw, Owen, owe|
|uw||/?u?/||/?u?/||not present in English; closest to 'i-oo' (short i)|
|wy||/, u/||/i/||not present in English; closest to gooey|
|yw||/?u?, ?u?/||/?u?, ?u?/||/?u?/ not present in English; closest to 'i-oo' (short i)|
/?u/ like "goat" in Received Pronunciation or like "house" in Canadian English
Welsh makes use of a number of diacritics.
The circumflex (^) is mostly used to mark long vowels, so â, ê, î, ô, û, ?, ? are always long. However, not all long vowels are marked with a circumflex, so the letters a, e, i, o, u, w, y with no circumflex do not necessarily represent short vowels; see "Predicting vowel length from orthography" below.
The grave accent (`) is sometimes used, usually in words borrowed from another language, to mark vowels that are short when a long vowel would normally be expected, e.g. pas /p?:s/ (a cough), pàs /pas/ (a pass/permit or a lift in a car); mwg /mu:?/ (smoke), m?g /m/ (a mug).
The acute accent (´) is sometimes used to mark a stressed final syllable in a polysyllabic word. Thus the words gwacáu (to empty) and dicléin (decline) have final stress. However, not all polysyllabic words with final stress are marked with the acute accent (Cymraeg "Welsh", for example, is written with none). The acute may also be used to indicate that a letter w represents a vowel where a glide might otherwise be expected, e.g. g?raidd /'.raið/ (two syllables) "manly", as opposed to gwraidd /'raið/ (one syllable) "root".
Similarly, the diaeresis (¨) is used to indicate that two adjoining vowels are to be pronounced separately (not as a diphthong). However, it is also used to show that the letter i is used to represent the cluster /ij/ which is always followed by another vowel, e.g. copïo (to copy) pronounced /k?.'pi.j?/, not */'k?p.j?/.
The grave and acute accents in particular are very often omitted in casual writing, and the same is true to a lesser extent of the diaeresis. The circumflex, however, is usually included. Accented vowels are not considered distinct letters for the purpose of collation.
As mentioned above, vowels marked with the circumflex are always long, and those marked with the grave accent are always short. If a vowel is not marked with a diacritic, its length must be determined by its environment; the rules vary a bit according to dialect.
In all dialects, only stressed vowels may be long; unstressed vowels are always short.
An unmarked (stressed) vowel is long:
An unmarked vowel is short:
The vowel y, when it is pronounced /?/, is always short[contradictory] even when it appears in an environment where other vowels would be long: cyfan (whole) /'k?van/. When pronounced as a close or near-close vowel (/?/ or // in the North, /i/ or /?/ in the South), y follows the same rules as other vowels: dydd (day) /'d?:ð/ (North) ~ /'di:ð/ (South), gwynt (wind) /'?wnt/ (North) ~ /'?w?nt/ (South).
Before l, m, n, and r, unmarked vowels are long in some words and short in others: gwin /?wi:n/ (wine), prin /pr?n/ (scarcely); hen /he:n/ (old), pen /p?n/ (head); dyn /d?:n/ ~ /di:n/ (man), gwyn /?wn/ ~ /?w?n/ (white); stwmo /'stu:mo/ (bank up a fire), amal /'amal/ (often); celyn /'ke:l?n/ (holly), calon /'kal?n/ (heart). (The last four examples are given in South Welsh pronunciation only since vowels in nonfinal syllables are always short in North Welsh.) Before nn and rr, vowels are always short: onn /'?n/ (ash trees), ennill /'?n/ (to win), carreg /'kar/ (stone).
In Northern dialects, long vowels are stressed and appear in the final syllable of the word. Vowels in non-final syllables are always short. In addition to the rules above, a vowel is long in the North before a consonant cluster beginning with s: tyst /t?:st/ (witness). Before ll, a vowel is short when no consonant follows the ll: gwell (better) /?w/ It is long when another consonant does follow the ll: gwallt /?w?:?t/ (hair).
In Southern dialects, long vowels may appear in a stressed penultimate syllable as well as in a stressed word-final syllable. Before ll, a stressed vowel in the last syllable can be either long (e.g. gwell "better" /?we:?/) or short (e.g. twll "hole" /t/). However, a stressed vowel in the penult before ll is always short: dillad /'dad/ (clothes). Before s, a stressed vowel in the last syllable is long, as mentioned above, but a stressed vowel in the penult is short: mesur (measure) /'m?sir/. Vowels are always short before consonant clusters: sant /sant/ (saint), gwallt /?wa?t/ (hair), tyst /t?st/ (witness).
While the digraphs ch, dd, ff, ng, ll, ph, rh, th are each written with two symbols, they are all considered to be single letters. This means, for example that Llanelli (a town in South Wales) is considered to have only six letters in Welsh, compared to eight letters in English. Consequently, they each take up only a single space in Welsh crosswords. Ll itself had actually been written as a ligature in Middle Welsh.
Sorting is done in correspondence with the alphabet. For example, la comes before ly, which comes before lla, which comes before ma. Automated sorting may occasionally be complicated by the fact that additional information may be needed to distinguish a genuine digraph from a juxtaposition of letters; for example llom comes after llong (in which the ng stands for /?/) but before llongyfarch (in which n and g are pronounced separately as //).
Although the digraphs above are considered to be single letters, only their first component letter is capitalised when a word in lower case requires an initial capital letter. Thus:
The two letters in a digraph are both capitalised only when the whole word is in uppercase: