Vietnamese Listeni/ˌviɛtnəˈmz/ (Tiếng Việt) is an Austroasiatic language that originated in the north of modern-day Vietnam, where it is the national and official language. It is the native language of the Vietnamese (Kinh) people, as well as a first or second language for the many ethnic minorities of Vietnam. As the result of Vietnamese emigration and cultural influence, Vietnamese speakers are found throughout the world, notably in East and Southeast Asia, North America, Australia and Western Europe. Vietnamese has also been officially recognized as a minority language in the Czech Republic.

It is part of the Austroasiatic language family of which it has by far the most speakers (several times as many as the other Austroasiatic languages combined).[6] Vietnamese vocabulary has borrowings from Chinese, and it formerly used a modified set of Chinese characters called chữ nôm given vernacular pronunciation. The Vietnamese alphabet (quốc ngữ) in use today is a Latin alphabet with additional diacritics for tones and certain letters.


Geographic distributionEdit

As the national language, Vietnamese is spoken throughout Vietnam by ethnic Vietnamese and by Vietnam's many minorities. Vietnamese is also the native language of the Gin minority group in southern Guangxi Province in China.[7] A significant number of native speakers also reside in neighboring Cambodia and Laos.

In the United States, Vietnamese is the sixth most spoken language, with over 1.5 million speakers, who are concentrated in a handful of states. It is the third most spoken language in Texas, fourth in Arkansas and Louisiana, and fifth in California.[8] Vietnamese is the seventh most spoken language in Australia.[9] In France, it is the most spoken Asian language and the eighth most spoken immigrant language at home.[10]

Official statusEdit

Vietnamese is the sole official and national language of Vietnam. It is the first language of the majority of the Vietnamese population, as well as a first or second language for country's ethnic minority groups.

In the Czech Republic, Vietnamese has been recognized as one of 14 minority languages, on the basis of communities that have either traditionally or on a long-term basis resided in the country.[11] This status grants Czech citizens from the Vietnamese community the right to use Vietnamese with public authorities and at courts anywhere in the country. Moreover, it also grants the usage of Vietnamese in public signage, election information, cultural institutions and access to legal information and assistance in municipalities where at least 10% of the population is of the minority group.

As a foreign languageEdit

Vietnamese is increasingly being taught in schools and institutions outside of Vietnam. In countries with strongly established Vietnamese-speaking communities such as the USA, France, Australia and Canada, Vietnamese language education largely serves as a cultural role to link descendants of Vietnamese immigrants to their ancestral culture. Meanwhile, in countries near Vietnam such as Cambodia, Laos, Thailand and South Korea, the increased role of Vietnamese in foreign language education is largely due to the growth and influence of Vietnam's economy.[12][13]

Since the 1980s, Vietnamese language schools (trường Việt ngữ) have been established for youth in many Vietnamese-speaking communities around the world, notably in the United States.[14][15]

Historic and stronger trade and diplomatic relations with Vietnam and a growing interest among the French Vietnamese population (one of France's most established non-European ethnic groups) of their ancestral culture have also led to an increasing number of institutions in France, including universities, to offer formal courses in the language.[16]

Since the late 1980s, the Vietnamese German community has enlisted the support of city governments to bring Vietnamese into high school curricula for the purpose of teaching and reminding Vietnamese German students of their mother-tongue. Furthermore, there has also been a number of Germans studying Vietnamese due to increased economic investment in Vietnam.[17][17][18]

Vietnamese is taught in schools in the form of dual immersion to a varying degree in Cambodia,[19] Laos,[20] and the United States.[21][22] Classes teach students subjects in Vietnamese and another language. Furthermore, in Thailand, Vietnamese is one of the most popular foreign languages in schools and colleges.[23]

Linguistic classificationEdit

Vietnamese was identified more than 150 years ago[24] as part of the Mon–Khmer branch of the Austroasiatic language family (a family that also includes Khmer, spoken in Cambodia, as well as various tribal and regional languages, such as the Munda and Khasi languages spoken in eastern India, and others in southern China). Later, Muong was found to be more closely related to Vietnamese than other Mon–Khmer languages, and a Viet–Muong subgrouping was established, also including Thavung, Chut, Cuoi, etc.[25] The term "Vietic" was proposed by Hayes (1992),[26] who proposed to redefine Viet–Muong as referring to a subbranch of Vietic containing only Vietnamese and Muong. The term "Vietic" is used, among others, by Gérard Diffloth, with a slightly different proposal on subclassification, within which the term "Viet–Muong" refers to a lower subgrouping (within an eastern Vietic branch) consisting of Vietnamese dialects, Muong dialects, and Nguồn (of Quảng Bình Province).[27]


The words in orange belong to the Vietnamese native lexical stock whereas the ones in green belong to the Sino-Vietnamese vocabulary.

As a result of 1000 years of Chinese rule, much of the Vietnamese lexicon relating to science and politics is derived from Chinese — see Sino-Vietnamese vocabulary. Some 30% to 60% of the lexical stock has naturalized word borrowings from Chinese, although many compound words are composed of native Vietnamese words combined with naturalized word borrowings (i.e. having Vietnamese pronunciation).[citation needed] As a result of French occupation, Vietnamese has since had many words borrowed from the French language, for example cà phê (from French café). Nowadays, many new words are being added to the language's lexicon due to heavy Western cultural influence; these are usually borrowed from English, for example TV (though usually seen in the written form as tivi). Sometimes these borrowings are calques literally translated into Vietnamese (for example, software is calqued into phần mềm, which literally means "soft part").


Main article: Vietnamese phonology


Like other Southeast Asian languages, Vietnamese has a comparatively large number of vowels.

Below is a vowel diagram of Hanoi Vietnamese (including centering diphthongs):

  Front Central Back
Centering ia~iê [iə̯] ưa~ươ [ɨə̯] ua~uô [uə̯]
Close i [i] ư [ɨ] u [u]
Mid ê [e] ơ [əː]

â [ə]

ô [o]
Open e [ɛ] a [aː]

ă [a]

o [ɔ]

Front, central, and low vowels (i, ê, e, ư, â, ơ, ă, a) are unrounded, whereas the back vowels (u, ô, o) are rounded. The vowels â [ə] and ă [a] are pronounced very short, much shorter than the other vowels. Thus, ơ and â are basically pronounced the same except that ơ [əː] is of normal length while â [ə] is short – the same applies to the vowels long a [aː] and short ă [a].[28]

The centering diphthongs are formed with only the three high vowels (i, ư, u). They are generally spelled as ia, ưa, ua when they end a word and are spelled , ươ, , respectively, when they are followed by a consonant.

In addition to single vowels (or monophthongs) and centering diphthongs, Vietnamese has closing diphthongs[29] and triphthongs. The closing diphthongs and triphthongs consist of a main vowel component followed by a shorter semivowel offglide /j/ or /w/.[30] There are restrictions on the high offglides: /j/ cannot occur after a front vowel (i, ê, e) nucleus and /w/ cannot occur after a back vowel (u, ô, o) nucleus.[31]

  /w/ offglide /j/ offglide
Front Central Back
Centering iêu [iə̯w] ươu [ɨə̯w] ươi [ɨə̯j] uôi [uə̯j]
Close iu [iw] ưu [ɨw] ưi [ɨj] ui [uj]
Mid êu [ew]

âu [əw]

ơi [əːj]

ây [əj]

ôi [oj]
Open eo [ɛw] ao [aːw]

au [aw]

ai [aːj]

ay [aj]

oi [ɔj]

The correspondence between the orthography and pronunciation is complicated. For example, the offglide /j/ is usually written as i; however, it may also be represented with y. In addition, in the diphthongs [āj] and [āːj] the letters y and i also indicate the pronunciation of the main vowel: ay = ă + /j/, ai = a + /j/. Thus, tay "hand" is [tāj] while tai "ear" is [tāːj]. Similarly, u and o indicate different pronunciations of the main vowel: au = ă + /w/, ao = a + /w/. Thus, thau "brass" is [tʰāw] while thao "raw silk" is [tʰāːw].


The consonants that occur in Vietnamese are listed below in the Vietnamese orthography with the phonetic pronunciation to the right.

Labial Alveolar Retroflex Palatal Velar Glottal
Nasal m [m] n [n] nh [ɲ] ng/ngh [ŋ]
Stop tenuis p [p] t [t] tr [ʈʂ~ʈ] ch [c~tɕ] c/k/q [k~q]
glottalized b [ɓ] đ [ɗ]
aspirated th [tʰ] kh [x~kʰ]
Fricative voiceless ph [f] x [s] s [ʂ] h [h]
voiced v [v] d [z~j] r [ʐ~ɹ] gi [z~j] g/gh [ɣ]
Approximant u/o [w] l [l] y/i [j]

Some consonant sounds are written with only one letter (like "p"), other consonant sounds are written with a digraph (like "ph"), and others are written with more than one letter or digraph (the velar stop is written variously as "c", "k", or "q"). The velar stop /k/ may be pronounced as a uvular stop /q/ by some speakers next to back vowels, but this is not reflected in the spelling.

Not all dialects of Vietnamese have the same consonant in a given word (although all dialects use the same spelling in the written language). See the language variation section for further elaboration.

The analysis of syllable-final orthographic ch and nh in Hanoi Vietnamese has had different analyses. One analysis has final ch, nh as being phonemes /c/, /ɲ/ contrasting with syllable-final t, c /t/, /k/ and n, ng /n/, /ŋ/ and identifies final ch with the syllable-initial ch /c/. The other analysis has final ch and nh as predictable allophonic variants of the velar phonemes /k/ and /ŋ/ that occur after the upper front vowels i /i/ and ê /e/. (See Vietnamese phonology: Analysis of final ch, nh for further details.)


Pitch contours and duration of the six Northern Vietnamese tones as spoken by a male speaker (not from Hanoi). Fundamental frequency is plotted over time. From Nguyễn & Edmondson (1998).

Vietnamese vowels are all pronounced with an inherent tone.[32] (More formally, diacritics indicate the tone of the entire word, centered on the main vowel or group of vowels, whereas accents qualify the vowel(s).) Tones differ in:

Tone is indicated by diacritics written above or below the vowel (most of the tone diacritics appear above the vowel; however, the nặng tone dot diacritic goes below the vowel).[33] The six tones in the northern varieties (including Hanoi), with their self-referential Vietnamese names, are:

Name Description Diacritic Example Sample vowel
ngang   'level' mid level (no mark) ma  'ghost'   a 
huyền   'hanging' low falling (often breathy) ` (grave accent)  'but'   à 
sắc   'sharp' high rising ´ (acute accent)  'cheek, mother (southern)'   á 
hỏi   'asking' mid dipping-rising  ̉ (hook) mả  'tomb, grave'    
ngã   'tumbling' high breaking-rising ˜ (tilde)  'horse (Sino-Vietnamese), code'   ã 
nặng   'heavy' low falling constricted (short length)  ̣ (dot below) mạ  'rice seedling'    

Other dialects of Vietnamese have fewer tones (typically only five).

In Vietnamese poetry, tones are classed into two groups:

Tone group Tones within tone group
bằng "level, flat" ngang and huyền
trắc "oblique, sharp" sắc, hỏi, ngã, and nặng

Words with tones belonging to a particular tone group must occur in certain positions within the poetic verse.

Vietnamese Catholics practice a distinctive style of prayer recitation called đọc kinh, in which each tone is assigned a specific note or sequence of notes.

Language variationEdit

There are various mutually intelligible regional varieties (or dialects), the main five being:[34]

Dialect region Localities Names under French colonization
Northern Vietnamese Hanoi, Haiphong, Red River Delta, Tây Bắc and Đông Bắc Tonkinese
North-central (or Area IV) Vietnamese Thanh Hoá, Nghệ An, Hà Tĩnh Annamese
Mid-Central Vietnamese Quảng Bình, Quảng Trị, Huế, Thừa Thiên Annamese
South-Central Vietnamese (or Area V) Đà Nẵng, Quảng Nam, Quảng Ngãi, Bình Định, Phú Yên, Nha Trang Annamese
Southern Vietnamese Bà Rịa–Vũng Tàu, Saigon, Lâm Đồng, Mekong Delta Cochinchinese

Vietnamese has traditionally been divided into three dialect regions: North, Central, and South. However, Michel Ferlus and Nguyễn Tài Cẩn offer evidence for considering a North-Central region separate from Central. The term Haut-Annam refers to dialects spoken from northern Nghệ An Province to southern (former) Thừa Thiên Province that preserve archaic features (like consonant clusters and undiphthongized vowels) that have been lost in other modern dialects.

These dialect regions differ mostly in their sound systems (see below), but also in vocabulary (including basic vocabulary, non-basic vocabulary, and grammatical words) and grammar.[35] The North-central and Central regional varieties, which have a significant amount of vocabulary differences, are generally less mutually intelligible to Northern and Southern speakers. There is less internal variation within the Southern region than the other regions due to its relatively late settlement by Vietnamese speakers (in around the end of the 15th century). The North-central region is particularly conservative; its pronunciation has diverged less from Vietnamese orthography than the other varieties, which tend to merge certain sounds. Along the coastal areas, regional variation has been neutralized to a certain extent, while more mountainous regions preserve more variation. As for sociolinguistic attitudes, the North-central varieties are often felt to be "peculiar" or "difficult to understand" by speakers of other dialects.

The large movements of people between North and South beginning in the mid-20th century and continuing to this day have resulted in a sizeable number of Southern residents speaking in the Northern accent/dialect and, to a greater extent, Northern residents speaking in the Southern accent/dialect. Following the Geneva Accords of 1954 that called for the temporary division of the country, about a million northerners (mainly from Hanoi, Haiphong and the surrounding Red River Delta areas) moved south (mainly to Saigon and heavily to Biên Hòa and Vũng Tàu, and the surrounding areas) as part of Operation Passage to Freedom. About 3% (~30,000) of that number of people made the move in the reverse direction.

Following the reunification of Vietnam in 1975–76, Northern and North-Central speakers from the densely populated Red River Delta and the traditionally poorer provinces of Nghệ An, Hà Tĩnh and Quảng Bình have continued to move South to look for better economic opportunities, beginning with the Hanoi government's "New Economic Zones program" which lasted from 1975–85.[36] The first half of the program (1975–80), resulted in 1.3 million people sent to the New Economic Zones (NEZs), majority of which were relocated in the southern half of the country in previously uninhabited areas, of which 550,000 were Northerners.[36] The second half (1981–85) saw almost 1 million Northerners relocated to the NEZs.[36] As well, government and military personnel, many from Northern and north-central Vietnam, are posted to various locations throughout the country, often away from their home regions. More recently, the growth of the free market system has resulted in business people and tourists traveling to distant parts of Vietnam. These movements have resulted in some small blending of the dialects, but more significantly, have made the Northern dialect more easily understood in the South and vice versa. Most Southerners, when singing modern/popular Vietnamese songs, do so in the Northern accent. This is true in Vietnam as well as in the overseas Vietnamese communities.

Regional variation in vocabulary[37]
Northern Northern Central Southern English gloss
này ni, này "this"
thế này như ri như vầy "thus, this way"
đấy nớ, đó "that"
thế, thế ấy rứa, rứa tê vậy, vậy đó "thus, so, that way"
kia, kìa , tề đó "that yonder"
đâu đâu "where"
nào mồ nào "which"
tại sao răng tại sao "why"
thế nào, như nào răng, làm răng làm sao "how"
tôi tui tui "I, me (polite)"
tao tau tao "I, me (arrogant, familiar)"
chúng tao choa, bọn choa tụi tao, tụi tui "we, us (but not you, colloquial, familiar)"
mày mi mày "you (thou) (arrogant, familiar)"
chúng mày bây, bọn bây tụi mầy "you guys, y'all (arrogant, familiar)"
chúng nó bọn nớ tụi nó "they/them (arrogant, familiar)"
ông ấy ông nớ ổng "he/him, that gentleman, sir"
bà ấy bà nớ bả "she/her, that lady, madam"
anh ấy anh nớ ảnh "he/him, that young man (of equal status)"
ruộng nương ruộng,rẫy "field"
bát đọi chén "rice bowl"
bẩn nhớp "dirty"
muôi môi "ladle"
đầu trốc đầu "head"
lười nhác làm biếng "lazy"
ô tô ô tô xe hơi "car"
thìa thìa muỗng "spoon"

The syllable-initial ch and tr digraphs are pronounced distinctly in North-central, Central, and Southern varieties, but are merged in Northern varieties (i.e. they are both pronounced the same way). The North-central varieties preserve three distinct pronunciations for d, gi, and r whereas the North has a three-way merger and the Central and South have a merger of d and gi while keeping r distinct. At the end of syllables, palatals ch and nh have merged with alveolars t and n, which, in turn, have also partially merged with velars c and ng in Central and Southern varieties.

Regional consonant correspondences
Syllable position Orthography Northern North-central Central Southern
syllable-initial x [s] [s]
s [ʂ] [ʂ, s]
ch [c, tɕ] [c]
tr [tʂ] [tʂ, c]
r [z] [ɹ, ʐ]
d [z] [j]
gi [ɟ]
v[38] [v] [j, v]
syllable-final c [k] [k] [k]
t [t]
after e
[k, t]
after i, ê
ch [k]
ng [ŋ] [ŋ] [ŋ]
n [n]
after e
[ŋ, n]
after i, ê
nh [ŋ]

In addition to the regional variation described above, there is also a merger of l and n in certain rural varieties:

l, n variation
Orthography "Mainstream" varieties Rural varieties
n [n] [n]
l [l]

Variation between l and n can be found even in mainstream Vietnamese in certain words. For example, the numeral "five" appears as năm by itself and in compound numerals like năm mươi "fifty" but appears as lăm in mười lăm "fifteen". (See Vietnamese syntax: Cardinal numerals.) In some northern varieties, this numeral appears with an initial nh instead of l: hai mươi nhăm "twenty-five" vs. mainstream hai mươi lăm.[39]

The consonant clusters that were originally present in Middle Vietnamese (of the 17th century) have been lost in almost all modern Vietnamese varieties (but retained in other closely related Vietic languages). However, some speech communities have preserved some of these archaic clusters: "sky" is blời with a cluster in Hảo Nho (Yên Mô prefecture, Ninh Bình Province) but trời in Southern Vietnamese and giời in Hanoi Vietnamese (initial single consonants /ʈᶳ/, /z/, respectively).


Generally, the Northern varieties have six tones while those in other regions have five tones. The hỏi and ngã tones are distinct in North and some North-central varieties (although often with different pitch contours) but have merged in Central, Southern, and some North-central varieties (also with different pitch contours). Some North-central varieties (such as Hà Tĩnh Vietnamese) have a merger of the ngã and nặng tones while keeping the hỏi tone distinct. Still other North-central varieties have a three-way merger of hỏi, ngã, and nặng resulting in a four-tone system. In addition, there are several phonetic differences (mostly in pitch contour and phonation type) in the tones among dialects.

Regional tone correspondences
Tone Northern North-central Central Southern
 Vinh  Thanh
Hà Tĩnh
ngang ˧ 33 ˧˥ 35 ˧˥ 35 ˧˥ 35, ˧˥˧ 353 ˧˥ 35 ˧ 33
huyền ˨˩̤ 21̤ ˧ 33 ˧ 33 ˧ 33 ˧ 33 ˨˩ 21
sắc ˧˥ 35 ˩ 11 ˩ 11, ˩˧̰ 13̰ ˩˧̰ 13̰ ˩˧̰ 13̰ ˧˥ 35
hỏi ˧˩˧̰ 31̰3 ˧˩ 31 ˧˩ 31 ˧˩̰ʔ 31̰ʔ ˧˩˨ 312 ˨˩˦ 214
ngã ˧ʔ˥ 3ʔ5 ˩˧̰ 13̰ ˨̰ 22̰
nặng ˨˩̰ʔ 21̰ʔ ˨ 22 ˨̰ 22̰ ˨̰ 22̰ ˨˩˨ 212

The table above shows the pitch contour of each tone using Chao tone number notation (where 1 = lowest pitch, 5 = highest pitch); glottalization (creaky, stiff, harsh) is indicated with the ⟨◌̰⟩ symbol; breathy voice with ⟨◌̤⟩; glottal stop with ⟨ʔ⟩; sub-dialectal variants are separated with commas. (See also the tone section below.)


Vietnamese, like many languages in Southeast Asia, is an analytic (or isolating) language. Vietnamese does not use morphological marking of case, gender, number or tense (and, as a result, has no finite/nonfinite distinction).[40] Also like other languages in the region, Vietnamese syntax conforms to subject–verb–object word order, is head-initial (displaying modified-modifier ordering), and has a noun classifier system. Additionally, it is pro-drop, wh-in-situ, and allows verb serialization.

Some Vietnamese sentences with English word glosses and translations are provided below.

Minh giáo viên.
Minh be teacher
"Minh is a teacher"
Trí 13 tuổi
Trí 13 age
"Trí is 13 years old."
Tài đang nói.
Tài -ing talk.
"Tài is talking."
Mai có vẻ sinh viên hoặc học sinh
Mai have the look be student (college) or student (under-college)
"Mai looks like a college or high school student."
Giáp rất cao.
Giáp very tall
"Giáp is very tall."
Người đó anh của nó.
person that be brother of he
"That person is his brother."
Con chó này chẳng bao giờ sủa cả.
classifier dog this not ever bark at all
"This dog never barks at all."
chỉ ăn cơm Việt Nam thôi.
he just eat rice (colloquial) Vietnam only
"He only eats Vietnamese rice."

"He only eats Vietnamese food." (especially spoken by the elders)

Cái thằng chồng em chẳng ra gì.
focus classifier husband I (as wife) he not turn out (any)thing
"That husband of mine, he is good for nothing."
Tôi thích con ngựa đen.
I (generic) like classifier horse black
"I like the black horse."
Tôi thích cái con ngựa đen đó.
I (generic) like focus classifier horse black that.
"I like that black horse."

Writing systemsEdit

In the bilingual dictionary Nhật dụng thường đàm (1851), Chinese characters (chữ nho) are explained in chữ Nôm.
Jean-Louis Taberd's dictionary Dictionarium anamitico-latinum (1838) represents Vietnamese (then Annamese) words in the Latin alphabet and chữ Nôm.
A sign at the Hỏa Lò Prison museum in Hanoi lists rules for visitors in both Vietnamese and English.

Up to the late 19th century, two writing systems based on Chinese characters were used in Vietnam.[41] All formal writing, including government business, scholarship and formal literature, was done in Classical Chinese (chữ nho 𡨸儒 "scholar's characters").

Folk literature in Vietnamese was recorded using the chữ Nôm script, in which many Chinese characters were borrowed and many more modified and invented to represent native Vietnamese words. Created in the 13th century or earlier, the Nôm writing reached its zenith in the 18th century when many Vietnamese writers and poets composed their works in Nôm, most notably Nguyễn Du and Hồ Xuân Hương (dubbed "the Queen of Nôm poetry"). However it was only used for official purposes during the brief Hồ and Tây Sơn dynasties.

A Vietnamese Catholic, Nguyễn Trường Tộ, sent petitions to the Court which suggested a Chinese character-based syllabary which would be used for Vietnamese sounds; however, his petition failed. The French colonial administration sought to eliminate the Chinese writing system, Confucianism, and other Chinese influences from Vietnam by getting rid of Nôm.[42]

A romanization of Vietnamese was codified in the 17th century by the French Jesuit missionary Alexandre de Rhodes (1591–1660), based on works of earlier Portuguese missionaries Gaspar do Amaral and António Barbosa. This Vietnamese alphabet (chữ quốc ngữ or "national script") was gradually expanded from its initial domain in Christian writing to become more popular among the general public. However, the Romanized script did not come to predominate until the beginning of the 20th century, when education became widespread and a simpler writing system was found more expedient for teaching and communication with the general population. Under French colonial rule, French superseded Chinese in administration. Vietnamese written with the alphabet became required for all public documents in 1910 by issue of a decree by the French Résident Supérieur of the protectorate of Tonkin. By the middle of the 20th century virtually all writing was done in chữ quốc ngữ, which became the official script on independence. Chữ nho was still in use on early North Vietnamese and late French Indochinese banknotes issued after World War II[43] but fell out of official use shortly thereafter. Only a few scholars and some extremely elderly people are able to read chữ Nôm today. In China, members of the Jing minority still write in chữ Nôm.

Changes in the script were made by French scholars and administrators and by conferences held after independence during 1954–1974. The script now reflects a so-called Middle Vietnamese dialect that has vowels and final consonants most similar to northern dialects and initial consonants most similar to southern dialects (Nguyễn 1996). This Middle Vietnamese is presumably close to the Hanoi variety as spoken sometime after 1600 but before the present. (This is not unlike how English orthography is based on the Chancery Standard of Late Middle English, with many spellings retained even after the Great Vowel Shift.)

Computer supportEdit

The Unicode character set contains all Vietnamese characters and the Vietnamese currency symbol. On systems that do not support Unicode, many 8-bit Vietnamese code pages are available such as Vietnamese Standard Code for Information Interchange (VISCII) or Windows-1258. Where ASCII must be used, Vietnamese letters are often typed using the VIQR convention, though this is largely unnecessary with the increasing ubiquity of Unicode. There are many software tools that help type true Vietnamese text on US keyboards, such as WinVNKey and Unikey on Windows, or MacVNKey on Macintosh.


It seems likely that in the distant past, Vietnamese shared more characteristics common to other languages in the Austroasiatic family, such as an inflectional morphology and a richer set of consonant clusters, which have subsequently disappeared from the language. However, Vietnamese appears to have been heavily influenced by its location in the Mainland Southeast Asia linguistic area, with the result that it has acquired or converged toward characteristics such as isolating morphology and phonemically distinctive tones, through processes of tonogenesis. These characteristics have become part of many of the genetically unrelated languages of Southeast Asia; for example, Tsat (a member of the Malayo-Polynesian group within Austronesian), and Vietnamese each developed tones as a phonemic feature.

The ancestor of the Vietnamese language is usually believed to have been originally based in the area of the Red River in what is now northern Vietnam. However, Chamberlain argues that the Red River Delta region was originally Tai-speaking and became Vietnamese-speaking only between the seventh and ninth centuries AD, as a result of immigration from the south, i. e., modern central Vietnam, where the highly distinctive and conservative North-Central Vietnamese dialects are spoken today. Therefore, the region of origin of Vietnamese (and the earlier Viet–Muong) was well south of the Red River.[44]

Like the ethnonym Lao, the name Yue/Việt originally referred to Tai–Kadai-speaking groups. In northern Vietnam, these later adopted Viet–Muong and further north Chinese varieties, where the designation Yue Chinese preserves the ethnonym. (Both in Vietnam and southern China, however, many Tai–Kadai languages remain in use.) This explains the fact that the same ethnonym Yue ~ Việt is associated with groups that speak Tai–Kadai, Austroasiatic and Chinese languages, which are typologically similar and share significant amounts of lexicon, but have different origins.

Distinctive tonal variations emerged during the subsequent expansion of the Vietnamese language and people into what is now central and southern Vietnam through conquest of the ancient nation of Champa and the Khmer people of the Mekong Delta in the vicinity of present-day Ho Chi Minh City, also known as Saigon.

Vietnamese was primarily influenced by Chinese, which came to predominate politically in the 2nd century BC. After Vietnam achieved independence in the 10th century, the ruling class adopted Classical Chinese as the medium of government, scholarship and literature. With the dominance of Chinese came radical importation of Chinese vocabulary and grammatical influence. Much of the Vietnamese lexicon in all realms consists of Sino-Vietnamese words.

When France invaded Vietnam in the late 19th century, French gradually replaced Chinese as the official language in education and government. Vietnamese adopted many French terms, such as đầm (dame, from madame), ga (train station, from gare), sơ mi (shirt, from chemise), and búp bê (doll, from poupée). In addition, many Sino-Vietnamese terms were devised for Western ideas imported through the French.

Henri Maspero described six periods of the Vietnamese language:[45][46]

  1. Pre-Vietnamese, also known as Proto-Viet–Muong or Proto-Vietnamuong, the ancestor of Vietnamese and the related Muong language.
  2. Proto-Vietnamese, the oldest reconstructable version of Vietnamese, dated to just before the entry of massive amounts of Sino-Vietnamese vocabulary into the language, c. 7th to 9th century AD? At this state, the language had three tones.
  3. Archaic Vietnamese, the state of the language upon adoption of the Sino-Vietnamese vocabulary, c. 10th century AD.
  4. Ancient Vietnamese, the language represented by Chữ Nôm (c. 15th century) and the Chinese–Vietnamese glossary Hua-yi Yi-yu (c. 16th century). By this point, a tone split had happened in the language, leading to six tones but a loss of contrastive voicing among consonants.
  5. Middle Vietnamese, the language of the Dictionarium Annamiticum Lusitanum et Latinum of the Jesuit missionary Alexandre de Rhodes (c. 17th century).
  6. Modern Vietnamese, from the 19th century.


The following diagram shows the phonology of Proto-Viet–Muong (the nearest ancestor of Vietnamese and the closely related Muong language), along with the outcomes in the modern language:[47][48][49]

Labial Interdental Dental/Alveolar Palatoalveolar Retroflex Palatal Velar Glottal
tenuis *p > b *t > đ * > x 1 *c > ch *k > k/c/q *ʔ > #
voiced *b > b *d > đ *ɟ > ch *ɡ > k/c/q
aspirated * > ph * > th * > kh
voiced glottalized *ɓ > m *ɗ > n *ʄ > nh 1
Nasal *m > m *n > n *ɲ > nh *ŋ > ng/ngh
Fricative voiceless *s > t *ɕ > th *h > h
voiced 2 *(β) > v 3 *(ð) > d *(ς) > r 4 *(ʝ) > gi *(ɣ) > g/gh
Approximant *w > v *l > l *r > r *j > d

^1 According to Ferlus, */tʃ/ and */ʄ/ are not accepted by all researchers. Ferlus 1992[47] had an additional phoneme */dʒ/, and had the preglottalized consonant */ʔj/ in place of the implosive consonant */ʄ/. Note that the latter two sounds are not all that different, both being voiced palatals and glottalic.

^2 The fricatives indicated above in parentheses developed as allophones of stop consonants occurring between vowels (i.e. when a minor syllable occurred). These fricatives were not present in Proto-Viet–Muong, as indicated by their absence in Muong, but were evidently present in the later Proto-Vietnamese stage. Subsequent loss of the minor-syllable prefixes phonemicized the fricatives. Ferlus 1992[47] proposes that originally there were both voiced and voiceless fricatives, corresponding to original voiced or voiceless stops, but Ferlus 2009[48] appears to have abandoned that hypothesis, suggesting that stops were softened and voiced at approximately the same time, according to the following pattern:

  • *p, *b > /β/
  • *t, *d > /ð/
  • *k, *ɡ > /ɣ/
  • *s, *ɕ > /ς/
  • *c, *ɟ, *tʃ > /ʝ/

^3 In Middle Vietnamese, the outcome of these sounds was written with a hooked b ( ), representing a /β/ that was still distinct from v (then pronounced /w/). See above.

^4 It is unclear what this sound was. According to Ferlus 1992,[47] in the Archaic Vietnamese period (c. 10th century AD, when Sino-Vietnamese vocabulary was borrowed) it was *ɽ, distinct at that time from *r.

The following initial clusters occurred, with outcomes indicated:

  • *pr, *br, *tr, *dr, *kr, *gr > /kʰr/ > /ks/ > s
  • *pl, *bl > MV bl > Northern gi, Southern tr
  • *kl, *gl > MV tl > tr
  • ml > MV ml > mnh > nh
  • *kj > gi

Note also that a large number of words were borrowed from Middle Chinese, forming part of the Sino-Vietnamese vocabulary. These caused the original introduction of the retroflex sounds /ʂ/ and /ʈ/ (modern s, tr) into the language.

Origin of the tonesEdit

Proto-Viet–Muong had no tones to speak of. The tones later developed in some of the daughter languages from distinctions in the initial and final consonants. Vietnamese tones developed as follows:

Register Initial consonant Smooth ending Glottal ending Fricative ending
High (first) register Voiceless A1 ngang "level" B1 sắc "sharp" C1 hỏi "asking"
Low (second) register Voiced A2 huyền "hanging" B2 nặng "heavy" C2 ngã "tumbling"

Glottal-ending syllables ended with a glottal stop /ʔ/, while fricative-ending syllables ended with /s/ or /h/. Both types of syllables could co-occur with a resonant (e.g. /m/ or /n/).

At some point, a tone split occurred, as in many other Southeast Asian languages. Essentially, an allophonic distinction developed in the tones, whereby the tones in syllables with voiced initials were pronounced differently from those with voiceless initials. (Approximately speaking, the voiced allotones were pronounced with additional breathy voice or creaky voice and with lowered pitch. The quality difference predominates in today's northern varieties, e.g. in Hanoi, while in the southern varieties the pitch difference predominates, as in Ho Chi Minh City.) Subsequent to this, the plain-voiced stops became voiceless and the allotones became new phonemic tones. Note that the implosive stops were unaffected, and in fact developed tonally as if they were unvoiced. (This behavior is common to all East Asian languages with implosive stops.)

As noted above, Proto-Viet–Muong had sesquisyllabic words with an initial minor syllable (in addition to, and independent of, initial clusters in the main syllable). When a minor syllable occurred, the main syllable's initial consonant was intervocalic and as a result suffered lenition, becoming a voiced fricative. The minor syllables were eventually lost, but not until the tone split had occurred. As a result, words in modern Vietnamese with voiced fricatives occur in all six tones, and the tonal register reflects the voicing of the minor-syllable prefix and not the voicing of the main-syllable stop in Proto-Viet–Muong that produced the fricative. For similar reasons, words beginning with /l/ and /ŋ/ occur in both registers. (Thompson 1976[49] reconstructed voiceless resonants to account for outcomes where resonants occur with a first-register tone, but this is no longer considered necessary, at least by Ferlus.)

Middle VietnameseEdit

The writing system used for Vietnamese is based closely on the system developed by Alexandre de Rhodes for his 1651 Dictionarium Annamiticum Lusitanum et Latinum. It reflects the pronunciation of the Vietnamese of Hanoi at that time, a stage commonly termed Middle Vietnamese (tiếng Việt trung đại). The pronunciation of the "rime" of the syllable, i.e. all parts other than the initial consonant (optional /w/ glide, vowel nucleus, tone and final consonant), appears nearly identical between Middle Vietnamese and modern Hanoi pronunciation. On the other hand, the Middle Vietnamese pronunciation of the initial consonant differs greatly from all modern dialects, and in fact is significantly closer to the modern Saigon dialect than the modern Hanoi dialect.

The following diagram shows the orthography and pronunciation of Middle Vietnamese:

Labial Dental/Alveolar Retroflex Palatal Velar Glottal
Stop tenuis p [p]1 t [t] tr [ʈ] ch [c] c/k [k]
aspirated ph [pʰ] th [tʰ] kh [kʰ]
voiced glottalized b [ɓ] đ [ɗ]
Fricative voiceless s/ſ [ʂ] x [ɕ] h [h]
voiced [β]2 d [ð] g/gi [ʝ] g/gh [ɣ]
Nasal m [m] n [n] nh [ɲ] ng/ngh [ŋ]
Approximant v/u/o [w] l [l] r [ɹ] y/i/ĕ [j]3
The first page of the section in Alexandre de Rhodes's Dictionarium Annamiticum Lusitanum et Latinum (Vietnamese–Portuguese–Latin dictionary)

^1 [p] occurs only at the end of a syllable.
^2 This symbol, "Latin small letter B with flourish", looks like:  . It has a rounded hook that starts halfway up the left side (where the top of the curved part of the b meets the vertical, straight part) and curves about 180 degrees counterclockwise, ending below the bottom-left corner.
^3 [j] does not occur at the beginning of a syllable, but can occur at the end of a syllable, where it is notated i or y (with the difference between the two often indicating differences in the quality or length of the preceding vowel), and after /ð/ and /β/, where it is notated ĕ. This ĕ, and the /j/ it notated, have disappeared from the modern language.

Note that b [ɓ] and p [p] never contrast in any position, suggesting that they are allophones; likewise for gi [ʝ] and y/i/ĕ [j].

The language also has three clusters at the beginning of syllables, which have since disappeared:

  • tl /tl/ > modern tr
  • bl /ɓl/ > modern gi (Northern), tr (Southern)
  • ml /ml/ > mnh /mɲ/ > modern nh

Most of the unusual correspondences between spelling and modern pronunciation are explained by Middle Vietnamese. Note in particular:

  • de Rhodes' system has two different b letters, a regular b and a "hooked" b in which the upper section of the curved part of the b extends leftward past the vertical bar and curls down again in a semicircle. This apparently represented a voiced bilabial fricative /β/. Within a century or so, both /β/ and /w/ had merged as /v/, spelled as v.
  • de Rhodes' system has a second medial glide /j/ that is written ĕ and appears in some words with initial d and hooked b. These later disappear.
  • đ /ɗ/ was (and still is) alveolar, whereas d /ð/ was dental. The choice of symbols was based on the dental rather than alveolar nature of /d/ and its allophone [ð] in Spanish and other Romance languages. The inconsistency with the symbols assigned to /ɓ/ vs. /β/ was based on the lack of any such place distinction between the two, with the result that the stop consonant /ɓ/ appeared more "normal" than the fricative /β/. In both cases, the implosive nature of the stops does not appear to have had any role in the choice of symbol.
  • x was the alveolo-palatal fricative /ɕ/ rather than the dental /s/ of the modern language. In 17th-century Portuguese, the common language of the Jesuits, s was the apico-alveolar sibilant /s̺/ (as still in much of Spain and some parts of Portugal), while x was a palatoalveolar /ʃ/. The similarity of apicoalveolar /s̺/ to the Vietnamese retroflex /ʂ/ led to the assignment of s and x as above.
de Rhodes's entry for dĕóu᷄ shows distinct breves, acutes and apices.

De Rhodes's orthography also made use of an apex diacritic to indicate a final labial-velar nasal /ŋ͡m/, an allophone of /ŋ/ that is peculiar to the Hanoi dialect to the present day. This diacritic is often mistaken for a tilde in modern reproductions of early Vietnamese writing.


"The Tale of Kieu is an epic narrative poem by the celebrated poet Nguyễn Du, (阮攸), which is often considered the most significant work of Vietnamese literature. It was originally written in Chữ Nôm (titled Đoạn Trường Tân Thanh 斷腸新聲) and is widely taught in Vietnam today.

