Pinyin

From Wikipedia, the free encyclopedia

Jump to: navigation, search
Hanyu Pinyin
Traditional Chinese:漢語拼音
Simplified Chinese:汉语拼音
Scheme of the Chinese Phonetic Alphabet
Traditional Chinese:漢語拼音方案
Simplified Chinese:汉语拼音方案
Chinese romanization
Mandarin for Standard Mandarin
Hanyu Pinyin (ISO standard)
EFEO
Gwoyeu Romatzyh
Spelling conventions
Latinxua Sin Wenz
Mandarin Phonetic Symbols II
Chinese Postal Map Romanization
Tongyong Pinyin
Wade-Giles
Yale
Legge romanization
Simplified Wade
Comparison chart
Cantonese for Standard Cantonese
Guangdong Romanization
Hong Kong Government
Jyutping
Meyer-Wempe
Sidney Lau
S. L. Wong (phonetic symbols)
S. L. Wong (romanisation)
Standard Cantonese Pinyin
Standard Romanization
Yale
Barnett-Chao
Wu
Long-short (romanization)
Min Nan
for Taiwanese, Amoy, and related
Pe̍h-oē-jī
For Hainanese
Hainanhua Pinyin Fang'an
For Teochew
Peng'im
Min Dong for Fuzhou dialect
Foochow Romanized
Hakka for Moiyan dialect
Kejiahua Pinyin Fang'an
For Siyen dialect
Phak-fa-s
See also:
General Chinese (Chao Yuenren)
Cyrillization
Xiao'erjing
Zhuyin
Romanisation in Singapore
Romanisation in the ROC (Taiwan)
This box: view talk edit

Pinyin, more formally Hanyu Pinyin, is the most common Standard Mandarin romanization system in use. Hanyu means the Chinese language[1], and pinyin means "spell sound", or the spelling of the sound.[1] Developed by a government committee in the People's Republic of China in the 1950s, Pinyin was adopted as the international standard in 1979. It is now used to teach Chinese schoolchildren and foreign learners the standard pronunciation of Mandarin Chinese, to spell Chinese names in foreign publications and to enter Chinese characters on computers.


Contents

[edit] History

In 1954, the Ministry of Education of the PRC assigned a Committee (Committee for the Reform of the Chinese Written Language) to reform the written language. This committee developed Hanyu Pinyin based upon existing systems of that time (Gwoyeu Romatzyh of 1928, Latinxua Sin Wenz of 1931, it uses the diacritic markings from Zhuyin)[2]. The main force behind pinyin was Zhou Youguang (born 1905, turning 103 in 2008 in good health).[3] [4] Zhou Youguang was working in a New York bank when he decided to return to China to help rebuild the country after the war. He became an economics professor in Shanghai. The government assigned him to help the development of a new romanisation system. The switch to language and writing largely saved him from the wrath of the Cultural Revolution of Mao Zedong.

A first draft was published on February 12, 1956. The first edition of Hanyu Pinyin was approved and adopted at the Fifth Session of the 1st National People's Congress on February 11, 1958. It was then introduced to primary schools as a way to teach Standard Mandarin pronunciation, and used to improve the literacy rate among adults. In 2001, the Chinese Government issued the National Common Language Law, providing a legal basis for applying pinyin.[5]

The correspondence between letter and sound does not follow any single other language, but does not depart any more from the norms of the Latin alphabet than many European languages. For example, the aspiration distinction between b, d, g and p, t, k is similar to that of English, but not to that of French. Z and c also have that distinction; however, they are pronounced as [ts], as in languages such as German, Italian, and Polish, which do not have that distinction. From s, z, c come the digraphs sh, zh, ch by analogy with English sh, ch; although this introduces the novel combination zh, it is internally consistent in how the two series are related, and represents the fact that many Chinese pronounce sh, zh, ch as s, z, c. In the x, j, q series, x rather resembles its pronunciation in Catalan, though q is more novel. Pinyin vowels are pronounced similarly to vowels in Romance languages. More information on the pronunciation of all pinyin letters in terms of English approximations is given further below.

The pronunciation of Chinese is generally given in terms of initials and finals, which represent the segmental phonemic portion of the language. Initials are initial consonants, while finals are all possible combinations of medials (semivowels coming before the vowel), the nucleus vowel, and coda (final vowel or consonant).

For a complete table of all pinyin syllables, see pinyin table.

[edit] Usage

Hanyu Pinyin superseded older romanization systems such as Wade-Giles (1859; modified 1892) and Chinese Postal Map Romanization, and replaced Zhuyin as the method of Chinese phonetic instruction in mainland China. Hanyu Pinyin was adopted in 1979 by the International Organization for Standardization (ISO) as the standard romanization for modern Chinese (ISO-7098:1991). It has also been accepted by the Government of Singapore, the Library of Congress, the American Library Association, and many other international institutions. It has also become a useful tool for entering Chinese language text into computers.

The spelling of Chinese geographical or personal names in pinyin has become a standard or most common way to transcribe them in English.

Chinese speaking Standard Mandarin at home use pinyin to help children associate characters with spoken words which they already know; however, for the many Chinese who do not use Standard Mandarin at home, pinyin is used to teach them the Standard Mandarin pronunciation of words when they learn them in elementary school.

Pinyin has become a tool for many foreigners to learn the Mandarin pronunciation, it is used to explain the grammar and spoken Mandarin together with hanzi. Like zhuyin fuhao it is used as a phonetic guide in books for children but also dialect speakers and foreign learners. Books containing both Chinese characters and pinyin are popular with foreign learners of Chinese, pinyin's role in teaching pronunciation to foreigners and children is similar to furigana-based books (with hiragana letters written above or next to kanji) in Japanese or fully vocalised texts in Arabic ("vocalised Arabic") but as mentioned above, pinyin is also the main romanisation method.

[edit] Initials and Finals

Unlike in European languages, initials (simplified Chinese: 声母; traditional Chinese: 聲母) and finals (simplified Chinese: 韵母; traditional Chinese: 韻母, or rhyming sounds) - and not consonants and vowels - are the fundamental elements in Pinyin (and most other phonetic systems used to describe the Han language). Nearly each Chinese syllable can be spelled with exactly one initial followed by one final, except in the special syllable 'er' and when a trailing 'r' is considered part of a syllable (see below). The latter case, though a common practice in some sub-dialects, is rarely used in official publications.[6]

Even though most initials contain a consonant, finals are not simple vowels,[7] especially in compound finals (simplified Chinese: 复韵母; traditional Chinese: 復韻母), i.e., when one "final" is placed in front of another one. For example, [i] and [u] are pronounced with such tight openings that some native Chinese speakers (especially when singing or on stage) pronounce yī (Chinese: , clothes, officially pronounced as /i/) as /ji/, wéi (simplified Chinese: ; traditional Chinese: , to enclose, officially as /uei/) as /wei/ or /wuei/. The concepts of consonants and vowels are not incorporated in Pinyin or its predecessors, despite the fact that the Roman alphabets are used in Pinyin. In the entire Pinyin system, there is not a list of consonants, nor a list of vowels.[8]

[edit] Initials

In each cell below, the first line indicates the IPA, the second indicates pinyin.
BilabialLabio-
dental
Labial-
velar
AlveolarRetroflexAlveolo-
palatal
PalatalVelar
Plosive[p]
b
[pʰ]
p
[t]
d
[tʰ]
t
[k]
g
[kʰ]
k
Nasal[m]
m
[n]
n
Lateral approximant[l]
l
Affricate[ts]
z
[tsʰ]
c
[ʈʂ]
zh
[ʈʂʰ]
ch
[tɕ]
j
[tɕʰ]
q
Fricative[f]
f
[s]
s
[ʂ]
sh
[ʐ] 1
r
[ɕ]
x
[x]
h
Approximant[w] 2
w
[ɻ] 1
r
[j] 3
y

1 /ɻ/ may phonetically be /ʐ/ (a voiced retroflex fricative). This pronunciation varies among different speakers, and is not two different phonemes.
2 the letter "w" may be considered as an initial or a final, and may be pronounced as /w/ or /u/[clarify]
3 the letter "y" may be considered as an initial or a final, and may be pronounced as /j/ or /i/[clarify]

Note: Letters "y" and "w" are not included in table of initials in the official Pinyin system.[clarify] They are used as spelling aids in place of "i", "u" and "ü" when there are no other initials, and carry the pronunciations of the corresponding finals.[clarify] Consonants /j/ and /w/ are not officially used for these letters;[clarify] they are absent from standard Chinese.[citation needed]

Conventional order (excluding w and y), derived from the Zhuyin system, is:
b p m fd t n lg k hj q xzh ch sh rz c s

[edit] Finals

In each cell below, the first line indicates IPA, the second indicates pinyin for a standalone (no-initial) form, and the third indicates pinyin for a combination with an initial. Other than finals modified by an -r, which are omitted, the following is an exhaustive table of all possible finals. 1

The only syllable-final consonants in standard Mandarin are -n and -ng, and -r which is attached as a grammatical suffix. Chinese syllables ending with any other consonant is either from a non-Mandarin language (southern Chinese languages such as Cantonese, or minority languages of China), or it indicates the use of a non-pinyin Romanization system (where final consonants may be used to indicate tones).
NucleusCodaMedial
Øiuy
aØ[a]
a
-a
[ia]
ya
-ia
[ua]
wa
-ua
i[aɪ]
ai
-ai
[uaɪ]
wai
-uai
u[ɑʊ]
ao
-ao
[iɑʊ]
yao
-iao
n[an]
an
-an
[iɛn]
yan
-ian
[uan]
wan
-uan
[yɛn]
yuan
-üan 2
ŋ[ɑŋ]
ang
-ang
[iɑŋ]
yang
-iang
[uɑŋ]
wang
-uang
əØ[ɤ]
e
-e
[iɛ]
ye
-ie
[uɔ]
wo
-uo/-o 3
[yœ]
yue
-üe 2
i[eɪ]
ei
-ei
[ueɪ]
wei
-ui
u[oʊ]
ou
-ou
[ioʊ]
you
-iu
n[ən]
en
-en
[in]
yin
-in
[uən]
wen
-un
[yn]
yun
-ün 2
ŋ[əŋ]
eng
-eng
[iŋ]
ying
-ing
[uəŋ], [ʊŋ] 4
weng
-ong
[yʊŋ]
yong
-iong
Ø[z̩], [ʐ̩̩]

-i
[i]
yi
-i
[u]
wu
-u
[y]
yu
2

1 /ər/ (而, 二, etc.) is written as er. For other finals formed by the suffix -r, pinyin does not use special orthography; one simply appends -r to the final that it is added to, without regard for any sound changes that may take place along the way. For information on sound changes related to final -r, please see Standard Mandarin.
2 "ü" is written as "u" after j, q, x, or y.
3 "uo" is written as "o" after b, p, m, or f.
4 It is pronounced [ʊŋ] when it follows an initial, and pinyin reflects this difference.

In addition, ê [ɛ] is used to represent certain interjections.

[edit] Rules given in terms of English pronunciation

All rules given here in terms of English pronunciation are approximate, as several of these sounds do not correspond directly to sounds in English.

[edit] Pronunciation of initials

PinyinIPAExplanation
b[p]unaspirated p, as in spit
p[pʰ]aspirated p, as in pit
m[m]as in English mum
f[f]as in English fun
d[t]unaspirated t, as in stop
t[tʰ]aspirated t, as in top
n[n]as in English nit
l[l]as in English love
g[k]unaspirated k, as in skill
k[kʰ]aspirated k, as in kill
h[x]like the English h if followed by "a"; otherwise it is pronounced more roughly (not unlike the Scots ch or Russian х (Cyrillic "kha")).
j[tɕ]like q, but unaspirated. (To get this sound, first take the sound halfway between joke and check, and then slowly pass it backwards along the tongue until it is entirely clear of the tongue tip.) While this exact sound is not used in English, the closest match is the j in ajar, not the s in Asia.
q[tɕʰ]like church; pass it backwards along the tongue until it is free of the tongue tip. In Mandarin pronunciation should not be confused with "ch" but in English both "ch" and "q" are often pronounced the same.
x[ɕ]like sh or palatalised s, but take the sound and pass it backwards along the tongue until it is clear of the tongue tip; very similar to the final sound in German ich, and to huge or Hugh in some English dialects. The combination "xi" is very similar both to the Russian си or the Japanese .
zh[ʈʂ]ch with no aspiration (a sound between joke and church, tongue tip curled more upwards); very similar to merger in American English, but not voiced
ch[ʈʂʰ]as in chin, but with the tongue curled upwards; very similar to nurture or tree in American English, but strongly aspirated
sh[ʂ]as in shinbone, but with the tongue curled upwards; very similar to undershirt in American English
r[ʐ] or [ɻ]Similar to the English r in rank, but with the lips spread and with the tongue curled upwards. The initial "r" can also be described as French "j" [ʒ] or a cross between English "r" and French "j". In Cyrillised Chinese the same sound is always rendered with letter "ж" (French "j").
z[ts]unaspirated c (something between suds and cats)
c[tsʰ]like ts in bats, however more aspirated
s[s]as in sun
w[u]Note that "w" is not pronounced as a "w", rather it is pronounced as the "u" (pinyin-pronounced) that "w" replaced. i.e. "w" replaces "u" in words starting with "u" (no initial / final only). This use of "w" is a spelling convention, to disambiguate syllable breaks when such "u-" words follow other words. As a spelling convention, it does not actually alter the pre-replacement pronunciation.
y[i]case 1) Note that "y" is not pronounced as a "y", rather it is pronounced as the "i" (pinyin-pronounced) that it replaced. i.e. "y" replaces "i" in words starting with "i" (no initial / final only). This use of "y" is a spelling convention, to disambiguate syllable breaks when such "i-" words follow other words. As a spelling convention, it does not actually alter the original pronunciation.

case 2) Similarly, note that "yu" is not pronounced as a "yu", rather it is pronounced as the (pinyin-pronounced) "ü" that it replaced. i.e. "yu" replaces "ü" in words starting with "ü" (no initial / final only). This use of "yu" is a spelling convention, to disambiguate syllable breaks when such "ü-" words follow other words and due to historical type and font set limitations. As a spelling convention, it does not actually alter the original pronunciation.

[edit] Pronunciation of finals

The following is an exhaustive list of all finals in Standard Mandarin. Those ending with a final -r are listed at the end.

To find a given final:

  1. Remove the initial consonant. For zh-, ch-, sh-, both letters should be removed, they are single consonants spelt with two letters.
  2. Although y- and w- are consonants nevertheless they may be considered as part of finals and do not remove those.
    1. Syllables beginning with y- and w- may be considered as standalone forms of finals "i, u, ü" and finals beginning with "i-, u-, ü-".
  3. If a syllable begins with j-, q-, x-, or y-, and the final is -u or starts with -u-, then change -u or -u- to -ü or -ü-.
PinyinIPAFinal-only formExplanation
-i[z̩], [ʐ̩]n/aDisplayed as an "i" after: "zh", "ch", "sh", "r", "z", "c" or "s". After "z", "c" or "s", sounds like a prolonged "zzz" sound. After "zh", "ch", "sh" or "r", sounds like a prolonged American "r" sound. In some dialects, pronounced slightly more open, allowing a clear-sounding vowel to pass through (a high, central, unrounded vowel, something like IPA /ɨ/; say 'zzz' and lower the tongue just enough for the buzzing to go away).
a[ɑ]aas in "father"
o[uɔ]ostarts with English "oo" and ends with a plain continental "o".
e[ɤ], [ə]ea back, unrounded vowel, which can be formed by first pronouncing a plain continental "o" (AuE and NZE law) and then spreading the lips without changing the position of the tongue. That same sound is also similar to English "duh", but not as open. Many unstressed syllables in Chinese use the schwa (idea), and this is also written as e.
ê[ɛ](n/a)as in "bet". Only used in certain interjections.
ai[aɪ]ailike English "eye", but a bit lighter
ei[ei]eias in "hey"
ao[ɑʊ]aoapproximately as in "cow"; the a is much more audible than the o
ou[ou̯]ouas in "so"
an[an]anstarts with plain continental "a" (AuE and NZE bud) and ends with "n"
en[ən]enas in "taken"
ang[ɑŋ]angas in German Angst, including the English loan word angst (starts with the vowel sound in father and ends in the velar nasal; like song in American English)
eng[ɤŋ]englike e above but with ng added to it at the back
ong[ʊŋ]n/astarts with the vowel sound in b'ook and ends with the velar nasal sound in sing
er[ɑɻ]erlike English "are" (exists only on its own, or as the last part of a final in combination with others - see bottom of this list)
Finals beginning with i- (y-)
i[i]yilike English "ee", except when preceded by "c", "ch", "r", "s", "sh", "z" or "zh"
ia[iɑ]yaas i + a; like English "yard"
io[iɔ]yoas i + plain continental "o". Only used in certain interjections.
ie[iɛ]yeas i + ê; but is very short; e (pronounced like ê) is pronounced longer and carries the main stress (similar to the initial sound ye in yet)
iao[iɑʊ]yaoas i + ao
iu[iou̯]youas i + ou
ian[iɛn]yanas i + ê + n; like English yen
in[in]yinas i + n
iang[iɑŋ]yangas i + ang
ing[iŋ]yingas i but with ng added to it at the back
iong[iʊŋ]yongas i + ong
Finals beginning with u- (w-)
u[u]wulike English "oo"
ua[ua]waas u + a
uo[uɔ]woas u + o; the o is pronounced shorter and lighter than in the o final
uai[uaɪ]waias u + ai
ui[ueɪ]weias u + ei; here, the i is pronounced like ei
uan[uan]wanas u + an
un[uən]wenas u + en; like the on in the English won
uang[uɑŋ]wangas u + ang; like the ang in English angst or anger
n/a[uɤŋ]wengas u + eng
Finals beginning with ü- (yu-)
ü[y]yuas in German "üben" or French "lune" (To get this sound, say "ee" with rounded lips)
ue[yɛ]yueas ü + ê; the ü is short and light
üan[yɛn]yuanas ü + ê+ n;
ün[yn]yunas ü + n;
Finals that are a combination of finals above + r final
ar[ɑɻ]like ar in American English "art"
er[ɤɻ]as e + r; not to be confused with er final on its own- this form only exists with an initial character before it
or[uɔɻ]as o + r
air[ɑɻ]as ar
eir[ɝ]as schwa + r
aor[ɑʊɻ]as ao + r
our[ou̯ɻ]as ou + r
anr[ɑɻ]as ar
enr[əɻ]as schwa + r
angr[ɑ̃ɻ]as ang + r, with ng removed and the vowel nasalized
engr[ɤ̃ɻ]as eng + r, with ng removed and the vowel nasalized
ongr[ʊ̃ɻ]as ong + r, with ng removed and the vowel nasalized
ir[iəɻ]as i + schwa + r
ir[əɻ]after "c", "ch", "r", "s", "sh", "z", "zh": as schwa + r.
iar[iɑɻ]as i + ar
ier[iɛɻ]as ie + r
iaor[iɑʊɻ]as iao + r
iur[iou̯ɻ]as iou + r
ianr[iɑɻ]as i + ar
inr[iəɻ]as ir
iangr[iɑ̃ɻ]as i + angr
ingr[iɤ̃ɻ]as i + engr
iongr[yʊ̃ɻ]as i + ongr
ur[uɻ]as u + r
uar[uɑɻ]as u + ar
uor[uɔɻ]as uo + r
uair[uɑɻ]as u + ar
uir[uɝ]as u + schwa + r
uanr[uɑɻ]as u + ar
unr[uəɻ]as u + schwa + r
uangr[uɑ̃ɻ]as u + angr
ür[yəɻ]as ü + schwa + r
üer[yɛɻ]as ue + r
üanr[yɑɻ]as ü + ar
ünr[yəɻ]as ü + schwa + r

[edit] Orthography

[edit] Letters

Pinyin differs from other romanizations in several aspects, such as the following:

  • Syllables starting with u are written as w in place of u (e.g. ueng is written as weng). Standalone u is written as wu.
  • Syllables starting with i are written as y in place of i (e.g. iou is written as you). Standalone i is written as yi.
  • Syllables starting with ü are written as yu in place of ü (e.g. üe is written as yue).
  • ü is written as u when there is no ambiguity (such as ju, qu, and xu), but written as ü when there are corresponding u syllables (such as and ). In such situations where there are corresponding u syllables, it is often replaced with v on a computer, making it easier to type on a standard keyboard.
  • When preceded by a consonant, iou, uei, and uen are simplified as iu, ui, and un (which do not represent the actual pronunciation).
  • As in zhùyīn, what are actually pronounced as buo, puo, muo, and fuo are given a separate representation: bo, po, mo, and fo.
  • The apostrophe (') is often used before a, o, and e to separate syllables in a word where ambiguity could arise, especially when omitting tone marks, e.g., pi'ao (simplified Chinese: 皮袄; traditional Chinese: 皮襖) vs. piao (票), and Xi'an (西安) vs. xian (先).
  • Eh alone is written as ê; elsewhere as e. Schwa is always written as e.
  • zh, ch, and sh can be abbreviated as , ĉ, and ŝ (z, c, s with a circumflex). However, the shorthands are rarely used due to difficulty of entering them on computers.
  • ng has the uncommon shorthand of ŋ.
  • The letter v is unused (except in spelling foreign languages, languages of minority nationalities, and some dialects[9]), despite a conscious effort to distribute letters more evenly than in Western languages.

Most of the above are used to avoid ambiguity when writing words of more than one syllable in pinyin. For example uenian is written as wenyan because it is not clear which syllables make up uenian; uen-ian, uen-i-an and u-en-i-an are all possible combinations whereas wenyan is unambiguous because we, nya, etc. do not exist in pinyin. A summary of possible pinyin syllables (not including tones), can be reviewed at: pinyin table

[edit] Capitalization and word formation

Many writers are not yet aware of the rules for dividing text into words by spaces, and either put a space after each syllable, or run all words together. The manufacturer of these blankets put unnecessary spaces into  'Bishikaike' (the correct pinyin for 比什凯克, 'Bishkek') - but wrote the English text on top with no spaces at all.
Many writers are not yet aware of the rules for dividing text into words by spaces, and either put a space after each syllable, or run all words together. The manufacturer of these blankets put unnecessary spaces into 'Bishikaike' (the correct pinyin for 比什凯克, 'Bishkek') - but wrote the English text on top with no spaces at all.

Although Chinese characters represent single syllables, Mandarin Chinese is a polysyllabic language. Spacing in pinyin is based on whole words, not single syllables. However, there are often ambiguities in partitioning a word. Orthographic rules were put into effect in 1988 by the National Educational Commission (国家教育委员会, pinyin: Guójiā Jiàoyù Wěiyuánhuì ) and the National Language Commission (国家语言文字工作委员会, pinyin: Guójiā Yǔyán Wénzì Gōngzuò Wěiyuánhuì).

  1. General
    1. Single meaning: Words with a single meaning, which are usually set up of two characters (sometimes one, seldom three), are written together and not capitalized: rén (Chinese: , person); péngyou (Chinese: 朋友, friend), qiǎokèlì (simplified Chinese: 巧克力; traditional Chinese: 巧尅力, chocolate)
    2. Combined meaning (2 characters): Same goes for words combined of two words to one meaning: hǎifēng (simplified Chinese: 海风; traditional Chinese: 海風, sea breeze); wèndá (simplified Chinese: 问答; traditional Chinese: 問答, Q&A), quánguó (simplified Chinese: 全国; traditional Chinese: 全國, 'pan-national')
    3. Combined meaning (4 or more characters): Words with four or more characters having one meaning are split up with their original meaning if possible: wúfèng gāngguǎn (simplified Chinese: 无缝钢管; traditional Chinese: 無縫鋼管, seamless steel-tube); huánjìng bǎohù guīhuà (simplified Chinese: 环境保护规划; traditional Chinese: 環境保護規劃, environmental protection planning)
  2. Duplicated words
    1. AA: Duplicated characters (AA) are written together: rénrén (Chinese: 人人, everybody), kànkàn (Chinese: 看看, to have a look), niánnián (Chinese: 年年, every year)
    2. ABAB: two characters duplicated (ABAB) are written separated: yánjiū yánjiū (Chinese: 研究研究, to study, to research), xuěbái xuěbái (Chinese: 雪白雪白, snow-white)
    3. AABB: A hyphen is used with the schema AABB: láilái-wǎngwǎng (simplified Chinese: 来来回回; traditional Chinese: 來來囬囬, go back and forth), qiānqiān-wànwàn (simplified Chinese: 千千万万; traditional Chinese: 仟仟萬萬, numerous)
  3. Nouns and names (míngcí): Nouns are written in one: zhuōzi (Chinese: , table), mùtou (simplified Chinese: 木头; traditional Chinese: 木頭, wood)
    1. Even if accompanied by a prefix and suffix: fùbùzhǎng (simplified Chinese: 副部长; traditional Chinese: 副部長, vice minister), chéngwùyuán (simplified Chinese: 乘务员; traditional Chinese: 乘務員, conductor), háizimen (simplified Chinese: 孩子们; traditional Chinese: 孩子們, children)
    2. Words of position are separated: mén wài (outdoor), hé li (in the river), huǒchē shàngmian (on the train), Huáng Hé yǐnán (south of the Yellow River)
      1. Exceptions are words traditionally connected: tiānshang (in the sky), dìxia (on the ground), kōngzhōng (in the air), hǎiwài (overseas)
    3. Surnames are separated from the given name: Lǐ Huá, Zhāng Sān. If the given name consists of two syllables, it should be written as one: Wáng Jiàngguó.
    4. Titles following the name are separated and are not capitalized: Wáng bùzhǎng (minister Wang), Lǐ xiānsheng (Mr. Li), Tián zhǔrèn (director Tian), Zhào tóngzhì (comrade Zhao).
    5. The forms of addressing people with Lǎo, Xiǎo, and A are capitalized: Xiǎo Liú ([young] Ms. Liu), Dà Lǐ ([great] Mr. Li), A Sān (Ah San), Lǎo Qián ([senior] Mr. Qian), Lǎo Wú ([senior] Ms. Wu)
      1. Exceptions are: Kǒngzǐ (Master Confucius), Bāogōng (Judge Bao), Xīshī (a historical person), Mèngchángjūn (a historical person)
    6. Geographical names of China: Běijīng Shì (City of Beijing), Héběi Shěng (Province of Hebei), Yālù Jiāng (Yalu River), Tài Shān (Mt. Taishan), Dòngtíng Hú (Lake Donting), Táiwān Hǎixiá (Taiwan strait)
    7. Non-Chinese names translated back from Chinese will be written by their original writing: Marx, Einstein, London, Tokyo
  4. Verbs (dòngcí): Verbs and their suffixes (-zhe, -le and -guo) are written as one: kànzhe/kànle/kànguo (to see/saw/seen), jìngxíngzhe (to implement). Le as it appears in the end of a sentence is separated though: Huǒchē dào le (The train [has] arrived).
    1. Verbs and their objects are separated: kàn xìn (read a letter), chī yú (eat fish), kāi wánxiào (to be kidding).
    2. If verbs and their complements are each monosyllabic, they are written together, if not, separated: gǎohuài ("to make broken"), dǎsǐ (hit to death), huàwéi ("to become damp"), zhěnglǐ hǎo (to straighten out), gǎixiě wéi (rewrite a screenplay)
  5. Adjectives (xíngróngcí): A monosyllabic adjective and its reduplication are written as one: mēngmēngliàng (dim), liàngtāngtāng (shining bright)
    1. Complements of size or degree (as xiē, yīxiē, diǎnr, yīdiǎnr) are written separated: dà xiē (a little bigger), kuài yīdiānr (a bit faster)
  6. Pronouns (dàicí)
    1. The plural suffix -men directly follows up: wǒmen (we), tāmen (they)
    2. The demonstrative pronoun zhè (this), nà (that) and the question pronoun nǎ (which) are separated: zhè rén (this person), nà cì huìyì (that meeting), nǎ zhāng bàozhǐ (which newspaper)
      1. Exceptions are: nàli (there), zhèbian (over here), zhège (this piece), zhème (so), zhèmeyàng (that way)... and similar ones.
  7. Numerals and measure words (shùcí hé liàngcí)
    1. Words like /měi (every, each), mǒu (any), běn (that), gāi (that), (mine, our), are separated from the measure words following them: gè guó (every nation), gè gè (everyone), měi nián (every year), mǒu gōngchǎng (a certain factory), wǒ xiào (our school).

[edit] Tones

Relative pitch changes of the four tones
Relative pitch changes of the four tones

The pinyin system also uses diacritics for the four tones of Mandarin, usually above a non-medial vowel. Many books printed in China mix fonts, with vowels and tone marks rendered in a different font than the surrounding text, tending to give such pinyin texts a typographically ungainly appearance. This style, most likely rooted in early technical limitations, has led many to believe that pinyin's rules call for this practice and also for the use of "ɑ" (with no curl over the top) rather than the standard style of the letter "a" found in most fonts. The official rules of Hanyu Pinyin, however, specify no such practice. Note that tone marks can also appear on consonants in certain vowelless exclamations.

  1. The first tone (Flat or High Level Tone) is represented by a macron (ˉ) added to the pinyin vowel:

    ā (ɑ̄) ē ī ō ū ǖ Ā Ē Ī Ō Ū Ǖ
  2. The second tone (Rising or High-Rising Tone) is denoted by an acute accent (ˊ):

    á (ɑ́) é í ó ú ǘ Á É Í Ó Ú Ǘ
  3. The third tone (Falling-Rising or Low Tone) is marked by a caron/háček (ˇ). It is not the rounded breve (˘), though a breve is sometimes substituted due to font limitations.

    ǎ (ɑ̌) ě ǐ ǒ ǔ ǚ Ǎ Ě Ǐ Ǒ Ǔ Ǚ
  4. The fourth tone (Falling or High-Falling Tone) is represented by a grave accent (ˋ):

    à (ɑ̀) è ì ò ù ǜ À È Ì Ò Ù Ǜ
  5. The fifth or neutral tone (Neutral Tone) is represented by a normal vowel without any accent mark:

    a (ɑ) e i o u ü A E I O U Ü
(In some cases, this is also written with a dot before the syllable; for example, ·ma.)

These tone marks normally are only used in Mandarin textbooks or in foreign learning texts, but they are essential for correct pronunciation of Mandarin syllables, as exemplified by the following classic example of five characters whose pronunciations differ only in their tones:

Traditional characters:

() () () () (·ma)

Simplified characters:

() () () () (·ma)

mā má mǎ mà

A sound sample of the four tones
Problems listening to the file? See media help.

The words are "mother", "hemp", "horse", "scold" and a question particle, respectively.

[edit] Numerals in place of tone marks

Since before the advent of computers, many fonts did not contain vowels with macron or caron diacritics, a common convention for tone is to add a tone number at the end of individual syllables. For example, tóng is written tong2. The number used for each tone is as the order listed above (except the neutral tone, which is either not numbered or numbered zero, as in ma0 吗/嗎 (an interrogative marker).
ToneTone MarkNumber added to end of syllable
in place of tone mark
Example using
tone mark
Example using
number
IPA
Firstmacron ( ˉ )1ma1mɑ˥˥
Secondacute accent ( ˊ )2ma2mɑ˧˥
Thirdcaron ( ˇ )3ma3mɑ˨˩˦
Fourthgrave accent ( ˋ )4ma4mɑ˥˩
"Neutral"No mark
or dot before syllable (·)
no number
0
ma
·ma
ma
ma0

[edit] Rules for placing the tone mark[10]

The rules for determining on which vowel the tone mark appears are as follows:

  1. If there is more than one vowel and the first vowel is i, u, or ü, then the tone mark appears on the last vowel.[11][12]
  2. In all other cases, the tone mark appears on the vowels in this order: a, o, e.

(y and w are not considered vowels for these rules.)

The reasoning behind these rules is in the case of diphthongs and triphthongs, i, u, and ü (and their orthographic equivalents y and w when there is no initial consonant) are considered medial glides rather than part of the syllable nucleus in Chinese phonology. The rules ensure that the tone mark always appears on the nucleus of a syllable.

Another algorithm for determining the vowel on which the tone mark appears is as follows:

  1. First, look for an "a" or an "e". If either vowel appears, it takes the tone mark. There are no possible pinyin syllables that contain both an "a" and an "e".
  2. If there is no "a" or "e", look for an "ou". If "ou" appears, then the "o" takes the tone mark.
  3. If none of the above cases hold, then the last vowel in the syllable takes the tone mark.

[edit] The character "ü"

An umlaut is placed over the letter u when it occurs after the initials l and n in order to represent the sound [y]. This is necessary in order to distinguish the front high rounded vowel in (e.g. 驴/驢 donkey) from the back high rounded vowel in lu (e.g. 炉/爐 oven). Tonal markers are added on top of the umlaut, as in .

However, the ü is not used in other contexts where it represents a front high rounded vowel, namely after the letters j, q, x and y. For example, the sound of the word 鱼/魚 (fish) is transcribed in pinyin simply as , not as . This practice is opposed to Wade-Giles, which always uses ü, and Tongyong Pinyin, which always uses yu. Whereas Wade-Giles needs to use the umlaut to distinguish between chü (pinyin ju) and chu (pinyin zhu), this ambiguity cannot arise with pinyin, so the more convenient form ju is used instead of . Genuine ambiguities only happen with nu/ and lu/, which are then distinguished by an umlaut diacritic.

Many fonts or output methods do not support an umlaut for ü or cannot place tone marks on top of ü. Likewise, using ü in input methods is difficult because it is not present as a simple key on many keyboard layouts. For these reasons v is sometimes used instead by convention. Occasionally, uu (double u), u: (u followed by a colon) or U (capital u) is used in its place.

[edit] Comparison chart

Vowels a, e, o, i
IPAɑɔɤɑʊɤʊanənɑŋɤŋɑɻʊŋiiɤʊiɛniɪnjiŋ
Pinyinaoeaieiaoouanenangengerongyiyeyouyanyinying
Tongyong Pinyinaoeaieiaoouanenangengerongyiyeyouyanyinying
Wade-Gilesaoo/êaieiaoouanênangêngêrhungiyehyuyenyinying
Zhuyinㄨㄥㄧㄝㄧㄡㄧㄢㄧㄣㄧㄥ
example
Vowels u, y
IPAuueɪuanuənuʊnuɤŋuʊŋyyɛnyniʊŋ
Pinyinwuwoweiwanwenwengyuyueyuanyunyong
Tongyong Pinyinwuwoweiwanwunwongyuyueyuanyunyong
Wade-Gileswuwoweiwanwênwêngyüehyüanyünyung
Zhuyinㄨㄛㄨㄟㄨㄢㄨㄣㄨㄥㄩㄝㄩㄢㄩㄣㄩㄥ
example


Non-sibilant consonants
IPApmfəŋfʊŋtioutueinylykəɻ
Pinyinbpmfengdiuduitgerkhe
Tongyong Pinyinbpmfongdioudueitnyulyugerkhe
Wade-Gilespp'mfêngtiutuit'kêrhk'ho
Zhuyinㄈㄥㄉㄧㄡㄉㄨㄟㄋㄩㄌㄩㄍㄜㄦㄏㄜ
example歌儿
Sibilant consonants
IPAtɕiɛntɕyʊŋtɕʰɪnɕyɛnʈʂəʈʂɚʈʂʰəʈʂʰɚʂəʂɚʐəʐɚtsətsuɔtsɨtsʰətsʰɨ
Pinyinjianjiongqinxuanzhezhichechisheshirerizezuozicecisesi
Tongyong Pinyinjianjyongcinsyuanjhejhihchechihsheshihrerihzezuozihcecihsesih
Wade-Gileschienchiungch'inhsüanchêchihch'êch'ihshêshihjihtsêtsotzŭts'êtz'ŭszŭ
Zhuyinㄐㄧㄢㄐㄩㄥㄑㄧㄣㄒㄩㄢㄓㄜㄔㄜㄕㄜㄖㄜㄗㄜㄗㄨㄛㄘㄜㄙㄜ
example
Tones
IPAma˥˥ma˧˥ma˨˩˦ma˥˩ma
Pinyinma
Tongyong Pinyinma
Wade-Gilesma1ma2ma3ma4ma0
Zhuyinㄇㄚㄇㄚˊㄇㄚˇㄇㄚˋㄇㄚ・
example (traditional/simplfied)媽/妈麻/麻馬/马罵/骂媽媽/妈妈

[edit] Pinyin in Taiwan

Taiwan adopted Tongyong Pinyin on the national level in October 2002. Tongyong Pinyin is a modified version of Hanyu Pinyin. The adoption of Tongyong Pinyin has also resulted in political controversy. Much of the controversy centered on issues of national identity, with proponents of Chinese reunification favoring the Hanyu Pinyin system which is used in the People's Republic of China, and proponents of Taiwanese independence favoring the use of Tongyong Pinyin.[citation needed]

Localities with governments controlled by the Kuomintang, most notably Taipei City, have overridden the 2002 administrative order and converted to Hanyu Pinyin (although with a slightly different capitalization convention than the Mainland). As a result, the use of romanization on signage in Taiwan is inconsistent, with many places using Tongyong Pinyin but some using Hanyu Pinyin, and still others not yet having had the resources to replace older Wade-Giles or MPS2 signage. This has resulted in the odd situation in Taipei City in which inconsistent pinyin transcriptions are shown in freeway directions — with freeway signs, which are under the control of the national government, using one pinyin, but surface street signs, which are under the control of the city government, using the other.[citation needed]

Primary education continues to teach pronunciation using the zhùyīn system in Taiwan. Although the ROC government has stated the desire to use romanization rather than zhùyīn in education, the lack of agreement on which form of pinyin to use and the huge logistical challenge of teacher training has stalled these efforts.[citation needed]

[edit] Other languages

Pinyin-like systems have been devised for other variants of Chinese. Guangdong Romanization is a set of romanizations devised by the government of Guangdong province for Cantonese,