|yale romanization||hon yúh|
|canton romanization||hon3 yü5|
|ipa||cantonese pronunciation: [hɔ̄ːn.jy̬ː]|
|hokkien poj||hàn-gí, hàn-gú|
|literal meaning||middle/central/chinese text|
|yale romanization||jūng mán|
|canton romanization||zung1 men4*2|
chinese (simplified chinese: 汉语; traditional chinese: 漢語; pinyin: hànyǔ; literally: 'han language' or especially though not exclusively for written chinese: 中文; zhōngwén; 'chinese writing') forms the sinitic branch of the sino-tibetan languages. chinese languages are spoken by the ethnic han chinese majority and many minority ethnic groups in china. about 1.2 billion people (around 16% of the world's population) speak some form of chinese as their first language.
the varieties of chinese are usually considered by native speakers to be regional variants of ethnic chinese speech, without consideration of whether they are mutually intelligible. due to their lack of mutual intelligibility, linguists generally describe them as distinct languages, perhaps hundreds, sometimes noting that they are more varied than the romance languages.[b] investigation of the historical relationships among the sinitic languages is just getting started. currently, most classifications posit 7 to 13 main regional groups, based on often superficial phonetic developments, of which the most populous by far is mandarin (about 800 million speakers, e.g. southwestern mandarin), followed by min (75 million, e.g. southern min), wu (74 million, e.g. shanghainese), and yue (68 million, e.g. cantonese). these groups are unintelligible to each other, and generally many of their subgroups are mutually unintelligible as well (e.g., not only is min chinese a family of mutually unintelligible languages, but southern min itself is not a single language). there are, however, several transitional areas, where languages and dialects from different branches share enough features for some limited intelligibility between neighboring areas. examples are new xiang and southwest mandarin, xuanzhou wu and lower yangtze mandarin, jin and central plains mandarin, and certain divergent dialects of hakka with gan (though these are unintelligible with mainstream hakka). all varieties of chinese are tonal to at least some degree and largely analytic.
standard chinese (pǔtōnghuà/guóyǔ/huáyǔ) is a standardized form of spoken chinese based on the beijing dialect of mandarin. it is an official language of china, similar to one of the national languages of taiwan (taiwanese mandarin) and one of the four official languages of singapore. it is one of the six official languages of the united nations. the written form of the standard language (中文; zhōngwén), based on the logograms known as chinese characters (汉字/漢字; hànzì), is shared by literate speakers of otherwise unintelligible dialects.
the earliest chinese written records are shang dynasty-era oracle inscriptions, which can be traced back to 1250 bce. the phonetic categories of archaic chinese can be reconstructed from the rhymes of ancient poetry. during the northern and southern dynasties period, middle chinese went through several sound changes and split into several varieties following prolonged geographic and political separation. qieyun, a rime dictionary, recorded a compromise between the pronunciations of different regions. the royal courts of the ming and early qing dynasties operated using a koiné language (guanhua) based on nanjing dialect of lower yangtze mandarin. standard chinese was adopted in the 1930s, and is now an official language of both the people's republic of china and the republic of china on taiwan.