Wiktionary:About Chinese
This is a Wiktionary policy, guideline or common practices page. This is a draft proposal. It is unofficial, and it is unknown whether it is widely accepted by Wiktionary editors. | |
Policies – Entries: CFI - EL - NORM - NPOV - QUOTE - REDIR - DELETE. Languages: LT - AXX. Others: BLOCK - BOTS - VOTES. |
The Chinese or Sinitic language family includes a number of related lects which have very similar written forms, but different grammar, vocabulary and especially pronunciation. On Wiktionary, these lects are treated under the header ==Chinese==
and the language code zh unless they natively use a non-Chinese script.
Key points
- The various varieties of Chinese are subsumed under the header
==Chinese==
and the language code zh (vote).- The exception is when the variety of Chinese natively uses a non-Chinese script, e.g. Dungan. The layout of these entries is currently undecided.
- A Traditional Chinese form of a Chinese word, usually the most commonly used Traditional Chinese form, is chosen as the lemma (vote).
- All other forms of exactly the same word should soft-redirect to the lemma form using
{{zh-see}}
.
- All other forms of exactly the same word should soft-redirect to the lemma form using
- Terms are defined in relation to Modern Standard Written Chinese.
Entry format
Chinese entries should follow format guidelines in Entry layout. 朋友 is a good example of how Chinese entries should ideally be formatted.
{{zh-new}}
can be used to accelerate creation of entries.
Unless the entry is of a variant or simplified spelling, templates that are almost always obligatory are {{zh-forms}}
and {{zh-pron}}
.
Some useful Chinese-specific templates include {{zh-l}}
and {{zh-x}}
.
Variants and simplified forms use {{zh-see}}
to redirect to the standard traditional form.
干 is a good example of how Chinese character entries should ideally be formatted.
Basic headers for single characters
- Glyph origin
- Describes how the character obtained its current shape. Templates used include
{{Han etym}}
and{{Han compound}}
. - A character is not always of Chinese origin; see 乭 (shí) for an example of what to do in this case.
- Etymology
- Describes the origin of the character's pronunciation(s). (Or, used to host a
{{zh-see}}
box; see documentation for details and 干 for an example.) - Note that Old Chinese and Middle Chinese have been subsumed under "Chinese", so indicating that Mandarin 黑 (hēi) is inherited from Old Chinese 黑 (*hmlɯːɡ /*hmlɯːɡ/) or Middle Chinese 黑 (xok) is redundant. However, indicating that a character is derived from a different character is fine, such as with 佑 (yòu) (from Old Chinese 右).
- It is preferred that an entry is split by etymology per Old Chinese and Middle Chinese ancestor.
- Pronunciation
- Hosts
{{zh-pron}}
; see documentation for details. - The
|cat=
parameter is responsible for sorting entries into categories such as Category:Chinese verbs, Category:Mandarin nouns, and Category:Cantonese chengyu and should be filled out when reasonable.
- Definitions
- Hosts definitions. Rationale for using a "Definitions" header instead of "Noun", "Verb", or other more specific part of speech headers can be found here.
- The
{{head|zh|hanzi}}
template is found directly under this header as the headword template. It has little practical value but is currently part of the standard Wiktionary entry format.
Basic headers for words
- Glyph origin
- This is usually not needed unless it is convenient to discuss it together (like 彳亍 (chìchù), 旮旯 (gālá)).
- Etymology
- Etymologies for words may show change of pronunciations or spellings, and semantic change (change in meaning). However, it is not necessary to duplicate any content from
{{zh-forms}}
. For simple figurative or idiomatic meaning, the|lit=
parameter of{{zh-forms}}
may be used to show the literal meaning.
- Pronunciation
- This section is the same as the one for single character entries.
- Definitions
- Definitions for words should be defined using specific part of speech headers, and appropriate headword template. For noun entries, use
{{zh-mw}}
to indicate classifiers for each sense and categorize the entry by classifiers; classifiers in the headword template is deprecated.
Other templates
{{zh-forms}}
: found either at the top of an entry or under an Etymology header. See documentation for details.
Headword-line templates
- Wiktionary previously had several headword-line templates specifically for Chinese, but all have been deprecated except for
{{zh-verb}}
, used for marking morphological structure. In most cases, you can simply use{{head}}
.
About specific lects
This section lists all Chinese dialects supported and proposed to be supported. Dialect names in italics are currently not supported (in to-do list); other content in italics is tentative and needs further discussion.
Subgroup of Chinese |
Variety chosen | Romanisation | Help page | Romanisation allowed as entries? | Tone sandhi |
---|---|---|---|---|---|
Mandarin | Standard Mandarin (Beijing dialect/Taiwan Mandarin) |
Hanyu Pinyin | Allowed for (all as as non-lemmas, votes: 1, 2): | Only original tones are indicated in Pinyin, e.g. consecutive third tones are shown as third tones. Phonetic pinyin is shown in the expanded mode on tone sandhi with characters 一 and 不 when the actual tone differs from the nominal, e.g. 一定 (yīdìng) [phonetic: yídìng], 不過/不过 (bùguò) [phonetic: búguò] | |
Sichuanese (Chengdu dialect) | Sichuanese Pinyin | help | No | Indicated by hyphen - .
| |
Xi'an (Xi'an dialect) | Modified Hanyu Pinyin | help | No | See the romanisation page. | |
Wuhannese (Wuhan dialect) | Modified Hanyu Pinyin, based on that of the Longren dictionary |
No | Indicated by hyphen - ?
| ||
Nankinese (Nanjing dialect) | Modified Hanyu Pinyin | help | No | Original tones are displayed. | |
Dungan | Use Dungan term in Cyrillic script. | help | Allow for terms in Cyrillic script and Xiao'erjing as lemmas | Indicated by hyphen - .
| |
Northeastern Mandarin (Harbin or Shenyang dialect) | Hanyu Pinyin | to be determined | to be determined | ||
Cantonese | Standard Cantonese (Guangzhou dialect/Hong Kong dialect) |
Jyutping | help | Allowed for monosyllables as non-lemmas (vote). | Changed tones are indicated by hyphen - .
|
Dongguan (Guancheng dialect) |
adapted Jyutping++ | help | No | See the romanisation page. | |
Taishanese (Taicheng dialect) |
Wiktionary | help | No | See the romanisation page. | |
Yangjiang (Jiangcheng dialect) |
adapted Jyutping++ | help | No | Does not exist. | |
Gan | Nanchang dialect | Wiktionary | help | No | |
Hakka | Sixian dialect (north and south) |
Pha̍k-fa-sṳ | help | No | Original tones are displayed. |
Hailu dialect | Hakka Romanization System | No | Original tones are displayed. | ||
Meixian dialect | Guangdong Romanization | help | No | Original tones are displayed. | |
Tingzhou dialect | to be determined | No | |||
Huizhou | to be determined | to be determined | help | No | |
Jin | Taiyuan dialect | Wiktionary | help | No | |
Central Min | Yong'an dialect | to be determined, but an in-house transliteration system could be similar to Kienning Colloquial Romanized | help | No | to be determined |
Eastern Min | Fuzhou dialect | Bàng-uâ-cê | help | No | |
Fu'an dialect | to be determined, but an in-house transliteration system could be similar to Bàng-uâ-cê | No | to be determined | ||
Northern Min | Jian'ou dialect | Kienning Colloquial Romanized | help | No | |
Puxian Min | Putian dialect, Xianyou dialect |
Pouseng Ping'ing, Hinghwa Romanized | help | No | See the romanisation page. |
Southern Min | Datian Min | to be determined | No | ||
Hainanese (multiple dialects) |
to be determined | help | No | to be determined | |
Hokkien (multiple dialects) |
Pe̍h-ōe-jī | help | Allowed as lemmas (e.g. put-khó-su-gī) | Only original tones are used. | |
Leizhou Min | Leizhou Pinyin | No | |||
Teochew (multiple dialects) |
Guangdong Ministry of Education's Teochew Romanization Scheme (not Gaginang's Peng'im) |
No | |||
Shaojiang | Shaowu dialect | to be determined, but an in-house transliteration system could be similar to Kienning Colloquial Romanized | No | to be determined | |
Northern Pinghua | to be determined | to be determined | No | ||
Southern Pinghua | Nanning Pinghua (Tingzi dialect) |
adapted Jyutping++ | help | No | Does not exist. |
Wu | Northern Wu (multiple dialects) |
Wugniu | help | No | See WT:AZH/Wu#Tones |
Jinhua dialect | Wugniu | No | |||
Taizhou dialect | Wugniu | No | |||
Wenzhou dialect (Lucheng dialect) | Wugniu | help | No | ||
Xiang | Changsha dialect | Wiktionary | help | No | |
Loudi dialect | Wiktionary | help | No | ||
Hengyang dialect | Wiktionary | help | No |
Category sorting
In general, categories should be sorted by radical–stroke sortkeys produced by Module:Hani-sortkey.
Chinese characters
Chinese characters should not be conflated with Chinese words or morphemes. General information about the characters themselves belong in the Translingual section, which appears before all other sections. See Wiktionary:About Chinese characters for discussion of its format.
Additional help
Help from the community
Sometimes, we know there is a problem, but don’t know what to do to correct the problem. If you should find a Chinese entry with a problem that you do not know how to correct, there are several ways to approach the situation.
- Mark the page with
{{attention}}
. This template adds the entry to a cleanup category, where another user can then find and correct the problem. It helps if you include comments explaining what the problem is or why you think the page needs attention. - Raise the issue on Wiktionary talk:About Chinese. Note that this approach is primarily for issues of style, formatting, categorization, and not for specifics of content.
- Mark the page with
{{rfc}}
. This is a more general cleanup tag, and it allows the user to include reasons or concerns as an argument in the template. Be sure to also add an entry to WT:RFC concerning the word so that other editors will be made aware of the problem.
Translations into Chinese lects
- All translations into Chinese lects must be grouped under
* Chinese
. Subdialects can be sub-nested. Regional variations can be flagged with{{qualifier}}
.
* Chinese: *: Hokkien: {{t|nan-hbl|雪文|tr=sat-bûn}} {{qualifier|Zhangzhou}}, {{t|nan-hbl|茶塊|tr=tê-kóe}} {{qualifier|Quanzhou}} ...
- Mandarin: 肥皂 (féizào)
- The traditional precedes the simplified version if they are different and the transliteration is provided with the simplified version. All Chinese varieties need both traditional and simplified forms.
* Chinese: *: Mandarin: {{t|cmn|心理學}}, {{t|cmn|心理学|tr=xīnlǐxué}}
- If a translation is both simplified and traditional, only one translation is given.
* Chinese: *: Mandarin: {{t|cmn|三明治|tr=sānmíngzhì}}
Current tasks
Other Chinese aids
Selected previous discussions
- Wiktionary:Beer parlour archive/July 06 § Min Nan
- Wiktionary:Beer parlour archive/2007/April § Amoy
- Wiktionary:Beer parlour archive/2007/April § Headings for 漢語, 閩南話, 粵語 etc.
- Wiktionary:Beer parlour archive/2009/May § Recurring problem with Chinese vs. Mandarin
- Wiktionary:Beer parlour/2011/January § Renaming "Mandarin" headings to "Chinese"
- Wiktionary:Votes/pl-2014-04/Unified Chinese and Wiktionary talk:Votes/pl-2014-04/Unified Chinese