Hànzì Analyzer

Home

Simplified vs. Traditional

There are 6763 characters in the GB2312 set. Of the 6763 characters, 2140 (32%) are simplified, leaving 4623 (68%) traditional characters.

  • In the GB2312 set, the top five most common stroke counts are 10, 11, 9, 8, and 12, accounting for 51% of all characters.

  • In the primary Big5 set, the top six most common stroke counts are 11, 12, 10, 13, 15, and 14, accounting for 49% of all characters.

Radicals

(Note: Considering GB2312.)

Of the 214 radicals, one is unused by GB2312: 鬥. This radical simplifies to another radical: 斗.

The three most popular radicals each account for about 5% of the characters: 水 (364 characters), 艸 (338 characters), and 口 (332 characters). This is 1034 characters, which is about 15% of all the characters.

The top 16 radicals (7%) account for 50% of the characters. The remaining 198 radicals (93%) account for the other 50% of the characters. The top 16 radicals are:

    水 艸 口 木 手 人 金 心 糸 虫 言 肉 土 女 火 竹

The top 86 radicals (40%) account for 90% of the characters. The remaining 128 radicals (60%) account for the other 10%. The bottom 100 radicals account for only 6% of the characters.

Comparison of top 20 radicals from GB2312 and Big5:

  • 水 艸 口 木 手 人 金 心 糸 虫 言 肉 土 女 火 竹 辵 石 疒 足

  • 水 口 手 木 人 艸 心 言 金 糸 女 肉 土 虫 辵 火 竹 日 玉 疒

Syllables and Tones

(Note: considering Mandarin and GB2312.)

Ignoring tones, there are 410 different syllables. Including tones, there are 1337 syllables. Note that if all 410 syllables covered four tones, the total would be 1640.

There are 185 syllables (45%) that cover all four tones, leaving 225 syllables (55%) that don't.

Tone coverage:

  • There are 335 syllables (82%) that have a first tone; there are 75 (18%) that don't.

  • There are 267 syllables (65%) that have a second tone; there are 143 (35%) that don't.

    • Of those that do, 47 (18%) have no third tone.

  • There are 329 syllables (80%) that have a third tone; there are 81 (20%) that don't.

    • Of those that do, 109 (33%) have no second tone.

  • There are 365 (89%) syllables that have a fourth tone; there are 45 (11%) that don't.

One tone only:

  • First tone only (5): den diu hei keng seng

  • Second tone only (7): fo m neng nin shei teng zei

  • Third tone only (5): dei dia gei lia ruan

  • Fourth tone only (12): ce kuo miu nen nou nüe nun ri run se te zhei

  • Neutral tone only (3): hng lo me

Syllables and Pronunciation

(Note: considering Mandarin and GB2312.)

Ignoring tones, the ten most-used syllables are: yi ji yu zhi xi fu yan li qi jian. These cover 17% of all characters.

Including tones, the ten most-used syllables are: yì bì xī yù lì jī yú shì jì fú. These cover 8% of all characters.

For those intimidated by the 'r' initial: only 128 characters (2%) use it.

(Most-used refers to usage within the set of GB2312 characters, not what is actually spoken or written.)

Home

Copyright © 2007-2008 by Jens Farley