Simplified vs. TraditionalThere are 6763 characters in the GB2312 set. Of the 6763 characters, 2140 (32%) are simplified, leaving 4623 (68%) traditional characters.
Radicals(Note: Considering GB2312.) Of the 214 radicals, one is unused by GB2312: 鬥. This radical simplifies to another radical: 斗. The three most popular radicals each account for about 5% of the characters: 水 (364 characters), 艸 (338 characters), and 口 (332 characters). This is 1034 characters, which is about 15% of all the characters. The top 16 radicals (7%) account for 50% of the characters. The remaining 198 radicals (93%) account for the other 50% of the characters. The top 16 radicals are: 水 艸 口 木 手 人 金 心 糸 虫 言 肉 土 女 火 竹 The top 86 radicals (40%) account for 90% of the characters. The remaining 128 radicals (60%) account for the other 10%. The bottom 100 radicals account for only 6% of the characters. Comparison of top 20 radicals from GB2312 and Big5:
Syllables and Tones(Note: considering Mandarin and GB2312.) Ignoring tones, there are 410 different syllables. Including tones, there are 1337 syllables. Note that if all 410 syllables covered four tones, the total would be 1640. There are 185 syllables (45%) that cover all four tones, leaving 225 syllables (55%) that don't. Tone coverage:
One tone only:
Syllables and Pronunciation(Note: considering Mandarin and GB2312.) Ignoring tones, the ten most-used syllables are: yi ji yu zhi xi fu yan li qi jian. These cover 17% of all characters. Including tones, the ten most-used syllables are: yì bì xī yù lì jī yú shì jì fú. These cover 8% of all characters. For those intimidated by the 'r' initial: only 128 characters (2%) use it. (Most-used refers to usage within the set of GB2312 characters, not what is actually spoken or written.) Copyright © 2007-2008 by Jens Farley |