August 15, 2020
We know there are quite a few Chinese characters (汉字, Hànzì), but how many, exactly? The answer depends on perspective. Are we considering all characters in existence throughout China’s long history or “just” those used in modern times?
The number of unique Chinese characters used through the ages, though unknown, is safely in excess of 100,000. The largest number ever recorded in a Chinese dictionary—the Taiwan’s Ministry of Education’s 2004 Dictionary of Chinese Character Variants (異體字字典, Yìtǐzì zìdiǎn)—was 106,230. Only a subset of these characters are still in regular use today.
In 2013, the Chinese government published a list of the 3,500 most essential characters used in modern Chinese. Chinese schoolchildren are expected to learn all 3,500 at a minimum, though many graduate knowing 5,000, 6,000 or more.
From writing on bone fragments thousands of years ago to typing on smartphones today, the quantity and logographic form of Chinese characters has evolved—and continues to evolve—over time.
Table of Contents
Oracle bones and Chinese character origins
The Chinese system of writing has a long and complex history spanning 3,000 years. The oldest surviving examples of Chinese writing appear on oracle bones unearthed in Henan by farmers and archeologists. These inscriptions of Chinese characters were made around 3,200 years ago during the Shang Dynasty.
Far fewer characters existed when these early writers etched messages into ox and turtle bones. The following video offers a 10-minute overview of the evolution of Chinese characters from oracle bone script to modern-day Chinese.
How the number of Chinese characters evolves over time
It is clear that many of the characters in use today developed from their pictographic ancestors over two thousand years ago. In some cases, characters developed so much over the millennia that their modern form is completely different.
Yet is other cases, like below, characters can be quite similar to their ancient forms.
One of the most recent and major developments to the character system was the introduction of simplified characters around the middle of the 20th century. 2,135 of the commonly used but relatively complex traditional Chinese characters were redesigned into simpler graphics that could be remembered more easily and written more quickly. This was done in order to increase literacy rates in China.
While traditional characters are still used in Hong Kong, Taiwan, and several overseas Chinese communities, simplified characters are now standard in Mainland China even though many people can still read the traditional characters.
See how the number of officially-recognized Chinese characters has developed over the last two millennia:
|Name of Dictionary (hànzì)||Name of Dictionary (pīnyīn)||Published in:||Total Characters|
|說文解字||Shuōwén Jiězì||121 AD||9,353|
|康熙字典||Kāngxī Zìdiǎn||1716 AD||49,174|
|中华字海||Zhōnghuá Zìhǎi||1994 AD||85,568|
|異體字字典||Yìtǐzì Zìdiǎn||2004 AD||106,230|
|现代汉语词典||Xiàndài Hànyǔ Cídiǎn||2016 AD||12,500|
So why does the character count increase throughout the ages to reach over 100,000 in the 2004 dictionary, then plummet to 12,500 in Modern Chinese Language Dictionary (现代汉语词典, xiàndài hànyǔ cídiǎn) published in 2016?
Well, a clue can be found in the name of the 2016 dictionary, which calls itself “Modern.” As the name suggests, it is a modern dictionary that covers both Chinese characters and words, but which only focuses on those that are still in fairly common use today in China.
Highly specialized subjects discussed by modern scholars today will sometimes use characters that are not included in Modern Chinese Language Dictionary. For those specialized topics, a larger dictionary is required.
The first six dictionaries in the above table all contain characters only, not characters and words like Modern Chinese Language Dictionary does. However, most of them also include many old characters that are not used anymore and can only be found on ancient scrolls or stone carvings from past dynasties.
The larger dictionaries also contain many variants of the same character, but in different character styles or systems that have been used at various times throughout China’s history.
What counts as a Chinese character?
With over 100,000 Chinese characters in existence, linguists and dictionary authors offering varying answers to this question.
The Chinese language is very different from Romance languages like French and Spanish, which are phonetic languages with an alphabet made up of a fixed number of letters. Instead, Chinese is comprised of thousands of individual characters, each with at least one, and often many, meanings associated with it.
This is the key difference between Chinese characters and letters in Western languages, which carry no meaning on their own, only a phonetic sound.
Some characters, like 你 (nǐ), can exist as a standalone word. Others, like 们 (men), must be paired with an additional character to create a word. Consider the following:
|我||wǒ||This character means “me” or “I” and is a word on its own.|
|们||men||This character carries the meaning of plural but is not a word that means anything on its own. It does not translate to the English word “plural,” and cannot be used as such.|
|我们||wǒmen||These two characters together form the word “we” by combining the meanings of the characters 我 and 们. Thus “I” or “me” becomes plural, meaning “we.”|
|的||de||This character typically indicate possessions, like the "'s" in "John's bike"|
|我们的||wǒmen de||We have now combined 我, 们, and 的 to form the single word "our"|
|你的||nǐ de||your, as in "your bike"|
|你们||nǐmen||you [plural]; you all|
|你们的||nǐmen de||your [plural]|
So if there are no letters in Chinese, is there an alphabet with which to classify or categorize Chinese characters? Not in the conventional sense. Most languages originating in Europe and the Middle East are phonetic and have an alphabet of letters, but as Chinese does not have letters, it also does not have an alphabet. However, Chinese does have some components which could be considered to serve a similar purpose to that of the alphabet in other languages.
Each modern Chinese character is constructed from a number of strokes of the pen or, traditionally, the brush. Different characters are made up of different numbers of strokes and can be classified by the number of strokes that they contain, but the strokes themselves come from a small pool of clearly defined strokes.
There are several different systems for classifying these strokes but most systems recognize approximately 10 basic strokes and another roughly 25 complex strokes. You could think of strokes as letters and the pool of strokes from which the characters are drawn as the alphabet of strokes. However, unlike letters in the alphabets used in English and the Romance languages, there are no specific sounds tied to each stroke.
Chinese characters can also be classified by the Chinese radical upon which the character is based. A radical is part of a character’s logogram. The radical is simpler and smaller than the whole character but looks like a small character in itself. Some radicals are standalone characters when written on their own, and others are not.
Take the character below as an example. On the left you can see the character 渔 (yú) which means “fishery” or “fishing.”
On the right it is split into its two radicals. The radical on the left means “water” but is not a character on its own. The radical on the right is the noun for “fish” and is a character when drawn on its own. It is also pronounced “yú.” So when you add the radical for water to the character for fish, you get a character that means fishing or fishery.
There are several different systems of radicals, but the most commonly used system is Kangxi radicals, which is employed in many Chinese dictionaries used by native speakers. It contains 214 different radicals and each one is used in the construction of many different characters.
Some characters, like the one in the example above, contain more than one radical. In this case, the main classifying radical for the character often provides a hint about the meaning of the character. The other radicals making up the character sometimes give hints about its pronunciation, as is the case above.
The fact that certain radicals indicate a specific pronunciation means that, contrary to popular belief, Chinese is not entirely a pictographic language. Rather, it also has some phonetic elements, although it is certainly not a phonetic language in the same way as English or the Romance languages are.
For example, note the phonetic similarities between the following Chinese characters, each of which contain the radical 羊 yáng.
|羊||yáng||1. sheep; goat; ram; ewe 2. (Yáng) a surname|
|样||yàng||1. appearance; shape; look; form 2. sample; model; pattern|
|洋||yáng||1. ocean 2. foreign (esp. Western)|
|佯||yáng||pretend; feign; sham|
|蛘||yáng||(dialect) certain insects (esp. rice weevils)|
|徉||yáng||to walk back and forth|
|烊||yáng||melt; go soft|
Note that the radical 羊 yáng appears in each of the above characters, and really has no bearing on the character’s meaning. It does, though, clearly indicate the character’s phonetic pronunciation.
Since Chinese characters are made up of individual radicals, whether they are “meaning radicals” or “sound radicals,” these radicals can also be seen as a kind of alphabet which is used to construct the written and spoken language.
For more information on how characters are constructed and classified, take a look at the anatomy of Chinese characters.
You Can Learn Chinese
You should now have a better understanding of how many Chinese characters exist—over 100,000! We also hope you gained insight into how the Chinese writing system originated and how it has developed over time to meet the changing needs of the Chinese people.
For those interested in learning the language, hopefully you are encouraged to find out that you only need to know about 3,000 characters to be fluent—and much less than that to be able to read most of what you come across on a daily basis.
Follow the links in the article above if you want to learn more about characters or start learning them yourself. We welcome you to learn Chinese with us in China.
Vocabulary: How Many Characters are There in Chinese?
|汉语||hànyǔ||Chinese language (describes the spoken form only)|
|中文||zhōngwén||Chinese language (describes the written and spoken form)|
|字||zì||Character (in terms of writing)|
|甲骨文||jiǎgǔwén||Oracle bone script|
|河南||Hénán||Henan (a province in China)|
|商代||shāngdài||Shang Dynasty (1600 - 1046 BC)|
|词典||cídiǎn||dictionary (contains both word and character definitions)|
|画||huà||stroke (in a Chinese character)|
|偏旁||piānpáng||radical (in a Chinese character)|
|汉语水平考试||hànyǔ shuǐpíng kǎoshì||HSK Chinese proficency test|
|声调||shēngdiào||tone (in linguistics)|