Import Japanese games to China—-“Chinese movies do not have Japanese code sets—-“cause garbled characters
If you don’t want to be messy—-“install language packs —-“Can display Japanese normally
United Nations Universal Code—–“Supports all languages in the world
1. unicode, 2-4 bytes, has been included 136690 characters (Chinese has more than 20,000 characters), and it has been continuously expanding. . . . .
2. It can also be converted to various languages around the world, unicode—-》gbk etc. Around 1990
1. In 1980, many softwares were developed based on gb2312 , Switching to unicode is basically tantamount to pushing it all over again
2. Unicode = English, gbk = Chinese, there is no strong demand, all convert to Unicode
Global computer manufacturers all support Unicode , Most software programs support Unicode
Chinese windows computers are all gbk-encoded
Unicode brings a new problem, there is no problem in the memory space, if To be transmitted or stored on the hard disk, space is doubled and wasteful + low efficiency
For unicode transmission + storage, a utf-8 (full name: Unicode Transformation Format, academic name UTF) is made, there are three in total Two versions, utf-8, utf-16, utf-32
UTF-8: Use 1, 2, 3, 4 bytes to represent all characters; use 1 character first, which cannot meet the rule Add 1 byte, up to 4 bytes, 1 byte for English, 2 for European languages, 3 for East Asia, 4 bytes for other and special characters
characters | ACSII | Unicode | UTF -8 |
A | 01000001 | 00000000 01000001 | 01000001 |
中< /strong> | x td> | 01001110 00101101 | 11100100 10111000 10101101 |
< p> The
py 2 = ACSII
py 3 = Unicode
You need to declare what encoding method is used before writing a program