Coded type - Coding, type

Import Japanese games to China—-“Chinese movies do not have Japanese code sets—-“cause garbled characters

If you don’t want to be messy—-“install language packs —-“Can display Japanese normally

United Nations Universal Code—–“Supports all languages in the world

1. unicode, 2-4 bytes, has been included 136690 characters (Chinese has more than 20,000 characters), and it has been continuously expanding. . . . .

2. It can also be converted to various languages around the world, unicode—-》gbk etc. Around 1990

1. In 1980, many softwares were developed based on gb2312 , Switching to unicode is basically tantamount to pushing it all over again

2. Unicode = English, gbk = Chinese, there is no strong demand, all convert to Unicode

Global computer manufacturers all support Unicode , Most software programs support Unicode

Chinese windows computers are all gbk-encoded

Unicode brings a new problem, there is no problem in the memory space, if To be transmitted or stored on the hard disk, space is doubled and wasteful + low efficiency

For unicode transmission + storage, a utf-8 (full name: Unicode Transformation Format, academic name UTF) is made, there are three in total Two versions, utf-8, utf-16, utf-32

UTF-8: Use 1, 2, 3, 4 bytes to represent all characters; use 1 character first, which cannot meet the rule Add 1 byte, up to 4 bytes, 1 byte for English, 2 for European languages, 3 for East Asia, 4 bytes for other and special characters

characters	ACSII	Unicode	UTF -8
A	01000001	00000000 01000001	01000001
中< /strong>	x td>	01001110 00101101	11100100 10111000 10101101

< p> The Share a picture

py 2 = ACSII

py 3 = Unicode

You need to declare what encoding method is used before writing a program

Leave a Comment Cancel reply