python2.7 编码问题

Question

小弟在经过python2.7和python3关于编码问题的对比后发现一个问题,如下: 在python3中 {代码...} 按照python3的说法就是B为bytes类型.是用来表示二进制字节字符串的.然后对他按照"latin-1" 去decode之后得到了实际...

PHP中文网 · Answer

Be careful. Not ce8 but xe8.

Python's CLI interface, when seeing a single expression, will print out the original value of the variable similar to PHP's var_dump. For unicode strings, the output is a string with u'', and each non-ascii character is escaped. Only when print is actually used, the python interpreter will correctly convert the encoding and output real characters to the screen according to the locale option of the system interface.

You can take a look at the following results of running under the Linux system, which will be helpful to answer your questions.

pi@linux-0o8x:~> locale | grep LANG
LANG=zh_CN.utf8
pi@linux-0o8x:~> python
Python 2.7.5 (default, May 30 2013, 16:55:57) [GCC] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> u'中国人 说汉语 用中文 abc123'
u'\u4e2d\u56fd\u4eba \u8bf4\u6c49\u8bed \u7528\u4e2d\u6587 abc123'
>>> B = b'\xc4\xe8'
>>> B.decode('latin-1') 
u'\xc4\xe8'
>>> print(B.decode('latin-1'))
Äè

In addition, a good habit in actual programming is to simply not use byte to decode, but to unify everything to Unicode. Simple and hassle-free.

python2.7 编码问题

在python3中

请问这是为什么?

如何在python2.7中得到以上的西欧文字?

over

reply all(1)I'll reply