python - How do I write regex?

Question

s = u'\ud83d\udc8b'co = re.compile( u'\ud83d\udc8b')co.sub(u'',s)print(u'ud83d') 输出如下UnicodeEncodeError: 'utf-8' codec can't encode character 'ud83d' in position 0: surrogates not allowed s中大...

高洛峰 · Answer

First of all, there are 2 questions
1. Why can’t it be displayed? 2. I want to replace it but why can’t it match?
Answer

1. Special encoding cannot be displayed on the terminal. If it is displayed on the UI, then the UI encoding needs to be set.

2. Try the following code

import re
s = u'hello \ud83d\udc8b world'
co = re.compile( u'\ud83d\udc8b')
ss = co.sub(u'',s)
print(ss)

Run result:

hello world

黄舟 · Answer

I copied them all

python - How do I write regex?

reply all(2)I'll reply