python library to extract html string using css font-family?

Question

Is there any library in python that uses the font-family attribute of css to extract html strings? Used for font subsetting.

我想大声告诉你 · Answer

The question you asked is a bit vague. If you use CSS Selector to get the content in html, you can use lxml.cssselect. There are Chinese instructions for this, and it’s not just about using lxml

巴扎黑 · Answer

font-family just specifies the font to use.

What you want to do is to calculate how many Chinese characters there are in an HTML article, and then dynamically or semi-statically generate a smaller Chinese character font containing only these characters for remote download and use?

If you just count Chinese characters, the set under python is actually the simplest.

But it is a big pitfall to generate the corresponding font library. Founder currently has a similar service, which seems to be called Yunziku. I have inquired about the price before, and the other party honestly said that there are many problems.

python library to extract html string using css font-family?

reply all(2)I'll reply