Home  >  Article  >  Backend Development  >  python remove html tags

python remove html tags

高洛峰
高洛峰Original
2016-10-18 10:20:181501browse

Python removes html tags, written by myself, please correct me if there are any deficiencies:

#! /usr/bin/env python
#coding=utf-8
# blueel 2013-01-19
from HTMLParser import HTMLParser
  
class MLStripper(HTMLParser):
    def __init__(self):
        self.reset()
        self.fed = []
    def handle_data(self, d):
        self.fed.append(d)
    def get_data(self):
        return ''.join(self.fed)
  
def strip_tags(html):
    s = MLStripper()
    s.feed(html)
    return s.get_data()

Call:

html = 'ou X de sem juros'


print strip_tags(html)


Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn