search

Home  >  Q&A  >  body text

python - 删除txt文件内所有<>里面的内容包括<>本身。

删除txt文件内所有<>里面的内容包括<>本身。该怎么写?
卡在对字符串的处理那里。

ringa_leeringa_lee2785 days ago768

reply all(1)I'll reply

  • 大家讲道理

    大家讲道理2017-04-18 10:29:36

    Probably like this:

    Use re.sub<...> instead of blank:

    # python3 code
    
    import re
    
    # for example
    s = '123<><here>#$%@#$%^<123>::<a class="haha" href="http://www.hello.com">haha'
    print(re.sub('<[^>]*>', '', s))
    
    # read txt and remove all <...>
    with open('your.txt', 'r') as reader:
        for line in reader:
            line = line.strip()
            print(re.sub('<[^>]*>', '', line))

    Example string replacement result:

    123#$%@#$%^::haha

    Questions I answered: Python-QA

    reply
    0
  • Cancelreply