Home  >  Q&A  >  body text

Python text matching for specific paragraphs

a='''
[Scene: Central Perk, Chandler, Joey, Phoebe, and Monica are there.]
Monica: There's nothing to tell! He's just some guy I work with!
Joey: C'mon, you're going out with the guy! There's gotta be something wrong with him!
Chandler: All right Joey, be nice.? So does he have a hump? A hump and a hairpiece?
Phoebe: Wait, does he eat chalk?
[Scene: Chandler, Joey,abcsde.]
Phoebe: Just, 'cause, I don't want her to go through what I went through with Carl- oh!
Monica: Okay, everybody relax. This is not even a date. It's just two people going out to dinner and- not having sex.
Chandler: Sounds like a date to me.
[Scene: Joey.]
'''

I have a text a, as above,
I want to get the dialogue text of each scene and save it as lsit. The difference between each scene is [Scene: Add an English sentence.], as shown in bold# above ##Then use regular expressions to write,
paragraphs = re.findall('[Scene: w .](.*?)[Scene: w .]',a,re.S)

I found that there was no matching content, and paragraphs was empty.

What is the reason for the error? How to match the dialogue content of each scene?
Thanks.

PHP中文网PHP中文网2687 days ago679

reply all(1)I'll reply

  • 滿天的星座

    滿天的星座2017-05-18 10:59:26

    There are several errors
    No native string is used
    No escaping[

    The following is my modified code.

    paragraphs = re.findall(r"\[Scene: [\w\s,]+\.]\s([^[]+)\s(?=\[Scene: [\w\s,]+\.])", a, re.S)
    

    Python regular expression guide
    http://www.cnblogs.com/huxi/a...

    reply
    0
  • Cancelreply