网页爬虫 - python爬虫用BeautifulSoup爬取~~元素并写入字典，但某些div下没有这一元素，导致自动写入下一条，如何解决？~~

Question

新手写二手车网站爬虫，爬卖价和原价，原价以&lt;s&gt;删除线形式放在&lt;p class="priType-s"&gt;下。但是遇到没有标记原价，也就是并没有&lt;s&gt;标签的情况下，会自动把下一个&lt;s&gt;内的信息写入上一条占...

大家讲道理 · Answer

The general idea is to add more selectors, make them empty, and then you make the decision

大家讲道理 · Answer

prices0=soup.select('p.list > ul > li > p > p.priType-s > span> i')
prices1=soup.select('p.list > ul > li > p > p.priType-s > span + s')

Give it a try.
If it still doesn’t work, I’ll get the whole paragraph for you and use regex to extract it

天蓬老师 · Answer

Try this idea:
1. Each second-hand car will have a block to display,

..

and the like
2. In each block, let’s capture the original price and current price. Take
so that the next price point will not be added to the original price of the previous car because a second-hand car does not have the original price

网页爬虫 - python爬虫用BeautifulSoup爬取<s>元素并写入字典，但某些div下没有这一元素，导致自动写入下一条，如何解决？

reply all(3)I'll reply