首頁 >後端開發 >Python教學 >抓取亞馬遜搜尋結果時如何避免 StaleElementException？

抓取亞馬遜搜尋結果時如何避免 StaleElementException？

Barbara Streisand原創: 2024-11-22 00:56:141091瀏覽

How to Avoid StaleElementException When Scraping Amazon Search Results?

Selenium 迭代中的StaleElementException

當嘗試使用Selenium 迭代Amazon 上的搜尋結果時，使用者在重複向下滾動到以下位置時可能會遇到StaleElementException載入新頁面。出現此錯誤的原因是用於滾動的元素 Bottom_bar 在頁面重新載入後變得無效。

要解決此問題並實現更可靠的分頁，建議採用更簡單的方法來消除明確頁面滾動。相反，Selenium 可以連續點擊「下一步」按鈕，直到它被停用。這簡化了程式碼並確保驅動程式能夠一致地瀏覽結果。

下面更新的程式碼實作了此方法：

from selenium.webdriver.common.by import By
from selenium.webdriver.support import expected_conditions as EC
from selenium.webdriver.support.ui import WebDriverWait as wait
from selenium.common.exceptions import TimeoutException

driver = webdriver.Chrome()

driver.get('https://www.amazon.com/s/ref=nb_sb_noss_1?url=search-alias%3Daps&amp;field-keywords=sonicare+toothbrush')

while True:
    try:
        wait(driver, 10).until(EC.element_to_be_clickable((By.CSS_SELECTOR, 'a > span#pagnNextString'))).click()
    except TimeoutException:
        break

請注意，implicitly_wait(10) 不會等待整整10 秒，而是「等待10 秒讓元素出現在HTML DOM中」。因此，如果在較短的時間內（例如1或2秒）找到該元素，則等待過程完成。

以上是抓取亞馬遜搜尋結果時如何避免 StaleElementException？的詳細內容。更多資訊請關注PHP中文網其他相關文章！

html if for Error using dom this issue

陳述：

本文內容由網友自願投稿，版權歸原作者所有。本站不承擔相應的法律責任。如發現涉嫌抄襲或侵權的內容，請聯絡admin@php.cn

上一篇：Pandas 如何融化和字典操作將寬資料框重塑為整齊的格式？下一篇：Pandas 如何融化和字典操作將寬資料框重塑為整齊的格式？

看更多