Selenium：当 ValueError（“未找到表”）时如何重试浏览器/URL答案

【问题标题】：Selenium: How do I retry browser/URL when ValueError("No tables found")Selenium：当 ValueError（“未找到表”）时如何重试浏览器/URL
【发布时间】：2025-08-15 21:20:02
【问题描述】：

我有一个代码可以抓取奇怪的网站。

有时在抓取时，我得到ValueError("No tables found")，当我手动刷新浏览器时，页面加载。

如何通过代码实现？

我的代码如下：

import pandas as pd
from selenium import webdriver
from bs4 import BeautifulSoup as bs

browser = webdriver.Chrome()

class GameData:

    def __init__(self):
        self.date = []
        self.time = []
        self.game = []
        self.score = []
        self.home_odds = []
        self.draw_odds = []
        self.away_odds = []
        self.country = []
        self.league = []


def parse_data(url):
    browser.get(url)
    df = pd.read_html(browser.page_source, header=0)[0]
    html = browser.page_source
    soup = bs(html, "lxml")
    cont = soup.find('div', {'id': 'wrap'})
    content = cont.find('div', {'id': 'col-content'})
    content = content.find('table', {'class': 'table-main'}, {'id': 'tournamentTable'})
    main = content.find('th', {'class': 'first2 tl'})
    if main is None:
        return None
    count = main.findAll('a')
    country = count[1].text
    league = count[2].text
    game_data = GameData()
    game_date = None
    for row in df.itertuples():
        if not isinstance(row[1], str):
            continue
        elif ':' not in row[1]:
            game_date = row[1].split('-')[0]
            continue
        game_data.date.append(game_date)
        game_data.time.append(row[1])
        game_data.game.append(row[2])
        game_data.score.append(row[3])
        game_data.home_odds.append(row[4])
        game_data.draw_odds.append(row[5])
        game_data.away_odds.append(row[6])
        game_data.country.append(country)
        game_data.league.append(league)
    return game_data


urls = {
"https://www.oddsportal.com/soccer/africa/africa-cup-of-nations/results/#/",
"https://www.oddsportal.com/soccer/africa/africa-cup-of-nations/results/#/page/2/",
"https://www.oddsportal.com/soccer/africa/africa-cup-of-nations/results/#/page/3/",
"https://www.oddsportal.com/soccer/africa/africa-cup-of-nations/results/#/page/4/",
"https://www.oddsportal.com/soccer/africa/africa-cup-of-nations/results/#/page/5/",
"https://www.oddsportal.com/soccer/africa/africa-cup-of-nations/results/#/page/6/",
"https://www.oddsportal.com/soccer/africa/africa-cup-of-nations/results/#/page/7/",
"https://www.oddsportal.com/soccer/africa/africa-cup-of-nations/results/#/page/8/",
"https://www.oddsportal.com/soccer/africa/africa-cup-of-nations/results/#/page/9/",

}

if __name__ == '__main__':

    results = None

    for url in urls:
        try:
            game_data = parse_data(url)
            if game_data is None:
                continue
            result = pd.DataFrame(game_data.__dict__)
            if results is None:
                results = result
            else:
                results = results.append(result, ignore_index=True)
        except ValueError:
            game_data = parse_data(url)
            if game_data is None:
                continue
            result = pd.DataFrame(game_data.__dict__)
            if results is None:
                results = result
            else:
                results = results.append(result, ignore_index=True)
        except AttributeError:
            game_data = parse_data(url)
            if game_data is None:
                continue
            result = pd.DataFrame(game_data.__dict__)
            if results is None:
                results = result
            else:
                results = results.append(result, ignore_index=True)

有时我会收到此浏览器错误。

Traceback (most recent call last):
  File "C:/Users/harsh/AppData/Roaming/JetBrains/PyCharmCE2021.1/scratches/scratch_29.py", line 10098, in <module>
    game_data = parse_data(url)
  File "C:/Users/harsh/AppData/Roaming/JetBrains/PyCharmCE2021.1/scratches/scratch_29.py", line 37, in parse_data
    df = pd.read_html(browser.page_source, header=0)[0]
  File "C:\Python\lib\site-packages\pandas\util\_decorators.py", line 299, in wrapper
    return func(*args, **kwargs)
  File "C:\Python\lib\site-packages\pandas\io\html.py", line 1100, in read_html
    displayed_only=displayed_only,
  File "C:\Python\lib\site-packages\pandas\io\html.py", line 913, in _parse
    raise retained
  File "C:\Python\lib\site-packages\pandas\io\html.py", line 893, in _parse
    tables = p.parse_tables()
  File "C:\Python\lib\site-packages\pandas\io\html.py", line 213, in parse_tables
    tables = self._parse_tables(self._build_doc(), self.match, self.attrs)
  File "C:\Python\lib\site-packages\pandas\io\html.py", line 543, in _parse_tables
    raise ValueError("No tables found")
ValueError: No tables found

我最好的猜测是我没有在代码中正确构建ValueError: No tables found。

我该如何处理？

【问题讨论】：

错误描述了自己。 pandas 无法从HTML 读取表，您可以将try/except 保存到.html 文件中，然后查看源代码。我也强烈建议使用seleniumwait，因为我相信你已经使用了 selenium，因为一旦页面加载，数据就会动态呈现，所以你必须等到页面源中显示的对象。
@αԋɱҽԃαмєяιcαη 这很有趣！我怎样才能将它构建到这里的代码中？
我的猜测是在遇到此错误时等待 2 秒，然后重试 URL。如何编码？
我已经为您提到了selenium-wait 方法。但是用一种简单的方式，你可以在browser.get(url)之后尝试time.sleep(5)
这是无效的吧？因为我正在解析 75K 网址并等待 2 秒或 time.sleep(5) 将需要很长时间才能完成哈哈。只有当我遇到错误时才尝试这个有用吗？

标签： python selenium web-scraping beautifulsoup retry-logic

【解决方案1】：

import pandas as pd
from selenium import webdriver
from selenium.common.exceptions import TimeoutException
from selenium.webdriver.common.by import By
from selenium.webdriver.support import expected_conditions as EC
from selenium.webdriver.support.ui import WebDriverWait

urls = [
    "https://www.oddsportal.com/soccer/africa/africa-cup-of-nations/results"
]


def main(driver):
    for url in urls:
        driver.get(url)
        waiter = WebDriverWait(driver, 10)
        try:
            waiter.until(EC.presence_of_all_elements_located(
                (By.ID, 'tournamentTable')))
            df = pd.read_html(driver.page_source, attrs={
                              'id': 'tournamentTable'})[0].iloc[:, :-1]
            df.dropna(inplace=True)
            df.reset_index(drop=True, inplace=True)
            return df

        except TimeoutException:
            print('Unable to Find Table')
        finally:
            driver.quit()


if __name__ == "__main__":
    driver = webdriver.Firefox()
    df = main(driver)
    print(df)

输出：

   Soccer» Africa»Africa Cup of Nations
            15 Jun 2021 - Qualification  15 Jun 2021 - Qualification.1 15 Jun 2021 - Qualification.2      1      X      2
0                                 16:00           Sierra Leone - Benin                           1:0   2.85   2.85   2.63
1           30 Mar 2021 - Qualification    30 Mar 2021 - Qualification   30 Mar 2021 - Qualification      1      X      2
2                                 19:00              Cameroon - Rwanda                           0:0   1.51   3.59   7.78
3                                 19:00              Morocco - Burundi                           1:0   1.15   6.63  22.58
4                                 19:00        Mozambique - Cape Verde                           0:1   2.66   2.96   2.81
5                                 16:00          Guinea Bissau - Congo                           3:0   2.64   2.81   3.01
6                                 16:00              Nigeria - Lesotho                           3:0   1.13   7.78  20.49
7                                 16:00             Senegal - Eswatini                           1:1   1.05  12.78  30.79
8                                 13:00    Central Africa - Mauritania                           0:1   1.96   3.12   4.11
9                                 13:00         Ivory Coast - Ethiopia                           3:1   1.61   2.85   9.43
10                                13:00             Madagascar - Niger                           0:0   1.20   6.36  13.98
11          29 Mar 2021 - Qualification    29 Mar 2021 - Qualification   29 Mar 2021 - Qualification      1      X      2
12                                19:00             Algeria - Botswana                           5:0   1.11   7.75  26.29
13                                19:00              Zimbabwe - Zambia                           0:2   2.92   3.02   2.49
14                                16:00                 Angola - Gabon                           2:0   3.08   2.84   2.55
15                                16:00     Burkina Faso - South Sudan                           1:0   1.16   6.56  18.56
16                                16:00            D.R. Congo - Gambia                           1:0   1.89   3.25   4.30
17                                16:00                Egypt - Comoros                           4:0   1.28   4.70  12.98
18                                16:00                   Togo - Kenya                           1:2   2.11   2.70   4.44
19                                13:00                Malawi - Uganda                           1:0   3.28   2.58   2.62
20          28 Mar 2021 - Qualification    28 Mar 2021 - Qualification   28 Mar 2021 - Qualification      1      X      2
21                                16:00  Ghana - Sao Tome and Principe                           3:1   1.05  12.65  28.43
22                                16:00           Sudan - South Africa                           2:0   3.66   3.09   2.10
23                                13:00               Namibia - Guinea                           2:1   3.32   2.85   2.38
24                                13:00               Tanzania - Libya                           1:0   2.50   2.80   3.20
25                                13:00    Tunisia - Equatorial Guinea                           2:1   1.27   5.07  11.87
26          27 Mar 2021 - Qualification    27 Mar 2021 - Qualification   27 Mar 2021 - Qualification      1      X      2
27                                16:00                Benin - Nigeria                           0:1   4.45   2.37   2.30
28                                13:00         Lesotho - Sierra Leone                           0:0   2.71   2.50   3.25
29          26 Mar 2021 - Qualification    26 Mar 2021 - Qualification   26 Mar 2021 - Qualification      1      X      2
30                                19:00           Mauritania - Morocco                           0:0   9.09   3.96   1.40
31                                16:00          Cape Verde - Cameroon                           3:1   2.92   2.69   2.79
32                                16:00                Congo - Senegal                           0:0   7.29   3.50   1.54
33                                16:00            Niger - Ivory Coast                           0:3  11.28   4.38   1.33
34                                13:00       Burundi - Central Africa                           2:2   1.86   3.16   4.42
35                                13:00       Eswatini - Guinea Bissau                           1:3   6.42   3.35   1.60
36          25 Mar 2021 - Qualification    25 Mar 2021 - Qualification   25 Mar 2021 - Qualification      1      X      2
37                                19:00   Equatorial Guinea - Tanzania                           1:0   2.13   2.76   4.10
38                                19:00                Libya - Tunisia                           2:5   3.98   3.03   2.03
39                                19:00               Zambia - Algeria                           3:3   3.95   3.06   2.02
40                                16:00            Botswana - Zimbabwe                           0:1   2.44   2.58   3.64
41                                16:00             Gabon - D.R. Congo                           3:0   2.33   2.75   3.59
42                                16:00                Gambia - Angola                           1:0   1.97   2.86   4.58
43                                16:00                  Kenya - Egypt                           1:1   6.24   3.39   1.61
44                                16:00           South Africa - Ghana                           1:1   2.47   2.77   3.31
45                                13:00                 Comoros - Togo                           0:0   1.56   3.63   6.13
46          24 Mar 2021 - Qualification    24 Mar 2021 - Qualification   24 Mar 2021 - Qualification      1      X      2
47                                16:00                  Guinea - Mali                           1:0   1.66   3.19   6.25
48                                13:00                 Chad - Namibia                        award.      -      -      -
49                                13:00          Ethiopia - Madagascar                           4:0   2.16   2.99   3.59
50                                13:00            Rwanda - Mozambique                           1:0   2.30   2.78   3.52
51                                13:00  Sao Tome and Principe - Sudan                           0:2   7.31   3.44   1.54
52                                13:00           South Sudan - Malawi                           0:1   2.78   2.58   3.06
53                                13:00          Uganda - Burkina Faso                           0:0   2.75   2.46   3.31
54          17 Nov 2020 - Qualification    17 Nov 2020 - Qualification   17 Nov 2020 - Qualification      1      X      2
55                                19:00                 Namibia - Mali                           1:2   5.22   3.30   1.72
56                                19:00             Tanzania - Tunisia                           1:1   5.04   3.42   1.71
57                                16:00            Angola - D.R. Congo                           0:1   3.19   2.84   2.41

【讨论】：

这太棒了！添加这部分代码会使我编写的错误处理变得多余吗？
except KeyError: browser.quit() 是否会导致浏览器退出？不应该是except KeyError: browser.retry() 吗？
@PyNoob_N 我们将关闭页面并在问题发生后重新打开它。您也可以发送refresh 刷新页面并继续尝试。但我强烈建议使用硒等待。另一方面，您的整个代码对我来说看起来很不舒服。我稍后会审查它。
是的，你是对的！我会要求您审查和改进 SO 社区事业的代码。该代码只是从此处发布的解决方案中拼接在一起的，因为我不擅长开发（根本！）。我（认为）理解逻辑，但构建代码完全是另一回事。另外，我非常感谢社区帮助我发展和进步。 谢谢你
@PyNoob_N 欢迎您，但请注意StackOver Flow！不是代码编写服务。您的问题扰乱了对语言本身的基本理解，没有表现出努力，甚至没有将您的帖子限制在对贡献者帮助您没有帮助的特定问题上。但是，我为您的问题发布了示例答案。但请务必阅读How to Ask 以及如何提供minimal reproducible example