【发布时间】:2020-09-18 11:41:21
【问题描述】:
尝试从以下维基百科页面获取 Bowler 和 Team 列: https://en.wikipedia.org/wiki/List_of_bowlers_who_have_taken_300_or_more_wickets_in_Test_cricket
能够使用我的代码获得投球手专栏,但团队专栏证明很困难。可能是那里的板球迷之一,但欢迎任何帮助!
这是我的代码:
import requests
from bs4 import BeautifulSoup
wiki = "https://en.wikipedia.org/wiki/List_of_bowlers_who_have_taken_300_or_more_wickets_in_Test_cricket"
website_url = requests.get(wiki).text
soup = BeautifulSoup(website_url, "lxml")
my_table = soup.find("table", {"class":"wikitable sortable plainrowheaders"})
bowler = []
team = []
for row in my_table.find_all("tr")[1:]:
bowler_cell = row.find_all("a")[0]
bowler.append(bowler_cell.text)
print(bowler)
for row in my_table.find_all("td"):
team_cell = row.find_all("a")[0]
team.append(team_cell.text)
print(team)
【问题讨论】:
标签: python web-scraping beautifulsoup