【发布时间】:2018-03-15 21:35:43
【问题描述】:
您好,我是 Python 新手,我想弄清楚为什么每次在 while 循环期间加载和抓取新页面时,我的列表都会覆盖以前的元素。先感谢您。
def scrapeurls():
domain = "https://domain234dd.com"
count = 0
while count < 10:
page = requests.get("{}{}".format(domain, count))
soup = BeautifulSoup(page.content, 'html.parser')
data = soup.findAll('div', attrs={'class': 'video'})
urls = []
for div in data:
links = div.findAll('a')
for a in links:
urls.append(a['href'])
print(a['href'])
print(count)
count += 1
【问题讨论】:
-
将
urls列表移出while循环 -
谢谢,严重的菜鸟错误
标签: python list while-loop scrape