【发布时间】:2021-03-28 18:17:31
【问题描述】:
我想抓取以下产品https://www.decathlon.it/p/disco-ghisa-bodybuilding-28mm/_/R-p-7278?mc=1042303&c=NERO
但是对于产品,我们可以选择不同的重量(从 0.5 到 20 公斤)。我创建了以下代码,但它只给了我第一个重量(0,5kg)而不是另一个。
import requests
import re
import time
urls = ['p/disco-ghisa-bodybuilding-28mm/_/R-p-7278.html']
user_agent = {'User-agent': 'Mozilla/5.0'}
def main(site):
with requests.Session() as req:
for url in urls:
r = req.get(site.format(url), headers=user_agent)
match = re.search('availability.+org\/(.*?)"', r.text)
print("url: {:<70}, status: {}".format(r.url, match.group(1)))
while True:
main("https://www.decathlon.it/{}")
time.sleep(1)
但我想弄清楚以下输出
weight 0,5kg outofstock, 1kg outofstock and so on.
【问题讨论】:
标签: python python-3.x python-requests request python-requests-html