【发布时间】:2021-11-16 08:38:34
【问题描述】:
我想从链接列表中下载。
test_list = ['https://dibbs2.bsm.dla.mil/Downloads/RFQ/8/SPE1C122Q0058.PDF', 'https://dibbs2.bsm.dla.mil/Downloads/RFQ/8/SPE2DH22Q0028.PDF',
'https://dibbs2.bsm.dla.mil/Downloads/RFQ/9/SPE2DH22Q0029.PDF', 'https://dibbs2.bsm.dla.mil/Downloads/RFQ/3/SPE2DS22Q0023.PDF',
'https://dibbs2.bsm.dla.mil/Downloads/RFQ/1/SPE2DS22Q0031.PDF', 'https://dibbs2.bsm.dla.mil/Downloads/RFQ/3/SPE2DS22Q0033.PDF']
但是这个脚本也在下载单个文件的多个副本。如何避免这种情况?我只想下载列表中的六个pdf文件。
options = webdriver.ChromeOptions()
# options.add_argument('--no-sandbox')
# # options.add_argument('--disable-dev-shm-usage')
# options.headless = True
# prefs = {"download.default_directory": zip_dir,
# "download.directory_upgrade": True,
# "download.manager.showWhenStarting": False,
# "download.manager.useWindow": False,
# "helperApps.alwaysAsk.force":False,
# "download.manager.showAlertOnComplete": False}
# options.add_experimental_option("prefs", prefs)
driver = webdriver.Chrome(ChromeDriverManager().install(),options=options)
dn = len(test_list)
for t in range(0,dn):
URL = test_list[t]
sleep(randint(3, 9))
driver.get(URL)
try:
driver.find_element_by_id("butAgree").click()
except:
pass
也尝试过:
for t in test_list:
URL = t
sleep(randint(3, 9))
driver.get(URL)
try:
driver.find_element_by_id("butAgree").click()
except:
pass
【问题讨论】:
标签: python python-3.x selenium selenium-webdriver selenium-chromedriver