【发布时间】:2018-12-11 08:14:29
【问题描述】:
大家好,我一直在尝试在网络上抓取一些包含不断变化的值的页面,但到目前为止我无法获得价格。谁能帮帮我,这是我到目前为止到达的地方!
import requests
import bs4
from urllib.request import Request, urlopen as uReq
from bs4 import BeautifulSoup as soup
from selenium import webdriver
from selenium.webdriver.firefox.firefox_binary import FirefoxBinary
from selenium.webdriver.firefox.options import Options
from selenium.webdriver.common.desired_capabilities import DesiredCapabilities
my_url = 'https://www.cryptocompare.com/'
binary = FirefoxBinary('C:/Program Files/Mozilla Firefox/firefox.exe')
options = Options()
options.set_headless(headless=True)
options.binary = binary
cap = DesiredCapabilities().FIREFOX
cap["marionette"] = True
driver = webdriver.Firefox(firefox_options=options, capabilities=cap, executable_path="C:/Users/Genti/AppData/Local/Programs/Python/Python36-32/Lib/site-packages/selenium/geckodriver.exe")
browser = webdriver.Firefox(firefox_binary=binary)
browser.get(my_url)
html = browser.execute_script("return document.documentElement.outerHTML")
sel_soup = soup(html, 'html.parser')
prices = sel_soup.findAll("td", {"class":"price"})
print(prices)
【问题讨论】:
-
你想要所有 10 个价格?
-
是的,我想了解所有价格
-
您想实时获取最新值还是只提取 10 个价格?
-
我想提取所有最新的价格,但我只想提取价格和货币名称,没有更多可能形成表格
-
可以说理想的情况是让我获得货币名称、价格和日期
标签: python selenium web-crawler