【发布时间】:2020-03-19 22:12:26
【问题描述】:
我想从下面的 CME 网站中提取一个包含该代码相关价格的代码选择列表。我能够得到一个符号列表,但我无法弄清楚如何拉动每一行的价格。
在浏览器上使用“检查”时遇到问题,要查询的标签不是“跨度”。让我解决这个问题的想法?
代码:
import urllib
from requests import get
from requests.exceptions import RequestException
from contextlib import closing
from bs4 import BeautifulSoup
def simple_get(url):
"""
Attempts to get the content at `url` by making an HTTP GET request.
If the content-type of response is some kind of HTML/XML, return the
text content, otherwise return None.
"""
try:
with closing(get(url, stream=True)) as resp:
if is_good_response(resp):
return resp.content
else:
return None
except RequestException as e:
log_error('Error during requests to {0} : {1}'.format(url, str(e)))
return None
def is_good_response(resp):
"""
Returns True if the response seems to be HTML, False otherwise.
"""
content_type = resp.headers['Content-Type'].lower()
return (resp.status_code == 200
and content_type is not None
and content_type.find('html') > -1)
def log_error(e):
print(e)
raw_html = simple_get('https://www.cmegroup.com/trading/price-limits.html#equityIndex')
html = BeautifulSoup(raw_html, 'html.parser', store_line_numbers=True)
seq = ['ESM0', 'NQM0', 'RTYM0', 'YMM0']
for quote in html.find_all('span'):
symbolcme = quote.get_text(strip=True)
#print("Check Symbol: ", symbolcme)
for text in seq:
if text in symbolcme:
print(quote.sourceline, ' Symbol:', symbolcme)
结果:
2014 Symbol: E-mini S&P 500 Futures (ESM0)
2047 Symbol: E-mini Nasdaq-100 Futures (NQM0)
2065 Symbol: E-mini Dow ($5) Futures (YMM0)
2392 Symbol: E-mini Russell 2000 Index Futures (RTYM0)
2500 Symbol: Micro E-mini Dow Jones Industrial Average Index Futures (MYMM0)
2515 Symbol: Micro E-mini Nasdaq-100 Index Futures (MNQM0)
2551 Symbol: Micro E-mini S&P 500 Index Futures (MESM0)
【问题讨论】:
-
您有具体的技术问题吗? Stack Overflow 不能替代指南、教程或文档。
标签: python beautifulsoup