【发布时间】:2020-08-09 16:26:07
【问题描述】:
我正在学习使用 BeautifulSoup 进行网页抓取。目标是从金融网站提取数字以供我个人评估。到目前为止,这就是我所做的:
import bs4
import requests
r=requests.get('https://www.finnomena.com/stock/CPALL')
r.text
html_page=bs4.BeautifulSoup(r.text, 'html.parser')
然后我尝试使用 find/find_all 提取每行末尾的数字(3.69、3.60、0.31 等),但不知道如何引用这些数据,因为我从未见过以前这种格式的元素:
<div data-v-30581cd9="" class="data-wrapper sub-topic first-sub-topic">
<div data-v-30581cd9="" class="data-each">3.69</div>
<div data-v-30581cd9="" class="data-each">3.60</div>
<div data-v-30581cd9="" class="data-each">0.31</div>
<div data-v-30581cd9="" class="data-each">10.26</div>
<div data-v-30581cd9="" class="data-each">1.58</div>
<div data-v-30581cd9="" class="data-each">4.73</div>
<div data-v-30581cd9="" class="data-each">2.64</div>
<div data-v-30581cd9="" class="data-each">-3.31</div>
<div data-v-30581cd9="" class="data-each">10.49</div>
<div data-v-30581cd9="" class="data-each">6.83</div>
<div data-v-30581cd9="" class="data-each">7.38</div>
.
.
.
<div data-v-30581cd9="" class="data-each">4.88</div>
<div data-v-30581cd9="" class="data-each">-1.40</div>
<div data-v-30581cd9="" class="data-each"></div>
</div>
尝试查看旧主题并已经进行了一些研究,但找不到我想要的。如何提取这些值?
【问题讨论】:
标签: python html web-scraping beautifulsoup