【发布时间】:2019-02-14 08:39:03
【问题描述】:
我想检索一个html页面的<h3>标签的href属性,我可以得到innerText,但是我不知道如何访问href属性,里面有几个<h3>标签文件,但目前我只需要第一个。剩下的我会稍后处理...
这是我目前得到的代码
Sub Scrap()
Dim IE As New InternetExplorer
Dim sDD As String
Dim Doc As HTMLDocument
IE.Visible = True
IE.navigate "https://www.oneoiljobsearch.com/senior-reservoir-engineer-jobs/?page=1"
Do
DoEvents
Loop Until IE.readyState = READYSTATE_COMPLETE
Set Doc = IE.document
sDD = Trim(Doc.getElementsByTagName("h3")(0).innerText)
'sDD contains the string "Senior Reservoir Engineer"
End Sub
以下是要从中提取数据的 HTML 文档的一部分:
<div class="front_job_details">
<h3>
<a href="/jobs/senior-reservoir-engineer-oslo-norway-7?cmp=js&from=job-search-form-2" target="_blank">
Senior Reservoir Engineer
</a>
</h3>
我需要检索的文本是:“/jobs/senior-reservoir-engineer-oslo-norway-7?cmp=js&from=job-search-form-2”
提前感谢您的帮助。
【问题讨论】:
标签: excel vba web-scraping href