在 Python 2.7 上使用 xpath 提取 href 值答案

【问题标题】：Extract href values with xpath on Python 2.7在 Python 2.7 上使用 xpath 提取 href 值
【发布时间】：2013-03-07 13:09:34
【问题描述】：

我有这个 HTML：

<a href="some content">Click here</a>

如何在 Python 2.7 上使用 xpath 提取 some content 和 click me？

到目前为止，我有以下内容（仅从 href 结果中提取“一些内容”）：

import lxml.etree as LE
import requests

r = requests.get("http://localhost")
html = r.text
root = LH.fromstring(html)
print root.xpath('//a/@href')

【问题讨论】：

【解决方案1】：

您只能使用 XPath 选择一个或另一个，但您可以选择所有 <a> 元素，然后选择 href 属性和文本内容，如下所示：

for elt in root.xpath('//a'):
    print(elt.attrib['href'], elt.text_content())

【讨论】：