【发布时间】:2021-04-03 09:57:28
【问题描述】:
我想从网站中提取“评论”。我已经尝试过使用 selenium 并使用 xpath 提取它,但它不起作用。
from selenium import webdriver
import pandas as pd
driver = webdriver.Chrome()
driver.get('https://finance.detik.com/berita-ekonomi-bisnis/d-5307853/ri-disebut-punya-risiko-korupsi-yang-tinggi?_ga=2.13736693.357978333.1608782559-293324864.1608782559')
userid_element = driver.find_elements_by_xpath('//*[@id="cmt66364625"]/div[1]/div[1]/text()')[0]
userid = userid_element.text
这是结果:
IndexError Traceback (most recent call last)
<ipython-input-73-151acf07e320> in <module>
----> 1 userid_element = driver.find_elements_by_xpath('//*[@id="cmt66364625"]/div[1]/div[1]/text()')[0]
2 userid = userid_element.text
IndexError: list index out of range
我试图删除列表索引
userid_element = driver.find_elements_by_xpath('//*[@id="cmt66364625"]/div[1]/div[1]/text()')
userid = userid_element.text
但结果是:
---------------------------------------------------------------------------
AttributeError Traceback (most recent call last)
<ipython-input-74-890ba28d7494> in <module>
1 userid_element = driver.find_elements_by_xpath('//*[@id="cmt66364625"]/div[1]/div[1]/text()')
----> 2 userid = userid_element.text
AttributeError: 'list' object has no attribute 'text'
【问题讨论】:
标签: python selenium xpath iframe css-selectors