【发布时间】:2026-02-09 02:55:01
【问题描述】:
我有一个简单的函数返回通过 xpath 从网站检索到的表的内容:
import traceback
from selenium import webdriver
from selenium.webdriver.chrome.options import Options
from selenium.webdriver.common.proxy import Proxy, ProxyType
def get_hotbird_13e():
# Downloads an up to date channel/country map for Hotbird 13e.
try:
chrome_options = Options()
chrome_options.add_argument("--headless")
driver = webdriver.Chrome(chrome_options=chrome_options)
driver.get("http://www.eutelsat.com/deploy_tvLineUp/struts/advancedSearch.do?orbitalPositionId=13%B0%20EAST&Langue=EN")
link_xpath = '/html/body/div[1]/div[3]/div/table'
link_path = driver.find_element_by_xpath(link_xpath).text
driver.quit()
print(link_path)
except Exception as exc:
print(traceback.format_exc())
get_hotbird_13e()
...然而,这将返回 xpath 中表的所有元素,其中空格使用分隔符。由于某些字段值中有空格,因此我无法单独访问字段值。
我需要在我的代码中修改什么,以便输出以下示例:
TVN TURBO TVN 13° EAST HOTBIRD 13C POLISH HD CONAX / IRDETO / MEDIAGUARD / NAGRAVISION / VIACCESS
...变成:
TVN TURBO, TVN, 13° EAST, HOTBIRD 13C, POLISH, HD, CONAX / IRDETO / MEDIAGUARD / NAGRAVISION / VIACCESS
谢谢
【问题讨论】:
标签: python-3.x selenium xpath selenium-chromedriver