无法使用 BeautifulSoup 获取标签的值

【问题标题】：Can't get the value of a tag with BeautifulSoup无法使用 BeautifulSoup 获取标签的值
【发布时间】：2023-11-06 06:30:01
【问题描述】：

我刚开始使用 BS4，但我似乎无法找到为什么我无法提取下表中的文本 -> http://pastebin.com/MCQC7wLY

这是我的代码：

    for team in soup.find_all('tr'):
    print team.a.string

我收到以下错误

AttributeError: 'NoneType' 对象没有属性 'string'

我也尝试过其他的东西，比如

for team in soup.find_all('tr'):
    print team.find('a').string

但我总是遇到同样的错误。

这就是 team.find('a') 返回的内容

<a href="/entry/688922/event-history/7/">FC Lasne</a>

我想提取“FC Lasne”

这让我很生气，因为通常我只是做 find('a').string 并且它只是工作

我应该如何进行？

谢谢

【问题讨论】：

标签： python web-scraping beautifulsoup

【解决方案1】：

您示例中的第一个 tr 中没有任何 a 标记。

你可以忽略任何没有链接的trs：

for team in soup.find_all('tr'):
    link = team.find('a')
    if link == null:
       continue
    print link.string

虽然你可以这样做：

soup.find_all('a')

【讨论】：

天哪，我怎么能错过第一个 tr 没有任何 a... 这将教会我长时间编码。感谢您的帮助

猜你喜欢

超越 4 与 git for windows 的比较：无法使用 git diff 打开 2025-11-21
如何从 PyGTK 中的 FileChooserButton 获取文件名？ 2025-11-21
使用索引同时从 numpy 2D 数组的行中减去多个值 2025-11-21
从 UIPopoverController 外点击后获取边界 2025-11-21
.NET Core Web API 应用程序无法连接到在 Docker 上运行的 SQL 服务器 2025-11-21
生成签名的 apk 时出现 DexArchiveMergerException 2025-11-21
声明模型的清洁方法 2025-11-21
Firefox iframe 中的 CSP 标头在具有动态内容的整个页面上工作 2025-11-21
在客户端用 JavaScript 逐行读取文件 2025-11-21
ASP.NET 中的 SVN 与 Ankh 的日常使用基础知识 2025-11-21

相关资源

Visual Studio.NET使用技巧手册完整版PDF(中文+英文)下载 2021-06-07
回到网页顶部的JS代码下载 2022-12-26
css3爱心点赞图标动画特效代码下载 2023-07-26

最近更新更多

热门标签

Java Python linux javascript C# Mysql Docker 算法前端 SpringBoot Redis Vue spring .net 设计模式 .net core c++ kubernetes 数据库机器学习大数据数据结构微服务 js 人工智能 Go Android 面试程序员 JVM 云原生后端 ASP.net core 深度学习 CSS k8s git golang PHP devops Nginx Django React mybatis 架构多线程 Spring Boot 云计算 LeetCode 分布式