【发布时间】:2019-08-26 18:12:19
【问题描述】:
我正在尝试抓取一个网站,其中详细信息存在于我尝试过的各种 div 标签中,但不知何故我无法抓取,因为每个元素都存在于 div 标签内,并且在 div 下还有 span 标签我也有编写返回空字符串的代码
这是我的代码
unspsc_link = "https://order.besse.com/Orders/Search/ProductSearch?query=34431"
link = requests.get(unspsc_link).text
soup = BeautifulSoup(link, 'lxml')
prdItemNumbers = []
prdTitles = []
prdSubTitles = []
prdNDCs = []
prdUOM = []
prdForm = []
for row in soup.select('.row'):
prdItemNumbers = row.select_one('.font-xs bg-teal')
if prdItemNumbers is None:
prdItemNumbers.append('N/A')
else:
prdItemNumbers.append(prdItemNumbers.text.strip().replace('\u200b',''))
prdTitles = row.select_one('.header1')
if prdTitles is None:
prdTitles.append('N/A')
else:
prdTitles.append(prdTitles.text.strip())
prdSubTitles = row.select_one('.header2')
if prdSubTitles is None:
prdSubTitles.append('N/A')
else:
prdSubTitles.append(prdSubTitles.text.strip())
prdNDCs = row.select_one('.col-sm-5')
if prdNDCs is None:
prdNDCs.append('N/A')
else:
prdNDCs.append(prdNDCs.text.strip())
prdUOM = row.select_one('.col-sm-3')
if prdUOM is None:
prdUOM.append('N/A')
else:
prdUOM.append(prdUOM.text.strip())
prdForm = row.select_one('.col-sm-4')
if prdForm is None:
prdForm.append('N/A')
else:
prdForm.append(prdForm.text.strip())
报错
prdItemNumbers.append('N/A')
AttributeError: 'NoneType' object has no attribute 'append'
【问题讨论】:
标签: python html web-scraping beautifulsoup