【发布时间】:2016-05-30 17:30:56
【问题描述】:
我要提取的信息:
列表中的位置 Al Bayan 和尼泊尔 ['Al Bayan' , 'Nepal']
<div class="location">
<div class="listing-location">Location</div>
<div class="location-areas">
<span class="location">Al Bayan</span>
,
<span class="location">Nepal</span>
</div>
<div class="area-description"> 3.3 km from Mall of the Emirates </div>
</div>
提取区域的代码:
区域
try:
area= soup.find('div', 'location-areas')
area_result= str(area.get_text().strip().encode("utf-8"))
print([area_result])
except StandardError as e:
area_result="Error was {0}".format(e)
print area_result
输出:
"Al Bayanأ¢â‚¬آھ,أ¢â‚¬آھ
Nepal"
所需的输出:
['Al Bayan', 'Nepal']
【问题讨论】:
-
你能不能用一句话概括实际的问题?
标签: python python-2.7 python-3.x web-scraping bs4