【发布时间】:2017-06-18 06:42:20
【问题描述】:
我希望 Python3.6 将以下代码的输出写入 csv。最好是这样:每篇文章都有一行(News-Website),四列分别是“标题”、“URL”、“类别”[#Politik 等]、“PublishedAt”。
from bs4 import BeautifulSoup
import requests
website = 'http://spiegel.de/schlagzeilen'
r = requests.get(website)
soup = BeautifulSoup((r.content), "lxml")
div = soup.find("div", {"class": "schlagzeilen-content schlagzeilen-overview"})
for a in div.find_all('a', title=True):
print(a.text, a.find_next_sibling('span').text)
print(a.get('href'))
为了写入 csv,我已经有了这个......
with open('%s_schlagzeilen.csv' % datetime.datetime.now().strftime('%Y-%m-%d_%H-%M-%S.%f'), 'w', newline='',
encoding='utf-8') as file:
w = csv.writer(file, delimiter="|")
w.writerow([...])
..并且需要知道接下来要做什么。谢谢!!提前!
【问题讨论】:
标签: python python-3.x csv beautifulsoup