【发布时间】:2019-01-02 14:17:54
【问题描述】:
我试过了,两种下载 XML 文件的方法:
import requests
from tqdm import tqdm
url = "http://software.broadinstitute.org/gsea/msigdb/download_file.jsp?filePath=/resources/msigdb/6.2/msigdb_v6.2.xml"
response = requests.get(url, stream=True)
with open("lol.xml", "wb") as handle:
for data in tqdm(response.iter_content()):
handle.write(data)
第二个:
import urllib2
response = urllib2.urlopen(url)
data = response.read()
print(data)
URL 被重定向到:
response.url
u'https://software.broadinstitute.org/gsea/login.jsp;jsessionid=2544FF431CB094FBBA80451EDD3A0411'
事实证明,我只下载了html文件而不是xml文件,输出的代码sn-p:
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<base href="http://software.broadinstitute.org/gsea/" />
<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1" />
<meta name="verify-v1" content="/23Jlayki9tnRqU7DcCYrbFI7zPmHJ3HfeZltM6mK5Q=" />
<title>GSEA | Login</title>
<link href="css/style.css" rel="stylesheet" type="text/css" />
</head>
如何下载 XML 文件?
【问题讨论】:
标签: xml python-2.7 http url web