【问题标题】:Trying to get data from a website but not coming试图从网站获取数据但没有来
【发布时间】:2021-01-05 09:59:01
【问题描述】:
from urllib.request import urlopen as uReq
from requests import get
from bs4 import BeautifulSoup as soup
import tablib


my_url = 'https://tradingeconomics.com/india/indicators'

uClient2 = uReq(my_url)
page_html = uClient2.read()
uClient2.close()

page_soup = soup(page_html, "html.parser")

containers11 = page_soup.findAll("div",{"class":"tab-pane"})

filename = "tradeeconomics.csv"

f = open(filename, "w", encoding='utf-8-sig')
headers = "GDP, Last, Reference,Previous,Range,Frequency\n"
f.write(headers)

for container in containers11:

    title_container = container.findAll("div",{"class":"col-md-4 col-xs-5"})


print(title_container)

f.close()

【问题讨论】:

标签: python selenium beautifulsoup


【解决方案1】:

您实际上并不需要 BeautifulSoup 来获取表格。

你可能想试试pandas

例如:

import pandas as pd
import requests

html = requests.get("https://tradingeconomics.com/india/indicators").text
df = pd.read_html(html, flavor="bs4")[1]
df.to_csv("trade_economics.csv", index=False)

输出:

【讨论】:

  • 但是通过这种方法无法提取其他表格数据 gdp 、劳动力、价格等
  • 使用df索引获取你想要的表。
猜你喜欢
  • 1970-01-01
  • 1970-01-01
  • 1970-01-01
  • 1970-01-01
  • 1970-01-01
  • 2016-02-10
  • 1970-01-01
  • 2021-05-16
  • 2013-01-28
相关资源
最近更新 更多