【发布时间】:2021-08-13 00:32:01
【问题描述】:
我一直在做一个小项目,它是一个网络爬虫模板。我在 pycharm 中遇到问题,我收到警告 Unresolved attribute reference 'domain' for class 'Scraper'
from abc import abstractmethod
import requests
import tldextract
class Scraper:
scrapers = {}
def __init_subclass__(scraper_class):
Scraper.scrapers[scraper_class.domain] = scraper_class # Unresolved attribute reference 'domain' for class 'Scraper'
@classmethod
def for_url(cls, url):
k = tldextract.extract(url)
# Returns -> <scraper.SydsvenskanScraper object at 0x000001E94F135850> & Scraped BBC News<!DOCTYPE html><html Which type annotiation?
return cls.scrapers[k.registered_domain](url)
@abstractmethod
def scrape(self):
pass
class BBCScraper(Scraper):
domain = 'bbc.co.uk'
def __init__(self, url):
self.url = url
def scrape(self):
rep = requests.Response = requests.get(self.url)
return "Scraped BBC News" + rep.text[:20] # ALL HTML CONTENT
class SydsvenskanScraper(Scraper):
domain = 'sydsvenskan.se'
def __init__(self, url):
self.url = url
def scrape(self):
rep = requests.Response = requests.get(self.url)
return "Scraped Sydsvenskan News" + rep.text[:20] # ALL HTML CONTENT
if __name__ == "__main__":
URLS = ['https://www.sydsvenskan.se/', 'https://www.bbc.co.uk/']
for urls in URLS:
get_product = Scraper.for_url(urls)
r = get_product.scrape()
print(r)
当然我可以忽略它,因为它正在工作,但我不喜欢忽略警告,因为我相信 pycharm 很聪明,应该解决警告而不是忽略它,我想知道它警告我的原因是什么?
【问题讨论】:
-
注意这里和其他stackoverflow.com/questions/67669212/… :您可以考虑accepting an answer 来奖励最有用的答案
-
@azro 对不起!我没有像我想的那样接受答案,我是给+1的人哈哈!刚刚接受❤️
标签: python python-3.x class-attributes