【发布时间】:2021-11-05 03:23:29
【问题描述】:
根据here的建议,我正在尝试:
scrapy crawl spider-name -a start_urls="https://start-url.com/"
我明白了:
Traceback (most recent call last):
File "/usr/local/lib/python3.9/site-packages/scrapy/core/engine.py", line 129, in _next_request
request = next(slot.start_requests)
File "/usr/local/lib/python3.9/site-packages/scrapy/spiders/__init__.py", line 77, in start_requests
yield Request(url, dont_filter=True)
File "/usr/local/lib/python3.9/site-packages/scrapy/http/request/__init__.py", line 25, in __init__
self._set_url(url)
File "/usr/local/lib/python3.9/site-packages/scrapy/http/request/__init__.py", line 73, in _set_url
raise ValueError(f'Missing scheme in request url: {self._url}')
要重现,请运行以下命令:
scrapy startproject example_project
cd example_project
scrapy genspider spider1 https://stackoverflow.com
scrapy crawl spider1 -a start_urls="https://stackoverflow.com"
【问题讨论】:
-
请分享代码
-
我编辑了这个问题,并包含了一个类似的例子