【发布时间】:2016-09-03 07:57:43
【问题描述】:
按照 alexce 的说明 here 安装 docker、scrapyjs 和 splash。
然后,运行docker run -p 8050:8050 scrapinghub/splash,我得到这个输出:
2016-05-08 17:17:45+0000 [-] Log opened.
2016-05-08 17:17:45.978866 [-] Splash version: 2.1
2016-05-08 17:17:45.979553 [-] Qt 5.5.1, PyQt 5.5.1, WebKit 538.1, sip 4.17, Twisted 16.1.1, Lua 5.2
2016-05-08 17:17:45.980138 [-] Python 3.4.3 (default, Oct 14 2015, 20:28:29) [GCC 4.8.4]
2016-05-08 17:17:45.980401 [-] Open files limit: 1048576
2016-05-08 17:17:45.981020 [-] Can't bump open files limit
2016-05-08 17:17:46.086232 [-] Xvfb is started: ['Xvfb', ':1', '-screen', '0', '1024x768x24']
2016-05-08 17:17:46.161902 [-] proxy profiles support is enabled, proxy profiles path: /etc/splash/proxy-profiles
2016-05-08 17:17:46.260357 [-] verbosity=1
2016-05-08 17:17:46.260607 [-] slots=50
2016-05-08 17:17:46.261170 [-] argument_cache_max_entries=500
2016-05-08 17:17:46.262476 [-] Web UI: enabled, Lua: enabled (sandbox: enabled)
2016-05-08 17:17:46.264565 [-] Site starting on 8050
2016-05-08 17:17:46.265203 [-] Starting factory <twisted.web.server.Site object at 0x7f270ec81e10>
它挂在那里。我尝试根据指令here 进行故障排除(重新安装 docker,验证 vm 是否正在运行,重新生成证书,设置环境),但仍然没有。
当在yield scrapy.Request' call withinstart_requests`中没有meta=参数运行蜘蛛时,蜘蛛运行良好(除了没有捕获动态字段),所以我不确定这是docker的问题还是飞溅。
提前致谢。
【问题讨论】: