【发布时间】:2017-11-21 17:05:09
【问题描述】:
我现在正在使用 Github API,这是一个获取列表中每个 repo 的所有拉取请求的函数:
async def get_all_pulls(repos, api):
pulls = []
for repo in repos:
try:
async for pull in api.getiter(f'/repos/{org}/{repo}/pulls?state=all'):
pull['repo'] = repo
if pull not in pulls:
pulls.append(pull)
except Exception:
print(f"Bad repo/no access=> [{repo}]")
continue
return pulls
一切正常,但有一个小问题,由于对 repos 的迭代需要很长时间(假设有 30 个)。
我试图让它像这样异步(确定我在使用它时在声明中摆脱了 for 循环):
# gather all prs for all repos
tasks = [asyncio.ensure_future(get_all_pulls_for_repo(api, repo)) for repo in repos]
results = await asyncio.gather(*tasks)
# unwrap list of lists
for res in results:
all_pull_requests += res
但我遇到崩溃并说 repos 不好等。 我想我在这里遗漏了一些重要的东西,但找不到什么。
为什么它会因异步 for 循环而崩溃?我可以让它工作吗?
更新1: get_all_reviews 的追溯:
Traceback (most recent call last):
File "/home/metal/Documents/projects/-git/async_git_tool.py", line 193, in <module>
loop.run_until_complete(main())
File "/home/metal/.pyenv/versions/3.6.0/lib/python3.6/asyncio/base_events.py", line 466, in run_until_complete
return future.result()
File "/home/metal/Documents/projects/-git/async_git_tool.py", line 113, in main
reviewed = await get_all_reviews(created, api, ss_programmers)
File "/home/metal/Documents/projects/-git/async_git_tool.py", line 181, in get_all_reviews
async for review in api.getiter(f'/repos/{org}/{pr_repo}/pulls/{pr_number}/reviews'):
File "/home/metal/Documents/projects/-git/venv/lib/python3.6/site-packages/gidgethub/abc.py", line 85, in getiter
data, more = await self._make_request("GET", url, url_vars, b"", accept)
File "/home/metal/Documents/projects/-git/venv/lib/python3.6/site-packages/gidgethub/abc.py", line 66, in _make_request
data, self.rate_limit, more = sansio.decipher_response(*response)
File "/home/metal/Documents/projects/-git/venv/lib/python3.6/site-packages/gidgethub/sansio.py", line 284, in decipher_response
rate_limit = RateLimit.from_http(headers)
File "/home/metal/Documents/projects/-git/venv/lib/python3.6/site-packages/gidgethub/sansio.py", line 226, in from_http
limit = int(headers["x-ratelimit-limit"])
File "multidict/_multidict.pyx", line 140, in multidict._multidict._Base.__getitem__
File "multidict/_multidict.pyx", line 135, in multidict._multidict._Base._getone
KeyError: "Key not found: 'x-ratelimit-limit'"
这是函数本身:
async def get_all_reviews(pulls, api, programmers):
reviewed_pulls = []
for pull in pulls:
pr_repo = pull['repo']
pr_number = str(pull['number'])
async for review in api.getiter(f'/repos/{org}/{pr_repo}/pulls/{pr_number}/reviews'):
if review['user']['login'] not in programmers \
and pull not in reviewed_pulls:
reviewed_pulls.append(pull)
return reviewed_pulls
我这样称呼它:
reviewed = await get_all_reviews(softserve_created, api, ss_programmers)
【问题讨论】:
-
你使用什么具体的模块来处理 Github api?
-
@MikhailGerasimov 我正在使用 _Gidgegethub_(gidgethub.readthedocs.io/en/stable)
标签: python-3.x asynchronous python-asyncio