【发布时间】:2019-07-10 18:04:04
【问题描述】:
我在终端中收到以下错误:
Traceback (most recent call last):
File "deep_Q_learner.py", line 289, in <module>
agent.replay_experience()
File "deep_Q_learner.py", line 170, in replay_experience
self.learn_from_batch_experience(experience_batch)
File "deep_Q_learner.py", line 151, in learn_from_batch_experience
self.Q_target(next_obs_batch).max(1)[0].data
TypeError: mul(): argument 'other' (position 1) must be Tensor, not numpy.ndarray
只有当 self.DQN=SLP 时才会出现错误(参见第 76 行)
这个问题有解决办法吗?我在这里遗漏了什么吗?
【问题讨论】:
标签: pytorch reinforcement-learning