【问题标题】:Trouble running Gensim LDA运行 Gensim LDA 时出现问题
【发布时间】:2016-06-24 09:07:48
【问题描述】:

我正在尝试运行此处描述的分布式 LDA 示例:

https://radimrehurek.com/gensim/dist_lda.html

我按照这里的教程创建了一组文档:

https://radimrehurek.com/gensim/dist_lsi.html

按照它的建议,通过“将语料库膨胀[ing] 到 1M 个文档,通过反复重复其文档”

我正在使用 python 3.3 和 numpy 1.9.2 我不断收到以下错误:

Exception in thread oneway-call:
Traceback (most recent call last):
  File "/usr/lib64/python3.3/threading.py", line 901, in _bootstrap_inner
    self.run()
  File "/usr/lib/python3.3/site-packages/Pyro4/core.py", line 1484, in run
    super(_OnewayCallThread, self).run()
  File "/usr/lib64/python3.3/threading.py", line 858, in run
    self._target(*self._args, **self._kwargs)
  File "/usr/lib64/python3.3/site-packages/gensim/models/lda_worker.py", line 71, in requestjob
    self.processjob(job)
  File "/usr/lib64/python3.3/site-packages/gensim/utils.py", line 98, in _synchronizer
    result = func(self, *args, **kwargs)
  File "/usr/lib64/python3.3/site-packages/gensim/models/lda_worker.py", line 80, in processjob
    self.model.do_estep(job)
  File "/usr/lib64/python3.3/site-packages/gensim/models/ldamodel.py", line 480, in do_estep
    gamma, sstats = self.inference(chunk, collect_sstats=True)
  File "/usr/lib64/python3.3/site-packages/gensim/models/ldamodel.py", line 423, in inference
    if doc and not isinstance(doc[0][0], six.integer_types):
ValueError: The truth value of an array with more than one element is ambiguous. Use a.any() or a.all()

我运行了分布式 lsi 示例,它运行良好,但由于某种原因,我似乎无法让 lda 工作。

我尝试将 /usr/lib64/python3.3/site-packages/gensim/models/ldamodel.py 中的第 423 行更改为:

if doc is not None and not isinstance(doc[0][0], six.integer_types):

错误消失了,但我收到了警告

FutureWarning: comparison to `None` will result in an elementwise object comparison in the future.

谁能解释我做错了什么?我对此文件的更改是否正确?或者我应该以不同的方式运行 LDA?

【问题讨论】:

    标签: python numpy lda gensim


    【解决方案1】:

    这是 gensim 中的一个错误,由 here 提出。

    编辑 - 此问题现已在 this pull request 中解决。

    【讨论】:

      猜你喜欢
      • 1970-01-01
      • 1970-01-01
      • 2017-03-24
      • 2017-02-19
      • 1970-01-01
      • 1970-01-01
      • 2019-01-28
      • 2013-12-19
      • 1970-01-01
      相关资源
      最近更新 更多