【问题标题】:word2vec deeplearning4j ArrayIndexOutOfBoundsException in WordVectorSerializer.loadTxtVectors()WordVectorSerializer.loadTxtVectors() 中的 word2vec deeplearning4j ArrayIndexOutOfBoundsException
【发布时间】:2016-10-20 22:39:03
【问题描述】:

我从http://deeplearning4j.org/word2vec 训练了 word2vec 模型 成功,现在在尝试应用 单词最近的:

Exception in thread "main" java.lang.ArrayIndexOutOfBoundsException: 99
at 
org.deeplearning4j.models.embeddings.loader.WordVectorSerializer.loadTxt(WordVectorSerializer.java:1107)
at 
org.deeplearning4j.models.embeddings.loader.WordVectorSerializer.loadTxtVectors(WordVectorSerializer.java:1033)
at 
org.deeplearning4j.examples.nlp.word2vec.NearestWords.main(NearestWords.java:13)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at com.intellij.rt.execution.application.AppMain.main(AppMain.java:144)

这是我的代码:

package org.deeplearning4j.examples.nlp.word2vec;
import java.io.File;
import java.util.Collection;
import org.deeplearning4j.models.embeddings.loader.WordVectorSerializer;
import org.deeplearning4j.models.embeddings.wordvectors.WordVectors;
public class NearestWords {
    public static void main(String[] args) throws Exception{
        File file = new File("pathToWriteto.txt");
        WordVectors vec = WordVectorSerializer.loadTxtVectors(file);
        Collection<String> similar = vec.wordsNearest("day", 10);
        System.out.println(similar);
    }
}

【问题讨论】:

标签: java deep-learning word2vec deeplearning4j


【解决方案1】:

当前版本 (0.4-rc3.10) 在 loadTxt 函数中有一个错误。它已在主存储库中修复,并将反映在下一个版本中。阅读此 github 问题以解决您的问题: https://github.com/deeplearning4j/deeplearning4j/issues/1721

目前一个简单的解决方法是将最新的 WordVectorSerializer.java 复制到您的项目中。 https://github.com/deeplearning4j/deeplearning4j/blob/master/deeplearning4j-scaleout/deeplearning4j-nlp/src/main/java/org/deeplearning4j/models/embeddings/loader/WordVectorSerializer.java

【讨论】:

    猜你喜欢
    • 1970-01-01
    • 2016-09-05
    • 2020-04-01
    • 2016-12-01
    • 2015-12-21
    • 2021-09-11
    • 1970-01-01
    • 1970-01-01
    • 2015-12-11
    相关资源
    最近更新 更多