【问题标题】:Stanford NLP pos-tagger model REQUIREMENTS斯坦福 NLP 后标记模型要求
【发布时间】:2025-11-24 05:25:01
【问题描述】:

我创建了一个基本应用程序,我通过 OpenIE Lib 使用斯坦福解析器,当使用指定属性初始化 StanfordCoreNLP 时,它停止为 pos(pos-tagger)。我确实认为包含了所有必需的模型,所以不确定为什么该过程无法找到模型数据。

Properties props = new Properties();       
props.setProperty("annotators", "tokenize, ssplit, pos, lemma, ner, parse, dcoref");
StanfordCoreNLP pipeline = new StanfordCoreNLP(props);
Annotation doc = new Annotation(testString);
pipeline.annotate(doc);
for (CoreMap sentence : doc.get(CoreAnnotations.SentencesAnnotation.class))
{
  Collection<RelationTriple> triples = sentence.get(NaturalLogicAnnotations.RelationTriplesAnnotation.class);
  for (RelationTriple triple : triples) {              
    System.out.println(triple.confidence + "\t" +
       triple.subjectLemmaGloss() + "\t" +
       triple.relationLemmaGloss() + "\t" +
       triple.objectLemmaGloss());
  }
}

包括

stanford-parser.jar
stanford-parser-3.5.2-models.jar
stanford-openie.jar
stanford-openie-models.jar
JDK1.8

StanfordCoreNLP 类启动时的堆栈跟踪。

Reading POS tagger model from edu/stanford/nlp/models/pos-tagger/english-left3words/english-left3words-distsim.tagger ... done [1.2 sec].
Exception in thread "AWT-EventQueue-0" java.lang.NoSuchFieldError: REQUIREMENTS
    at edu.stanford.nlp.pipeline.POSTaggerAnnotator.requires(POSTaggerAnnotator.java:169)
    at edu.stanford.nlp.pipeline.StanfordCoreNLP.construct(StanfordCoreNLP.java:362)
    at edu.stanford.nlp.pipeline.StanfordCoreNLP.<init>(StanfordCoreNLP.java:131)
    at edu.stanford.nlp.pipeline.StanfordCoreNLP.<init>(StanfordCoreNLP.java:127)
    at org.sjdp.q2sm.GUI.extractTriplets(GUI.java:373)
    at org.sjdp.q2sm.GUI.processSPARQL(GUI.java:353)
    at org.sjdp.q2sm.GUI.actionPerformed(GUI.java:153)
    at javax.swing.AbstractButton.fireActionPerformed(AbstractButton.java:2022)
    at javax.swing.AbstractButton$Handler.actionPerformed(AbstractButton.java:2348)
    at javax.swing.DefaultButtonModel.fireActionPerformed(DefaultButtonModel.java:402)
    at javax.swing.DefaultButtonModel.setPressed(DefaultButtonModel.java:259)
    at javax.swing.plaf.basic.BasicButtonListener.mouseReleased(BasicButtonListener.java:252)
    at java.awt.AWTEventMulticaster.mouseReleased(AWTEventMulticaster.java:289)
    at java.awt.Component.processMouseEvent(Component.java:6535)
    at javax.swing.JComponent.processMouseEvent(JComponent.java:3324)
    at java.awt.Component.processEvent(Component.java:6300)
    at java.awt.Container.processEvent(Container.java:2236)
    at java.awt.Component.dispatchEventImpl(Component.java:4891)
    at java.awt.Container.dispatchEventImpl(Container.java:2294)
    at java.awt.Component.dispatchEvent(Component.java:4713)
    at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4888)
    at java.awt.LightweightDispatcher.processMouseEvent(Container.java:4525)
    at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4466)
    at java.awt.Container.dispatchEventImpl(Container.java:2280)
    at java.awt.Window.dispatchEventImpl(Window.java:2750)
    at java.awt.Component.dispatchEvent(Component.java:4713)
    at java.awt.EventQueue.dispatchEventImpl(EventQueue.java:758)
    at java.awt.EventQueue.access$500(EventQueue.java:97)
    at java.awt.EventQueue$3.run(EventQueue.java:709)
    at java.awt.EventQueue$3.run(EventQueue.java:703)
    at java.security.AccessController.doPrivileged(Native Method)
    at java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:76)
    at java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:86)
    at java.awt.EventQueue$4.run(EventQueue.java:731)
    at java.awt.EventQueue$4.run(EventQueue.java:729)
    at java.security.AccessController.doPrivileged(Native Method)
    at java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:76)
    at java.awt.EventQueue.dispatchEvent(EventQueue.java:728)
    at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:201)
    at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:116)
    at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:105)
    at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:101)
    at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:93)
    at java.awt.EventDispatchThread.run(EventDispatchThread.java:82)

【问题讨论】:

    标签: stanford-nlp pos-tagger


    【解决方案1】:

    这几乎可以肯定是同时包含解析器和 OpenIE 系统的类不兼容错误。当一切再次同步时,它将在下一个版本(3.5.3)中修复,但由于 OpenIE 在 3.5.2 版本中不存在,因此该 jar 中的代码有点“领先”解析器。

    同时运行两者的最简单方法是从CoreNLP 的 Github 版本运行。命令ant jar 应该创建一个包含解析器和OpenIE 系统的jar 文件。 most recent models(警告:大量下载)应该同时具有解析器和 OpenIE 模型。

    【讨论】:

    • 感谢您的建议。它现在只能抱怨使用 80mb 模型库时找不到 dcoref 模型。模型有点大,但不应该碍事。
    • 除非您传入标志 -resolve_coref,否则不应加载 coref 模型。在这种情况下,您必须通过 CoreNLP 发行版使用 OpenIE 系统,这意味着使用 GitHub 版本的 CoreNLP(除了 coref 目前在 HEAD 中已损坏;请改用this commit)和相关的最新模型。