【发布时间】:2015-11-29 05:36:27
【问题描述】:
我正在尝试设置一个管道,该管道会生成词形还原的句子。我知道如何获得所有句子或所有引理,但我不知道如何获得引理集合除以句子结尾。这是一个带有 ?????? 标记的缺失参数的代码 sn-p:
AnalysisEngine pipeline = createEngine(createEngineDescription(
createEngineDescription(BreakIteratorSegmenter.class),
createEngineDescription(StanfordLemmatizer.class),
createEngineDescription(StopWordRemover.class, StopWordRemover.PARAM_MODEL_LOCATION,
new String[]{"stopwords.txt"})));
JCas jcas = JCasFactory.createJCas();
jcas.setDocumentText ("Almost all energy on Earth comes from the Sun. Plants make food energy from sunlight.");
jcas.setDocumentLanguage("en");
pipeline.process (jcas);
for (Sentence s : select(jcas, Sentence.class)) {
out.println("");
for (Lemma l : select(??????, Lemma.class))
out.print(l.getValue() + " ");
}
我需要在此代码中更改什么,因此它会在两行中打印来自两个输入句子的引理。
【问题讨论】:
标签: java nlp uima dkpro-core