【发布时间】:2020-05-31 04:33:28
【问题描述】:
我正在尝试在 colab 中使用 ScispaCy 构建临床数据的 NER 模型。我已经安装了这样的包。
!pip install spacy
!pip install scispacy
!pip install https://s3-us-west-2.amazonaws.com/ai2-s2-scispacy/releases/v0.2.4/en_core_sci_md-0.2.4.tar.gz #pip install <Model URL>```
然后我都使用导入了
import scispacy
import spacy
import en_core_sci_md
然后使用以下代码显示句子和实体
nlp = spacy.load("en_core_sci_md")
text ="""Myeloid derived suppressor cells (MDSC) are immature myeloid cells with immunosuppressive activity. They accumulate in tumor-bearing mice and humans with different types of cancer, including hepatocellular carcinoma (HCC)"""
doc = nlp(text)
print(list(doc.sents))
print(doc.ents)
我收到以下错误
OSError: [E050] Can't find model 'en_core_sci_md'. It doesn't seem to be a shortcut link, a Python package or a valid path to a data directory.
我不知道为什么会出现这个错误,我遵循了 ScispaCy 官方 GitHub 帖子中的所有代码。任何帮助,将不胜感激。 提前致谢。
【问题讨论】:
标签: python nlp spacy named-entity-recognition