【发布时间】:2021-08-16 19:31:19
【问题描述】:
我有兴趣将 Huggingface 的预训练模型用于命名实体识别 (NER) 任务,而无需进一步训练或测试模型。
在model page of HuggingFace上,模型复用的唯一信息如下:
from transformers import AutoTokenizer, AutoModel
tokenizer = AutoTokenizer.from_pretrained("emilyalsentzer/Bio_ClinicalBERT")
model = AutoModel.from_pretrained("emilyalsentzer/Bio_ClinicalBERT")
我尝试了以下代码,但我得到的是张量输出,而不是每个命名实体的类标签。
from transformers import AutoTokenizer, AutoModel
tokenizer = AutoTokenizer.from_pretrained("emilyalsentzer/Bio_ClinicalBERT")
model = AutoModel.from_pretrained("emilyalsentzer/Bio_ClinicalBERT")
text = "my text for named entity recognition here."
input_ids = torch.tensor(tokenizer.encode(text, padding=True, truncation=True,max_length=50, add_special_tokens = True)).unsqueeze(0)
with torch.no_grad():
output = model(input_ids, output_attentions=True)
关于如何将模型应用于 NER 文本的任何建议?
【问题讨论】:
标签: huggingface-transformers named-entity-recognition transformer