【发布时间】:2021-12-02 15:27:46
【问题描述】:
我有这种格式的微调数据:
[[(('Kaweah', 'NNP'), 'O'),
(('Delta', 'NNP'), 'O'),
(('Mental', 'NNP'), 'O'),
(('Health', 'NNP'), 'O'),
(('Hospital', 'NNP'), 'O'),
(('D/p', 'NNP'), 'O'),
(('Aph', 'NNP'), 'O'),
(('is', 'VBZ'), 'O'),
(('located', 'VBN'), 'O'),
(('at', 'IN'), 'O'),
(('1100', 'CD'), 'B-GPE'),
(('SO', 'NNP'), 'I-GPE'),
(('.', '.'), 'I-GPE'),
(('AKERS', 'NNP'), 'I-GPE'),
(('STREET', 'NNP'), 'I-GPE')],
[(('CHARLTON', 'NNP'), 'O'),
(('MEMORIAL', 'NNP'), 'O'),
(('HOSPITAL', 'NNP'), 'O'),
(('is', 'VBZ'), 'O'),
(('located', 'VBN'), 'O'),
(('at', 'IN'), 'O'),
(('2449', 'CD'), 'B-GPE'),
(('THIRD', 'NNP'), 'I-GPE'),
(('STREET', 'NNP'), 'I-GPE'),
((',', ','), 'I-GPE'),
(('GA', 'NNP'), 'I-GPE')]]
但是 spacy 的训练格式是这样的:
TRAIN_DATA =[ ("Pizza is a common fast food.", {"entities": [(0, 5, "FOOD")]}),
("Pasta is an italian recipe", {"entities": [(0, 5, "FOOD")]}) ]
我应该怎么做才能将我的 pickle 文件转换为 spacy 格式?
【问题讨论】: