【发布时间】:2019-04-11 21:37:47
【问题描述】:
具有以下结构的数据框 -
ID text
0 Language processing in python th is great
1 Relace the string
字典命名自定义修复
{'Relace': 'Replace', 'th' : 'three'}
尝试了代码,输出如下 - 当前输出 -
ID text
0 Language processing in pythirdon three is great
1 Replace threee string
代码:
def multiple_replace(dict, text):
# Create a regular expression from the dictionary keys
regex = re.compile("(%s)" % "|".join(map(re.escape, dict.keys())))
# For each match, look-up corresponding value in dictionary
return regex.sub(lambda mo: dict[mo.string[mo.start():mo.end()]], text)
df['col1'] = df.apply(lambda row: multiple_replace(custom_fix, row['text']), axis=1)
预期输出 -
ID text
0 Language processing in python three is great
1 Replace the string
【问题讨论】:
标签: python dictionary nlp lookup