一个更具扩展性的版本是使用转换字典:
my_dict = {'a':0, 'b':1, 'c':2}
x = np.vectorize(my_dict.get)(x)
输出:
[0 1 0 2 1 1 2 0]
另一种方法是:
np.select([x==i for i in ['a','b','c']], np.arange(3))
对于小字典@ypno 的回答会更快。对于更大的字典,请使用此答案。
时间比较:
三进制字母:
lst = ['a','b','c']
my_dict = {k: v for v, k in enumerate(lst)}
#@Ehsan's solution1
def m1(x):
return np.vectorize(my_dict.get)(x)
#@ypno's solution
def m2(x):
return np.where(x == 'a', 0, (np.where(x == 'b', 1, 2)))
#@SteBog's solution
def m3(x):
y = np.where(x=='a', 0, x)
y = np.where(x=='b', 1, y)
y = np.where(x=='c', 2, y)
return y.astype(np.integer)
#@Ehsan's solution 2 (also suggested by user3483203 in comments)
def m4(x):
return np.select([x==i for i in lst], np.arange(len(lst)))
#@juanpa.arrivillaga's solution suggested in comments
def m5(x):
return np.array([my_dict[i] for i in x.tolist()])
in_ = [np.random.choice(lst, size = n) for n in [10,100,1000,10000,100000]]
8 个字母的相同分析:
lst = ['a','b','c','d','e','f','g','h']