【问题标题】:Replacing a value in a pandas multiindex dataframe替换熊猫多索引数据框中的值
【发布时间】:2017-07-23 20:18:50
【问题描述】:

我想更改我的 pandas 数据框中的一个值,但我想我误解了索引的工作原理。

import pandas as pd idx = pd.IndexSlice df.loc[idx[(0, 2006.0, '01019_13055_01073_01009_01055')],idx[('moment_25','P517')]]

我得到了输出

Out[376]: 
moment_25  P517    0.665873
Name: (0, 2006.0, 01019_13055_01073_01009_01055), dtype: float64

我想将 df 中的值 0.665873 更改为 1。我试过了

df.ix[idx[(0, 2006.0,'01019_13055_01073_01009_01055')],idx[('moment_25','P517')]]=1

但我得到了错误

Exception: cannot handle a non-unique multi-index!

我试图用示例数据框复制问题,但无济于事。

arrays = [['bar', 'bar', 'baz', 'baz', 'foo', 'foo', 'qux', 'qux'],['one', 'two', 'one', 'two', 'one', 'two', 'one', 'two']]
tuples = list(zip(*arrays))
index = pd.MultiIndex.from_tuples(tuples, names=['first', 'second'])
df = pd.DataFrame(np.random.randn(3, 8), index=['A', 'B', 'C'], columns=index)
df.loc[idx['A'],idx[('baz','one')]]

Out[391]: -0.17935592549360641

我认为问题在于,当我使用实际数据时,我的输出得到了一个系列,但是当我使用练习数据时,我得到了一个浮点数。为什么我得到的是那个系列而不是浮点数 0.665873?

【问题讨论】:

  • 发布您的数据框样本?

标签: python pandas dataframe multi-index


【解决方案1】:

使用 set_value 更改数据框中的值。示例如下:

import pandas as pd
import numpy as np
dfp = pd.DataFrame({'A' : [np.NaN,np.NaN,3,4,5,5,3,1,5,np.NaN], 
                    'B' : [1,0,3,5,0,0,np.NaN,9,0,0], 
                    'C' : ['Pharmacy of IDAHO','Access medicare arkansas','NJ Pharmacy','Idaho Rx','CA Herbals','Florida Pharma','AK RX','Ohio Drugs','PA Rx','USA Pharma'], 
                    'D' : [123456,123456,1234567,12345678,12345,12345,12345678,123456789,1234567,np.NaN],
                    'E' : ['Assign','Unassign','Assign','Ugly','Appreciate','Undo','Assign','Unicycle','Assign','Unicorn',]})
print(dfp)


  A    B                         C            D           E
0  NaN  1.0         Pharmacy of IDAHO     123456.0      Assign
1  NaN  0.0  Access medicare arkansas     123456.0    Unassign
2  3.0  3.0               NJ Pharmacy    1234567.0      Assign
3  4.0  5.0                  Idaho Rx   12345678.0        Ugly
4  5.0  0.0                CA Herbals      12345.0  Appreciate
5  5.0  0.0            Florida Pharma      12345.0        Undo
6  3.0  NaN                     AK RX   12345678.0      Assign
7  1.0  9.0                Ohio Drugs  123456789.0    Unicycle
8  5.0  0.0                     PA Rx    1234567.0      Assign
9  NaN  0.0                USA Pharma          NaN     Unicorn
   #                      ^^Check HEERE^^

变化和输出:

dfp.set_value(9, 'C', 10)
print(dfp)

     A    B                         C            D           E
0  NaN  1.0         Pharmacy of IDAHO     123456.0      Assign
1  NaN  0.0  Access medicare arkansas     123456.0    Unassign
2  3.0  3.0               NJ Pharmacy    1234567.0      Assign
3  4.0  5.0                  Idaho Rx   12345678.0        Ugly
4  5.0  0.0                CA Herbals      12345.0  Appreciate
5  5.0  0.0            Florida Pharma      12345.0        Undo
6  3.0  NaN                     AK RX   12345678.0      Assign
7  1.0  9.0                Ohio Drugs  123456789.0    Unicycle
8  5.0  0.0                     PA Rx    1234567.0      Assign
9  NaN  0.0                        10          NaN     Unicorn
#                             ^^The CHANGE^^

如果您特别询问索引,请查看here

使用上面链接的方法:

dfp.ix[0, 'C'] = 'x'
#                         vv Check Below vv
     A    B                         C            D           E
0  NaN  1.0                         x     123456.0      Assign
1  NaN  0.0  Access medicare arkansas     123456.0    Unassign
2  3.0  3.0               NJ Pharmacy    1234567.0      Assign
3  4.0  5.0                  Idaho Rx   12345678.0        Ugly
4  5.0  0.0                CA Herbals      12345.0  Appreciate
5  5.0  0.0            Florida Pharma      12345.0        Undo
6  3.0  NaN                     AK RX   12345678.0      Assign
7  1.0  9.0                Ohio Drugs  123456789.0    Unicycle
8  5.0  0.0                     PA Rx    1234567.0      Assign
9  NaN  0.0                        10          NaN     Unicorn

【讨论】:

  • 感谢您的回复,但我不确定这是否能回答我的问题。您的第二个建议 dfp.ix[0, 'C'] = 'x' 与上面给出非唯一索引错误的代码有何不同?
猜你喜欢
  • 2013-12-28
  • 2016-07-06
  • 2019-05-21
  • 1970-01-01
  • 2019-02-02
  • 1970-01-01
  • 1970-01-01
  • 2018-09-29
相关资源
最近更新 更多