【问题标题】:show text description in x axis rather than numbers using pandas matplotlib使用 pandas matplotlib 在 x 轴上显示文本描述而不是数字
【发布时间】:2018-11-18 19:26:27
【问题描述】:

我编写了代码以将我的数据集显示为条形图。这是我的代码: 我以这种方式从 .csv 文件中读取了我的数据:

names = ["Clinic Number","Question Text","Answer Text","Answer Date","Class"]
data = pd.read_csv('ADLCI.csv', names = names)

然后

grouped = data.groupby(['Question Text','Answer Text']).size().reset_index(name='counts')

import matplotlib.pyplot as plt
plt.figure()

grouped.plot(kind='bar', title ="Functional Status Count", figsize=(15, 10), legend=True, fontsize=12)
plt.show()

这也是我想要显示为条形图的数据框的结果。

                         Question Text Answer Text  counts
0                          CI function          No     513
1                          CI function         Yes     373
2                             bathing?          No    2827
3                             bathing?         Yes     408
4                            dressing?          No    2824
5                            dressing?         Yes     423
6                              feeding          No    2851
7                              feeding         Yes     160
8                         housekeeping          No    2803
9                         housekeeping         Yes     717
10                      preparing food          No    2604
11                      preparing food         Yes     593
12  responsibility for own medications          No    2793
13  responsibility for own medications         Yes     625
14                            shopping          No      35
15                            shopping         Yes      49
16                           toileting          No    2843
17                           toileting         Yes     239
18                        transferring          No    2834
19                        transferring         Yes     904
20                using transportation          No    2816
21                using transportation         Yes     483

第一列数字是自动添加的,实际上我的数据集中没有。

这是此代码创建的条形图。

正如您在条形图中看到的,所有条形都具有相同的颜色。 x轴也是我说的数字。但我不想要这种形状。 我想要的东西看起来像this link:

我将解释我想要对我在这里上传的图片进行哪些更改。

x 轴上应该是 Question Text 列,而不是 0 和 1 ...。详细地说,x 轴上的条形图将是:正如我们在数据框中看到的,有两个 CI function 一个用于yes,一个用于No。我想要 CI function 而不是 0 和 1 有两种不同的颜色,一种指向 No 1596 的计数,另一种指向 Yes 1376 的颜色。

下一项将是bathing?,同样一个条指向17965,另一个指向702

这样我应该有将近十个条,每个条包含两个条,就像我上面放置的链接一样。

我尝试了类似上述链接的各种方法,但我的没有这样显示或出现错误。

谢谢:)

更新 1 当我应用您的代码时:

import matplotlib.pyplot as plt
data.groupby(['Question Text','Answer Text']).sum().unstack().plot(kind='bar')
plt.show()

我收到了这个错误:

  Traceback (most recent call last):
  File "C:/Users/M193053/PycharmProjects/ADL-distribution/test.py", line 52, in <module>
    data.groupby(['Question Text','Answer Text']).sum().unstack().plot(kind='bar')
  File "C:\Users\M193053\Documents\Anaconda3\envs\conda3\lib\site-packages\pandas\plotting\_core.py", line 2941, in __call__
    sort_columns=sort_columns, **kwds)
  File "C:\Users\M193053\Documents\Anaconda3\envs\conda3\lib\site-packages\pandas\plotting\_core.py", line 1977, in plot_frame
    **kwds)
  File "C:\Users\M193053\Documents\Anaconda3\envs\conda3\lib\site-packages\pandas\plotting\_core.py", line 1804, in _plot
    plot_obj.generate()
  File "C:\Users\M193053\Documents\Anaconda3\envs\conda3\lib\site-packages\pandas\plotting\_core.py", line 258, in generate
    self._compute_plot_data()
  File "C:\Users\M193053\Documents\Anaconda3\envs\conda3\lib\site-packages\pandas\plotting\_core.py", line 373, in _compute_plot_data
    'plot'.format(numeric_data.__class__.__name__))
TypeError: Empty 'DataFrame': no numeric data to plot

但是当我使用这段代码时:

grouped = data.groupby(['Question Text','Answer Text']).size().reset_index(name='counts')

import matplotlib.pyplot as plt
grouped.groupby(['Question Text','Answer Text']).sum().unstack().plot(kind='bar')
plt.show()

这样对我来说似乎没问题:

但应用两个 groupby 似乎不合逻辑。正因为如此,我仍然不确定我应该怎么做。 感谢您抽出时间:)

更新两个

这是我的数据框,已通过以下代码获得:

grouped = data.groupby(['Question Text','Answer Text']).size().reset_index(name='counts')

0                          CI function          No     513
1                          CI function         Yes     373
2                             bathing?          No    2827
3                             bathing?         Yes     408
4                            dressing?          No    2824
5                            dressing?         Yes     423
6                              feeding          No    2851
7                              feeding         Yes     160
8                         housekeeping          No    2803
9                         housekeeping         Yes     717
10                      preparing food          No    2604
11                      preparing food         Yes     593
12  responsibility for own medications          No    2793
13  responsibility for own medications         Yes     625
14                            shopping          No      35
15                            shopping         Yes      49
16                           toileting          No    2843
17                           toileting         Yes     239
18                        transferring          No    2834
19                        transferring         Yes     904
20                using transportation          No    2816
21                using transportation         Yes     483

这个数据框,来自你的代码和我的代码:

grouped = data.groupby(['Question Text','Answer Text']).size().reset_index(name='counts')
print(grouped)
import matplotlib.pyplot as plt
final = grouped.groupby(['Question Text','Answer Text']).sum()
print(final)


Question Text                      Answer Text        
CI function                        No              513
                                   Yes             373
bathing?                           No             2827
                                   Yes             408
dressing?                          No             2824
                                   Yes             423
feeding                            No             2851
                                   Yes             160
housekeeping                       No             2803
                                   Yes             717
preparing food                     No             2604
                                   Yes             593
responsibility for own medications No             2793
                                   Yes             625
shopping                           No               35
                                   Yes              49
toileting                          No             2843
                                   Yes             239
transferring                       No             2834
                                   Yes             904
using transportation               No             2816
                                   Yes             483

更新 3

原始数据框有 200000 行这样的:

1                             bathing?          No       3529933
2                            dressing?          No       3529933
3                              feeding          No       3529933
4                         housekeeping          No       3529933
5   responsibility for own medications          No       3529933
6                 using transportation          No       3529933
7                            toileting          No       3529933
8                         transferring          No       3529933
10                      preparing food          No       3529933
11                            bathing?         NaN       2864155
12                           dressing?         NaN       2864155
13                             feeding         NaN       2864155
14                        housekeeping         NaN       2864155
15  responsibility for own medications         NaN       2864155
16                           toileting         NaN       2864155
17                        transferring         NaN       2864155
19                      preparing food         NaN       2864155
20                using transportation         Yes       2864155
21                            bathing?         NaN       2921299
22                           dressing?         NaN       2921299

【问题讨论】:

    标签: python pandas matplotlib data-visualization


    【解决方案1】:

    你可以这样做(df 是你写的数据框):

    import matplotlib
    import matplotlib.pyplot as plt
    matplotlib.style.use('ggplot')
    df.groupby(['Question Text','Answer Text']).sum().unstack().plot(kind='bar')
    plt.show()
    

    输出: 你也可以这样旋转xlabel:

    plt.xticks(rotation=45)
    

    但我建议您将标签缩短以使其更清晰

    【讨论】:

    猜你喜欢
    • 2021-06-08
    • 2016-02-16
    • 1970-01-01
    • 2013-05-18
    • 1970-01-01
    • 1970-01-01
    • 2020-03-11
    • 1970-01-01
    • 1970-01-01
    相关资源
    最近更新 更多