【问题标题】:How to create a Pandas DataFrame from a list of lists?如何从列表列表中创建 Pandas DataFrame?
【发布时间】:2019-08-31 18:58:06
【问题描述】:

我有一个列表列表。这是它的一个示例。

[[], ['1', 'Anthony Taylor', '6', '11.17', '0.68', '0.17', '1.83', '11', '0.17', '1'], ['2', 'Jonathan Moss', '4', '12.75', '0.73', '0.00', '2.00', '8', '0.00', '0'], ['3', 'Martin Atkinson', '3', '11.00', '0.69', '0.67', '1.67', '5', '0.00', '0'], ['4', 'Graham Scott', '3', '10.00', '0.67', '0.00', '0.33', '1', '0.00', '0'], ['5', 'Chris Kavanagh', '3', '10.33', '0.74', '0.00', '1.67', '5', '0.00', '0'], ['6', 'Michael Oliver', '2', '12.00', '0.62', '1.00', '2.00', '4', '0.00', '0'], ['7', 'Paul Tierney', '2', '12.50', '0.66', '0.00', '2.50', '5', '0.00', '0'], ['8', 'Mike Dean', '2', '14.50', '0.73', '0.50', '3.00', '6', '0.00', '0'], ['9', 'Andre Marriner', '2', '8.50', '0.53', '0.00', '2.00', '4', '0.00', '0'], ['10', 'Kevin Friend', '2', '9.50', '0.50', '0.00', '3.00', '6', '0.00', '0'], ['11', 'Craig Pawson', '1', '9.00', '0.45', '0.00', '1.00', '1', '0.00', '0'], ['12', 'Stuart Attwell', '1', '9.00', '0.75', '0.00', '2.00', '2', '0.00', '0'], ['13', 'Lee Probert', '1', '11.00', '0.73', '0.00', '0.00', '0', '0.00', '0'], ['', 'Total / Average', '32', '11.09', '0.66', '0.19', '1.81', '58', '0.03', '1']]

我如何创建一个类似于这种格式的 Pandas DataFrame(每一行都有一个单独的名称)?

['1', 'Anthony Taylor', '6', '11.17', '0.68', '0.17', '1.83', '11', '0.17', '1']
['2', 'Jonathan Moss', '4', '12.75', '0.73', '0.00', '2.00', '8', '0.00', '0']
['3', 'Martin Atkinson', '3', '11.00', '0.69', '0.67', '1.67', '5', '0.00', '0']
['4', 'Graham Scott', '3', '10.00', '0.67', '0.00', '0.33', '1', '0.00', '0']
['5', 'Chris Kavanagh', '3', '10.33', '0.74', '0.00', '1.67', '5', '0.00', '0']
['6', 'Michael Oliver', '2', '12.00', '0.62', '1.00', '2.00', '4', '0.00', '0']
['7', 'Paul Tierney', '2', '12.50', '0.66', '0.00', '2.50', '5', '0.00', '0']
['8', 'Mike Dean', '2', '14.50', '0.73', '0.50', '3.00', '6', '0.00', '0']
['9', 'Andre Marriner', '2', '8.50', '0.53', '0.00', '2.00', '4', '0.00', '0']
['10', 'Kevin Friend', '2', '9.50', '0.50', '0.00', '3.00', '6', '0.00', '0']
['11', 'Craig Pawson', '1', '9.00', '0.45', '0.00', '1.00', '1', '0.00', '0']
['12', 'Stuart Attwell', '1', '9.00', '0.75', '0.00', '2.00', '2', '0.00', '0']
['13', 'Lee Probert', '1', '11.00', '0.73', '0.00', '0.00', '0', '0.00', '0']
['', 'Total / Average', '32', '11.09', '0.66', '0.19', '1.81', '58', '0.03', '1']

在我的研究过程中,this 的回答对我有所帮助,但它仍然只将所有内容放在一个列中,而我希望为特定列表的每个元素设置一个单独的列。 有没有办法做到这一点?

【问题讨论】:

    标签: python python-3.x pandas list


    【解决方案1】:

    您可以过滤掉第一个子列表并传递给 DataFrame 构造函数,也可以通过另一个列表设置列名:

    L = [[], ['1', 'Anthony Taylor', '6', '11.17', '0.68', '0.17', '1.83', '11', '0.17', '1'], ['2', 'Jonathan Moss', '4', '12.75', '0.73', '0.00', '2.00', '8', '0.00', '0'], ['3', 'Martin Atkinson', '3', '11.00', '0.69', '0.67', '1.67', '5', '0.00', '0'], ['4', 'Graham Scott', '3', '10.00', '0.67', '0.00', '0.33', '1', '0.00', '0'], ['5', 'Chris Kavanagh', '3', '10.33', '0.74', '0.00', '1.67', '5', '0.00', '0'], ['6', 'Michael Oliver', '2', '12.00', '0.62', '1.00', '2.00', '4', '0.00', '0'], ['7', 'Paul Tierney', '2', '12.50', '0.66', '0.00', '2.50', '5', '0.00', '0'], ['8', 'Mike Dean', '2', '14.50', '0.73', '0.50', '3.00', '6', '0.00', '0'], ['9', 'Andre Marriner', '2', '8.50', '0.53', '0.00', '2.00', '4', '0.00', '0'], ['10', 'Kevin Friend', '2', '9.50', '0.50', '0.00', '3.00', '6', '0.00', '0'], ['11', 'Craig Pawson', '1', '9.00', '0.45', '0.00', '1.00', '1', '0.00', '0'], ['12', 'Stuart Attwell', '1', '9.00', '0.75', '0.00', '2.00', '2', '0.00', '0'], ['13', 'Lee Probert', '1', '11.00', '0.73', '0.00', '0.00', '0', '0.00', '0'], ['', 'Total / Average', '32', '11.09', '0.66', '0.19', '1.81', '58', '0.03', '1']]
    

    cols = ['id','name','a','b','c','d','e','f','g','h']
    df = pd.DataFrame(L[1:], columns=cols)
    print(df)
        id             name   a      b     c     d     e   f     g  h
    0    1   Anthony Taylor   6  11.17  0.68  0.17  1.83  11  0.17  1
    1    2    Jonathan Moss   4  12.75  0.73  0.00  2.00   8  0.00  0
    2    3  Martin Atkinson   3  11.00  0.69  0.67  1.67   5  0.00  0
    3    4     Graham Scott   3  10.00  0.67  0.00  0.33   1  0.00  0
    4    5   Chris Kavanagh   3  10.33  0.74  0.00  1.67   5  0.00  0
    5    6   Michael Oliver   2  12.00  0.62  1.00  2.00   4  0.00  0
    6    7     Paul Tierney   2  12.50  0.66  0.00  2.50   5  0.00  0
    7    8        Mike Dean   2  14.50  0.73  0.50  3.00   6  0.00  0
    8    9   Andre Marriner   2   8.50  0.53  0.00  2.00   4  0.00  0
    9   10     Kevin Friend   2   9.50  0.50  0.00  3.00   6  0.00  0
    10  11     Craig Pawson   1   9.00  0.45  0.00  1.00   1  0.00  0
    11  12   Stuart Attwell   1   9.00  0.75  0.00  2.00   2  0.00  0
    12  13      Lee Probert   1  11.00  0.73  0.00  0.00   0  0.00  0
    13      Total / Average  32  11.09  0.66  0.19  1.81  58  0.03  1
    

    【讨论】:

    • 这对我有用,谢谢。如果可以接受两个答案,我会这样做的:)
    【解决方案2】:

    使用pandas.DataFrame.from_records:

    import pandas as pd
    
    df = pd.DataFrame.from_records(x[1:]))
    print(df)
    

    输出:

        0                 1   2      3     4     5     6   7     8  9
    0    1   Anthony Taylor   6  11.17  0.68  0.17  1.83  11  0.17  1
    1    2    Jonathan Moss   4  12.75  0.73  0.00  2.00   8  0.00  0
    2    3  Martin Atkinson   3  11.00  0.69  0.67  1.67   5  0.00  0
    3    4     Graham Scott   3  10.00  0.67  0.00  0.33   1  0.00  0
    4    5   Chris Kavanagh   3  10.33  0.74  0.00  1.67   5  0.00  0
    5    6   Michael Oliver   2  12.00  0.62  1.00  2.00   4  0.00  0
    6    7     Paul Tierney   2  12.50  0.66  0.00  2.50   5  0.00  0
    7    8        Mike Dean   2  14.50  0.73  0.50  3.00   6  0.00  0
    8    9   Andre Marriner   2   8.50  0.53  0.00  2.00   4  0.00  0
    9   10     Kevin Friend   2   9.50  0.50  0.00  3.00   6  0.00  0
    10  11     Craig Pawson   1   9.00  0.45  0.00  1.00   1  0.00  0
    11  12   Stuart Attwell   1   9.00  0.75  0.00  2.00   2  0.00  0
    12  13      Lee Probert   1  11.00  0.73  0.00  0.00   0  0.00  0
    13      Total / Average  32  11.09  0.66  0.19  1.81  58  0.03  1
    

    【讨论】:

    • 这是我今天了解到的关于 Pandas 的新知识。非常感谢
    猜你喜欢
    • 2021-08-31
    • 1970-01-01
    • 2015-03-09
    • 2020-03-18
    • 2020-05-04
    • 1970-01-01
    • 1970-01-01
    • 2020-12-26
    • 1970-01-01
    相关资源
    最近更新 更多