【问题标题】:How to add comma in between strings inside square brackets in pandas如何在熊猫方括号内的字符串之间添加逗号
【发布时间】:2018-08-16 10:55:55
【问题描述】:

我有一个熊猫数据框:-

categories
['business $ finance'\n 'business $currency']
['entertainment $movies' 'entertainment &music']
['sports& volleyball' 'sports& football'\n 'sports$ baseball']
...............

我只想将括号内的这些字符串转换为正确的列表格式。预期输出为:-

categories
['business $ finance','business $currency']
['entertainment $movies','entertainment &music']
['sports&volleyball','sports& football','sports$ baseball']
....................

到目前为止我所尝试的:-

array_cat=[]
for i in df['categories']:
      s=i[0].split()
      array_cat.append(s)

编辑

df['categories'][0] 类似于:-

"['business finance#mergers & acquisitions' 'business#industries#telecom'\n 'diseases#kidney' 'gaming #offline #console#xbox'\n 'gaming #offline #playstation' 'lifestyle#shopping' 'movies#genres#drama'\n 'music#genres#hiphop/rap' 'personal finance#tools' 'technology#trending'\n 'theme#historicalsites']"

【问题讨论】:

    标签: python python-3.x pandas


    【解决方案1】:

    str.findall 用于'' 之间的所有值:

    df['categories'] = df['categories'].str.findall('\'(.*?)\'')
    print (df)
                                              categories
    0  [business finance#mergers & acquisitions, busi...
    1        [entertainment$movies, entertainment&music]
    2  [sports&volleyball, sports&football, sports$ba...
    

    【讨论】:

    • @Mavrick - 什么返回 print (df['categories'].iloc[1:5].to_dict()) ?请将15 更改为\n 的捕获值
    • @Mavrick - 你现在能检查一下吗?
    • @Mavrick - 超级,你的字符串 df['categories'][0] 帮了我很多忙 ;)
    • 我可以在https://stackoverflow.com/questions/51864822/matching-elements-of-pandas-column-with-column-of-another-pandas-dataframe/51869865#51869865 上寻求您的帮助吗?这将是我的荣幸。
    • @Mavrick - 不容易的问题,但我可以试试。
    猜你喜欢
    • 2017-12-02
    • 2019-11-14
    • 2022-01-03
    • 1970-01-01
    • 2022-06-23
    • 2018-08-22
    • 1970-01-01
    • 2018-04-14
    • 1970-01-01
    相关资源
    最近更新 更多