【问题标题】:UnicodeDecodeError utf-8 can't decode byteUnicodeDecodeError utf-8 无法解码字节
【发布时间】:2020-03-09 14:20:12
【问题描述】:

combined_csv = pd.concat([pd.read_csv(f) for f in all_filenames]) 无法正常显示

UnicodeDecodeError: 'utf-8' codec can't decode byte 0xff in position 12: invalid start byte

全部代码如下:

import os
import glob
import pandas as pd
os.chdir("S/Last Processed Data")

extension = 'csv'
all_filenames = [i for i in glob.glob('*.{}'.format(extension))]

combined_csv = pd.concat([pd.read_csv(f) for f in all_filenames])

combined_csv.to_csv( "combined_csv.csv", index=False, encoding='utf-8-sig')

【问题讨论】:

  • 您需要将文件的实际编码传递给read_csvencoding 参数。默认为utf-8

标签: python pandas


【解决方案1】:

读取文件时需要传递编码参数:

combined_csv = pd.concat([pd.read_csv(f, encoding=...) for f in all_filenames])

如果utf-8 不起作用,请尝试ISO-8859-1

【讨论】:

    猜你喜欢
    • 2018-04-22
    • 2019-02-24
    • 1970-01-01
    • 2017-09-27
    • 1970-01-01
    • 2014-08-29
    • 2019-05-07
    • 1970-01-01
    • 2021-12-06
    相关资源
    最近更新 更多