【发布时间】:2015-11-14 06:07:28
【问题描述】:
我有 .TX0 文件(某种 csv txt 文件)并已通过 python .readlines()、open(filename, 'w') 等方法将其转换为 .txt 文件。我有这个新保存的 txt 文件,但是当我尝试将其转换为数据框时,它只给了我一列。 txt文件如下:
Empty DataFrame
Columns: [ '"Software Version:", 6.3.2.0646, Date:, 19/08/2015 09:26:04\n', '"Reprocess Number:", vma2: 261519, Unnamed: 7, \n', '"Sample Name:", , Data Acquisition Time:, 18/08/2015 17:23:23\n', '"Instrument Name:", natural gas (PE ASXL-TCD/FID), Channel:, B\n', '"Rack/Vial:", 0, 0.1, Operator:, joey.walker\n', '"Sample Amount:", 1.000000, Dilution Factor:, 1.000000\n', '"Cycle:", 1, Result File :, \\\\vma2\\TotalChrom\11170_he_tcd001.rst \n', '"Sequence File :", \\\\vma\C1_C2_binary.seq \n', '"===================================================================================================================================="\n', '""\n', '""\n'.1, '"condensate analysis (HP4890 Optic - FID)"\n', '"Peak", Component, Time, Area, Height, BL\n', '"#", Name, [min], [uV*sec], [uV], \n'.1, '------, ------, ------.1, ------.2, ------.3, ------\n', '1, Unnamed: 55, 0.810, 706.42, 304.38, *BB\n', '2, CH4, 0.900, 1113518.24, 495918.41, *BB\n'.1, '3, C2H6, 1.373, 901670.23, 295381.12, *BB\n'.2, '"", Unnamed: 73, Unnamed: 74, ------.4, ------.5, \n'.2, '"".1, Unnamed: 79, Unnamed: 80, 2015894.89, 791603.91, \n'.3, '"Missing Component Report"\n', '"Component", Expected Retention (Calibration File)\n', '------.1, ------\n'.1, '"All components were found"\n', '"Report stored in ASCII file :", C:\\Shared Folders\\TotalChrom\\11170_he_tcd001.TX0 \n']]
Index: []
为了方便阅读:
空数据框
列:['“软件版本:”,6.3.2.0646,日期:,19/08/2015 09:26:04\n', '"重新处理编号:", vma2: 261519, 未命名: 7, \n', '"样品名称:", , 数据采集时间:, 18/08/2015 17:23:23\n', '"仪器名称:", 天然气 (PE ASXL-TCD/FID), 通道:, B\n', '"Rack/Vial:", 0, 0.1, Operator:, joey.walker\n', '"样品量:", 1.000000, 稀释因子:, 1.000000\n', '"Cycle:", 1, 结果文件:, \\vma2\TotalChrom\data\Joey\Binary_Mixtures\Std1\11170_he_tcd001.rst \n', '"序列文件:", \\vma2\TotalChrom\sequences\Joey\C1_C2_binary.seq \n', '"================================================= ==================================================== ==================================="\n', '""\n', '""\ n'.1, '"冷凝水分析 (HP4890 光学 - FID)"\n', '"Peak", 分量, 时间, 面积, 高度, BL\n', '"#", Name, [min], [uV*sec], [uV], \n'.1, '------, ------, ------.1, ------.2, - -----.3, ------\n', '1, 未命名: 55, 0.810, 706.42, 304.38, *BB\n', '2, CH4, 0.900, 1113518.24, 495918.41, *BB\n'.1, '3 , C2H6, 1.373, 901670.23, 295381.12, *BB\n'.2, '"", 未命名: 73, 未命名: 74, ------.4, ------.5, \n' .2, '"".1, Unnamed: 79, Unnamed: 80, 2015894.89, 791603.91, \n'.3, '"Missing Component Report"\n', '"Component", Expected Retention (Calibration File)\n ', '-----.1, ------\n'.1, '"找到所有组件"\n', '"报告存储在 ASCII 文件中:", C:\共享 文件夹\TotalChrom\data\Joey\Binary_Mixtures\Std1\11170_he_tcd001.TX0 \n']] 索引:[]
如您所见,这是逗号分隔的。有没有办法将此文本传输到逗号分隔的数据框?
谢谢。
J
【问题讨论】:
-
问题是由于它将文本文件视为一列,因此无法从中构造数据框。有没有办法将文本文件拆分为逗号分隔的列和行?