MATLAB 阅读文件答案

【问题标题】：MATLAB READING FILEMATLAB 阅读文件
【发布时间】：2013-10-08 11:27:58
【问题描述】：

我无法读取文件，基本上，我想以某种方式摆脱不必要的文本，只打印出一个只包含数字的矩阵。

1 1 -1 1 1 -1 -1 1 1 1 -1 1

1 -1 1 -1 -1 1 1 1 1 -1 1 1

sgfgdf

1 1 1 -1 1 -1 1 -1 -1 -1 -1 1

rtydsfdsfds

1 -1 1 -1 -1 -1 1 1 -1 -1 -1 1

1 1 -1 1 1 -1 -1 -1 1 -1 1 -1

1 -1 1 1 1 1 1 -1 -1 -1 1 -1

到目前为止我尝试过的是：

d = fopen('transmission_data.txt')

R = textscan(d, '%f %f', 'headerLines', 3:5)

fclose(d)

但这不起作用，因为我只需要为 textscan 输入一个数字，例如“3”，这将去掉前 3 行，但我想特别去掉第三和第五行。也许还有其他方法可以读取数据？帮助将不胜感激:)

*注意第一行文本和第二行数字之间有一个空行

【问题讨论】：

标签： matlab textscan

【解决方案1】：

通过fileread 读取文件，将其拆分为regexp 并要求textscan 查找所有数字：

C = regexp(fileread('transmission_data.txt'), '(\n|\r)*', 'split');
C = C(~cellfun('isempty', C));
D = cellfun(@(c) textscan(c, '%f'), C);
R = [D{:}].';

这里的重点是，当textscan 遇到不包含数字的行时，它会返回一个空矩阵，因此当您连接结果向量时，您只会得到来自非字符串行的向量。对于您的示例，此代码返回

>> R
R =
     1     1    -1     1     1    -1    -1     1     1     1    -1     1
     1    -1     1    -1    -1     1     1     1     1    -1     1     1
     1     1     1    -1     1    -1     1    -1    -1    -1    -1     1
     1    -1     1    -1    -1    -1     1     1    -1    -1    -1     1
     1     1    -1     1     1    -1    -1    -1     1    -1     1    -1
     1    -1     1     1     1     1     1    -1    -1    -1     1    -1

【讨论】：

我似乎遇到了错误？？？使用 ==> textscan 时出错第一个输入不能为空。 ==> @(c)textscan(c,'%f') 错误 ==> q1a 在 38 D = cellfun(@(c) textscan(c, '%f'), C);
@GeorgeRandall 出于某种原因你在C 中有空字符串。我添加了一行来删除这些。它现在应该可以工作了。

【解决方案2】：

这是一种方法：

使用导入向导
根据需要选择元胞数组或矩阵
选择删除具有无效值的行
点击导入

如果你必须自动化它，你可以让 importwizard 为你生成代码。

这是当我在名为 test.txt 的文件中对您的数据使用导入向导时生成的（相当冗长的）代码

%% Import data from text file.
% Script for importing data from the following text file:
%
%    \\invol-vs-fp1\Users$\d.jaheruddin\MATLAB\test.txt
%
% To extend the code to different selected data or a different text file,
% generate a function instead of a script.

% Auto-generated by MATLAB on 2013/10/08 14:39:30

%% Initialize variables.
filename = 'test.txt';
delimiter = ' ';

%% Read columns of data as strings:
% For more information, see the TEXTSCAN documentation.
formatSpec = '%s%s%s%s%s%s%s%s%s%s%s%s%[^\n\r]';

%% Open the text file.
fileID = fopen(filename,'r');

%% Read columns of data according to format string.
% This call is based on the structure of the file used to generate this
% code. If an error occurs for a different file, try regenerating the code
% from the Import Tool.
dataArray = textscan(fileID, formatSpec, 'Delimiter', delimiter, 'MultipleDelimsAsOne', true,  'ReturnOnError', false);

%% Close the text file.
fclose(fileID);

%% Convert the contents of columns containing numeric strings to numbers.
% Replace non-numeric strings with NaN.
raw = [dataArray{:,1:end-1}];
numericData = NaN(size(dataArray{1},1),size(dataArray,2));

for col=[1,2,3,4,5,6,7,8,9,10,11,12]
    % Converts strings in the input cell array to numbers. Replaced non-numeric
    % strings with NaN.
    rawData = dataArray{col};
    for row=1:size(rawData, 1);
        % Create a regular expression to detect and remove non-numeric prefixes and
        % suffixes.
        regexstr = '(?<prefix>.*?)(?<numbers>([-]*(\d+[\,]*)+[\.]{0,1}\d*[eEdD]{0,1}[-+]*\d*[i]{0,1})|([-]*(\d+[\,]*)*[\.]{1,1}\d+[eEdD]{0,1}[-+]*\d*[i]{0,1}))(?<suffix>.*)';
        try
            result = regexp(rawData{row}, regexstr, 'names');
            numbers = result.numbers;

            % Detected commas in non-thousand locations.
            invalidThousandsSeparator = false;
            if any(numbers==',');
                thousandsRegExp = '^\d+?(\,\d{3})*\.{0,1}\d*$';
                if isempty(regexp(thousandsRegExp, ',', 'once'));
                    numbers = NaN;
                    invalidThousandsSeparator = true;
                end
            end
            % Convert numeric strings to numbers.
            if ~invalidThousandsSeparator;
                numbers = textscan(strrep(numbers, ',', ''), '%f');
                numericData(row, col) = numbers{1};
                raw{row, col} = numbers{1};
            end
        catch me
        end
    end
end


%% Exclude rows with non-numeric cells
J = ~all(cellfun(@(x) (isnumeric(x) || islogical(x)) && ~isnan(x),raw),2); % Find rows with non-numeric cells
raw(J,:) = [];

%% Create output variable
test = cell2mat(raw);
%% Clear temporary variables
clearvars filename delimiter formatSpec fileID dataArray ans raw numericData col rawData row regexstr result numbers invalidThousandsSeparator thousandsRegExp me J;

【讨论】：

尽管我想使用 importwizard，但实际上我必须使用另一种需要我编写代码的方式
@GeorgeRandall 如果你想写代码，我建议你基于 importwizard 生成的代码，可能有几行以importdata 为核心。如果您无法解释代码，请尝试阅读此命令。
importwizard 和 importdata 不起作用，因为它只能读取前 2 行数据
@GeorgeRandall 查看我的更新帖子，包括示例。