Reading numeric values from complex text files

1 view (last 30 days)
I know this question has been asked too many times before, but I'm attempting to read numeric values from a text file that has the following format:
2.00000000000000e+001 (-2.58645139980833e+001dB,-3.39749468897168e+001°)
(This format comes from LTspiceIV AC analysis, edit: see attached .txt for example file). I have tried using different functions to read this (dlmread, fscanf, textscan) but with no success. The method I tried was to use multiple delimiters e.g.
{'\t','dB','°','(',')',','}
textscan(filename,'%f %f %f','HeaderLines',1,'Delimiter',{'\t','\n','dB','°','(',')',','});
and also grouped e.g.
textscan(filename,'%f %f %f','HeaderLines',1,'Delimiter',{'\t(','dB,','°)'});
and also with a format specifier including the whole line make-up in fscanf e.g.
'%f\t(%fdB,%f°)'
I also need to skip 1 header row. Where am I going wrong?
  1 Comment
Stephen23
Stephen23 on 7 Oct 2015
Edited: Stephen23 on 7 Oct 2015
Can you please upload the complete file. Without this it is difficult to know how the lines are arranged, how the values repeat, the format of the lines, and other information that we need to know how the file should be parsed.
Your title describes them, as being "complex text files", but we do no have any information on their format.
You can upload a file by clicking on the paperclip button and then both the Choose file and Attach file buttons.

Sign in to comment.

Accepted Answer

Jeremy Hughes
Jeremy Hughes on 8 Oct 2015
Edited: Jeremy Hughes on 8 Oct 2015
Hi Ben,
I found this worked. (of course I used a string and not a file id, but it works the same)
textscan(fid,'%f(%fdB%f°)','Delimiter',{'\t',','},'HeaderLines',1)
Good Luck,
Jeremy
  1 Comment
Ben Holmes
Ben Holmes on 8 Oct 2015
Thanks Jeremy, this works great. I think the issue which I was struggling with most was understanding how to format the format specifier which I haven't managed to find a comprehensive explanation for. To my best guess, delimiters should separate the values, but you can also include other repeated characters in the specifier which you want to ignore?

Sign in to comment.

More Answers (1)

Thorsten
Thorsten on 7 Oct 2015
You can process individual lines using
s = fgets(fid);
data(i,:) = sscanf(s, '%f (%fdB, %f)');
  2 Comments
Ben Holmes
Ben Holmes on 7 Oct 2015
Thanks! This does work but I was hoping for something more elegant so that I didn't have to iterate. My current code now involves a while loop iterating until the end of the file which ends up with a matrix with constantly changing size:
fileID = fopen(filename,'r');
% Discard header line;
s = fgets(fileID);
n=1; % Setup index
s = fgets(fileID); % Get first line
% While not at the end of the file
while(s ~= -1)
% Extract numbers
data(:,n) = sscanf(s,'%f\t(%fdB,%f°)');
n=n+1; % Increment index
s = fgets(fileID); % get next line
end
It will do the trick but is not good practice...
Thorsten
Thorsten on 7 Oct 2015
Edited: Thorsten on 7 Oct 2015
The problem is due to the degree sign in the file. Without it, you can use
data = textscan(fopen('code.m'), '%f\t(%fdB,%f)', 'Headerlines', 1)

Sign in to comment.

Products

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!