Find the 137th character in a file?
Show older comments
Hi, I have a file in MATLAB that has a lot of text. I would like to find the 137th character in the file.
For this I tried the following code, but witout any luck. How can any character at a given numbered position in a text file? Thanks
>> C = char(137)
C =
''
8 Comments
Dyuman Joshi
on 15 Jan 2024
When you use char() on numeric values, it will take the input value as a Unicode© value and output the corresponding ASCII values.
See here for reference and more information - https://in.mathworks.com/help/matlab/ref/char.html#mw_e48a8886-8c5b-4df0-b331-ea2bd6c27fbb
For 137, the output is an empty char, as you can observe above.
"For 137, the output is an empty char..."
No, it is a scalar character, its Unicode name is "Character Tabulation With Justification":
The fact that it is an unprintable control character does not mean that it is an "empty" character.
one = char(137)
isscalar(one)
isstrprop(one,'cntrl')
isempty(one) % nope, not empty.
"it will take the input value as a Unicode© value and output the corresponding ASCII values."
Not really: CHAR accepts a Unicode code point (as a decimal value) and returns a character. ASCII has nothing to do with it:
char([3734,3792,116,32,3588,350,162,105,105,32,116,275,120,116,33])
Dyuman Joshi
on 15 Jan 2024
Thank you for the correction @Stephen23. There was a part of brain telling me to cross-check it, but idk why I didn't do it.
Though, how did you get to print it if it's an unprintable character?
"how did you get to print it if it's an unprintable character?"
The box is displayed when the OS/browser/whatever cannot render the character.
Curiously, a few of the other control characters are not displayed with boxes:
char([0:15,137])
Dyuman Joshi
on 15 Jan 2024
I see, when the code is ran the output is shown as an empty char, but when I press submit and post the response, it gets converted to these boxes as shown.
I find it weird that the live editor here shows/displays it as an empty char.
Rik
on 15 Jan 2024
Another technical nit-pick: char casts to UTF-16. For most 'normal' characters (U+0000 to U+FFFF) this is equivalent to saying char accepts a Unicode code point, but there are exceptions (see the UTF-16 wiki page). Higher code points (e.g. emoji) need a different treatment.
char(128077)
This is incorrect. To get the correct output, you should provide the UTF-16 encoding:
char([55357 56397])
Dyuman Joshi
on 24 Jan 2024
Hassaan
on 26 Jan 2024
@Dyuman Joshi
Okay.
Accepted Answer
More Answers (3)
Stephen23
on 15 Jan 2024
fnm = 'theNameOfYourFile.txt';
txt = fileread(fnm);
txt(137)
8 Comments
Dyuman Joshi
on 15 Jan 2024
+1, The simplest approach.
Sergio
on 16 Jan 2024
Dyuman Joshi
on 16 Jan 2024
What is the data type of T and what is its size?
Sergio
on 16 Jan 2024
Dyuman Joshi
on 16 Jan 2024
T is a char array of the size 1x5120.
Simply use
T(137)
Sergio
on 17 Jan 2024
Dyuman Joshi
on 18 Jan 2024
@Sergio, you might have accepted the wrong answer.
Stephen23
on 18 Jan 2024
@Dyuman Joshi: I agree. The only correct answer for a MAT file would be Walter Roberson's.
Approach 1
% Open the file in read mode
fileID = fopen('yourfile.txt', 'r');
% Check if the file was opened successfully
if fileID == -1
error('File cannot be opened.');
end
% Read the entire contents of the file into a string
fileContents = fread(fileID, '*char')';
% Close the file
fclose(fileID);
% Check if the file contains at least 137 characters
if length(fileContents) >= 137
% Extract the 137th character
charAt137 = fileContents(137);
else
error('The file does not contain 137 characters.');
end
% Display the 137th character
disp(charAt137);
Make sure to replace 'yourfile.txt' with the actual name of your text file. The fread function reads the contents of the file, and the *char argument specifies that it should read the data as characters. Then, we simply index into the fileContents to find the 137th character.
Please note that this code assumes that the text file is encoded in ASCII or UTF-8 without multi-byte characters. If the text file contains multi-byte characters (like those in UTF-16 or other encodings), you will need to account for that when reading and indexing the file.
Approach 2
% Define the filename of the text file you want to read
fileName = 'FileName.txt';
% Read the entire contents of the file into a string
txtRead = fileread(fileName);
% Find and display the character at the 137th position
charAt137 = txtRead(137);
% Display the result
disp(['Character at position 137: ', charAt137]);
---------------------------------------------------------------------------------------------------------------------------------------------------------
If you find the solution helpful and it resolves your issue, it would be greatly appreciated if you could accept the answer. Also, leaving an upvote and a comment are also wonderful ways to provide feedback.
Professional Interests
- Technical Services and Consulting
- Embedded Systems | Firmware Developement | Simulations
- Electrical and Electronics Engineering
Feel free to contact me.
1 Comment
Walter Roberson
on 18 Jan 2024
Neither of these approaches are suitable for the problem at hand.
Hi,
I understand that you want to extract the 137th character from a .txt file.
You can refer to the following workaround to extract the 137 character from a .txt file:
file = fopen('sampleTxtFile.txt', 'r');
% Check for successfully opened file
if file == -1
error('File cannot be opened.');
end
% Move to the desired position in the file (137th character)
fseek(file, 136, 'bof'); % 'bof' means beginning of file, indices start at 0
% Read one character from the current position
ch = fread(file, 1, 'char');
fclose(file);
% Convert the character code to a character
desiredCharacter = char(ch)
I hope it helps.
Thanks
5 Comments
Sergio
on 16 Jan 2024
Shivam
on 16 Jan 2024
Hi, can you attach the related files using the paperclip icon for me to debug the issue?
Sergio
on 16 Jan 2024
Shivam
on 17 Jan 2024
Hi,
Upon loading the data of 'data1.mat', 'T' obtained is a character array of size 1x5120. You can access the 137th character of T in the following way:
load data1.mat;
desiredChar = T(137)
I hope it helps.
Thanks
Sergio
on 17 Jan 2024
Categories
Find more on Data Type Conversion in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!
