Why are characters in MATLAB assigned numerical values?

Question

Camille Levine on 11 Mar 2018

0
Link

Direct link to this question

https://www.mathworks.com/matlabcentral/answers/387673-why-are-characters-in-matlab-assigned-numerical-values

Edited: Stephen23 on 16 Apr 2018

When I was trying to display a value next to a percent sign as a string, I accidentally used single quotes instead of double quotes, and it gave me this odd output. I realized that each non-numerical character is assigned a value in matlab, eg % = 37, ^ = 94, . = 46, etc. Even digits are assigned values, such as 0 = 48. Is there any reason or use for this feature or is it just a quirk of the language as a whole?

>> disp('%')
%
>> disp(0 + '%')
    37
>> disp(0 + '^')
    94
>> disp(0 + '.')
    46
>> disp('^'+'0')
   142

2 Comments
Show NoneHide None

Roger Stafford on 12 Mar 2018

Edited: Walter Roberson on 12 Mar 2018

I recommend you read the Wikipedia article at:

https://en.wikipedia.org/wiki/ASCII

Stephen23 on 16 Apr 2018

Edited: Stephen23 on 16 Apr 2018

"Is there any reason or use for this feature or is it just a quirk of the language as a whole?"

Absolutely everything on your computer is stored as numbers. Every photo, every film, every website, every program or app, ... they are all just made up of lots of numbers. Characters are no exception to this, although their encoding is, for historical reasons, certainly a bit "quirky":

https://en.wikipedia.org/wiki/ASCII

https://en.wikipedia.org/wiki/UTF-16

Sign in to comment.

Sign in to answer this question.

Answer 1

Joost Meulenbeld on 12 Mar 2018

0
Link

Direct link to this answer

https://www.mathworks.com/matlabcentral/answers/387673-why-are-characters-in-matlab-assigned-numerical-values#answer_309550

Edited: Joost Meulenbeld on 12 Mar 2018

Using the plus operator assumes the operands (in this case a character and a numeral or two characters) to be numerics, and a character will then be converted to its ascii value. Look at i.e. https://www.asciitable.com/ for a table containing all character values.

1 Comment
Show -1 older commentsHide -1 older comments

Walter Roberson on 12 Mar 2018

As a technical quibble: MATLAB does not use ASCII, MATLAB uses the first 65536 entries of Unicode. ASCII is only defined up to location 127.

Sign in to comment.

Answer 2

Walter Roberson on 12 Mar 2018

0
Link

Direct link to this answer

https://www.mathworks.com/matlabcentral/answers/387673-why-are-characters-in-matlab-assigned-numerical-values#answer_309551

Open in MATLAB Online

Digital computers can only manipulate bits. A storage location is just a group of bits. There are not different kinds of external memory to store integers or floating point numbers or characters: there are just groups of bits together with programs to manipulate the bits. Everything else is a matter of how the programs tell the computer to interpret the bits. As far as a computer cares, it is entirely valid to take the group of bits that a moment ago was interpreted as a double precision number, and to say "Now that group of bits is to be interpreted as four signed 16 bit integers instead" and do calculations on the group of bits that way.

So too characters are just stored as groups of bits. Each different character has a different binary pattern. You can do arithmetic on the patterns and nothing cares. At some point, though, the user needs to be shown the character: at that point the binary pattern is processed through some code to look up information about how the character should be displayed on the screen. It is at that point that information about font and italics and color and point size are to be considered: those are display attributes rather than information about the character itself. The same binary pattern is used to represent an 'A' that is to be displayed in Monaco bold 17 point, as might be used to represent an 'A' that is to be displayed in Century New Schoolbook italic 12 point.

MATLAB represents characters as 16 bit unsigned numbers: up to 65536 different characters can be represented this way, each represented by a distinct binary pattern. There are some indications that internally MATLAB uses UTF16 encoding to represent additional characters, but that can be difficult to prove or disprove.

The binary pattern used for '%' is the same binary pattern that is used to represent uint16(37). If you do

fooc = '%';
food = uint16(37);

then the data pointers of the two variables will end up pointing to the same binary patterns: the difference will just be in the headers for the variable that tell MATLAB how the user wants the binary patterns to be interpreted.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Why are characters in MATLAB assigned numerical values?

2 Comments
Show NoneHide None

Answers (2)

1 Comment
Show -1 older commentsHide -1 older comments

0 Comments
Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Products

Community Treasure Hunt

Why are characters in MATLAB assigned numerical values?

2 Comments Show NoneHide None

Answers (2)

1 Comment Show -1 older commentsHide -1 older comments

0 Comments Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Products

Community Treasure Hunt

2 Comments
Show NoneHide None

1 Comment
Show -1 older commentsHide -1 older comments

0 Comments
Show -2 older commentsHide -2 older comments