Splitting Characters in A Cell Array

11 views (last 30 days)
Aldrich To
Aldrich To on 11 Nov 2015
Commented: Guillaume on 13 Nov 2015
Hi All,
I am trying to split some content in a cell array into separate portions. I've tried converting to a string and using strsplit, but I am not getting the results I want because of the datatype syntax.
Came across the cellfun command, but not really sure how to implement it.
Here is what I have
'P245/65R17 105S'
'P265/70R16 111S'
'P275/55R20 111H'
'285/60R18 120H'
'P235/70R17 108S'
What I need:
'P245/' '65' 'R' '17' '105' 'S'
'P265/' '70' 'R' '16' '111' 'S'
'P275/' '55' 'R' '20' '111' 'H'
'285/' '60' 'R' '18' '120' 'H'
'P235/' '70' 'R' '17' '108' 'S'
Thanks in advance!

Accepted Answer

Jan
Jan on 11 Nov 2015
Data = {'P245/65R17 105S'; ...
'P265/70R16 111S'; ...
'P275/55R20 111H'; ...
'285/60R18 120H'; ...
'P235/70R17 108S'};
n = numel(Data);
Result = cell(n, 6);
for k = 1:n
S = Data{k};
p = strfind(S, '/');
% 'P245/65R17 105S'
% 'P245/' '65' 'R' '17' '105' 'S'
Result(k, :) = {S(1:p), S(p+1:p+2), S(p+3), S(p+4:p+5), S(p+7:p+9), S(p+10)};
end
Does this help already? Or do strings appear, which do not match this pattern? If so, you can search for the space also, use the length of the strings or whatever.
  5 Comments
Aldrich To
Aldrich To on 12 Nov 2015
Thanks everyone for all the useful input! Guillaume, regexp actually gave me my desired output. Made an edit that gave me the following:
split = regexp(size,'([P-T])(\d+)(\D+)(\d+)([A-Z])(\d+) (\d+)([A-Z])', 'tokens', 'once');
% 'P' '265' '/' '70' 'R' '16' '111' 'S'
However, it ends up skipping the rows that do not follow that exact character configuration (i.e. skips something like 185/65R15 or LT245/70R17 128/112R) and continues iterating.
The end result I would like is the following:
'P' '265' '/' '70' 'R' '16' '111' 'S'
'275' '/' '50' 'R' '17' 'ST'
'LT' '245' '/' '70' 'R' '17' '128/112' 'R'
Jan
Jan on 13 Nov 2015
@Aldrich: The shown result cannot be represented in Matlab. If it is stored as a cell string, the missing elements must be at least [], because an array must have the same number of elements per row.

Sign in to comment.

More Answers (1)

Guillaume
Guillaume on 11 Nov 2015
Edited: Guillaume on 11 Nov 2015
Use the power of Regular Expressions. It's a daunting language at first but it's very powerful:
data = {'P245/65R17 105S';
'P265/70R16 111S';
'P275/55R20 111H';
'285/60R18 120H';
'P235/70R17 108S'};
splitdata = regexp(data, '(.+/)(\d+)([A-Z])(\d+) (\d+)([A-Z])', 'tokens', 'once');
splitdata = vertcat(splitdata{:})
The regular expression is divided into tokens (the () in the regex)
  • the 1st token is one or more (the +) character (the .) followed by '/'
  • the 2nd token is one or more (the +) digit (the \d)
  • the 3rd token is a single character between A and Z (the [A-Z])
  • 4th token, see 2nd
  • it then matches a space which is not part of any token
  • 5th token, see 2nd
  • 6th token, see 3rd
  3 Comments
Guillaume
Guillaume on 13 Nov 2015
A regex that would most likely work with all your cases would be
regexp(data, '([A-Z]*)(\d+)(/)(\d+)([A-Z])(\d+) (\d+(/\d+)?)?([A-Z])', 'tokens')

Sign in to comment.

Categories

Find more on Characters and Strings in Help Center and File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!