Strfind doesn't find string

9 views (last 30 days)
hpramos4@gmail.com
hpramos4@gmail.com on 1 Jun 2016
Hi everyone
I'm web scrapping using strfind but I can't find one string with spaces. Assume that part of my text is the following:
tempHTML2=' Área <strongclass="search-results-property-list__feature-value"> 65.0'
And I want this:
str14='Área <strongclass="search-results-property-list__feature-value">';
However, strfind(tempHTML2,str14) returns me blank. If I remove 'Área ', it returns the correct location of the string. If I look for just ' Área', it also finds correctly.
One issue could be the blank spaces. However, the tempHTML2 is constructed as follows:
tempHTML2=tempHTML;
tempHTML2(tempHTML2==' ')=[];
One issue is that tempHTML2 has blank spaces even after deleting them. The sum(ismember(tempHTML2,' ')) returns zero.
Thanks in advance,
  6 Comments
Walter Roberson
Walter Roberson on 1 Jun 2016
Please attach a copy of the tempHTML2 (before blank removal), or post the URL.
hpramos4@gmail.com
hpramos4@gmail.com on 1 Jun 2016

This is the webpage: wp

These are the HTML codes (the part I need) with and without spaces. Both were scrapped with urlread.

There's something strange in this: when I open the tempHTML2 string and look manually for the string, there are spaces between "Área" (check the print attached). When I write it to a txt the spaces are gone.

Still, neither strfind or regexp works.

Sign in to comment.

Answers (1)

hpramos4@gmail.com
hpramos4@gmail.com on 6 Jun 2016
I've solved the problem using isspace() in the tempHTML function. Thank you all.

Categories

Find more on Characters and Strings in Help Center and File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!