How can I read an HTML file into MATLAB and discard the HTML tags?
Show older comments
I have an HTML file that I would like to read into MATLAB. However, I would like to discard the HTML tags and keep only the text from the file.
Accepted Answer
More Answers (1)
Sean de Wolski
on 17 Oct 2017
Edited: MathWorks Support Team
on 19 May 2023
0 votes
R2017b introduced this capability with the Text Analytics Toolbox.
1 Comment
Walter Roberson
on 22 Oct 2017
That toolbox is new as of R2017b, and it also requires the Statistics and Machine Learning toolbox. If you already have the toolbox for other reasons then certainly use it, but if not then regexp() does fine.
Categories
Find more on Spreadsheets in Help Center and File Exchange
Products
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!