Main Content


Convert documents to cell array of string vectors



C = doc2cell(documents) converts a tokenizedDocument array to a cell array. The entries of C are string arrays containing the corresponding words in each document.


collapse all

Convert a tokenizedDocument array to a cell array of string vectors.

documents = tokenizedDocument([ ...
    "an example of a short sentence" ...
    "a second short sentence"])
documents = 
  1x2 tokenizedDocument:

    6 tokens: an example of a short sentence
    4 tokens: a second short sentence

C = doc2cell(documents)
C=1×2 cell array
    {["an"    "example"    "of"    "a"    "short"    "sentence"]}    {["a"    "second"    "short"    "sentence"]}

View the first element of the cell array.

ans = 1x6 string
    "an"    "example"    "of"    "a"    "short"    "sentence"

Input Arguments

collapse all

Input documents, specified as a tokenizedDocument array.

Output Arguments

collapse all

Output cell array of string vectors. Each element of C is a string vector containing the words of the corresponding document.

Version History

Introduced in R2017b