multialignread - Read multiple-sequence alignment file

Syntax

S = multialignread(File)
[Headers, Sequences] = multialignread(File)

multialignread(..., 'PropertyName', PropertyValue,...)
multialignread(..., 'IgnoreGaps', IgnoreGapsValue)

Arguments

FileMultiple sequence alignment file (ASCII text file). Enter a file name, a path and file name, or a URL pointing to a file. File can also be a MATLAB® character array that contains the text of a multiple sequence alignment file.

You can read common multiple alignment file types, such as ClustalW (.aln) and GCG (.msf).

IgnoreGapsValueProperty to control removing gap symbols.

Description

S = multialignread(File) reads a multiple sequence alignment file. The file contains multiple sequence lines that start with a sequence header followed by an optional number (not used by multialignread) and a section of the sequence. The multiple sequences are broken into blocks with the same number of blocks for every sequence. (For an example, type open aagag.aln.) The output S is a structure array where S.Header contains the header information and S.Sequence contains the amino acid or nucleotide sequences.

[Headers, Sequences] = multialignread(File) reads the file into separate variables Headers and Sequences.


multialignread(..., 'PropertyName', PropertyValue,...)
defines optional properties using property name/value pairs.

multialignread(..., 'IgnoreGaps', IgnoreGapsValue), when IgnoreGapsValue is true, removes any gap symbol ('-' or '.') from the sequences. Default is false.

Examples

Read a multiple sequence alignment of the gag polyprotein for several HIV strains.

gagaa = multialignread('aagag.aln')

gagaa = 

1x16 struct array with fields:
    Header
    Sequence

See Also

Bioinformatics Toolbox™ functions: fastaread, gethmmalignment, multialign, seqconsensus, seqdisp, seqprofile

  


 © 1984-2008- The MathWorks, Inc.    -   Site Help   -   Patents   -   Trademarks   -   Privacy Policy   -   Preventing Piracy   -   RSS