| Contents | Index |
| On this page… |
|---|
Importing a Sequence into the Sequence Tool |
The Sequence Tool window integrates many of the sequence functions in the toolbox. Instead of entering commands in the MATLAB Command Window, you can select and enter options.
The first step when analyzing a nucleotide or amino acid sequence is to import sequence information into the MATLAB environment. The Sequence Tool can connect to Web databases such as NCBI and EMBL and read information into the MATLAB environment.
The following procedure illustrates how to retrieve sequence information from the NCBI database on the Web. This example uses the GenBank accession number NM_000520, which is the human gene HEXA that is associated with Tay-Sachs disease.
In the MATLAB Command Window, type
seqtool
The Sequence Tool window opens without a sequence loaded. Notice that the panes to the right and bottom are blank.
To retrieve a sequence from the NCBI database, select File > Download Sequence from > NCBI.
The Download Sequence from NCBI dialog box opens.

In the Enter Sequence box, type an accession number for an NCBI database entry, for example, NM_000520. Click the Nucleotide option button, and then click OK.
The MATLAB software accesses the NCBI database on the Web, loads nucleotide sequence information for the accession number you entered, and calculates some basic statistics.

After you import a sequence into the Sequence Tool window, you can read information stored with the sequence, or you can view graphic representations for ORFs and CDSs.
In the left pane tree, click Comments. The right pane displays general information about the sequence.
Now click Features. The right pane displays NCBI feature information, including index numbers for a gene and any CDS sequences.
Click ORF to show the search results for ORFs in the six reading frames.

Click Annotated CDS to show the protein coding part of a nucleotide sequence.

The following procedure illustrates how to search for characteristic words and sequence patterns. You will search for sequence patterns like the TATAA box and patterns for specific restriction enzymes.
Select Sequence > Find Word.
In the Find Word dialog box, type a sequence word or pattern, for example, atg, and then click Find.

The Sequence Tool window searches and displays the location of the selected word.

Clear the display by clicking the Clear Word Selection
button
on
the toolbar.
The following procedure illustrates how to identify the protein coding part of a nucleotide sequence and copy it into a new view. Identifying coding sections of a nucleotide sequence is a common bioinformatics task. After locating the coding part of a sequence, you can copy it to a new view, translate it to an amino acid sequence, and continue with your analysis.
In the left pane, click ORF.
The Sequence Tool window displays the ORFs for the six reading frames in the lower-right pane. Hover the cursor over a frame to display information about it.

Click the longest ORF on reading frame 2.
The ORF is highlighted to indicate the part of the sequence that is selected.

Right-click the selected ORF and then select Export to Workspace. In the Export to MATLAB Workspace dialog box, type a variable name, for example, NM_000520_ORF_2, then click Export.

The NM_000520_ORF_2 variable is added to the MATLAB Workspace.
Select File > Import from Workspace. Type the name of a variable with an exported ORF, for example, NM_000520_ORF_2, and then click Import.
The Sequence Tool window adds a tab at the bottom for the new sequence while leaving the original sequence open.

In the left pane, click Full Translation. Select Display > Amino Acid Residue Display > One Letter Code.
The Sequence Tool window displays the amino acid sequence below the nucleotide sequence.

Close the Sequence Tool window from the MATLAB command line using the following syntax:
seqtool('close')
![]() | Reading, Analyzing, and Converting a Nucleotide Sequence Using the Command Line | Importing, Viewing, and Exploring a Protein Sequence Using a Graphical Interface | ![]() |

See how to analyze, visualize, and model biological data and systems using MathWorks products.
Get free kit| © 1984-2012- The MathWorks, Inc. - Site Help - Patents - Trademarks - Privacy Policy - Preventing Piracy - RSS |