Products & Services Solutions Academia Support User Community Company

Learn more about Bioinformatics Toolbox   

genbankread - Read data from GenBank file

Syntax

GenBankData = genbankread(File)

Arguments

File

Either of the following:

  • String specifying a file name, a path and file name, or a URL pointing to a file. The referenced file is a GenBank-formatted file (ASCII text file). If you specify only a file name, that file must be on the MATLAB search path or in the MATLAB Current Directory.

  • MATLAB character array that contains the text of a GenBank-formatted file.

GenBankDataMATLAB structure with fields corresponding to GenBank keywords.

Description

GenBankData = genbankread(File) reads in a GenBank-formatted file, File, and creates a structure, GenBankData, containing fields corresponding to the GenBank keywords. Each separate sequence listed in the output structure GenBankData is stored as a separate element of the structure.

Examples

  1. Retrieve sequence information for a gene (HEXA), store data in a file, and then read back into the MATLAB software.

    getgenbank('nm_000520', 'ToFile', 'TaySachs_Gene.txt')
    s = genbankread('TaySachs_Gene.txt')
    
    s = 
    
                    LocusName: 'NM_000520'
          LocusSequenceLength: '2437'
         LocusNumberofStrands: ''
                LocusTopology: 'linear'
            LocusMoleculeType: 'mRNA'
         LocusGenBankDivision: 'PRI'
        LocusModificationDate: '18-FEB-2009'
                   Definition: [1x63 char]
                    Accession: 'NM_000520'
                      Version: 'NM_000520.4'
                           GI: '189181665'
                      Project: []
                       DBLink: []
                     Keywords: []
                      Segment: []
                       Source: 'Homo sapiens (human)'
               SourceOrganism: [4x65 char]
                    Reference: {1x10 cell}
                      Comment: [32x67 char]
                     Features: [147x74 char]
                          CDS: [1x1 struct]
                     Sequence: [1x2437 char]
  2. Display the source organism for this sequence.

    s.SourceOrganism
    
    ans =
    
    Homo sapiens                                                     
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
    Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;     
    Catarrhini; Hominidae; Homo.

See Also

Bioinformatics Toolbox functions: emblread, fastaread, genpeptread, getgenbank, scfread, seqtool

  


Recommended Products

Includes the most popular MATLAB recorded presentations with Q&A sessions led by MATLAB experts.

 © 1984-2009- The MathWorks, Inc.    -   Site Help   -   Patents   -   Trademarks   -   Privacy Policy   -   Preventing Piracy   -   RSS