emblread - Read data from EMBL file

Syntax

EMBLData = emblread(File)
EMBLSeq = emblread (File, 'SequenceOnly', SequenceOnlyValue)

Arguments

File

Either of the following:

  • String specifying a file name, a path and file name, or a URL pointing to a file. The referenced file is an EMBL-formatted file.

  • MATLAB® character array that contains the text of an EMBL-formatted file

SequenceOnlyValueControls the reading of only the sequence without the metadata. Choices are true or false (default).

Return Values

EMBLDataMATLAB structure with fields corresponding to EMBL data.
EMBLSeqMATLAB character string representing the sequence.

Description

EMBLData = emblread(File) reads data from File, an EMBL-formatted file, and creates EMBLData, a MATLAB structure with fields corresponding to the EMBL two-character line type code. Each line type code is stored as a separate element in the structure.

EMBLData contains the following fields.

Field
Identification.EntryName
Identification.Version
Identification.Topology
Identification.Molecule
Identification.DataClass
Identification.Division
Identification.SequenceLength
Accession
SequenceVersion
DateCreated
DateUpdated
Description
Keyword
OrganismSpecies
OrganismClassification
Organelle
Reference{#}.Number
Reference{#}.Comment
Reference{#}.Position
Reference{#}.MedLine
Reference{#}.PubMed
Reference{#}.Group
Reference{#}.Authors
Reference{#}.Title
Reference{#}.Location
DatabaseCrossReference
Comments
Assembly
Feature
Basecount.BP
Basecount.A
Basecount.C
Basecount.G
Basecount.T
Basecount.Other
Sequence

EMBLSeq = emblread (File, 'SequenceOnly', SequenceOnlyValue) controls the reading of only the sequence without the metadata. Choices are true or false (default).

Examples

Retrieve sequence information from the Web, save to a file, and then read back into the MATLAB software.

  1. Use the getembl function and ToFile property to retrieve sequence information from the Web and save to an EMBL-formatted file.

    getembl('X00558','ToFile','rat_protein.txt');
    
  2. Read data from the EMBL-formatted file and create a MATLAB structure.

    EMBLData = emblread('rat_protein.txt')
    

See Also

Bioinformatics Toolbox™ functions: fastaread, genbankread, genpeptread, getembl, pdbread, seqtool

  


 © 1984-2008- The MathWorks, Inc.    -   Site Help   -   Patents   -   Trademarks   -   Privacy Policy   -   Preventing Piracy   -   RSS