isotopicdist

Calculate high-resolution isotope mass distribution and density function

Syntax

[MD,Info,DF]
= isotopicdist(SeqAA)

[MD,Info,DF]
= isotopicdist(Compound)

[MD,Info,DF]
= isotopicdist(Formula)

[MD,Info,DF]
= isotopicdist(___,Name,Value)

Description

[MD,Info,DF] = isotopicdist(SeqAA) analyzes a peptide sequence and returns a matrix containing the expected mass distribution; a structure containing the monoisotopic mass, average mass, most abundant mass, nominal mass, empirical formula, and a matrix containing the expected density function.

example

[MD,Info,DF] = isotopicdist(Compound) analyzes a compound specified by a numeric vector or matrix.

example

[MD,Info,DF] = isotopicdist(Formula) analyzes a compound specified by an empirical chemical formula represented by the structure Formula. The field names in Formula must be valid element symbols and are case sensitive. The respective values in Formula are the number of atoms for each element. Formula can also be an array of structures that specifies multiple formulas. The field names can be in any order within a structure. However, if there are multiple structures, the order must be the same in each.

[MD,Info,DF] = isotopicdist(___,Name,Value) calculate high-resolution isotope mass distribution and density function using one or more Name,Value arguments. Use name-value arguments with any combination of arguments from the previous syntaxes.

example

Examples

collapse all

Isotopic Mass Distribution of Peptide Sequence `MATLAP`

Open Live Script

Calculate and display the isotopic mass distribution of the peptide sequence MATLAP with an Acetyl N-terminal and an Amide C-terminal:ю

MD = isotopicdist("MATLAP", ...
                   NTerminal="acetyl", ...
                   CTerminal="amide", ...
                   ShowPlot=true)

MD = 9×2

  643.3363    0.6676
  644.3388    0.2306
  645.3378    0.0797
  646.3386    0.0181
  647.3396    0.0033
  648.3409    0.0005
  649.3423    0.0001
  650.3439    0.0000
  651.3455    0.0000

Isotopic Mass Distribution of Glutamine

Open Live Script

Calculate and display the isotopic mass distribution of Glutamine ( $С_{5} H_{10} N_{2} O_{3}$ ):

MD = isotopicdist([5 10 2 3 0],ShowPlot=true)

MD = 5×2

  146.0691    0.9328
  147.0715    0.0595
  148.0733    0.0074
  149.0755    0.0004
  150.0774    0.0000

Display the isotopic mass distribution of the "averagine" model, whose molecular formula represents the statistical occurrences of amino acids from all known proteins.

isotopicdist([4.9384 7.7583 1.3577 1.4773 0.0417])

Input Arguments

collapse all

`SeqAA` — Peptide sequence
character vector | string | cell array of character vectors | string vector

Peptide sequence, specified as one of these values:

Character vector or string of single-letter codes
Cell array of character vectors or string vector that specifies multiple peptide sequences

Tip

You can use the getgenpept and genpeptread functions to retrieve peptide sequences from the GenPept database or a GenPept-formatted file. You can then use the cleave function to perform an insilico digestion on a peptide sequence. The cleave function creates a cell array of character vectors representing peptide fragments, which you can submit to the isotopicdist function.

Data Types: char | string | cell

`Compound` — Compound
numeric vector | numeric matrix

Compound, specified as one of these values:

Numeric vector of form [C H N O S], where C, H, N, O, and S are nonnegative numbers that represent the number of atoms of carbon, hydrogen, nitrogen, oxygen, and sulfur respectively in a compound.
M-by-5 numeric matrix that specifies M compounds, with each row corresponding to a compound and each column corresponding to an atom.

Data Types: double

`Formula` — Chemical formula
structure | array of structures

Chemical formula, specified as one of these values:

Structure whose field names are valid element symbols and case sensitive. Their respective values are the number of atoms for each element.
Array of structures that specifies multiple formulas.

Note

If Formula is a single structure, the order of the fields does not matter. If Formula is an array of structures, then the order of the fields must be the same in each structure.

Data Types: struct

Name-Value Arguments

collapse all

Specify optional pairs of arguments as Name1=Value1,...,NameN=ValueN, where Name is the argument name and Value is the corresponding value. Name-value arguments must appear after other arguments, but the order of the pairs does not matter.

Example: NTerminal="acetyl",CTerminal="amide",ShowPlot=true

`NTerminal` — Modification for N-terminal of peptide
`"amine"` (default) | `"none"` | `"formyl"` | `"acetyl"` | structure

Modification for the N-terminal of the peptide, specified as one of these values:

"none", "amine", "formyl", or "acetyl".
Custom modification, specified by an empirical formula, represented by a structure. The structure must have field names that are valid element symbols and case sensitive. Their respective values are the number of atoms for each element.

Data Types: char | string | struct

`CTerminal` — Modification for C-terminal of peptide
`"freeacid"` (default) | `"none"` | `"amide"` | structure

Modification for the C-terminal of the peptide, specified as one of these values:

"none", "freeacid", or "amide".
Custom modification specified by an empirical formula, represented by a structure. The structure must have field names that are valid element symbols and case sensitive. Their respective values are the number of atoms for each element.

Data Types: char | string | struct

`Resolution` — Approximate resolution of the instrument
`1/8` (default) | number

Approximate resolution of the instrument (in daltons), specified as a number. Here, the resolution value is the Gaussian width at full width half height (FWHH).

Data Types: double

`FFTResolution` — Number of data points per dalton for computing the FFT algorithm
`1000` (default) | number

Number of data points per dalton for computing the FFT algorithm, specified as a number.

Data Types: double

`FFTRange` — Absolute range (window size) in daltons for FFT algorithm and output density function
number

Absolute range (window size) in daltons for the FFT algorithm and output density function, specified as a number. By default, this value is automatically estimated based on the weight of the molecule. The actual FFT range used internally by isotopicdist is further increased such that the FFTRange*FFTResolution value is a power of two.

Increase the FFTRange value if the signal represented by the DF output value appears to be truncated.

Ultrahigh resolution allows you to resolve micropeaks that have the same nominal mass, but slightly different exact masses. To achieve ultrahigh resolution, increase the FFTResolution value and reduce the Resolution value, but ensure that the FFTRange*FFTResolution value is within the available memory.

Data Types: double

`FFTLocation` — Location of the FFT range (window) defined by `FFTRange`
`1/16` (default) | fraction

Location of the FFT range (window) defined by FFTRange, specified as a fraction. This value sets the location of the lower limit of the FFT range, relative to the location of the monoisotopic peak, which is computed by isotopicdist. The location of the lower limit of the FFT range is set to the mass of the monoistopic peak -FFTLocation*FFTRange.

Tip

In rare cases where a compound contains an element, such as Iron or Argon, whose most abundant isotope is not the lightest one, shift the FFT range to the left.

Data Types: double

`NoiseThreshold` — Noise threshold value
`1e6` (default) | number

Noise threshold value, specified as a number. When you specify this value, isotopicdist removes points in the mass distribution that are smaller than 1/NoiseThreshold times the most abundant mass.

Data Types: double

`ShowPlot` — Control for displaying isotopic mass distribution plot
`true` | `false` | integer

Control for displaying the isotopic mass distribution plot, specified as false, true, or an integer specifying a compound. If set to true, the first compound is plotted. The default value is:

false — when you specify return values.
true — when you do not specify return values.

Data Types: double

Output Arguments

collapse all

`MD` — Mass distribution
two-column matrix

Mass distribution, returned as a two-column matrix in which each row corresponds to an isotope. The first column lists the isotopic mass, and the second column lists the probability for that mass.

`Info` — Mass information for the peptide sequence or compound
structure

Mass information for the peptide sequence or compound, returned as a structure with these fields:

NominalMass
MonoisotopicMass
ObservedAverageMass — Estimated from the DF signal output, using instrument resolution specified by the 'Resolution' property.
CalculatedAverageMass — Calculated directly from the input formula, assuming perfect instrument resolution.
MostAbundantMass
Formula — Structure containing the number of atoms of each element.

`DF` — Mass distribution
two-column matrix

Density function, returned as a two-column matrix. Each row corresponds to an m/z value. The first column lists the mass, and the second column lists the relative intensity of the signal at that mass.

More About

collapse all

Average Mass

Sum of the average atomic masses of the constituent elements in a molecule.

Monoisotopic Mass

Sum of the masses of the atoms in a molecule using the unbound, ground-state, rest mass of the principle (most abundant) isotope for each element instead of the isotopic average mass.

Most Abundant Mass

Mass of the molecule with the most-highly represented isotope distribution, based on the natural abundance of the isotopes.

Nominal Mass

Sum of the integer masses (ignoring the mass defect) of the most abundant isotope of each element in a molecule.

References

[1] Rockwood, A. L., S. L. Van Orden, and R. D. Smith. "Rapid Calculation of Isotope Distributions." Anal. Chem. 67:15 (1995): 2699–2704.

[2] Rockwood, A. L., S. L. Van Orden, and R. D. Smith. "Ultrahigh Resolution Isotope Distribution Calculations." Rapid Commun. Mass Spectrum 10 (1996): 54–59.

[3] Senko, M.W., S. C. Beu, and F. W. McLafferty. "Automated assignment of charge states from resolved isotopic peaks for multiply charged ions." J. Am. Soc. Mass Spectrom. 6 (1995): 52–56.

[4] Senko, M.W., S. C. Beu, and F. W. McLafferty. "Determination of monoisotopic masses and ion populations for large biomolecules from resolved isotopic distributions." J. Am. Soc. Mass Spectrom. 6 (1995): 229–233.

Version History

Introduced in R2009b

isotopicdist

Syntax

Description

Examples

Isotopic Mass Distribution of Peptide Sequence MATLAP

Isotopic Mass Distribution of Glutamine

Input Arguments

SeqAA — Peptide sequence character vector | string | cell array of character vectors | string vector

Compound — Compound numeric vector | numeric matrix

Formula — Chemical formula structure | array of structures

Name-Value Arguments

NTerminal — Modification for N-terminal of peptide "amine" (default) | "none" | "formyl" | "acetyl" | structure

CTerminal — Modification for C-terminal of peptide "freeacid" (default) | "none" | "amide" | structure

Resolution — Approximate resolution of the instrument 1/8 (default) | number

FFTResolution — Number of data points per dalton for computing the FFT algorithm 1000 (default) | number

FFTRange — Absolute range (window size) in daltons for FFT algorithm and output density function number

FFTLocation — Location of the FFT range (window) defined by FFTRange 1/16 (default) | fraction

NoiseThreshold — Noise threshold value 1e6 (default) | number

ShowPlot — Control for displaying isotopic mass distribution plot true | false | integer

Output Arguments

MD — Mass distribution two-column matrix

Info — Mass information for the peptide sequence or compound structure

DF — Mass distribution two-column matrix

More About

Average Mass

Monoisotopic Mass

Most Abundant Mass

Nominal Mass

References

Version History

See Also

Isotopic Mass Distribution of Peptide Sequence `MATLAP`

`SeqAA` — Peptide sequence
character vector | string | cell array of character vectors | string vector

`Compound` — Compound
numeric vector | numeric matrix

`Formula` — Chemical formula
structure | array of structures

`NTerminal` — Modification for N-terminal of peptide
`"amine"` (default) | `"none"` | `"formyl"` | `"acetyl"` | structure

`CTerminal` — Modification for C-terminal of peptide
`"freeacid"` (default) | `"none"` | `"amide"` | structure

`Resolution` — Approximate resolution of the instrument
`1/8` (default) | number

`FFTResolution` — Number of data points per dalton for computing the FFT algorithm
`1000` (default) | number

`FFTRange` — Absolute range (window size) in daltons for FFT algorithm and output density function
number

`FFTLocation` — Location of the FFT range (window) defined by `FFTRange`
`1/16` (default) | fraction

`NoiseThreshold` — Noise threshold value
`1e6` (default) | number

`ShowPlot` — Control for displaying isotopic mass distribution plot
`true` | `false` | integer

`MD` — Mass distribution
two-column matrix

`Info` — Mass information for the peptide sequence or compound
structure

`DF` — Mass distribution
two-column matrix