| Contents | Index |
cpgStruct = cpgisland(SeqDNA)
cpgStruct = cpgisland(SeqDNA,
...'Window', WindowValue, ...)
cpgStruct = cpgisland(SeqDNA,
...'MinIsland', MinIslandValue, ...)
cpgStruct = cpgisland(SeqDNA,
...'GCmin', GCminValue, ...)
cpgStruct = cpgisland(SeqDNA,
...'CpGoe', CpGoeValue, ...)
cpgStruct = cpgisland(SeqDNA,
...'Plot', PlotValue, ...)
| SeqDNA | One of the following:
Valid characters include A, C, G, and T. cpgisland does not count ambiguous nucleotides or gaps. |
| WindowValue | Integer specifying the window size for calculating GC content and CpGobserved/CpGexpected ratios. Default is 100 bases. A smaller window size increases the noise in a plot. |
| MinIslandValue | Integer specifying the minimum number of consecutive marked bases to report as a CpG island. Default is 200 bases. |
| GCminValue | Value specifying the minimum GC percent in a window needed to mark a base. Choices are a value between 0 and 1. Default is 0.5. |
| CpGoeValue | Value specifying the minimum CpGobserved/CpGexpected ratio in each window needed to mark a base. Choices are a value between 0 and 1. Default is 0.6. This ratio is defined as: CPGobs/CpGexp = (NumCpGs*Length)/(NumGs*NumCs) |
| PlotValue | Controls the plotting of GC content, CpGoe content, CpG islands greater than the minimum island size, and all potential CpG islands for the specified criteria. Choices are true or false (default). |
| cpgStruct | MATLAB structure containing the starting and ending bases of the CpG islands greater than the minimum island size. |
cpgStruct = cpgisland(SeqDNA) searches SeqDNA, a DNA nucleotide sequence, for CpG islands with a GC content greater than 50% and a CpGobserved/CpGexpected ratio greater than 60%. It marks bases meeting this criteria within a moving window of 100 DNA bases and then returns the results in cpgStruct, a MATLAB structure containing the starting and ending bases of the CpG islands greater than the minimum island size of 200 bases.
cpgStruct = cpgisland(SeqDNA, ...'PropertyName', PropertyValue, ...) calls cpgisland with optional properties that use property name/property value pairs. You can specify one or more properties in any order. Each PropertyName must be enclosed in single quotation marks and is case insensitive. These property name/property value pairs are as follows:
cpgStruct = cpgisland(SeqDNA,
...'Window', WindowValue, ...) specifies
the window size for calculating GC content and CpGobserved/CpGexpected
ratios. Default is 100 bases. A smaller window
size increases the noise in a plot.
cpgStruct = cpgisland(SeqDNA, ...'MinIsland', MinIslandValue, ...) specifies the minimum number of consecutive marked bases to report as a CpG island. Default is 200 bases.
cpgStruct = cpgisland(SeqDNA, ...'GCmin', GCminValue, ...) specifies the minimum GC percent in a window needed to mark a base. Choices are a value between 0 and 1. Default is 0.5.
cpgStruct = cpgisland(SeqDNA, ...'CpGoe', CpGoeValue, ...) specifies the minimum CpGobserved/CpGexpected ratio in each window needed to mark a base. Choices are a value between 0 and 1. Default is 0.6. This ratio is defined as:
CPGobs/CpGexp = (NumCpGs*Length)/(NumGs*NumCs)
cpgStruct = cpgisland(SeqDNA, ...'Plot', PlotValue, ...) controls the plotting of GC content, CpGoe content, CpG islands greater than the minimum island size, and all potential CpG islands for the specified criteria. Choices are true or false (default).
Import a nucleotide sequence from the GenBank database. For example, retrieve a sequence from Homo sapiens chromosome 12.
S = getgenbank('AC156455');Calculate the CpG islands in the sequence and plot the results.
cpgisland(S.Sequence,'PLOT',true)
ans =
Starts: [4510 29359]
Stops: [5468 29604]
The CpG islands greater than 200 bases in length are listed and a plot displays.

basecount | ntdensity | seqshoworfs

See how to analyze, visualize, and model biological data and systems using MathWorks products.
Get free kit| © 1984-2012- The MathWorks, Inc. - Site Help - Patents - Trademarks - Privacy Policy - Preventing Piracy - RSS |