| Bioinformatics Toolbox™ | ![]() |
AffyStruct = affyread(File)
AffyStruct = affyread(File, LibraryPath)
| File | String specifying a file name or a path and file name of one of the following Affymetrix file types:
If you specify only a file name, that file must be on the MATLAB® search path or in the current directory. If you specify only a file name of a CDF or GIN library file, you can specify the path and directory in the LibraryPath input argument. |
| LibraryPath | String specifying the path and directory of a:
|
| AffyStruct | MATLAB structure containing information from an Affymetrix data or library file, for expression, genotyping (SNP), or resequencing assay types. |
Note This function does not work on the Solaris™ platform. |
AffyStruct = affyread(File) reads File, an Affymetrix file, and creates AffyStruct, a MATLAB structure. The affyread function can read Affymetrix EXP, DAT, CEL, CHP, CDF, and GIN files created from Affymetrix GeneChip arrays for expression, genotyping (SNP), or resequencing assays.
AffyStruct = affyread(File, LibraryPath) specifies the path and directory of a:
CDF library file associated with File when File is a CHP file
CDF library file when File is a CDF file
GIN library file when File is a GIN file
Note If you do not specify LibraryPath when reading a CHP file, affyread looks in the current directory for the CDF file. If it does not find the CDF file, it still reads the CHP file, but the probe set names and types will be omitted from the return value, AffyStruct. |
You can learn more about the Affymetrix GeneChip files and download sample files from:
http://www.affymetrix.com/support/technical/sample_data/demo_data.affx
Note Some Affymetrix sample data files (DAT, EXP, CEL, and CHP) are combined in a DTT or CAB file. You must download and use the Affymetrix Data Transfer Tool to extract these files from the DTT or CAB file. You can download the Affymetrix Data Transfer Tool from: http://www.affymetrix.com/products/software/specific/dtt.affx You will have to register and log in at the Affymetrix Web site to download the Affymetrix Data Transfer Tool. |
The following tables describe the fields in AffyStruct for the different Affymetrix file types.
File Types EXP, DAT, CEL, CHP, CDF, and GIN
| Field | Description |
|---|---|
| Name | File name. |
| DataPath | Path and directory of the file. |
| LibPath | Path and directory of the CDF and GIN library files associated with the file being read. |
| FullPathName | Path and directory of the file. |
| ChipType | Name of the AffymetrixGeneChip array (for example, DrosGenome1 or HG-Focus). |
| Date | Date the file was created. |
EXP File
| Field | Description |
|---|---|
| ChipLot Operator SampleType SampleDesc Project Comments Reagents ReagentLot Protocol Station Module HydridizeDate ScanPixelSize ScanFilter ScanDate ScannerID NumberOfScans ScannerType NumProtocolSteps ProtocolSteps | Information about experimental conditions and protocols captured by the Affymetrix software. |
DAT File
| Field | Description |
|---|---|
| NumPixelsPerRow | Number of pixels per row in the image created from the GeneChip array (number of columns). |
| NumRows | Number of rows in the image created from the GeneChip array. |
| MinData | Minimum intensity value in the image created from the GeneChip array. |
| MaxData | Maximum intensity value in the image created from the GeneChip array. |
| PixelSize | Size of one pixel in the image created from the GeneChip array. |
| CellMargin | Size of gaps between cells in the image created from the GeneChip array. |
| ScanSpeed | Speed of the scanner used to create the image. |
| ScanDate | Date the scan was performed. |
| ScannerID | Name of the scanning device used. |
| UpperLeftX UpperLeftY UpperRightX UpperRightY LowerLeftX LowerLeftY LowerRightX LowerRightY | Pixel coordinates of the scanned image. |
| ServerName | Not used. |
| Image | A NumRows-by-NumPixelsPerRow image of the scanned GeneChip array. |
CEL File
| Field | Description |
|---|---|
| FileVersion | Version of the CEL file format. |
| Algorithm | Algorithm used in the image processing step that converts from DAT format to CEL format. |
| AlgParams | String containing parameters used by the algorithm in the image processing step. |
| NumAlgParams | Number of parameters in AlgParams. |
| CellMargin | Size of gaps between cells in the image created from the GeneChip array, used for computing the intensity values of the cells. |
| Rows | Number of rows of probes. |
| Cols | Number of columns of probes. |
| NumMasked | Number of probes that are masked and not used in subsequent processing. |
| NumOutliers | Number of cells identified as outliers (very high or very low intensity) by the image processing step. |
| NumProbes | Number of probes (Rows * Cols) on the GeneChip array. |
| UpperLeftX UpperLeftY UpperRightX UpperRightY LowerLeftX LowerLeftY LowerRightX LowerRightY | Pixel coordinates of the scanned image. |
| ProbeColumnNames | Cell array containing the eight column names in the Probes field:
|
| Probes | NumProbes-by-8 array of information about the individual probes, including intensity values. The columns of this array are contained in the ProbeColumnNames field. |
CHP File
| Field | Description |
|---|---|
| AssayType | Type of assay that the GeneChip array contained (for example, Expression, Genotyping, or Resequencing). |
| CellFile | File name of the CEL file from which the CHP file was created. |
| Algorithm | Algorithm used to convert from CEL format to CHP format. |
| AlgVersion | Version of the algorithm used to create the CHP file. |
| NumAlgParams | Number of parameters in AlgParams. |
| AlgParams | String containing parameters used in steps needed to create the CHP file (for example, background correction). |
| NumChipSummary | Number of entries in ChipSummary. |
| ChipSummary | Summary information for the GeneChip array, including background average, standard deviation, max, and min. |
| BackgroundZones | Structure containing information about the zones used in the background adjustment step. |
| Rows | Number of rows of probes. |
| Cols | Number of columns of probes. |
| NumProbeSets | Number of probe sets on the GeneChip array. |
| NumQCProbeSets | Number of QC probe sets on the GeneChip array. |
ProbeSets (Expression GeneChip array) | A NumProbeSets-by-1 structure array containing
information for each expression probe set, including the following
fields:
|
ProbeSets (Genotyping GeneChip array) | A NumProbeSets-by-1 structure array containing
information for each genotyping probe set, including the following
fields:
|
ProbeSets (Resequencing GeneChip array) | A NumProbeSets-by-1 structure array containing
information for each resequencing probe set, including the following
fields:
|
CDF File
| Field | Description |
|---|---|
| Rows | Number of rows of probes. |
| Cols | Number of columns of probes. |
| NumProbeSets | Number of probe sets on the GeneChip array. |
| NumQCProbeSets | Number of QC probe sets on the GeneChip array. |
| ProbeSetColumnNames | Cell array containing the six column names in the ProbePairs field in the ProbeSets array:
|
| ProbeSets | A NumProbeSets-by-1 structure array containing
information for each probe set, including the following fields:
|
GIN File
| Field | Description |
|---|---|
| Version | GIN file format version. |
| ProbeSetName | Probe set ID/name. |
| ID | Identifier for the probe set (gene ID). |
| Description | Description of the probe set. |
| SourceNames | Source(s) of the probe sets. |
| SourceURL | Source URL(s) for the probe sets. |
| SourceID | Vector of numbers specifying which SourceNames or SourceURL each probe set is associated with. |
The following example uses the demo data and CDF library file from the E. coli Antisense Genome array, which you can download from:
http://www.affymetrix.com/support/technical/sample_data/demo_data.affx
After you download the demo data, you will need the Affymetrix Data Transfer Tool to extract the CEL, DAT, and CHP files from a DTT file. You can download the Affymetrix Data Transfer Tool from:
http://www.affymetrix.com/products/software/specific/dtt.affx
The following example assumes that files Ecoli-antisense-121502.CEL, Ecoli-antisense-121502.dat, and Ecoli-antisense-121502.chp are stored on the MATLAB search path or in the current directory. It also assumes that the associated CDF library file, Ecoli_ASv2.CDF, is stored at D:\Affymetrix\LibFiles\Ecoli.
Read the contents of a CEL file into a MATLAB structure.
celStruct = affyread('Ecoli-antisense-121502.CEL');Display a spatial plot of the probe intensities.
maimage(celStruct, 'Intensity')

Zoom in on a specific region of the plot.
axis([200 340 0 70])

Read the contents of a DAT file into a MATLAB structure, display the raw image data, and then use the axis image function to set the correct aspect ratio.
datStruct = affyread('Ecoli-antisense-121502.dat');
imagesc(datStruct.Image)
axis image

Zoom in on a specific region of the plot.
axis([1900 2800 160 650])

Read the contents of a CHP file into a MATLAB structure, specifying the location of the associated CDF library file. Then extract information for probe set 3315278.
chpStruct = affyread('Ecoli-antisense-121502.chp',...
'D:\Affymetrix\LibFiles\Ecoli');
geneName = probesetlookup(chpStruct,'3315278')
geneName =
Identifier: '3315278'
ProbeSetName: 'argG_b3172_at'
CDFIndex: 5213
GINIndex: 3074
Description: [1x82 char]
Source: 'NCBI EColi Genome'
SourceURL: [1x74 char]
Bioinformatics Toolbox™ functions: agferead, celintensityread, gprread, ilmnbsread, probelibraryinfo, probesetlink, probesetlookup, probesetplot, probesetvalues, sptread
![]() | affyprobeseqread | affysnpquartets | ![]() |
| © 1984-2008- The MathWorks, Inc. - Site Help - Patents - Trademarks - Privacy Policy - Preventing Piracy - RSS |