Code covered by the BSD License  

Highlights from
Mass Spectrometry Bayesian Network Analysis Tool

Be the first to rate this file! 9 Downloads (last 30 days) File Size: 16.76 KB File ID: #24345

Mass Spectrometry Bayesian Network Analysis Tool

by Karl Kuschner

 

04 Jun 2009 (Updated 17 Jul 2009)

Finds diagnostic features in the spectra of biologic samples by using a Bayesian Network approach

| Watch this File

File Information
Description

Starting with a group of training data and a given classification (like "disease" or "non-"disease") this function builds a three level Bayesian Network from mass spectrometry data. The function was designed primarily for use in finding proteins that are diagnostic of a disease group using a biologic sample.

The root node of the Bayesian network is the class variable. The first lower level contains all the features found to have high mutual information with the class variable. The second lower level are features that have high mutual information with the first lower level variable.

The input is a vector of class values for some number of spectra, a vector of IDs for those samples, and a data matrix with cases (samples) in rows, and features (mass positions in a mass spectrum) in columns, whose values are the intensity of the mass spectrum for that sample at that mass position. The values in the matrix are derived from the spectra via peak picking and alignment routines that are not included here. The values are allowed to be continuous but are later discretized automatically.

Many iterations of n-fold cross validation are allowed. The output consists of a frequency of discovery for the links described above. Thus the stability of the feature set can be estimated. The second level features are provided in order to discover modifications, satellites, and adducts of the parent protein. The function attempts to combine second level features withthe parent feature when the result provides more mutual information with the class variable.

The function also reports error rates under cross-validation, and provides options to average replicated samples, normalize spectra, and others.

Required Products Statistics Toolbox
MATLAB release MATLAB 7.8 (R2009a)
Other requirements Two external functions are required, "mutualinfo" and "condmutualinfo." One source of these functions is the package at http://www.mathworks.com/matlabcentral/fileexchange/14888.
Tags for This File  
Everyone's Tags
Tags I've Applied
Add New Tags Please login to tag files.
Please login to add a comment or rating.
Updates
17 Jul 2009

Updated comments in main function for usability

Tag Activity for this File
Tag Applied By Date/Time
mass spectrometry Karl Kuschner 04 Jun 2009 15:55:52
bayesian network Karl Kuschner 04 Jun 2009 15:55:52
feature selection Karl Kuschner 04 Jun 2009 15:55:52
biomarker discovery Karl Kuschner 04 Jun 2009 15:55:52
mutual information Karl Kuschner 04 Jun 2009 15:55:52
mass spectrometry Neeraj 11 Feb 2010 22:09:22
mutual information Neeraj 11 Feb 2010 22:09:26
biomarker discovery Neeraj 11 Feb 2010 22:09:29
bayesian network Neeraj 11 Feb 2010 22:09:31

Contact us at files@mathworks.com