source:
plugins/base2/net.sf.basedb.illumina/trunk/README_PluginDetails
@
1179
Last change on this file since 1179 was 1179, checked in by , 14 years ago  



File size: 4.6 KB 
$Id: README_PluginDetails 1179 20091020 22:09:42Z jari $
Introduction
There are many plugins in the Illumina plugin package for BASE. This file gives detailed information on some of the contributed plugins.
Illumina detection Pvalue calculation
This plugin implements BeadStudio like detection Pvalue calculations for Illumina expression data (see http://www.genomecenter.ucdavis.edu/expression_analysis/documents/illumina_normalization_081201.pdf) on detection Pvalues.
The plugin will always base the detection Pvalue calculation on raw data values, i.e., the mean raw intensity for the different signals. By default the calculations are based on negative controls available in the root bioassay set for the current analysis branch. The user may change this to only use negative controls in the current bioassay set.
The detection Pvalue plugin does not filter the assays, it provides the detection Pvalues usable in a filter step after running this plugin.
Parameters
The plugin requires input of array type since detection Pvalue are calculated differently depending on array type.
Users may select to use negative controls in the current bioassay set only. The default behaviour is to use all negative controls in the root bioassay set for the current bioassay set.
A cut off parameter is available to exclude outliers within
the negative controls. The cutoff
defines the acceptable negative
control signal range
medianMAD*cutoff < I < median+MAD*cutoff
where MAD
is the median absolute deviation.
Implementation details
Each assay is treated separately, i.e., no samples are combined together. All calculations are made on the raw beadtype level data, i.e., on the average expression value for each bead type and raw data is always used irrespective when in analysis the detection Pvalue is calculated.
Pvalue calculation for whole genome arrays:
For all signals i
calculate the detection Pvalue as
Pvalue = 1R/N
where R
is the rank of the signal i
relative to the
negative controls and N
is the number of negative controls.
Pvalue calculation for others array types (DASL, miRNA, VeraCode DASL, and Focused Arrays):
For all signals i
calculate the detection Pvalue as
Pvalue = 1/2  1/2 * erf( [iAvgControl]/StdControl/sqrt(2) )
where
AvgControl
is the average intensity of the negative controls,
StdControl
is the standard deviation of the the negative controls,
and erf
is the error function
(http://mathworld.wolfram.com/Erf.html). The error function is used
for arguments within the range (4,4). To save CPU cycles, the function
value for arguments outside this range is set to 1 and 1,
respectively.
Control Summary plots
This package provides overview plots for Illumina expression data. The 'Overview plots' tab becomes available in the experiment analysis tree when the user selects a bioassay set. The plots are automatically generated when the user clicks on the Overview plots tab. The display of the plots cannot be changed by the user and the same plot is shown irrespective which bioassay set is selected.
Currently two control summary curves are generated in one plot; The average intensity of perfect match beads in each assay, and the average intensity of housekeeping beads in each assay.
The average intensity I_avg is calculated as
I_avg = sum[Iraw_i] / N
where N is the number of bead types in the sum, Iraw_i is the raw mean intensity for bead type i.
A bead type is considered to belong to the perfect match group if it is annotated with ':pm' in the reporter annotation column '[Rep] Control group id', and a bead type is grouped as housekeeping if it is annotated with 'housekeeping' in reporter annotator column '[Rep] Control group id'.
Copyright (C) 2009 Jari Häkkinen This file is part of Illumina plugin package for BASE. Available at http://baseplugins.thep.lu.se/ BASE main site: http://base.thep.lu.se/ This is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 3 of the License, or (at your option) any later version. The software is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details. You should have received a copy of the GNU General Public License along with BASE. If not, see <http://www.gnu.org/licenses/>.