Sign up to our newsletter to receive information and tips related to your field of study.
Please enter your email:
Technical Details
Statistical Analysis - Technical Specification
Input data type supported
Preprocessed MALDI or LC-MS data.
Feature data – any CSV files containing intensity (or similar) values for consecutive samples and features, e.g. dataset obtained from any protein quantitation software or any ‘omics’ data such as gene expression microarray data.
Available algorithms
One-dimensional data exploration
Measures of central tendency: mean, median, trimmed mean, minimum and maximum.
Measures of variability: standard deviation (std) and variance, sample range, interquartile range (IQR), median absolute deviation (MAD).
Empirical quantiles: the first and the third quartile (Q1, Q3).
Advanced analysis techniques: Partial Least Squares combined with Logistic Regression (PLS-LR).
Single biomarker detection
Separation measures available: Divergence, Fisher score, SAM score, T score, p-value of standard t-test, Kolmogorov-Smirnov test statistics, Wilcoxon-Mann-Whitney test statistics, Area Under ROC Curve (AUC).
Preliminary feature filters available: AUC (Area Under ROC Curve), Student’s t-test, Kolmogorov-Smirnov test, Wilcoxon-Mann-Whitney test, Divergence, Fisher score, SAM score, T score.
Classification algorithms available: LDA (Linear Discriminant Analysis), k-NN (k-Nearest Neighbors classifier), Decision Tree, SVM (Support Vector Machines), PLS (Partial Least Squares), Logistic Regression, randomForest — random tree ensemble.
Graphical tools: Feature Scores Bar Plot, Model Performance Line Plot, Heat Map, interactive Feature Browser.
Classification models
Classification algorithms: k-NN — k-Nearest Neighbours algorithm, LDA – Linear Discriminant Analysis, SVM — Support Vector Machines, Tree — Classification Tree, PLS — Partial Least Squares, LR — Logistic Regression, randomForest — random tree committee.
Validation schemes: simple learning/test split, k-fold cross-validation, leave-one-out, random subsampling.