The “care” package a novel multivariate algorithm for large scale SNP selection using CAR score regression, a promising new approach for prioritizing biomarkers. CAR scores measure the correlation between the response and the Mahalanobis-decorrelated predictors. The squared CAR score is a natural measure of variable importance and provides a canonical ordering of variables. This package provides functions for estimating CAR scores, for variable selection using CAR scores, and for estimating corresponding regression coefficients. Both shrinkage as well as empirical estimators are available.
SLR (Sitewise Likehood Ratio) is a program to detect sites in coding DNA that are unusually conserved and/or unusually variable (that is, evolving under purify or positive selection) by analysing the pattern of changes for an alignment of sequences on an evolutionary tree. The strength of selection at each site is determined by comparing the rate of nonsynonymous (amino acid changing) substitutions to that of synonymous (silent) substituions, the latter assumed to be invisible to selection and so evolving in a strictly neutral fashion.
Gblocks is a computer program written in C that eliminates poorly aligned positions and divergent regions of a DNA or protein alignment so that it becomes more suitable for phylogenetic analysis.
TreeSAAP (Selection of Amino Acid Properties Based on Phylogenetic Trees) measures the selective influences on 31 structural and biochemical amino acid properties during phylogenesis (the history of genealogical development) and performs goodness-of-fit and categorical statistical tests.
TagMix is an integrated cross-populations LD-based, haplotype-based and principal component analysis genome-wide tag SNPs selection algorithm to efficiently identify informative variants, prioritized across multi-populations of low LD and high diversity populations for custom chip array.
CLASS is a software tool for accurately assembling splice variants using local read coverage patterns of RNA-seq reads, contiguity constraints from read pairs and spliced reads, and optionally information about gene structure extracted from cDNA sequence databases.
IMPACT_S is a java swing based Graphical User Interface program for analyzing and/or combining several selection results from programs like, PAML (codeml), Datamonkey, TreeSAAP. Provides 3D homology modeling through Swiss Model and 3D mapping of selection results through Jmol.