Comrad is a novel algorithmic framework for the integrated analysis of RNA-Seq and Whole Genome Shotgun Sequencing (WGSS) data for the purposes of discovering genomic rearrangements and aberrant transcripts. The Comrad framework leverages the advantages of both RNA-Seq and WGSS data, providing accurate classification of rearrangements as expressed or not expressed and accurate classification of the genomic or non-genomic origin of aberrant transcripts.
Heinz is a tool for single-species active module discovery.
xHeinz is a software solver that searches for active subnetwork modules that are conserved between two species. It uses a branch-and-cut algorithm that finds provably optimal or near-optimal solutions.
VISDA (VIsual and Statistical Data Analyzer) is a software for cluster modeling, visualization, and discovery in genomic data. VISDA performs progressive, coarse-to-fine (divisive) hierarchical clustering and visualization, supported by hierarchical mixture modeling, supervised/unsupervised informative gene selection, supervised/unsupervised data visualization, and user/prior knowledge guidance, to discover hidden clusters within complex, high-dimensional genomic data.
GeneProgram is a new unsupervised computational framework based on Hierarchical Dirichlet Processes that addresses each of the above challenges. GeneProgram uses expression data to simultaneously organize tissues into groups and genes into overlapping programs with consistent temporal behavior, to produce maps of expression programs, which are sorted by generality scores that exploit the automatically learned groupings.
Lumpy is a new probabilistic framework that we have developed to integrate multiple structural variation signals such as discordant paired-end alignments and split-read alignments. While it is clear that integrating all SV signals is important for sensitive discovery, most existing (including our own Hydra) tools only exploit one signal. Lumpy integrates multiple signals in order to improve sensitivity and breakpoint resolution. This is especially important for cancer genome analysis where tumor heterogeneity causes potentially important rearrangements occur with less supporting alignments in the sampled DNA.
Hydra detects structural variation (SV) breakpoints by clustering discordant paired-end alignments whose “signatures” corroborate the same putative breakpoint. Hydra can detect breakpoints caused by all classes of structural variation. Moreover, it was designed to detect variation in both unique and duplicated genomic regions; therefore, it will examine paired-end reads having multiple discordant alignments.
Hydra-Multi is a paired-end read structural variant discovery tool that is capable of integrating signals from hundreds of samples.