BMC Bioinformatics. 2014;15 Suppl 6:S4. doi: 10.1186/1471-2105-15-S6-S4. Epub 2014 May 16. Furby: fuzzy force-directed bicluster visualization.
Streit M, Gratzl S, Gillhofer M, Mayr A, Mitterecker A, Hochreiter S.
Bi-Force is a novel way of modeling the problem as combinatorial optimization problem on graphs: Weighted Bi-Cluster Editing. It is a very flexible model that can handle arbitrary kinds of multi-condition data sets (not limited to gene expression).
FABIA (Factor Analysis for Bicluster Acquisition) is a model-based technique for biclustering, that is clustering rows and columns simultaneously. FABIA is a multiplicative model that assumes realistic non-Gaussian signal distributions with heavy tails. FABIA utilizes well understood model selection techniques like variational approaches and applies the Bayesian framework. The generative framework allows FABIA to determine the information content of each bicluster to separate spurious biclusters from true biclusters. On 100 simulated data sets with known true, artificially implanted biclusters, FABIA clearly outperformed all 11 competitors. FABIA was tested on microarray data sets which known, biological verfified subclusters and performed on average best out of 11 biclustering approaches.
Sepp Hochreiter, Ulrich Bodenhofer, Martin Heusel, Andreas Mayr, Andreas Mitterecker, Adetayo Kasim, Tatsiana Khamiakova, Suzy Van Sanden, Dan Lin, Willem Talloen, Luc Bijnens, Hinrich W.H. Göhlmann, Ziv Shkedy, and Djork-Arné Clevert. FABIA: Factor Analysis for Bicluster Acquisition,
Bioinformatics 2010, 26(12):1520-1527,
EDISA (Extended Dimension Iterative Signature Algorithm) is a novel probabilistic clustering approach for 3D gene-condition-time datasets. Based on mathematical definitions of gene expression modules, the EDISA samples initial modules from the dataset which are then refined by removing genes and conditions until they comply with the module definition. A subsequent extension step ensures gene and condition maximality. We applied the algorithm to a synthetic dataset and were able to successfully recover the implanted modules over a range of background noise intensities.
relax_bicluster is a biclustering algorithm based the probabilistic relaxation labeling framework for discovering geometric biclusters of gene expression data.
::DEVELOPER
Hong Yan , Signal Processing Lab at City University of Hong Kong
ExpressionView is an R package that provides an interactive environment to explore biclusters identified in gene expression data. A sophisticated ordering algorithm is used to present the biclusters in a visually appealing layout. From this overview, the user can select individual biclusters and access all the biologically relevant data associated with it. The package is aimed to facilitate the collaboration between bioinformaticians and life scientists who are not familiar with the R language.