Ghosh, Priyanka, Sriram Krishnamoorthy, and Ananth Kalyanaraman.
“PaKman: A Scalable Algorithm for Generating Genomic Contigs on Distributed Memory Machines.”
IEEE Transactions on Parallel and Distributed Systems (TPDS) vol. 32, no. 5, pp. 1191-1209, 2021. DOI: 10.1109/TPDS.2020.3043241.
ABySS (Assembly By Short Sequences) is a de novo, parallel, paired-end sequence assembler that is designed for short reads. The single-processor version is useful for assembling genomes up to 100 Mbases in size. The parallel version is implemented using MPI and is capable of assembling larger genomes.
HECTOR is a parallel multistage homopolymer spectrum based error corrector for 454 sequencing data. In this algorithm, for the first time we have investigated a novel homopolymer spectrum based approach to handle homopolymer insertions or deletions, which are the dominant sequencing errors in 454 short-reads.
BitPAl is a bit-parallel algorithm for general, integer-scoring global alignment. Integer-scoring schemes assign integer weights for match, mismatch, and insertion/deletion.
PLAST (Parallel Local Alignment Search Tool for Database Comparison) is an intensive bank sequence comparison with a parallel version of BLAST-like software.The algorithm exploits two key parallel features of existing and future microprocessors: the SIMD programming model (SSE instruction set) and the multithreading concept (multicore). Compared to multithreaded BLAST software, tests performed on an 8-processor server have shown speedup ranging from 3 to 6 with a similar level of accuracy.
GDASC is a GPU-parallel based server used for identifying batches and classifying samples into different batches in a high dimensional gene expression dataset.
Parallel T-Coffee (PTC) is the first parallel implementation of the TCoffee multiple sequence alignment tool. It has been developed to overcome main limitations of the original method. It is based on the MPI and RMA mechanisms, and it can be run on distributed memory clusters. PTC supports a majority of options provided by TCoffee 3.79, including the 3D-Coffee mode. It can be used to align data sets consisting of hundreds of proteins in reasonable time limits.