Panache is a web-based interface designed for the visualization of linearized pangenomes. It can be used to show presence/absence information of pangenomic blocks of sequence or genes in a browser-like display.
VisSPA (Visual Signature Pattern Analysis) is a program that accepts an alignment of sequences, interactively analyzes frequencies of residues at each position in groups of sequences and supports the generation of sequence logos. The frequency calculations and program name were initially inspired by VESPA (by Gerry Myers and Bette Korber of Los Alamos National Labs).
VarPlot is a program that accepts an alignment of sequences, and interactively plots sliding-window analyses of genetic distance among sequences or groups of sequences. It is different from SimPlot because in SimPlot, all groups are compared to one query, whereas in VarPlot all groups are compared to one another. Also, VarPlot includes calculation of nonsynonymous and synonymous distance (using the Nei and Gojobori method at present).
The readDepth package for R can detect copy number aberrations by measuring the depth of coverage obtained by massively parallel sequencing of the genome. It achieves higher accuracy than many other packages, and runs much faster by utilizing multi-core architectures to parallelize the processing of these large data sets.
FaBox is a collection of simple and intuitive web services that enable biologists and medical researchers to quickly perform typical task with sequence data. The services makes it easy to extract, edit, and replace sequence headers and join or divide data sets based on header information. Other services include collapsing a set of sequences into haplotypes and automated formatting of input files for a number of population genetics programs, such as ARLEQUIN , TCS and MRBAYES . The toolbox is expected to grow on the basis of requests for particular services and converters in the future.
The SeSiMCMC (Sequence Similarities by Markov Chain Monte Carlo) algorithm finds DNA motifs of unknown length and complicated structure, such as direct repeats or palindromes with variable spacers in the middle in a set of unaligned DNA sequences. It uses an improved motif length estimator and careful Bayesian analysis to consider site absence in a sequence.