FastQC aims to provide a simple way to do some quality control checks on raw sequence data coming from high throughput sequencing pipelines. It provides a modular set of analyses which you can use to give a quick impression of whether your data has any problems of which you should be aware before doing any further analysis.
BM-BC is a Bayesian method of base calling for Solexa-GA sequencing data. The Bayesian method builds on a hierarchical model that accounts for three sources of noise in the data, which are known to affect the accuracy of the base calls: fading, phasing, and cross-talk between channels.
diCal is a scalable demographic inference method based on the sequentially Markov conditional sampling distribution framework.
diCal-IBD can be used for predicting identical by descent (IBD) tracts in sequence data. It provides means for calculating the accuracy of the prediction, if the true tracts are available, plotting of the predicted tracts, their TMRCA (time to the most recent common ancestor) and corresponding posterior probabilities, and identification of putative recent positive selection through investigation of average IBD sharing
KGGSeq (Genomic and Genetic studies using Sequence data) is a software platform constituted of Bioinformatics and statistical genetics functions making use of valuable biologic resources and knowledge for sequencing-based genetic mapping of variants/genes responsible for human diseases/traits. Simply, KGGSeq is like a fishing rod facilitating geneticists to fish the genetic determinants of human diseases/traits in the big sea of DNA sequences. Compared with other genetic tools like plink/seq, KGGSeq paid more attention downstream analysis of genetic mapping. Currently, a comprehensive and efficient framework was newly implemented on KGGSeq to filter and prioritize genetic variants from whole exome sequencing data.
SeqPop is a program for computing population genetics statistics on sequence data, including Pn, Theta, Pi(i,j), Kst(*), Fst(*), and their Monte Carlo significance for population subdivision.
PRINSEQ (PReprocessing and INformation of SEQuence data.) is a tool that generates summary statistics of sequence and quality data and that is used to filter, reformat and trim next-generation sequence data. It is particular designed for 454/Roche data, but can also be used for other types of sequence data. PRINSEQ is available through a user-friendly web interface or as standalone version. The standalone version is primarily designed for data preprocessing and does not generate summary statistics in graphical form.
scan-x is a software tool designed to find motifs within any sequence data set. The first large scale scan was performed using all available human, mouse, fly and yeast phosphorylation and acetylation data to perform a scan for undiscovered modification sites.
ReadTools provides a consistent and highly tested set of tools for processing sequencing data from any kind of source and focusing on raw reads, while including tools for mapped reads as well.
verifyBamID is a software that verifies whether the reads in particular file match previously known genotypes for an individual (or group of individuals), and checks whether the reads are contaminated as a mixture of two samples
CS23D (Chemical Shift to 3D Structure) is a web server for rapidly generating accurate 3D protein structures using only assigned NMR chemical shifts as input.