DIAL 20110606 – De novo Identification of Alleles

DIAL 20110606

:: DESCRIPTION

DIAL is a computational pipeline for identifying single-base substitutions between two closely related genomes without the help of a reference genome.

::DEVELOPER

Aakrosh Ratan (ratan@bx.psu.edu) at Miller Lab

:: SCREENSHOTS

N/A

:: REQUIREMENTS

Linux

:: DOWNLOAD

DIAL

:: MORE INFORMATION

Citation

BMC Bioinformatics. 2010 Mar 15;11:130. doi: 10.1186/1471-2105-11-130.
Calling SNPs without a reference sequence.
Ratan A, Zhang Y, Hayes VM, Schuster SC, Miller W.

SMART TOOLS – Implementation of a de novo Genome-wide Computational approach for updating Brachypodium miRNAs

SMART TOOLS

:: DESCRIPTION

SMART TOOLS (smallRNA Team) comprises the moduleas:

Identification of stress-related small RNAs and miRNAs in plants.
Software development for discovery of novel miRNA genes in plant genomes.
Cancer-related miRNAs and their target sites
Promoter-associated small RNAs and methylation gene profile in plants.
Computational tools for small RNA annotation and expression profiling.

::DEVELOPER

University of Plovidv, Dept. Plant Physiology and Molecular Biology,

:: SCREENSHOTS

N/A

:: REQUIREMENTS

Windows / MacOsX / Linux
Bioperl
Perl
Java
Vienna

:: DOWNLOAD

SMART TOOLS

:: MORE INFORMATION

Citation

Baev V, Milev I, Naydenov M, Apostolova E, Minkov G, Minkov I, Yahubyan G.
“Implementation of a de novo genome-wide computational approach for updating Brachypodium miRNAs”
Genomics. 2011 Feb 28. [Epub ahead of print] PMID: 21371551

VICUNA 1.1 – de novo Assembler for Ultra-deep Sequence data

VICUNA 1.1

:: DESCRIPTION

VICUNA is a de novo assembly program targeting populations with high mutation rates. It creates a single linear representation of the mixed population on which intra-host variants can be mapped. For clinical samples rich in contamination (e.g., >95%), VICUNA can leverage existing genomes, if available, to assemble only target-alike reads. After initial assembly, it can also use existing genomes to perform guided merging of contigs. For each data set (e.g., Illumina paired read, 454), VICUNA outputs consensus sequence(s) and the corresponding multiple sequence alignment of constituent reads.

::DEVELOPER

Computational R&D, The Broad Institute, Cambridge, MA

:: SCREENSHOTS

N/A

:: REQUIREMENTS

Linux

:: DOWNLOAD

VICUNA

:: MORE INFORMATION

Citation

Xiao Yang, Patrick Charlebois, Sante Gnerre, Matthew G Coole, Niall J. Lennon, Joshua Z. Levin, James Qu, Elizabeth M. Ryan, Michael C. Zody, and Matthew R. Henn (2012)
De novo assembly of highly diverse viral populations.
BMC Genomics 13:475.

AV454 1.0 – de novo Consensus Assembler designed for reads derived from diverse Viral Populations

AV454 1.0

:: DESCRIPTION

AV454 (AssembleViral454) is an assembler, based on the ARACHNE package, designed for small and non-repetitive genomes sequenced at high depth. It was specifically designed to assemble read data generated from a mixed population of viral genomes. Reads need not be paired, and it is assumed that no sequence repeat in the genome would be large enough to fully contain an average read.

::DEVELOPER

Computational R&D, The Broad Institute, Cambridge, MA

:: SCREENSHOTS

N/A

:: REQUIREMENTS

Linux

:: DOWNLOAD

AV454

:: MORE INFORMATION

QSRA 1.0 – Quality-value guided de novo Short Read Assembler

QSRA 1.0

:: DESCRIPTION

QSRA (Quality-value-guided Short Read Assembler) is a quality-value guided de novo short read assembler. QSRA generally produced the highest genomic coverage, while being faster than VCAKE. QSRA is extremely competitive in its longest contig and N50/N80 contig lengths, producing results of similar quality to those of EDENA and VELVET. QSRA provides a step closer to the goal of de novo assembly of complex genomes, improving upon the original VCAKE algorithm by not only drastically reducing runtimes but also increasing the viability of the assembly algorithm through further error handling capabilities.

::DEVELOPER

Mockler Lab

:: SCREENSHOTS

N/A

:: REQUIREMENTS

Linux

:: DOWNLOAD

QSRA

:: MORE INFORMATION

Citation

Bryant DW, Wong WK, Mockler TC.
QSRA – a quality-value guided de novo short read assembler.
BMC Bioinformatics 2009, 10:69

Tag-DB 0.4.0 – de novo Sequencing and Tag-based database Searching of MS/MS Spectra

Tag-DB 0.4.0

:: DESCRIPTION

Tag-DB provides a user-friendly, lightweight and open-source graphical user interface for running the de novo sequencing algorithm PepNovo+ and also holds the possibiliy to search derived short amino acid sequences (so-called tags) against a protein database in order to retrieve peptide and protein identifications.

::DEVELOPER

Thilo Muth and Harald Barsnes.

:: SCREENSHOTS

:: REQUIREMENTS

Windows / Linux
Java

:: DOWNLOAD

Tag-DB

:: MORE INFORMATION

Pasqual 1.0 – Parallel de Novo Genome Sequence Assembler

Pasqual 1.0

:: DESCRIPTION

PASQUAL (PArallel SeQUence AssembLer) is designed for shared memory parallelism, using OpenMP due to its good tradeoff between performance and programmer productivity. Shared memory parallelism has become mainstream with the widespread production of multicore commodity processors. For PASQUAL we follow the OLC approach and use a careful combination of tailored algorithms and data structures to obtain high-quality solutions.

::DEVELOPER

Bader HPC Lab @ GeorgiaTech

:: REQUIREMENTS

Linux

:: DOWNLOAD

Pasqual

:: MORE INFORMATION

Citation

Xing Liu, Pushkar R. Pande, Henning Meyerhenke, and David A. Bader.
PASQUAL: A Parallel de novo Assembler for Next Generation Genome Sequencing.
Submitted for journal publication, 2011.

GS De Novo Assembler – de novo DNA Sequence Assembly

GS De Novo Assembler

:: DESCRIPTION

GS De Novo Assembler (Newbler) is a software package for de novo DNA sequence assembly. It is designed specifically for assembling sequence data generated by the 454 GS-series of pyrosequencing platforms sold by 454 Life Science, a Roche diagnostic.

::DEVELOPER

454 Sequencing

:: SCREENSHOTS

:: REQUIREMENTS

Linux

:: DOWNLOAD

GS De Novo Assembler

:: MORE INFORMATION

SHORTY 2.0 – de novo Assembler

SHORTY 2.0

:: DESCRIPTION

SHORTY is targetted for de novo assembly of microreads with mate pair information and sequencing errors. SHORTY has some novel approach and features in addressing the short read assembly problem.

::DEVELOPER

Steven Skiena

:: SCREENSHOTS

N/A

:: REQUIREMENTS

Windows / Linux / MacOsX
C++ Compiler

:: DOWNLOAD

SHORTY

:: MORE INFORMATION

Citation:

BMC Bioinformatics. 2009 Jan 30;10 Suppl 1:S16.
Crystallizing short-read assemblies around seeds.
Hossain MS, Azimi N, Skiena S.

LutefiskXP 1.0.7 / Lutefisk – De novo Interpretation of Peptide CID Cpectra

LutefiskXP 1.0.7 / Lutefisk

:: DESCRIPTION

Lutefisk is software for the de novo interpretation of peptide CID spectra.High quality tandem mass spectra of peptides are often obtained for which no exact database match can be made. Consequently, we are faced with the question of whether the protein under investigation is novel, or if the non-matching spectra are due to less exciting prospects such as inter-species variation, database sequence errors, or unexpected proteolytic cleavages. To begin addressing this problem we perform a de novointerpretation of the CID spectra using the computer program Lutefisk; however, any such interpretations nearly always yield multiple sequence candidates, where it is often difficult or impossible to distinguish the correct sequence from the incorrect ones. The variations between candidate sequences are often minor and typically involve dipeptide inversions, swapping of dipeptides of the same mass, replacements of dipeptides with single amino acids of the same mass, and replacements of amino acids by dipeptides of the same mass. We use the multiple sequence candidates produced by Lutefisk as query sequences in a second program, CIDentify. CIDentify is a version of Bill Pearson’s FASTA algorithm modified by Alex Taylor to accommodate MS nuances such as multiple query sequences, ambiguous dipeptides and isobaric mass equivalencies.

::DEVELOPER

Richard S. Johnson( jsrichar@alum.mit.edu)

:: SCREENSHOTS

N/A

:: REQUIREMENTS

Linux/windows/MacOsX
C++ compiler

:: DOWNLOAD

LutefiskXP

:: MORE INFORMATION

Citation

R. S. Johnson and J. A. Taylor (2002)
“Searching sequence databases via de novo peptide sequencing by tandem mass spectrometry“,
Mol. Biotechnology Vol. 22, No. 3. (November 2002), pp. 301-315.