TGICL 2.1 – TGI Clustering Tools

TGICL 2.1

:: DESCRIPTION

TGICL is a software system for fast clustering of large EST datasets. This package automates clustering and assembly of a large EST/mRNA dataset. The clustering is performed by a slightly modified version of NCBI’s megablast , and the resulting clusters are then assembled using CAP3 assembly program. TGICL starts with a large multi-FASTA file (and an optional peer quality values file) and outputs the assembly files as produced by CAP3. Both clustering and assembly phases can be parallelized by distributing the searches and the assembly jobs across multiple CPUs, as TGICL can take advantage of either SMP machines or PVM (Parallel Virtual Machine) clusters. The two full precompiled packages below were built on Linux and SunOS, respectively. They include CAP3, mgblast and all the other binaries for this platform (of course, except the base Unix utilities like ‘sed’, ‘sort’ etc.). Please note that only the Linux version was thoroughly tested at DFCI.

::DEVELOPER

The Gene Index Project

:: REQUIREMENTS

  • Windows / MacOsX / Linux
  • Perl

:: DOWNLOAD

 TGICL

:: MORE INFORMATION

Citation:

Bioinformatics. 2003 Mar 22;19(5):651-2.
TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets.
Pertea G, Huang X, Liang F, Antonescu V, Sultana R, Karamycheva S, Lee Y, White J, Cheung F, Parvizi B, Tsai J, Quackenbush J.

 

MADAP 2.0 – Clustering Tool for One-dimensional Genome Annotation Data Mapped onto Genome Sequences

MADAP 2.0

:: DESCRIPTION

MADAP is a flexible clustering tool for the interpretation of one-dimensional genome annotation data mapped onto complete or partial genome sequences. Such data might consist in counts, probabilities, or intensities and be obtained from cDNA and tag sequencing protocols to map the 5′ and 3’ends of mRNA, from ChIP-chip analysis, or from genome-wide SNP-typing used in genotype-phenotype association studies. MADAP identifies groups of data corresponding to one or several genomic sites, and estimates the volume and extension of such groups (clusters).

::DEVELOPER

Computational Cancer Genomics

:: SCREENSHOTS

N/A

:: REQUIREMENTS

  • Linux / Windows

:: DOWNLOAD

 MADAP

:: MORE INFORMATION

Citation

Schmid CD, Sengstag T, Bucher P and Delorenzi M. (2007)
MADAP, a flexible clustering tool for the interpretation of one-dimensional genome annotation data.
Nucleic Acids Res, 35, W201-205.