shellfish – Parallel PCA and data processing for Genome-wide SNP data

shellfish

:: DESCRIPTION

shellfish carries out a variety of tasks related to principal component analysis of genome-wide SNP data. Unlike other available software, PCA computations can be carried out in parallel (both on a computing cluster running the Sun Grid Engine, and also in the simple case of a machine with multiple processors). In addition to the PCA calculations, it automates the process of data subsetting and allele-matching, using plink and gtool for file format interconversion where necessary. The aim is that tasks that would otherwise require a complex series of shell commands and/or work in R, can be carried out with a single, straightforward, command.

Advertisement

::DEVELOPER

Dan Davison

:: SCREENSHOTS

N/A

:: REQUIREMENTS

  • MacOsX / Linux
  • Python

:: DOWNLOAD

 shellfish

:: MORE INFORMATION