CD-HIT 4.8.1 – Cluster Large Protein database at High Sequence Identity Threshold

CD-HIT 4.8.1

:: DESCRIPTION

CD-HIT is a very widely used program for clustering and comparing protein or nucleotide sequences.CD-HIT is very fast and can handle extremely large databases. CD-HIT helps to significantly reduce the computational and manual efforts in many sequence analysis tasks and aids in understanding the data structure and correct the bias within a dataset.

Advertisement

CD-HIT Online Version

::DEVELOPER

Group of Weizhong LiGodzik Lab

:: SCREENSHOTS

N/A

:: REQUIREMENTS

  • Linux
:: DOWNLOAD

 CD-HIT

:: MORE INFORMATION

Citation:

Ying Huang, Beifang Niu, Ying Gao, Limin Fu and Weizhong Li.
CD-HIT Suite: a web server for clustering and comparing biological sequences.
Bioinformatics, 2010(26): 680-682

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.