CLUMPHAP 1.1 – Haplotype-based Association Analysis



CLUMPHAP implements a novel method for association testing based on clustering similar haplotypes (Knight et al. Submitted). This represents an extension of the basic methodology used in CLUMP, a program designed for the analysis of multi-allelic markers (Sham and Curtis 1995). CLUMPHAP calculates chi-squared statistics for binary partitions of haplotypes, where the number of partitions is reduced by allowing only those that are supported by a hierarchical cluster analysis of the haplotypes. CLUMPHAP obtains the empirical significance level of the largest chi-square statistic by a permutation procedure in which multiple permuted datasets (where the case-control labels have been randomly re-assigned) are subjected to exactly the same procedure of haplotype partitioning and calculation of largest chi-square statistic. Incidentally, this permutation procedure accounts for not only the inflation of the test statistic due to the maximization over the multiple ways of partitioning the haplotypes, but also for the uncertainty in haplotype phase of the individual subjects (Curtis and Sham 2006). The results are easy to interpret, a significant result suggests that a disease causing variant is present on haplotypes in the group which has an increased overall frequency in cases. CLUMPHAP reports the cluster pattern that resulted in the highest chi-squared along with the corresponding statistic and the empirical p-value.



Dave Curtis


Command Line


  • Windows





Knight J, Curtis D, Sham PC (submitted)
CLUMPHAP: A simple tool for performing haplotype-based association analysis.
Genet Epidemiol. 2008 Sep;32(6):539-45.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.