The hapassoc R package implementing likelihood inference of trait associations with SNP haplotypes and other attributes using the EM Algorithm.

The identification of genetic factors influencing susceptibility to complex disorders such as diabetes and cancer is important for improving understanding of disease pathways and for disease prevention. Associations between complex traits and single nucleotide polymorphisms (SNPs) in candidate genomic regions can provide a useful tool for such identification. However, analysis of trait associations with single SNPs ignores the potential for extra information from joint consideration of multiple SNPs on a haplotype. When haplotype phase can be resolved, regression models may be used to adjust associations between complex traits and haplotypes for the effects of nongenetic cofactors or attributes, and to investigate whether nongenetic cofactors modify haplotype associations. However, resolution of haplotype phase can be problematic, particularly when data are collected on unrelated subjects. We develop a likelihood approach to inference of haplotype and nongenetic effects and their interactions in generalized linear models of disease penetrance, when haplotype phase is unknown for some subjects. The likelihood is formulated assuming population Hardy-Weinberg equilibrium and independence of haplotype and nongenetic covariates. Parameter estimates are obtained by use of an expectation-maximization (EM) algorithm and standard errors are calculated using Louis' formula.

The hapassoc R package is available on the Comprehensive R Archive Network website. See the package vignette for a description of the underlying statistical methods and illustration of the use of hapassoc through examples.

You will have to have R installed on your system to use it. To install hapassoc you can execute the following command in R:


Windows users can also install the package via the "Packages" menu item. hapassoc is part of the statistical genetics task view, so if you have this task view installed, you already have hapassoc installed.

The software is licenced under the GNU General Public Licence.

For more details on the implementation, please see our vignette, or the earlier paper in Human Heredity.