Hu, Ting2; Chen, Yuanzhu2; Kiralis, Jeff W2; Collins, Ryan L2; Wejse, Christian4; Sirugo, Giorgio2; Williams, Scott M2; Moore, Jason H2
1 Department of Public Health, Health, Aarhus University2 unknown3 Department of Public Health - Department of Health Services Research, Department of Public Health, Health, Aarhus University4 Department of Public Health - Department of Health Services Research, Department of Public Health, Health, Aarhus University
Background Epistasis has been historically used to describe the phenomenon that the effect of a given gene on a phenotype can be dependent on one or more other genes, and is an essential element for understanding the association between genetic and phenotypic variations. Quantifying epistasis of orders higher than two is very challenging due to both the computational complexity of enumerating all possible combinations in genome-wide data and the lack of efficient and effective methodologies. Objectives In this study, we propose a fast, non-parametric, and model-free measure for three-way epistasis. Methods Such a measure is based on information gain, and is able to separate all lower order effects from pure three-way epistasis. Results Our method was verified on synthetic data and applied to real data from a candidate-gene study of tuberculosis in a West African population. In the tuberculosis data, we found a statistically significant pure three-way epistatic interaction effect that was stronger than any lower-order associations. Conclusion Our study provides a methodological basis for detecting and characterizing high-order gene-gene interactions in genetic association studies.
American Medical Informatics Association. Journal, 2013, Vol 20, Issue 4