TY - JOUR
T1 - U-statistics-based tests for multiple genes in genetic association studies
AU - Wei, Zhi
AU - Li, Mingyao
AU - Rebbeck, Timothy
AU - Li, Hongzhe
PY - 2008
Y1 - 2008
N2 - As our understanding of biological pathways and the genes that regulate these pathways increases, consideration of these biological pathways has become an increasingly important part of genetic and molecular epidemiology. Pathway-based genetic association studies often involve genotyping of variants in genes acting in certain biological pathways. Such pathway-based genetic association studies can potentially capture the highly heterogeneous nature of many complex traits, with multiple causative loci and multiple alleles at some of the causative loci. In this paper, we develop two nonparametric test statistics that consider simultaneously the effects of multiple markers. Our approach, which is based on data-adaptive U-statistics, can handle both qualitative data such as case-control data and quantitative continuous phenotype data. Simulations demonstrate that our proposed methods are more powerful than standard methods, especially when there are multiple risk loci each with small genetic effects. When the number of disease-predisposing genes is small, the data-adaptive weighting of the U-statistics over all the markers produces similar power to commonly used single marker tests. We further illustrate the potential merits of our proposed tests in the analysis of a data set from a pathway-based candidate gene association study of breast cancer and hormone metabolism pathways. Finally, potential applications of the proposed tests to genome-wide association studies are also discussed.
AB - As our understanding of biological pathways and the genes that regulate these pathways increases, consideration of these biological pathways has become an increasingly important part of genetic and molecular epidemiology. Pathway-based genetic association studies often involve genotyping of variants in genes acting in certain biological pathways. Such pathway-based genetic association studies can potentially capture the highly heterogeneous nature of many complex traits, with multiple causative loci and multiple alleles at some of the causative loci. In this paper, we develop two nonparametric test statistics that consider simultaneously the effects of multiple markers. Our approach, which is based on data-adaptive U-statistics, can handle both qualitative data such as case-control data and quantitative continuous phenotype data. Simulations demonstrate that our proposed methods are more powerful than standard methods, especially when there are multiple risk loci each with small genetic effects. When the number of disease-predisposing genes is small, the data-adaptive weighting of the U-statistics over all the markers produces similar power to commonly used single marker tests. We further illustrate the potential merits of our proposed tests in the analysis of a data set from a pathway-based candidate gene association study of breast cancer and hormone metabolism pathways. Finally, potential applications of the proposed tests to genome-wide association studies are also discussed.
KW - Breast cancer
KW - Genetic heterogeneity
KW - Genetic pathways
KW - Global tests
UR - http://www.scopus.com/inward/record.url?scp=54049114308&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=54049114308&partnerID=8YFLogxK
U2 - 10.1111/j.1469-1809.2008.00473.x
DO - 10.1111/j.1469-1809.2008.00473.x
M3 - Article
C2 - 18691161
AN - SCOPUS:54049114308
SN - 0003-4800
VL - 72
SP - 821
EP - 833
JO - Annals of Human Genetics
JF - Annals of Human Genetics
IS - 6
ER -