Abstract
In longitudinal genetic studies, investigators collect repeated measurements on a trait that changes with time along with genetic markers. For family-based longitudinal studies, since repeated measurements are nested within subjects and subjects are nested within families, both the subject level and the measurement level correlations must be taken into account in the statistical analysis to achieve more accurate estimation. In such studies, the primary interests include testing for a quantitative trait locus effect, and estimating the age-specific quantitative trait locus effect and residual polygenic heritability function. We propose flexible semiparametric models and their statistical estimation and hypothesis testing procedures for longitudinal genetic data. We employ penalized splines to estimate non-parametric functions in the model. We find that misspecifying the baseline function or the genetic effect function in a parametric analysis may lead to a substantially inflated or highly conservative type I error rate on testing and large mean-squared error on estimation. We apply the proposed approaches to examine age-specific effects of genetic variants reported in a recent genomewide association study of blood pressure collected in the Framingham Heart Study.
Original language | English (US) |
---|---|
Pages (from-to) | 1-24 |
Number of pages | 24 |
Journal | Journal of the Royal Statistical Society. Series C: Applied Statistics |
Volume | 61 |
Issue number | 1 |
DOIs | |
State | Published - Jan 2012 |
Externally published | Yes |
All Science Journal Classification (ASJC) codes
- Statistics and Probability
- Statistics, Probability and Uncertainty
Keywords
- Genomewide association study
- Penalized splines
- Quantitative trait locus