Variable Selection in Canonical Discriminant Analysis for Family Studies

Man Jin, Yixin Fang

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

In family studies, canonical discriminant analysis can be used to find linear combinations of phenotypes that exhibit high ratios of between-family to within-family variabilities. But with large numbers of phenotypes, canonical discriminant analysis may overfit. To estimate the predicted ratios associated with the coefficients obtained from canonical discriminant analysis, two methods are developed; one is based on bias correction and the other based on cross-validation. Because the cross-validation is computationally intensive, an approximation to the cross-validation is also developed. Furthermore, these methods can be applied to perform variable selection in canonical discriminant analysis. The proposed methods are illustrated with simulation studies and applications to two real examples.

Original languageEnglish (US)
Pages (from-to)124-132
Number of pages9
JournalBiometrics
Volume67
Issue number1
DOIs
StatePublished - Mar 2011
Externally publishedYes

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • General Biochemistry, Genetics and Molecular Biology
  • General Immunology and Microbiology
  • General Agricultural and Biological Sciences
  • Applied Mathematics

Keywords

  • Cross-validation
  • Heritability
  • Model selection
  • Optimism

Fingerprint

Dive into the research topics of 'Variable Selection in Canonical Discriminant Analysis for Family Studies'. Together they form a unique fingerprint.

Cite this