Abstract
In family studies, canonical discriminant analysis can be used to find linear combinations of phenotypes that exhibit high ratios of between-family to within-family variabilities. But with large numbers of phenotypes, canonical discriminant analysis may overfit. To estimate the predicted ratios associated with the coefficients obtained from canonical discriminant analysis, two methods are developed; one is based on bias correction and the other based on cross-validation. Because the cross-validation is computationally intensive, an approximation to the cross-validation is also developed. Furthermore, these methods can be applied to perform variable selection in canonical discriminant analysis. The proposed methods are illustrated with simulation studies and applications to two real examples.
Original language | English (US) |
---|---|
Pages (from-to) | 124-132 |
Number of pages | 9 |
Journal | Biometrics |
Volume | 67 |
Issue number | 1 |
DOIs | |
State | Published - Mar 2011 |
Externally published | Yes |
All Science Journal Classification (ASJC) codes
- Statistics and Probability
- General Biochemistry, Genetics and Molecular Biology
- General Immunology and Microbiology
- General Agricultural and Biological Sciences
- Applied Mathematics
Keywords
- Cross-validation
- Heritability
- Model selection
- Optimism