Abstract
The goal of cluster analysis is to assign observations into clusters so that observations in the same cluster are similar in some sense. Many clustering methods have been developed in the statistical literature, but these methods are inappropriate for clustering family data, which possess intrinsic familial structure. To incorporate the familial structure, we propose a form of penalized cluster analysis with a tuning parameter controlling the tradeoff between the observation dissimilarity and the familial structure. The tuning parameter is selected based on the concept of clustering stability. The effectiveness of the method is illustrated via simulations and an application to a family study of asthma.
Original language | English (US) |
---|---|
Pages (from-to) | 2128-2136 |
Number of pages | 9 |
Journal | Computational Statistics and Data Analysis |
Volume | 55 |
Issue number | 6 |
DOIs | |
State | Published - Jun 1 2011 |
Externally published | Yes |
All Science Journal Classification (ASJC) codes
- Statistics and Probability
- Computational Mathematics
- Computational Theory and Mathematics
- Applied Mathematics
Keywords
- Consistency
- Cross-validation
- K-means
- Kinship
- Stability