TY - JOUR
T1 - Diametrical clustering for identifying anti-correlated gene clusters
AU - Dhillon, Inderjit S.
AU - Marcotte, Edward M.
AU - Roshan, Usman
N1 - Funding Information:
We would like to thank Vishwanath Iyer for helpful discussion and Usman Shakil for helping with the web page. This work was supported by a grant from the Welch Foundation (E.M.M.), a Dreyfus New Faculty Award (E.M.M.), the Texas Advanced Research Program (E.M.M. and I.S.D.), the NSF (E.M.M.), and NSF Career Award Grant No. ACI-0093404 (I.S.D.).
PY - 2003/9/1
Y1 - 2003/9/1
N2 - Motivation: Clustering genes based upon their expression patterns allows us to predict gene function. Most existing clustering algorithms cluster genes together when their expression patterns show high positive correlation. However, it has been observed that genes whose expression patterns are strongly anti-correlated can also be functionally similar. Biologically, this is not unintuitive-genes responding to the same stimuli, regardless of the nature of the response, are more likely to operate in the same pathways. Results: We present a new diametrical clustering algorithm that explicitly identifies anti-correlated clusters of genes. Our algorithm proceeds by iteratively (i) re-partitioning the genes and (ii) computing the dominant singular vector of each gene cluster; each singular vector serving as the prototype of a 'diametric' cluster. We empirically show the effectiveness of the algorithm in identifying diametrical or anti-correlated clusters. Testing the algorithm on yeast cell cycle data, fibroblast gene expression data, and DNA microarray data from yeast mutants reveals that opposed cellular pathways can be discovered with this method. We present systems whose mRNA expression patterns, and likely their functions, oppose the yeast ribosome and proteosome, along with evidence for the inverse transcriptional regulation of a number of cellular systems.
AB - Motivation: Clustering genes based upon their expression patterns allows us to predict gene function. Most existing clustering algorithms cluster genes together when their expression patterns show high positive correlation. However, it has been observed that genes whose expression patterns are strongly anti-correlated can also be functionally similar. Biologically, this is not unintuitive-genes responding to the same stimuli, regardless of the nature of the response, are more likely to operate in the same pathways. Results: We present a new diametrical clustering algorithm that explicitly identifies anti-correlated clusters of genes. Our algorithm proceeds by iteratively (i) re-partitioning the genes and (ii) computing the dominant singular vector of each gene cluster; each singular vector serving as the prototype of a 'diametric' cluster. We empirically show the effectiveness of the algorithm in identifying diametrical or anti-correlated clusters. Testing the algorithm on yeast cell cycle data, fibroblast gene expression data, and DNA microarray data from yeast mutants reveals that opposed cellular pathways can be discovered with this method. We present systems whose mRNA expression patterns, and likely their functions, oppose the yeast ribosome and proteosome, along with evidence for the inverse transcriptional regulation of a number of cellular systems.
UR - http://www.scopus.com/inward/record.url?scp=0141850419&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0141850419&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btg209
DO - 10.1093/bioinformatics/btg209
M3 - Article
C2 - 12967956
AN - SCOPUS:0141850419
SN - 1367-4803
VL - 19
SP - 1612
EP - 1619
JO - Bioinformatics
JF - Bioinformatics
IS - 13
ER -