TY - GEN
T1 - A lin-kernighan heuristic for the DCJ median problem of genomes with unequal contents
AU - Yin, Zhaoming
AU - Tang, Jijun
AU - Schaeffer, Stephen W.
AU - Bader, David A.
PY - 2014
Y1 - 2014
N2 - In this paper, we designed a distance metric as DCJ-Indel-Exemplar distance to estimate the dissimilarity between two genomes with unequal contents (with gene insertions/deletions (Indels) and duplications). Based on the aforementioned distance metric, we proposed the DCJ-Indel-Exemplar median problem, to find a median genome that minimize the DCJ-Indel-Exemplar distance between this genome and the given three genomes. We adapted Lin-Kernighan (LK) heuristic to calculate the median quickly by utilizing the features of adequate sub-graph decomposition and search space reduction technologies. Experimental results on simulated gene order data indicate that our distance estimator can closely estimate the real number of rearrangement events; while compared with the exact solver using equal content genomes, our median solver can get very accurate results as well. More importantly, our median solver can deal with Indels and duplications and generates results very close to the synthetic cumulative number of evolutionary events.
AB - In this paper, we designed a distance metric as DCJ-Indel-Exemplar distance to estimate the dissimilarity between two genomes with unequal contents (with gene insertions/deletions (Indels) and duplications). Based on the aforementioned distance metric, we proposed the DCJ-Indel-Exemplar median problem, to find a median genome that minimize the DCJ-Indel-Exemplar distance between this genome and the given three genomes. We adapted Lin-Kernighan (LK) heuristic to calculate the median quickly by utilizing the features of adequate sub-graph decomposition and search space reduction technologies. Experimental results on simulated gene order data indicate that our distance estimator can closely estimate the real number of rearrangement events; while compared with the exact solver using equal content genomes, our median solver can get very accurate results as well. More importantly, our median solver can deal with Indels and duplications and generates results very close to the synthetic cumulative number of evolutionary events.
KW - Double-cut and Join (DCJ)
KW - Genome Rearrangement
KW - Lin-Kernighan Heuristic
UR - http://www.scopus.com/inward/record.url?scp=84904740848&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84904740848&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-08783-2_20
DO - 10.1007/978-3-319-08783-2_20
M3 - Conference contribution
AN - SCOPUS:84904740848
SN - 9783319087825
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 227
EP - 238
BT - Computing and Combinatorics - 20th International Conference, COCOON 2014, Proceedings
PB - Springer Verlag
T2 - 20th International Computing and Combinatorics Conference, COCOON 2014
Y2 - 4 August 2014 through 6 August 2014
ER -