An improved ratio-based (IRB) batch effects removal algorithm for cancer data in a co-analysis framework

Shuchu Ham, Hong Qin, Dantong Yu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Ratio-based algorithms are proven to be effective methods for removing batch effects that exist among micro array expression data from different data sources. They are outperforming than other methods in the enhancement of cross-batch prediction, especially for cancer data sets. However, their overall power is limited by: (1) Not every batch has control samples. The original method uses all negative samples to calculate the subtrahend. (2) Micro array experimental data may not have clear labels, especially in the prediction application, the labels of test data set are unknown. In this paper, we propose an Improved Ratio-Based (IRB) method to relieve these two constraints for cross-batch prediction applications. For each batch in a single study, we select one reference sample based on the idea of aligning probability density functions (pdfs) of each gene in different batches. Moreover, for data sets without label information, we transfer the problem of finding reference sample to the dense sub graph problem in graph theory. Our newly-proposed IRB method is straightforward and efficient, and can be extended for integrating large volume micro array data sets. The experiments show that our method is stable and has high performance in tumor/non-tumor prediction.

Original languageEnglish (US)
Title of host publicationProceedings - IEEE 14th International Conference on Bioinformatics and Bioengineering, BIBE 2014
EditorsReda Alhajj, Taghi M. Khoshgoftaar, Nikolaos G. Bourbakis, Xingquan Zhu
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages212-219
Number of pages8
ISBN (Electronic)9781479975013
DOIs
StatePublished - Feb 5 2014
Externally publishedYes
Event14th IEEE International Conference on BioInformatics and BioEngineering, BIBE 2014 - Boca Raton, United States
Duration: Nov 10 2014Nov 12 2014

Publication series

NameProceedings - IEEE 14th International Conference on Bioinformatics and Bioengineering, BIBE 2014

Other

Other14th IEEE International Conference on BioInformatics and BioEngineering, BIBE 2014
CountryUnited States
CityBoca Raton
Period11/10/1411/12/14

All Science Journal Classification (ASJC) codes

  • Bioengineering
  • Biomedical Engineering

Fingerprint Dive into the research topics of 'An improved ratio-based (IRB) batch effects removal algorithm for cancer data in a co-analysis framework'. Together they form a unique fingerprint.

Cite this