An integrated machine learning framework for a discriminative analysis of schizophrenia using multi-biological data

Peng fei Ke, Dong sheng Xiong, Jia hui Li, Zhi lin Pan, Jing Zhou, Shi jia Li, Jie Song, Xiao yi Chen, Gui xiang Li, Jun Chen, Xiao bo Li, Yu ping Ning, Feng chun Wu, Kai Wu

Research output: Contribution to journalArticlepeer-review

24 Scopus citations


Finding effective and objective biomarkers to inform the diagnosis of schizophrenia is of great importance yet remains challenging. Relatively little work has been conducted on multi-biological data for the diagnosis of schizophrenia. In this cross-sectional study, we extracted multiple features from three types of biological data, including gut microbiota data, blood data, and electroencephalogram data. Then, an integrated framework of machine learning consisting of five classifiers, three feature selection algorithms, and four cross validation methods was used to discriminate patients with schizophrenia from healthy controls. Our results show that the support vector machine classifier without feature selection using the input features of multi-biological data achieved the best performance, with an accuracy of 91.7% and an AUC of 96.5% (p < 0.05). These results indicate that multi-biological data showed better discriminative capacity for patients with schizophrenia than single biological data. The top 5% discriminative features selected from the optimal model include the gut microbiota features (Lactobacillus, Haemophilus, and Prevotella), the blood features (superoxide dismutase level, monocyte-lymphocyte ratio, and neutrophil count), and the electroencephalogram features (nodal local efficiency, nodal efficiency, and nodal shortest path length in the temporal and frontal-parietal brain areas). The proposed integrated framework may be helpful for understanding the pathophysiology of schizophrenia and developing biomarkers for schizophrenia using multi-biological data.

Original languageEnglish (US)
Article number14636
JournalScientific reports
Issue number1
StatePublished - Dec 2021

All Science Journal Classification (ASJC) codes

  • General


Dive into the research topics of 'An integrated machine learning framework for a discriminative analysis of schizophrenia using multi-biological data'. Together they form a unique fingerprint.

Cite this