Association Journal of CSIAM
Supervised by Ministry of Education of PRC
Sponsored by Xi'an Jiaotong University
ISSN 1005-3085  CN 61-1269/O1

Chinese Journal of Engineering Mathematics ›› 2021, Vol. 38 ›› Issue (5): 601-609.doi: 10.3969/j.issn.1005-3085.2021.05.001

    Next Articles

Gene Data Mining and Bioinformatics Analysis of Tuberculosis Based on Three Bayesian Methods

ZHANG Xu,   WU Peiwang,   QIAO Feng   

  1. School of Mathematics and Statistics, Southwest University, Chongqing 400715
  • Online:2021-10-15 Published:2021-12-15
  • Contact: P. Wu. E-mail address: 664249360@qq.com
  • Supported by:
    The National Natural Science Foundation of China (11701471); the Basic Science and Frontier Technology Research Project of Chongqing (cstc2017jcyjAX0476).

Abstract:

Identification of susceptible host genes plays a key role in the treatment and prevention of tuberculosis. So far, only a few genes have been confirmed to be related to it. Based on the gene chip dataset GSE54992 of peripheral blood mononuclear cells of tuberculosis patients, this paper selects the differentially expressed genes between normal samples and active tuberculosis samples by combing information prior Bayesian test and linear model empirical Bayesian method. 319 differentially expressed genes are recognized by both methods. These genes are modeled by using naive Bayes classifier based on the independent verification set GSE83456 where a high classification accuracy is obtained. Finally, the molecular mechanism of tuberculosis is analyzed from the biological point of view through GO function enrichment and KEGG pathway analysis. This study highlights the important roles of the three Bayesian methods in gene data analysis. It provides a new comprehensive strategy for exploring specific biomarkers of tuberculosis, and shows the important clues for the prevention, diagnosis and treatment of tuberculosis.

Key words: tuberculosis, information prior Bayesian test, linear model and empirical Bayes, naive Bayes classifier

CLC Number: