Advanced Search

    YAN Jianjun, LIU Zhangpeng, LIU Guoping, GUO Rui, WANG Yiqin, FU Jingjing, QIAN Peng. Syndrome Classification of Chronic Gastritis Based on Multi-grained Cascade Forest[J]. Journal of East China University of Science and Technology, 2019, 45(4): 593-599. DOI: 10.14135/j.cnki.1006-3080.20180410001
    Citation: YAN Jianjun, LIU Zhangpeng, LIU Guoping, GUO Rui, WANG Yiqin, FU Jingjing, QIAN Peng. Syndrome Classification of Chronic Gastritis Based on Multi-grained Cascade Forest[J]. Journal of East China University of Science and Technology, 2019, 45(4): 593-599. DOI: 10.14135/j.cnki.1006-3080.20180410001

    Syndrome Classification of Chronic Gastritis Based on Multi-grained Cascade Forest

    • The standardization and objectification of traditional Chinese medicine (TCM) inquiry has been becoming hot issues in machine learning fields. However, TCM inquiry data has complex relation between the symptoms and syndromes as well as among symptoms such that most of machine learning algorithms cannot effectively deal with the complexity and non-linearity of TCM inquiry data. In this paper, we propose a model of syndrome classification of chronic gastritis (CG) with multi-grained cascade forest (gcForest). TCM inquiry is a typical multi-label learning problem, that is, a patient may have two or more syndromes at the same time. Firstly, we convert the multi-label problem into binary classification via transformation method. And then, the classification model is made for each syndrome via gcForest algorithm. The gcForest is a novel decision tree ensemble method based on deep learning and is composed of two independent parts, cascade forest and multi-grained scanning. The proposed algorithm is compared with two deep learning algorithms, Deep Belief Nets (DBN) and Deep Boltzmann Machine (DBM), and other five multi-label algorithms, ML-KNN, BSVM, ECC, RankSVM, and LIFT. It is shown from the experiment results that the proposes model can outperform these algorithms based on multi-label metrics and classification accuracy of each syndrome overall. The general accuracy reaches up to 0.834, and the classification precision of 6 syndromes is 0.906, 0.818, 0.764, 0.966, 0.840, 0.912, respectively. Besides, we also analyze the effect of hyper-parameter on model performance, whose results verify its robustness. The gcForest exhibits hierarchical and abstract traits during the data process that is consistent with TCM syndrome diagnosis. Therefore, gcForest can effectively solve the TCM inquiry syndrome classification of CG and provide the reference for the research of CG quantitative diagnosis.
    • loading

    Catalog

      /

      DownLoad:  Full-Size Img  PowerPoint
      Return
      Return