XIE Feibiao,WANG Jing,XIANG Xinghua,et al.Identification of Kidney-Yang Deficiency Syndrome in Osteoporosis Patients Based on Rule Ensemble Method of Bagging Combining LASSO Regression[J].Chinese Journal of Experimental Traditional Medical Formulae,2023,29(23):150-157.
XIE Feibiao,WANG Jing,XIANG Xinghua,et al.Identification of Kidney-Yang Deficiency Syndrome in Osteoporosis Patients Based on Rule Ensemble Method of Bagging Combining LASSO Regression[J].Chinese Journal of Experimental Traditional Medical Formulae,2023,29(23):150-157. DOI: 10.13422/j.cnki.syfjx.20230449.
Identification of Kidney-Yang Deficiency Syndrome in Osteoporosis Patients Based on Rule Ensemble Method of Bagging Combining LASSO Regression增强出版
To investigate the identification of kidney Yang deficiency syndrome of patients with osteoporosis(OP), and to form the clinical syndrome identification rules of traditional Chinese medicine(TCM).
Method
2
Basic information, etiology, clinical symptoms and other characteristics of 982 OP patients were included, and statistical tests were used to screen the variables associated with kidney Yang deficiency syndrome. Taking the decision tree as the base model, bootstrap aggregation algorithm(Bagging algorithm) was utilized to establish the classification model of kidney Yang deficiency syndrome in OP, generating numerous rules and removing redundancy. Combining least absolute shrinkage and selection operator(LASSO) regression to screen key rules and integrate them to construct an identification model, achieving the identification of kidney Yang deficiency syndrome in OP patients.
Result
2
Eighteen key identification rules were screened out, and of these, where 11 rules with regression coefficients>0 correlated positively with the kidney Yang deficiency syndrome, the rule with the highest coefficient was chilliness(present)&feverish sensation over the palm and sole(absent). The other 7 rules with regression coefficients<0 correlated negatively with the syndrome, the rule with the lowest coefficient was reddish tongue(present)&diarrhea(absent)&deficiency of endowment(absent). According to the regression coefficients of each key rule, variables with importance>0.2 were ranked as chilliness, reddish tongue, feverish sensation over the palm and sole, cold limbs, clear urine, diarrhea, deficiency of endowment, prolonged illness. The results of the partial dependence analysis of the identification model showed that compared to OP patients without chilliness, those with chilliness(present) had a 0.266 8 higher probability of being identified as having kidney Yang deficiency syndrome, indicating that this variable had the highest impact on identification of the syndrome. Similarly, compared to OP patients without reddish tongue, those with reddish tongue had a 0.141 9 lower probability of being identified as having kidney Yang deficiency syndrome, indicating that this variable had the highest impact on identifying non-kidney Yang deficiency syndrome. The accuracy, sensitivity, specificity and area under receiver operating characteristic curve(AUC) of the established kidney Yang deficiency syndrome identification model in the test set were 0.865 9, 0.853 7, 0.872 0 and 0.931 5, respectively.
Conclusion
2
A precise identification model of OP kidney Yang deficiency syndrome is conducted basing on the rule ensemble method of Bagging combining LASSO regression, and the screened key rules can explain the identification process of kidney Yang deficiency syndrome. In this research, according to the regression coefficients of rules, the importance and partial dependence of variables, combined with the thinking of TCM, the influence of patient characteristics on the identification of syndromes is described, so as to reveal the primary and secondary syndromes of identification and assist the clinical identification of kidney Yang deficiency syndrome.
关键词
骨质疏松症肾阳虚证规则集成预测方法辨识模型变量重要性偏依赖性
Keywords
osteoporosiskidney Yang deficiency syndromerule ensembleprediction methodidentification modelvariable importancepartial dependence
references
BIJLSMA A Y,MESKERS C G M,WESTENDORP R G J,et al.Chronology of age-related disease definitions:Osteoporosis and sarcopenia[J].Ageing Res Rev,2012,11(2):320-324.
CHENG X G,ZHAO K P,ZHA X J,et al.Opportunistic screening using low-dose CT and the prevalence of osteoporosis in China:A nationwide,multicenter study[J].J Bone Miner Res,2021,36(3):427-435.
YANG W,YI D H,XIE Y M,et al.Statistical identification of syndromes feature and structure of disease of western medicine based on general latent structure model[J].Chin J Integr Med,2012,18(11):850-861.
DENG H.Interpreting tree ensembles with inTrees[J].Int J Data Sci Anal,2019,7(4):277-287.
FOKKEMA M.Fitting prediction rule ensembles with R package pre[J].J Stat Softw,2020,92(12):1-30.
WANG J,SHU B,LI C G,et al.Polymorphisms of genes related to vitamin D metabolism and transportation and its relationship with the risk of osteoporosis:Protocol for a multicentre prospective cohort study in China[J].BMJ Open,2019,9(11):e028084.