当前位置:首页 / 老年缺血性脑卒中患者急性期发生脑卒中后疲劳的影响因素及风险预测模型
论著·临床研究 | 更新时间:2024-07-17
|
老年缺血性脑卒中患者急性期发生脑卒中后疲劳的影响因素及风险预测模型
Influencing factors and risk prediction model of post⁃stroke fatigue in elderly patients with ischemic cerebral stroke in acute stage

广西医学 页码:687-694

作者机构:杨金盘,在读硕士研究生,研究方向为老年常见疾病的防治与护理。

基金信息:广西中医药大学高层次人才培育创新团队建设项目(2022A010);广西中医药大学引进博士科研启动基金项目(2023BS055)

DOI:10.11675/j.issn.0253⁃4304.2024.05.13

  • 中文简介
  • 英文简介
  • 参考文献

目的 分析老年缺血性脑卒中患者急性期发生脑卒中后疲劳(PSF)的影响因素,并基于随机森林模型和Logistic回归模型构建风险预测模型。方法 选取2家医院的老年缺血性脑卒中患者分别作为训练集(304例)、验证集(102例)。收集患者的一般资料、入院时血清学指标检测结果,并采用疲劳严重程度量表(FSS)对患者进行疲劳评估。在训练集中进行单因素分析,将有统计学意义的血清学指标纳入Lasso回归模型筛选预测变量,再分别使用随机森林模型和Logistic回归模型构建老年缺血性脑卒中患者急性期发生PSF的风险预测模型。利用验证集数据,通过受试者工作特征曲线、校准曲线、Hosmer⁃Lemeshow拟合优度检验、决策曲线分析评价两个模型的预测性能。结果 最终筛选出5个具有非零系数的预测变量,分别为C反应蛋白(CRP)、同型半胱氨酸(Hcy)、总胆固醇、空腹血糖、胱抑素C。随机森林模型依据基尼指数减少平均值对预测变量进行重要性排序依次为CRP、Hcy、总胆固醇、空腹血糖、胱抑素C。Logistic回归模型显示,空腹血糖、总胆固醇、CRP、胱抑素C及Hcy水平升高可增加老年缺血性脑卒中患者急性期发生PSF的风险。外部验证结果显示,随机森林模型和Logistic回归模型的曲线下面积、准确度、灵敏度、特异度、阳性预测值、阴性预测值分别为0.905和0.885、0.814和0.755、0.830和0.855、0.800和0.787、0.780和0.826、0.846和0.696。两种模型校准度均良好,在阈值0.08~0.86范围内具有较高的净收益率。结论 空腹血糖、总胆固醇、CRP、胱抑素C、Hcy水平与老年缺血性脑卒中患者急性期PSF的发生密切相关。基于上述血清学指标构建的随机森林模型和Logistic回归模型在预测老年缺血性脑卒中患者急性期发生PSF方面均具有较好的价值,其中随机森林模型的预测效能相对较好,而Logistic回归模型能直观解释变量对PSF发生风险的影响程度,可以作为随机森林模型的补充。

Objective To analyze the influencing factors for post⁃stroke fatigue (PSF) in elderly patients with ischemic cerebral stroke in acute stage, and to establish a risk prediction model based on random forest model and Logistic regression model. Methods Elderly patients with ischemic cerebral stroke in 2 hospitals were selected as training set (304 cases) or validation set (102 cases), respectively. The general data, detection results of serological indices at admission were collected, and the Fatigue Severity Scale (FSS) was used to evaluate patients' fatigue. The univariate analysis in the training set was performed, and serological indices with statistical significance were enrolled into the Lasso regression model for screening prediction variables, and then the risk prediction model of PSF in elderly patients with ischemic cerebral stroke in acute stage was established by using the random forest model and Logistic regression model, respectively. Employing the data from the validation set, the prediction efficiency of the two models was evaluated through receiver operating characteristic curve, calibration curve, Hosmer⁃Lemeshow goodness of fit test, and decision curve analysis. Results A total of 5 prediction variables with non⁃zero coefficient were finally screened out, concerning C⁃reactive protein (CRP), homocysteine (Hcy), total cholesterol, fasting blood glucose, cystatin C, respectively. The random forest model ranked the importance of prediction variables as CRP, Hcy, total cholesterol, fasting blood glucose, and cystatin C according to the decreasing average value of Gini coefficient. The Logistic regression model revealed that the elevations of fasting blood glucose, total cholesterol, CRP, cystatin C, and Hcy levels could increase PSF risk in elderly patients with ischemic cerebral stroke in acute stage. The results of external validation indicated that areas under the curve, accuracy, sensitivities, specificities, positive prediction values, and negative prediction values between the random forest model and Logistic regression model were 0.905 and 0.885, 0.814 and 0.755, 0.830 and 0.855, 0.800 and 0.787, 0.780 and 0.826, and 0.846 and 0.696, respectively. The calibration degree of the two models was good, and the net benefit rate was relatively high in the threshold range of 0.08-0.86. Conclusion Fasting blood glucose, total cholesterol, CRP, cystatin C, and Hcy levels are closely related to PSF occurrence in elderly patients with ischemic cerebral stroke in acute stage. The random forest model and Logistic regression model established based on the aforementioned serological indices for predicting PSF occurrence in elderly patients with ischemic cerebral stroke in acute stage exert favorable values, therein the random forest model exhibits relatively favorable prediction efficiency, whereas the Logistic regression model can directly explain the influence degree of variables on the risk of PSF occurrence, which can be used as a supplement to the random forest model.

419

浏览量

77

下载量

0

CSCD

工具集