Background
Methods
Data
Social determinants
Disparities in social determinants across various sepsis criteria
Mortality prediction for sepsis patients using machine learning
Statistical analysis for disparities in performances on sub-populations of social determinants
Results
Disparities in social determinants across various sepsis criteria
Mortality prediction for sepsis patients using machine learning
Social Determinants | Category | n | % sepsis population | In-hospital mortality | % in-hospital mortality | Training | Testing |
---|---|---|---|---|---|---|---|
Race | Asian | 179 | 3.10 | 26 | 14.53 | 129 | 50 |
Black or African American | 501 | 8.66 | 52 | 10.38 | 348 | 153 | |
Hispanic or Latino | 188 | 3.25 | 18 | 9.57 | 132 | 56 | |
Other | 714 | 12.35 | 165 | 23.11 | 527 | 187 | |
White | 4201 | 72.64 | 575 | 13.69 | 2912 | 1289 | |
Sex | Female | 2562 | 44.30 | 384 | 14.99 | 1798 | 764 |
Male | 3221 | 55.70 | 452 | 14.03 | 2250 | 971 | |
Marital status | Separated | 398 | 6.88 | 52 | 13.07 | 287 | 111 |
Significant other | 2559 | 44.25 | 363 | 14.19 | 1788 | 771 | |
Single | 1638 | 28.32 | 174 | 10.62 | 1157 | 481 | |
Unknown | 332 | 5.74 | 102 | 30.72 | 248 | 84 | |
Widowed | 856 | 14.80 | 145 | 16.94 | 568 | 288 | |
Insurance type | Government | 166 | 2.87 | 13 | 7.83 | 115 | 51 |
Medicaid | 570 | 9.86 | 67 | 11.75 | 395 | 175 | |
Medicare | 3358 | 58.07 | 560 | 16.68 | 2335 | 1023 | |
Private | 1639 | 28.34 | 185 | 11.29 | 1168 | 471 | |
Self-pay | 50 | 0.86 | 11 | 22.00 | 35 | 15 | |
Language | English | 5167 | 89.35 | 727 | 14.07 | 3631 | 1536 |
Other | 499 | 8.63 | 94 | 18.84 | 339 | 160 | |
Spanish | 117 | 2.02 | 15 | 12.82 | 78 | 39 |
Accuracy | AUC | Precision | Recall | F1_binary | F1_macro | Specificity | |
---|---|---|---|---|---|---|---|
Ridge classifier | 0.6790 | 0.7774 | 0.2682 | 0.7052 | 0.3886 | 0.5855 | 0.6745 |
Perceptron | 0.6720 | 0.7786 | 0.2634 | 0.7052 | 0.3835 | 0.5801 | 0.6664 |
Passive-aggressive | 0.6841 | 0.7582 | 0.2733 | 0.7131 | 0.3951 | 0.5907 | 0.6792 |
kNN | 0.7135 | 0.7299 | 0.2780 | 0.6135 | 0.3826 | 0.5981 | 0.7305 |
Random forest | 0.7516 | 0.6459 | 0.2826 | 0.4661 | 0.3519 | 0.5991 | 0.7999 |
LinearSVC_L1 | 0.6749 | 0.7781 | 0.2654 | 0.7052 | 0.3856 | 0.5823 | 0.6698 |
LinearSVC_L2 | 0.6784 | 0.7777 | 0.2678 | 0.7052 | 0.3882 | 0.5850 | 0.6739 |
SGDClassifier_L1 | 0.6790 | 0.7759 | 0.2682 | 0.7052 | 0.3886 | 0.5855 | 0.6745 |
SGDClassifier_L2 | 0.6790 | 0.7749 | 0.2668 | 0.6972 | 0.3859 | 0.5843 | 0.6759 |
SGDClassifier_EN | 0.6801 | 0.7753 | 0.2683 | 0.7012 | 0.3881 | 0.5858 | 0.6765 |
MultinomialNB | 0.6392 | 0.7040 | 0.2348 | 0.6614 | 0.3466 | 0.5487 | 0.6354 |
BernoulliNB | 0.3107 | 0.5724 | 0.1665 | 0.9402 | 0.2830 | 0.3096 | 0.2042 |
Logistic regression | 0.6824 | 0.7761 | 0.2720 | 0.7131 | 0.3938 | 0.5893 | 0.6772 |
SVC_rbf | 0.6847 | 0.7744 | 0.2702 | 0.6932 | 0.3888 | 0.5882 | 0.6833 |
SVC_poly | 0.6749 | 0.7751 | 0.2654 | 0.7052 | 0.3856 | 0.5823 | 0.6698 |
SVC_sigmoid | 0.6277 | 0.6873 | 0.2349 | 0.6972 | 0.3514 | 0.5451 | 0.6159 |
Asian | Black or African American | Hispanic or Latino | Other | White | ||||||
---|---|---|---|---|---|---|---|---|---|---|
Observed difference | p_val | Observed difference | p_val | Observed difference | p_val | Observed difference | p_val | Observed difference | p_val | |
Ridge classifier | − 0.2812 | 0.009 | − 0.0241 | 0.366 | − 0.2208 | 0.038 | 0.0011 | 0.528 | 0.0175 | 0.286 |
Perceptron | − 0.2748 | 0.007 | 0.0078 | 0.448 | − 0.2453 | 0.025 | − 0.0026 | 0.502 | 0.0158 | 0.312 |
Passive-aggressive | − 0.3188 | 0.003 | − 0.0075 | 0.464 | − 0.1749 | 0.083 | 0.0159 | 0.381 | 0.0111 | 0.370 |
kNN | − 0.1314 | 0.141 | − 0.0628 | 0.219 | − 0.1865 | 0.069 | − 0.0144 | 0.372 | 0.0214 | 0.245 |
Random forest | − 0.0834 | 0.207 | − 0.0939 | 0.046 | − 0.1226 | 0.112 | 0.0536 | 0.120 | 0.0056 | 0.429 |
LinearSVC_L1 | − 0.2819 | 0.012 | − 0.0172 | 0.410 | − 0.2247 | 0.039 | 0.0003 | 0.481 | 0.0173 | 0.285 |
LinearSVC_L2 | − 0.2815 | 0.009 | − 0.0221 | 0.385 | − 0.2211 | 0.045 | 0.0005 | 0.490 | 0.0175 | 0.294 |
SGDClassifier_L1 | − 0.2872 | 0.008 | − 0.0041 | 0.482 | − 0.2159 | 0.044 | 0.0036 | 0.478 | 0.0184 | 0.266 |
SGDClassifier_L2 | − 0.2900 | 0.008 | − 0.0087 | 0.455 | − 0.2182 | 0.039 | 0.0044 | 0.469 | 0.0191 | 0.263 |
SGDClassifier_EN | − 0.2905 | 0.010 | − 0.0046 | 0.461 | − 0.2186 | 0.058 | 0.0050 | 0.497 | 0.0181 | 0.300 |
MultinomialNB | − 0.2797 | 0.010 | 0.0671 | 0.182 | − 0.2373 | 0.033 | 0.0051 | 0.484 | 0.0051 | 0.416 |
BernoulliNB | − 0.1974 | 0.025 | − 0.0034 | 0.483 | 0.0476 | 0.331 | 0.0173 | 0.368 | 0.0012 | 0.490 |
Logistic regression | − 0.2875 | 0.010 | − 0.0257 | 0.377 | − 0.2061 | 0.054 | 0.0043 | 0.495 | 0.0174 | 0.273 |
SVC_rbf | − 0.3085 | 0.005 | 0.0042 | 0.480 | − 0.2311 | 0.031 | − 0.0176 | 0.383 | 0.0175 | 0.275 |
SVC_poly | − 0.2978 | 0.006 | 0.0027 | 0.483 | − 0.2751 | 0.017 | 0.0087 | 0.431 | 0.0154 | 0.287 |
SVC_sigmoid | − 0.1343 | 0.144 | − 0.0941 | 0.083 | − 0.0606 | 0.332 | − 0.0099 | 0.455 | 0.0208 | 0.247 |
English | Other | Spanish | ||||
---|---|---|---|---|---|---|
Observed difference | p_val | Observed difference | p_val | Observed difference | p_val | |
Ridge classifier | 0.0154 | 0.279 | − 0.0760 | 0.107 | − 0.3422 | 0.012 |
Perceptron | 0.0182 | 0.252 | − 0.0916 | 0.053 | − 0.3551 | 0.004 |
Passive-aggressive | 0.0122 | 0.301 | − 0.0555 | 0.172 | − 0.2288 | 0.053 |
kNN | 0.0166 | 0.263 | − 0.0768 | 0.102 | − 0.3063 | 0.017 |
Random forest | 0.0037 | 0.409 | − 0.0057 | 0.489 | − 0.2342 | 0.002 |
LinearSVC_L1 | 0.0160 | 0.299 | − 0.0772 | 0.102 | − 0.3428 | 0.003 |
LinearSVC_L2 | 0.0156 | 0.297 | − 0.0763 | 0.121 | − 0.3424 | 0.007 |
SGDClassifier_L1 | 0.0184 | 0.246 | − 0.0783 | 0.093 | − 0.3347 | 0.008 |
SGDClassifier_L2 | 0.0187 | 0.269 | − 0.0752 | 0.107 | − 0.3396 | 0.004 |
SGDClassifier_EN | 0.0181 | 0.259 | − 0.0760 | 0.105 | − 0.3283 | 0.006 |
MultinomialNB | 0.0221 | 0.224 | − 0.1210 | 0.021 | − 0.2746 | 0.031 |
BernoulliNB | 0.0076 | 0.389 | − 0.0621 | 0.082 | 0.0306 | 0.422 |
Logistic regression | 0.0145 | 0.293 | − 0.0703 | 0.125 | − 0.3173 | 0.014 |
SVC_rbf | 0.0159 | 0.306 | − 0.0825 | 0.080 | − 0.3332 | 0.012 |
SVC_poly | 0.0176 | 0.275 | − 0.0860 | 0.079 | − 0.3633 | 0.002 |
SVC_sigmoid | − 0.0030 | 0.454 | 0.0341 | 0.288 | − 0.1814 | 0.089 |
Asian v.s. Black or African American | Asian v.s. Hispanic or Latino | Asian v.s. Other | Asian v.s. White | Black or African American v.s. Hispanic or Latino | ||||||
---|---|---|---|---|---|---|---|---|---|---|
Observed difference | p_val | Observed difference | p_val | Observed difference | p_val | Observed difference | p_val | Observed difference | p_val | |
Ridge classifier | 0.2572 | 0.074 | 0.0605 | 0.738 | 0.2824 | 0.033 | 0.2988 | 0.018 | − 0.1967 | 0.189 |
Perceptron | 0.2827 | 0.042 | 0.0295 | 0.883 | 0.2722 | 0.051 | 0.2906 | 0.021 | − 0.2531 | 0.081 |
Passive-aggressive | 0.3114 | 0.045 | 0.1439 | 0.432 | 0.3348 | 0.018 | 0.3299 | 0.008 | − 0.1674 | 0.238 |
kNN | 0.0686 | 0.647 | − 0.0552 | 0.763 | 0.1170 | 0.380 | 0.1528 | 0.224 | − 0.1238 | 0.413 |
Random forest | − 0.0104 | 0.916 | − 0.0392 | 0.715 | 0.1370 | 0.211 | 0.0890 | 0.372 | − 0.0287 | 0.781 |
LinearSVC_L1 | 0.2647 | 0.075 | 0.0571 | 0.756 | 0.2822 | 0.043 | 0.2991 | 0.020 | − 0.2076 | 0.156 |
LinearSVC_L2 | 0.2594 | 0.084 | 0.0605 | 0.752 | 0.2820 | 0.042 | 0.2990 | 0.019 | − 0.1990 | 0.179 |
SGDClassifier_L1 | 0.2832 | 0.052 | 0.0714 | 0.668 | 0.2908 | 0.036 | 0.3057 | 0.022 | − 0.2118 | 0.136 |
SGDClassifier_L2 | 0.2813 | 0.050 | 0.0718 | 0.692 | 0.2944 | 0.019 | 0.3091 | 0.015 | − 0.2095 | 0.151 |
SGDClassifier_EN | 0.2858 | 0.058 | 0.0718 | 0.706 | 0.2954 | 0.035 | 0.3086 | 0.015 | − 0.2140 | 0.142 |
MultinomialNB | 0.3468 | 0.013 | 0.0424 | 0.800 | 0.2848 | 0.035 | 0.2848 | 0.029 | − 0.3044 | 0.032 |
BernoulliNB | 0.1940 | 0.043 | 0.2450 | 0.068 | 0.2147 | 0.015 | 0.1986 | 0.021 | 0.0510 | 0.609 |
Logistic regression | 0.2617 | 0.082 | 0.0814 | 0.620 | 0.2918 | 0.037 | 0.3049 | 0.019 | − 0.1804 | 0.198 |
SVC_rbf | 0.3127 | 0.025 | 0.0774 | 0.653 | 0.2909 | 0.030 | 0.3259 | 0.013 | − 0.2352 | 0.093 |
SVC_poly | 0.3005 | 0.024 | 0.0227 | 0.889 | 0.3066 | 0.025 | 0.3132 | 0.016 | − 0.2778 | 0.056 |
SVC_sigmoid | 0.0402 | 0.780 | 0.0736 | 0.666 | 0.1244 | 0.375 | 0.1551 | 0.235 | 0.0334 | 0.796 |
Black or African American v.s. Other | Black or African American v.s. White | Hispanic or Latino v.s. Other | Hispanic or Latino v.s. White | Other v.s. White | ||||||
---|---|---|---|---|---|---|---|---|---|---|
Observed difference | p_val | Observed difference | p_val | Observed difference | p_val | Observed difference | p_val | Observed difference | p_val | |
Ridge classifier | 0.0252 | 0.781 | 0.0416 | 0.557 | 0.2219 | 0.103 | 0.2383 | 0.055 | 0.0164 | 0.783 |
Perceptron | − 0.0104 | 0.930 | 0.0080 | 0.926 | 0.2427 | 0.076 | 0.2611 | 0.032 | 0.0184 | 0.747 |
Passive-aggressive | 0.0234 | 0.764 | 0.0186 | 0.791 | 0.1908 | 0.167 | 0.1860 | 0.134 | − 0.0048 | 0.931 |
kNN | 0.0484 | 0.564 | 0.0841 | 0.225 | 0.1721 | 0.189 | 0.2079 | 0.081 | 0.0358 | 0.537 |
Random forest | 0.1474 | 0.029 | 0.0994 | 0.089 | 0.1762 | 0.101 | 0.1282 | 0.182 | − 0.0480 | 0.278 |
LinearSVC_L1 | 0.0175 | 0.832 | 0.0344 | 0.629 | 0.2251 | 0.106 | 0.2420 | 0.065 | 0.0170 | 0.764 |
LinearSVC_L2 | 0.0226 | 0.792 | 0.0396 | 0.585 | 0.2216 | 0.088 | 0.2386 | 0.065 | 0.0170 | 0.756 |
SGDClassifier_L1 | 0.0076 | 0.931 | 0.0225 | 0.753 | 0.2194 | 0.108 | 0.2343 | 0.075 | 0.0149 | 0.794 |
SGDClassifier_L2 | 0.0131 | 0.882 | 0.0278 | 0.699 | 0.2226 | 0.080 | 0.2373 | 0.059 | 0.0147 | 0.786 |
SGDClassifier_EN | 0.0096 | 0.932 | 0.0228 | 0.765 | 0.2236 | 0.088 | 0.2368 | 0.070 | 0.0132 | 0.830 |
MultinomialNB | − 0.0620 | 0.491 | − 0.0620 | 0.425 | 0.2423 | 0.073 | 0.2424 | 0.053 | 0.0001 | 1.000 |
BernoulliNB | 0.0207 | 0.702 | 0.0046 | 0.935 | − 0.0303 | 0.764 | − 0.0464 | 0.607 | − 0.0161 | 0.650 |
Logistic regression | 0.0301 | 0.705 | 0.0432 | 0.579 | 0.2104 | 0.130 | 0.2235 | 0.083 | 0.0131 | 0.827 |
SVC_rbf | − 0.0218 | 0.799 | 0.0133 | 0.860 | 0.2135 | 0.110 | 0.2485 | 0.047 | 0.0350 | 0.527 |
SVC_poly | 0.0060 | 0.930 | 0.0127 | 0.848 | 0.2838 | 0.027 | 0.2905 | 0.019 | 0.0066 | 0.904 |
SVC_sigmoid | 0.0841 | 0.286 | 0.1149 | 0.110 | 0.0507 | 0.727 | 0.0814 | 0.544 | 0.0307 | 0.584 |
English v.s. Other | English v.s. Spanish | Other v.s. Spanish | ||||
---|---|---|---|---|---|---|
Observed difference | p_val | Observed difference | p_val | Observed difference | p_val | |
Ridge classifier | − 0.0915 | 0.135 | − 0.3576 | 0.008 | − 0.2661 | 0.081 |
Perceptron | − 0.1098 | 0.070 | − 0.3733 | 0.005 | − 0.2635 | 0.095 |
Passive-aggressive | − 0.0677 | 0.266 | − 0.2410 | 0.087 | − 0.1733 | 0.281 |
kNN | − 0.0934 | 0.159 | − 0.3230 | 0.021 | − 0.2295 | 0.121 |
Random forest | − 0.0094 | 0.833 | − 0.2379 | 0.023 | − 0.2285 | 0.064 |
LinearSVC_L1 | − 0.0931 | 0.132 | − 0.3587 | 0.007 | − 0.2656 | 0.097 |
LinearSVC_L2 | − 0.0919 | 0.135 | − 0.3580 | 0.008 | − 0.2661 | 0.095 |
SGDClassifier_L1 | − 0.0967 | 0.113 | − 0.3531 | 0.015 | − 0.2564 | 0.097 |
SGDClassifier_L2 | − 0.0939 | 0.143 | − 0.3583 | 0.009 | − 0.2643 | 0.078 |
SGDClassifier_EN | − 0.0940 | 0.136 | − 0.3463 | 0.009 | − 0.2523 | 0.093 |
MultinomialNB | − 0.1432 | 0.017 | − 0.2967 | 0.034 | − 0.1535 | 0.295 |
BernoulliNB | − 0.0697 | 0.091 | 0.0230 | 0.818 | 0.0927 | 0.397 |
Logistic regression | − 0.0849 | 0.174 | − 0.3318 | 0.025 | − 0.2469 | 0.093 |
SVC_rbf | − 0.0984 | 0.112 | − 0.3491 | 0.013 | − 0.2507 | 0.104 |
SVC_poly | − 0.1035 | 0.100 | − 0.3809 | 0.009 | − 0.2773 | 0.072 |
SVC_sigmoid | 0.0372 | 0.544 | − 0.1784 | 0.203 | − 0.2155 | 0.151 |