Introduction
Methods
Dataset description
Remove missing values
Selecting the study population
Definition of diseases
Model development and validation
Results
Classification model performance
variable | Total (%) | Non-depression | depression | χ2 | P | |
---|---|---|---|---|---|---|
Gender | Male | 2386(93.7) | 2218(93.0) | 168(7.0) | 2.858 | 0.091 |
Female | 160(6.3) | 143(89.4) | 17(10.6) | |||
Age | Youth | 273(10.7) | 243(89.0) | 30(11.0) | 13.903 | < 0.001 |
Middle age | 913(35.9) | 834(91.3) | 79(8.7) | |||
Old age | 1360(53.4) | 1284(94.4) | 76(5.6) | |||
Race | Mexican American | 137(5.4) | 124(90.5) | 13(9.5) | 6.805 | 0.078 |
Non-Hispanic White | 1480(58.1) | 1388(93.8) | 92(6.2) | |||
Non-Hispanic Black | 666(26.2) | 612(91.9) | 54(8.1) | |||
Other | 263(10.3) | 237(90.1) | 26(9.9) | |||
Education | <High school | 328(12.9) | 296(90.2) | 32(9.8) | 5.098 | 0.078 |
High school | 623(24.5) | 573(92.0) | 50(8.0) | |||
>High school | 1595(62.6) | 1492(93.5) | 103(6.5) | |||
Marital status | Live together | 1685(66.2) | 1589(94.3) | 96(5.7) | 18.203 | < 0.001 |
Single | 861(33.8) | 772(89.7) | 89(10.3) | |||
Ratio of family income to poverty | < 1.3 | 510(20.0) | 438(85.9) | 72(14.1) | 44.427 | < 0.001 |
≥ 1.3 | 2036(80.0) | 1923(94.4) | 133(5.6) | |||
BMI(Kg/m²) | Under weight | 29(1.1) | 28(96.6) | 1(3.4) | 18.969 | < 0.001 |
Normal weight | 579(22.7) | 538(82.9) | 41(7.1) | |||
Over weight | 968(38.0) | 921(95.1) | 47(4.9) | |||
Obesity | 970(38.2) | 874(90.1) | 96(9.9) |
Index | DL | XGBoost | DT | SVM | KNN | RF |
---|---|---|---|---|---|---|
Total | ||||||
AUC | 0.891 | 0.869 | 0.818 | 0.805 | 0.724 | 0.737 |
Accuracy | 0.830 | 0.913 | 0.786 | 0.691 | 0.879 | 0.875 |
Recall | 0.754 | 0.963 | 0.782 | 0.980 | 0.932 | 0.963 |
Specificity | 0.906 | 0.427 | 0.790 | 0.176 | 0.180 | 0.320 |
Precision | 0.889 | 0.942 | 0.790 | 0.679 | 0.938 | 0.900 |
F1-score | 0.816 | 0.952 | 0.786 | 0.803 | 0.935 | 0.930 |
Middle age | ||||||
AUC | 0.929 | 0.879 | 0.834 | 0.835 | 0.868 | 0.833 |
Accuracy | 0.867 | 0.859 | 0.816 | 0.697 | 0.871 | 0.880 |
Recall | 0.773 | 0.965 | 0.785 | 0.975 | 0.966 | 0.980 |
Specificity | 0.962 | 0.364 | 0.852 | 0.169 | 0.314 | 0.359 |
Precision | 0.953 | 0.877 | 0.856 | 0.691 | 0.892 | 0.888 |
F1-score | 0.854 | 0.919 | 0.819 | 0.808 | 0.928 | 0.932 |
Old age | ||||||
AUC | 0.924 | 0.923 | 0.773 | 0.697 | 0.691 | 0.687 |
Accuracy | 0.860 | 0.917 | 0.753 | 0.702 | 0.831 | 0.887 |
Recall | 0.759 | 0.985 | 0.802 | 0.961 | 0.952 | 0.961 |
Specificity | 0.960 | 0.273 | 0.710 | 0.122 | 0.158 | 0.275 |
Precision | 0.950 | 0.927 | 0.711 | 0.710 | 0.862 | 0.917 |
F1-score | 0.844 | 0.955 | 0.754 | 0.817 | 0.905 | 0.938 |
Feature importance
Code | Label | Importance | ||
---|---|---|---|---|
Total | 1 | HUQ010 | General health condition | 1.0000000 |
2 | SLQ050 | Ever told doctor had trouble sleeping? | 0.9631659 | |
3 | PFQ057 | Experience confusion/memory problems | 0.9480876 | |
4 | PFQ049 | Limitations keeping you from working | 0.8343042 | |
5 | INDFMPIR | Ratio of family income to poverty | 0.7421787 | |
6 | RIDAGEYR | Age in years at screening | 0.7248971 | |
7 | LBDNENO | Segmented neutrophils num (1000 cell/uL) | 0.7025002 | |
8 | DMDHHSIZ | Total number of people in the Household | 0.7007679 | |
9 | MCQ300B | Close relative had asthma? | 0.6674766 | |
10 | PFQ054 | Need special equipment to walk | 0.6487374 | |
11 | HSQ520 | SP have flu, pneumonia, ear infection? | 0.6469163 | |
12 | MCQ160L | Ever told you had any liver condition | 0.6463069 | |
13 | BMXBMI | Body Mass Index (kg/m²) | 0.6382723 | |
14 | DR2TATOC | Vitamin E as alpha-tocopherol (mg) | 0.6148617 | |
15 | MCQ160D | Ever told you had angina/angina pectoris | 0.6122292 | |
16 | Hypertension | Ever told you had high blood pressure or Systolic blood pressure ≥ 140 mmHg, and/or diastolic blood pressure ≥ 90 mmHg. | 0.6018993 | |
17 | DR1TS160 | SFA 16:0 (Hexadecanoic) (gm) | 0.5999041 | |
18 | MCQ160F | Ever told you had a stroke | 0.5953889 | |
19 | DR2TVC | Vitamin C (mg) | 0.5909083 | |
20 | HSQ510 | SP have stomach or intestinal illness? | 0.5805472 |
Code | Label | Importance | ||
---|---|---|---|---|
Middle age | 1 | SLQ050 | Ever told doctor had trouble sleeping? | 1.0000000 |
2 | PFQ057 | Experience confusion/memory problems | 0.8306264 | |
3 | HUQ010 | General health condition | 0.7768890 | |
4 | HSQ510 | SP have stomach or intestinal illness? | 0.7510906 | |
5 | MCQ160F | Ever told you had a stroke | 0.6759962 | |
6 | KIQ042 | Leak urine during physical activities | 0.6741676 | |
7 | PFQ054 | Need special equipment to walk | 0.6403114 | |
8 | PFQ049 | Limitations keeping you from working | 0.6373989 | |
9 | Hypertension | Ever told you had high blood pressure + Systolic blood pressure ≥ 140 mmHg, and/or diastolic blood pressure ≥ 90 mmHg. | 0.6329273 | |
10 | MCQ160A | Doctor ever said you had arthritis | 0.6317729 | |
11 | DR1TP226 | PFA 22:6 (Docosahexaenoic) (gm) | 0.6258965 | |
12 | MCQ160L | Ever told you had any liver condition | 0.6164003 | |
13 | DR1TS160 | SFA 16:0 (Hexadecanoic) (gm) | 0.6159058 | |
14 | MCQ160E | Ever told you had heart attack | 0.6116369 | |
15 | HSQ520 | SP have flu, pneumonia, ear infection? | 0.6103028 | |
Old age | 1 | HUQ010 | General health condition | 1.0000000 |
2 | PFQ054 | Need special equipment to walk | 0.8548500 | |
3 | PFQ057 | Experience confusion/memory problems | 0.7185544 | |
4 | MCQ160F | Ever told you had a stroke | 0.6913724 | |
5 | SLQ050 | Ever told doctor had trouble sleeping? | 0.6728851 | |
6 | Hypertension | Ever told you had high blood pressure + Systolic blood pressure ≥ 140 mmHg, and/or diastolic blood pressure ≥ 90 mmHg. | 0.6709751 | |
7 | MCQ160K | Ever told you had chronic bronchitis | 0.6636364 | |
8 | DR2TATOC | Vitamin E as alpha-tocopherol (mg) | 0.6257717 | |
9 | DR2TS120 | SFA 12:0 (Dodecanoic) (gm) | 0.6235973 | |
10 | KIQ044 | Urinated before reaching the toilet | 0.6177957 | |
11 | KIQ042 | Leak urine during physical activities | 0.6157834 | |
12 | HSQ520 | SP have flu, pneumonia, ear infection? | 0.6091104 | |
13 | HSQ590 | Blood ever tested for HIV virus? | 0.5971844 | |
14 | MCQ300A | Close relative had heart attack? | 0.5970445 | |
15 | MCQ160A | Doctor ever said you had arthritis | 0.5952202 |