Background
Methods
Description of patients
Anonymity of data
No. | De-identified information |
---|---|
1 | Unique identification information (resident/alien registration number, passport number) |
2 | Names (including Chinese characters, English name, pen name, etc.)* |
3 | Detailed address (detailed address below eup/myeon/dong) |
4 | All phone numbers (including mobile phone/home/company/fax number) |
5 | E-mail addresses |
6 | Medical record number |
7 | Patient registration number |
8 | Health insurance card number, Welfare recipient number |
9 | Accounts number, Credit card number |
10 | Certificate/License number, Student number |
11 | Vehicle number, registration number & serial number of various devices |
12 | Full-face photographs or equivalent (still photo, video, CCTV, video) |
13 | Identification code (member ID, employee number) |
14 | IP (Internet Protocol) address, Mac (Media Access Control) address |
15 | URLs (Universal Resource Locators) |
16 | Biometric identifiers: fingerprint, iris, vein, voice, handwriting, personally identifiable genetic information |
17 | Any other personally identifiable information (pathological number) |
18 | Date of birth** |
19 | Any other unique identifying information (military number, registration number of the individual business operator) |
20 | The indirect identification information contained in the information collection is also deleted in principle if it is not related to the purpose of data use. |
Class | Total codes (N) | Mapped code (N) | Mapping ratio (%) | Remark |
---|---|---|---|---|
Diagnosis | 10728 | 10708 | 99.81 | SNOMED-CT |
Surgery | 1554 | 1544 | 99.36 | SNOMED-CT |
Laboratory test | 705 | 599 | 84.96 | LOINC* |
Image pathology | 247 | 245 | 99.19 | SNOMED-CT |
Medication | 4631 | 4600 | 99.33 | RxNorm |
Blood transfusion | 23 | 23 | 100 | SNOMED-CT |
Procedures and materials | 386 | 382 | 98.96 | SNOMED-CT |
Description | Features (N) | Number of records | Number of patients | ||
---|---|---|---|---|---|
AMC | UUH | AMC | UUH | ||
Demographics | 9 | 572811 | 175663 | 572811 | 175663 |
Demographics of those visiting ER | 20 | 502055 | 171489 | 214393 | 72423 |
Vital signs of those visiting ER | 13 | 1865348 | – | 185447 | – |
Physical measurement | 14 | 46768559 | 5485196 | 511061 | 130361 |
Visits | 23 | 18967703 | 8935764 | 571163 | 172169 |
Diagnosis | 13 | 28328713 | 8089345 | 553031 | 174403 |
Schedule of operation | 12 | 434085 | – | 245159 | – |
Summary of operation | 14 | 3404439 | 88760 | 348939 | 52852 |
Six-minute walk test | 74 | 32158 | 1210 | 8871 | 665 |
Coronary artery CT | 97 | 97585 | – | 79046 | – |
Thallium SPECT | 26 | 198711 | – | 156615 | – |
Echocardiography | 112 | 726187 | 178386 | 428004 | 110626 |
Holter monitoring test | 75 | 66366 | 21035 | 46636 | 15135 |
Pulmonary function test | 135 | 4634091 | 63593 | 265817 | 38933 |
PACS | 12 | 12410683 | 4490786 | 551280 | 169801 |
Pediatric echocardiography | 63 | 4017 | – | 1720 | – |
Cardiac rehabilitation | 80 | 2912 | – | 1990 | – |
Treadmill test | 29 | 110094 | 31741 | 68203 | 25979 |
Laboratory test | 7 | 344908032 | 143847546 | 489278 | 175663 |
Medication | 26 | 129804022 | 57639868 | 500444 | 162750 |
Procedures and materials | 21 | 105739326 | 13201735 | 417407 | 136128 |
Order of blood transfusion | 10 | 1090115 | 219804 | 192169 | 43814 |
Result of blood transfusion | 11 | 2764232 | 625574 | 100215 | 28621 |
Human-derived materials | 13 | 46760 | – | 43412 | – |
Human-derived bonemarrow | 13 | 5757 | – | 2983 | – |
Patient history | 10 | 673143 | – | 307681 | – |
Smoking information | 12 | 608441 | – | 280492 | – |
AMC (N = 572811) | UUH (N = 175663) | Total (N = 748474) | |
---|---|---|---|
Gender ([F,M]) | [257160, 315651] | [79988, 95675] | [337148, 411315] |
Age (Year) | 56.32 \(\pm\) 14.72 | 52.11 \(\pm\) 18.09 | 55.78 \(\pm\) 15.20 |
Systolic blood pressure (mmHg)* | 123.06 \(\pm\) 12.61 | 129.05 \(\pm\) 13.38 | 124.14 \(\pm\) 12.95 |
Diastolic blood pressure (mmHg)* | 74.29 \(\pm\) 7.94 | 75.96 \(\pm\) 9.07 | 74.59 \(\pm\) 8.18 |
BMI (kg/m\(^{2}\))** | 24.11 \(\pm\) 3.50 | 24.04 \(\pm\) 3.55 | 24.100 \(\pm\) 3.513 |
CV/CS Encounter(N) *** | |||
0 | 250160 | 14925 | 265085 |
1 | 68037 | 19489 | 87526 |
2 | 78406 | 19101 | 97507 |
\(\ge\) 3 | 174560 | 118654 | 293214 |
Test (N(%)) | |||
Echocardiography | 428004 (74.71%) | 110626 (62.97%) | 538630 (71.96%) |
Pulmonary function | 265817 (46.40%) | 38933 (22.16%) | 304750 (40.71%) |
Thallium SPECT | 156615 (27.34%) | – | 156615 (20.92%) |
Treadmill | 68203 (11.90%) | 25979 (14.78%) | 94182 (12.58%) |
CT | 79064 (13.80%) | – | 79064 (10.56%) |
Holter monitoring | 46636 (8.14%) | 15135 (8.61%) | 61771 (8.25%) |
Six-minute walk test | 8871 (1.54%) | 665 (0.37%) | 9536 (1.27%) |
Cardiac rehabilitation | 1990 (0.34%) | – | 1990 (0.26%) |
Pediatric echocardiography | 1720 (0.30%) | – | 1720 (0.22%) |
AMC (N = 321003) | UUH (N = 157244) | Total (N = 478247) | |
---|---|---|---|
Age (Year) | 59.85 \(\pm\) 13.21 | 57.28\(\pm\) 15.00 | 58.80 \(\pm\)14.03 |
Outpatients | 2548245 | 1854432 | 4402677 |
Inpatients | 134846 | 71012 | 205858 |
ER | 86429 | – | 86429 |
Diagnosis | AMC | UUH | Total |
---|---|---|---|
(N = 357910) | (N = 87877) | (N = 445787) | |
Hypertension | 200109 (55.91%) | 37886 (43.11%) | 237995 (53.38%) |
Pain in throat and chest | 142567 (39.83%) | 38690 (44.02%) | 181257 (40.66%) |
Diabetes mellitus | 112381 (31.39%) | 30236 (34.40%) | 142617 (31.99%) |
Angina pectoris | 61789 (17.26%) | 9694 (11.03%) | 71483 (16.03%) |
Ischaemic heart disease | 47836 (13.36%) | 5847 (6.65%) | 53683 (12.04%) |
Cerebral infarction | 24752 (6.91%) | 8958 (10.19%) | 33710 (7.56%) |
Heart failure | 15345 (4.28%) | 4825 (5.49%) | 20170 (4.52%) |
Acute myocardial infarction | 10543 (2.94%) | 3853 (4.38%) | 14396 (3.22%) |
Cardiac arrest | 1213 (0.003%) | 1196 (0.013%) | 2409 (0.005%) |
Laboratory test | AMC(%) | UUH(%) | Total(%) |
---|---|---|---|
Creatinine | 83.64 | 91.24 | 85.42 |
Cholesterol | 83.60 | 93.25 | 85.86 |
ALT | 83.53 | 92.21 | 85.57 |
AST | 83.53 | 92.26 | 85.58 |
Bilirubin (total) | 83.03 | 91.29 | 84.97 |
Albumin | 83.02 | 91.34 | 84.97 |
Protein | 83.01 | 91.30 | 84.96 |
Glucose | 83.00 | 90.47 | 84.75 |
ALP | 82.97 | 91.25 | 84.91 |
Hb | 82.88 | 92.51 | 85.14 |
Platelet | 82.88 | 92.48 | 85.13 |
Calcium | 82.77 | 87.37 | 83.84 |
Uric acid | 82.68 | 89.44 | 84.27 |
Potassium | 80.54 | 87.91 | 82.27 |
Sodium | 80.51 | 87.95 | 82.25 |
BUN | 75.36 | 91.08 | 79.05 |
Chloride | 65.94 | 87.42 | 70.98 |
CO2 (total) | 65.89 | 83.48 | 70.02 |
Phosphorus | 62.40 | 87.40 | 68.27 |
Triglyceride | 53.61 | 37.55 | 49.84 |
HDL-Cholesterol | 52.96 | 37.17 | 49.26 |
LDL-Cholesterol | 44.09 | 31.45 | 41.12 |
CRP (quantity) | 42.94 | 67.49 | 48.71 |
ESR | 42.79 | 42.70 | 42.77 |
Hb A1c | 38.66 | 32.00 | 37.09 |
CK | 27.76 | 43.13 | 31.37 |
Troponin-I | 25.48 | 11.62 | 22.23 |
hsCRP | 17.64 | 14.67 | 16.94 |
Description | AMC (N = 428,004) | UUH (N=110,626) |
---|---|---|
LVESD | 30.25 \(\pm\) 6.43 | 30.30 \(\pm\) 5.76 |
LVEDD | 47.77 \(\pm\) 8.74 | 47.20 \(\pm\) 5.57 |
LVPWES | 13.93 \(\pm\) 2.90 | 14.32 \(\pm\) 2.25 |
LVPWED | 9.02 \(\pm\) 1.87 | 9.25 \(\pm\) 1.55 |
LVIVSES | 13.14 \(\pm\) 2.88 | 13.42 \(\pm\) 2.26 |
LVIVSED | 9.09 \(\pm\) 2.01 | 9.52 \(\pm\) 1.74 |
LAd | 37.39 \(\pm\) 8.72 | 36.04 \(\pm\) 6.23 |
LVESV | 35.99 \(\pm\) 18.94 | 36.83 \(\pm\) 23.61 |
LVEDV | 88.97 \(\pm\) 32.39 | 80.82 \(\pm\) 33.07 |
E/A ratio | 0.93 \(\pm\) 0.52 | 0.44 \(\pm\) 0.54 |
E/E ratio | 8.52 \(\pm\) 7.18 | 9.64 \(\pm\) 3.63 |
LVEF | 58.83 \(\pm\) 11.93 | 62.18 \(\pm\) 8.47 |
LV mass | 163.78 \(\pm\) 57.80 | 149.31 \(\pm\) 45.74 |
Details of data
-
Patients who had visited the Departments of Cardiology or Thoracic Surgery.
-
Patients who had visited the ER and were assigned International Classification of Diseases, 10th version (ICD-10) codes related to CVDs.
-
The codes I00-I99 were related to the diseases of the circulatory system, while R00-R03, R06, R068, R073, and R074 were related to symptoms and signs involving the circulatory and respiratory systems.
-
Patients who had undergone coronary artery CT as part of their health screening procedures.
-
Patients who had undergone one of the following clinical examinations: thallium single-photon emission computed tomography (SPECT), 2D-echocardiography, treadmill test, and Holter monitoring test.
Data extracted
-
Demographics: date of birth, sex, national code, address, blood type (ABO, RH), death date, and death date of a cancer patient (one row per patient).
-
Vital signs: measurement time and date, reason for absence of measurement, body temperature, blood pressure, respiratory rate, pulse, oxygen saturation, and consciousness status (one row for each patient seen in the ER).
-
Physical information: age, height, weight, blood pressure, pulse, respiratory rate, body mass index (BMI), body surface area, and measurement date (one row per encounter).
-
Visits: date of visit, date of admission and discharge, type of visit, medical department, hospitalization, duration of stay in the intensive care unit, type of discharge, and the result of treatment (one row per encounter).
-
Diagnosis: date of diagnosis, type of visit, medical department, and ICD-10th code (one row per encounter).
-
Surgery: date of admission and discharge, date of surgery or treatment, sequential number of surgery, surgery type (before/after the surgery), diagnosis (before/after the surgery), surgery category, surgical department, and method and time of anesthesia (one row per encounter).
-
Digital tests: date of visit, age, department of examination, code of examination, date of order, and reports or readings of the result. (one row per test).
-
Laboratory test result: code of pathology examination, number of work, test result, and unit of result (one row per test).
-
Medication: medical department, type of visit, date of prescription, code of prescription, active ingredient in medication, indication, category of medicinal effect, and duration of treatment (one row per encounter).
-
Procedure: medical department, date of order, code of order, time of order, material code, capacity of materials, and place of patient and materials (one row per procedure).
-
Blood transfusion: date of order, code of order, ordering department, sequential number of blood, quantity of prescribed/released blood, and time released (one row per order).
-
Human-derived material: date of extraction, code of diagnosis, name of diagnosis, tissue sample description (status and amount of cancer/normal, plasma/buffy coat, type of organ), and information on bone marrow (status and amount of cerebrospinal fluid/bone marrow/blood stored) (one row per patient).
-
Patient history: marital status, religion, education, exercise habits over the last three months, lifestyle and habits information (e.g., alcohol, smoking), and personal and family medical history (one row per encounter).
Data processing
Structured data
-
Systolic blood pressure, diastolic blood pressure between 0 and 300 mmHg
-
The respiratory rate is between 0 and 100 breaths per minute.
-
The pulse rate is between 0 and 300 beats per minute.
-
The body temperature is between 0 and 50 \(^\circ\)C.
-
To determine the range of the plausible body weight and height, we divided the data into three groups: patients younger than 12 months, younger than 20 years, and older than 20 years. We manually calculated the mean of values and \(\pm\) 3 standard deviations for each group.