Introduction
-
propose a general processing pipeline to identify robust radiomics features for muscle tissue in large MRI cohorts comprised of 10672 eligible subjects,
-
show that a diverse set of selected features can be ranked and thereby ordered based on their relative importance,
-
analyze important features for sarcopenia by means of their statistical differences between subgroups based on available auxiliary information such as age and sex.
Methods
Medical data
Radiomics pipeline
Feature generation
glcm
), gray level run length matrix (glrlm
), gray level size zone matrix (glszm
), neighboring gray tone difference matrix (ngtdm
), gray level dependence matrix (gldm
) were created. Different from studies as in [11], we also include common shape features (e.g. volume, area, sphericity, compactness, and elongation) alongside the texture features due to the availability of generated muscle masks. Border regions that may contain unwanted texture variations due to partial volume effects resulting in voxel bleeding or information from neighboring content, are excluded by a morphological erosion filter. Images were normalized prior to the feature extraction and bilinear resampled to a consistent resolution of \(3\textrm{mm} \times 3\textrm{mm} \times 3\textrm{mm}\). Besides original features, variations of the image content by application of Laplacian of Gaussian (LoG) and Wavelet transformations were included, resulting in an overall number of 4060 available texture features per muscle group per side for each individual subject. In 10 subjects features couldn’t be generated due to an underlying corruption of the respective raw data. Leaving 10672 subjects for the subsequent feature selection process.Feature selection
Statistical evaluation
Results
Feature selection
contrast | muscle | age | BMI | FF |
---|---|---|---|---|
water | all | 34.6 [32, 37] (45) | 31.0 [29, 36] (40) | 13.8 [12, 16] (18) |
gluteus | 37.6 [35, 39] (49) | 37.8 [37, 39] (50) | 24.2 [21, 27] (38) | |
psoas | 26.4 [25, 28] (35) | 24.8 [22, 26] (34) | 21.2 [19, 25] (34) | |
extensors | 33.6 [29, 35] (48) | 28.6 [26, 33] (40) | 31.6 [28, 37] (48) | |
adductors | 39.6 [33, 44] (61) | 23.0 [20, 28] (48) | 33.8 [28, 42] (61) | |
fat | all | 44.2 [41, 47] (58) | 30.6 [21, 40] (52) | 25.2 [21, 28] (36) |
gluteus | 45.4 [42, 48] (60) | 45.6 [41, 49] (58) | 39.4 [35, 43] (56) | |
psoas | 32.0 [31, 33] (39) | 31.6 [30, 33] (39) | 22.6 [19, 25] (27) | |
extensors | 31.4 [29, 33] (51) | 30.4 [25, 34] (52) | 42.6 [38, 51] (70) | |
adductors | 34.6 [32, 37] (60) | 37.0 [30, 41] (64) | 36.0 [29, 41] (57) |
Feature importance ranking
shape
, gldm
, glrlm
and their wavelet and log-sigma variants scoring low overall ranks. For example, for the second water contrast features, we see a shape feature, namely the original_shape_elongation
which is highly dependent on the muscle shape and can thus vary greatly for different muscles. Considering all muscles, it scores nonetheless with a low rank. For the fat contrast the ranking becomes noisier indicated by larger bounding boxes and in some cases strong differences between the mean and median rank as seen e.g. for original_ngtdm_strength and original_firstorder_energy
.original_gldm_largedependencehighgraylevelemphasis
for all investigated muscles across a coronal and two axial views in Fig. 6. We note, that in contrast to this visualization, the processed texture features used for the ranking are based on mean values, which are aggregated across respective masked regions for each muscle.original_gldm_largedependencehighgraylevelemphasis
in one subject for a a coronal view in all muscle groups, b axial views in gluteus and psoas, and c axial views through adductors and extensorsgldm_largedependencehighgraylevelemphasis
and glszm_largeareahighgraylevelemphasis
scored low on all four muscle groups, indicating that these features are important to identify at least one of the auxiliary targets (age, BMI or the FF). The shape elongation is less important for each individual muscle, as it does not aid in differentiating characteristics within the same muscle group compared to the presence of multiple muscles (thereby indirectly allowing for a better estimation of the FF). Again, we see the original as well as LoG and wavelet features across varying different subgroups. The gluteus, psoas and the extensors showed tight boxplots with close mean and median values leading to robust rankings. The ranking for the adductors showed two good performing features for the lowest ranks and higher variability for subsequent features. We note, that this muscle group was partially cut off in some cases (due to the FOV placement of the imaging data), which may lead to lower prediction accuracy and thereby higher mask coverage errors.gldm_largedependencehighgraylevelemphasis
feature remains the most important feature. For BMI a wavelet variant of the firstorder_minimum
and for FF a wavelet variant of the glrlm_graylevelnonuniformitynormalized
achieved the lowest rank.Subgroup differences
original_gldm_largedependencehighgraylevelemphasis
for individual targets age (left column), body mass index (BMI) (middle column) and fat fraction (FF) (right column) separated by sex (top row: male, bottom row: female)wavelet-lhl_gldm_largedependencehighgraylevelemphasis
for individual targets age (left column), body mass index (BMI) (middle column) and fat fraction (FF) (right column) separated by sex (top row: male, bottom row: female)original_gldm_largedependencehighgraylevelemphasis
for age (left column), wavelet-lll_firstorder_minimum for body mass index (BMI) (middle column) and wavelet-llh_glrlm_graylevelnonuniformitynormalized
for fat fraction (FF) (right column) in male (top row) and female (bottom row) subjectscontrast | target | p-value | |||||
---|---|---|---|---|---|---|---|
age | BMI | FF | |||||
male | female | male | female | male | female | ||
water | all | \(< 0.001\) | 0.034 | \(< 0.001\) | \(< 0.001\) | \(< 0.001\) | \(< 0.001\) |
individual | \(< 0.001\) | 0.034 | \(< 0.001\) | \(< 0.001\) | \(< 0.001\) | \(< 0.001\) | |
fat | all | \(< 0.001\) | 0.034 | \(< 0.001\) | \(< 0.001\) | \(< 0.001\) | \(< 0.001\) |
individual | \(< 0.001\) | 0.034 | \(< 0.001\) | \(< 0.001\) | \(< 0.001\) | \(< 0.001\) |
gldm_largedependencehighgraylevelemphasis
across all muscles and all targets on the water contrast for male and female subgroups. The main body of the boxes indicates the first (lower bound) and third quartile (upper bound). Values of neighboring subgroups can overlap substantially. However, the mean (green triangles) and median (red lines) show monotonic trends of its values. These trends are present for male and female subjects as well as for different targets. The Kruskal-Wallis p-value for the original_gldm_largedependencehighgraylevelemphasis
feature consistently showed a p-value < 0.001 indicating statistically significant differences between subgroups for all available auxiliary targets. In addition, all of the lowest ranked ten features of the water contrast identified in the selection pipeline including the final ranking, exceeded a p-value of 0.005 for males (see Table 2). For females, only one of the ten features (original_shape_elongation
) did not show significant changes between age subgroups with a maximum p-value of 0.034 (being above the corrected threshold of 0.005). Note that all other nine texture features remained below the significance threshold of 0.005 for females. For the Mann-Whitney-U test, most subgroup comparisons showed a p-value < 0.001, showing significant differences between male and female texture feature values for features identified for the water contrast and all targets. Prominently, there are exceptions, where the p-value exceeds the significance thresholds. These outliers are especially for the first and last subgroup bins (where subjects in the long tails of the distributions are aggregated) with highest p-values of 0.059 for age, 0.465 for BMI and 0.003 for the FF. Respective features (indicated by the their rank from 1 to 10) and the number of the histogram bins associated with the highest p-value are reported in Table 3.contrast | target | age | BMI | FF | ||||||
---|---|---|---|---|---|---|---|---|---|---|
p-value | feature | bin | p-value | feature | bin | p-value | feature | bin | ||
water | all | 0.059 | 8 | 1 | 0.465 | 3 | 1 | 0.003 | 1 | 6 |
individual | 0.059 | 3 | 1 | 0.465 | 1 | 1 | 0.003 | 2 | 6 | |
fat | all | 0.380 | 6 | 6 | 0.498 | 8 | 1 | 0.440 | 4 | 6 |
individual | 0.380 | 10 | 6 | 0.470 | 4 | 6 | 0.440 | 2 | 6 |
gldm_largedependencehighgraylevelemphasis
feature on the fat contrast. It follows similar trends as the first ranked variant of the water contrast, yet the decrease for all targets across bins is less linear. Notably, there is a large difference between median and mean values for males indicating a distribution skewed to higher values. This is also present, albeit in a less prominent fashion, for females. Nonetheless, clear differences between each subgroup can be seen again with a Kruskal-Wallis p-value < 0.001 for the shown feature values for males and females alike. As illustrated in Table 2, this held also true for the ten lowest ranked texture features of the fat contrast for the BMI and FF target. For age, the p-value is again 0.034 since the shape_elongation feature is contrast independent and was thereby also present in the ranking for the fat contrast. p-values of the lowest ten ranked features in the Mann-Whitney U test between male and female subgroups are higher compared to the water contrast, with values of 0.380 for age, 0.498 for BMI and 0.440 for FF exceeding the significance thresholds of 0.008, 0.007, and 0.008 respectively.gldm_largedependencehighgraylevelemphasis
, for the BMI we depict the wavelet-lll variant of the firstorder_minimum
and for the fat fraction we show the wavelet-llh variant of the glrlm_graylevelnonuniformitynormalized
. In all cases, the features show similar trends independent of their vastly different range of values. For BMI, the means decreases for male and female subjects alike. All ten lowest ranked texture features remained below a p-value of 0.001 for the Kruskal-Wallis test. For BMI, the significant p-value of 0.007 for the Mann-Whitney U test was exceeded for the first subgroup (\(p= 0.465\)). The other subgroups all showed significant differences between male and female subjects. Highest p-values are again noted in Table 2 and 3 for both contrasts.largeareahighgraylevelemphasis
were present (original for the psoas, wavelet variants for the gluteus and the extensors) indicating the robustness and sensitivity of this feature with respect to the auxiliary targets. In addition, first-order features such as energy and the 10th percentile were able to distinguish well between subgroups of the BMI target.Prediction performance on subgroups
contrast | muscle | train | train 10 | val | val 10 |
---|---|---|---|---|---|
age | |||||
water | all | \(80.30\pm 0.33\%\) | \(78.53\pm 0.14\%\) | \(73.03\pm 0.70\%\) | \(72.24\pm 0.55\%\) |
gluteus | \(91.41\pm 0.34\%\) | \(88.37\pm 0.21\%\) | \(76.43\pm 0.50\%\) | \(74.90\pm 0.81\%\) | |
psoas | \(91.46\pm 0.09\%\) | \(89.29\pm 0.28\%\) | \(78.54\pm 0.67\%\) | \(77.60\pm 0.53\%\) | |
extensors | \(91.10\pm 0.28\%\) | \(89.02\pm 0.10\%\) | \(76.94\pm 0.73\%\) | \(76.34\pm 0.65\%\) | |
adductors | \(91.44\pm 0.36\%\) | \(88.58\pm 0.28\%\) | \(76.09\pm 0.94\%\) | \(74.93\pm 0.88\%\) | |
fat | all | \(80.45\pm 0.20\%\) | \(78.06\pm 0.17\%\) | \(73.63\pm 0.70\%\) | \(72.32\pm 0.74\%\) |
gluteus | \(91.50\pm 0.36\%\) | \(88.84\pm 0.27\%\) | \(76.70\pm 0.58\%\) | \(75.79\pm 0.62\%\) | |
psoas | \(91.45\pm 0.16\%\) | \(89.57\pm 0.16\%\) | \(79.79\pm 0.47\%\) | \(79.37\pm 0.45\%\) | |
extensors | \(91.04\pm 0.10\%\) | \(89.73\pm 0.16\%\) | \(78.49\pm 0.63\%\) | \(78.17\pm 0.82\%\) | |
adductors | \(91.17\pm 0.16\%\) | \(87.79\pm 0.22\%\) | \(76.47\pm 0.98\%\) | \(74.82\pm 0.78\%\) | |
BMI | |||||
water | all | \(86.80\pm 0.21\%\) | \(85.05\pm 0.11\%\) | \(80.68\pm 0.30\%\) | \(79.87\pm 0.09\%\) |
gluteus | \(94.54\pm 0.21\%\) | \(92.72\pm 0.14\%\) | \(84.63\pm 0.46\%\) | \(93.92\pm 0.31\%\) | |
psoas | \(92.47\pm 0.09\%\) | \(91.17\pm 0.21\%\) | \(79.14\pm 0.66\%\) | \(78.99\pm 0.73\%\) | |
extensors | \(94.78\pm 0.26\%\) | \(93.00\pm 0.06\%\) | \(82.26\pm 0.43\%\) | \(81.39\pm 0.41\%\) | |
adductors | \(95.91\pm 0.18\%\) | \(95.10\pm 0.22\%\) | \(88.31\pm 0.28\%\) | \(88.37\pm 0.18\%\) | |
fat | all | \(91.89\pm 0.46\%\) | \(90.02\pm 2.02\%\) | \(88.03\pm 0.89\%\) | \(86.42\pm 2.32\%\) |
gluteus | \(96.78\pm 0.05\%\) | \(95.97\pm 0.03\%\) | \(90.13\pm 0.37\%\) | \(90.48\pm 0.31\%\) | |
psoas | \(96.49\pm 0.05\%\) | \(95.83\pm 0.06\%\) | \(88.84\pm 0.28\%\) | \(89.93\pm 0.18\%\) | |
extensors | \(97.12\pm 0.08\%\) | \(96.37\pm 0.07\%\) | \(91.00\pm 0.50\%\) | \(91.34\pm 0.35\%\) | |
adductors | \(96.66\pm 0.38\%\) | \(94.97\pm 0.29\%\) | \(89.38\pm 1.05\%\) | \(88.37\pm 0.59\%\) | |
FF | |||||
water | all | \(98.91\pm 0.03\%\) | \(98.90\pm 0.00\%\) | \(98.36\pm 0.03\%\) | \(98.36\pm 0.04\%\) |
gluteus | \(99.07\pm 0.07\%\) | \(98.61\pm 0.48\%\) | \(97.12\pm 0.33\%\) | \(96.80\pm 0.95\%\) | |
psoas | \(98.18\pm 0.05\%\) | \(97.85\pm 0.03\%\) | \(95.83\pm 0.14\%\) | \(95.66\pm 0.13\%\) | |
extensors | \(99.99\pm 0.00\%\) | \(99.98\pm 0.00\%\) | \(99.96\pm 0.00\%\) | \(99.96\pm 0.00\%\) | |
adductors | \(99.65\pm 0.04\%\) | \(99.47\pm 0.02\%\) | \(98.86\pm 0.05\%\) | \(98.77\pm 0.08\%\) | |
fat | all | \(99.13\pm 0.02\%\) | \(98.73\pm 0.03\%\) | \(98.52\pm 0.09\%\) | \(98.11\pm 0.05\%\) |
gluteus | \(99.74\pm 0.02\%\) | \(99.35\pm 0.02\%\) | \(98.40\pm 0.05\%\) | \(97.95\pm 0.06\%\) | |
psoas | \(99.18\pm 0.02\%\) | \(99.06\pm 0.04\%\) | \(97.62\pm 0.07\%\) | \(97.70\pm 0.06\%\) | |
extensors | \(99.99\pm 0.00\%\) | \(99.98\pm 0.00\%\) | \(99.97\pm 0.00\%\) | \(99.96\pm 0.00\%\) | |
adductors | \(99.62\pm 0.04\%\) | \(99.38\pm 0.01\%\) | \(98.82\pm 0.08\%\) | \(98.56\pm 0.11\%\) |
contrast | muscle | train | train 10 | val | val 10 |
---|---|---|---|---|---|
age | |||||
water | all | \(46.69\pm 0.83\%\) | \(42.71\pm 0.43\%\) | \(30.61\pm 1.05\%\) | \(29.11\pm 1.01\%\) |
gluteus | \(68.99\pm 1.09\%\) | \(62.34\pm 1.17\%\) | \(35.01\pm 079\%\) | \(32.78\pm 1.09\%\) | |
psoas | \(68.28\pm 0.17\%\) | \(62.85\pm 0.44\%\) | \(38.59\pm 1.43\%\) | \(37.01\pm 1.47\%\) | |
extensors | \(68.40\pm 0.49\%\) | \(63.20\pm 0.43\%\) | \(36.01\pm 0.88\%\) | \(34.90\pm 0.97\%\) | |
adductors | \(68.33\pm 0.79\%\) | \(62.03\pm 0.72\%\) | \(35.08\pm 1.32\%\) | \(34.04\pm 1.01\%\) | |
fat | all | \(45.49\pm 0.67\%\) | \(39.94\pm 0.68\%\) | \(30.30\pm 0.81\%\) | \(27.73\pm 0.96\%\) |
gluteus | \(68.37\pm 0.56\%\) | \(61.69\pm 0.45\%\) | \(34.71\pm 1.00\%\) | \(34.12\pm 0.93\%\) | |
psoas | \(67.09\pm 0.63\%\) | \(61.58\pm 0.72\%\) | \(39.40\pm 1.33\%\) | \(38.30\pm 1.48\%\) | |
extensors | \(66.71\pm 0.19\%\) | \(63.43\pm 0.19\%\) | \(37.28\pm 1.04\%\) | \(37.17\pm 1.30\%\) | |
adductors | \(67.14\pm 0.63\%\) | \(59.29\pm 0.78\%\) | \(35.02\pm 1.20\%\) | \(33.08\pm 0.96\%\) | |
BMI | |||||
water | all | \(53.15\pm 0.62\%\) | \(47.88\pm 0.35\%\) | \(35.92\pm 0.40\%\) | \(34.96\pm 0.37\%\) |
gluteus | \(72.63\pm 0.54\%\) | \(65.84\pm 0.37\%\) | \(42.87\pm 0.97\%\) | \(41.40\pm 0.32\%\) | |
psoas | \(69.68\pm 0.36\%\) | \(65.60\pm 0.84\%\) | \(34.37\pm 1.00\%\) | \(34.37\pm 1.00\%\) | |
extensors | \(75.55\pm 0.80\%\) | \(69.31\pm 0.34\%\) | \(38.35\pm 0.99\%\) | \(37.27\pm 0.92\%\) | |
adductors | \(75.36\pm 0.88\%\) | \(71.13\pm 0.89\%\) | \(48.97\pm 0.94\%\) | \(48.84\pm 0.81\%\) | |
fat | all | \(61.90\pm 0.81\%\) | \(56.18\pm 3.75\%\) | \(48.96\pm 2.26\%\) | \(45.81\pm 3.66\%\) |
gluteus | \(77.09\pm 0.21\%\) | \(72.37\pm 0.21\%\) | \(53.02\pm 1.32\%\) | \(53.75\pm 1.01\%\) | |
psoas | \(78.01\pm 0.30\%\) | \(72.47\pm 0.14\%\) | \(50.77\pm 0.91\%\) | \(52.68\pm 0.65\%\) | |
extensors | \(78.88\pm 0.67\%\) | \(73.59\pm 0.26\%\) | \(55.17\pm 1.55\%\) | \(56.02\pm 1.59\%\) | |
adductors | \(77.53\pm 1.39\%\) | \(69.88\pm 1.04\%\) | \(50.81\pm 2.76\%\) | \(48.84\pm 1.39\%\) | |
FF | |||||
water | all | \(86.89\pm 0.28\%\) | \(86.72\pm 0.07\%\) | \(83.62\pm 0.36\%\) | \(83.52\pm 0.30\%\) |
gluteus | \(87.79\pm 0.49\%\) | \(84.86\pm 2.31\%\) | \(78.17\pm 1.46\%\) | \(77.19\pm 3.77\%\) | |
psoas | \(82.90\pm 0.29\%\) | \(81.01\pm 0.35\%\) | \(73.36\pm 0.64\%\) | \(72.85\pm 0.58\%\) | |
extensors | \(98.64\pm 0.19\%\) | \(98.25\pm 0.16\%\) | \(97.64\pm 0.19\%\) | \(97.32\pm 0.11\%\) | |
adductors | \(92.89\pm 0.47\%\) | \(90.92\pm 0.26\%\) | \(86.48\pm 0.28\%\) | \(85.60\pm 0.63\%\) | |
fat | all | \(88.21\pm 0.15\%\) | \(85.55\pm 0.25\%\) | \(84.28\pm 0.53\%\) | \(82.04\pm 0.22\%\) |
gluteus | \(94.03\pm 0.28\%\) | \(89.76\pm 0.25\%\) | \(84.10\pm 0.52\%\) | \(81.86\pm 0.61\%\) | |
psoas | \(88.67\pm 0.15\%\) | \(87.76\pm 0.23\%\) | \(80.37\pm 0.60\%\) | \(80.99\pm 0.33\%\) | |
extensors | \(98.78\pm 0.10\%\) | \(98.41\pm 0.12\%\) | \(98.03\pm 0.17\%\) | \(97.71\pm 0.09\%\) | |
adductors | \(92.62\pm 0.38\%\) | \(90.25\pm 0.03\%\) | \(86.10\pm 0.61\%\) | \(84.26\pm 0.81\%\) |