Clinical evaluation of fully automated thigh muscle and adipose tissue segmentation using a U-Net deep learning architecture in context of osteoarthritic knee pain

Kemnitz, Jana; Baumgartner, Christian F.; Eckstein, Felix; Chaudhari, Akshay; Ruhdorfer, Anja; Wirth, Wolfgang; Eder, Sebastian K.; Konukoglu, Ender

doi:10.1007/s10334-019-00816-5

Clinical evaluation of fully automated thigh muscle and adipose tissue segmentation using a U-Net deep learning architecture in context of osteoarthritic knee pain

Research Article
Open access
Published: 23 December 2019

Volume 33, pages 483–493, (2020)
Cite this article

Download PDF

You have full access to this open access article

Magnetic Resonance Materials in Physics, Biology and Medicine Aims and scope Submit manuscript

Clinical evaluation of fully automated thigh muscle and adipose tissue segmentation using a U-Net deep learning architecture in context of osteoarthritic knee pain

Download PDF

Jana Kemnitz ORCID: orcid.org/0000-0003-0342-4952^1,2,3,4,
Christian F. Baumgartner⁴,
Felix Eckstein^1,2,
Akshay Chaudhari⁵,
Anja Ruhdorfer¹,
Wolfgang Wirth^1,2,
Sebastian K. Eder^1,6 &
…
Ender Konukoglu⁴

4784 Accesses
35 Citations
3 Altmetric
Explore all metrics

Abstract

Objective

Segmentation of thigh muscle and adipose tissue is important for the understanding of musculoskeletal diseases such as osteoarthritis. Therefore, the purpose of this work is (a) to evaluate whether a fully automated approach provides accurate segmentation of muscles and adipose tissue cross-sectional areas (CSA) compared with manual segmentation and (b) to evaluate the validity of this method based on a previous clinical study.

Materials and methods

The segmentation method is based on U-Net architecture trained on 250 manually segmented thighs from the Osteoarthritis Initiative (OAI). The clinical evaluation is performed on a hold-out test set bilateral thighs of 48 subjects with unilateral knee pain.

Results

The segmentation time of the method is < 1 s and demonstrated high agreement with the manual method (dice similarity coeffcient: 0.96 ± 0.01). In the clinical study, the automated method shows that similar to manual segmentation (− 5.7 ± 7.9%, p < 0.001, effect size: 0.69), painful knees display significantly lower quadriceps CSAs than contralateral painless knees (− 5.6 ± 7.6%, p < 0.001, effect size: 0.73).

Discussion

Automated segmentation of thigh muscle and adipose tissues has high agreement with manual segmentations and can replicate the effect size seen in a clinical study on osteoarthritic pain.

Deep learning for automatic segmentation of thigh and leg muscles

Article Open access 19 October 2021

Precise individual muscle segmentation in whole thigh CT scans for sarcopenia assessment using U-net transformer

Article Open access 08 February 2024

Deep Learning Convolutional Neural Networks for the Automatic Quantification of Muscle Fat Infiltration Following Whiplash Injury

Article Open access 28 May 2019

Introduction

Thigh muscle deficits [1, 2] and accumulation of (local) adipose tissue [3,4,5] are important pathophysiological events in the context of the clinical science and management of musculoskeletal diseases such as knee osteoarthritis (OA) [1]. Muscles play an essential role in stabilizing the joints [1, 6], while excessive adipose tissue may induce a chronic inflammatory state by producing adipokines and inflammatory cytokines. Both are suggested to be involved in cartilage degradation, synovial inflammation, and bone erosion [3, 4]. Magnetic resonance imaging (MRI)-based analysis is increasingly used to study the association between thigh muscle and adipose tissue composition with knee OA [7,8,9,10,11,12]. Further, it has permitted to investigate the impact of training interventions on thigh tissue composition [13], as well as on functional and clinical outcomes of knee OA [5, 14]. Yet, evaluation of thigh muscle morphology and adipose tissue composition requires image segmentation, with the time needed for manual segmentations of thigh muscle and adipose tissue cross-sectional areas (CSAs), precluding the analysis of large databases and image repositories such as the Osteoarthritis Initiative (OAI) [15].

There exist several semi-automated [16,17,18,19,20,21] and fully automated [22,23,24,25] tools for thigh tissue volume and CSA segmentation to overcome the challenges in capturing the complex morphology and texture of thigh muscle and adipose tissue that are complicated by considerable inter-subject variability (Fig. 1) and potentially also artefacts as intensity distortions. Imperfections in MRI systems and interactions between the imaged subject and the electromagnetic field cause the sensitivity and hence, the image intensity scale to vary over the image.

Therefore, in previous published studies mainly continuous methods [18, 19, 24, 26, 27] and/or applied intensity inhomogeneity correction prior to segmentation were applied [19, 21, 22, 24]. Several of the discrete methods used k-means [19, 27], fuzzy c-means clustering [24, 28] for adipose tissue classification and focused on atlas-based segmentation methods for the segmentation [18, 24, 25, 27] of individual muscle heads or lean muscle tissue from whole MRIs. Similar segmentation techniques have also been applied to other body tissues [29,30,31]. With more data becoming available and recent advances in machine learning and computing infrastructure, segmentation techniques based on deep convolutional neural networks (CNN) are emerging as the new state-of-the-art [32, 33]. For this reason, CNNs are recently examined for musculoskeletal tissue segmentation of the knee joint [34,35,36] and thigh muscle MRIs by Ahmad et al. [37] and our group [38]. Ahmad et al. explored five pre-trained fully convolutional networks (FCN) with initiated weights for transfer learning for the quadriceps muscle (including the femoral bone and the medulla). The authors reached high dice similarity coefficients (DSC) of 0.95. While in the above paper [37], the FCN-8s showed the most accurate results, a modified 2D U-Net architecture achieved an ever better performance, when applied to segmentation of cardiac MRI data [39]. While, in general, these CNN methods show great potential for thigh muscle and adipose tissue segmentation, particularly in large clinical image repositories; it is important to demonstrate whether clinical observation can be reproduced using segmentations generated and thus to “clinically” validate the methodology developed. More specifically, our study aims to close this important gap between medical imaging innovation and its clinical application by reproducing a clinical effect observed in a previous published study and by comparing effect sizes. A fully automated segmentation method that is clinically validated will enable segmentation of large imaging repositories such as the OAI, where the MRIs of several thousand patients can be analyzed and used for the development of imaging biomarkers and ultimately the resulting diagnosis and treatment of diseases.

The aim of the current study was therefore: (i) to determine the agreement between a fully automated thigh muscle and adipose tissue segmentation method based on a 2D U-Net technique vs. manual segmentation and (ii) to test whether a previously published clinical study can be reproduced using the CNN algorithm [40] that has shown that patients display lower quadriceps CSAs in limbs with frequently painful knees compared with pain-free contralateral limbs. The latter “clinical evaluation” is particularly important to show the value of the fully automated algorithm for detecting small differences between groups and within subjects over time.

Material and methods

U-Net architecture

Convolutional neural networks are a type of multilayered artificial neural networks that can be used to analyze imaging data. The U-Net architecture is built upon the fully convolutional network (FCN), which is built only from successive locally connected convolution, and pooling layers, and a final upsampling layer. In contrast to FCNs, the U-Net (1) has symmetric downsampling and upsampling with deep skip connections and (2) the skip connections between the downsampling path and the upsampling path apply a concatenation operator instead of a sum. These skip connections intend to provide local spatial cues to the upsampling operator. Because of its symmetry, the network has a large number of feature maps in the upsampling path, which allows transferring information. Our proposed method also relies on data augmentation, adding random spatial transformations of existing data as additional training examples, to use the available annotated samples more efficiently and yield higher segmentation performance [32]. In this work, we used a modified 2D U-Net architecture, where the number of feature maps in the transpose convolutions of the upsampling path was set to the number of classes, which has been used previously for segmenting cardiac tissue [39]. We trained the architecture to optimize pixel-wise multi-class, where each pixel i in image X is assigned to a label $y_{i}\, { \in }\,L_{a} = \{ l_{o} , \ldots ,l_{L} \}$ and $p$ denotes the ground-truth probability distribution, and $q$ denotes the networks softmax output (Eq. 1) with mini-batch stochastic gradient descent using the ADAM optimizer [41] with a learning rate of 0.01. The network was trained on Nvidia Titan Xp GPU for 24 h.

$$C = \mathop \sum \limits_{i} m_{i} \mathop \sum \limits_{{l{ \in }L_{a} }} p\left( {y_{i} = l} \right)\log q(y_{i} = l|X).$$

(1)

U-Net training

In principal, the following steps were performed (Fig. 2):

(a)
The network was trained with a set of images and corresponding manual segmentations made by one reader (1st reader)
(b)
Technical evaluation was performed using another dataset with corresponding manual segmentations made by the same reader (1st reader)
(c)
Technical evaluation was repeated using 96 manual segmentations of another reader (2nd reader) who were manually acquired in the clinical study with 48 patients under d)
(d)
Clinical evaluation was performed in comparison with data previously generated by the 2nd reader.

Ad (a) training

The training set consisted of axial MR images from 222 participants (250 thighs: 202 left; 48 right) of OAI participants (male: 44%, age 65.5 ± 10.1 years, BMI 28.8 ± 4.8 kg/m²). In these, various muscle groups (i.e. quadriceps, hamstrings, adductors and sartorius), adipose tissue (i.e. subcutaneous fat [SCF], intermusclar fat [IMF]), and the femoral bone (including the cortex and the medulla) had been manually segmented to study the impact of pain [7, 8] and radiographic disease stage [42] on thigh muscle. The MR images had been acquired using a T1-weighted spin echo MRI sequence from the OAI (slice thickness 5 mm; in-plane resolution 0.98 mm; no inter-slice gap, repetition time 500 ms, echo time 10 ms) [15, 43] using a 3-T scanner (Siemens Trio, Siemens AG, Erlangen, Germany). Image acquisition encompassed 15 slices at a fixed distance from the distal femoral metaphysis [32]. Segmentation was performed at a single slice located at 33% of the femoral bone (from distal to proximal), the anatomical location that was consistently covered in all cases according to previously established criteria [44]. All MRI datasets were manually segmented by one reader (1st reader). Since both thighs are almost left–right symmetric, the right and left ones were mirrored to increase the number of training samples and randomly divided into a training set (N = 225 lefts and N = 225 right thighs) used to adjust the U-Net weights and into a validation set (N = 25 right and N = 25 left thighs) used to determine when to stop the training to avoid overfitting. Note that the validation set here is not used to evaluations, but only to determine when to stop the training.

Ad (b) technical evaluation with data from the same reader

The trained U-Net segmentation method was first applied to ten previously manual-segmented MRI datasets from the OAI segmented by the same reader (1st reader) that were not part of the training and validation set.

To determine the agreement between manual (M) and the fully automated (A) technique, the segmentations were compared using three different metrics. First, the dice similarity coefficient (DSC) was determined to measure the pixel overlap between A and M, normalized to their respective size (Eq. 2). The DSC is commonly used to evaluate the agreement between segmentation methods and relates the overlap of the segmentations to the total area of the segmentations. It takes pixel misclassification more strongly into account in smaller areas compared to larger ones. Therefore the average symmetric surface distance (ASSD) and Haussdorf distance (HD) were also included: The ASSD was determined as an indicator of the average segmentation error by the average of all the distances from each pixel on the boundary $b_{M}$ of M to the boundary $b_{A}$ of A and vice versa (Eq. 3). Finally, we determined the HD as an indicator of the largest segmentation error measuring the maximal distance from a pixel in A to a nearest pixel in M (Eq. 4):

$$DSC\left( {M,A} \right) = 2\frac{{\left( {M \cap A} \right)}}{{\left( {M + A} \right)}}.$$

(2)

$$ASSD\left( {M,A} \right) = \frac{1}{{b_{M} + b_{A} }}\left( {\mathop \sum \limits_{{x{ \in }b_{M} }} d\left( {x,b_{A} } \right) + \mathop \sum \limits_{{y{ \in }b_{A} }} d\left( {y,b_{M} } \right)} \right).$$

(3)

$$HD\left( {M,A} \right) = max_{{x{ \in }M}} \left( {x,y} \right).$$

(4)

Ad (c) technical evaluation with data from another reader (2nd reader)

As interobserver differences have been previously reported for thigh muscle and adipose tissue segmentations between different readers [45], this step was used to elucidate how technical evaluation parameters differ when the neural network trained by one reader is applied to data from another reader. To this end, the algorithm was applied to 48 different MRI from the OAI dataset, which have been manually segmented by another reader (2nd reader) who was involved in the clinical study described under (d). The same measures of similarity were used as described under (b).

The differences between the manual segmentations by the 2nd reader and the fully automated results of the pain study were examined using Bland–Altman analyses.

Ad (d) clinical evaluation

Finally, the trained U-Net segmentation method was applied to 48 patients from a previous published pain study segmented by the 2nd reader [40]. This study had aimed to determine whether thigh muscles differ in subjects with unilateral pain, i.e. between limbs with frequent knee pain (for at least one month during the past 12 months) compared with contralateral limbs without any knee pain over the past 12 months. These 48 subjects (31 women; 17 men; age 45–78 years) had been drawn from 4796 OAI participants, in whom both knees displayed the same radiographic stage osteoarthritis according to Kellgren and Lawrence grade (KLG) system for classification of osteoarthritis of knee and had been classified to be either bilaterally KLG2 or KLG3 [40]. Twenty-one participants displayed KLG2 (6 men, 15 women) and 27 bilateral KLG3 (11 men, 16 women) in both knees. Axial MR images were used to determine quadriceps, hamstrings, and adductors at 33% femoral length (distal to proximal).

As in the previously published paper [40], side differences between knees were determined and the standard deviation of these side differences was calculated. Paired t-tests were used to determine whether significant side differences in the quadriceps, hamstrings, and adductors appeared, with the effect size of significant differences being described using Cohen's D.

All statistical analyses were performed using SPSS version 24 (IBM Corp., USA) and Python 3.4 (Python Software Foundation, Delaware, United States).

Results

Figure 3 shows examples of different segmentations and some of the variability observed in the training dataset.

The agreement between manual and U-Net segmentation on the technical evaluation set (segmented by the same readers who’s segmentations had been used in the training and testing of the U-Net) was consistently high for all segmented structures (overall DSC (mean ± SD): 0.96 ± 0.02, overall ASSD: 0.004 ± 0.001, overall HD: 0.022 ± 0.001, Table 1). The DSC agreement was particularly high for the SCF, the quadriceps, the hamstrings, the femoral bone circumference, and the sartorius, and somewhat lower for femoral medulla, adductors, and IMF (Table 1).

Table 1 Agreement between manual and fully automated U-Net segmentation in the technical evaluation set; manual segmentations were acquired by the same reader (1st reader)

Full size table

The agreement between manual and U-Net segmentation, using training and evaluation data from different readers was also very high with an overall DSC of 0.94 ± 0.04 and overall ASS of 0.006 ± 0.004 cm² and an overall HD of 0.085 ± 0.097 cm² (Table 2). Yet, the measures of similarity were slightly lower than for the technical evaluation obtained from data from the same reader (Table 1).

Table 2 Agreement between manual and fully automated U-Net segmentation in the technical evaluation set; manual segmentations were acquired by another reader (2nd reader)

Full size table

In the technical evaluation (Fig. 4), the Bland–Altman analysis applied to data from the 1st reader showed a high agreement between the automated and manual segmentation results with a difference (absolute values in cm² (percent values relative to size of structure)) of − 1.1 cm² (− 0.8%) for the SCF, + 0.8 cm² (+ 2.1%) for the quadriceps, + 0.3 cm2 (− 1.0%) for the hamstring, + 0.2 cm² (+ 1.5%) for the adductors, and + 0.02 cm² (+ 0.7%) for the Sartorius and -0.5 cm2 (− 3.0%) IMF CSAs (Fig. 4). When applied to data from the 2nd reader, the Bland–Altman analysis showed good agreement between the automated and manual segmentation results with a difference (absolute values in cm² ( percent values relative to size of structure) of − 1.2 cm² (1.5%) for the SCF, − 1.0 cm² (− 2.0%) for the quadriceps, − 0.45 cm² (− 1.3%) for the hamstring, + 0.2 cm² (+ 1.5%) for the adductors, and + 0.02 cm² (+ 0.7%) for the Sartorius CSAs (Fig. 4). Some systematic deviations between the automated and the manual segmentation methods were observed with + 2.3 cm² (+ 14.6%) for the IMF CSAs (Fig. 4).

In the clinical evaluation study (Fig. 5), painful knees displayed significantly lower quadriceps CSAs for analyses performed with both segmentation methods (manual: − 5.7 ± 7.9%, p < 0.001, effect size: 0.69; fully automated: − 5.6 ± 7.6%, p < 0.001, effect size: 0.73) than painless contralateral knees (Table 3). The CSAs of the hamstrings and adductors, in contrast, did not show any significant side differences using either segmentation method (Table 3). No statistically significant differences were observed for the other muscle groups, the SCF or the IMF between both segmentation techniques, but as can be appreciated by Table 3, the mean values and standard deviations on either side were very similar between both methods. Values of IMF CSAs obtained from the automated method tended, however, to be somewhat (approx. 15%) smaller than the CSAs obtained from manual segmentation (Table 3).

Table 3 Measured side differences in muscle and adipose tissue cross-sectional areas (CSA) between manual und U-Net segmentation techniques of thigh MRI (33% distal–proximal) in OAI participants with the same radiographic disease stage in both knees, but unilateral frequent pain; painful knee vs. painful knee

Full size table

Discussion

The aim of the current study was to evaluate a rapid fully automated U-Net segmentation method for thigh muscle CSA segmentation from MRI that is suitable the analysis of large imaging databases in clinical trials. For this purpose, we evaluated the agreement between the fully automated segmentations and previously performed manual segmentations using data from the reader that performed the manual segmentations used to train the U-Net and using data from a second reader that were not part of the training or validation set. In a second step, the current U-Net segmentation method was able to reproduce the results from a previous clinical study, in which we had observed that the quadriceps of limbs with frequently painful knees shows lower CSAs compared with contralateral knees without knee pain.

The results from the current study showed high agreement (DSCs > 0.95) between the fully automated U-Net vs. manual segmentation approach for SCF, quadriceps, hamstrings, and femoral bone segmentations, independent of whether the algorithm was compared to the segmentations of the same or a different reader. The agreement for adductors, medulla, and sartorius was still high, but slightly lower (DSCs > 0.91) and in a similar range for both readers. The agreement for IMF was still good (DSCs > 0.90), when the U-Net was applied to segmentations from the same reader and was notably lower (DSCs > 0.80), when the U-Net was applied to segmentations from a different reader. This was consistent with the outcome of the Bland–Altman plots: the fully automated method showed a good agreement with the manual segmentations from both readers for most of the structures. Only the measurement of the IMF CSAs showed a considerable bias, when the U-Net was applied to data from a different reader. As outlined previously, the method presented here is fully automated and not dependent on a specific reader. The difference between automated and manual segmentation observed here, however, is well in the range of the interobserver variability of manual segmentation reported in previous studies [45]. When applied to the data from the clinical study with unilateral pain subjects, the proposed fully automated algorithm detected similar side differences in quadriceps CSAs, but with substantially less time needed for the analysis (< 1 s) than for current (semi-) automated (3–6 min) or manual segmentation techniques (60–90 min) depending on the reader and the image quality.

Prescott et al. used a numerical analysis-based level set approach and reported DSCs of 0.69 ± 0.16 (vastus medialis) − 0.82 ± 0.08 (vastus lateralis) in the individual quadriceps heads [17]. Trotter et al. focused on the individual quadriceps heads as well, reaching a DSC of 0.87 ± 0.11 for the fully automated multi-atlas framework [18]. Baudin et al. reported an average DSC of 0.86 ± 0.07 for individual thigh muscle heads combining a statistical shape atlas with a random walks graph [26]. Andrews et al. presented a probabilistic shape model framework and reported a mean DSC of 0.81 ± 0.07 for the segmentation of all individual thigh muscle heads [23]. Yang et al. used a voxel classifier combined with morphological operations in four contrast Dixon MR images. The authors reached a DSC of 0.96 ± 0.03 for the SCF, 0.80 ± 0.03 for the IMF and 0.97 ± 0.01 for the combined thigh muscles [22]. Karlsson et al. based their work on a multi-atlas segmentation approach for the muscle tissue segmentation from the whole body and reached a true positive classification from 0.93 ± 0.01 to 0.93 ± 0.03 [24]. Orgiu et al. introduced a discrimination of muscle and adipose tissue from T1-weighted MRIs of the thigh using a fuzzy c-mean algorithm and morphologic operators reporting a mean sensitivity above 96%, mean relative area difference of 1.8%, 2.7%, and 2.5%, respectively [28]. A first attempt for quadriceps MRI segmentation based on deep neural networks was undertaken by Ahmad et al. [37]. The authors explored five pre-trained deep learning models FCN-AlexNet, FCN-32s, FCN-16s, FCN- 8s with initiated weights for transfer learning and PSPNet with two different optimizations as Stochastic Gradient Descent and ADAM for quadriceps (including femoral bone and medulla as one large segmentation label), where the FCN-8s showed combined with the ADAM with quick processing time for inferencing as the best all-around deep learning model with a DSC of 0.95 for the quadriceps and processing time of 0.117s per image. In our study, we obtained an average DSC of 0.98 for the quadriceps, both when using evaluation data segmented by the same or by another reader (in relation to the training dataset) and hence, were able to improve upon this previous approach. Also, we included other and far more complex thigh MRI structures, such as the other muscle groups and IMF, and reached an overall DSC of 0.96 for an evaluation dataset from the same reader and an overall DSC of 0.94 for that from the a different reader, improving upon current state of the art.

More importantly, in the previous approaches [17, 18, 22,23,24, 26, 28, 37] the performance was not evaluated in the setting of a clinical study. The ability to reproduce relatively small side differences in the quadriceps muscle shown previously is promising for the application of the automated method in future studies, in particular for muscle and adipose tissue of the thigh that are in focus in knee OA [2, 13].

A potential limitation of the study is that the proposed fully automated segmentation method was trained only for a particular anatomical location (33% level of the femoral bone: distal–proximal) and not for other CSAs or a volumetric analysis. However, muscle CSAs acquired at the 33% level were shown to be strongly correlated with 3D muscle volume [46] and found to be sensitive to longitudinal change or cross-sectional differences in several clinical studies [12, 47, 48]. In addition, a longitudinal reduction of CSAs acquired at 33% of the femoral length was shown to be associated with muscle strength loss in patients with concurrent increase in KOA pain [10].

Another potential limitation is that the fully automated method showed a bias toward manual segmentation for the IMF that was greater when the method was applied to data from a 2nd reader (14.6%, DSC: 0.80), whose segmentations were not part of the training and validation set, than when applied to data from the 1st reader (− 3.0%, DSC: 0.90) whose data were used to train the U-Net. This difference between two readers is consistent with results from previous studies that reported an interobserver variability between manual segmentations of two different readers of 18.4/ 14.7% for the IMF, before/after quality control, respectively [45]. Yet, the DSC observed for IMF using data from the same reader compares quite favorable to the literature, while the DSC achieved with data from the 2nd reader is still comparable with the best achieved results of 0.80 in a study of Yang et al. using DIXON MRIs [22]. Further, since the observed effect was systematic and therefore similar for all patients, measuring (side) differences or longitudinal change in the IMF may not be strongly affected. Yet, future studies will have to establish the sensitivity to change for IMF and SCF, for instance during weight gain or loss.

The strength of the current study was that it not only assessed the agreement between manual and automated segmentation, but it also showed that the results of a clinical study could be reproduced using this new method.

Conclusion

Our novel approach of muscle segmentation based on a U-Net is shown to be accurate and can thus be applied to fully automated evaluation of large datasets considerably faster (< 1 s) than for current (semi-) automated (3–6 min) or manual segmentation techniques (60–90 min). More importantly, the effect shown in a clinical study that knees with unilateral frequent pain demonstrate lower CSAs of the quadriceps (but not of other thigh muscles) compared with contralateral knees without knee pain was reproduced and showed a comparable effect size to that of manual segmentation.

References

Issa RI, Griffin TM (2012) Pathobiology of obesity and osteoarthritis: integrating biomechanics and inflammation. Pathobiol Aging Age Relat Dis 2:1–7
Google Scholar
Øiestad BE, Juhl CB, Eitzen I, Thorlund JB (2015) Knee extensor muscle weakness is a risk factor for development of knee osteoarthritis. A systematic review and meta-analysis. Osteoarthritis Cartilage 23:171–177
Article Google Scholar
Mobasheri A, Rayman MP, Gualillo O, Sellam J, Van Der Kraan P, Fearon U (2017) The role of metabolism in the pathogenesis of osteoarthritis. Nat Rev Rheumatol 13:302–311
Article CAS Google Scholar
Chang J, Liao Z, Lu M, Meng T, Han W, Ding C (2018) Systemic and local adipose tissue in knee osteoarthritis. Osteoarthr Cartil 26:864–871
Article CAS Google Scholar
Maly MR, Calder KM, Macintyre NJ, Beattie KA (2013) Relationship of intermuscular fat volume in the thigh with knee extensor strength and physical performance in women at risk of or with knee osteoarthritis. Arthritis Care Res (Hoboken) 65:44–52
Article Google Scholar
Winby CR, Lloyd DG, Besier TF, Kirk TB (2009) Muscle and external load contribution to knee joint contact loads during normal gait. J Biomech 42:2294–2300
Article CAS Google Scholar
Ruhdorfer A, Dannhauer T, Wirth W, Hitzl W, Kwoh CK, Guermazi A, Hunter DJ, Benichou O, Eckstein F (2013) Thigh muscle cross-sectional areas and strength in advanced versus early painful osteoarthritis: an exploratory between-knee, within-person comparison in osteoarthritis initiative participants. Arthritis Care Res (Hoboken) 65:1034–1042
Article Google Scholar
Ruhdorfer A, Wirth W, Dannhauer T, Eckstein F (2015) Longitudinal (4 year) change of thigh muscle and adipose tissue distribution in chronically painful vs painless knees—data from the Osteoarthritis Initiative. Osteoarthr Cartil 23:1348–1356
Article CAS Google Scholar
Dannhauer T, Sattler M, Wirth W, Hunter DJ, Kwoh CK, Eckstein F (2012) Comparison of muscle area and strength between oa knees with and without structural progression—data from the OA initiative. Osteoarthr Cartil 20:S221–S222
Article Google Scholar
Kemnitz J, Wirth W, Eckstein F, Culvenor AG (2018) The role of thigh muscle and adipose tissue in knee osteoarthritis progression in women: data from the osteoarthritis initiative. Osteoarthr Cartil 26:1190–1195
Article CAS Google Scholar
Kumar D, Subburaj K, Lin W, Karampinos DC, McCulloch CE, Li X, Link TM, Souza RB, Majumdar S (2013) Quadriceps and hamstrings morphology is related to walking mechanics and knee cartilage MRI relaxation times in young adults. J Orthop Sports Phys Ther 43:881–890
Article Google Scholar
Culvenor AG, Hamler FC, Kemnitz J, Wirth W, Eckstein F (2018) Brief report: loss of muscle strength prior to knee replacement: a question of anatomic cross-sectional area or specific strength? Arthritis Rheumatol 70:222–229
Article Google Scholar
Dannhauer T, Ruhdorfer A, Wirth W, Eckstein F (2015) Quantitative relationship of thigh adipose tissue with pain, radiographic status, and progression of knee osteoarthritis: longitudinal findings from the osteoarthritis initiative. Invest Radiol 50:268–274
Article Google Scholar
Visser AW, Ioan-Facsinay A, de Mutsert R, Widya RL, Loef M, de Roos A, le Cessie S, den Heijer M, Rosendaal FR, Kloppenburg M (2014) Adiposity and hand osteoarthritis: the Netherlands epidemiology of obesity study. Arthritis Res Ther 16:R19
Article Google Scholar
Peterfy CG, Schneider E, Nevitt M (2008) The osteoarthritis initiative: report on the design rationale for the magnetic resonance imaging protocol for the knee. Osteoarthr Cartil 16:1433–1441
Article CAS Google Scholar
Gilles B, Magnenat-Thalmann N (2010) Musculoskeletal MRI segmentation using multi-resolution simplex meshes with medial representations. Med Image Anal 14:291–302
Article Google Scholar
Prescott JW, Best TM, Swanson MS, Haq F, Jackson RD, Gurcan MN (2011) Anatomically anchored template-based level set segmentation: application to quadriceps muscles in MR images from the osteoarthritis initiative. J Digit imaging Off J Soc Comput Appl Radiol 24:28–43
Google Scholar
Le Troter A, Fouré A, Guye M, Confort-Gouny S, Mattei JP, Gondin J, Salort-Campana E, Bendahan D (2016) Volume measurements of individual muscles in human quadriceps femoris using atlas-based segmentation approaches. Magn Reson Mater Phy 29:245–257
Article Google Scholar
Kemnitz J, Eckstein F, Culvenor AG, Ruhdorfer A, Dannhauer T, Ring-Dimitriou S, Sänger AM, Wirth W (2017) Validation of an active shape model-based semi-automated segmentation algorithm for the analysis of thigh muscle and adipose tissue cross-sectional areas. Magn Reson Mater Phy 30:489–503
Article CAS Google Scholar
Ghatas MP, Lester RM, Khan MR, Gorgey AS (2018) Semi-automated segmentation of magnetic resonance images for thigh skeletal muscle and fat using threshold technique after spinal cord injury. Neural Regen Res 13:1787–1795
Article Google Scholar
Kemnitz J, Eckstein F, Culvenor AG, Ruhdorfer A, Dannhauer T, Ring-Dimitriou S, Sänger AM, Wirth W (2018) Validation of a 3D thigh muscle and adipose tissue segmentation method using statistical shape models. Osteoarhritis Cartil 26:457–458
Article Google Scholar
Yang YX, Chong MS, Tay L, Yew S, Yeo A, Tan CH (2016) Automated assessment of thigh composition using machine learning for Dixon magnetic resonance images. Magn Reson Mater Phy. https://doi.org/10.1007/s10334-016-0547-2
Article Google Scholar
Andrews S, Hamarneh G (2015) The generalized log-ratio transformation: learning shape and adjacency priors for simultaneous thigh muscle segmentation. IEEE Trans Med Imaging 0062:1–1
Google Scholar
Karlsson A, Rosander J, Romu T, Tallberg J, Grönqvist A, Borga M, Dahlqvist Leinhard O (2015) Automatic and quantitative assessment of regional muscle volume by multi-atlas segmentation using whole-body water-fat MRI. J Magn Reson Imaging 41:1558–1569
Article Google Scholar
Baudin P-Y, Azzabou N, Carlier PG, Paragios N (2012) Prior knowledge, random walks and human skeletal muscle segmentation. Med Image Comput Interv 7510:569–576
Google Scholar
Baudin P-Y (2013) Graph- based segmentation of skeletal striated muscles In NMR images. Signal and Image processing. cole Centrale Paris, 2. https://tel.archives-ouvertes.fr/tel-00858584/file/pyb_PhD_may2013_v3.pdf
Lareau-Trudel E, Le TA, Ghattas B, Pouget J, Attarian S, Bendahan D, Salort-Campana E (2015) Muscle quantitative MR imaging and clustering analysis in patients with facioscapulohumeral muscular dystrophy type 1. PLoS ONE 10:1–16
Article Google Scholar
Orgiu S, Lafortuna CL, Rastelli F, Cadioli M, Falini A, Rizzo G (2015) Automatic muscle and fat segmentation in the thigh from T1-Weighted MRI. J Magn Reson Imaging 43:601–610
Article Google Scholar
Feng C, Zhao D, Huang M (2016) Image segmentation using CUDA accelerated non-local means denoising and bias correction embedded fuzzy c-means (BCEFCM). Signal Processing 122:164–189
Article Google Scholar
Feng C, Zhao D, Huang M (2017) Image segmentation and bias correction using local inhomogeneous iNtensity clustering (LINC): a region-based level set method. Neurocomputing 219:107–129
Article Google Scholar
Feng C, Li W, Hu J, Yu K, Zhao D (2020) BCEFCM_S: Bias correction embedded fuzzy c-means with spatial constraint to segment multiple spectral images with intensity inhomogeneities and noises. Signal Process 168:107347. https://doi.org/10.1016/j.sigpro.2019.107347
Article Google Scholar
Ronneberger O, Fischer P, Brox T (2015) U-Net: convolutional networks for biomedical image segmentation. Lect Notes Comput Sci 9351:234–241
Article Google Scholar
Litjens G, Kooi T, Bejnordi BE, Setio AAA, Ciompi F, Ghafoorian M, van der Laak JAWM, van Ginneken B, Sánchez CI (2017) A survey on deep learning in medical image analysis. Med Image Anal 42:60–88
Article Google Scholar
Liu F, Zhou Z, Jang H, Samsonov A, Zhao G (2018) Deep convolutional neural network and 3D deformable approach for tissue segmentation in musculoskeletal magnetic resonance imaging. Magn Reson Med 79:2379–2391
Article Google Scholar
Norman B, Pedoia BSV, Majumdar S (2018) Use of 2D U-Net convolutional neural networks for automated cartilage and meniscus segmentation of knee MR imaging data to determine relaxometry and morphometry. Radiology 288:177–185
Article Google Scholar
Chaudhari AS, Stevens KJ, Wood JP, Chakraborty AK, Gibbons EK, Fang Z, Desai AD, Hyung Lee J, Gold GE, Hargreaves BA (2019) Utility of deep learning super-resolution in the context of osteoarthritis MRI biomarkers. J Magn Reson Imaging. https://doi.org/10.1002/jmri.26872
Article PubMed Google Scholar
Ahmad E, McPhee JS, Degens H YM (2018) Automatic segmentation of MRI human thigh muscles: combination of reliable and fast framework methods for quadriceps, femur and marrow segmentation. 31–38
Kemnitz J, Baumgartner CF, Wirth W, Eckstein F, Eder SK, Konukoglu E (2018) Combining heterogeneously labeled datasets for training segmentation networks. In: Shi Y, Suk HI, Liu M (eds) Machine learning in medical imaging. MLMI 2018. Lecture notes in computer science, vol 11046. Springer, Cham. https://doi.org/10.1007/978-3-030-00919-9_32
Baumgartner CF, Koch LM, Pollefeys M, Konukoglu E (2018) An Exploration of 2D and 3D deep learning techniques for cardiac MR image segmentation. In: Pop M et al (eds) Statistical atlases and computational models of the heart. ACDC and MMWHS challenges. STACOM 2017. Lecture notes in computer science, vol 10663. Springer, Cham. https://doi.org/10.1007/978-3-319-75541-0_12
Sattler M, Dannhauer T, Hudelmaier M, Wirth W, Sänger AM, Kwoh CK, Hunter DJ, Eckstein F, Sanger AM (2012) Side differences of thigh muscle cross-sectional areas and maximal isometric muscle force in bilateral knees with the same radiographic disease stage, but unilateral frequent pain—data from the osteoarthritis initiative. Osteoarthr Cart 20:532–540
Article CAS Google Scholar
Kingma DP, Ba JL (2015) ADAM: a method for stochastic optimization. Conf Pap ICLR 2015:1–15
Google Scholar
Ruhdorfer A, Dannhauer T, Wirth W, Cotofana S, Roemer FW, Nevitt M, Eckstein F (2014) Thigh muscle cross-sectional areas and strength in knees with early vs knees without radiographic knee osteoarthritis: a between-knee, within-person comparison. Osteoarthr Cartil 22:1634–1638
Article CAS Google Scholar
Eckstein F, Boeth H, Diederichs G, Wirth W, Hudelmaier M, Cotofana S, Hofmann-Amtenbrink M, Duda G (2014) Longitudinal change in femorotibial cartilage thickness and subchondral bone plate area in male and female adolescent vs. mature athletes. Ann Anat 196:150–157
Article Google Scholar
Dannhauer T, Sattler M, Wirth W, Hunter DJ, Kwoh CK, Eckstein F (2014) Longitudinal sensitivity to change of MRI-based muscle cross-sectional area versus isometric strength analysis in osteoarthritic knees with and without structural progression: pilot data from the Osteoarthritis Initiative. Magn Reson Mater Phy 27:339–347
Article CAS Google Scholar
Ruhdorfer AS, Steidle E, Diepold J, Pogacnik Murillo AL, Dannhauer T, Wirth W, Eckstein F (2016) Inter- and intra-observer reliability of thigh muscle and adipose tissue cross-sectional area analysis from MR images. Osteoarthr Cartil 24:S415
Article Google Scholar
Cotofana S, Hudelmaier M, Wirth W, Himmer M, Ring-Dimitriou S, Sanger AM, Eckstein F, Sänger AM (2010) Correlation between single-slice muscle anatomical cross-sectional area and muscle volume in thigh extensors, flexors and adductors of perimenopausal women. Eur J Appl Physiol 110:91–97
Article CAS Google Scholar
Ruhdorfer A (2013) Chronically painful vs. painless knees: Longitudinal (4 year) change in thigh muscle/adipose tissue distribution and isometric muscle strength. 1–17
Ruhdorfer A, Wirth W, Eckstein F (2016) Longitudinal change in thigh muscle strength prior and concurrent to a minimal clinically important worsening or improvement in knee function—data from the Osteoarthritis Initiative. Arthritis Rheumatol 68:826–836
Article Google Scholar

Download references

Acknowledgements

Open access funding provided by Paracelsus Medical University. We would like to thank the OAI participants, OAI study investigators, and OAI CLINICAL Center staff for generating this publicly available image sets and Martina Sattler for providing us with the image segmentation from her previous study.

Funding

Data acquisition of part of this study was funded by the Osteoarthritis Initiative, a public–private partnership comprised of five contracts (N01-AR-2-2258; N01-AR-2-2259; N01-AR-2-2260; N01-AR-2-2261; N01-AR-2-2262) funded by the National Institutes of Health, a branch of the Department of Health and Human Services and conducted by the Osteoarthritis Initiative study investigators. Private funding partners of the OAI include Merck Research Laboratories, Novartis Pharmaceuticals Corporation, GlaxoSmithKline, and Pfizer, Inc. Private sector funding for the Osteoarthritis Initiative is managed by the Foundation for the National Institutes of Health. The sponsors were not involved in the design and conduct of this particular study in the analysis and interpretation of the data and in the preparation, review, or approval of the manuscript. Jana Kemnitz was supported with the Hospitationsstipendium for a research stay at ETH Zurich by the German Society for Biomechanics.

Author information

Authors and Affiliations

Department of Imaging and Functional Musculoskeletal Research, Institute of Anatomy, Paracelsus Medical University, Strubergasse 21, 5020, Salzburg, Austria
Jana Kemnitz, Felix Eckstein, Anja Ruhdorfer, Wolfgang Wirth & Sebastian K. Eder
Chondrometrics GmbH, Ainring, Germany
Jana Kemnitz, Felix Eckstein & Wolfgang Wirth
University of Vienna, Vienna, Austria
Jana Kemnitz
ETH, Zurich, Switzerland
Jana Kemnitz, Christian F. Baumgartner & Ender Konukoglu
Stanford University, Stanford, CA, USA
Akshay Chaudhari
St. Anna Children’s Hospital, Vienna, Austria
Sebastian K. Eder

Authors

Jana Kemnitz
View author publications
You can also search for this author in PubMed Google Scholar
Christian F. Baumgartner
View author publications
You can also search for this author in PubMed Google Scholar
Felix Eckstein
View author publications
You can also search for this author in PubMed Google Scholar
Akshay Chaudhari
View author publications
You can also search for this author in PubMed Google Scholar
Anja Ruhdorfer
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Wirth
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian K. Eder
View author publications
You can also search for this author in PubMed Google Scholar
Ender Konukoglu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

JK: study conception and design, acquisition of data, analysis and interpretation of data, drafting of manuscript; CFB: study conception and design, analysis and interpretation of data, drafting of manuscript, critical revision; AC: analysis and interpretation of data, drafting of manuscript, critical revision; FE: study conception and design, analysis and interpretation of data, drafting of manuscript, critical revision; AR: acquisition of data, critical revision; WW: analysis and interpretation of data, drafting of manuscript, critical revision; SKE: analysis and interpretation of data, drafting of manuscript, critical revision; EK: study conception and design, analysis and interpretation of data, drafting of manuscript, critical revision.

Corresponding author

Correspondence to Jana Kemnitz.

Ethics declarations

Conflict of interest

Felix Eckstein is CEO of Chondrometrics GmbH, a company providing MR image analysis services to academic researchers and to industry. He has provided consulting services to Merck Serono, Mariel Therapeutics, Servier, and Bioclinica/Synarc has prepared educational sessions for Medtronic and has received research support from Pfizer, Eli Lilly, Merck Serono, GlaxoSmithKline, Centocor R&D, Wyeth, Novartis, Stryker, Abbvie, Kolon, Synarc, Ampio, BICL, and Orthotrophix. Jana Kemnitz and Wolfgang Wirth have a part time employment with Chondrometrics GmbH; Wolfgang Wirth is the co-owner of Chondrometrics GmbH and provided consulting services to Galapagos. Akshay Chaudhari has provided consulting services to Skope MR Inc, Subtle Medical, and Chondrometrics GmBH and is a shareholder of Subtle Medical, LVIS Corporation, and Brain Key. No other authors declare a conflict of interest.

Ethical approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards. This article does not contain any studies with animals performed by any of the authors.

Informed consent

Informed consent was obtained from all individual participants included in the study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kemnitz, J., Baumgartner, C.F., Eckstein, F. et al. Clinical evaluation of fully automated thigh muscle and adipose tissue segmentation using a U-Net deep learning architecture in context of osteoarthritic knee pain. Magn Reson Mater Phy 33, 483–493 (2020). https://doi.org/10.1007/s10334-019-00816-5

Download citation

Received: 23 July 2019
Revised: 23 November 2019
Accepted: 28 November 2019
Published: 23 December 2019
Issue Date: August 2020
DOI: https://doi.org/10.1007/s10334-019-00816-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Clinical evaluation of fully automated thigh muscle and adipose tissue segmentation using a U-Net deep learning architecture in context of osteoarthritic knee pain