1 Introduction
Sarcopenia is characterized by an extreme age-related decline in skeletal muscle mass and strength [
1]. In particular, it is a registered disease under the International Classification of Diseases (ICD). However, its unified diagnostic criteria remain open [
2]. The Asian Working Group for Sarcopenia (AWGS) has proposed the use of appendicular skeletal muscle mass, grip strength, and gait speed as diagnostic criteria [
3]. Furthermore, frailty (physical frailty) represents an intermediate state between independence and the requirement of nursing care [
4]. Consequently, five items have been suggested as the diagnostic criteria for frailty: weight loss, exhaustion, low physical activity, slow gait speed, and weakness [
5]. Both sarcopenia and frailty are associated with decreased skeletal muscle quantity and function; thus, skeletal muscle mass is a crucial indicator for care prevention. Skeletal muscle analysis using computed tomography (CT) images has been recognized as an effective method for diagnosing sarcopenia [
6]. Hanaoka et al. proposed an analysis of the psoas major muscle using cross-sectional CT images at the third lumbar vertebra (L3) level [
7]. Similarly, the skeletal muscles at level L3 [
8] and the surface muscles of the entire body [
9] have been segmented in several studies. In diagnostic support based on skeletal muscle segmentation and analysis, the site-specific muscles are important, as well as the entire skeletal musculature [
10].
This study focuses on the sternocleidomastoid muscle, which is a powerful muscle that runs along the lateral neck [
11]. The sternocleidomastoid muscle plays a crucial role in various movements of the head and neck, such as rotation of the head to the opposite side and flexion and extension of the neck. Electromyographic analysis of the sternocleidomastoid muscle can effectively differentiate amyotrophic lateral sclerosis [
12] from cervical spondylotic myelopathy, which presents with similar initial symptoms [
13]. The distribution of Hounsfield units (HU) for site-specific skeletal muscles, including the sternocleidomastoid muscle, overlaps with that of organs and other skeletal muscles in CT images. This overlap makes it difficult to distinguish between these regions using threshold-based segmentation techniques. Automatic segmentation of the sternocleidomastoid muscle has been attempted in body CT images using bone positional information at the anatomical attachment sites and a probabilistic atlas [
14]. Thus, automatic segmentation of the sternocleidomastoid muscle has been researched for computer-aided diagnosis (CAD) [
15]. However, the sternocleidomastoid muscle is not entirely visible in body CT images, which presents a challenge to landmark acquisition and alignment for atlas-based recognition methods. Consequently, the average recognition accuracy across 20 cases remains limited to 65.4%.
In contrast, the psoas major muscle, which is another specific skeletal muscle, is fully visible in body CT images and has a simpler shape than does the sternocleidomastoid muscle. This has enabled model-based recognition with an accuracy of 72.3% [
16]. In recent years, deep learning approaches, particularly methods based on U-Net [
17], have been successfully applied to muscle segmentation. Hashimoto et al. achieved segmentation of the psoas major muscle in low-dose CT images [
18]. Furthermore, Castiglione et al. achieved skeletal muscle segmentation at the third lumbar vertebra with U-Net-based methods [
8]. The potential of deep-learning-based automatic feature selection has been noted for skeletal muscle segmentation and could produce more accurate musculoskeletal analysis than conventional methods that manually select features [
19]. Furthermore, U-Net-based segmentation models for 104 anatomical structures have been proposed [
20]. Thus, deep learning-based methods for medical image segmentation have demonstrated their effectiveness not only for skeletal muscles but also in various other areas including organs and bones. Given this background, we conducted a study to compare between the conventional atlas-based method and a novel deep-learning-based approach for the three-dimensional (3D) automatic segmentation of the sternocleidomastoid muscle in body CT images.
The aim of this study is to achieve the joint segmentation of the sternocleidomastoid muscle and other skeletal muscles by applying deep learning techniques to the automatic segmentation of the sternocleidomastoid muscle. It is important not only to recognize the sternocleidomastoid muscle with higher accuracy than that achieved by conventional probability atlas-based recognition methods, but also to simultaneously enable comprehensive analysis of skeletal muscles. To accomplish this, we propose a joint segmentation method for the sternocleidomastoid muscle and other skeletal muscles using multiclass learning with a 2D U-Net [
17] architecture. The aim is to segment the sternocleidomastoid muscle region. Unlike existing deep learning approaches that focus solely on the target muscle region, the proposed method simultaneously learns the sternocleidomastoid muscle and the complete skeletal musculature, and it facilitates the acquisition of 3D segmentation results for both the target sternocleidomastoid muscle and the entire skeletal muscle compartment. By incorporating knowledge from the surrounding muscle context, the proposed method enhances the segmentation performance for the sternocleidomastoid muscle region. Our method represents an advancement in applying U-Net architectures to skeletal muscle segmentation and marks a breakthrough in addressing the accuracy challenges that have confronted previous skeletal muscle segmentation methods. The proposed method simultaneously learns the sternocleidomastoid muscle and the entire skeletal musculature, and it facilitates the acquisition of 3D segmentation results for both the target sternocleidomastoid muscle and the entire skeletal muscle compartment. Therefore, the proposed method is expected to achieve site-specific skeletal muscle segmentation and comprehensive skeletal muscle analyses. The effectiveness of the proposed method was validated through a comparison with conventional methods.
5 Conclusion
In previous research, the 3D automatic recognition of the sternocleidomastoid muscle has been limited to recognition with the use of an established atlas, and no deep learning approaches have been explored. Additionally, although deep learning has been applied to the automatic recognition of skeletal muscles as a whole, it has been limited to the L3 cross-section, and 3D recognition remains open. This study proposed a multiclass learning approach for the joint segmentation of the sternocleidomastoid and skeletal muscles using a 2D U-Net architecture. The proposed method achieved segmentation accuracies of 82.94% for the sternocleidomastoid muscle and 92.73% for the entire skeletal muscle compartment. The proposed method demonstrated not only higher accuracy than the conventional atlas-based method for automatic recognition of the sternocleidomastoid muscle [
14], but also surpassed the performance of deep learning-based methods that have shown effectiveness in recognizing other specific skeletal muscles [
22,
28]. Furthermore, the proposed method was able to realize high-accuracy segmentation regardless of the input range of slices, owing to the fact that skeletal muscles are depicted in all cross-sectional slices of CT images. These results suggest that the proposed method is effective for recognizing skeletal muscles, leading to a comprehensive analysis of the skeletal muscular system in the trunk region. However, cases in which the mandible was depicted, exhibited lower segmentation accuracy compared to other cases, albeit with improvements over conventional methods. Future work will focus on enhancing robustness by not only applying U-Net-based successor networks, but also expanding the training dataset by including open data, and normalizing posture variations.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.