Introduction
Otitis media with effusion (OME) typically occurs due to persistent negative middle ear pressure and poor ventilation in the middle ear. Atelectasis and attic retraction pocket are the result of tympanic retraction in the pars tensa and pars flaccida, respectively, and nearly always occur concurrently with OME [
1], although they could be the sequela of OME, especially since atelectasis is more frequently observed in cases of OME surgery [
2,
3]. Atelectasis and attic retraction pocket are structural problems involving the tympanic membrane and include the possibility of chronic complaints and severity that progresses over time. Determining whether atelectasis or attic retraction pocket is present is an important feature of OME diagnosis [
4]. Early diagnosis with appropriate follow-up enables practical methods for managing OME cases that include mild atelectasis or attic retraction pocket, and to promote natural self-healing [
5,
6]. Severe atelectasis and attic retraction pocket nearly always requires additional surgery to treat the lesions [
5,
7], as severe atelectasis and attic retraction pocket could predispose the affected individual to complications such as adhesive otitis, cholesteatoma formation, and erosion of the ossicles [
8‐
12]. Many objective auxiliary diagnostic tools for OME are available, including tympanometry and pneumatic otoscopy [
4]. However, the diagnosis of atelectasis and attic retraction pocket is typically based on otoscopy and assessment by an expert clinician. Several types of smartphone adaptable otoscopes can be used to acquire tympanic membrane images by either non-specialists or non-clinicians [
13‐
16]. Diagnosis of ear disease made solely with manual examination and otoscopic images, however, has a low rate of accuracy, which may lead to an improper referral, delayed or improper treatment, and unnecessary follow-up. Previous research [
17,
18] has found that the rate of correctly diagnosed otitis media by pediatricians was only 50% compared to that of 73% by otolaryngologists.
The progressive use of telemedicine and artificial intelligence in the otologic setting may gradually change the current approach to disease management. Previous studies have established machine learning models for the diagnosis of ear diseases that have achieved high diagnostic accuracy [
19‐
27]. However, these studies regarded atelectasis and attic retraction pocket as one condition, rather than two distinct disorders. Moreover, OME and tympanic retraction were regarded as two separate diseases, overlooking the fact that these lesions can co-exist. When OME and tympanic retraction co-exist, standard diagnostic models diagnose only one lesion, such as OME [
20,
21]. No DL studies have focused on dividing OME into different types according to the presence of atelectasis or attic retraction pocket.
In the present study, we developed and validated a DL model to identify the presence of attic retraction pocket and atelectasis in OME cases with the use of multi-center otoscopic images. We further classified OME into different types based on atelectasis and attic retraction pocket, which may be used to improve the procedures for accurate OME diagnosis and management.
Discussion
In this study, we developed and validated a DL model to get an accurate diagnosis of OME by identifying the presence of attic retraction pocket and atelectasis with multiple centers of OME otoscopic images. The DL model could be used to determine further classification of OME by attic retraction pocket and atelectasis. This CNN algorithm obtained an AUC of 0.89 for the identification of attic retraction pocket and 0.87 for atelectasis in OME otoscopic images. The CAM of the DL model showed a consistent discriminative region of tympanic membranes to otolaryngologists.
Attic retraction pocket and atelectasis are the most commonly observed tympanic changes in OME, which currently lacks an auxiliary diagnostic tool. Different types of otoscopes (e.g., smartphone-based imaging otoscopes [
13,
14,
16]) are the preferred diagnostic modality for diagnosing attic retraction pocket and atelectasis, but these approaches are limited for use by clinicians who lack sufficient diagnostic experience, particularly in clinicians who are not specialists in otolaryngology. During the procedures carried out for OME management, attic retraction pocket and atelectasis could be detected by our DL model. In clinical practice, the DL model could be useful in diagnosing attic retraction pocket and atelectasis and for assisting clinicians in determining a more accurate diagnosis. In addition, the DL diagnostic model could increase the performance of clinicians in diagnosing these disorders. Then, precise diagnosis with DL model could save the economic cost of misdiagnosis and missed diagnosis. Including avoiding unnecessary referral and surgery, and facilitating the screening of severe OME cases. Besides, DL diagnostic model could be loaded on the website, and could be used by patients, while tympanometry and otoscope always need additional professionals, which will increase the cost of disease diagnosis.
When the attic retraction pocket or atelectasis occurs in OME cases, patients should be immediately referred to otolaryngologists for further evaluation. Moreover, for young otolaryngologists and those not specialized in otolaryngology, this model could be used as a study tool to increase their knowledge of attic retraction pocket and atelectasis.
Previous studies have established DL models for the diagnosis of tympanic retraction and achieved an average level of accuracy ranging from 85.78 to 88.06% [
20,
22]. However, these previous studies regarded tympanic retraction and OME as different conditions, which ignores those cases in which tympanic retraction and OME are both potentially present. Cha et al. [
20] proposed a DL model that detected attic retraction pocket or adhesive otitis media with an accuracy of 85.78%. However, these authors merged atelectasis and attic retraction pocket into a single class rather than distinguish them as two different conditions. Moreover, a disease labeling approach was used when there was more than one feature in otoscopic images, in which only the more severe feature was identified. For example, when OME and attic retraction pocket were both present, the DL model only provided an output for the diagnosis of OME. As a result, such an approach will result in disregarding cases of milder disease. Shie et al. [
21] extracted color, geometric, and textural features to develop a classification system for differentiating most types of otitis media, achieving an accuracy of 88.06%; however, the machine learning model was developed in a small dataset that included 865 otoscopic images. Moreover, the color of otoscopic images may vary with diverse conditions for illumination and different otoscope systems; thus, color is not a stable variable in an accurate diagnostic model. During the course of disease, attic retraction pocket is likely to progress to cholesteatoma, and atelectasis may evolve to include ossicular erosions [
31]. In addition, the surgical approach to severe tympanic membranes in pars tensa and pars flaccida widely differs. Therefore, we divided tympanic retraction into atelectasis and attic retraction pocket. Compared to previous models, we were able to detect the atelectasis and attic retraction pocket separately. This image classification system, therefore, was the first to individually diagnose two types of tympanic membrane lesions. The performance of the DL model in the present study has been found to be better than the diagnostic performance of pediatricians, general practitioners, and otolaryngologists, with rates of accuracy ranging from 50 to 80% [
17,
32,
33]. This DL model could provide an objective second opinion to assist otolaryngologists in making a correct diagnosis. The performance of this DL model is comparable to the diagnostic accuracy of tympanometry in diagnosing OME, with a degree of sensitivity ranging from 76 to 96% [
34,
35].
Our results showed that DL models could identify different regions (pars tensa and pars flaccida) of retraction on the tympanic membrane with varying degrees of performance. Based on clinical experience, it is reasonable to suggest that attic retraction pocket is easier to identify than atelectasis because the normal tympanic membrane shows a mild retraction in the pars tensa without a retraction in the pars flaccida. Thus, in cases of mild retraction, atelectasis may be subtle and difficult to determine, whether it is normal or at stage I atelectasis. Our image dataset was representative and collected from three hospitals with different types of otoscopes, photo conditions, and record systems.
Considering our experience [
19] and that of other teams [
20,
24,
26,
27], Google Inception-V3 demonstrated high performance in developing a diagnostic DL model based on otoscopic images. Therefore, the Google Inception-V3 CNN model was adopted as the backbone network, and subsequently trained, tuned, and evaluated. Based on our findings, the foremost approach to OME otoscopic images with atelectasis by CAM was to focus on pars tensa and attic retraction pocket by CAM on pars flaccida, particularly large and deep retraction pockets and atelectasis, which is consistent with current practice by otologists.
Limitations
Some limitations in our study should be noted. Although the CNN algorithm could identify mild and severe attic retraction pocket and atelectasis, due to the low incidence of severe attic retraction pocket and atelectasis [
36‐
38], however, there were not enough images to develop and validate a DL model to identify the different stages of attic retraction pocket and atelectasis. This limitation meant that clinicians were required to complete the task of further classification. Moreover, the retrospective nature of the design created some limitations with regard to data collection, such as inconsistent illumination of the otoscopes. The accuracy of the DL model was considerable affected by the quality of otoscopic images, which is associated with the operators’ examination skills and with the cooperation of the children being examined. Larger, prospective studies with more detailed data for collection rules are needed to improve the performance of the DL model. Finally, non-medical history and hearing information were provided for the DL model and to the otolaryngologists, which may have affected the accuracy of diagnosis. Clinicians could improve the accuracy of diagnosis by combining information from disease history.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.