To date, there is no consensus on how to standardize the assessment of ototoxicity in serial measurements. DPOAE paradigms currently used in the clinic cannot differentiate between the two DPOAE components arising from continuous sound stimulation. Nor do the paradigms take into account the individual middle-ear transfer function. Pulsed DPOAEs with individually optimal stimulus levels lead to improved validity and lower variability of test results in subjects with normal hearing. A combined analysis of pulsed DPOAEs and hearing thresholds best resolves changes in hearing status.
Background
The aim of follow-up assessments of the functional state of the cochlear amplifier is to track changes with high sensitivity and specificity. In everyday clinical practice, serial measurements are used, for example, for the timely recognition of the influence of ototoxic substances on hearing function or to establish the efficacy of regenerative therapy. To date, however, there is no international consensus on how to assess ototoxicity or regeneration in a standardized way. The American Academy of Audiology considers the determination of the pure-tone hearing threshold, especially in the high-frequency range, and the measurement of distortion-product otoacoustic emissions (DPOAEs) to be the most reliable, clinically applicable methods [
10].
DPOAEs represent intermodulation products resulting from the simultaneous stimulation of the cochlea with two stimulus tones of frequencies
f1 and
f2 (typically,
f2/f1 ≈ 1.2) with stimulus levels
L1 and
L2. DPOAEs are directly based on the nonlinearity of the mechanoelectrical transduction of the outer hair cells located near the characteristic place of the stimulus frequency
f2. Therefore, DPOAEs provide frequency-specific information about the functional state of the cochlear amplifier [
2].
A previous recommendation for monitoring and evaluating ototoxicity in children and adolescents includes the medical history, pure-tone audiometry for the frequencies 1–8 kHz, DPOAE, and tympanometry [
6]. Accordingly, a test battery of different methods should be performed, as individual methods are not sufficiently informative. Preliminary studies suggest that DPOAEs detect changes in hearing earlier than pure-tone audiometry and have a higher sensitivity to subtle or subclinical changes [
7]. DPOAE thresholds showed higher sensitivity than single DPOAE levels in two studies [
13,
22]. High-frequency audiometry (HFA) at 9–16 kHz can detect hearing changes more often than pure-tone audiometry [
1]. In children, DPOAEs are used to detect early ototoxic, cisplatin-induced decreases in amplitude or signal-to-noise ratio (SNR) of DPOAEs [
14].
In audiological follow-up examinations, a high test–retest reliability of the measurement method is essential in order to distinguish systematic pathological or regenerative changes from random measurement inaccuracies; the validity of the measurement method is equally important. For example, DPOAE level changes (< 6 dB) observed with clinically used DPOAE protocols alone cannot predict, with sufficient sensitivity and specificity, an ototoxic hearing-threshold increase verified by pure-tone audiometry [
16]. Multivariate analyses that take into account DPOAE levels at neighboring frequencies, SNR, and the dose–response relationship increase the predictive power for detecting ototoxic hearing damage, but have not yet become established in clinical practice [
16]. Consequently, there is currently no clinically validated, significant DPOAE change that predicts potential cochlear damage [
15,
22].
DPOAEs are currently regarded in the clinic as a useful, supplementary method for the diagnosis of cochlear function, but they have limitations in their diagnostic value [
11]. There are three main limiting factors: (1) DPOAEs essentially consist of two components, the nonlinear distortion component and the coherent reflection component, which are generated at different locations along the organ of Corti by different mechanisms [
25]. Depending on both the level and the phase differences between the components, the waves can variously interfere and thus lead to artifact-prone measurement results [
29]. (2) DPOAE signals are influenced by individual middle-ear characteristics, particularly of retrograde transmission [
17]. (3) DPOAE levels show a relatively limited correlation with cochlear hearing loss, with the relationship being nonlinearly dependent on both level and frequency [
4,
12].
An extended DPOAE diagnostic approach is provided by DPOAE growth functions, which semi-logarithmically map the sound pressure of the DPOAE amplitude as a function of the stimulus level
L2 for each frequency. Extrapolation of a regression line to the
L2 axis yields the so-called estimated distortion-product threshold (EDPT), whereby its level is denoted by
LEDPT. The EDPT correlates approximately 1:1 with the hearing threshold [
5]. Diagnostic precision is significantly improved by the artifact-free acquisition and analysis of DPOAEs in the time domain using pulsed stimuli [
8,
27,
29,
30], together with the application of individually optimal, frequency-specific stimulus levels, which are acquired using DPOAE level maps [
28]. DPOAE level maps depict the growth behavior of DPOAE amplitude as a function of stimulus-level combinations that sample an extended area in
L1,
L2 space and allow for the derivation of
LEDPT by numerically fitting a nonlinear mathematical function to the DPOAE amplitude samples. Importantly, this procedure does not require a priori choice of a stimulus path to yield the maximum DPOAE amplitudes for a given subject [
28].
The
LEDPT provides a promising method for ascertaining true changes in the functional state of the cochlear amplifier. The
LEDPT can quantify hearing loss with high accuracy [
28] and has a high test–retest reliability [
3], yielding high sensitivity and specificity in serial monitoring. DPOAE levels, which predominate in studies of serial monitoring compared with DPOAE thresholds, also exhibit particularly high test–retest reliability [
9,
20,
23]. However, given that the hearing threshold appears to be approximately proportional to the DPOAE level with a slope of 2 [
18], then significant differences in the test–retest reliabilities of DPOAE levels and thresholds—either DPOAE thresholds or hearing thresholds—can only be estimated after first multiplying the changes of DPOAE level by 2 [
3]. When comparing test–retest reliabilities, not only must the measurement times be kept in mind, but also that DPOAE levels primarily contain information about the suprathreshold behavior of the cochlear amplifier, whereas DPOAE thresholds characterize behavior near neural threshold and thus assess the maximum amount of cochlear amplification.
The aim of the present study was to reduce the influence of measurement inaccuracies in the respective methods and to increase the test–retest reliability by using a combined analysis paradigm of pure-tone threshold (LTA), estimated distortion-product threshold (LEDPT), and DPOAE level (LDP).
Discussion
DPOAE level: significance and test–retest reliability
In general, DPOAE levels show a relatively limited correlation with cochlear-induced hearing loss, and the complex relationship between DPOAE level and the associated hearing loss is nonlinearly dependent on stimulus level and frequency [
4,
12]. Interestingly, after using the individually optimal stimulus level
L1,opt at a moderate stimulus level
L2 = 45 dB SPL, a significantly higher correlation of
LDP and
LTA and a lower scatter was found, especially in the high-frequency range
f2 = 8–14 kHz (Fig.
3). Since the inclusion criterion for the study was defined as PTA
4 (0.5–4 kHz) < 20 dB HL, there was hardly any hearing loss for some frequencies in the range 1–6 kHz and, therefore, the
LTA range was insufficient to detect potential correlation between
LDP and
LTA and consequently any associated middle-ear or noise influences.
The idea behind the use of optimal stimulus levels is to achieve an ideal overlap between the travelling-wave envelopes of the two stimulus tones
f1 and
f2 near the characteristic place of the
f2 tone by taking into account the different compression states of the two travelling waves near the
f2 characteristic place [
24], so that the distortion produced by nonlinear mechanoelectrical transduction is maximal near that place [
2]. Although these two aims were originally introduced by Kummer et al. [
18], the algorithms for attaining an optimal
L1 for a given
L2 differ significantly: In our case, the optimization parameters are derived individually for each session, subject, and
f2 and, as such, our algorithm is a major development of the earlier optimizing algorithm (the “scissor’s” algorithm) where the parameters were independent of the subject and
f2 [
18]. Individually optimizing
L1 led to a reduction in the inter-subject variability of
LDP, especially for
f2 = 8–14 kHz (Fig.
3b).
Not only the inter-subject variability of LDP, but also the intra-subject variability, the so-called test–retest reliability, improved significantly by selecting frequency-specific, individually optimal stimulus levels. Consequently, the frequently quoted reference range for an intra-subject DPOAE change from examination to examination at the stimulus level of L2 = 65 dB SPL was reduced from approximately 6–8 dB to 4–5 dB. In future clinical examinations, pulsed DPOAE signals excited using frequency-specific, individually optimal stimulus levels could therefore be a valuable method for detecting early signs of changes in the functional state of the cochlear amplifier, before they become visible with conventional DPOAE level measurements.
Estimated DPOAE thresholds: significance and test–retest reliability
DPOAE level maps capture—with high precision—the intensity behavior of the cochlear amplifier near the
f2 place for different stimulus level pairs
L1,
L2.
LEDPT derived from such maps incorporate information from multiple DPOAE amplitudes and thus allow for a more precise and extended diagnosis of the functional state of the cochlear amplifier, as has already been shown by us and other authors for DPOAE growth functions and their properties [
26]. With numerical extrapolation of the DPOAE amplitudes, the growth behavior of the DPOAE amplitudes can also be derived at low stimulus levels. It is precisely at such levels that the test significance of the functional state of the cochlear amplifier is highest due to nonlinear amplification being highest at low stimulus levels [
18].
In addition, provided that there is no damage to the inner hair cells and neural pathways,
LEDPT enables an objective quantification of the hearing threshold [
8,
28,
30]. Such quantification is not possible using conventional protocols in the clinic, where suprathreshold DPOAEs are measured at one or two stimulus levels. Moreover, changes in the individual transmission characteristics of the middle ear can be captured by DPOAE level maps, whereby losses in anterograde middle-ear transmission shift and distort the DPOAE level maps and losses of retrograde middle-ear transmission reduce DPOAE amplitude [
19]. The DPOAE growth function is extrapolated along the ridge of the three-dimensional surface of the model DPOAE level map (Fig.
2) and is thus based on maximum DPOAE amplitudes generated using individual, near-ideal stimulus levels.
LEDPT based on DPOAE level maps estimate hearing thresholds more accurately than conventional DPOAE growth functions that are excited with predetermined stimulus levels [
28]. In this study,
LTA for
f2 = 1–14 kHz correlated with the
LEDPT derived from DPOAE level maps with a standard deviation of 7.7 dB (Fig.
5). This error estimate is slightly higher than the standard deviation of 6.5 dB reported by Zelle et al. [
30], and is attributable to a factor four reduction in the averaging time per DPOAE signal in the present study together with the extension of the frequency measurement range from 1–8 kHz to 1–14 kHz. In addition, for reasons of the still limited quantity of hearing-loss data, the regression analysis has not been performed at single frequencies, in which case the standard deviation of the estimate of
LTA predicted from
LEDPT for a given frequency can be significantly reduced [
30]. Moreover, it is expected that the implementation of a modern calibration procedure such as IPL (integrated pressure level) or FPL (forward pressure level) would further reduce the standard deviation, particularly at high frequencies [
20].
LEDPT not only estimate individual hearing thresholds accurately, but are also stable for follow-up measurements in a given ear [
3]. The test–retest reliability of
LEDPT for the entire frequency range
f2 = 1–14 kHz with a median AD of 3.3 dB is comparable to that of
LTA (median AD = 3.2 dB), whereas for the high-frequency range,
f2 = 11–14 kHz,
LEDPT are superior to
LTA [
3]. The reference range corresponding to the 90th percentile, above which an ear must be considered in need of control in follow-up examinations, is approximately 10 dB for both
LEDPT and
LTA for
f2 = 1–14 kHz. When
LDP differences are doubled to correct for
LTA being proportional to
LDP with a slope of 2,
LDP show a comparable test–retest reliability; namely, with a median AD of 2.8–3.6 dB and a 90th percentile of 8–12 dB when using
L1,opt (Table
2). Since
LDP and
LEDPT are partly subject to different confounding factors (e.g., middle-ear pathology, noise sources) and physiological mechanisms, the strategy introduced in this study was to combine the two DPOAE parameters and the auditory threshold parameter into a single parameter that is as sensitive and reliable as possible.
Combined analysis paradigm: significance and test–retest reliability
To date, changes in the pure-tone hearing threshold and DPOAE level (typically, measured at
L2 = 65 dB SPL,
L1,std = 75 dB SPL) have generally been considered separately in everyday clinical practice for the monitoring of ototoxicity. To the best of our knowledge, the test–retest reliability of concurrent changes in hearing thresholds and DPOAE levels has not been reported in the literature. Only multivariate statistical DPOAE analyses that consider DPOAE level and SNR simultaneously have been presented for predicting hearing threshold [
11] and ototoxic hearing loss [
16]. As predictors of hearing status, multivariate DPOAE analyses achieve better test quality compared with univariate approaches using either DPOAE level or SNR. However, even with multivariate analyses, there is still considerable overlap between the distributions for normal-hearing and hearing-impaired people, which was found to be more pronounced for the frequency range 0.75–3 kHz than for 4–8 kHz [
11]. Multivariate DPOAE analyses also lead to improved test performance for predicting an increase of ototoxic-induced hearing threshold, but only when the cumulative cisplatin dose is included in the analysis [
16]. Using a 6-dB change in the DPOAE level as a metric allows for little to no improvement over an analysis based on cumulative cisplatin dose and pre-exposure hearing threshold [
16]. The analysis paradigm presented here, which combines changes in
LEDPT, suprathreshold
LDP, and fine-structure-reduced
LTA, significantly improved the test–retest reliability (Fig.
6 and Table
3). It is expected that this approach will lead to higher sensitivity and specificity in future studies for detecting pathological or regenerative changes in the outer hair cells.
Since this study focused on the validation of the methodology of pulsed DPOAEs in follow-up measurements in normal-hearing subjects, there are few data for mild-to-moderate hearing loss for
f2 = 1–6 kHz. Therefore, for the purpose of specifying a metric that combines the various parameters, we assumed that
LDP is correlated with
LTA with a ratio of 1:2, the assumption being mainly based on the findings of Kummer et al. [
18]. Given the nonlinear dependence of
LDP on stimulus frequency and level, future applications of the combined analysis paradigm should quantify the relationship between
LDP and
LTA as a function of frequency and level using individually optimal stimulus levels
L1.
Although it was shown here that the combined analysis paradigm together with the pulsed DPOAE protocol yields a higher test–retest reliability than reported to date, it still has to be established that, for example, ototoxic hearing impairment in follow-up examinations of patients receiving chemotherapy with cisplatin can be detected earlier and more sensitively by using DPOAE level maps and combined analysis paradigms compared with other audiological test procedures.
In addition, the procedure could be optimized through further technical adjustments. A modern calibration procedure for sound pressure could be implemented that avoids erroneous stimulus levels due to standing waves within the auditory canal and thus facilitates the detection of DPOAEs in still higher numbers and quality. It would also be advantageous to develop an adaptive algorithm that enables the detection of DPOAE level maps within an L1,L2 space depending on the SNR, in order to reliably construct DPOAE level maps in as many patients with residual cochlear hearing as possible in a time-efficient manner.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.