Introduction

Hallucinations are perceptual experiences in the absence of an adequate external stimulus, and auditory verbal hallucinations (AVH) are a key symptom of psychosis. In schizophrenia, the 1-month prevalence of these hallucinations exceeds 70%,1 and in 25–30% of patients these perceptions are resistant to medication, resulting in functional disability and a low quality of life.2, 3 With limited therapeutic options, the development of new treatment strategies4 is needed. One such intervention is repetitive transcranial magnetic stimulation (TMS). Hoffman was the first to report that low-frequency TMS decreased the severity of medication-resistant AVH when the coil is directed at the left temporoparietal5 area. Since that initial report, a number of studies have been conducted with a similar treatment paradigm and focus.6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 Nevertheless, in a recent study with the largest sample size so far, no difference between sham and stimulation has been found.22 However, it has been suggested that patients respond differentially to TMS,23 making the exploration for possible predictors desirable. Predictors may be either found in the clinical phenomenology of symptoms or in brain function. With respect to symptoms, only minimal response to TMS has been reported in continuous hallucinators.6, 14 With regard to brain function, only functional magnetic imaging (fMRI), but not resting perfusion, has been used to predict response to TMS in AVH. Using temporoparietal TMS, lower levels of hallucination-related activation in Broca’s area predicted the response rate in intermittent hallucinators, and reduced coupling between Wernicke’s and a right homolog of Broca’s area predicted the response-rate in continuous hallucinators.15 fMRI requires detecting the symptoms of hallucinations in the scanner, which is extremely challenging depending on many uncontrollable factors, thus making it clinically practical only in a few patients. Therefore, predictors, such as resting neuronal activity, not requiring symptom and patient cooperation, would clearly be clinically more useful. We recently showed that the improvement of symptoms after 10 days of TMS treatment was associated with decreased neuronal activity in the primary auditory cortex, Broca’s area and cingulate gyrus,24 suggesting that resting perfusion, reflecting neuronal activity, is an appropriate marker for the effectiveness of TMS. In the current study, we applied a different approach. Our aim was to identify responders and nonresponders post hoc using resting-state perfusion and not to test the effectiveness of TMS itself. Considering that there is a hyperactivity in language-related regions in AVH patients, which is related to symptom severity,25, 26, 27 we postulated that cerebral-resting perfusion before TMS treatment is higher in responders compared with nonresponders. To achieve a larger sample size, we here pooled patients using two different TMS protocols. A repetitive TMS protocol that has recently been introduced into clinical research is theta-burst stimulation.28, 29, 30 Compared with the often-used 1-Hz stimulation protocol, theta-burst stimulation has the advantage that the application duration is very short, making the treatment more convenient for the patient. In a recent study, we found theta-burst and 1-Hz TMS to be equally effective,24 and therefore we randomly assigned patients to receive either a 1-Hz or theta-burst TMS protocol. Before treatment, we measured whole-brain resting perfusion using magnetic resonance (MR) arterial spin labeling (ASL), an MR technique that provides a direct quantitative measure of cerebral blood flow (CBF).31, 32, 33 ASL is a noninvasive technique that reliably measures whole-brain CBF with converging results to those obtained with invasive PET(Positron emission tomography) perfusion imaging.34

Materials and methods

Patients and clinical investigation

Inclusion criteria were the diagnosis of schizophrenia or schizoaffective disorder according to ICD-10 (F20, F25), age between 18 and 65 years, right-handedness and medication-resistant AVH. Therapy refractoriness was defined as having no response to at least two different antipsychotic treatment options within licensed dose ranges, each administered for at least 8 weeks. Medication dose remained stable during the treatment. Exclusion criteria were history of epileptic seizures, signs of elevated neuronal activity by electroencephalography, MR contraindications and medical disorders other than schizophrenia or schizoaffective disorder. Substance misuse in the 4 weeks before treatment was ruled out by a urinary drug screen before treatment. All patients underwent identical diagnostic procedures and identical MR protocols. The diagnostic procedure was performed on the basis of clinical interviews and psychiatric history. The assessment of psychopathology, consisting of the auditory hallucination rating scale (AHRS),35 the hallucination change scale35 and the positive and negative symptom scale (PANSS),36 was performed before and after 10 days of TMS treatment. To detect clinically relevant improvements in symptoms, we used a conservative level of at least 50% in the hallucination change scale to define response to TMS, similar to the one used in previous studies.14, 35 The Edinburgh handedness scale37 was assessed to ensure that all patients were right-handed. The investigation was conducted in accordance with the Declaration of Helsinki and approved by the local ethics committee. Patients provided informed written consent to participate in the study.

Experimental design

MRI was conducted on a 3-T whole-body MRI system (Magnetom Trio, Siemens Medical Systems, Erlangen, Germany) with a standard 12-channel radiofrequency head coil. High-resolution three-dimensional structural (s)MRI and ASL were acquired before TMS. T1-weighted three-dimensional magnetization-prepared rapid gradient echo scans were recorded (number of slices=176, matrix=256 × 256, slice thickness=1 mm, voxel size=1 × 1 × 1 mm3), serving as a high-resolution three-dimensional anatomical template for coregistration with functional data. A pseudo-continuous ASL technique was used to measure CBF.38 In this gradient-echo echo-planar imaging sequence, interleaved images with and without labeling were acquired. A delay of 1250 ms was applied between the end of the labeling pulse (label time=1720 ms) and image acquisition (slice acquisition time=45 ms), in order to reduce transit artifact (field of view=220 mm2, matrix=64 × 64, repetition time/echo time=4000/18 ms and flip angle=25°). A total of 14 slices (voxel size=3.4 × 3.4 × 6 mm3, gaP=1.5 mm) were acquired in the anterior and posterior commissure line, from inferior to superior, in a sequential order. The pseudo-continuous ASL scan comprised 80 acquisitions. ASL data analysis was performed using aslm39 (downloadable at http://aslm.sourceforge.net), a newly developed toolbox that allows multiple subjects to be analyzed at once and to perform region of interest analysis of ASL data. aslm is an object-oriented toolbox based on Matlab (Matlab version 8, release 14; The MathWorks, Natick, MA, USA), SPM 8 (Wellcome Department of Imaging Neuroscience, London, England; http://www.fil.ion.ucl.ac.uk/spm/software/spm8/) and marsbar.40 All ASL time series were first realigned to correct for motion artifacts. We calculated flow-time series by subtracting the labeling images from the control images, and subsequently computed mean CBF images for each subject.41 Each individual subject’s T1 anatomy was segmented into gray matter (GM) and white matter. Mean ASL images were then coregistered to GM-segmented T1. T1, GM, white matter and ASL images were normalized to the SPM MNI T1 template. ASL images were spatially smoothed with a three-dimensional 8-mm Full Width at Half Maximum Gaussian kernel. Data were z-transformed (z=(voxel CBF-global GM CBF)/s.d.) to remove sources of variance caused by differences of global mean CBF between acquisitions and corrected for GM using GM segments as inclusive masks. For visualization, CBF maps and clusters of the whole-brain analysis were overlaid onto T1-weighted MR image slices using aslm.

Neuronavigated TMS

We chose area Spt (Sylvian parietotemporal), located in the sylvian fissure at the parietotemporal boundary, as the target region for TMS. Area Spt is a sensorimotor interface between the sensory and motor speech systems,42 is strongly left dominant,43 and is activated equally by the perception and reproduction of aurally or visually presented words. There is evidence of functional abnormalities within area Spt in patients with schizophrenia.44 Because area Spt is connected to the frontal and temporal language regions that are involved in the generation of AVH, it qualifies as an ideal target region for TMS modulation of AVH. A modified version of an evaluated language-processing task introduced by Hickok et al.43 was used to localize the target region (Spt) for TMS treatment in an fMRI scan of each patient in the first MR measurement. This measurement took place immediately after the resting perfusion measurement with ASL. After superimposing the individual’s fMRI activation map on its anatomical mesh (three dimensional reconstruction of T1 image), a TMS target point was created at area Spt. A custom TMS stimulator (MagPro R 100, Medtronic Functional Diagnostics, Skovlunde, Denmark) was used to generate repetitive biphasic magnetic pulses. Magnetic pulses were delivered with a figure-8 coil (Magnetic Coil Transducer MC-B70, Medtronic). Stimulation of the motor cortex with single TMS pulses until a movement of the contralateral thumb was detected permitted the identification of the individual resting motor thresholds.45 The center of the coil was held tangentially to the skull. TMS pulse intensity was adjusted to 90% of the resting motor threshold. Patients were randomly assigned to receive a 1-Hz (n=12) or theta-burst (n=12) TMS protocol. In both treatment groups, the target area was stimulated for 10 consecutive days. Stimulation at 1 Hz was applied once a day, according to the protocol used by Hoffman et al.35 In the theta-burst group, each burst contained three pulses at 30 Hz, repeated with an interburst interval of 100 ms.29 Theta burst was applied in double trains with a 15-min intertrain interval. On the first 3 days, 2 double trains of theta burst were applied (based on the protocol in Nyffeler et al.29), whereas on days 3–10 one double-theta-burst train was applied. Safety protocols were in accordance with international safety standards of TMS experimentation.46 A frameless, ultrasound-based stereotactic system was used for neuronavigation (BrainvoyagerTMS Neuronavigator System, Brain Innovation, BV 2006,47 Maastricht, The Netherlands). After superimposing the individual’s fMRI activation map on its anatomical mesh (three-dimensional reconstruction of T1 image), a TMS target point was created at area Spt. Details on the fMRI stimulation paradigm have recently been published elsewhere.24

Statistical analysis: CBF

Average mean GM CBF values of responders and nonresponders were compared in a two-sample t-test. A global and assumption-free second-level random-effect analysis was then performed using a two-sample t-test on GM CBF images obtained in each participant (degree of freedom (df)=22). We report clusters with a voxel-level threshold of P<0.001 (uncorrected) at a cluster-level threshold of P<0.05, whole-brain corrected for familywise error.

Results

Clinical data

Twenty-four patients were included in the study. Clinical and demographic characteristics of the subject sample are detailed in Table 1. After 10 days of TMS treatment, patients were rated again by a blinded clinician. Nine patients showed a clinical improvement of at least 50% and were therefore classified as responders; the other 15 patients were classified as nonresponders. Responders and nonresponders did not differ in the levels of AHRS and PANSS before treatment (Table 1). In addition, the frequency item (P=0.8), the reality item (P=0.4) and the loudness item (P=0.1) of the AHRS did not differ in responders and nonresponders. Responders and nonresponders were not different in age, gender, age of onset, disease duration or medication levels (Table 1). No differences were detected between the 1-Hz (mean: 0.32±0.39) and theta-burst (mean: 0.32±0.32) protocols comparing the amount of improvement as measured with the hallucination change scale (P=1). There was no difference in the relationships of responders to nonresponders in the 1-Hz group (4:8) and in the theta-burst group (5:7, P=1).

Table 1 Patient characteristics with mean values, s.d., statistical differences with P-values

ASL data

The average mean global CBF in the responders and nonresponders was 58.3 (±12.5) and 52.0 (±9.0) ml 100 g−1 min−1, respectively (Table 1). The average mean global CBF images of responders and nonresponders are shown in Figure 1. As average mean global CBF did not differ between responders and nonresponders (P=0.2), analyses were performed with normalized data to reduce variability introduced by nonspecific fluctuations in global CBF. In the whole-brain random-effect analysis, two contrasts were tested: first, responders>nonresponders, that is, voxels that were significantly higher perfused in responders than in nonresponders before treatment, and second the inverse contrast (nonresponders>responders). In the first contrast, one single cluster of 120 voxels survived a cluster-level whole-brain correction for familywise error in the left superior temporal gyrus (STG) (Brodmann area 22, peak: t[22]=7.28, x/y/z=−58/−26/4, Figure 2). No clusters survived in the inverse contrast (nonresponders>responders).

Figure 1
figure 1

Average mean global CBF images of responders (top) and nonresponders (bottom). CBF values were z-transformed.

Figure 2
figure 2

Contrast responders>nonresponders, that is,voxels that were significantly higher perfused in responders than in nonresponders before treatment. Single cluster of 120 voxels survived a cluster-level whole-brain correction for familywise error in the left superior temporal gyrus (Brodmann area 22, peak: t[22]=7.28, x/y/z=−58/−26/4). No clusters survived in the inverse contrast (nonresponders>responders).

ASL data and psychopathology

The normalized levels of CBF in the left STG before treatment were correlated (Spearman’s rank correlation) with the amount of improvement, as measured with the hallucination change scale (r=0.7, df: 22, P<0.0001, Figure 3). There was no relationship between the level of perfusion in STG in responders and nonresponders and the levels of symptoms as measured by the PANSS (r=−0.04, df: 22, P=0.9) before treatment. In addition, pretreatment scores of the AHRS items frequency (P=0.9), reality (P=0.7) and loudness (P=0.5) were not correlated with the level of STG perfusion.

Figure 3
figure 3

Scatter plot of improvement in percent, as measured with the hallucination change scale and CBF in the left superior temporal gyrus before treatment (Spearman’s rank correlation, r=0.7, df: 22, P<0.0001). CBF values were z-transformed.

Reciever operator characteristic (ROC) analysis

To further investigate the discriminative power of CBF in the left STG, we conducted a receiver operator characteristic (ROC) analysis. That way, we measured how well CBF in the left STG could be used to distinguish between responders and nonresponders. The area under the ROC curve (AUC) was 0.96 (s.e.: 0.05, 95% confidence interval: 0.87–1.0, P<0.0001, Figure 4). The cutoff point for best sensitivity and specificity was calculated at z=0.26 (Figure 4).

Figure 4
figure 4

ROC analysis to test the discriminative power of the CBF in the left superior temporal gyrus before treatment in responders and nonresponders to transcranial magnetic stimulation. The AUC was 0.96 (s.e.: 0.05, 95% confidence interval: 0.87–1.0, P<0.0001). The cutoff point for best sensitivity and specificity was calculated at z=0.26.

Discussion

Here we could demonstrate that CBF predicts TMS therapeutic response in patients with schizophrenia suffering from AVH. Responders showed a higher resting perfusion in the left STG before treatment. The clinical relevance of this finding derives from the fact that possible responders to TMS might be identified before the treatment by a single noninvasive MR measurement of resting perfusion.

In the present study, a sample of 24 patients with AVH who had been treated for 10 days with TMS was examined to test whether resting perfusion measurement with ASL before treatment could be used to distinguish between responders and nonresponders. A single cluster of 120 voxels in the left STG was detected that was significantly higher perfused in responders than in nonresponders and that was very close to the region targeted by TMS, suggesting that the efficacy of TMS depends on the level of brain activation in the area close to the target region. Patients with higher brain activity in the left STG might constitute a specific clinical subgroup of patients responsive to TMS. This is in line with recent findings. The reduced but still significant effect size of TMS on refractory AVH, when compared with previous quantitative reviews that has recently been published,23 suggests the possibility that TMS may be effective in clinical subgroups of patients only. In addition, a study with the largest sample size (n=62) so far showed that low-frequency TMS administered to the left temporoparietal cortex or to the site of maximal hallucinatory neuronal activation is not more effective for medication-resistant AVH than sham treatment.22 Although the authors reported that mean severity of AVH significantly decreased over time, no significant difference in reduction of AVH or any other psychotic symptom was revealed among the three treatment conditions. They described that even when the two groups receiving guided and nonguided TMS were combined (n=42), TMS was not superior to sham treatment in reducing AVH or other psychotic symptoms. However, resting brain perfusion was not measured in their study, and thus the missing effect might be explained by lower levels of CBF corresponding to a low neuronal activity in the target region. On the other hand, in the aforementioned study, an online fMRI paradigm was used during the experience of AVH to determine the TMS target region. As BOLD signal changes represent relative changes of neuronal activity, they may be associated with unresponsiveness to TMS. Thus, the present study extends the previous ones by suggesting that baseline resting CBF in specific language-related cerebral areas is the key factor determining the efficacy of TMS therapy. More specifically, a nonresponder showing statistically relevant BOLD activations during AVH might still be ‘under-perfused’ in the same area compared with a responder. Our finding of an elevation in the left STG perfusion in responders to TMS gives further evidence for the involvement of primary language production and perception areas in AVH. It is widely believed that AVH arise from a disorder of inner verbal experiences and its monitoring.48, 49 Altered activation in speech production areas (that is inferior frontal gyrus), and altered coupling with monitoring areas (anterior cingulate) and language reception areas (Wernicke's area), has been proposed in a model of bottom-up dysfunction through overactivation in the secondary (and occasionally primary) sensory cortices, which leads to the experience of vivid perceptions in the absence of sensory stimuli.50, 51 In line with this model, TMS delivered to the left temporoparietal cortex possibly reduces AVH.6, 35, 52 Recently, we published a case of a schizophrenia patient with medication-resistant AVH that was successfully stimulated at the same region with transcranial direct current stimulation.4 These findings suggest that AVH require the activation of speech perception neurocircuitry, consistent with what has been proposed as a model of underconstrained perception.53 Interestingly, none of the covariates demonstrated any prediction power, suggesting that neurobiological factors are more closely related to the possibility of modulating AVH with TMS. In addition, there was no statistical difference between the efficacy of theta-burst and 1-Hz TMS protocols, which is in line with our recent finding that theta burst and 1-Hz TMS are equally effective in the treatment of AVH.24

Several limitations of our study design merit comment. One must consider that no direct relationship between brain activity and the occurrence of hallucinations during scanning was investigated, because in ASL the mean cerebral perfusion over the scanning time is calculated and therefore temporal resolution is low. In addition, these results will have to be replicated in a sham-controlled design that applies the cutoff point found in our study. Both sham and stimulation groups should include an even number of patients with a high-enough level of STG perfusion before treatment. Thus, a decision-making algorithm for future studies could be developed following the ROC analysis. A level of normalized CBF in the left STG of at least 0.26 could then be used to decide whether to include or exclude a patient because of expected treatment response in TMS.

How can the higher STG perfusion in responders be explained? Given the fact that the target region for TMS was also situated in the STG, one might speculate that the efficacy of TMS relies on a certain level of brain activity in the area close to the target region.29 This is in line with the suggestion that the disturbance of the AVH-generating network by TMS is the key factor.24 In the condition of relatively low brain activity in patients, an inhibitive TMS treatment might not be the appropriate treatment approach. Future studies should be aimed at investigating whether there are alternative brain regions in such nonresponders that could be involved in the generation of AVH, and that could be considered the possible target regions for TMS.

In conclusion, this study showed that only patients with higher STG perfusion benefited from TMS as adjuvant treatment. Being noninvasive, relatively easy to use and independent of the occurrence of AVH during scanning, the measurement of resting perfusion with ASL before treatment might be a clinically relevant way to identify possible responders and nonresponders to TMS.