Locality Adaptive Multi-modality GANs for High-Quality PET Image Synthesis

Wang, Yan; Zhou, Luping; Wang, Lei; Yu, Biting; Zu, Chen; Lalush, David S.; Lin, Weili; Wu, Xi; Zhou, Jiliu; Shen, Dinggang

doi:10.1007/978-3-030-00928-1_38

Yan Wang²⁵,
Luping Zhou²⁶,
Lei Wang²⁷,
Biting Yu²⁷,
Chen Zu²⁷,
David S. Lalush²⁸,
Weili Lin²⁹,
Xi Wu³⁰,
Jiliu Zhou^25,30 &
…
Dinggang Shen²⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11070))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

15k Accesses
10 Citations
1 Altmetric

The original version of this chapter was revised: an Acknowledgements section has been added. The correction to this chapter is available at https://doi.org/10.1007/978-3-030-00928-1_100

Abstract

Positron emission topography (PET) has been substantially used in recent years. To minimize the potential health risks caused by the tracer radiation inherent to PET scans, it is of great interest to synthesize the high-quality full-dose PET image from the low-dose one to reduce the radiation exposure while maintaining the image quality. In this paper, we propose a locality adaptive multi-modality generative adversarial networks model (LA-GANs) to synthesize the full-dose PET image from both the low-dose one and the accompanying T1-weighted MRI to incorporate anatomical information for better PET image synthesis. This paper has the following contributions. First, we propose a new mechanism to fuse multi-modality information in deep neural networks. Different from the traditional methods that treat each image modality as an input channel and apply the same kernel to convolute the whole image, we argue that the contributions of different modalities could vary at different image locations, and therefore a unified kernel for a whole image is not appropriate. To address this issue, we propose a method that is locality adaptive for multi-modality fusion. Second, to learn this locality adaptive fusion, we utilize 1 × 1 × 1 kernel so that the number of additional parameters incurred by our method is kept minimum. This also naturally produces a fused image which acts as a pseudo input for the subsequent learning stages. Third, the proposed locality adaptive fusion mechanism is learned jointly with the PET image synthesis in an end-to-end trained 3D conditional GANs model developed by us. Our 3D GANs model generates high quality PET images by employing large-sized image patches and hierarchical features. Experimental results show that our method outperforms the traditional multi-modality fusion methods used in deep networks, as well as the state-of-the-art PET estimation approaches.

You have full access to this open access chapter, Download conference paper PDF

Fully Convolutional Transformer-Based GAN for Cross-Modality CT to PET Image Synthesis

3D Transformer-GAN for High-Quality PET Reconstruction

GAN for synthesizing CT from T2-weighted MRI data towards MR-guided radiation treatment

Article 06 November 2021

1 Introduction

As a nuclear imaging technology, positron emission topography (PET) has been increasingly used in clinics for disease diagnosis and intervention. It enables the visualization of metabolic processes of human body by detecting pairs of gamma rays emitted indirectly from the radioactive tracer injected into the human body. However, the radioactive exposure inevitably raises concerns for potential health hazards. Nevertheless, lowering the tracer dose will introduce noises and artifacts, thus degrading the PET image quality to a certain extent. Therefore, it is of great interest to synthesize the high-quality full-dose PET (F-PET) image from the low-dose PET (L-PET) image to reduce the radiation exposure while maintaining the image quality. Modern PET scans are usually accompanied by other modalities, such as computed tomography (CT) and magnetic resonance imaging (MRI). By combining functional and morphologic information, PET/MRI system could increase diagnostic accuracy for various malignancies. Previous research also indicates the benefit brought by multi-modality data to PET image quality enhancement [1,2,3].

There have been some works for F-PET image synthesis. Most of them, however, are based on voxel-wise estimation methods, e.g., random forest regression method [1], mapping-based sparse representation method [2], semi-supervised tripled dictionary learning method [4], and multi-level canonical correlation analysis (CCA) framework [5]. These methods are all based on small patches and the final estimation of each voxel is determined by averaging the overlapped patches, resulting in over-smoothed images that lack the texture of a typical F-PET image.

In recent years, deep learning has been used to improve image synthesis. Dong et al. [6] proposed a convolutional neural networks (CNNs) model for image super-resolution. With the similar architecture, Li et al. [7] estimated the missing PET image from MRI for the same subject. More recently, generative adversarial networks (GANs) have also showed their superior performance in many image synthesis tasks [8]. In the literature, the incorporation of multi-modality data in deep learning models is usually conducted in a global manner. For example, in CNN-based deep learning models such as two recent multi-channel GANs models [9], multi-modalities are treated as multiple input channels, and for each channel a unified kernel (invariant to image locations) is applied for the convolution over the whole image. Such a kind of multi-modality fusion is referred to as the multi-channel method in this paper. However, we argue that the contributions of different modalities could vary at different image locations, and therefore a unified kernel for a whole image is not appropriate.

In this paper, inspired by the appealing success of GANs and also motivated to tackle the limitation of the current multi-channel deep architectures, we propose a “locality adaptive” multi-modality GANs (LA-GANs) model to synthesize the F-PET image from both the L-PET and the accompanying T1-weighted MRI images. The contributions of our method are as follows. (1) We propose a new mechanism to fuse multi-modality information in deep neural networks. The weight of each imaging modality varies with image locations to better serve the synthesis of F-PET. (2) Using multi-modality (especially making it locality adaptive) may induce many additional parameters to learn. We therefore propose to utilize 1 × 1 × 1 kernel to learn such locality adaptive fusion mechanism to minimize the increase on the number of parameters. Doing so also naturally leads to a fused image that acts as a pseudo input for the subsequent learning stages. (3) We develop a 3D conditional GANs model for PET image synthesis, and jointly learn the proposed locality adaptive fusion with the synthesis process in an end-to-end trained manner. Our 3D GANs model generates high quality PET images by employing large-sized image patches and hierarchical features.

2 Methodology

The proposed LA-GANs model is illustrated in Fig. 1, which consists of three parts: (1) the locality adaptive fusion network, (2) the generator network, and (3) the discriminator network. Concretely, the locality adaptive fusion network takes both an L-PET and a T1-MRI as input and generates a fused image by learning different convolutional kernels at different image locations. After that, the generator network produces a synthesized F-PET from the fused image, and the discriminator network subsequently takes a pair of images as input, i.e., the L-PET and the real or synthetic F-PET, and aims to distinguish between the real and synthetic pairs. When the discriminator can easily distinguish between them, it means that the synthesized F-PET has not well resembled the real one, and that the fusion network and the generator network should be further improved to produce more realistic synthesis. Otherwise, the discriminator should be enhanced instead. Therefore, the three networks are trained jointly with the discriminator network trying to correctly distinguish between the real and synthetic F-PET, while the fusion and generator networks trying to produce realistic images that can fool the discriminator. Please note that, we use 3D operations for all the networks to better model the 3D spatial information.

2.1 Architecture

Locality Adaptive Fusion Network:

This is a module for multi-modality information fusion. As mentioned before, in most multi-channel based networks, image convolution is performed in a global manner, i.e., for each modality the same filter is applied to all image locations to generate the feature maps that will be combined in higher layers. This could not effectively handle the location-varying contributions from different imaging modalities. To tackle this problem, locality adaptive convolution should be enforced. However, if the locality adaptive convolution is simply conducted in the multi-channel framework, many additional parameters will have to be learned due to adding new imaging modalities. This is not favorable for medical applications where the number of the training samples is often limited. Therefore, we propose to add a module that produces a fused image from multi-modality images and use the fused image as the pseudo input to the generator network. In this way, the increase of the number of modalities will not cause any increase on the number of parameters in the generator. Moreover, we propose to utilize 1 × 1 × 1 kernel for locality adaptive convolution to minimize the number of necessary parameters to learn in this fusion module. The fusion network will be jointly learned with the generator and the discriminator to ensure that they can effectively negotiate with each other to achieve the best possible performance on image synthesis. Specifically, the entire L-PET and T1-MRI images are partitioned, respectively, into N non-overlapping small patches, i.e., P ^L_i and $ P_{i}^{T1} \left( {i = 1, \ldots , N} \right) $, as indicated by the patches in different colors in Fig. 1. Then, the two patches at the same location (indicated by using the same color) from the two modalities, i.e., $ P_{i}^{L} $ and P ^T1_i , are convolved, respectively, using two different 1 × 1 × 1 filters with parameters w ^L_i and w ^T1_i . For instance, in the fusion block in Fig. 1, the two gray filters are respectively operated on the two gray patches of the L-PET and T1-MRI images to generate their corresponding combined patch. Formally, the combined patch P ^C_i is obtained as follows:

$$ \begin{aligned} & P_{i}^{C} = w_{i}^{L} \, * P_{i}^{L} + w_{i}^{T1} \, * P_{i}^{T1} , \\ & {\text{s}}.{\text{t}}. \,w_{i}^{L} + w_{i}^{T1} = 1;w_{i}^{L} > 0;w_{i}^{T1} > 0, i = 1, \ldots ,N \\ \end{aligned} $$

(1)

In this way, we will learn N pairs of different convolution kernels for N local patches. The outputs of the fusion are further assembled to form an entire fused image as the input of the following generator network.

Generator Network:

In our generator network, we adopt both the convolutional layers and de-convolutional layers to ensure the same size of the input and output. Since the L-PET and F-PET images belong to the same modality, there is a lot of low-level information shared between them. As such, we follow the U-Net and add skip connections between the convolutional and de-convolutional layers, thus combining hierarchical features for better synthesis. Also, the skip connection strategy mitigates the vanishing gradient issue, allowing the network architecture to be much deeper.

Our network architecture (more details in the supplementary) contains multiple Convolution-BatchNormalization-LeakyRelu components. Specifically, it constitutes 12 3D convolutional layers. In the encoder part which includes the first 6 convolutional layers, we use 4 × 4 × 4 filters and a stride of 2 for convolution, and 0.2 negative slope for the leaky ReLu. The number of feature maps increases from 64 in the 1^st layer to 512 in the 6^th layer. In the decoder part, we perform up-sampling with a factor of 2.

Discriminator Network:

The discriminator network is a typical CNN architecture consisting of 4 convolutional layers, and each of them uses 4 × 4 × 4 filters with a stride of 2, similar to the encoder structure of the generator. The first convolution layer produces 64 feature maps, and this number is doubled at each of the following convolutional layers. On top of the convolutional layers, a fully connected layer is applied and followed by a sigmoid activation to determine whether the input is the real pair or the synthetic one.

2.2 Objective Functions

Let us denote x_L an L-PET image, x_T1 the accompanying T1-MRI image, and y_F the corresponding real F-PET image (i.e., the ground truth annotation). In this study, we learn three function mappings. The first mapping $ {F_\alpha}:\left({{x_L} \in {{\mathbb{R}}_{Low}}, {x_{T1}} \in {{\mathbb{R}}_{T1 - MRI}}} \right) \to \bar{y}_{F} \in {{\mathbb{R}}_{Fused}} $ is for the locality adaptive fusion network, which produces a fused image $ \bar{y}_{F} $ from x_L and x_T1. The second mapping $ G_{\beta } :\bar{y}_{F} \in {\mathbb{R}}_{Fused} \to \overline{\overline{y}}_{F} \in {\mathbb{R}}_{Synthetic} $ is for the generator network, which maps the fused image $ \bar{y}_{F} $ to a synthetic F-PET image $ \overline{\overline{y}}_{F} $. The third mapping corresponds to the discriminator network function $ D_{\gamma } :(x_{L} \in {\mathbb{R}}_{Low} , x_{T1} \in {\mathbb{R}}_{T1 - MRI} , Y_{F} \in {\mathbb{R}}_{Full} ) \to d \in \left[ {0,1} \right] $, whose task is to distinguish the synthetic pair $ Y_{F} : = (x_{L} ,\overline{\overline{y}}_{F} ) $ (ideally $ d \to 0 $) from the real pair $ Y_{F} : = \left( {x_{L} , y_{F} } \right) $ (ideally $ d \to 1 $). The symbols α, β and γ denote the parameter sets of the three networks, respectively, and are automatically learned from a training set $ \left\{ {\left( {x_{L}^{i} , x_{T1}^{i} , y_{F}^{i} } \right)} \right\}_{i = 1}^{m} $. Formally, we solve the following optimization problem

$$ \begin{aligned} & \mathop {\hbox{min} }\limits_{\alpha } \,\mathop {\hbox{min} }\limits_{\beta } \mathop {\hbox{max} }\limits_{\gamma } V\left( {F_{\alpha } ,G_{\beta } ,D_{\gamma } } \right) = \\ & {\mathbb{E}}\left[ {\log D_{\gamma } \left( {x_{L} ,y_{F} } \right)} \right] + {\mathbb{E}}\left[ {\log \left( {1 - D_{\gamma } \left( {x_{L} ,G_{\beta } \left( {F_{\alpha } (x_{L} ,x_{T1} )} \right)} \right)} \right)} \right] + \lambda V_{L1} \left( {F_{\alpha } ,G_{\beta } } \right), \\ \end{aligned} $$

(2)

with λ > 0 being a trade-off constant. The last term is an L1 loss, used to ensure that the synthetic F-PET image stays close to its real counterpart. The L1 loss is defined as

$$ {V_{L1}}\left( {{F_\alpha },{G_\beta }} \right) = {\mathbb{E}}\left[ {{{\left\| {{y_F} - {G_\beta }\left( {{F_\alpha }\left( {{x_L},{x_{T1}}} \right)} \right)} \right\|}_1}} \right]. $$

(3)

Please note that, the fusion network $ F $ and the generator network $ G $, in a sense, can be regarded as a whole network whose goal is to synthesize realistic-looking F-PET images that can fool the discriminator network D. Following the approximation scheme in [10], the term log (1 – D_γ(x_L, G_β(F_α(x_L, x_T1)))) can be replaced by minimizing a simpler form –logD_γ(x_L, G_β(F_α(x_L, x_T1))). Therefore, training the fusion network $ F $ and generator network $ G $ equals minimizing

$$ L_{{{\mathcal{F}},{\mathcal{G}}}} \left( {F_{\alpha } ,G_{\beta } } \right) = - \sum\nolimits_{i} {\log D_{\gamma } \left( {x_{L}^{i} ,G_{\beta } \left( {F_{\alpha } \left( {x_{L}^{i} , x_{T1}^{i} } \right)} \right)} \right) + \lambda } \sum\nolimits_{i} {\left( {\left\| {y_{F} - G_{\beta } \left( {F_{\alpha } \left( {x_{L}^{i} ,\,x_{T1}^{i} } \right)} \right)} \right\|_{1} } \right)} . $$

(4)

On the other hand, the discriminator network D tries to tell the real pair $ \left( {x_{L} , y_{F} } \right) $ from the synthetic pair $ (x_{L} ,\overline{\overline{y}}_{F} ) $ by maximizing Eq. (2). Therefore, training the discriminator network corresponds to maximizing

$$ L_{{\mathcal{D}}} \left( {D_{\gamma } } \right) = \sum\nolimits_{i} {\left( {\left[ {\log D_{\gamma } \left( {x_{L}^{i} , y_{F}^{i} } \right)} \right] + \log \left( {1 - D_{\gamma } \left( {x_{L}^{i} ,G_{\beta } \left( {F_{\alpha } \left( {x_{L}^{i} ,x_{T1}^{i} } \right)} \right)} \right)} \right)} \right).} $$

(5)

2.3 Training LA-GANs

The fusion network $ F $ together with the generator network G and the discriminator network D are trained in an alternating manner as [10]. Specifically, we first fix F and G to train D, and then fix D to train F and G. As shown in Eq. (2), the training of F, G and D is just like playing a min-max game: F and G try to minimize the loss function while D tries to maximize it, until an equilibrium is reached. In the test stage, only the fusion and generator networks are needed for synthesis. All networks are trained by Adam solver with mini-batch stochastic gradient descent (SGD) and the mini-batch size is 128. The training process runs for 200 epochs, and the learning rate is set to 0.0002 for the first 100 epochs, and then linearly decays to 0 in the second 100 epochs.

3 Experiments and Results

We validate our proposed method on a real human brain dataset consisting of 8 normal control (NC) subjects and 8 mild cognitive impairment (MCI) subjects, each with an L-PET image, a T1-MRI image and an F-PET image. Subjects were administered an average of 203 MBq of [¹⁸F]FDG. The PET scans were acquired by a Siemens Biograph mMR PET-MR scanner. For each subject, the PET images are aligned to its T1-MRI to build the voxel-level correspondence via affine transformation. Each aligned image has the resolution of 2.09 × 2.09 × 2.03 mm³ and the image size of 128 × 128 × 128. Considering the small number of the training samples, we extract 125 large 3D image patches of size 64 × 64 × 64 from each image, rather than directly using the entire 3D image, to train the deep model. In addition, to make full use of available samples, we follow the widely used “Leave-One-Subject-Out” strategy. To train the proposed locality adaptive convolution network, we further partition each large image patch into 4096 non-overlapping 4 × 4 × 4 regions for fusion. Our method is implemented by PyTorch, and all the experiments are carried out on an NVIDIA GeForce GTX 1080 Ti with 11 GB memory.

Comparison with the State-of-the-Art PET Estimation Methods:

We compare our method with the following state-of-the-art multi-modality based PET estimation methods: (1) mapping based sparse representation method (m-SR) [2], (2) tripled dictionary learning method (t-DL) [4], (3) multi-level CCA method (m-CCA) [5], and (4) auto-context CNN method [3]. The averaged PSNR are given in Fig. 2(a), from where we can see that our proposed method outperforms all the other competing methods, demonstrating its effectiveness and advantages. In Fig. 2(b), we show an example visual result of our method compared with two methods (m-CCA and auto-context CNN) which produce the top two results in the literature. As observed, the estimated images by the m-CCA method are over-smoothed compared with the real F-PET images due to the averaging of the overlapping patches to construct the final output images. Compared with the auto-context CNN network, our model tends to better preserve the detailed information in the estimated F-PET images, as indicated by the red arrows. We argue that this is because the adversarial training network used in our model constrains the synthesized images to be similar to the real ones.

Comparison with our method using single-modality or multi-channel strategy:

To study the contribution of the anatomical information from MRI for PET synthesis and evaluate the effectiveness of the locality adaptive fusion network of our proposed model, we further conduct experiments to compare our method with its two variants: one using only single modality of L-PET [11], and the other using the common multi-channel strategy [9] for multi-modality. The results are reported in Table 1. First, we can see that our method obtains higher PSNR than the single-modality L-PET variant (p-value in paired t-test: 0.0092 for NC and 0.0036 for MCI), indicating that the anatomical information from MRI yields important cues for PET synthesis. Second, compared with the multi-channel variant, our method boosts the averaged PSNR approximately 0.28 and 0.2 for NC and MCI groups, respectively. The standard deviation of our method is also smaller than that of the multi-channel GANs while the median is higher. Also, the paired t-test indicates that our improvement from the multi-channel one is statistically significant (p < 0.05). Moreover, it is found that the number of increased learning parameters induced by adding T1-MRI is 4096 for our method and 8192 for the multi-channel GANs, suggesting that our model produces better performance with less parameters to learn.

Table 1. Quantitative comparison with two variants of our method (single-modality GANs and multi-channel GANs) in terms of PSNR. Here, Med. means median.

Full size table

We also provide a visual comparison in Fig. 3, where the two leftmost images are the input T1-MRI and L-PET images and the rightmost image is the ground-truth F-PET. We can clearly see that the synthesized F-PET image of our proposed model has less artifacts than those of the single-modality method and the multi-channel method, as indicated by the red arrows.

4 Conclusion

In this work, we proposed a 3D locality adaptive multi-modality GANs model for synthesizing high-quality PET images from the L-PET and T1-MRI images. Both qualitative and quantitative results demonstrate that our method significantly outperforms the traditional multi-modality fusion methods used in deep networks, as well as the state-of-the-art PET estimation approaches. Our model could also be applied to other related applications such as mapping one or two modalities to another modality. In the future, we will investigate the potential of our model for general synthesis tasks as well.

Change history

29 July 2020
In the originally published version of chapter 47, the Acknowledgements section was missing a grant number. This has been corrected.
In the originally published version of chapters 25 and 38, the Acknowledgements section was missing. This has been corrected and an Acknowledgements section has been added.

References

Kang, J., Gao, Y., Shi, F., et al.: Prediction of standard-dose brain PET image by using MRI and low-dose brain [18F] FDG PET images. Med. Phys. 42(9), 5301–5309 (2015)
Article Google Scholar
Wang, Y., Zhang, P., An, L., et al.: Predicting standard-dose PET image from low-dose PET and multimodal MR images using mapping-based sparse representation. Phys. Med. Biol. 61(2), 791–812 (2016)
Article Google Scholar
Xiang, L., Qiao, Y., Nie, D., et al.: Deep auto-context convolutional neural networks for standard-dose PET image estimation from low-dose PET/MRI. Neurocomputing 267, 406–416 (2017)
Article Google Scholar
Wang, Y., Ma, G., An, L., et al.: Semi-supervised tripled dictionary learning for standard-dose PET image prediction using low-dose PET and multimodal MRI. IEEE Trans. Biomed. Eng. 64(3), 569–579 (2017)
Article Google Scholar
An, L., Zhang, P., Adeli, E., et al.: Multi-level canonical correlation analysis for PET image estimation. IEEE Trans. Image Process. 25(7), 3303–3315 (2016)
Article MathSciNet Google Scholar
Dong, C., Loy, C.C., He, K., Tang, X.: Image super-resolution using deep convolutional networks. IEEE Trans. Pattern Anal. Mach. Intell. 38(2), 295–307 (2016)
Article Google Scholar
Li, R., Zhang, W., Suk, H.-I., Wang, L., Li, J., Shen, D., Ji, S.: Deep learning based imaging data completion for improved brain disease diagnosis. In: Golland, Polina, Hata, Nobuhiko, Barillot, Christian, Hornegger, Joachim, Howe, Robert (eds.) MICCAI 2014. LNCS, vol. 8675, pp. 305–312. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10443-0_39
Chapter Google Scholar
Zhang, H., Xu, T., Li, H., Zhang, S., Huang, X., Wang, X., Metaxas, D.: StackGAN: text to photo-realistic image synthesis with stacked generative adversarial networks. In: IEEE International Conference on Computer Vision (ICCV), pp. 5907–5915 (2017)
Google Scholar
Bi, L., Kim, J., Kumar, A., Feng, D., Fulham, M.: Synthesis of positron emission tomography (PET) images via multi-channel generative adversarial networks (GANs). In: Cardoso, M.J., Arbel, T., Gao, F., Kainz, B., van Walsum, T., Shi, K., Bhatia, K.K., Peter, R., Vercauteren, T., Reyes, M., Dalca, A., Wiest, R., Niessen, W., Emmer, B.J. (eds.) CMMI/SWITCH/RAMBO -2017. LNCS, vol. 10555, pp. 43–51. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-67564-0_5
Chapter Google Scholar
Goodfellow, I., Pouget-Abadie, J., Mirza, M.: Generative adversarial nets. In: Advances in Neural Information Processing Systems, pp. 2672–2680 (2014)
Google Scholar
Wang, Y., Yu, B., Wang, L., Zu, C., Lalush, D., Lin, W., Wu, X., Zhou, J., Shen, D., Zhou, L.: 3D conditional generative adversarial networks for high-quality PET image estimation at low dose. Neuroimage 174, 550–562 (2018)
Article Google Scholar

Download references

Acknowledgement

This work was supported in part by NIH grant EB006733.

Author information

Authors and Affiliations

School of Computer Science, Sichuan University, Chengdu, China
Yan Wang & Jiliu Zhou
School of Electrical and Information Engineering, University of Sydney, Sydney, Australia
Luping Zhou
School of Computing and Information Technology, University of Wollongong, Wollongong, Australia
Lei Wang, Biting Yu & Chen Zu
Joint Department of Biomedical Engineering, University of North Carolina at Chapel Hill and North Carolina State University, Raleigh, NC, USA
David S. Lalush
Department of Radiology and BRIC, University of North Carolina at Chapel Hill, Chapel Hill, USA
Weili Lin & Dinggang Shen
School of Computer Science, Chengdu University of Information Technology, Chengdu, China
Xi Wu & Jiliu Zhou

Authors

Yan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Luping Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Lei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Biting Yu
View author publications
You can also search for this author in PubMed Google Scholar
Chen Zu
View author publications
You can also search for this author in PubMed Google Scholar
David S. Lalush
View author publications
You can also search for this author in PubMed Google Scholar
Weili Lin
View author publications
You can also search for this author in PubMed Google Scholar
Xi Wu
View author publications
You can also search for this author in PubMed Google Scholar
Jiliu Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Dinggang Shen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Luping Zhou or Dinggang Shen .

Editor information

Editors and Affiliations

University of Leeds, Leeds, UK
Alejandro F. Frangi
King’s College London, London, UK
Julia A. Schnabel
University of Pennsylvania, Philadelphia, PA, USA
Christos Davatzikos
Universidad de Valladolid, Valladolid, Spain
Carlos Alberola-López
Queen’s University, Kingston, ON, Canada
Gabor Fichtinger

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, Y. et al. (2018). Locality Adaptive Multi-modality GANs for High-Quality PET Image Synthesis. In: Frangi, A., Schnabel, J., Davatzikos, C., Alberola-López, C., Fichtinger, G. (eds) Medical Image Computing and Computer Assisted Intervention – MICCAI 2018. MICCAI 2018. Lecture Notes in Computer Science(), vol 11070. Springer, Cham. https://doi.org/10.1007/978-3-030-00928-1_38

Download citation

DOI: https://doi.org/10.1007/978-3-030-00928-1_38
Published: 26 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00927-4
Online ISBN: 978-3-030-00928-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us