Data-driven shape parameterization for segmentation of the right ventricle from 3D+t echocardiography

doi:10.1016/j.media.2014.12.002

Medical Image Analysis

Volume 21, Issue 1, April 2015, Pages 29-39

https://doi.org/10.1016/j.media.2014.12.002 Get rights and content

Highlights

•
The right ventricle is segmented jointly across multiple echocardiography sequences.
•
Segmentation is constrained by a linear basis shape model.
•
The linear basis shape model is optimized during segmentation.
•
The framework is applied to multiple-view and multiple-subject datasets.

Abstract

Model-based segmentation facilitates the accurate measurement of geometric properties of anatomy from ultrasound images. Regularization of the model surface is typically necessary due to the presence of noisy and incomplete boundaries. When simple regularizers are insufficient, linear basis shape models have been shown to be effective. However, for problems such as right ventricle (RV) segmentation from 3D+t echocardiography, where dense consistent landmarks and complete boundaries are absent, acquiring accurate training surfaces in dense correspondence is difficult.

As a solution, this paper presents a framework which performs joint segmentation of multiple 3D+t sequences while simultaneously optimizing an underlying linear basis shape model. In particular, the RV is represented as an explicit continuous surface, and segmentation of all frames is formulated as a single continuous energy minimization problem. Shape information is automatically shared between frames, missing boundaries are implicitly handled, and only coarse surface initializations are necessary.

The framework is demonstrated to successfully segment both multiple-view and multiple-subject collections of 3D+t echocardiography sequences, and the results confirm that the linear basis shape model is an effective model constraint. Furthermore, the framework is shown to achieve smaller segmentation errors than a state-of-art commercial semi-automatic RV segmentation package.

Graphical abstract

Introduction

Segmentation of the left ventricle (LV) and right ventricle (RV) from 3D+t echocardiography is an important task for quantifying cardiac function. In practice, this can be challenging because of missing anatomical boundaries and ambiguous edge features. 3D analysis of the LV has received most attention in literature, and model-based methods have been shown to be successful at LV segmentation from 3D echocardiography (Mitchell et al., 2002, Noble and Boukerroui, 2006). The RV literature is less developed, in part as the shape is harder to model, but also because RV ultrasound (US) scans are generally more incomplete due to the relative position of the RV with respect to the transthoracic probe.

Commonly, a surface representing the endocardium is deformed frame-to-frame so that it adheres to the blood-tissue boundary (e.g. Orderud et al., 2007). Specifically, the surface is represented either explicitly or implicitly and an energy minimization problem is formulated which captures the notion of “fitting” the surface to the blood-tissue boundary subject to priors or regularization. An explicit representation directly models the geometry of the segmentation surface (Blake and Isard, 2000). Examples include point distribution models (Cootes et al., 1995), truncated ellipsoids (Orderud, 2006), spline and subdivision surfaces (Piegl and Tiller, 1995, Stam, 1998), and B-spline Explicit Active Surfaces (BEAS) (Barbosa et al., 2012). Implicit representations model the segmentation surface as the level-set of a higher-dimensional function over the image domain (Caselles et al., 1997, Osher and Sethian, 1988).

Implicit representations of the LV and RV have the advantage that they allow for more complex appearance models because they naturally define interior and exterior segmentation regions (Lankton and Tannenbaum, 2008, Huang et al., 2014). The primary drawback of implicit representations, and level sets in particular, is that direct optimization over the entire function is slow. Level sets were made practical by the development of fast marching and narrowband methods, which only update the level set function near the implicit interface boundary or zero level set (Lankton, 2009, Sethian, 1999, Whitaker, 1998). Common to these approaches is that a first-order gradient-based evolution equation is derived from the energy and used to update the level set function. Since the surface fit is specified directly against image intensities—a result of the more complex appearance model—an accurate initialization is necessary so that the first-order optimization does not converge to an unwanted local minimum.

Like level set methods, the surface fit for explicit representations can also be formulated directly against the image (e.g. Kass et al., 1988), but specification of interior and exterior regions is, with the exception of BEAS (Barbosa et al., 2013), typically more difficult. However, if a discrete set of boundary candidates is available—detected independently or based on the current surface state—then the surface fit can instead be defined as the distance between these points and their corresponding points on the explicit surface. In this case, explicit representations are more amenable to non-linear continuous optimization algorithms more powerful than gradient descent.

To handle missing boundaries, regularization is used to constrain the segmentation surface to be smooth and physically plausible. Simple spatio-temporal regularizers are physically motivated and straightforward to implement for explicit or implicit surface representations. However, for large boundary gaps, such as those encountered when acquiring 3D+t US sequences of the RV, the simple interpolating action of these regularizers is insufficient. Instead, shape models can be used to constrain the model surface.

In medical image analysis, point distribution models and triangle meshes have been popular model surface representations. For these definitions, linear basis shape models have been constructed using Principal Component Analysis (PCA) on a set of training surfaces which have been aligned semi-automatically (Bosch et al., 2002, Cootes et al., 1995). The advantage of these Active Shape Models (ASMs) is that the dimensionality of the model surface is reduced. The disadvantage is that the resulting parameterization can be too restrictive and prevent modeling of local deformation unseen in the training examples.

To remedy this, hierarchical ASMs have been proposed which recover scale- and location-specific linear basis shape models. In Davatzikos et al. (2003), this is achieved in 2D by first computing wavelet coefficients of the x- and y-components of the training model contours. The coefficients are then partitioned into bands based on scale and spatial location, and PCA is applied to each. In 3D, spherical wavelets with adaptively selected bands (Nain et al., 2005, Nain et al., 2006), Catmull-Clark subdivision wavelets with fixed bands (Li et al., 2007), and diffusion wavelets (Essafi et al., 2009) with orthomax PCA (Kaiser, 1958, Stegmann et al., 2006), have been proposed. Each of these algorithms produces a linear basis shape model which enables “legal” deformations of the dense model points to be specified with a small number of parameters, where each parameter (by design) controls local shape deformation only.

A key challenge when constructing any of the aforementioned shape models is acquiring accurate training surfaces which are in dense correspondence. Automatically detecting correspondences based on shape features (e.g. positions of high curvature) has been proposed (Brett and Taylor, 2000, Wang et al., 2000), but this is difficult for the RV because of the absence of dense consistent landmarks across the surface, complete boundaries, and increased shape variability (Caudron et al., 2012, Petitjean and Dacher, 2011). Furthermore, while training surfaces are rigidly aligned using Procrustes analysis (Bosch et al., 2002), local incorrect correspondences can remain which introduce artificial shape variation into the model. Therefore, it is necessary to model the parameterization differences between the dense points of the training surfaces. In Davies et al. (2002), this is achieved by mapping each surface to the unit sphere and optimizing predefined parameter transformations for each training surface to construct the ASM of minimum description length (Rissanen, 1983).

In this paper, a framework is described which performs joint 3D segmentation of the RV from multiple 3D+t echocardiography sequences, while simultaneously optimizing all correspondences and an underlying linear basis shape model (Fig. 1). This framework is a modification of Cashman and Fitzgibbon (2013)—where 3D linear basis shape models of animals are learned from 2D exterior silhouettes—to 3D segmentation of multiple-subject and multiple-view collections of 3D+t echocardiography sequences. The key differences are:

•
In Cashman and Fitzgibbon (2013), the exterior silhouettes of animals are recovered by semi-automatic segmentation. Therefore, all boundary candidates are valid and ordered, and dynamic programming is used to initialize the boundary candidate correspondences (preimages). Here, boundary candidates are derived using a simple edge detector—only a subset are valid. Furthermore, the boundary candidate positions are noisy and missing boundaries are common. Therefore, a “model-to-data” approach is adopted: the boundary candidate correspondences are initialized by sampling the model surface uniformly, and boundary candidates are subsequently selected based on the current model surface geometry. Robust fitting terms are also used.
•
In Cashman and Fitzgibbon (2013), independent rigid transformations and shape parameters are modeled for each frame. Here, scales are introduced for each subject and rigid transformations are introduced for each sequence. Shape similarity is also enforced between frames which are of the same subject at the same point in the cardiac cycle.

Conceptually, the proposed framework is also similar to Zhou et al. (2013), where in 2D, explicit model contours are simultaneously fitted to multiple images in a sequence and constrained to be of similar shape. In Zhou et al. (2013), shape similarity is achieved by minimizing the nuclear norm of the matrix composed of the x- and y-components of all model contours. Here, shape similar is enforced explicitly using a linear basis shape model.

As will be shown, the described framework is suitable for the proposed application for three reasons. First, a Loop subdivision surface is used for the model surface, which by construction, has a small number of parameters but is flexible and can realize local shape deformations. Second, joint optimization of all continuous parameters—including the linear basis shape model control vertices, rigid transformations, and boundary candidate correspondences—mitigates the requirement for any registration or fusion of the input images (e.g. Rajpoot et al., 2011), or accurate training surfaces that are in dense correspondence. Third, it naturally handles missing boundaries.

The structure of the paper is as follows. In Section 2 the complete model energy and optimization algorithm for our framework is presented. In Section 3 and Section 4 we then demonstrate the application to two problems which are hard to solve using prior approaches. First, we show how multiple 3D+t sequences acquired from different viewpoints for a single subject can be segmented jointly while optimizing a subject-specific shape model. Second, we show how multiple 3D+t sequences acquired from multiple subjects—with potentially different viewpoints—can be jointly segmented. Conclusions are given in Section 5.

Section snippets

Method

In this article we denote matrices with uppercase letters (X) and vectors with bold-face lowercase letters ( $x$ ). Column vectors from matrices are denoted by indexed bold-face letters. For example, $x_{i}$ is column i of the matrix X. Similarly, the $j$ th element of a vector $x$ is denoted by $x_{j}$ . Cursive uppercase letters ( $X$ ) denote sets.

The input to our framework is a collection of 3D+t sequences from one or more subjects. The 3D echocardiogram of the $k$ th frame of the $j$ th sequence for the $i$ th subject is

Experiments

Experiments were performed to demonstrate the application of our framework for different use cases, to assess its overall segmentation performance, and to determine the usefulness of the underlying linear basis shape model. The datasets used for test and validation are described in the next subsection and the details of each use case follow.

Single subject, multiple views

Example slices of echocardiography frames segmented using our framework (Linear Basis Shape RV, LBSRV) and the thin-plate regularization baseline (BL) for SSMV are shown in Fig. 6. When all boundaries are available (Fig. 6(a)), the segmentation surfaces for both methods reasonably delineate the RV. However, for a temporally aligned frame from a different view (Fig. 6(b)), LBSRV implicitly utilizes the information from the other views which plausibly interpolates the missing boundary.

Conclusions

In this article, a framework to perform model-based segmentation of multiple 3D+t sequences while jointly optimizing an underlying linear basis shape model has been described.

The framework was motivated by difficulties specific to RV segmentation from 3D+t echocardiography. Specifically, large regions of missing boundary candidates are common when imaging the RV due to the relative position of the RV with respect to the transthoracic US probe. Simple model surface regularizers are incapable of

Acknowledgments

The first author would like to thank the Rhodes Trust for funding this research. Data acquired on EPSRC Grant EP/G030693/1 was used in this research.

References (41)

D. Barbosa et al.
Fast and fully automatic 3-D echocardiographic segmentation using B-spline explicit active surfaces: feasibility study and validation in a clinical setting
Ultrasound Med. Biol.
(2013)
A.D. Brett et al.
A method of automated landmark generation for automated 3D PDM construction
Image Vis. Comput.
(2000)
J. Caudron et al.
Cardiac MRI assessment of right ventricular function in acquired heart disease: factors of variability
Acad. Radiol.
(2012)
Y. Chen et al.
Algorithm 887: CHOLMOD, supernodal sparse Cholesky factorization and update/downdate
ACM TOMS
(2008)
T.F. Cootes et al.
Active shape models—their training and application
Comput. Vis. Image Understand.
(1995)
X. Huang et al.
Contour tracking in echocardiographic sequences via sparse representation and dictionary learning
Med. Image Anal.
(2014)
S. Osher et al.
Fronts propagating with curvature-dependent speed: algorithms based on Hamilton–Jacobi formulations
J. Comput. Phys.
(1988)
C. Petitjean et al.
A review of segmentation methods in short axis cardiac MR images
Med. Image Anal.
(2011)
K. Rajpoot et al.
The evaluation of single-view and multi-view fusion 3d echocardiography using image-driven segmentation and tracking
Med. Image Anal.
(2011)
Agarwal, S., Mierle, K., et al., 2010. Ceres solver....

P.R. Amestoy et al.

An approximate minimum degree ordering algorithm

SIAM J. Matrix Anal. Appl.

(1996)

D. Barbosa et al.

B-spline explicit active surfaces: an efficient framework for real-time 3-D region-based segmentation

IEEE TIP

(2012)

A. Blake et al.

(2000)

J.G. Bosch et al.

Automatic segmentation of echocardiographic sequences by active appearance motion models

IEEE TMI

(2002)

V. Caselles et al.

Geodesic active contours

IJCV

(1997)

T.J. Cashman et al.

What shape are dolphins? Building 3D morphable models from 2D images

IEEE TPAMI

(2013)

C. Davatzikos et al.

Hierarchical active shape models, using the wavelet transform

IEEE TMI

(2003)

R.H. Davies et al.

3D statistical shape models using direct optimisation of description length

Essafi, S., Langs, G., Paragios, N., 2009. Hierarchical 3D diffusion wavelet shape priors. In: IEEE CVPR, pp....

H.F. Kaiser

The varimax criterion for analytic rotation in factor analysis

Psychometrika

(1958)

Cited by (19)

Artificial Intelligence in Cardiovascular Imaging: JACC State-of-the-Art Review
2019, Journal of the American College of Cardiology
Citation Excerpt :
As methods start to be tested in clinical practice, terminology is also developing around how AI can be used as a tool. Typically these uses include the use of a quantitative strategy to automatically generate measures (25), enabling notification devices to flag up particular problems, and as diagnostic support tools to generate recommendations and related information that can be used by a clinician to reach a conclusion. Fully automated diagnostic tools will ultimately provide medical opinions (12,26), but will also require extensive validation before regulatory approval.
Data science is likely to lead to major changes in cardiovascular imaging. Problems with timing, efficiency, and missed diagnoses occur at all stages of the imaging chain. The application of artificial intelligence (AI) is dependent on robust data; the application of appropriate computational approaches and tools; and validation of its clinical application to image segmentation, automated measurements, and eventually, automated diagnosis. AI may reduce cost and improve value at the stages of image acquisition, interpretation, and decision-making. Moreover, the precision now possible with cardiovascular imaging, combined with “big data” from the electronic health record and pathology, is likely to better characterize disease and personalize therapy. This review summarizes recent promising applications of AI in cardiology and cardiac imaging, which potentially add value to patient care.
SiSSR: Simultaneous subdivision surface registration for the quantification of cardiac function from computed tomography in canines
2018, Medical Image Analysis
Citation Excerpt :
The objective of our method is conceptually similar to Pourmorteza et al. (2012) in that we derive SQUEEZ from a series of registered meshes. However, rather than registering meshes using CPD, we adapt and extend a method which has been successfully used for cardiac modeling and segmentation in the context of 3-D echocardiography (Stebbing, 2014; Stebbing et al., 2015). In Stebbing (2014), a subdivision surface is registered to an arbitrary point set obtained using an off-the-shelf edge detector.
Recent improvements in cardiac computed tomography (CCT) allow for whole-heart functional studies to be acquired at low radiation dose (<2mSv) and high-temporal resolution (<100ms) in a single heart beat. Although the extraction of regional functional information from these images is of great clinical interest, there is a paucity of research into the quantification of regional function from CCT, contrasting with the large body of work in echocardiography and cardiac MR. Here we present the Simultaneous Subdivision Surface Registration (SiSSR) method: a fast, semi-automated image analysis pipeline for quantifying regional function from contrast-enhanced CCT. For each of thirteen adult male canines, we construct an anatomical reference mesh representing the left ventricular (LV) endocardium, obviating the need for a template mesh to be manually sculpted and initialized. We treat this generated mesh as a Loop subdivision surface, and adapt a technique previously described in the context of 3-D echocardiography to register these surfaces to the endocardium efficiently across all cardiac frames simultaneously. Although previous work performs the registration at a single resolution, we observe that subdivision surfaces naturally suggest a multiresolution approach, leading to faster convergence and avoiding local minima. We additionally make two notable changes to the cost function of the optimization, explicitly encouraging plausible biological motion and high mesh quality. Finally, we calculate an accepted functional metric for CCT from the registered surfaces, and compare our results to an alternate state-of-the-art CCT method.
Reflections on ultrasound image analysis
2016, Medical Image Analysis
Citation Excerpt :
Recent ultrasound based Challenges include CLUST15 (liver ultrasound tracking), CETUS 2014 (endocardial 3D ultrasound segmentation) and ChallengeUS 2012 (fetal biometry, Rueda et al., 2013). Since that time, the sophistication, in terms of the cardiac deformations that can modelled (Zhi et al., 2010), underlying segmentation and tracking algorithms (Schneider et al., 1999; de Craene et al., 2012), and speed of analysis have significantly advanced, and solutions are starting to be developed for the harder challenge of cardiac right ventricle analysis (Stebbing et al., 2015, see also Fig. 1b). Additionally, we have seen the successful emergence of well-validated model-based fully automated and assistive ultrasound quantification tools in clinical application areas which include 2D and 3D echocardiographic image quantification, quantification of intra-vascular ultrasound (IVUS), ovarian follicle counting, and fetal biometry, some of which have gone on to be embedded in commercial products.
Ultrasound (US) image analysis has advanced considerably in twenty years. Progress in ultrasound image analysis has always been fundamental to the advancement of image-guided interventions research due to the real-time acquisition capability of ultrasound and this has remained true over the two decades. But in quantitative ultrasound image analysis - which takes US images and turns them into more meaningful clinical information - thinking has perhaps more fundamentally changed. From roots as a poor cousin to Computed Tomography (CT) and Magnetic Resonance (MR) image analysis, both of which have richer anatomical definition and thus were better suited to the earlier eras of medical image analysis which were dominated by model-based methods, ultrasound image analysis has now entered an exciting new era, assisted by advances in machine learning and the growing clinical and commercial interest in employing low-cost portable ultrasound devices outside traditional hospital-based clinical settings. This short article provides a perspective on this change, and highlights some challenges ahead and potential opportunities in ultrasound image analysis which may both have high impact on healthcare delivery worldwide in the future but may also, perhaps, take the subject further away from CT and MR image analysis research with time.
Big Data in cardiac surgery: real world and perspectives
2022, Journal of Cardiothoracic Surgery
MODELING SINGLE VENTRICLE MORPHOLOGY WITH A HLHS-SPECIFIC BIVENTRICULAR TEMPLATE TO ENHANCE STATISTICAL SHAPE AND BIOMECHANICS ANALYSES
2022, ASME International Mechanical Engineering Congress and Exposition, Proceedings (IMECE)
AI and The Cardiologist-When Mind, Heart and Machine Unite
2022, Communications in Computer and Information Science

View all citing articles on Scopus

View full text

Data-driven shape parameterization for segmentation of the right ventricle from 3D+t echocardiography

Highlights

Abstract

Graphical abstract

Introduction

Section snippets

Method

Experiments

Single subject, multiple views

Conclusions

Acknowledgments

Ultrasound Med. Biol.

Image Vis. Comput.

Acad. Radiol.

ACM TOMS

Comput. Vis. Image Understand.

Med. Image Anal.

J. Comput. Phys.

Med. Image Anal.

Med. Image Anal.

An approximate minimum degree ordering algorithm

SIAM J. Matrix Anal. Appl.

B-spline explicit active surfaces: an efficient framework for real-time 3-D region-based segmentation

IEEE TIP

Automatic segmentation of echocardiographic sequences by active appearance motion models

IEEE TMI

Geodesic active contours

IJCV

What shape are dolphins? Building 3D morphable models from 2D images

IEEE TPAMI

Hierarchical active shape models, using the wavelet transform

IEEE TMI

3D statistical shape models using direct optimisation of description length

The varimax criterion for analytic rotation in factor analysis

Psychometrika