Data-driven shape parameterization for segmentation of the right ventricle from 3D+t echocardiography

https://doi.org/10.1016/j.media.2014.12.002Get rights and content

Highlights

  • The right ventricle is segmented jointly across multiple echocardiography sequences.

  • Segmentation is constrained by a linear basis shape model.

  • The linear basis shape model is optimized during segmentation.

  • The framework is applied to multiple-view and multiple-subject datasets.

Abstract

Model-based segmentation facilitates the accurate measurement of geometric properties of anatomy from ultrasound images. Regularization of the model surface is typically necessary due to the presence of noisy and incomplete boundaries. When simple regularizers are insufficient, linear basis shape models have been shown to be effective. However, for problems such as right ventricle (RV) segmentation from 3D+t echocardiography, where dense consistent landmarks and complete boundaries are absent, acquiring accurate training surfaces in dense correspondence is difficult.

As a solution, this paper presents a framework which performs joint segmentation of multiple 3D+t sequences while simultaneously optimizing an underlying linear basis shape model. In particular, the RV is represented as an explicit continuous surface, and segmentation of all frames is formulated as a single continuous energy minimization problem. Shape information is automatically shared between frames, missing boundaries are implicitly handled, and only coarse surface initializations are necessary.

The framework is demonstrated to successfully segment both multiple-view and multiple-subject collections of 3D+t echocardiography sequences, and the results confirm that the linear basis shape model is an effective model constraint. Furthermore, the framework is shown to achieve smaller segmentation errors than a state-of-art commercial semi-automatic RV segmentation package.

Introduction

Segmentation of the left ventricle (LV) and right ventricle (RV) from 3D+t echocardiography is an important task for quantifying cardiac function. In practice, this can be challenging because of missing anatomical boundaries and ambiguous edge features. 3D analysis of the LV has received most attention in literature, and model-based methods have been shown to be successful at LV segmentation from 3D echocardiography (Mitchell et al., 2002, Noble and Boukerroui, 2006). The RV literature is less developed, in part as the shape is harder to model, but also because RV ultrasound (US) scans are generally more incomplete due to the relative position of the RV with respect to the transthoracic probe.

Commonly, a surface representing the endocardium is deformed frame-to-frame so that it adheres to the blood-tissue boundary (e.g. Orderud et al., 2007). Specifically, the surface is represented either explicitly or implicitly and an energy minimization problem is formulated which captures the notion of “fitting” the surface to the blood-tissue boundary subject to priors or regularization. An explicit representation directly models the geometry of the segmentation surface (Blake and Isard, 2000). Examples include point distribution models (Cootes et al., 1995), truncated ellipsoids (Orderud, 2006), spline and subdivision surfaces (Piegl and Tiller, 1995, Stam, 1998), and B-spline Explicit Active Surfaces (BEAS) (Barbosa et al., 2012). Implicit representations model the segmentation surface as the level-set of a higher-dimensional function over the image domain (Caselles et al., 1997, Osher and Sethian, 1988).

Implicit representations of the LV and RV have the advantage that they allow for more complex appearance models because they naturally define interior and exterior segmentation regions (Lankton and Tannenbaum, 2008, Huang et al., 2014). The primary drawback of implicit representations, and level sets in particular, is that direct optimization over the entire function is slow. Level sets were made practical by the development of fast marching and narrowband methods, which only update the level set function near the implicit interface boundary or zero level set (Lankton, 2009, Sethian, 1999, Whitaker, 1998). Common to these approaches is that a first-order gradient-based evolution equation is derived from the energy and used to update the level set function. Since the surface fit is specified directly against image intensities—a result of the more complex appearance model—an accurate initialization is necessary so that the first-order optimization does not converge to an unwanted local minimum.

Like level set methods, the surface fit for explicit representations can also be formulated directly against the image (e.g. Kass et al., 1988), but specification of interior and exterior regions is, with the exception of BEAS (Barbosa et al., 2013), typically more difficult. However, if a discrete set of boundary candidates is available—detected independently or based on the current surface state—then the surface fit can instead be defined as the distance between these points and their corresponding points on the explicit surface. In this case, explicit representations are more amenable to non-linear continuous optimization algorithms more powerful than gradient descent.

To handle missing boundaries, regularization is used to constrain the segmentation surface to be smooth and physically plausible. Simple spatio-temporal regularizers are physically motivated and straightforward to implement for explicit or implicit surface representations. However, for large boundary gaps, such as those encountered when acquiring 3D+t US sequences of the RV, the simple interpolating action of these regularizers is insufficient. Instead, shape models can be used to constrain the model surface.

In medical image analysis, point distribution models and triangle meshes have been popular model surface representations. For these definitions, linear basis shape models have been constructed using Principal Component Analysis (PCA) on a set of training surfaces which have been aligned semi-automatically (Bosch et al., 2002, Cootes et al., 1995). The advantage of these Active Shape Models (ASMs) is that the dimensionality of the model surface is reduced. The disadvantage is that the resulting parameterization can be too restrictive and prevent modeling of local deformation unseen in the training examples.

To remedy this, hierarchical ASMs have been proposed which recover scale- and location-specific linear basis shape models. In Davatzikos et al. (2003), this is achieved in 2D by first computing wavelet coefficients of the x- and y-components of the training model contours. The coefficients are then partitioned into bands based on scale and spatial location, and PCA is applied to each. In 3D, spherical wavelets with adaptively selected bands (Nain et al., 2005, Nain et al., 2006), Catmull-Clark subdivision wavelets with fixed bands (Li et al., 2007), and diffusion wavelets (Essafi et al., 2009) with orthomax PCA (Kaiser, 1958, Stegmann et al., 2006), have been proposed. Each of these algorithms produces a linear basis shape model which enables “legal” deformations of the dense model points to be specified with a small number of parameters, where each parameter (by design) controls local shape deformation only.

A key challenge when constructing any of the aforementioned shape models is acquiring accurate training surfaces which are in dense correspondence. Automatically detecting correspondences based on shape features (e.g. positions of high curvature) has been proposed (Brett and Taylor, 2000, Wang et al., 2000), but this is difficult for the RV because of the absence of dense consistent landmarks across the surface, complete boundaries, and increased shape variability (Caudron et al., 2012, Petitjean and Dacher, 2011). Furthermore, while training surfaces are rigidly aligned using Procrustes analysis (Bosch et al., 2002), local incorrect correspondences can remain which introduce artificial shape variation into the model. Therefore, it is necessary to model the parameterization differences between the dense points of the training surfaces. In Davies et al. (2002), this is achieved by mapping each surface to the unit sphere and optimizing predefined parameter transformations for each training surface to construct the ASM of minimum description length (Rissanen, 1983).

In this paper, a framework is described which performs joint 3D segmentation of the RV from multiple 3D+t echocardiography sequences, while simultaneously optimizing all correspondences and an underlying linear basis shape model (Fig. 1). This framework is a modification of Cashman and Fitzgibbon (2013)—where 3D linear basis shape models of animals are learned from 2D exterior silhouettes—to 3D segmentation of multiple-subject and multiple-view collections of 3D+t echocardiography sequences. The key differences are:

  • In Cashman and Fitzgibbon (2013), the exterior silhouettes of animals are recovered by semi-automatic segmentation. Therefore, all boundary candidates are valid and ordered, and dynamic programming is used to initialize the boundary candidate correspondences (preimages). Here, boundary candidates are derived using a simple edge detector—only a subset are valid. Furthermore, the boundary candidate positions are noisy and missing boundaries are common. Therefore, a “model-to-data” approach is adopted: the boundary candidate correspondences are initialized by sampling the model surface uniformly, and boundary candidates are subsequently selected based on the current model surface geometry. Robust fitting terms are also used.

  • In Cashman and Fitzgibbon (2013), independent rigid transformations and shape parameters are modeled for each frame. Here, scales are introduced for each subject and rigid transformations are introduced for each sequence. Shape similarity is also enforced between frames which are of the same subject at the same point in the cardiac cycle.

Conceptually, the proposed framework is also similar to Zhou et al. (2013), where in 2D, explicit model contours are simultaneously fitted to multiple images in a sequence and constrained to be of similar shape. In Zhou et al. (2013), shape similarity is achieved by minimizing the nuclear norm of the matrix composed of the x- and y-components of all model contours. Here, shape similar is enforced explicitly using a linear basis shape model.

As will be shown, the described framework is suitable for the proposed application for three reasons. First, a Loop subdivision surface is used for the model surface, which by construction, has a small number of parameters but is flexible and can realize local shape deformations. Second, joint optimization of all continuous parameters—including the linear basis shape model control vertices, rigid transformations, and boundary candidate correspondences—mitigates the requirement for any registration or fusion of the input images (e.g. Rajpoot et al., 2011), or accurate training surfaces that are in dense correspondence. Third, it naturally handles missing boundaries.

The structure of the paper is as follows. In Section 2 the complete model energy and optimization algorithm for our framework is presented. In Section 3 and Section 4 we then demonstrate the application to two problems which are hard to solve using prior approaches. First, we show how multiple 3D+t sequences acquired from different viewpoints for a single subject can be segmented jointly while optimizing a subject-specific shape model. Second, we show how multiple 3D+t sequences acquired from multiple subjects—with potentially different viewpoints—can be jointly segmented. Conclusions are given in Section 5.

Section snippets

Method

In this article we denote matrices with uppercase letters (X) and vectors with bold-face lowercase letters (x). Column vectors from matrices are denoted by indexed bold-face letters. For example, xi is column i of the matrix X. Similarly, the jth element of a vector x is denoted by xj. Cursive uppercase letters (X) denote sets.

The input to our framework is a collection of 3D+t sequences from one or more subjects. The 3D echocardiogram of the kth frame of the jth sequence for the ith subject is

Experiments

Experiments were performed to demonstrate the application of our framework for different use cases, to assess its overall segmentation performance, and to determine the usefulness of the underlying linear basis shape model. The datasets used for test and validation are described in the next subsection and the details of each use case follow.

Single subject, multiple views

Example slices of echocardiography frames segmented using our framework (Linear Basis Shape RV, LBSRV) and the thin-plate regularization baseline (BL) for SSMV are shown in Fig. 6. When all boundaries are available (Fig. 6(a)), the segmentation surfaces for both methods reasonably delineate the RV. However, for a temporally aligned frame from a different view (Fig. 6(b)), LBSRV implicitly utilizes the information from the other views which plausibly interpolates the missing boundary.

Conclusions

In this article, a framework to perform model-based segmentation of multiple 3D+t sequences while jointly optimizing an underlying linear basis shape model has been described.

The framework was motivated by difficulties specific to RV segmentation from 3D+t echocardiography. Specifically, large regions of missing boundary candidates are common when imaging the RV due to the relative position of the RV with respect to the transthoracic US probe. Simple model surface regularizers are incapable of

Acknowledgments

The first author would like to thank the Rhodes Trust for funding this research. Data acquired on EPSRC Grant EP/G030693/1 was used in this research.

References (41)

  • P.R. Amestoy et al.

    An approximate minimum degree ordering algorithm

    SIAM J. Matrix Anal. Appl.

    (1996)
  • D. Barbosa et al.

    B-spline explicit active surfaces: an efficient framework for real-time 3-D region-based segmentation

    IEEE TIP

    (2012)
  • A. Blake et al.
    (2000)
  • J.G. Bosch et al.

    Automatic segmentation of echocardiographic sequences by active appearance motion models

    IEEE TMI

    (2002)
  • V. Caselles et al.

    Geodesic active contours

    IJCV

    (1997)
  • T.J. Cashman et al.

    What shape are dolphins? Building 3D morphable models from 2D images

    IEEE TPAMI

    (2013)
  • C. Davatzikos et al.

    Hierarchical active shape models, using the wavelet transform

    IEEE TMI

    (2003)
  • R.H. Davies et al.

    3D statistical shape models using direct optimisation of description length

  • Essafi, S., Langs, G., Paragios, N., 2009. Hierarchical 3D diffusion wavelet shape priors. In: IEEE CVPR, pp....
  • H.F. Kaiser

    The varimax criterion for analytic rotation in factor analysis

    Psychometrika

    (1958)
  • Cited by (19)

    • Artificial Intelligence in Cardiovascular Imaging: JACC State-of-the-Art Review

      2019, Journal of the American College of Cardiology
      Citation Excerpt :

      As methods start to be tested in clinical practice, terminology is also developing around how AI can be used as a tool. Typically these uses include the use of a quantitative strategy to automatically generate measures (25), enabling notification devices to flag up particular problems, and as diagnostic support tools to generate recommendations and related information that can be used by a clinician to reach a conclusion. Fully automated diagnostic tools will ultimately provide medical opinions (12,26), but will also require extensive validation before regulatory approval.

    • SiSSR: Simultaneous subdivision surface registration for the quantification of cardiac function from computed tomography in canines

      2018, Medical Image Analysis
      Citation Excerpt :

      The objective of our method is conceptually similar to Pourmorteza et al. (2012) in that we derive SQUEEZ from a series of registered meshes. However, rather than registering meshes using CPD, we adapt and extend a method which has been successfully used for cardiac modeling and segmentation in the context of 3-D echocardiography (Stebbing, 2014; Stebbing et al., 2015). In Stebbing (2014), a subdivision surface is registered to an arbitrary point set obtained using an off-the-shelf edge detector.

    • Reflections on ultrasound image analysis

      2016, Medical Image Analysis
      Citation Excerpt :

      Recent ultrasound based Challenges include CLUST15 (liver ultrasound tracking), CETUS 2014 (endocardial 3D ultrasound segmentation) and ChallengeUS 2012 (fetal biometry, Rueda et al., 2013). Since that time, the sophistication, in terms of the cardiac deformations that can modelled (Zhi et al., 2010), underlying segmentation and tracking algorithms (Schneider et al., 1999; de Craene et al., 2012), and speed of analysis have significantly advanced, and solutions are starting to be developed for the harder challenge of cardiac right ventricle analysis (Stebbing et al., 2015, see also Fig. 1b). Additionally, we have seen the successful emergence of well-validated model-based fully automated and assistive ultrasound quantification tools in clinical application areas which include 2D and 3D echocardiographic image quantification, quantification of intra-vascular ultrasound (IVUS), ovarian follicle counting, and fetal biometry, some of which have gone on to be embedded in commercial products.

    • Big Data in cardiac surgery: real world and perspectives

      2022, Journal of Cardiothoracic Surgery
    • MODELING SINGLE VENTRICLE MORPHOLOGY WITH A HLHS-SPECIFIC BIVENTRICULAR TEMPLATE TO ENHANCE STATISTICAL SHAPE AND BIOMECHANICS ANALYSES

      2022, ASME International Mechanical Engineering Congress and Exposition, Proceedings (IMECE)
    • AI and The Cardiologist-When Mind, Heart and Machine Unite

      2022, Communications in Computer and Information Science
    View all citing articles on Scopus
    View full text