Literature DB >> 23845949

Endoluminal surface registration for CT colonography using haustral fold matching.

Thomas Hampshire1, Holger R Roth, Emma Helbren, Andrew Plumb, Darren Boone, Greg Slabaugh, Steve Halligan, David J Hawkes.   

Abstract

Computed Tomographic (CT) colonography is a technique used for the detection of bowel cancer or potentially precancerous polyps. The procedure is performed routinely with the patient both prone and supine to differentiate fixed colonic pathology from mobile faecal residue. Matching corresponding locations is difficult and time consuming for radiologists due to colonic deformations that occur during patient repositioning. We propose a novel method to establish correspondence between the two acquisitions automatically. The problem is first simplified by detecting haustral folds using a graph cut method applied to a curvature-based metric applied to a surface mesh generated from segmentation of the colonic lumen. A virtual camera is used to create a set of images that provide a metric for matching pairs of folds between the prone and supine acquisitions. Image patches are generated at the fold positions using depth map renderings of the endoluminal surface and optimised by performing a virtual camera registration over a restricted set of degrees of freedom. The intensity difference between image pairs, along with additional neighbourhood information to enforce geometric constraints over a 2D parameterisation of the 3D space, are used as unary and pair-wise costs respectively, and included in a Markov Random Field (MRF) model to estimate the maximum a posteriori fold labelling assignment. The method achieved fold matching accuracy of 96.0% and 96.1% in patient cases with and without local colonic collapse. Moreover, it improved upon an existing surface-based registration algorithm by providing an initialisation. The set of landmark correspondences is used to non-rigidly transform a 2D source image derived from a conformal mapping process on the 3D endoluminal surface mesh. This achieves full surface correspondence between prone and supine views and can be further refined with an intensity based registration showing a statistically significant improvement (p<0.001), and decreasing mean error from 11.9 mm to 6.0 mm measured at 1743 reference points from 17 CTC datasets.
Copyright © 2013 The Authors. Published by Elsevier B.V. All rights reserved.

Entities:  

Keywords:  CT colonography; Haustral fold; Landmark; Markov random field; Registration

Mesh:

Year:  2013        PMID: 23845949      PMCID: PMC3807796          DOI: 10.1016/j.media.2013.04.006

Source DB:  PubMed          Journal:  Med Image Anal        ISSN: 1361-8415            Impact factor:   8.545


Introduction

Motivation

Colorectal cancer is a leading cause of cancer mortality with 1.23 million individuals developing the disease and 608,000 deaths annually (World Health Organisation, 2009). Population screening offers the best prospects to reduce in mortality and aims to prevent the development of advanced cancers by early detection and removal of both localised cancers or premalignant adenomas, from which more than 80% of cancers are thought to arise (Cunningham et al., 2010). Optical colonoscopy is the current gold standard method to inspect the whole-colon; however colonoscopy is time consuming and uncomfortable for the patient, and is occasionally associated with serious complications (Pignone and Sox, 2008) such as colonic perforation. Computed tomographic colonography (CTC) is now widely considered the preferred radiological technique for detecting cancer and polyps, and has comparable sensitivity to optical colonoscopy (Johnson et al., 2008), while being more acceptable to patients (von Wagner et al., 2012) and relatively safe (Burling et al., 2006). Patients undergo a full bowel preparation to cleanse the colon, which is then insufflated with gas immediately before helical CT imaging of the abdomen and pelvis (gas insufflation maximises attenuation contrast between endoluminal surface and intraluminal space). Graphics rendering software is used to generate high resolution ‘virtual colonoscopy’ three-dimensional images of the endoluminal surface, simulating those obtained using conventional colonoscopy. The procedure is performed routinely with the patient both prone and supine to redistribute gas and residue within the colon (Taylor et al., 2007). This helps differentiate fixed colonic pathology from mobile faecal residue because abnormalities whose position remains fixed in both acquisitions are likely to be true polyps. Also, using two data acquisitions increase the chance of discovering pathology occluded by retained fluid or hidden by luminal collapse. Matching corresponding locations between prone and supine endoluminal colonic surfaces is therefore an essential aspect of interpretation by radiologists. However, interpretation can be difficult and time-consuming due to the considerable colonic deformations that often occur during repositioning of the patient (Punwani et al., 2009). These deformations can induce diagnostic error and increase interpretation time. Hence, a method for automatic registration of prone and supine datasets has the potential to improve efficiency and diagnostic accuracy.

Related work

A number of methods have been proposed to find correspondence between prone and supine positions. For example, centreline-based methods extract and align colonic centrelines by stretching and shrinking based on path geometries (Li et al., 2004; Wang et al., 2009). However, these methods are inherently restricted to achieving a registration along a single dimension and do not give any information about the degree of torsion of the colon wall between views. Anatomical landmarks can be used to help align the two datasets by first identifying a stable set of anatomical features, such as the caecum, rectum and flexures (Wang et al., 2009; Näppi et al., 2005). However, these do not account for the varying compression and expansion along the length of the colon nor the degree of colonic torsion between acquisitions. Voxel-based methods provide a further possibility for registration (Suh and Wyatt, 2009). However, these methods rely to varying extents upon continuous prone and supine colonic segmentations, free from luminal occlusion by fluid or collapse; a situation achieved infrequently in clinical practice, despite optimal bowel preparation (Taylor et al., 2003). Fukano et al. (2010) proposed a registration method based on haustral fold matching. A second-order derivative difference filter was used to extract folds; their volume and relative positions along the centreline with respect to a set of locations of high centreline curvature were then used to establish correspondence. The method relied on prior automatic identification of a set of landmark locations for registration. They reported correct registration of 65.1% of large folds (9.3% could not be judged), and 13.3% of small folds (32.7% could not be judged). Methods that involve conformal mapping of the endoluminal colonic surface have been proposed in order to reduce the complexity of the three-dimensional task. For example, Zeng et al. (2010a) combined conformal mapping with feature matching between the prone and supine endoluminal surfaces. The prone and supine colonic segmentations were mapped onto five rectangle pairs. Correspondences were established using a feature matching method based upon mean curvature. The method relied on accurately determining five matching segments in the prone and supine datasets, which is difficult to achieve and may be impossible with local endoluminal collapse. The method proposed by Roth et al. (2011), aims to overcome these limitations by mapping the entire endoluminal surface to a cylinder. Dense surface correspondence was then achieved by a conformal mapping of the prone and supine endoluminal surfaces to 2D cylindrical domains using Ricci flow (Jin et al., 2008; Zeng et al., 2010b), followed by a non-rigid cylindrical intensity based registration using a B-spline method (Rueckert et al., 1999) with a sum-of-squared-differences similarity metric based on shape index (SI) (Koenderink, 1990). However, this method can be susceptible to mis-registration of contiguous sections by one or two haustral folds due to the repetitive similarity of neighbouring features and may not achieve sufficient accuracy if the prone and supine acquisition differ considerably in terms of distension or endoluminal collapse. Most recently, Wang et al. (2012) used a graph matching algorithm to register a set of features extracted from colon segmentations. A set of key points are detected by the n-SIFT algorithm and matched using an algorithm based on mean field theory. The main advantage of the method is that definition of a colon centreline is unnecessary; however reported mean registration error was 37.6 mm. We propose a novel method to establish correspondence between the two acquisitions automatically. Previous methods attempt to match corresponding haustral folds based on spatial location and size alone, e.g. (Fukano et al., 2010; Zeng et al., 2010a). We use a novel fold-matching metric, based on depth map images of the endoluminal surface at fold positions as well as local geometric information. The problem is first simplified by detecting haustral folds using a graph cut method applied to a surface mesh generated from the segmentation of the colonic lumen. A virtual camera is used to create a set of images which provide a metric for matching pairs of folds between the prone and supine acquisitions. Image patches are generated at the fold positions using an endoluminal surface mesh depth map rendering and optimised by performing a virtual camera registration over a restricted set of degrees of freedom. The intensity difference between image pairs, along with additional neighbourhood information to enforce geometric constraints over a 2D parameterisation of the 3D space, are used as unary and pair-wise costs respectively, and included in a Markov Random Field (MRF) model to estimate the maximum a posteriori fold labelling assignment. This new haustral fold matching method is the principal contribution of the paper and is described in detail in Section 2. A new initialisation method is also introduced. First the sparse positions and displacements of the corresponding fold matches are mapped onto a 2D domain created by performing a conformal mapping using the Ricci flow algorithm (Jin et al., 2008; Zeng et al., 2010b), to construct an underlying function based on multilevel B-splines that can be evaluated at any point to give a transformation from the prone to the supine data. This transformation is refined further by the intensity based registration in Roth et al. (2011). This method explicitly addresses the problem of colonic collapse. Full details of the full surface-based registration and initialisation methods are provided in Section 3.

Haustral fold matching

Haustral fold segmentation

Haustral folds are elongated, ridgelike endoluminal structures and can be identified by extracting curvature measurements from a triangular mesh representation of the colonic wall. The maximum and minimum values of the normal curvature at a point are called the principal curvatures, k1 and k2 respectively. A metric based on the principal curvatures is used to perform a binary classification of each vertex as fold, or non-fold:This recognises that at a fold, one expects k1 ≫ 0 and k2 ≈ 0. The γ parameter penalises the metric against curvature in any direction other than in the maximum, helping to separate the folds at the tenaie coli. The surface mesh is first simplified using an edge collapse transformation process (Hoppe, 1999) to a resolution of ∼0.2 polygons/mm2, at which level the haustal folds are still clearly visible. Following this, the mesh is treated as a graph, with graph nodes defined by the mesh vertices and graph edges defined by the mesh edges. Using a virtual sink and source with the given weighting, a graph cut segmentation (Boykov and Kolmogorov, 2004) is performed which minimises the energy function:where f and f are the binary labels of neighbouring nodes p and q, corresponding to fold or non-fold, and N is the neighbour set of p defined by the directly connected vertices. The function E is defined as:and δ is a Potts energy function smoothing term: This model considers only the equality or inequality of labels and captures the assumption that labelling should be piecewise constant. This results in a label assignment of fold or non-fold over the entire surface mesh. The centre of each fold is taken as the vertex with the shortest maximum distance to any vertex lying on the border of the segmented region. Qualitative visual results are promising (Fig. 1).
Fig. 1

Virtual colonoscopy (left), external (right) and internal (bottom) views of segmented haustral folds with marked centres. Red and blue sections represent fold and non-fold labelled vertices respectively. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

Markov random field modelling

The matching of prone and supine haustral folds is formulated as a labelling problem. First, m haustral folds are detected in the supine data and these are uniquely labelled; the objective is to then assign labels to the detected prone folds, achieved by solving an MRF defined as such: A set of sites S = {1, …, n} corresponding to the haustral folds in the prone data set. A neighbourhood system N = {N∣∀i ∈ S}, defining the extent of local connections between sites. A pair-wise clique defined on N and S, C2 = {{i, i′}∣i ∈ S, i′ ∈ N}, allowing the inclusion of a priori knowledge of geometric dependencies between label configurations. A set of random variables F = {F1, …, F} taking on a discrete label f = {f1, …, f} taken from the set of haustral folds identified in the supine data set. The maximum a posteriori (MAP) estimate of the optimum labelling is computed, which is equivalent to minimising the energy function:where Θ(f) is the unary term, a cost function for assigning the label f to site i. Similarly, the pair-wise term Θ(f, f) is the cost for assigning neighbouring sites i and j labels f and f.

Unary cost function

The aim is to calculate an n × m unary cost matrix, where n = ∥S∥ is the number of sites or prone folds, and m is the number of labels or supine folds. To calculate the cost Θ(f), depth map images are rendered at the fold positions, visualising the internal colon wall (Fig. 2). The depth map images provide a description of the colon surface at the position of haustral folds, as well as the configuration of colonic pathology in the local region. The perspective projections provide a degree of invariance to colon wall deformation due to patient repositioning, whilst also allowing for a direct comparison between images. The images are generated by exploiting the Z-buffer from the graphics rendering model. Normally this buffer contains values that relate to the distance of an object from the camera position and is used to specify the order that polygons should be rendered in order to generate an image of the scene. Using a perspective camera model, the z buffer value z′ is a value in the range [−1, 1] specified in terms of the distances of near and far clipping planes of the viewing frustum with respect to the real object distance z (Lengyel, 2004):This can be rearranged in terms of z to retrieve true depth values:The resulting images are then compared using a mean-squared-error (MSE) metric. An optimisation over the external parameters of the virtual camera used to visualise the supine dataset accounts for any inaccuracies in the fold point identification. Restricting the number of degrees of freedom of camera search ensures that the camera focus remains on the correct fold. The degrees of freedom (see Fig. 3) are as follows:
Fig. 2

Internal views of rendered haustral fold in the prone (a) and supine (c) view, with their corresponding depth map images (b) and (d).

Fig. 3

Virtual camera with restricted set of degrees of freedom for optimisation. Image shows view along (left) and perpendicular to (right) the axis of the colon.

Elevation (θ) – the fold centre and camera right vector give a position and axis about which the camera is rotated Roll (ϕ) – rotation around the camera view direction Dollying (τ) – translation along the camera right vector Zoom (η) – the distance from camera position to fold centre Given the four parameters θ, ϕ, τ, η; the optimisation finds the local minimum in a MSE between the depth map images I1 and I2 using Powell’s gradient descent method (Fletcher and Powell, 1963). The camera position is initialised to the closest intra-luminal centreline point to the fold with the camera ‘up’ vector set to the tangent to the centreline curve and viewpoint centred on the fold. A multi-resolution approach is used to increase the chance of the optimisation converging on the correct minima. Adding a scaling parameter W allows the weighting of unary to pair-wise costs:Additionally, a constraint is added so that the matching folds must lie in a similar region. The normalised fold centreline positions in the prone and supine , are used to limit corresponding fold matches to a fraction ω of the total colon centreline length. Finally, a constant unary cost α is associated with the assignment of the null label f0 to any given node, allowing for missing labels. The unary costs are then defined:where min (R(I1, ·)) is the minimum cost for depth map I1 over all depth maps.

Pair-wise cost function

To improve labelling performance, geometric information about neighbouring fold positions can be used. In this work an adapted frame (t, u, v) on the centreline curve r(ξ) is used. An adapted frame is a set of orthonormal vectors, where t is the unit tangent and u and v span the curve normal plane. The most familiar case of an adapted frame is the Frenet frame which defines u and v as the curve normal n and binormal b respectively:Unfortunately, the Frenet frame suffers from indeterminacies at inflexions where r″ is parallel to r′, or vanishes, and therefore n and b are undefined. As such, the rotation of such a frame about the tangent of a general curve often leads to undesirable twists in frame orientation (Fig. 4). In this work a Rotation Minimising Frame (RMF), specifically the double inflexion method in Wang and Joe (1997) is employed to define a moving frame that does not rotate about the instantaneous tangent of the centreline curve r(ξ) (Fig. 4) by minimising the global error E of the magnitude of the angle between the reference vectors of frames U and U:The resulting set of reference frames can then be use to describe the relative position of each fold to its neighbours:  = [ν, ν]; where ν is the difference in fold position along the centreline and ν = [π/2, −π/2] is the difference in angle of rotation around the centreline with respect to the RMF. For each fold position p, the corresponding centreline position c can be found with a surface to centreline correspondence method. We use the conformal mapping technique described in (Roth et al., 2010). A degree of rotation θ can be defined by interpolating a RMF corresponding to centreline position c, and projecting the vector from centreline position to fold p onto the curve normal plane defined by u and v (Fig. 5). The angle θ between this vector p′ and the RMF normal u then gives a relative degree of rotation with respect to the RMF and can be used to compare neighbouring folds.
Fig. 4

Parameterisation based on Frenet-Serret (left) and Rotation Minimising (right) frames.

Fig. 5

Calculation of relative angle of rotation of haustral fold position with respect to an adapted frame.

This 2D parameterisation simplifies the description of the translation between corresponding pairs of folds between the prone and supine as the centreline ν and rotational ν displacement should be similar ( ≈ ). Alternatively we can state  =  + ∊, where ∊ represents some uncertainty, and can be modelled with a zero mean bivariate normal distribution , with . Finally we recognise that the position of a neighbouring site becomes more uncertain as the displacement along the centreline increases. In order to model this uncertainty we look at the distribution of stretching and rotation of corresponding fold pairs between the prone and supine views, with respect to the centreline distance between the two folds. By observing the variance in centreline and rotational (Fig. 6) displacement over a set of training data (described in detail in Section 4.2), a second order polynomial function can be fitted to model the error:With this information, a pair-wise cost for assigning neighbouring sites i and j label configurations f and f, is defined by the negative log-normal distribution:As the angle element of vector ( − ) can only be in the range [π/2, −π/2], ±π is added until this requirement is met. A local neighbourhood system is defined in order to enforce local geometric constraints on neighbouring fold positions. The local neighbourhood of a site is set to be:where k is a threshold distance. Pairs of sites that are separated by a local colonic collapse are removed from the neighbourhood set. A uniqueness constraint is also enforced so any two sites may not be assigned the same label. This is included in the pair-wise cost function by connecting each site with every other site in a global neighbourhood system:and defining the pair-wise cost of assigning the same label to two different nodes to be infinity, except in the case of a null label assignment. The full pair-wise cost function is:
Fig. 6

Distance between folds against variance in centreline (left) and rotational displacement (right) between corresponding fold pairs.

MRF inference

Submodularity influences the choice of inference algorithm that may be applied to the problem of generating the optimal labelling. A submodular energy function is one which, for all labels, satisfies:which must hold for all labels f, f, f ∈ f. The uniqueness constraint on the pair-wise costs means the problem of solving the MRF is non-submodular as Θ(f, f) = ∞, which restricts the possible algorithm choice for MAP inference. The min-sum Belief Propagation (BP) algorithm is suitable for this purpose (Weiss and Freeman, 2002). It is known that BP is exact on acyclic tree-like graphical models, but has been shown to give a good MAP estimate in graphs with loops. The BP algorithm works by passing messages between nodes of a graph defined by the set of sites S, with edges defined by the site neighbourhoods N. Each message M is an i dimensional vector, with i equal to the number of possible labels. At each iteration at time t, every node sends messages to each of its neighbours in parallel, whilst also receiving messages itself. Let be the message that node p passes to node q at iteration t. All entries in are initialised to zero. At each iteration, new messages are computed as follows:where N⧹j denotes all neighbours of i other than j. After T iterations, the belief vector for each node may be computed:The belief vector b(f) expresses the negative cost of assigning each label f to site j. The algorithm will converge on a solution if the system reaches a state where the messages stabilise; however, this is not guaranteed in a graph with loops as the system may flip between a pair of states in each iteration. Convergence can be aided with the use of message damping, where the actual messages at time t are computed:Once the algorithm has terminated, each node is assigned the label having the maximum belief:

Inverse consistency

In the methods presented, the algorithm is configured with the haustral folds in the prone view as the set of nodes in the MRF, and the haustral folds in the supine view as the labels (or vice versa). The algorithm is run in both directions and the intersection of the labelling results is used. For each node where null labels are present in both directions or the labelling in one direction differs from the labelling in the other direction, the null label is assigned. This results in an increase in the accuracy of the fold labelling at the expense of an increase in null label assignments.

Parameter training

The parameters to model the error in the pair-wise cost function are derived from the statistical analysis of reference standard fold matches in the training data described in Section 4.2. W, k, α, β and τ; the weighting of unary to pair-wise cost functions, node neighbourhood threshold, costs for assigning the null label in the unary and pair-wise cost function, and the message damping parameter respectively, are found using a gradient ascent optimisation. Parameters are trained on separate datasets to those used for validation and are listed below.

Surface based registration

There are scenarios where obtaining a one-to-one surface correspondence is required, such as to locate a possible polyp position in both the prone and supine views. In this case, the results of this fold matching method can be used to provide automated initialization for a surface-based registration technique (Roth et al., 2011). The intensity-based registration (IBR) algorithm proposed by Roth et al. (2011) recognises that the colon is topologically cylindrical and reduces the complexity of the registration by mapping each point on the endoluminal surface onto a cylindrical representation with the use of a conformal mapping technique (Hamilton, 1982). This allows the registration to account for the large 3D deformations between the prone and supine views as a more simple 2D cylindrical deformation. The registration is then represented as a transformation between the two cylinders and includes non-linear stretching along the cylinder, and local torsion and rotation. A shape index metric is calculated at each point of the cylindrical image and used to drive a B-spline intensity based registration (Rueckert et al., 1999) in a cylindrical domain to achieve correspondence between the two views. The following different methods are compared: Linear Scaling Initialisation with Intensity Based Registration (LSI w/ IBR): We use the locations of the hepatic and splenic flexures as in Roth et al. (2011). These are automatically detected based on local maxima of the z-coordinate of the centreline. The flexure positions may be discarded if their centreline distance vary by more than a certain threshold. The positions of found flexure are mapped onto the cylindrical images and used to provide a linear scaling of prone image in the direction of the centreline. This is used as an initialisation to the intensity-based registration in Roth et al. (2011) B-Spline Initialisation without Intensity Based Registration (BSI w/o IBR): The positions of the detected haustral folds and the corresponding locations in the prone and supine views are used to perform a multilevel B-spline point based deformation of the prone image (see Section 3.1). B-Spline Initialisation with Intensity Based Registration (BSI w/ IBR): The same as above, but used as an initialisation to the intensity-based registration. To determine the registration error, each reference standard point is transformed from one dataset to the other using the registration result, and the 3D Euclidean distance between this and the corresponding reference standard point is measured.

B-Spline Initialisation (BSI)

We wish to approximate a smooth function f which relates the (x, y) points in the prone unfolded image, to their (x′, y′) positions in the supine image over domain Ω = (x, y)∣0 ⩽ x < m,0 ⩽ y < n. To do this we use the set of folds P = {(x, y, v)} where (x, y) is a point in Ω and v is the value at (x, y). We define the mapping in terms of two functions: x′ = X(x, y) and y′ = Y(x, y). As these functions can be derived simultaneously, we use the notation v = (x′, y′) = f(x, y). To approximate the data P, we use function f as a uniform bicubic B-spline, defined by control lattice Φ overlaid on domain Ω using the method in Lee et al. (1997). We also assume Φ is an (m + 3) × (n + 3) lattice, where m and n are the image dimensions defined in lattice control points. We define ϕ as the value of ij-th control point on lattice Ω for i = −1, 0, …, m + 1 and j = −1, 0, …, n + 1. We can then define the approximation function f:where i = ⌊x⌋ − 1, j = ⌊y⌋ − 1, s = x − ⌊x⌋, and t = y − ⌊y⌋. B and B are basis functions:where 0 ⩽ t < 1. For every point in P = {(x, y, v)} a different value ϕ of each of the control points ϕ is defined:where w = w = B(s)B(t), k = (i + 1) − ⌊x⌋, l = (j + 1) − ⌊y⌋, s = x − ⌊x⌋, t = y − ⌊y⌋. Only data points in the 4 × 4 neighbourhood of each control point are taken into consideration. To choose a value for each ϕ from the contributions from each point ϕ the error is minimised by differentiating e(ϕ) with respect to ϕ giving:To allow for a smooth function over the entire domain and more accurate local deformations, a multilevel B-spline approximation is used to generate a hierarchy of control lattices from coarse to fine. A refinement process is used to reduce the sum of these functions into one B-spline function. For each level of control lattice Ψ we can derive a finer control lattice such that and where f0(x, y) = (x, y). We then derive control lattice Φ to approximate data P = {(x, y, Δv)}, where , and Δ0v = v. Each function defined by control lattice Φ serves to remove the residual error from the refined coarser lattice at each level. We can now define a progressive control lattice from the coarsest to finest levels. As the end points of the colon are fixed, the image should not be transformed in the direction of the x-axis at the x-axis extrema, so at each refinement we calculate the residual error Φ and set the x-component of the data Δx′ = 0 at points {ϕ∣i = −1, 0, m, m + 1; j = −1, 0, …, n + 1}. We apply this technique to the images created by a conformal mapping (Roth et al., 2011) of the endoluminal surfaces of prone and supine images onto a rectangular domain. The sparse set of data points P = {(x, y, v)} have their positional information {(x, y)} taken from the positions of haustral folds mapped onto the 2D domain, and the vertical and horizontal displaced positions of the corresponding positions in the supine image. To allow for a pseudo-continuous function over the y-axis, the image is repeated and stacked over the y-axis. Due to the true cylindrical nature of the registration problem, there is an ambiguity over the direction of vertical displacement in the 2D images, resulting from the periodicity of the boundary in the vertical dimension. To create a smooth displacement, the B-spline fitting is repeated and at each iteration the datum P with the maximum error between the y component of the estimated and true displacement e = ∣(F(x, y) − v)∣ is adjusted such that where y is the size of the image in the y-direction. The image is then shifted in the y-direction so as to minimise v and the full multi-level B-spline fitting is repeated to give the final function F(Ψ) with control lattice Ψ. Now for every position in the prone image P = {(x, y)} ∈  Ω we can use the function F to find the corresponding position in the supine image P = {(x, y)} ∈ Ω. We can use this transformation as an initialisation to the intensity-based B-spline registration function presented in Roth et al. (2011) to create a finer composite registration.

Experimental results

Clinical validation

Ethical approval and informed consent were obtained to use anonymised CT colonography data. Colonic cleansing and insufflations had been performed in accordance with current recommendations (Taylor et al., 2007) (see Fig. 7).
Fig. 7

Images of the endoluminal surface produced from the conformal mapping technique (case 11) showing the full registration with B-spline initialisation. The colour scheme shows the Shape Index (SI) and the vectors show the displacements generated from the landmark registration. Images show: (a) the source (prone) image; (b) the ambiguous vector direction on the source image; (c) the sorted displacements; (d) the source image vertically aligned to reduce displacements; (e) the source image with displacement vectors and regular grid; (f) the result of the landmark B-spline initialisation with transformed image and grid; (g) the refinement with the intensity based registration (with same grid); (h) the target (supine) image.

Haustral fold matching

For the purpose of establishing a fold labelling between the prone and supine acquisitions, we selected the same 34 patient cases used in a previous publication (Roth et al., 2011). In 24 of the cases, the colon was optimally distended in both views, and where fluid tagging (allowing for digital cleaning of residual fluid) was used or little fluid remained. This allowed a continuous segmentation over the full length of the colon using the methods described in Roth et al. (2011). The other 10 cases exhibit local colonic collapse and are used to further validate the method. The datasets were randomly allocated into training and validation sets using a random permutation, resulting in training and validation sets both with 17 prone and supine cases of which 5 exhibited one or more areas of local colonic collapse (see Table 1 for details), and 4 cases which had been excluded from the previous study due to marked differences in local distension and therefore different surface features (cases 9–12). A subset of the cases used in this study are shown in Fig. 8.
Table 1

Information of cases exhibiting local luminal collapse. For each case, the number of collapsed regions in the prone and supine images are displayed, along with the Euclidean distance across each region from centreline end-point to centreline start-point. Locations of collapse are given (DC: descending colon; SC: sigmoid colon).

CaseProneSupine
No. collapsesLocationDistance (mm)No. collapsesLocationDistance (mm)

131DC65.00
141DC245.11DC272.4
1501SC26.0
163DC6.50
DC34.4
SC8.0
1701DC18.3
Fig. 8

Surface rendered examples of a subset of the cases used for validation. Top row shows prone view. Bottom row shows supine view. Cases shown from left to right are: 9, 14 and 16.

Two radiologists (EH, AP) and a computer scientist (TH) with experience of reading CTC images, independently established a reference standard by matching haustral folds using virtual colonoscopic reconstructions, external renderings of the endoluminal surface, and unfolded images achieved by performing a conformal mapping of the endoluminal surface mesh onto a plane. Any folds where confident manual correspondences between the two views could not be established were excluded from the derived reference standard. All readers were unaware of the algorithm results. The reference standards were compared for consistency, and any discrepancies were resolved by the three readers in consensus. This resulted in a total of 1743 corresponding fold pairs over the 17 validation datasets. To assess algorithm performance, for each case the maximum a posteriori labelling solution is compared against the reference standard described above. The number of reference standard points that were correctly matched by the algorithm is given by the number of ’hits’. Similarly, the number of reference standard points that were incorrectly matched by the algorithm is given by the number of ’misses’. The ’misses’ consist of the union of the set of reference standard points that have been assigned the incorrect label, and the set of labels that have been assigned to the incorrect reference standard point. The resulting accuracy for each case is calculated as hits/(hits + misses). Table 2 shows the performance of the algorithm using the unary cost function alone. Table 3 shows the performance using the pairwise cost function only, with and without inverse consistency. In all cases the unary prior based on centreline distance and the pairwise uniqueness constraint are used to allow for a fair comparison. Table 4 shows the results using the full MRF cost function, with and without the inverse consistency constraint. Table 7 and summarise the performance of the individual cost functions with respect to the case features (fully distended or collapsed), in terms of accuracy and total number of output labels. It is apparent that across all datasets, using only the unary cost function results in a poor labelling accuracy with a mean of 44.2%, which is unsurprising since neighbouring haustral folds appear similar. Performance by using the pair-wise cost function alone is considerably better with mean accuracy of 81.6%. It is interesting to compare the pair-wise cost function results with the full model: using the subset of cases with a fully distended colon, mean accuracy only increases from 87.1% to 90.1%, compared to the much larger difference in the subset of cases exhibiting local colonic collapse where mean accuracy increases from 68.2% to 79.4%. There are two possible explanations: Firstly, the pair-wise cost function model will more likely converge to the correct solution when there is similarity in the pattern of segmented folds in the 2D parameterised space. In cases where local collapse is exhibited, the folds are less likely to be segmented correctly and therefore the likelihood of pattern similarity between the two acquisitions decreases. Secondly, the pair-wise cost function is more likely to converge on the correct solution when there is a fully connected graph along the entire colonic length. The local neighbourhood network of edges that make up the graph structure in the MRF are disconnected where there is local collapse as the geometric relationships between pairs of folds across this space cannot be as easily determined. This creates two subgraphs which may converge on the incorrect labelling solution without the unary costs in place. The inverse consistency constraint gives a better performance in terms of accuracy for the unary cost only, pair-wise cost only, and full model increasing accuracy from 44.3% to 75.1%, 81.5% to 91.8%, and 86.9% to 96.1% respectively. This is however, achieved by reducing the mean number of labels: from 106.2 to 23.5, 99.6 to 77.2, and in the case of the full model from 119.4 to 99.0. It is of interest to note that applying the pair-wise only model with inverse consistency to case 4 gives no labels, as when running algorithm (prone to supine, supine to prone) it converges on entirely different solutions. Also, using the full model with inverse consistency gives a 29% increase in the mean number of output labels compared to using pair-wise cost only.
Table 2

Fold labelling performance using the unary costs only, with and without inverse consistency. The number of folds identified in the Reference Standard (RS) are shown along with the number of algorithm output labels. From the intersection of the reference standard folds and the algorithm labelled folds, the number of correct hits and incorrect misses are displayed along with the accuracy calculated from these.

CaseRS pointsUnary costs only
Unary costs with inverse consistency
LabelsHitsMissesAccuracy (%)LabelsHitsMissesAccuracy (%)
Fully distended
17677241955.82816769.6
2134106374246.82414863.6
3127103443953.03931488.6
464101332358.920130100.0
58595373352.92212380.0
6137121354643.23316866.7
7166137426439.62819482.6
87679232251.11512192.3
992114164625.8207750.0
107089132732.5137187.5
11177164348029.836171160.7
127280211656.8186460.0
Subset mean106.3105.529.938.145.524.714.24.875.1



Collapsed
139688312654.42219386.4
149494174427.9145838.5
156187202742.61915288.2
16107122225030.6147187.5
17109148363550.73421775.0
Subset mean93.4107.825.236.441.220.613.44.275.1
Total mean102.5106.228.537.644.323.513.94.675.1
Table 3

Fold labelling performance using the pair-wise costs only, with and without inverse consistency. The number of folds identified in the Reference Standard (RS) are shown along with the number of algorithm output labels. From the intersection of the reference standard folds and the algorithm labelled folds, the number of correct hits and incorrect misses are displayed along with the accuracy calculated from these.

CaseRS pointsPair-wise costs only
Pair-wise costs with inverse consistency
LabelsHitsMissesAccuracy (%)LabelsHitsMissesAccuracy (%)
Fully distended
1767139981.35934685.0
213410487693.510387594.6
3127132101793.5131101793.5
46472301271.4000N/A
58579730100.075690100.0
613714598892.512381792.0
7166150136298.6149135298.5
8767859198.37758198.3
9928547787.065440100.0
107074351077.84034197.1
11177156793966.910577692.8
12727638784.44633489.2
Subset mean106.3101.868.59.087.181.162.83.394.6



Collapsed
139688591184.37959690.8
149478223240.723220100.0
15616939883.05839392.9
16107100244037.569232547.9
1710913685495.511083495.4
Subset mean93.494.245.819.068.267.845.27.685.4
Total mean102.599.661.811.981.577.257.64.591.8
Table 4

Fold labelling performance using the full MRF model, with and without inverse consistency. The number of folds identified in the Reference Standard (RS) are shown along with the number of algorithm output labels. From the intersection of the reference standard folds and the algorithm labelled folds, the number of correct hits and incorrect misses are displayed along with the accuracy calculated from these.

CaseRS pointsFull model
Full model with inverse consistency
LabelsHitsMissesAccuracy (%)LabelsHitsMissesAccuracy (%)
Fully distended
1769557986.49357887.7
2134137115595.8136115595.8
3127143118199.2140116199.1
4648550492.67849394.2
5859782198.894820100.0
6137168116794.3152115496.6
7166176154398.1175153398.1
8769569198.69469198.6
992106431870.561390100.0
107010158395.169560100.0
111771761241887.3151123298.4
127283271564.34527584.4
Subset mean106.3121.884.47.190.1107.383.42.796.1



Collapsed
1396110651977.48263592.6
149476262848.118180100.0
15618749590.76845197.8
16107148841188.410481396.4
1710914786792.512384693.3
Subset mean93.4113.662.014.079.479.058.23.096.0
Total mean102.5119.477.89.186.999.076.02.896.1
Table 7

Summary of mean values from Tables 2–4 with respect to case feature and cost function used.

Case featureCost functionRS pointsLabelsHitsMissesAccuracy (%)
Fully distendedUnary only106.3106.329.938.145.5
Unary with IC106.324.714.24.875.1
Pair-wise only106.3101.868.59.087.1
Pair-wise with IC106.381.162.83.394.6
Full model106.3121.884.47.190.1
Full model with IC106.3107.383.42.796.1



CollapsedUnary only93.4107.825.236.441.2
Unary with IC93.420.613.44.275.1
Pair-wise only93.494.245.819.068.2
Pair-wise with IC93.467.845.27.685.4
Full model93.4113.662.014.079.4
Full model with IC93.479.058.23.096.0



TotalUnary only102.5106.228.537.644.3
Unary with IC102.523.513.94.675.1
Pair-wise only102.599.661.811.981.5
Pair-wise with IC102.577.257.64.591.8
Full model102.5119.477.89.186.9
Full model with IC102.599.076.02.896.1
In Tables 5 and 6 the performance of the algorithm is compared to the performance of the individual readers. It is clear that the algorithm achieves a very similar level of mean accuracy at 96.1% compared to the mean reader accuracy of 97.5%. It also labels a similar mean number of folds at 99.0 per case, compared to the mean reader number of folds at 93.9 per case. To analyse the distributions of fold labelling in the reference standard and in the output of the algorithm, areas of the colon have been discretised with respect to the normalised centreline distance and the distribution of labelled folds expressed as a percentage of total folds in that region. Fig. 9 shows that the percentage of labelled folds in the reference standard (total 58.5%) and in the algorithm output (total 56.5%) follow similar distributions, with a higher confidence of fold labelling in the caecum (at 0.0) and rectum (at 1.0), and lower in the transverse and descending colon. This is due to less ambiguity in the positions of corresponding haustral folds at the terminations of the colon as a result of a reduced level of deformation and more straightforward identification of stable anatomical landmarks for which the relative positions of folds can be located.
Fig. 9

Normalised distributions of labelled folds with respect to normalised centreline distance (caecum at 0, rectum at 1) in the reference standard (left) and the MAP labelling (right).

Full surface registration

Registration error was assessed by performing the full surface registration using the Linear Scaling Initialisation with Intensity Based Registration (LSI w/ IBR) and the new B-Spline Initialisation (BSI) with and without subsequent IBR using the same cases and reference standard as in Section 4.2. Results are shown in Table 8. It is clear that the BSI w/ IBR method outperforms both the BSI w/o IBR and LSI w/ IBR registration methods with a mean Euclidean error of 6.0 mm (±1.9 mm), compared to 8.5 mm (±3.8 mm) and 11.9 mm (±11.1 mm) respectively. Using a Related Samples Wilcoxon Signed Rank Test on the Euclidean error of all 1743 fold pairs in the final reference standard, the differences in error between the BSI w/o IBR and LSI w/ IBR, BSI w/ IBR and BSI w/o IBR, and BSI w/ IBR and LSI w/ IBR methods are statistically significant with p = 0.043, p < 0.001 and p < 0.001 respectively.
Table 8

Mean fold registration error (mm) for each of the validation cases. Results are shown individually for the full registration with the Linear Scaling Initialisation with Intensity Based Registration (LSI w/ IBR) and the new B-Spline Initialisation (BSI) with and without subsequent IBR.

CaseLSI w/ IBSBSI w/o IBSBSI w/ IBS
Fully distended
16.97.16.4
26.97.45.4
34.65.44.5
49.24.34.3
55.04.45.1
63.54.63.6
75.05.55.0
85.26.04.9
Subset mean5.85.64.9
Subset Std1.71.10.8



Previously excluded cases
948.118.89.1
1012.99.07.8
1119.79.95.8
1214.29.96.8
Subset mean23.711.97.4
Subset Std14.34.01.2



Collapsed
1327.514.911.7
144.812.25.7
155.19.55.0
1617.48.15.9
176.26.75.4
Subset mean12.210.36.7
Subset Std8.93.02.5



Total mean11.98.56.0
Total Std11.13.81.9

Conclusions

We present a novel method for establishing correspondence between two CT colonography acquisitions with the patient in prone and supine positions. First, haustral folds are segmented with a graph-cut method applied to a triangular mesh representation of the colonic lumen segmentation. The method uses depth map images to drive a virtual camera optimisation to provide a unary cost value for the matching of folds between the two views. An additional pair-wise cost function compares the geometric relationship between pairs of haustral folds in the prone and supine CTC images. A parametrisation of the image space exploits the quasi-cylindrical form of the colon and simplifies the description of this geometric relationship by reducing the dimensionality from 3D to 2D. The problem is modelled using a Markov Random Field, and a Belief Propagation algorithm used to estimate the optimal labelling. This process can establish an accurate correspondence between the a set of positions in the two views even in cases where endoluminal collapse occurs, which is very common in clinical practice. We have also given an example of how this method can be applied to initialise a dense intensity-based registration technique, in this case a surface-based method, and show that it significantly reduces mean registration error. While the intensity-based registration method alone can be susceptible to misregistration by one or more haustral folds (Fig. 10), or by a degree of rotation around the tenaie coli, the new composite method defends against this problem by using a landmark-based initialisation. Moreover, the initialisation could be generalisable to other registration methods.
Fig. 10

External surface renderings of the transverse colon in the supine image of case 16. The set of reference standard points in the supine view (blue) and the corresponding points transformed from the prone view (green) and shown using the results from the LSI w/ IBS (left) and BSI w/ IBS (right) registration methods. The red lines show the Euclidean distance error. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

The work flow presented is fully automated, taking as input a prone and supine colon lumen segmentation, and in disconnected cases, the ordering of those segments. The consistency of results across cases exhibiting varying characteristics indicates that the composite method is more robust than those previously reported, especially in more ‘difficult’ cases showing marked differences in distension, or exhibiting areas of endoluminal collapse. This situation is very common in routine practice and algorithms must be able to cope in order to have clinical utility. Although algorithm performance is similar for both well-distended cases and those cases exhibiting one or more areas of endoluminal collapse, the number of identified corresponding folds can decrease. In future work we will investigate inclusion of an additional unary prior based on the relative location of stable anatomical landmarks to aid MRF convergence and increase the number of output correspondences in difficult cases. The current method relies on manual ordering of collapsed segments, so a proposed extension will automatically find their order using the same MRF model. It would also be interesting to investigate an interactive system such that landmark points could be selected manually on the virtual colonoscopic views, and used in the MRF optimisation. We will also research MRF optimisation schemes that include the inverse consistency constraint directly in the optimisation, rather than after algorithm convergence. To conclude, we have presented a novel method for matching haustral folds between prone and supine CT colonography acquisition. We give an example of how this may be used to improve the results of a full surface-based registration by validation using a set of 1743 reference standard points over 17 patient cases exhibiting a variety of characteristics.
a2 – 8.19 × 10−4
a1 – 0.331
a0 – 11.8
b2 – 6.67 × 10−3
b1 – 3.76
b0 – 650
W – 8.87 × 10−3
k – 200
α – 6.23 × 10−2
β – 0.839
τ – 0.813
Table 5

Comparison of the individual readers with the consensus reference standard. The number of folds identified in the consensus Reference Standard (RS) are shown along with the number of reader output labels. From the intersection of the reference standard folds and the reader labelled folds, the number of correct hits and incorrect misses are displayed along with the accuracy calculated from these.

CaseRS pointsReader 1 performance
Reader 2 performance
LabelsHitsMissesAccuracy (%)LabelsHitsMissesAccuracy (%)
Fully distended
1766862296.97661198.4
21341331260100.010394297.9
31271421240100.01091040100.0
46458450100.06145295.7
5857974198.784800100.0
61371361200100.0120100892.6
71661351340100.0157148199.3
87686740100.07459493.7
99285750100.09266593.0
107077660100.08145590.0
111771881680100.01381111091.7
127264550100.08959296.7
Subset mean106.3104.393.60.399.698.781.03.395.8



Collapsed
139675730100.09880693.0
149452520100.097850100.0
156161560100.07653493.0
1610796850100.011383198.8
171091331020100.010387594.6
Subset mean93.483.473.60.0100.097.477.63.295.9
Total mean102.598.187.70.299.798.380.03.395.8
Table 6

Comparison of the third reader and algorithm output with the consensus reference standard. The number of folds identified in the consensus Reference Standard (RS) are shown along with the number of reader/algorithm output labels. From the intersection of the reference standard folds and the reader/algorithm labelled folds, the number of correct hits and incorrect misses are displayed along with the accuracy calculated from these.

CaseRS pointsReader 3 performance
Algorithm performance
LabelsHitsMissesAccuracy (%)LabelsHitsMissesAccuracy (%)
Fully distended
1767060296.89357887.7
21341131080100.0136115595.8
3127102940100.0140116199.1
46455520100.07849394.2
5859482198.894820100.0
61379789198.9152115496.6
7166136125199.2175153398.1
8766260296.89469198.6
99260550100.061390100.0
10707263198.469560100.0
11177137121695.3151123298.4
12727663198.44527584.4
Subset mean106.389.581.01.398.5107.383.42.796.1



Collapsed
139675720100.08263592.6
149467441574.618180100.0
15616352394.56845197.8
1610784790100.010481396.4
171098677198.712384693.3
Subset mean93.475.064.83.893.679.058.23.096.0
Total mean102.585.276.22.097.199.076.02.896.1
  19 in total

1.  Nonrigid registration using free-form deformations: application to breast MR images.

Authors:  D Rueckert; L I Sonoda; C Hayes; D L Hill; M O Leach; D J Hawkes
Journal:  IEEE Trans Med Imaging       Date:  1999-08       Impact factor: 10.048

2.  An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision.

Authors:  Yuri Boykov; Vladimir Kolmogorov
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2004-09       Impact factor: 6.226

3.  European Society of Gastrointestinal and Abdominal Radiology (ESGAR): consensus statement on CT colonography.

Authors:  Stuart A Taylor; Andrea Laghi; Philippe Lefere; Steve Halligan; Jaap Stoker
Journal:  Eur Radiol       Date:  2007-02       Impact factor: 5.315

4.  Screening for colorectal cancer: U.S. Preventive Services Task Force recommendation statement.

Authors: 
Journal:  Ann Intern Med       Date:  2008-10-06       Impact factor: 25.391

5.  Discrete surface Ricci flow.

Authors:  Miao Jin; Junho Kim; Feng Luo; Xianfeng Gu
Journal:  IEEE Trans Vis Comput Graph       Date:  2008 Sep-Oct       Impact factor: 4.579

6.  Potentially serious adverse events at CT colonography in symptomatic patients: national survey of the United Kingdom.

Authors:  David Burling; Steve Halligan; Andrew Slater; Michael J Noakes; Stuart A Taylor
Journal:  Radiology       Date:  2006-03-28       Impact factor: 11.105

7.  Region-based supine-prone correspondence for the reduction of false-positive CAD polyp candidates in CT colonography.

Authors:  Janne Näppi; Akihiko Okamura; Hans Frimmel; Abraham Dachman; Hiroyuki Yoshida
Journal:  Acad Radiol       Date:  2005-06       Impact factor: 3.173

8.  Matching 3-D prone and supine CT colonography scans using graphs.

Authors:  Shijun Wang; Nicholas Petrick; Robert L Van Uitert; Senthil Periaswamy; Zhuoshi Wei; Ronald M Summers
Journal:  IEEE Trans Inf Technol Biomed       Date:  2012-04-27

9.  Accuracy of CT colonography for detection of large adenomas and cancers.

Authors:  C Daniel Johnson; Mei-Hsiu Chen; Alicia Y Toledano; Jay P Heiken; Abraham Dachman; Mark D Kuo; Christine O Menias; Betina Siewert; Jugesh I Cheema; Richard G Obregon; Jeff L Fidler; Peter Zimmerman; Karen M Horton; Kevin Coakley; Revathy B Iyer; Amy K Hara; Robert A Halvorsen; Giovanna Casola; Judy Yee; Benjamin A Herman; Lawrence J Burgart; Paul J Limburg
Journal:  N Engl J Med       Date:  2008-09-18       Impact factor: 91.245

10.  Registration of central paths and colonic polyps between supine and prone scans in computed tomography colonography: pilot study.

Authors:  Ping Li; Sandy Napel; Burak Acar; David S Paik; R Brooke Jeffrey; Christopher F Beaulieu
Journal:  Med Phys       Date:  2004-10       Impact factor: 4.071

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.