Literature DB >> 35463693

Efficient Johnson-S_B Mixture Model for Segmentation of CT Liver Image.

Abstract

To overcome the problem that the traditional Gaussian mixture model (GMM) cannot well describe the skewness distribution of the gray-level histogram of a liver CT slice, we propose a novel segmentation method for liver CT images by introducing the Johnson-SB mixture model (JSBMM). The Johnson-SB model not only has a flexible asymmetrical distribution but also covers a variety of other distributions as well. In this article, the parameter optimization formulas for JSBMM were derived by employing the expectation-maximization (EM) algorithm and maximum likelihood. The implementation process of the JSBMM-based segmentation algorithm is provided in detail. To make better use of the skewness of Johnson-SB and improve the segmentation accuracy, we devise an idea to divide the histogram into two parts and calculate the segmentation threshold for each part, respectively, which is called JSBMM-TDH. By analyzing and comparing the segmentation thresholds with different cluster numbers, it is illustrated that the segmentation threshold of JSBMM-TDH will tend to be stable with the increasing of cluster number, while that of GMM is sensitive to different cluster numbers. The proposed JSBMM-TDH is applied to segment four randomly obtained abdominal CT image sequences, and the segmentation results and robustness have been compared between JSBMM-TDH and GMM. It is verified that JSBMM-TDH has preferable segmentation results and better robustness than GMM for the segmentation of liver CT images.

Entities: Chemical

Mesh：

Year: 2022 PMID： 35463693 PMCID： PMC9023182 DOI： 10.1155/2022/5654424

Source DB: PubMed Journal: J Healthc Eng ISSN： 2040-2295 Impact factor: 3.822

1. Introduction

Liver cancer is a common malignant neoplasm worldwide, and the incidence of primary liver cancer is still on the rise at the global level [1]. The accurate understanding of the shape of the liver, the location and size of lesions in the liver tissue, and the relationship between the liver and surrounding blood vessels can help doctors to develop more effective treatment options. In addition, accurate liver segmentation is also conducive to three-dimensional reconstruction for liver and virtual surgery. At present, manual delineation of each slice by experts is still the standard clinical practice for liver demarcation [2]. Because the segmentation of organs and lesions has to be carried out layer by layer in CT slices, it is pretty cumbersome and time-consuming for doctors or experts to do this repetitive work. In practice, CT slices are grayscale images. The pixels in the image reflect the X-ray absorption coefficient of the corresponding voxels. Black areas represent low absorption areas, that is, low-density tissues and organs of human body, such as lungs; white areas represent high-absorption areas, that is, high-density body parts, such as bones. The absorption value in CT images is given in Hounsfield units (HUs). Compared with ordinary X-ray images, CT images have a higher density resolution. Therefore, CT images can better show organs composed of soft tissue, such as lung, liver, gallbladder, pancreas, and pelvis, and can well demonstrate pathological changes in the tomography image. At present, the original pixel size of the abdominal CT image is 512 × 512, and the HU value of the liver varies widely. For example, a healthy liver has smooth contours and uniform density, and the absorption values are 60 ± 6 HU (or 64 ± 5 HU). However, when the liver has about 80% steatosis, the absorption values will be reduced to about −50 HU. In contrast, the density of the liver parenchyma will increase due to the accumulation of iron in patients with hemochromatosis, whose CT scan showed that the liver parenchyma was clearly dense and bright, with an absorption value as high as +140 HU (so-called white liver) [3]. There is no doubt that it is quite difficult to accurately segment the liver within such a large gray range. To solve the above problems, many scholars have proposed a variety of liver CT image segmentation methods [4-7]. Some of these algorithms require human-computer interaction, such as active contour [8] and Livewire (intelligent scissors) [9, 10]. Some are semi-automatic, for example, graph-cut [11] and region growth [12, 13] methods. Some other methods focusing on fully automatic segmentation include statistical shape models (SSMs) [14, 15] and thresholding algorithm [16]. Neural network algorithms aiming to achieve automatic feature extraction have also been applied to the segmentation of medical images in recent years [17, 18]. The ultimate common goal of different methods is to segment images accurately and automatically, but this goal is still a bottleneck problem in liver CT image segmentation, due to the complexity of abdominal CT images and the differences between different liver morphologies. In this study, we aim to study on the finite mixture model (FMM), one kind of threshold segmentation algorithm, to improve the segmentation accuracy of liver CT images, and try hard to segment automatically at the same time. According to the idea of the threshold segmentation algorithm, if the grayscale threshold of the liver in the CT slice can be accurately determined, it is possible to realize the liver segmentation automatically. In 1893, Karl Pearson made an experiment using the method of moments to fit a mixture of two normal components to the crabs' data, which proved the FMM could improve the accuracy of clustering [19]. Since then, FMM was adopted to improve the accuracy of threshold segmentation methods. In 1972, Chow and Kaneko applied FMM in medical images to segment the left ventricle from cine angiograms with two Gaussian distributions [20]. There are two core points in FMM, one is the selection of the probability density function of the mixed components and the other is the parameter estimation of the mixture model. The most common mixed component probability distribution used in FMM is the Gaussian distribution, because in many cases there is a normal distribution in univariate and multivariate data. Therefore, the Gaussian mixture model (GMM) has been widely used in the segmentation of the images [21-24]. In addition to the Gaussian distribution, gamma distribution, Student's t distribution, exponential distribution, and Rayleigh distribution also commonly appear in FMM [25-27], and their probability density diagrams with different parameters are shown in Figure 1, respectively.

Figure 1

Probability density of normal, gamma, exponential, Student's t, and Rayleigh distributions. (a) μ : mean, σ : standard deviation. (b) α : shape parameter, β : scale parameter. (c) μ : mean. (d) x : position parameter, ν : degrees of freedom. (e) B: scale parameter.

The upper, middle, and lower slices of the liver CT image sequence are shown in Figures 2(a)∼2(c)), respectively. Figures 2(d)∼2(f) give the corresponding gray-level histograms of Figures 2(a)∼2(c)). Although the peak shape of gray-level histogram has symmetry to some extent, its asymmetry is also very obvious, which is different from the symmetric characteristic of Gaussian distribution and the asymmetric characteristic of the exponential, Rayleigh distribution, etc. Therefore, it is very difficult to fit the peak shape accurately with any single distribution shown in Figure 1.

Figure 2

Upper, middle, and lower slices of the liver CT image and their histograms. (a) The upper slice. (b) The middle slice. (c) The lower slice. (d) Gray-level histogram of (a). (e) Gray-level histogram of (b). (f) Gray-level histogram of (c).

To solve the problem, many researchers have focused on forming a mixture model using distribution functions that can better fit the shape of a single peak. The research ideas mainly concentrate on the following three kinds of mixture models. The first kind of mixture model is forming an asymmetric generalized Gaussian distribution (AGGD) by introducing shape parameters or functions into the generalized normal distribution to describe the skew characteristic [28, 29], so that it can describe not only a symmetric distribution but also an asymmetric distribution. However, the expression of the AGGD is complicated by embedding the gamma function. In the case of using the EM algorithm for maximum-likelihood estimation (MLE), all AGGD parameters in the mixture model are represented by highly nonlinear equations, which makes the numerical solutions cumbersome and sensitive to initial EM values [30]. Combining Gauss with other distributions to form a new model is the second idea. Wilson selected two Gaussian distributions and one uniform distribution to fit the low-gray and high-gray regions of the brain MRA data histogram, respectively [31]. Hassouna proposed a linear combination of a finite mixture model using one Rayleigh distribution and two Gaussian distributions [32]. Hence, for different problems, people need to determine in advance which existing models can be used to form a new probability distribution model, and the number of each distribution also needs to be determined in advance. Therefore, this kind of model is not flexible, and it is also difficult to realize segmentation automatically. The third way is to combine the components of the mixed model with the non-Gaussian distribution. Lee and McLachlan introduced a finite mixture of canonical fundamental deviation t (CFUST) distributions for asymmetric and possibly long-tailed clusters [33]. Sefidpour and Bouguila proposed and investigated the segmentation of spatial color images using the Dirichlet and Beta-Liouville distributions [34]. The normal inverse Gaussian distribution (NIG) is chosen by Karlis and Santourian to deal with skewed subpopulations [35]. Franczak et al. studied the asymmetric Laplace distribution (ALD) for clustering and classification [36]. NIG and ALD belong to the family of generalized hyperbolic (GH) distributions designed by Barndorff-Nielsen [37]. Browne and McNicholas extended a special case for the generalized hyperbolic distribution [38]. Wraith and Forbes studied the properties of these distributions in multiscale and their application in multivariate clustering [39]. Although there are many combination methods of non-Gaussian distribution, only a few methods are used for the segmentation in medical images, especially in liver CT images. Therefore, the study on the segmentation of liver CT images still needs further research. Considering the asymmetric skew characteristic of the gray histogram of the liver CT image, and inspired by the skew characteristic of Johnson-SB distribution, we propose a novel mixture model with the combination of Johnson-SB distribution to segment the liver CT image. In the following paper, the Johnson-SB mixture model and the optimized parameters with the EM algorithm are introduced firstly in Section 2. Secondly, Section 3 gives the implementation details of the segmentation algorithm based on the Johnson-SB mixture model, and the effects of cluster number on the segmentation threshold are also analyzed and compared between GMM and Johnson-SB mixture model in this section. Then, the segmentation experimental results of four randomly obtained abdominal CT image sequences from different image databases are provided in Section 4. Finally, conclusions are drawn in Section 5.

2. Finite Johnson-SB Mixture Model

In 1949, Johnson deduced a curve system called the Johnson system, which contains Johnson-SB, Johnson-SL, and Johnson-SU. The symbol SL means “log-normal system,” SB means “bounded system,” and SU means “unbounded system” [40]. The Johnson system can closely approximate many continuous distributions by one of the three functional forms, so it is very flexible to fit variety curves. Many of the commonly used continuous distributions, such as normal, log-normal, gamma, beta, and exponential, are special cases of the Johnson system; therefore, it has more advantages to fit curves with the Johnson system than any other single distribution [41].

2.1. Johnson-SB Distribution

Johnson-SB is one of the three distributions of the Johnson system. It corresponds to the distribution of a continuous random variable x in which a particular transformation is applied to obtain a normal distribution. The transformation is as follows:where x is a given continuous random variable. In this study, x is the grayscale value of the pixel in the CT image. x ∈ (ξ, ξ + λ), ξ = min (x), λ = max (x) − min (x), and γ and δ are shape parameters, δ > 0, γ ∈ (−∞, +∞). Z is a standard normal random variable, and its probability density function is as follows: We writewhere z is the inverse function of y. According to the transformations of continuous random variable, Then, the probability density function (PDF) with regard to x is as follows: Figure 3 shows some typical probability density function curves of Johnson-SB with different parameters. The range of horizontal coordinates is from 0 to 255, and the vertical coordinates are the corresponding probability density function values. Figure 3(a) presents the curves with different δ and γ = 0, and it is a normal distribution in the middle. Figures 3(b)∼3(e) present the curves with different δ and γ = 0, respectively, and they have better skew characteristics, especially at the two ends of the abscissa. It can be seen that γ controls the position of the function, and the distributions of the function are normal distribution, negatively skewed distribution, and positively skewed distribution when γ = 0, γ > 0, and γ < 0, respectively.

Figure 3

Probability density function curves with different parameters.

2.2. Johnson-SB Mixture Model (JSBMM) and Optimizing Parameters

2.2.1. Johnson-SB Mixture Model (JSBMM)

The finite mixture model (FMM) refers to the linear superposition of distribution functions of the same type but with different parameters. In the discrete case, the probability density function of a finite mixture distribution can be expressed as a p-dimensional random vector X [42].where x ⊂ X is the pixel grayscale value in the CT image, θ = (δ, γ), p (x|θ) is the density of the kth component, and Θ = (Φ1,…, Φ; θ1,…, θ) is the vector of parameters. Note that p (x| Θ) in Eqn (3) defines a density that is called a K-component finite mixture density. Here, the weight of the kth mixing proportion Φ satisfies the following relations:and According to Eqn (3), the probability density of a Johnson-SB mixture model is defined as follows:Here,is the probability density of a random variable X for a Johnson-SB distribution with the parameters δ and γ.

2.2.2. Optimizing Parameters with EM Algorithm

The vector of parameters Θ typically introduced by the log-likelihood function is defined as follows: Here, x is the ith discrete grayscale value and N is the number of discrete dots of CT image. The detailed derivation of the parameters given in equations (6)–(8) is presented in Appendix A.

3. Segmentation Algorithm Based on JSBMM

3.1. Implementation Details

There are mainly four steps to implement the segmentation algorithm based on JSBMM, and the details of each step in Algorithm 2 are introduced as follows.

3.1.1. Obtaining the Approximate Gray Value (LV_A)

The Hounsfield unit (HU) value of liver tissue varies from patient to patient. In addition, the X-ray tube of the CT machine will age with longtime use, which results in the intensity decrease in the X-ray source. Therefore, the liver HU values of different CT image sequences are usually different. Clearly, it is not possible to identify all liver scan sequences with the same HU range value. We can make full use of the continuity of CT scan section images to solve this problem. CT scans require patients to hold their breath during data collection, and this process usually takes no more than 20 seconds [43]. Therefore, in the same CT image sequence, the range of gray values between all liver slices varied a little. It is feasible to obtain the liver HU value of this sequence using the slice image with the largest liver area. Figure 4(a) shows the frontal outline of the liver, and Figures 4(b)∼4(d) give the upper, middle, and lower slice images of the liver CT scan sequence, respectively. It can be seen that the liver is almost completely surrounded by the ribs, and the liver image at about 1/3 of the entire sequence is the largest one [44]. For example, if a complete sequence of CT scans of the liver has 90 slices, the liver slice near the 30th slice will be more significant than others.

Figure 4

Frontal outline and the upper, middle, and lower sections of the liver CT scan sequence.

Figures 5(a) and 5(c) show two liver slices taken from two different CT scan sequences, which locate at 1/3 of each sequence. Grayscale-level statistics were performed at about 1/4 of the body section, shown as the dotted line in Figures 5(a) and 5(c). The approximate grayscale value of the liver (LV_A) can be easily calculated after removing the grayscale values of black and white zones. Figures 5(b) and 5(d) are the corresponding gray-level histograms with a sampling width of 10 pixels for the positions shown as the dotted lines in Figures 5(a) and 5(c), respectively. Here, the horizontal coordinates are the grayscale values, and the vertical coordinates are the statistical numbers of the corresponding grayscale values. The grayscale values with maximum statistical numbers are 131 and 178 as shown in Figures 5(b) and 5(d), respectively, which are taken as the approximate grayscale value of the liver (LV_A) in Figures 5(a) and 5(c), respectively. It should be noted that the maximum statistical number of the grayscale value 0 should be omitted, although it is more than 2000, because it means the black zone and is useless for the segmentation of the liver.

Figure 5

Slices of 1/3 of the liver and the corresponding LV_A. (a) A slice of 1/3 of the liver in the first sequence. (b) The LV_A of (a) is 131. (c) A slice of 1/3 of the liver in the second sequence. (d) The LV_A of (b) is 178.

3.1.2. Finding the Maximum Gray Value (LV_M)

Although liver HU values do not change significantly during the scan process, there also exist some slight changes. To segment each slice accurately, it is necessary to further locate the maximum grayscale value of the liver (LV_M) close to the LV_A on the gray-level histogram of each slice. For instance, Figure 6(a) shows the slice of the upper part of the liver CT sequence shown in Figure 5(c). Figure 6(c) displays the slice of the lower part in the same sequence. Figure 6(b) gives the gray-level histogram of Figure 6(a), and the gray-level histogram of Figure 6(c) is shown in Figure 6(d). Because the value of LV_A shown in Figure 5(d) is 178, the corresponding LV_M values of Figures 6(a) and 6(c) are 179 and 176, as shown in Figures 6(b) and 6(d), respectively. Here, it needs to be stated that we should choose the peak value closest to 178 as LV_M, because 178 is determined in Figure 5(d).

Figure 6

Maximum gray value of the liver (LV_M) in CT slice image. (a) The upper section. (b) LV_M of Figure 6(a), (c) The lower section. (d) LV_M of Figure 6(c).

3.1.3. Determining the Segmentation Points

The position of peak and skewness characteristics of the gray-level histogram of liver slices are uncertain and nonuniqueness, so it is difficult to take full advantage of the JSBMM in describing skewness if JSBMM is applied directly to fit the whole gray-level histogram. To make full use of the advantages of Johnson-SB in describing the skew characteristics, we divided the gray-level histogram into the left and right parts at LV_M to ensure that the gray level of the liver is just at the boundary of the histogram, to improve the Johnson-SB segmentation accuracy. The LV_M of Figure 6(c) is 176. At the LV_M point, the gray-level histogram shown in Figure 6(d) is divided into the left part and the right part, and the grayscale values of the segmentation points are obtained using JSBMM to fit the gray-level histograms. Figures 7(a) and 7(b) give the fitting results of the left part and the right part, respectively. The grayscale value of the segmentation point of the left part is 157, which is the intersection of the two rightmost fitting curves shown in Figure 7(a). The grayscale value of the segmentation point of the right part is 190, which is the intersection of the two leftmost fitting curves shown in Figure 7(b). However, if there is a peak of one fitting curve appearing between the intersection point of the other two fitting curves and the LV_M, the grayscale value of this peak will be taken as the segmentation point, which can get better segmentation results. For example, the grayscale value of the segmentation point of the right part should be 188, because the grayscale value of the peak is 188, which appears before the intersection of 190. Therefore, the range of the grayscale value for liver segmentation in Figure 6(c) is set from 157 to 188, which is the range of the segmentation thresholds for Figure 6(c).

Figure 7

Grayscale values of segmentation points and the binarized result.

3.1.4. Binarizing the Images and Processing with Mathematical Morphology

Mathematical morphology is a tool for extracting image components. Erosion, dilation, and filling are three basic morphological set transformations. These transformations involve the interaction between image and structuring element [45]. Figure 7(c) is obtained by binarizing Figure 6(c). The final boundary of the liver segmentation can be drawn as shown in Figure 7(d) after the procession of filling, erosion, and expansion algorithms in mathematical morphology.

3.2. Effect of Cluster Number on the Segmentation Thresholds

A key point of FMM is how to select the cluster numbers to realize a better fitting, and underfitting or overfitting may occur if the cluster number is selected inappropriately [46]. In this study, the cluster number means the number of curves in a cluster used to fit the gray-level histogram. To analyze the effect of different cluster numbers on segmentation thresholds, we choose two different slices to be segmented by GMM and JSBMM in the following Section 3.2.1 and Section 3.2.2, respectively. The initial location of the grayscale value is another consideration to affect the segmentation threshold. In this study, the initial locations are evenly arranged within the range of the grayscale values of the entire image.

3.2.1. Segmentation Thresholds of GMM with Different Cluster Numbers

Table 1 gives the segmentation thresholds of the upper, middle, and lower slices from two different CT image sequences. By analyzing the segmentation thresholds with different cluster numbers n, it can be found that the segmentation thresholds are changeable with the change in n, while they do not tend to be stable with the increasing n. In this study, n denotes the cluster numbers used to fit the gray-level histogram, and n is the same as K in Algorithm 1.

Table 1

Segmentation thresholds of two different CT image sequences using GMM.

Cluster number (n)	Segmentation thresholds
	Slices of sequence S1			Slices of sequence S2
	Upper	Middle	Lower	Upper	Middle	Lower
6	153∼201	168∼196	158∼204	115∼149	117∼149	115∼147
7	157∼201	168∼195	162∼194	109∼150	117∼150	115∼148
8	153∼202	165∼197	157∼196	122∼148	121∼145	120∼144
9	175∼193	169∼197	175∼188	112∼149	117∼152	116∼148
10	154∼201	172∼190	158∼196	124∼150	122∼150	121∼145
11	173∼195	169∼193	168∼191	113∼148	118∼153	116∼149
12	174∼194	168∼194	169∼191	126∼150	124∼150	124∼147
13	177∼193	172∼184	168∼189	115∼147	117∼150	116∼149
14	173∼194	169∼190	166∼191	127∼150	124∼150	125∼146
15	173∼194	168∼193	168∼191	117∼144	118∼142	117∼145

To illustrate the effect on the segmentation results with different cluster numbers, we give some segmentation results of the upper slices of the sequences S1 and S2. Here, Figure 8 shows the threshold segmentation results of the upper slices with n = 6, 8, 10, 12, and 14 using GMM, respectively. The first row and the second row give the segmentation results and thresholds for the upper slice of the first sequence S1. The third row and the fourth row give the results and thresholds for the upper slice of the second sequence S2.

Figure 8

Segmentation results and thresholds of GMM with different cluster numbers. (a) n = 6. (b) n = 8. (c) n = 10. (d) n = 12. (e) n = 14.

By comparing the segmentation results in the first row, it is not difficult to find that the results of the first three columns with n = 6, 8, and 10 are similar, because the three segmentation results include some other organs that do not belong to the liver. Although the segmentation result in the fourth column with n = 12 only includes the liver, the segmentation result is a little smaller liver than that of the fifth column with n = 14. Therefore, the segmentation result with n = 14 is relatively more reasonable by comparing with the other four results for the first sequence. Comparing the results of row 3 for the second sequence, it is obvious that the results of the first two columns with n = 6 and 8 are not correct, and the segmentation result of the third column with n = 10 is a little smoother than that of the fourth one with n = 12, while the segmentation result of the fifth column with n = 14 turns to be rougher boundary due to overfitting. Thus, the better segmentation result can be obtained when n is 10 for the second sequence. As can be seen from Figure 8, the segmentation results of GMM are sensitive to cluster numbers, and it is not easy for GMM to use fixed cluster number to get accurate segmentation results, which is a difficulty in the application of GMM, especially for a large number of images that need to be segmented. Here, it is needed to be stated that the red contours in the figures in Figure 8 indicate the error zones of the segmentation. It is the same meaning in the following figures.

3.2.2. Segmentation Thresholds of JSBMM with Different Cluster Numbers

Figures 9∼11 give the segmentation thresholds and the results of the upper, middle, and lower slices of the first liver CT sequence S1, and here, the segmentation thresholds are obtained with JSBMM-TDH, which means the gray-level histogram is divided into left and right parts, and it is denoted as thresholds of dividing histogram (TDH). While thresholds of whole histogram (TWH) mean that the segmentation thresholds are obtained with JSBMM under the whole gray-level histogram, and it is denoted as JSBMM-TWH. The segmentation results of JSBMM-TWH are not given in Figures 9∼11, and only the segmentation thresholds are summarized in the TWH column given in Table 2. The segmentation results of JSBMM-TDH with n = 6, 8, 11, and 13 are shown in Figures 9(a)∼9(d), respectively. Figures 9(e)∼9(n) show the segmentation thresholds of the left and right parts with n = 6∼15, respectively. Figures 10(a)∼10(n) and 11(a)∼11(n) give the corresponding segmentation results and thresholds of JSBMM-TDH for the middle and lower slices, respectively.

Figure 9

Segmentation thresholds of JSBMM-TDH with different cluster numbers for the upper slice. (a) n = 6. (b) n = 8. (c) n = 11. (d) n = 13. (e) n = 6. (f) n = 7. (g) n = 8. (h) n = 9. (i) n = 10. (j) n = 11. (k) n = 12. (l) n = 13. (m) n = 14. (n) n = 15.

Figure 11

Segmentation thresholds of JSBMM-TDH with different cluster numbers for the lower slice. (a) n = 6. (b) n = 8. (c) n = 11. (d) n = 13. (e) n = 6. (f) n = 7. (g) n = 8. (h) n = 9. (i) n = 10. (j) n = 11. (k) n = 12. (l) n = 13. (m) n = 14. (n) n = 15.

Table 2

Segmentation thresholds of S1 with different segmentation methods.

n	The upper slice			The middle slice			The lower slice
	J_SBMM		GMM (Figure 8)	J_SBMM		GMM (Figure 8)	J_SBMM		GMM (Figure 8)
	TDH (Figure 9)	TWH	GMM (Figure 8)	TDH (Figure 10)	TWH	GMM (Figure 8)	TDH (Figure 11)	TWH	GMM (Figure 8)
6	143∼194	157∼194	153∼201	166∼200	167∼195	168∼196	149∼189	160∼194	158∼204
7	147∼194	156∼198	157∼201	162∼199	171∼190	168∼195	148∼195	161∼193	162∼194
8	159∼194	164∼193	153∼202	162∼198	168∼195	165∼197	151∼195	165∼192	157∼196
9	160∼194	176∼192	175∼193	162∼196	168∼196	169∼197	153∼196	170∼190	175∼188
10	160∼194	176∼192	154∼201	161∼195	173∼190	172∼190	156∼196	171∼187	158∼196
11	161∼194	175∼192	173∼195	162∼194	170∼187	169∼193	156∼190	168∼189	168∼191
12	163∼193	175∼193	174∼194	162∼194	171∼188	168∼194	157∼189	170∼189	169∼191
13	164∼194	176∼193	177∼193	161∼193	171∼189	172∼184	157∼188	171∼189	168∼189
14	164∼194	177∼192	173∼194	161∼193	171∼185	169∼190	158∼188	169∼186	166∼191
15	164∼194	172∼185	173∼194	162∼193	171∼186	168∼193	158∼188	170∼188	168∼191

It is obvious that the segmentation thresholds almost tend to be stable when n is bigger than 12, which is shown as the bold values shown in TDH columns.

Figure 10

Segmentation thresholds of JSBMM-TDH with different cluster numbers for the middle slice. (a) n = 6. (b) n = 8. (c) n = 11. (d) n = 13. (e) n = 6. (f) n = 7. (g) n = 8. (h) n = 9. (i) n = 10. (j) n = 11. (k) n = 12. (l) n = 13. (m) n = 14. (n) n = 15.

The segmentation thresholds with n = 6∼15 for the upper, middle, and lower slices are summarized in the corresponding TDH column given in Table 2. It is obvious that the segmentation thresholds almost tend to be stable when n is bigger than 12. While the segmentation thresholds in TWH column are also changeable, even n is bigger than 12, which is similar to the trend of GMM. By analyzing and comparing the range of segmentation thresholds between JSBMM-TDH and JSBMM-TWH with the same n, it can be seen that most of the upper limit values are close to each other, but the lower limit values of JSBMM-TDH are smaller than that of JSBMM-TWH, and the differences are mostly around 10. Analyzing and comparing the segmentation results of (a)∼(d) shown in Figures 9∼11, respectively, the better segmentation results can be obtained for the upper, middle, and lower slices when n is 13. Therefore, considering the trend of segmentation thresholds given in Table 2, we determine to take n = 13 as the fixed cluster number for JSBMM-TDH to segment different CT image sequences in the following segmentation experiments in Section 4. As a matter of convenience in comparison, Table 2 also gives the segmentation thresholds of GMM for liver CT sequence S1 shown in Table 1. By analyzing and comparing the range of segmentation thresholds between JSBMM-TWH and GMM with the same n, it is found that most of the upper and lower limit values are close to each other, which means the segmentation results of JSBMM-TWH are similar to that of GMM. Therefore, the segmentation results of JSBMM-TDH are both better than the results of JSBMM-TWH and GMM. Table 3 gives the segmentation calculation time with different cluster numbers as shown in Figures 9–11. The segmentation time is less affected by the different cluster numbers, and it almost tends to decrease for the middle slice of Figure 10 with a large liver area. In this study, only the parameters Φ, δ, and γ are used to calculate the threshold value of analytic solution, so the segmentation speed is quite fast, and the effect of segmentation time with different cluster numbers can be ignored.

Table 3

Segmentation calculation time for the JSBMM with TDH for the upper, middle, and lower slices when n = 6∼15, respectively (time unit is second).

n	6	7	8	9	10	11	12	13	14	15
Upper (Figure 9)	2.62	11.7	5.71	3.05	3.30	5.60	3.01	4.89	5.15	1.99
Middle (Figure 10)	5.09	6.26	5.51	4.58	4.97	4.46	4.03	3.64	3.94	3.64
Lower (Figure 11)	2.41	7.57	4.38	2.59	3.09	2.95	2.56	4.10	5.41	3.47

4. Segmentation Experimental Results

The JSBMM-TDH was applied to segment four randomly obtained abdominal CT image sequences from different image databases [47,48], and the segmentation thresholds and results of JSBMM-TDH are compared with that of GMM, as shown in Figures 12∼14, which are the upper, middle, and lower slices of the four different CT image sequences, respectively. In these figures, the first row shows the original image from four different CT image sequences, the second row gives the threshold segmentation results of GMM, and the third row is the binary images obtained according to the segmentation results of GMM. The fourth row shows the binary images obtained depending on the segmentation results of JSBMM-TDH. The fifth row is the segmentation results of GMM, and the sixth row gives the results of JSBMM-TDH. The seventh row and the last row provide the left and right segmentation thresholds of JSBMM-TDH, respectively.

Figure 12

Binary images, segmentation thresholds, and results of JSBMM-TDH and GMM for the upper slice.

Figure 14

Binary images, segmentation thresholds, and results of JSBMM-TDH and GMM for the lower slice.

Table 4 gives the comparison of segmentation thresholds and quantitative evaluation for Figures 12–14, respectively. Here, it needs to be stated that the segmentation results of GMM are fitted with different cluster numbers, which is the best one chosen from the segmentation results with n = 6∼15 for each slice for the sequences S2, S3, S4, and S5, respectively, while all segmentation results of JSBMM-TDH are fitted with the same cluster number n = 13. However, most segmentation results of JSBMM-TDH are better than those of GMM, and the other few results are similar to each other. On the whole, it is illustrated that not only the segmentation results of JSBMM-TDH are better than those of GMM, but also the cluster numbers can be fixed at n = 13 without worrying about overfitting. This is good for the implementation of automatic segmentation of liver CT images.

Table 4

Comparison of segmentation thresholds and quantitative evaluation for Figures 12–14.

Liver slice position	Sequence	Segmentation thresholds		Quantitative evaluation
		GMM	JSBMM-TDH(n = 13)	Jaccard index			Dice's coefficient
		GMM	JSBMM-TDH(n = 13)	GMM	JSBMM-TDH	Difference	GMM	JSBMM-TDH	Difference
Upper slice (Figure 12)	S2	126∼150 (n = 12)	126∼146	0.9063	0.8854	−0.0209	0.9509	0.9392	−0.0117
	S3	152∼169 (n = 12)	146∼171	0.8793	0.9483	0.069	0.9358	0.9735	0.0377
	S4	127∼143 (n = 12)	122∼152	0.5955	0.7608	0.1653	0.7465	0.8641	0.1176
	S5	196∼226 (n = 12)	181∼212	0.5818	0.7805	0.1987	0.7356	0.8767	0.1411

Middle slice (Figure 13)	S2	123∼154 (n = 12)	117∼144	0.9173	0.9298	0.0125	0.9569	0.9636	0.0067
	S3	152∼173 (n = 13)	149∼177	0.9549	0.948	−0.0069	0.9769	0.9733	−0.0036
	S4	130∼145 (n = 12)	108∼151	0.6523	0.9527	0.3004	0.7895	0.9758	0.1863
	S5	188∼221 (n = 11)	172∼213	0.9744	0.9744	0.0	0.9501	0.9501	0.0

Lower slice (Figure 14)	S2	125∼151 (n = 12)	125∼140	0.8852	0.8794	−0.0058	0.9391	0.9358	−0.0033
	S3	158∼170 (n = 14)	149∼175	0.8234	0.904	0.0806	0.9031	0.9496	0.0465
	S4	100∼126 (n = 13)	113∼153	0.8715	0.8777	0.0062	0.9314	0.9349	0.0035
	S5	192∼223 (n = 11)	169∼212	0.9422	0.9719	0.0297	0.8907	0.9454	0.0547

Average value				0.832	0.9011	0.0691	0.8922	0.9402	0.048

Figure 13

Binary images, segmentation thresholds, and results of JSBMM-TDH and GMM for the middle slice.

The Jaccard index and Dice coefficient are common indexes for quantitative evaluation of image segmentation. The Jaccard index is a statistic used for comparing the similarity of sample sets. The Dice coefficient is another similarity measure index. The Jaccard index and Dice coefficient are calculated to quantitatively evaluate the segmentation results of GMM and JSBMM-TDH, respectively. The values of the Jaccard index and Dice coefficient are given in Table 4, and the values in the column of difference mean the difference value between JSBMM-TDH and GMM, that is, the Jaccard index (or Dice coefficient) of JSBMM-TDH minus that of GMM. The last row gives the average value. The maximum value of difference column in the Jaccard index is 0.1987, and the average value of this column is 0.0691. The maximum value of difference column in the Dice coefficient is 0.1863, and the average value of this column is 0.048. By comprehensive quantitative comparison, it is found that the segmentation results of JSBMM-TDH are better than that of GMM. The website addresses of the data sets S1, S2, S3, S4, and S5 are provided in Appendix B.

5. Conclusions

The JSBMM-TDH with flexibly skewed characteristics proposed in this study is suitable for fitting the skewness distribution of the gray-level histogram of liver CT images. The parameter optimization algorithm employing EM and the implementation process of the segmentation algorithm has been given in detail. The effects of cluster number on segmentation threshold were discussed and compared for GMM, JSBMM-TWH, and JSBMM-TDH, respectively. It is shown that the JSBMM-TDH threshold will tend to be stable at cluster number 13, while the threshold of GMM and the threshold of JSBMM-TWH are similar and sensitive to different cluster numbers. The proposed JSBMM-TDH with cluster number 13 is applied to segment four random CT image sequences, and the segmentation results are compared with those of GMM. Analyzing the segmentation results and quantitative evaluations, it is further illustrated that JSBMM-TDH does not have the overfitting phenomenon with the increase in cluster number, which verifies that JSBMM-TDH has preferable segmentation results and better robustness than GMM. JSBMM-TDH makes it possible to realize the automatic segmentation of live CT image due to the robustness with the fixed cluster number. The JSBMM-TDH can be used not only for liver CT image segmentation, but also for other CT image segmentation as well.

16 in total

1. Mixtures of Shifted AsymmetricLaplace Distributions.

Authors: Brian C Franczak; Ryan P Browne; Paul D McNicholas
Journal: IEEE Trans Pattern Anal Mach Intell Date: 2014-06 Impact factor: 6.226

2. Improved interactive medical image segmentation using Enhanced Intelligent Scissors (EIS).

Authors: Akshaya Mishra; Alexander Wong; Wen Zhang; David Clausi; Paul Fieguth
Journal: Conf Proc IEEE Eng Med Biol Soc Date: 2008

3. Rayleigh mixture model for plaque characterization in intravascular ultrasound.

Authors: José C Seabra; Francesco Ciompi; Oriol Pujol; Josepa Mauri; Petia Radeva; João Sanches
Journal: IEEE Trans Biomed Eng Date: 2011-01-17 Impact factor: 4.538

4. Automatic 3D liver location and segmentation via convolutional neural network and graph cut.

Authors: Fang Lu; Fa Wu; Peijun Hu; Zhiyi Peng; Dexing Kong
Journal: Int J Comput Assist Radiol Surg Date: 2016-09-07 Impact factor: 2.924

5. Automatic boundary detection of the left ventricle from cineangiograms.

Authors: C K Chow; T Kaneko
Journal: Comput Biomed Res Date: 1972-08

6. The Cancer Imaging Archive (TCIA): maintaining and operating a public information repository.

Authors: Kenneth Clark; Bruce Vendt; Kirk Smith; John Freymann; Justin Kirby; Paul Koppel; Stephen Moore; Stanley Phillips; David Maffitt; Michael Pringle; Lawrence Tarbox; Fred Prior
Journal: J Digit Imaging Date: 2013-12 Impact factor: 4.056

7. The trends in incidence of primary liver cancer caused by specific etiologies: Results from the Global Burden of Disease Study 2016 and implications for liver cancer prevention.

Authors: Zhenqiu Liu; Yanfeng Jiang; Huangbo Yuan; Qiwen Fang; Ning Cai; Chen Suo; Li Jin; Tiejun Zhang; Xingdong Chen
Journal: J Hepatol Date: 2018-12-11 Impact factor: 25.083