Literature DB >> 27857845

An improved teaching-learning based robust edge detection algorithm for noisy images.

Sasirooba Thirumavalavan1, Sasikala Jayaraman1.   

Abstract

This paper presents an improved Teaching Learning Based Optimization (TLO) and a methodology for obtaining the edge maps of the noisy real life digital images. TLO is a population based algorithm that simulates the teaching-learning mechanism in class rooms, comprising two phases of teaching and learning. The 'Teaching Phase' represents learning from the teacher and 'Learning Phase' indicates learning by the interaction between learners. This paper introduces a third phase denoted by "Avoiding Phase" that helps to keep the learners away from the worst students with a view of exploring the problem space more effectively and escaping from the sub-optimal solutions. The improved TLO (ITLO) explores the solution space and provides the global best solution. The edge detection problem is formulated as an optimization problem and solved using the ITLO. The results of real life and medical images illustrate the performance of the developed method.

Entities:  

Keywords:  Canny and Sobel operators; Edge detection; Evolutionary algorithms; Teaching–learning based optimization

Year:  2016        PMID: 27857845      PMCID: PMC5106463          DOI: 10.1016/j.jare.2016.04.002

Source DB:  PubMed          Journal:  J Adv Res        ISSN: 2090-1224            Impact factor:   10.479


Introduction

Edge Detection (ED) that provides continuous contours of the object boundaries is low-level feature detection in image analysis and computer vision such as shape recognition, 3D reconstruction and defect detection on mechanical parts. Precise information about edges is vital to the success of such systems. Edges are sets of pixels in the image regions with sharp intensity changes and correspond to visible contour features of objects in an image. Normally, ED is a process that inputs a grey scale image and then results in a binary edge map to indicate the edges of objects [1], [2]. The shape of edges depends on many parameters, such as geometrical and optical properties of an image, illumination conditions and noise level in the image [3]. Several ED theories and algorithms have been suggested in the recent decades [1]. They can be grouped into two categories, Gradient and Laplacian operators [1]. There are other ED methods such as snake methods [4] and mathematical morphology [5]. Methods based on Gradient operators mainly include the Roberts operator [6], the Prewitt operator [7] and the Sobel operator [8]. Methods based on Laplacian operators mainly include Laplacian of Gaussian Method [9] and Canny edge detector [10]. Both gradient-based and Laplacian based ED methods have some disadvantages such as noise sensitivity, illumination sensitivity and non-adaptive parameters [1]. Some new approaches that include a multi-scale method for ED based on increasing Gaussian smoothing and edge tracking [11] and a model based on the multi-scale and multi-expert analyses inspired by common vector approach and the concept of Gaussian scale [12] have been outlined. An objective performance analysis of statistical tests for ED of textured or cluttered images has been performed [13]. Most of the existing algorithms are based on first and second derivatives, Gaussian filters, statistics, soft computing techniques and different transforms. They employ a thresholding technique to classify a pixel as an edge or non-edge based on its magnitude, a pixel with a weak magnitude may be recognized as non-edge and accordingly the edges become broken. Noise phenomena is an important hindrance to the detection of continuous edges [14]. It causes some variation of pixel intensities and accordingly reduces the performance of an ED algorithm in noisy images. Another vital barrier that complicates the operation of ED is illumination phenomena that cause the magnitude of the edges in the illuminated areas to become weak [15]. Though most of the classical ED algorithms are computationally efficient and perform well while the image has good quality and the object contours are distinct, they are susceptible to noise and suffer from producing broken edges. Besides, these algorithms often may not effectively detect the object boundaries for complex objects with noise or with complex texture such as medical images, which are often vague, especially for skin lesions. Evolutionary algorithms such as harmony search optimization [16], ant colony optimization (ACO) [17], cuckoo search optimization [18] and particle swarm optimization [19] have been applied for ED with a view of overcoming the drawbacks of classical approaches. More recently Teaching–Learning-Based Optimization (TLO) has been suggested from the inspiration of teaching–learning mechanism in class rooms by Rao et al. [20], [21] and Rao and Patel [22] for solving complex optimization problems, and applied for real world optimization problems such as parameter optimization of modern machining processes [23], optimal power flow [24] and unit commitment [25], to date, it has not been applied to ED. The focus of this article was to develop an improved TLO (ITLO) algorithm for ED of digital noisy images with a view of effectively obtaining continuous and thin edges besides reducing broken and jagged edges. The results of the developed algorithm are compared with those of the ACO, Sobel and Canny edgy detection algorithms with a view of exhibiting the superiority of the algorithm.

Methodology

Improved TLO

TLO is developed from the inspiration of teaching–learning mechanism in class rooms for solving optimization problems and involves two crucial mechanisms, represented as teaching and learning phases.

Teaching phase

The teaching phase denotes the global search process of the TLO. The knowledgeable teacher attempts to enhance the performance of the learners through teaching. He aims to improve the mean grade point of each subject of all the learners to his level. The change in the grade point of the -th subject at -th iteration, , is expressed aswhere indicates grade of the -th subject of the teacher at -th iteration represents the mean grade of the -th subject at -th iteration and is computed by indicates the number of students denotes the teaching factor and is computed by The grades of each learner is updated bywhere is the grade point of the -th subject of the -th learner at -th iteration

Learning phase

The learning phase represents the local search mechanism of TLO. Each learner in the class room attempts to enhance his performance by acquiring knowledge through interaction with other learners. The grades of -th learner after interaction with -th learner are updated by the following:where is the performance measure of the -th learner.

Avoiding phase

The interactions in the learning phase may lead to inappropriate knowledge exchange between learners in such a way that the solution can be trapped at local minima. Another phase, represented as avoiding phase, is required to come out from the sub-optimal traps in addition to searching unexplored regions in the solution space. This phase is inspired from the fact that the learners in general intend to move with the teacher for learning and avoid the worst students with a view of keeping themselves away from the mischief activities of the worst students. The behaviour of learners in respect of worst students helps to explore the problem space more effectively and escape from the sub-optimal solutions. The behaviour of the worst student can be modelled bywhere denotes grade points of the worst student at -th iteration represents the modified grade points of the worst student at -th iteration is the maximum number of iterations The grade points of the learner as a result of avoiding the worst student can be modeled by the following equationswhere is the Euclidean distance between worst student and the learner and represents the avoiding rate. Eq. (7) permits the learners to avoid the worst student, thereby escaping from sub-optimal solution traps in the search space and improving the capability of exploration. It forces the population to arrive at the global best solution.

Proposed method

Many of the existing ED algorithms convolve a convolution matrix on an image to calculate the edge magnitude only for a single pixel at a time and then classify it as an edge or a non-edge by comparing with a thresholding technique, thereby falsely classifying the pixels with weak magnitudes as non-edges and a few noisy pixels with high magnitude as edges. It may cause discontinuous edges or some speckles to appear on a resulting edge map. The proposed method attempts to search the best possible segment of a given length of edge with a view of correcting the discontinues caused due to the presence of noises and illumination. The proposed method involves representation of decision variables associated with an edge segment and formation of a performance function.

Representation of control variables

The connectivity between a chosen pixel and its neighbouring pixel of an edge can be denoted by an angle that varies in the range of (0–360°) in steps of 45°, as marked in Fig. 1(a). An example edge segment, centred around a chosen pixel , is represented by a set of angles that represent directions to the next pixel and are encoded as indicated in Fig.1(b) and (c) respectively. The grade points of -th learner in the proposed method are tailored to denote the control variables associated with an edge segment for a chosen pixel as follows:where represents angle direction of the previous pixel to -th pixel of the edge segment
Fig. 1

(a) Eight movement direction, (b) representation of an Edge Segment centred around a pixel, (c) encoding.

In this representation, the first row and second row of entries indicate the first and second half of the edge segment, starting from the chosen pixel respectively.

Performance function

The ITLO algorithm searches for global best solution by maximizing a performance function , which is to be formulated for each of a chosen pixel . In the light of the fact that an edge is a set of continuous pixels that result in two regions: the light and dark regions, as indicated in Fig.1(b), the proposed method processes a set of pixels at a time instead of a single pixel with a view to extract the real edge. The set of consecutive pixels is identified as an edge, when they maximize the interset distance between the pixel intensities of the two regions and minimize the interset distances within the regions. The edge magnitude of a chosen pixel in a movement direction in terms of interest and intraset distances can be formulated as a maximization function [19] aswhere indicates edge magnitude of in a movement direction- and denote average intensity of the dark and light regions in movement direction- for pixel respectively and are weight factors represents intensity of the neighbouring pixel and indicate dark and light regions around the chosen pixel denotes total number of pixels in each half of the edge segment around the chosen pixel The edge magnitude of a chosen pixel in a movement direction can be thinned [26] by employing the criterion of non-maxima suppressionwhere indicates non-maxima suppression factor of pixel in a movement direction- and is evaluated by represents thinned edge magnitude of in a movement direction- indicates edge magnitude of in a movement direction- The probability of pixel lying on an edge in a movement direction can be represented by a sigmoid function as The probability score of the edge segment of the chosen pixel can be written aswhere is the probability score of the edge segment represents the probability of pixel lying on an edge in a movement direction- indicates a threshold value obtained by Otsu’s method denotes the similarity index of the edge segment and is computed by The smoothness of the edge segment can be written aswhere is smoothness of the edge segment represents a smoothness measure between two consecutive pixels based on movement direction and is written as The performance function of -th learner, , can be tailored as

Detection process

An initial population of learners is obtained by generating random values within their respective limits to every individual in the population, for each pixel, whose value is greater than Otsu’s threshold value of . The is calculated by considering grade points of each learner as connectivity angles, and the teaching, learning and avoiding phases are performed for all the learners in the population with a view of maximizing their performances. The iterative process is continued till convergence. The flow of the proposed method for obtaining the optimal edge map is shown in Fig. 2.
Fig. 2

Flow chart of the proposed method.

Results and discussion

The proposed method has been tested on a few real life images of airplane, egg, lifting body and Saturn [27], which are shown in Fig. 3. The size of these images is 256 × 256 pixels and the resolution is 8 bits per pixel. With a view of comparing and studying the performances of the proposed method, a metaheuristic robust method involving ACO [17] and two classical operators of Canny [10] and Sobel [8] is also applied to these test images for obtaining the edge maps. The heuristically chosen parameters , and , required in Eqs. (9), (16), the scale of sigma parameter and the threshold values for Canny and Sobel operators are given in Table 1. These parameters are found to yield satisfactory results for all the chosen test images even under noisy environment.
Fig. 3

Results of real life images. (a) Without any noise; (b) with Gaussian noise; and (c) with Impulse noise.

Table 1

List of parameters.

w1w2w3SigmaThreshold values
Real life images
Skin images
LowHighLowHigh
Proposed method90401800
Canny1.41420.040.10.1620.42
Sobel0.1
The resulting edge maps, obtained by the proposed method for real life images without any artificial noises, are presented in Fig.3(a). The results of the ACO, Canny and Sobel operators are also included in the figure. The visual comparison of these edge maps clearly indicates that the edges detected by the proposed method are more complete and thin. The performance of the ACO, Canny and Sobel operators is found to be good for these test images but the edge obtained by ACO is not thin. In order to study the performance under noisy environment, these images are corrupted by Gaussian and Impulse noises with a variance of 0.05. The ED algorithms are then applied to these corrupted images without applying any filtering with a view of studying the performance under noisy environment. The edge maps of the corrupted real life images are presented in Fig.3(b) and (c) for Gaussian and Impulse noises respectively. The visual comparison of these figures clearly indicates that the proposed method and ACO are able to reject both the Gaussian and impulse noises in obtaining the true edge maps, which are found to be similar to edge maps of uncorrupted images of Fig.3(a). The edge maps obtained by Canny operator are unclear, found to be distorted and deviate widely from the true edges for all the corrupted images with Gaussian and impulse noises. The deviations, while comparing with noiseless case, are more pronounced in Gaussian noises, while for impulse noises, they are comparatively lower. In case of Sobel operator, the distortions in the edge maps are comparatively lower than those of Canny operator. It can also be observed from these figures that the performance of Sobel is better for Gaussian noises than impulse noise environment. The qualitative visual analysis clearly indicates that the proposed method is complete, thin and robust in rejecting the both Gaussian and impulse noises. Though the ACO is reasonably good in rejecting both Gaussian and Impulse noises, it cannot produce thin edge maps. In the light of the fact that the proposed method performs much better than those of the existing methods, it is necessary to quantitatively analyse the results. The objective performance of ED was generally performed as a measurement of accuracy of the edge maps against an ideal ground truth image. As the ground truth images are not available for these real life images, the objective comparison is not made for these edge maps. In order to quantitatively measure the accuracy of the edge maps, the proposed method is applied to another set of medical images containing skin lesions with ground truth as shown in Fig. 4. The figure also includes the test images with Gaussian and Impulse noises. The edge maps are also obtained by ACO [17], Canny [10] and Marr_Hildreth method [28], [29] with a view of studying the performances.
Fig. 4

Results of skin lesions. (a) Without any noise; (b) with Gaussian noise; and (c) with Impulse noise.

The resulting edge maps, obtained by the proposed method, ACO, Canny and Marr_Hildreth methods for the medical images without any noises, with Gaussian and impulse noises are presented in Fig.4(a)–(c) respectively. The visual qualitative analyses of these figures confirm the findings of the aforesaid study on real life images. Many methods exist for performing the objective measurement, each aiming to provide the optimal method of measuring similarity to the ideal output. Among them, Pratt’s Figure of Merit (FOM) has been popularly used [30]. It lies in the range of (0–1) and can be evaluated by the following equation. A larger value, nearer to 1, indicates good performance.where and denote ideal and actual edge points in the ground truth and estimated edge points respectively represents the total number of pixels in image, is the distance between the pixel- in the estimated edge map and the nearest edge point in the idea edge map denotes a constant scale factor, typically set to 1/9. The FOM of these edge maps obtained by all the methods for images with different Gaussian and Impulse noise levels is evaluated and presented through Fig. 5. The FOM of the proposed method for all test images is very nearer to unity and the variation is almost flat. However, the FOMs of Canny and Marr_Hildreth methods are smaller than the proposed method and rapidly decrease with increase in noise level. The decay of FOM of ACO is slightly inferior to proposed method but better than Canny and Marr_Hildreth. It is very clear from these results that the proposed method is less affected by the increased noises compared to other methods, thereby establishing that the proposed method is robust.
Fig. 5

Quantitative performance comparison.

The edge maps are also obtained by varying the scale of sigma parameter of Canny operator in the range of 0–2.6 and their FOM values are evaluated for the three skin lesions with and without Gaussian and Impulse noises. The FOM values are graphically compared with those of the proposed method in Fig. 6. The results clearly indicate that the performance of Canny operator with different scale of sigma parameters is inferior to the proposed method. The aforesaid discussions clearly indicate that the proposed method outperforms the existing approaches and is suitable for ED of digital images, especially in noisy environments. The average execution times of all the methods are given in Table 2. It is well known that Canny and Sobel operators are very efficient as they involve first order derivatives. The Marr_Hildreth method involves second order derivatives and takes little higher execution time. The evolutionary algorithms such as ACO and ITLO involve huge computations over sufficient number of iterations and require huge execution time. While comparing the execution time of the proposed method with ACO based method, the proposed method is 1.39 times faster, besides offering robust solution.
Fig. 6

Performance variation with scale factor ‘sigma’ for skin images.

Table 2

Comparison of average execution time.

Average execution time (s)
Proposed method6.7
ACO9.3
Canny0.54
Sobel0.54
Marr_Hildreth1.76

Conclusions

TLO, comprising two phases of teaching and learning, is a population based algorithm that simulates the teaching–learning process in the classroom. The ‘Teaching Phase’ represents learning from the teacher and ‘Learning Phase’ indicates learning by the interaction between learners. The ITLO has been developed by including a third phase denoted by “Avoiding Phase” that helps to keep the learners away from the worst students with a view of exploring the problem space more effectively and escaping from the sub-optimal solutions. The ED problem of digital images has been formulated as an optimization problem and solved using the ITLO. The developed method has been applied on both the real life and medical images and the edge maps have been obtained. The results clearly exhibit that the developed method is robust in producing the edge maps even under noisy environment.

Conflict of interest

The authors have declared no conflict of interest.

Compliance with Ethics Requirements

This article does not contain any studies with human or animal subjects.
  3 in total

1.  The Max Roberts Operator is a Hueckel-Type Edge Detector.

Authors:  A Rosenfeld
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  1981-01       Impact factor: 6.226

2.  A computational approach to edge detection.

Authors:  J Canny
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  1986-06       Impact factor: 6.226

3.  Theory of edge detection.

Authors:  D Marr; E Hildreth
Journal:  Proc R Soc Lond B Biol Sci       Date:  1980-02-29
  3 in total
  1 in total

1.  Automatic extraction channel of space debris based on wide-field surveillance system.

Authors:  Ping Jiang; Chengzhi Liu; Wenbo Yang; Zhe Kang; Cunbo Fan; Zhenwei Li
Journal:  NPJ Microgravity       Date:  2022-05-05       Impact factor: 4.970

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.