Knowledge based object
extraction technique Mohamed
Abdel-salam PhD candidate & Research Assistant Department of Geomatics Engineering, The University of Calgary 2500 University Drive N.W. Calgary, AB, T2N 1N4, Canada Email: mabdel@ucalgary.ca Phone +1 (403)-282-9308 Fax +1 (403)284-1980 Abstract Since the beginning of remote sensing technology, bottom up methods for interpretation of scenes monopolize remote sensing activities. For example, in the conventional per-pixel classification, which is a bottom up approach, the system uses the information hidden in spectral data only. Such techniques suffer from drawbacks caused by mixed pixels and spectral overlap. Knowledge about objects, sensor and environment is hardly considered. Therefore a lot of problems appear in the conventional methods are related to insufficient representation of the available knowledge. The difference between human interpretation and any computer assiste d image analysis is that humans use spatial information like texture, shape, shade, size, site, association and topology, while computers are powerful in handling the gray values in the image. In this paper a new technique for object recognition and identification is proposed. The proposed technique employs knowledge about objects and edge spread function. These two types of knowledge are hardly included in the conventional methods. The heart of proposed technique is the integration of the existing knowledge and numerical optimization techniques. A new proposed cost function, which is used by the optimizer, and a rectangular class of objects have been studied in this paper. The high performance of the method is clearly shown in the obtained results. Varieties of objects have been extracted with the proposed method some of them as will be shown are almost impossible for bottom up approach. Introduction Bottom approaches for remote sensing data interpretation use only spectral knowledge represented in pixels. Other existing knowledge in the scene is not considered at all. These exiting knowledge can be categorized into three main branches. The first branch is related to the object: geometric and radiometric. The second branch is related to the sensor: imaging technique, bluer model, noise model, image acquisition model and projection model. The third branch is related to the environment: path and illumination model (Mulder and Fang, 1994), (Van der Heijden, 1994). For most farms, the agriculture fields enjoy certain shapes and border thickness, and even orientations. Integrating this simple existing knowledge in the interpretation process can enhance the outcome. The way to integrate different types of knowledge in known as model based image analysis. It is popular in different areas like industrial applications, medical applications and computer vision. It tries to model a scene in terms of objects and parameters and then it estimates these parameters. Therefore, using model based approach for interpreting imagery can be powerful. Proposed Technique The proposed technique starts with the initiation of a hypothetical object model and it checks the corresponding statistical information of the multispectral information inside this object. Based on this statistical characteristic, the object parameters are adjusted through a numerical optimizer till a given satisfactory statistical criterion is satisfied. This criterion acts as a cost function. Then the object will be stored as identified object. The method will start again to search for another object, avoiding the positions of the previous objects.
Several experiments with real data are implemented. These experiments are divided into two main categories; the first category deals with scenes consisting of rectangular shapes only, while the second x.category deals with scenes consisting of rectangular shapes in addition to other shapes. For numerical optimization, gradient information, steepest descent optimizer, is used at the beginning to move the initial parameters to a good location, a low cost area, then different simplex iterations deliver the result to each other. The simplex iterations start work by beginning with population, initial values, of a big step and ending with population with a small step. The reason for this is to overcome local minima as much as possible. 1. Scenes with Rectangular Shapes Only Three different scenes are selected. The first scene (154x172 pixels) consists of two different objects, the second (114x107) consists of three objects with the same crop type and the third (373x342) contains six objects with three different crops. The scenes are shown in figure (1). Before going to further processing it is necessary to obtain information about the width of the edge spread function. Since there is no information available about it, the edge-spread function is assumed to be a piecewise linear symmetrical function. A width of three pixels is selected for edge spread function. This value is used in the real data experiments. It should be noted that experiment 1 and 3 contain an object with a repeated pattern texture, a glass house, which is a problematic issue in bottom up approaches. Fig (1) shows an example of the output of our MBIA algorithm. The figure shows an automatic overlay of DXF layer that contains the extracted object and the input image. In addition to the DXF layer, the algorithm produces five geometrical parameters for each object. Figure (1) Result of real data (scenes with only rectangular shapes) The results of the three experiments show that the method detected and extracted the existing objects. Some error appears in experiment 1 due do inaccurate modeling of ESF (the given value of ESF width was too big). In experiment 2, the existence of noise near the corners makes one of the objects to be extracted with some error. The reason is that the optimizer could not overcome the local minimum due to this noise. In experiment 3 the overlaying of the extracted objects by the image shows that all objects are extracted successfully. Only the green house object is too large. The reason is that the similarity between the glasshouse and the surrounding which makes both of them appear as one object. The similarity is mainly, between the neighbor object and one of the textures that construct the glasshouse. 2. Irregular Shapes To consider agriculture fields as only rectangles is hardly realistic. The existence of irregular shapes beside the rectangular shapes is more probable. In this section a method for detection of only rectangular shapes from a scene is explained. Considering an irregular shape as shown in figure (2) left, Noise.until now the proposed method will try to find a rectangle with minimum cost inside this irregular shape. For example consider the solid rectangle, candidate object, as identified object inside the actual irregular object. This object should be rejected. The rejection method is based on the detection of any irregular extension of the candidate rectangle. Studying the two hypothetical thin objects (ob1 and ob2), it is evident that Ob2 has lower cost value compared with Ob1. The reason is that since an object is homogeneous, increasing the size of the object reduces the cost function value. This previous remark is considered the criteria for rejection of an irregular object. To reject a rectangle, in other words not to extract a candidate rectangle with minimum cost function, several hypothetical thin rectangles inside and just outside this candidate object are checked. Figure (2) right shows the checks location. Since we are interested in detecting any small amount of change across the border, the StdM as cost function for check is more convenient in this case. Figure 2 Irregular shapes rejection strategy and checks locations The parameters of those thin rectangles depend on the definition of the rectangular shape. In the current context the rectangular shape is identified as any candidate rectangle (with minimum cost function) and satisfies the following properties.
Scenes with Rectangular Shapes in Addition to non Rectangular Shapes Three different scenes are used in this stage. The first scene contains only one rectangle. The second one contains two rectangles; the third one contains three rectangles. The edge spread function width adopted in these scenes is 1.5 pixels. Figure (3) shows that the entire expected rectangular object was successfully detected and extracted. One more object (the third object in third experiment) was expected to be detected; the object fails to satisfy the rules of the rectangular shapes. The reason for that is the existence of a homogeneous zone near the end of the object, which considers extension for the object. Figure 3 Results of second type of scenes Analysis of the results A twofold criterion is used to analyse the result. It is qualitative in terms of number of extracted objects compared to the number of actual objects in the scene and the running time and quantitative in terms of numerical figures, which describe the results. The qualitative result of the experiments shows that the technique identified all expected rectangles in the scenes. Only one rectangle was expected to be extracted in the third experiment of the irregular shapes and later this rejection is explained. This object was not concordant with the rules that define the rectangular shapes. The running time as component of qualitative assessment points out that the Matlab code is not optimised and it is time consuming. The quantitative component of the assessment is explored in the following sections. 1. Method of quantitative evaluation To evaluate quantitative success of the procedure, we have to find truth data to assess the results. To do that the images that are us ed in the experiments are digitised using on screen digitising. The digitising is done in Arcview (GIS software). The delineating of boundaries is done in such a way that it is as close as possible to the reality. It is necessary to say that subjectivity and human delineating errors are inevitable. The extracted objects are overlaid by the output of on screen digitising. After overlying the results and the truth data, Arcinfo (GIS software) is used to process the data. Arcinfo is mainly used to build topology and to calculate the overlapped area. Mainly three quantities are measured; wrongly predicted parts, parts not predicted, and correctly predicted parts as shown in figure 4. These measurements are done for each object for each test. The chosen quantita tive criterion for assessment is twofold. The first is related to correctness of extraction and the other relates to the error in extraction. For the first part the correctness in extraction is considered as the ratio between the correctly extracted areas to the area of the digitised object. The second part, which is related to the error in extraction, is divided into two subparts. The first subpart is the ratio of the wrongly predicted area to the.digitised area; the second subpart is the ratio of the not predicted part to the digitised area. It should be clear that the summation of the three quantities does not produce 100% because these ratios are normalised by the digitised area. The three quantities: the ratio of correct prediction, the ratio of wrong prediction and the ratio of the not predicted parts are 97.01,7.56, and 2.99% respectively for the six experiments. Figure 4 Assessment procedures This paper has introduced a method for geometrical object parameter estimation of model shapes from multispectral images. A rectangle model shapes is used. The paper presents a new cost function. The cost function avoids the problems of using training data in the extraction of agriculture fields. The nature of the cost function makes the technique independent of assuming any radiometric distribution of the agriculture field. The output results of several scenes show that the method can detect and extract the objects with high accuracy. The output of the technique is easy to integrate with GIS. It is produced in two forms, the first is DXF file and the second is the five parameters of each object. ACKNOWLEDGEMENTS This work was done under the Netherlands fellowship program, at ITC, the Netherlands. ITC is deeply appreciated. References
|