Academia.eduAcademia.edu

Seabed segmentation using optimized statistics of sonar textures

2008

In this paper, we propose and compare two super-4 vised algorithms for the segmentation of textured sonar images, 5 with respect to seafloor types. We characterize seafloors by a set of 6 empirical distributions estimated on texture responses to a set of 7 different filters. Moreover, we introduce a novel similarity measure 8 between sonar textures in this feature space. Our similarity mea-9 sure is defined as a weighted sum of Kullback-Leibler divergences 10 between texture features. The weight setting is twofold. First, 11 each filter is weighted according to its discrimination power: The 12 computation of these weights are issued from a margin maxi-13 mization criterion. Second, an additional weight, evaluated as an 14 angular distance between the incidence angles of the compared 15 texture samples, is considered to take into account sonar-image 16 acquisition process that leads to a variability of the backscattered 17 value and of the texture aspect with the incidence-angle range. 18 A Bayesian framework is used in the first algorithm where the 19 conditional likelihoods are expressed using the proposed similarity 20 measure between local pixel statistics and the seafloor prototype 21 statistics. The second method is based on a variational framework 22

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 46, NO. 12, DECEMBER 2008 2 Seabed Segmentation Using Optimized Statistics of Sonar Textures 3 Imen Karoui, Ronan Fablet, Jean-Marc Boucher, Member, IEEE, and Jean-Marie Augustin 1 Abstract—In this paper, we propose and compare two supervised algorithms for the segmentation of textured sonar images, with respect to seafloor types. We characterize seafloors by a set of empirical distributions estimated on texture responses to a set of different filters. Moreover, we introduce a novel similarity measure between sonar textures in this feature space. Our similarity measure is defined as a weighted sum of Kullback–Leibler divergences between texture features. The weight setting is twofold. First, each filter is weighted according to its discrimination power: The computation of these weights are issued from a margin maximization criterion. Second, an additional weight, evaluated as an angular distance between the incidence angles of the compared texture samples, is considered to take into account sonar-image acquisition process that leads to a variability of the backscattered value and of the texture aspect with the incidence-angle range. A Bayesian framework is used in the first algorithm where the conditional likelihoods are expressed using the proposed similarity measure between local pixel statistics and the seafloor prototype statistics. The second method is based on a variational framework as the minimization of a region-based functional that involves the similarity between global-region texture-based statistics and the predefined prototypes. IE E Pr E oo f 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 1 Fig. 1. Typical sidscan sonar image (Rebent, IFREMER). 26 Index Terms—Active regions, angular backscattering, feature 27 selection, level sets, maximum marginal probability (MMP), seg28 mentation, sonar images, texture. I. I NTRODUCTION 29 COUSTIC remote sensing, such as high-resolution multibeam and sidescan sonars, provides new means for in situ observation of the seabed. The characterization of these high33 resolution sonar images is important for a number of practical 34 applications such as marine geology, commercial fishing, off35 shore oil prospecting, and drilling [1]–[4]. 36 The segmentation and the classification of sonar images with 37 respect to seafloor types (rocks, mud, sand, . . .) is the key 38 goal behind the analysis of these acoustic images. This task, 39 however, raises two major difficulties. The first task is to deal 40 with texture in these images. Previous methods are generally 41 based only on backscattered (BS) intensity models, and several 42 parametric families of probabilistic distribution functions have 30 31 32 A Manuscript received January 4, 2008; revised May 20, 2008 and August 11, 2008. I. Karoui, R. Fablet, and J.-M. Boucher are with the Department of Signal and Communications, École Nationale Supérieure des Télécommunications de Bretagne, 29238 Brest Cedex 3, France (e-mail: [email protected]). J.-M. Augustin is with the Department of Acoustics and Seismics, Institut Français de Recherche et d’Exploitation de la Mer, 29280 Brest, France. Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org. Digital Object Identifier 10.1109/TGRS.2008.2006362 Fig. 2. BS evolution with the incidence angles for the three seafloor types of Fig. 1: sand, mud, and sand ripples. been suggested (Rayleigh distribution, K distribution, Weibull 43 distribution, etc.) [3]–[8]. These first-order statistics are not 44 sufficient when high-resolution sonar images involve textures, 45 which is the case of most sonar images (Fig. 1). 46 The other important issue arising in seabed texture charac- 47 terization is a built-in feature of sonar observation: The value 48 of BS measure depends both on the seafloor type and on the 49 incident angle of the reflected acoustic signal, ranging typically 50 from −85◦ to +85◦ . Fig. 2 shows the BS evolution for three 51 different seafloor types: sand, sand ripples, and mud. In addition 52 to the BS variability within incidence angles, seafloor textures 53 0196-2892/$25.00 © 2008 IEEE 2 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 46, NO. 12, DECEMBER 2008 Fig. 3. Sonar image composed of sand ripples. (Left) [5◦ , 40◦ ]. (Right) [80◦ , 85◦ ]. 55 56 57 for two angular sectors. (Left) [5◦ , 40◦ ]. segmentation schemes. We first state the segmentation issue 93 as a Bayesian pixel-based labeling according to local texture 94 features. The second approach relies on a region-level vari- 95 ational framework, which resorts to a level-set minimization 96 of an energy criterion involving global-region-based seafloor 97 statistics. 98 This paper is organized as follows. The seafloor similarity 99 measure is introduced in Section II. The Bayesian segmentation 100 method is detailed in Section III. The region-based segmen- 101 tation algorithm is described in Section IV. Experiments are 102 reported and discussed in Section V, and conclusions are drawn 103 in Section VI. 104 IE E Pr E oo f are dependent on the incidence angles. Figs. 3 and 4 show a sonar image composed by sand ripples and rocks, respectively, for two angular sectors: [80◦ , 85◦ ] and [5◦ , 40◦ ]. The texture of sand ripples shows a loss of contrast in the specular sec58 tor [5◦ , 40◦ ]: The steep grazing angle reduces the backscatter 59 differences between facing and trailing slopes, while at low 60 incidence angles, much of the variation is lost due to increasing 61 sonar shadow. A similar loss of contrast is observable in the 62 rock samples (Fig. 4). BS behavior according to the incidence 63 angles has been of wide interest for sonar imaging [3], [9]–[12]. 64 Parametric and nonparametric techniques have been proposed 65 to model sonar-image behavior with respect to the incidence 66 angle variations. The effect of the incidence angle on the BS 67 has also been explored as a discriminating feature for seafloor 68 recognition [4], [13], [14]. However, no studies have proposed 69 a method to accurately compensate this phenomena because 70 of the joint dependence of the seafloor types and on the local 71 bathymetry which is generally unknown for sidescan sonar 72 images. To our knowledge, the effects of the incidence angle on 73 textured seabed features have not been addressed for segmen74 tation issues. Only some studies were interested in simulating 75 the behavior of oriented and textured seafloor types [15]–[18]. 76 These methods are mainly based on shape-from-shading [19] 77 and were restricted to synthetic images or to real sonar images 78 involving only one seafloor type. 79 In this paper, we aim at using texture information within 80 sonar seabed images and at developing segmentation algo81 rithms that take into account angular variability of BS and 82 textural features. We propose to characterize seafloors by a 83 wide set of marginal distributions of their filter responses, and 84 we measure seafloor similarities according to a weighted sum 85 of Kullback–Leibler divergences [28] in this feature space. 86 To cope with seafloor angular dependence, we introduce an 87 additional weighting factor, evaluated as an angular distance 88 between the compared texture samples: This angular distance 89 is measured according to Gaussian kernels, whose variance 90 sets the level of the angular variability, depending on textures 91 and seafloor types. The proposed incidence-angle-and-texture92 based similarity measure is exploited to develop two different 54 Fig. 4. Rock texture (Right) [80◦ , 85◦ ]. II. S ONAR -T EXTURE S IMILARITY M EASURE 105 Texture-based segmentation of seabed sonar images gener- 106 ally relies on Haralick [30] parameters or scalar spectral and 107 filter coefficients to model textures [20]–[22]. Recently, in the 108 field of texture analysis, features computed as statistics of local 109 filter responses have been shown to be relevant and discriminant 110 texture descriptors [23]–[27]. Motivated by these studies, we 111 propose to use texture features computed as marginal distrib- 112 utions of a wide set of filter seafloor responses. Each seafloor 113 type denoted by Tk is characterized by a set Qk composed of 114 the marginal distributions of the seafloor with respect to the 115 predefined filters. The following conditions are issued. 116 1) 121 cooccurrence distributions with horizontal and ver- 117 tical displacements denoted by dx and dy , respectively, 118 (dx , dy ) ∈ {0, 1, . . . , 10}. 119 2) 50 distributions of the magnitude of Gabor filter 120 responses, computed for combination of parameters 121 (f0 , σ, θ), where f0 is the radial frequency, σ is 122 the standard√deviation, and θ is the √ orientation, such 123 that f0 ∈ { 2/2k }k=1:6 , σ ∈ {2k 2}k=2:5 , and θ ∈ 124 {0◦ , 25◦ , 45◦ , 90◦ , 135◦ }. 125 3) 48 distributions of the energy of the image wavelet packet 126 coefficient computed for different bands (we used three 127 wavelet types: Haar, Daubechies, and Coiflet). 128 KAROUI et al.: SEABED SEGMENTATION USING OPTIMIZED STATISTICS OF SONAR TEXTURES 3 Gaussian mixture, and the assignment likelihood p(Θj /zs ) is 147 given by 148   −(Θj −θs )2 exp σ2 .  j p(Θj /zs ) =  (1) −(Θi −θs )2 J exp 2 i=1 σ i Hence, the assignment of a given region R to an angular 149 subdomain is given by 150  1 p(Θj /zs ), ds. (2) πR (j) = |R| s IE E Pr E oo f Fig. 5. Cooccurrence distributions computed for parameters (d, θ) = (6, 0◦ ). (Left) [5◦ , 40◦ ]. (Right) [80◦ , 85◦ ]. The angular bounds, defined by the parameters Θj and σj , 151 are set experimentally, and each seafloor type is character- 152 ized by Qk = {Qf,j (Tk )}f =1:F,j=1:J , its filter responses es- 153 timated on the J angular domains. Note that f accounts 154 both for filter types and various associated parameteri- 155 zations: f = 1 → 121 refers to cooccurrence distributions, 156 f = 122 → 171 to Gabor filter, and f = 172 → 219 to 157 wavelet-based distributions. These statistics are computed us- 158 ing Parzen window estimation [29]. Formally, Qf,j (Tk , α) = 159 (1/πTk (j)) Tk p(Θj /zs )gσf (hf (s) − α)ds, where hf is the 160 filter response indexed by f (for a cooccurrence matrix with pa- 161 rameters μ = (d, θ), hf : Ω → [1, N g] × [1, N g], and hf (s) = 162 (I(s), I(s + μ)), where I(s) is the gray value at pixel s and N g 163 is the total gray level number) and gσf is a Gaussian kernel with 164 zero mean and standard deviation σf . 165 We define the similarity measure between a texture sample 166 T with feature set D(T ) = {Df,j (T )}f =1:F,j=1:J and a given 167 seafloor type Tk as follows: 168 J  k  KLΘ w Q , D(T ) = Fig. 6. Cooccurrence distributions computed for parameters (d, θ) = (Left) [5◦ , 40◦ ]. (Right) [80◦ , 85◦ ]. (6◦ , 0). Sonar-texture variability with respect to the incidence angles induces a variability in texture features. Figs. 5 and 6 show the cooccurrence matrices of the images involving sand ripples 132 and rocks shown in Figs. 3 and 4, respectively. A change in 133 cooccurrence distributions can be noticed. In the angular sector 134 [5◦ , 40◦ ], the cooccurrence is bimodal due to the alternation of 135 dark and light values in the image of sand ripples, whereas in 136 the sector [80◦ , 85◦ ], the loss of contrast between dark and light 137 values in the image of sand ripples leads to unimodal distribu138 tion. For cooccurrence distributions related to rock samples, a 139 change in the variance is depicted. 140 To deal with this problem, we propose to define angular 141 subdomains, in which texture characteristics can be regarded 142 as homogeneous. Each angular sector is indexed by j and 143 is characterized by a mean incidence-angle value Θj . For144 mally, this leads in introducing a state variable zs assign145 ing pixel s to a given angular subdomain. (θs , zs ), where 146 θs is the incidence angle of pixel s, is then modeled as a 129 130 131 F j=1 f =1   πT (j)wf2 KL Qkf,j , Df,j (T ) (3) where {wf2 }f =1:F are the feature weights such that 169 F 2 f =1 wf = 1, Df,j (T ) is the distribution related to 170 filter f and estimated on  region T for angular sector j: 171 Df,j (T, α) = (1/πT (j)) T p(Θj /zs )gσf (hf (s) − α)ds, and 172 KL denotes for the Kullback–Leibler divergence [28]; 173 for  two probability distributions Q and D, KL(Q, D) = 174 Q(α)log(Q(α)/D(α))dα. 175 The resulting weighting factors {wf2 }f =1:F are exploited on 176 the one hand for filter selection to keep only the distributions 177 corresponding to the significant weights and, on the other hand, 178 for the definition of an optimized texture-based similarity mea- 179 sure KLΘ w given the selected distributions. In a supervised con- 180 text, the weights are estimated from a training set T composed 181 of N -labeled texture samples: T = {(D(T ), sT )}, where sT is 182 the class of sample T . Formally, {wf2 }f =1:F are issued from the 183 maximization of the global margin defined as follows: 184 MwT = Mw (T ) (4) T ∈T where  d  Θ sT T Mw (T ) = KLΘ w Q , D(T ) − KLw (Q , D(T )) 185 (5) 4 186 187 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 46, NO. 12, DECEMBER 2008 where dT is the nearest class to T different from sT according to the similarity measure KLΘ w  k  (6) dT = arg min KLΘ w Q , D(T ) . k=sT iterative conditional estimation procedure [42]. Using Bayes 227 rule, the posterior distribution is expressed as follows: 228 p(X, Y ) = p(Y /X)p(X) = p(X) The maximization of the margin criterion MwT is carried out using a stochastic gradient method as detailed in our previous 190 work [39]. 188 189 191 III. B AYESIAN S ONAR -I MAGE S EGMENTATION As far as Bayesian image segmentation is concerned, the most popular criteria are the maximum a posteriori (MAP) 194 [31] and the maximum marginal probability (MMP) [32]. It 195 has been shown that the MMP estimation criterion is more 196 appropriate for image segmentation than the MAP criterion 197 [32]. The MAP estimate assigns the same cost to every incorrect 198 segmentation regardless the number of pixels at which the 199 estimated segmentation differs from the true one, whereas the 200 MMP algorithm minimizes the expected value of the number of 201 misclassified pixels. As shown in [32], the MMP procedure is 202 equivalent to maximizing the marginal of the class labels. Let 203 us introduce the following notations: 204 1) S, the image lattice composed of N pixels; 205 2) X = {xs }s∈S , the label random field; 206 3) Y = {ys }s∈S , the random field of the observations, the 207 textural feature in our case. 208 Formally, the MMP scheme is equivalent to the maximi209 zation of x̂MMP = arg max p(xs = k/y). s k∈{1:K} (7) In general, pixel conditional likelihood is computed according to local texture features like Gabor or wavelet coefficients. Here, we aim at using the proposed similarity measure KLΘ w. We associate to each pixel s a set of features D(Ws ) = 214 {Df (Ws )}f =1:F estimated according to a Parzen estimation 215 method [29], within a square window Ws centered at s. The 216 window size that we denote by TW is set by the user according 217 to texture coarseness. In our case, the observation denoted 218 by a random field Y is specified by {ys = D(Ws )}s∈S . The 219 conditional likelihood at each pixel s with respect to class k is 220 then defined from the similarity measure KLΘ w by 210 211 212 213 k −KLΘ w (Q ,D(Ws )) 221 222 223 exp . p(ys /xs = k) = K i −KLΘ w (Q ,D(Ws )) i=1 exp s∈S t∈cs −αc (1−δ(xs ,xt ))+log(p(ys /xs )) . (10) The maximization of local probabilities p(xs = k/y) is car- 229 ried out as follows [32]: We use the Gibbs sampler to gen- 230 erate a discrete-time Markov chain X(t) which converges in 231 distribution to a random field with probability mass functions 232 P (X/Y ). The marginal conditional distributions p(xs = k/y) 233 are then approximated as the fraction of time the Markov chain 234 spends in state k for each class k and each pixel s. The MMP 235 segmentation steps are the following. 236 1) Simulation of Tmax realizations of x1 , x2 , . . . , xTmax of 237 X using Gibbs sampler. 238 2) Using the realizations x1 , x2 , . . . , xTmax , p(xs /y) is esti- 239 mated using the frequency of each realizations 240     δ x1s − k + · · · + δ x1s − k p(xs = k/y) = . Tmax s∈S t∈cs αc (1 − δ(xs , xt )) 3) Choose as xs the class that maximizes p(xs = k/y). 241 IV. V ARIATIONAL S ONAR -I MAGE S EGMENTATION 242 Unlike the Bayesian scheme, the second approach is stated 243 at a region level as the minimization of a constrained energy 244 criterion E({Ωk }k=1:K ) = E1 + E2 , where Ωk is the domain 245 composed of all pixels attributed to the class k, E1 is a texture- 246 based data-driven term, and E2 is a regularization term detailed 247 as follows. 248 A. Functional Terms E ({Ωk }k=1:K ) is evaluated as the log-likelihood of a given 250 partition with respect to texture models. It is evaluated as the 251 sum of the similarities according to the measure KLΘ w between 252 each region Ωk and its corresponding class Tk 253 K E1 ({Ωk }k=1:K ) = k=1  k  KLΘ w Q , D(Ωk ) (11) where Df,j (Ωk ) is the marginal distribution of the image 254 response to the filter indexed by f . For angular domain j, 255 Df,j (Ωk ) is estimated according to Parzen method [29]. 256 E2 is a regularization term, it penalizes the lengths of region 257 contours and is expressed by 258 (9) where δ is the delta function and αc ∈ {αH , αV , αD } are real parameters assigned, respectively, to horizontal, vertical, and 226 diagonal cliques. These parameters are estimated using the 249 1 (8) As a prior PX , we consider a Markov random field associated to an eight-neighborhood system with potential functions given by U2 (x) = 224 225 ≈ exp   IE E Pr E oo f 192 193 p(ys /xs ) s K E2 = k=1 γk |Γk |, γk ∈ ℜ+ (12) where |Γk | is the length of the contour Γk associated to the 259 region Ωk . 260 KAROUI et al.: SEABED SEGMENTATION USING OPTIMIZED STATISTICS OF SONAR TEXTURES 261 B. Computation of the Evolution Equation We solve for the minimization of the functional E using a gradient-descent technique. It relies on the computation of 264 the first derivative of E according to regions {Ωk }k=1:K . The 265 evolution equation of region contours {Γk }k=1:K is then given 266 by the following dynamic scheme [33]: 262 263 ∂Γk (x,t) = Fk (x, t)Nk ∂t Γk (x, 0) = Γ0k (13) 5 where ∂ϕik /∂t, i = 1, 2, 3 are the evolution equation terms associated, respectively, to functionals Ei , i = 1, 2, 3. The derivatives of the energy terms E2 and E3 are directly estimated from level-set functions [35], [36] 287 288 289 290 ∇ϕk ∂ϕ2k (s, t) = γk δα (ϕk )div ∀k ∈ {1 : K} (19) ∂t |∇ϕk | K  ∂ϕ3k (s, t) =−δα (ϕk )λ (Hα (ϕk )−1) ∀k ∈ {1 : K}. ∂t k=1 where Nk is the unit inward normal to Γk at pixel x and at time t and Fk is the velocity field (in our case, Fk = ∇Ek , the 269 derivative of E with respect to Γk ). 270 The explicit implementation of the curve evolution according 271 to the latter dynamic scheme using a difference-approximation 272 scheme cannot deal with topological changes of the moving 273 front. This could be avoided by introducing the level-set method 274 proposed by Osher and Sethian [34]. The basic idea of the 275 method is the implicit representation of the moving interface Γ 276 by a higher dimensional hypersurface ϕ (the level-set function) 277 such that the zero level set of ϕ is actually the set of Γ and 267 268 K γk lim k=1 281 ∀s ∈ Ωk . (14) E2 can be expressed using level-set functions ϕk [36] E2 = α→0  δα (ϕk )|∇ϕk |ds (15) Ω where δα is a regularized delta function    1 1 + cos πs , 2α α δα (s) = 0, if |s| ≤ α if |s| < α. (16) In order to cope with multiclass segmentation and to fulfill the image-partition constraint, we use an additional term E3 given 284 by the following functional [38]: 282 283 λ E3 = 2   Ω 285 286 K k=1 F f =1    k wf2 p(Θj /zs ) KL Qkf,j , Df,j  − Qkf,j /Df,j ∗ gσf (hf (s)) − 1 |∇ϕk | The evolution of the contours {Γk }k=1:K (13) is then equivalent to the evolution of the level-set functions {ϕk }k=1:K [34] ∂ϕk (s, t) = Fk (s, t) |∇ϕk (s, t)| ∂t 280 J ∂ϕ1k (s, t) = − ∂t j=1 IE E Pr E oo f 278 The evolution equation related to E1 is more complex, since it 291 involves computations over the spatial support of each region. 292 To differentiate E1 , we use shape-derivative tools, particularly 293 the Gâteaux derivative theorem given in [37]. As detailed in the 294 Appendix, it leads to 295  Ωinside = {s ∈ Ω/ϕ(s) > 0} Ωoutside = {s ∈ Ω/ϕ(s) < 0} . 279 (20) 2 Hα (ϕk ) − 1 ds, λ ∈ ℜ+ where Hα is a regularized Heaviside function ⎧    ⎨ 12 1 + αs + π1 sin πs , if |s| ≤ α α Hα (s) = 1, if s > α ⎩ 0, if s < −α. As E = E1 + E2 + E3 , we have ∂ϕk ∂ϕ1k ∂ϕ2k ∂ϕ3k (s, t) = (s, t) + (s, t) + (s, t) ∂t ∂t ∂t ∂t (17) (21) where ∗ is the convolution symbol. 296 The evolution equation related to the energy E1 (21) is 297 composed of two terms. 298 J  F 2 k 1) A global term − j=1 f =1 wf p(Θj /zs )KL(Qf,j , 299 Df,j ): This term is always negative or null. It is a contrac- 300 tion force that  reducesthe size of heterogeneous regions. 301 F J 2 k 2) A local term f =1 wf p(Θj /zs )((Qf,j /Df,j ) − 302 j=1 1) ∗ gσf (hf (s)): This term locally compares the features 303 values at each pixel. This term can be positive or neg- 304 ative and aims at readjusting the statistics inside the 305 regions Df,j (Ωk ) to fit to prototype models Qkf,j . The 306 contribution of each descriptor f is weighted by wf2 and 307 p(Θj /zs ), the relative contribution of descriptor f and of 308 angular sector j. 309 The overall evolution equations of the contours {Γk }k=1:K 310 are the following: 311 ∂ϕk (s, t) ∂t = −δα (ϕk ) ⎡ J ×⎣ F wf2 p(Θj /zs ) j=1 f =1 ×  KL(Qkf,j , Df,j )− (18) + γk div ∇ϕk −λ |∇ϕk |  Qkf,j ∗ gσf (hf (s))+1 Df,j  K (Hα (ϕk )−1) k=1 ∀k. (22) We apply these coupled evolution equations until convergence. 312 6 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 46, NO. 12, DECEMBER 2008 Fig. 8. Feature weights. 1 → 121 cooccurrence distributions, 122 → 171 Gabor filter-based features and 172 → 219 wavelet-based distributions. (a) I1. (b) I5. IE E Pr E oo f TABLE I SEGMENTATION ERROR RATES Fig. 7. Test images and their manual segmentation (in black line). 313 V. E XPERIMENTAL R ESULTS AND D ISCUSSION In previous work, we have tested the method on various optic textures (Brodatz textures). The method was compared 316 to other texture-classification methods, and some results are 317 reported in [39] and [40]. Here, we evaluate the proposed 318 seabed-segmentation technique for different real sonar images 319 acquired by a sidescan sonar, as part of a natural seabed320 mapping project (IFREMER, Rebent Project) [41]. A reference 321 interpretation by an expert is available [41]. Fig. 7 shows the 322 set of images on which we carried out the experiments. We 323 superimposed on these images the manual expert segmentation. 324 Image I1 is composed of three seafloor types: rock, mud, and 325 marl ripples [41]; I2 of rock, marl ripples, and mud seafloors; 326 I3 of mud, sand, and marl ripples; I4 of marl and marl ripples; 327 and I5 of sand, sand ripples, and rock. For I2 and I3, the angular 328 variability of the sefloors, particularly marl ripples (for I2) and 329 marl (for I3), is visually clear. 330 For all these images, we first determine the most discriminant 331 features among the initial set of 219 features: We apply the 332 algorithm described in Section II and detailed in [39], and 333 we keep only the feature set such that the cumulative sum 334 of weights exceeds 0.9. Only a small number of features are 335 retained. For example, Fig. 8 shows the plot of feature weights 336 computed for image I1 and I5. For I1, the two cooccurrence 337 matrices account for more than 90% of the total weight sum; 338 these cooccurrence are computed for parameters (dx , dy ) ∈ 339 {(1, 4), (2, 1)}. For I5, the cooccurrence distributions com340 puted for parameters (dx , dy ) ∈ {(2, 1), (2, 2)} are selected. 341 For sonar images, we noticed that cooccurrence matrices are the 314 315 most selected features. In previous work on Brodatz textures 342 [39], [40], we remarked that Gabor and wavelet filters were 343 selected for oriented textures, whereas cooccurrence distribu- 344 tions, which, in addition to the detection of texture structures, 345 detect the intensity values change, are selected in the case of 346 texture having different intensity values and texture with regular 347 motifs. 348 For the five test images, three segmentation algorithms are 349 compared. 350 1) The maximum-likelihood (ML) segmentation denoted by 351 ML. This method consists in maximizing at each pixel the 352 conditional probability p(ys /xs ) given by (8) 353 xˆs = arg max p(ys /xs = k). k (23) Several analysis-window sizes are compared: TW ∈ {7 × 354 7, 17 × 17, 33 × 33}. 355 2) The MMP segmentation described in Section III, applied 356 for several analysis-window sizes: TW ∈ {7 × 7, 17 × 357 17, 33 × 33}. 358 3) The region-based variational segmentation described in 359 Section IV that we denote by V ar. 360 Table I summarizes the different error classification rates for all 361 segmentations. 362 All segmentation methods give quite good results according 363 to the mean classification error rates. MMP and variational 364 approaches are more efficient than the ML-based segmentation 365 because they take into account the spatial dependence between 366 pixels. The difference between these later approaches (MMP 367 and variational methods) mainly lies in the accuracy of the 368 localization of region boundaries and in the dependence of 369 MMP-based segmentation on the window size TW . In fact, 370 for the MMP segmentation, the use of small neighborhood 371 (TW = 7 × 7) leads to more accurate region frontiers but 372 small misclassified patches appear because of the neighborhood 373 KAROUI et al.: SEABED SEGMENTATION USING OPTIMIZED STATISTICS OF SONAR TEXTURES MMP segmentation of I1 using TW = 7 × 7, τ = 12.5%. Fig. 10. MMP segmentation of I1 using TW = 17 × 17, τ = 9.2%. inefficiency for texture characterization. Conversely, the use of large window sizes (TW = 33 × 33) resorts to a lack of 376 accuracy in the localization of the boundaries of the seabed 377 regions because texture features extracted for pixels close to 378 region boundaries involve a mixture of texture characteris379 tics. The variational region-based approach does not need the 380 choice of an analysis window and operates globally on region 381 composed of pixels belonging to the same class. It resorts 382 to a tradeoff between segmentation accuracy and region ho383 mogeneity. Figs. 9–13 show examples of the dependence of 384 MMP segmentation on the sizes of analysis window and on the 385 robustness of the variational approach. In Fig. 14, we plot the 386 mean error rate for different window sizes. 387 We note that MMP and variational segmentations give sim388 ilar results when the analysis-window size is well chosen for 389 instance, TW = 17 × 17 for image I2 (Figs. 15 and 16), but 390 its performance depends a lot on the choice of this parameter. 391 This method can however be appropriate when the aim of the 392 segmentation is to detect texture regions and without seeking 393 for accurate boundaries. 394 The variational approach is also interesting because it is 395 much faster than the MMP segmentation. Being deterministic, 396 the variational approach can be very fast particularly if we use 397 appropriate initialization such as an initial segmentation based 398 on the ML criterion. Whereas the MMP segmentation needs 374 375 Fig. 11. MMP segmentation of I1 using TW = 33 × 33, τ = 13%. IE E Pr E oo f Fig. 9. 7 Fig. 12. Region-based segmentation of I1, τ = 7.5%. a large number of iterations, each iteration is also complex 399 and involves Gibbs sampling. For our implementations, the 400 convergence time of a variational image typically corresponds 401 to one iteration of the MMP algorithm. 402 To stress the interest of taking into account texture variability 403 with respect to incidence angles, additional segmentation re- 404 sults are reported for image I2 and I3 for which the seafloor 405 texture variability is clearer. In Table II, we summarize the 406 segmentation error rates for the segmentation with (J = 3) and 407 with no angular weighting, i.e., using only one angular domain 408 J = 1, for ML, MMP, and variational methods. In Figs. 17 and 409 18 are reported the associated segmentation results. It can be 410 noticed that a classical segmentation (without taking into ac- 411 count texture variability within incidence angles) cannot distin- 412 guish between visually similar seafloors (mud and marl ripples 413 for I2 and marl and sand for I3) near the specular domain. 414 VI. C ONCLUSION 415 We proposed two segmentation algorithms for sonar-image 416 segmentation: a Bayesian algorithm using local statistics and a 417 region-based variational algorithm, both based on a novel sim- 418 ilarity measure between seafloor-type images according to the 419 statistics of their responses to a large set of filters. This similar- 420 ity measure is expressed as a weighted sum of Kullback–Leibler 421 8 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 46, NO. 12, DECEMBER 2008 IE E Pr E oo f Fig. 13. Segmentations of I4. (a) MMP: TW = 7, τ = 4%. (b) MMP: TW = 33, τ = 6%. (c) Variational method: τ = 3%. Fig. 16. Variational-based segmentation of I2, τ = 3%. Fig. 14. Mean segmentation error rate for several window sizes. TABLE II SEGMENTATION ERROR RATES Fig. 15. MMP-based segmentation of I2 TW = 17, τ = 4%. divergences between individual seafloor filter-response statistics. The resulting weighting factors are exploited on the one hand for filter selection and, on the other hand, for taking into account the incidence angular dependence of seafloor tex426 tures. The conclusion is the cooccurrence matrices outperform 427 the other features for our sonar images. The results show 428 that the performance of the Bayesian approach depends on 429 the size of the analysis window. For pixel-based segmenta430 tion (ML and MMP segmentations), the size must be tuned 431 according to the coarseness of given textures. The results 432 also stress the suitability of the region-based approach as 433 compared to the Bayesian pixel-based scheme for texture seg434 mentation and demonstrate the interest of taking into account 422 423 424 425 Fig. 17. I2 region-based segmentation with and without angular weighting. (a) J = 1, τ = 23.5%. (b) J = 3, τ = 3%. Fig. 18. I3 MMP-based segmentation with and without angular weighting. (a) J = 1, τ = 16%. (b) J = 3, τ = 8%. the angular backscatter variabilities to discriminate between 435 seafloor types particularly near the specular sector. 436 KAROUI et al.: SEABED SEGMENTATION USING OPTIMIZED STATISTICS OF SONAR TEXTURES A PPENDIX E VOLUTION E QUATION C OMPUTATION 437 438 441 Using the shape-derivative tools, we want to differentiate the functional  k  F (Ωk ) = KLΘ w Q , D(Ωk ) which can be written as follows: J F πΩk (j)wf2 j=1 f =1 442 443 444  Qkf,j (α) log Rf Qkf,j (α) Df,j (Ωk , α)  Then, we have J F wf2 πΩk (j) j=1 f =1 J F Finally, we get 457 ) dHjk (Ωk , V   = p(Θj /zs )  Γk Qkf,j (α) Qf (α) gσf (hf (s) − α) − G1 (f, α, Ωk ) πΩk (j) )  Hjk . Γk + πΩk (j) Hjk dα  )dα. dHjk (Ωk , V  · Nk )da(s) × (V   )dα dHjk (Ωk , V Rf 1 = πΩk (j) Rf    · Nk )da(s) × (V    Qkf,j (α) 1 = gσ (hf (s)−α) Qf (α) p(Θj /zs ) πΩk (j) Df,j (Ωk , α) f Rf  wf2 dπΩk (j)(Ωk , V j=1 f =1 Qkf,j ∂Hjk = . ∂πΩk (j) πΩk (j) Γk The Gâteaux derivative of F (Ωk ) in the direction of a vector  is given by field V )= dF (Ωk , V Qkf,j ∂Hjk =− ∂G1 G1 According to the previous theorem, we get 456   · Nk )da(s).  ) = − p(Θj /zs )gσ (hf (s) − α) (V dG1 (Ωk , V f p(Θj /zs )gσf (hf (s) − α) ds. F (Ωk ) = 446 dα. 2) Ωk 447  Let us introduce the following notations. 1)   Qkf,j (α) k k Hj = Qf,j (α) log . Df,j (Ωk , α) G1 = 445  455 Using the chain rule, we get   ∂Hjk ∂Hjk  )=  )+ ) dHjk (Ωk , V dG1(Ωk , V dπΩk(j)(Ωk , V ∂G1 ∂πΩk (j) IE E Pr E oo f 439 440 9  Γk   Qkf,j ∗ gσf (hf (s)) − 1 p(Θj /zs ) Df,j (Ωk )  · Nk )da(s). × (V (24) Finally, according to (24) 458 Rf 448 Theorem: See [37]. 449  The Gâteaux derivative 450 Ω k(s, Ω)ds is given by )= dK(Ω, V  ) dF (Ωk , V of a functional of the type K(Ω) =  )ds − ksh (s, Ω, V  N  )da(s) k(s, Ω)(V Γ Ω J =− F p(Θj /zs )wf2 j=1 f =1       k  Qkf,j × KL Qf,j , Df,j − −1 ∗ gσf (hf (s)) Df,j (Ωk ) Γk  ) is where Ω denoted a region and Γ its boundary; ksh (s, Ω, V the shape derivative of k(s, Ω). 453 In our case   ) = − p(Θj /zs )(V  · Nk )da(s) dπΩk (j)(Ωk , V 451 452 Γk 454 Hjk can be written as Hjk = Qkf,j (α) log  πΩk (j)Qkf,j (α) G1 (Ωk , f, j, α)  .    · Nk da(s). × V R EFERENCES 459 [1] P. Brehmer, F. Gerlotto, J. Guillard, F. Sanguinède, Y. Guénnegan, and D. Buestel, “New applications of hydroacoustic methods for monitoring shallow water aquatic ecosystems: The case of mussel culture grounds,” Aquat. Living Resour., vol. 16, no. 3, pp. 333–338, Jul. 2003. [2] M. Mignotte, C. Collet, P. Perez, and P. Bouthemy, “Hybrid genetic optimization and statistical model based approach for the classification of shadow shapes in sonar imagery,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 22, no. 2, pp. 129–141, Feb. 2000. 460 461 462 463 464 465 466 467 10 [3] L. Hellequin, J. M. Boucher, and X. Lurton, “Processing of highfrequency multibeam echo sounder data for seafloor characterization,” IEEE J. Ocean. Eng., vol. 28, no. 1, pp. 78–89, Jan. 2003. [4] G. Le Chenadec and J. M. Boucher, “Sonar image segmentation using the angular dependence of backscattering distributions,” in Proc. Oceans—Europe, Brest, France, 2005, vol. 1, pp. 147–152. [5] G. Le Chenadec, J.-M. Boucher, and X. Lurton, “Angular dependence of K-distributed sonar data,” IEEE Trans. Geosci. Remote Sens., vol. 45, no. 5, pp. 1224–1236, May 2007. [6] E. Jakeman, “Non-Gaussian models for the statistics of scattered waves,” Adv. Phys., vol. 37, no. 5, pp. 471–529, Oct. 1988. [7] A. P. Lyons and D. A. Abraham, “Statistical characterization of highfrequency shallow-water seafloor backscatter,” J. Acoust. Soc. Amer., vol. 106, no. 3, pp. 1307–1315, Sep. 1999. [8] D. A. Abraham and A. P. Lyons, “Novel physical interpretations of k-distributed reverberation,” IEEE J. Ocean. Eng., vol. 27, no. 4, pp. 800– 813, Oct. 2002. [9] N. C. Mitchell, “Processing and analysis of Simrad multibeam sonar data,” Mar. Geophys. Res., vol. 18, no. 6, pp. 729–739, Dec. 1996. [10] G. Le Chenadec, J. M. Boucher, X. Lurton, and J. M. Augustin, “Angular dependence of statistical distributions for backscattered signals: Modeling and application to multibeam echosounder data,” in Proc. Oceans, Sep. 2003, vol. 2, pp. 897–903. [11] D. R. Jackson, D. P. Winbrenner, and A. Ishimaru, “Application of the composite roughness model to high-frequency bottom backscattering,” J. Acoust. Soc. Amer., vol. 79, no. 5, pp. 1410–1422, May 1986. [12] C. de Moustier and D. Alexandrou, “Angular dependence of 12-kHz seafloor acoustic backscatter,” J. Acoust. Soc. Amer., vol. 90, no. 1, pp. 522–531, Jul. 1991. [13] J. E. Hughes-Clarke, “Toward remote seafloor classification using the angular response of acoustic backscatter: A case study from multiple overlapping GLORIA data,” IEEE J. Ocean. Eng., vol. 19, no. 1, pp. 112– 127, Jan. 1994. [14] E. Pouliquen and X. Lurton, “Automated sea-bed classification system for echo-sounders,” in Proc. Oceans, Oct. 1992, vol. 1, pp. 317–321. [15] D. Langer and M. Herbert, “Building qualitative elevation maps from side scan sonar data for autonomous underwater navigation,” in Proc. IEEE Int. Conf. Robot. Autom., Apr. 1991, vol. 3, pp. 2478–2483. [16] R. Li and S. Pai, “Improvement of bathymetric data bases by shape from shading technique using side-scan sonar images,” in Proc. OCEANS, Oct. 1991, vol. 1, pp. 320–324. [17] E. Dura, J. Bell, and D. Lane, “Reconstruction of textured seafloors from side-scan sonar images,” Proc. Inst. Elect. Eng.—Radar Sonar Navig., vol. 151, no. 2, pp. 114–125, Apr. 2004. [18] A. E. Johnson and M. Hebert, “Seafloor map generation for autonomous underwater vehicle navigation,” Auton. Robots, vol. 3, no. 2, pp. 145–168, Jun. 1996. [19] R. Zhang, P. Tsai, J. E. Cryer, and M. Shah, “Shape from shading: A survey,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 21, no. 8, pp. 690– 705, Aug. 1999. [20] T. H. Eggen, “Acoustic sediment classification experiment by means of multibeam echosounders,” Acoust. Classification Mapping Seabed, vol. 15, no. 2, pp. 437–444, 1993. [21] L. M. Linnett, S. J. Clarke, and D. R. Carmichael, “The analysis of sidescan sonar images for seabed types and objects,” in Proc. 2nd Eur. Conf. Underwater Acoust., 1994, vol. 22, pp. 187–194. [22] M. Lianantonakis and Y. R. Petillot, “Sidescan sonar segmentation using active contours and level set method,” in Proc. OCEANS—Europe, Brest, France, 2005, vol. 1, pp. 719–724. [23] L. Xiuwen and W. DeLiang, “Texture classification using spectral histograms,” IEEE Trans. Image Process., vol. 12, no. 6, pp. 661–670, Jun. 2003. [24] O. G. Cula and K. Dana, “3D texture recognition using bidirectional feature histograms,” Int. J. Comput. Vis., vol. 59, no. 1, pp. 33–60, Aug. 2004. [25] R. Fablet and P. Bouthemy, “Motion recognition using nonparametric image motion models estimated from temporal and multiscale co-occurrence statistics,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 25, no. 12, pp. 1619–1624, Dec. 2003. [26] P. Nammalwar, O. Ghita, and P. F. Whelan, “Integration of feature distributions for colour texture segmentation,” in Proc. Int. Conf. Pattern Recog., Aug. 2005, vol. 1, pp. 716–719. [27] Q. Xu, J. Yang, and S. Ding, “Texture Segmentation using LBP embedded region competition,” Electron. Lett. Comput. Vis. Image Anal., vol. 5, no. 1, pp. 41–47, 2004. [28] S. Kullback, “On information and sufficiency,” in The Annals of Mathematical Statistics, vol. 22. New York: Wiley, 1951. [29] E. Parzen, “On estimation of a probability density function and mode,” Ann. Math. Stat., vol. 33, no. 3, pp. 1065–1076, Sep. 1962. [30] R. M. Haralick, “Statistical and structural approaches to texture,” Proc. IEEE, vol. 67, no. 5, pp. 786–804, May 1979. [31] S. Geman and G. Geman, “Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images,” IEEE Trans. Pattern Anal. Mach. Intell., vol. PAMI-6, no. 6, pp. 721–741, Nov. 1984. [32] J. Marroquin, S. Mitter, and T. Poggio, “Probabilistic solution of ill-posed problems in computational vision,” J. Am. Stat. Assoc., vol. 82, no. 397, pp. 76–89, Mar. 1987. [33] M. Kass, A. Withkin, and D. Terzopoulos, “Snakes: Active contour models,” Int. J. Comput. Vis., vol. 1, no. 4, pp. 321–331, Jan. 1988. [34] J. A. Sethian, Level Set Methods. Cambridge, U.K.: Cambridge Univ. Press, 1996. [35] J. F. Aujol, G. Aubert, and L. Blanc-Féraud, “Wavelet-based level set evolution for classification of textured images,” IEEE Trans. Image Process., vol. 12, no. 12, pp. 1634–1641, Dec. 2003. [36] C. Samson, L. Blanc-Féraud, G. Aubert, and J. Zerubia, “A level set method for image classification,” Int. J. Comput. Vis., vol. 40, no. 3, pp. 187–197, 2000. [37] S. Jehan-Besson, M. Barlaud, and G. Aubert, “Image segmentation using active contours: Calculus of variations or shape gradients?” SIAM J. Appl. Math., vol. 63, no. 6, pp. 2128–2154, 2003. [38] H. K. Zhao, T. Chan, B. Merriman, and S. Osher, “A variational level set approach to multiphase motion,” J. Comput. Phys., vol. 127, no. 1, pp. 179–195, Aug. 1996. [39] I. Karoui, R. Fablet, J. M. Boucher, and J. M. Augustin, “Separabilitybased Kullback divergence weighting and filter selection for texture classification and segmentation,” in Proc. 18th ICPR, Hong Kong, Aug. 2006. [40] I. Karoui, R. Fablet, J. M. Boucher, W. Pieczynski, and J. M. Augustin, “Fusion of textural statistics using a similarity measure: Application to texture recognition and segmentation,” Pattern Anal. Appl., vol. 11, no. 3/4, pp. 425–434, Sep. 2008. Computer Science. [41] A. Ehrhold, D. Hamon, and B. Guillaumont, “The Rebent monitoring network, a spatially integrated, acoustic approach to surveying nearshore macrobenthic habitats: Application to the bay of Concarneau (South Brittany, France),” ICES J. Mar. Sci., vol. 63, no. 9, pp. 1604– 1615, Jan. 2006. [42] W. Pieczynski, “Statistical image segmentation,” Mach. Graph. Vis., vol. 1, no. 2, pp. 261–268, 1992. 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 Imen Karoui received the M.Sc. degree in telecommunications from École Polytechnique de Tunis, Tunisia, in 2002, the D.E.A. degree in STIR (signal, télécommunications, images, radar) from the Université de Rennes 1, Rennes, France, in 2003, and the Ph.D. degree in signal processing and telecommunications from the Department of Signal and Communications, École Nationale Supérieure des Télécommunications de Bretagne (Telecom Bretagne), Brest Cedex 3, France, in 2007. She is currently a Postdoctor with Telecom Bretagne. Her research interests include texture analysis and segmentation with application to acoustic imaging and fisheries. 585 586 587 588 589 590 591 592 593 594 595 596 597 Ronan Fablet receive the M.Sc. degree in signal processing and telecommunications from École Nationale Supérieure de l’Aéronautique et de l’Espace, Toulouse Cedex 4, France, in 1997 and the Ph.D. degree in signal processing and telecommunications from the University of Rennes 1, Rennes, France, in 2001. In 2002, he was an INRIA Postdoctoral Fellow with Brown University, Providence, RI. From 2003 to 2007, he had a full-time research position with the Institut Français de Recherche et d’Exploitation de la Mer, Brest, France, in the field of biological oceanography. Since 2008, he has been an Assistant Professor with the Department of Signal and Communications, École Nationale Supérieure des Télécommunications de Bretagne, Brest Cedex 3, France. His main interests include sonar and radar imaging particularly applied to biological and physical oceanography and signal and image processing applied to biological archives. 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 IE E Pr E oo f 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 46, NO. 12, DECEMBER 2008 KAROUI et al.: SEABED SEGMENTATION USING OPTIMIZED STATISTICS OF SONAR TEXTURES Jean-Marc Boucher (M’83) was born in 1952. He received the M.Sc. degree in telecommunications from École Nationale Supérieure des Télécommunications, Paris, France, in 1975 and the Habilitation à Diriger des Recherches degree from the University of Rennes 1, Rennes, France, in 1995. He is currently a Professor with the Department of Signal and Communications, École Nationale Supérieure des Télécommunications de Bretagne, Brest Cedex 3, France, where he is also an Education Deputy Director. He is also the Deputy Manager of a National Scientific Research Center Laboratory (Institut Telecom-Telecom Bretagne-UMR CNRS 3192 lab-STICC, Université Européenne de BretagneFrance). His current researches are related to statistical signal analysis, including estimation theory, Markov models, blind deconvolution, wavelets, and multiscale-image analysis. These methods are applied to radar and sonar images, seismic signals, and electrocardiographic signals. He published about a hundred technical articles in these areas in international journals and conferences. Jean-Marie Augustin received the Master degree in signal processing from the University of Rennes 1, Rennes, France, in 1982. Since 1984, he has been with the Institut Français de Recherche et d’Exploitation de la Mer, Brest, France, the French public institute for marine research, where he is currently a Senior Engineer with the Department of Acoustics and Seismics. His main interests include software development for sonar seafloor mapping and backscatter reflectivity analysis. IE E Pr E oo f 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 11 634 635 636 637 638 639 640 641 642 643 644 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 46, NO. 12, DECEMBER 2008 2 Seabed Segmentation Using Optimized Statistics of Sonar Textures 3 Imen Karoui, Ronan Fablet, Jean-Marc Boucher, Member, IEEE, and Jean-Marie Augustin 1 Abstract—In this paper, we propose and compare two supervised algorithms for the segmentation of textured sonar images, with respect to seafloor types. We characterize seafloors by a set of empirical distributions estimated on texture responses to a set of different filters. Moreover, we introduce a novel similarity measure between sonar textures in this feature space. Our similarity measure is defined as a weighted sum of Kullback–Leibler divergences between texture features. The weight setting is twofold. First, each filter is weighted according to its discrimination power: The computation of these weights are issued from a margin maximization criterion. Second, an additional weight, evaluated as an angular distance between the incidence angles of the compared texture samples, is considered to take into account sonar-image acquisition process that leads to a variability of the backscattered value and of the texture aspect with the incidence-angle range. A Bayesian framework is used in the first algorithm where the conditional likelihoods are expressed using the proposed similarity measure between local pixel statistics and the seafloor prototype statistics. The second method is based on a variational framework as the minimization of a region-based functional that involves the similarity between global-region texture-based statistics and the predefined prototypes. IE E Pr E oo f 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 1 Fig. 1. Typical sidscan sonar image (Rebent, IFREMER). 26 Index Terms—Active regions, angular backscattering, feature 27 selection, level sets, maximum marginal probability (MMP), seg28 mentation, sonar images, texture. I. I NTRODUCTION 29 COUSTIC remote sensing, such as high-resolution multibeam and sidescan sonars, provides new means for in situ observation of the seabed. The characterization of these high33 resolution sonar images is important for a number of practical 34 applications such as marine geology, commercial fishing, off35 shore oil prospecting, and drilling [1]–[4]. 36 The segmentation and the classification of sonar images with 37 respect to seafloor types (rocks, mud, sand, . . .) is the key 38 goal behind the analysis of these acoustic images. This task, 39 however, raises two major difficulties. The first task is to deal 40 with texture in these images. Previous methods are generally 41 based only on backscattered (BS) intensity models, and several 42 parametric families of probabilistic distribution functions have 30 31 32 A Manuscript received January 4, 2008; revised May 20, 2008 and August 11, 2008. I. Karoui, R. Fablet, and J.-M. Boucher are with the Department of Signal and Communications, École Nationale Supérieure des Télécommunications de Bretagne, 29238 Brest Cedex 3, France (e-mail: [email protected]). J.-M. Augustin is with the Department of Acoustics and Seismics, Institut Français de Recherche et d’Exploitation de la Mer, 29280 Brest, France. Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org. Digital Object Identifier 10.1109/TGRS.2008.2006362 Fig. 2. BS evolution with the incidence angles for the three seafloor types of Fig. 1: sand, mud, and sand ripples. been suggested (Rayleigh distribution, K distribution, Weibull 43 distribution, etc.) [3]–[8]. These first-order statistics are not 44 sufficient when high-resolution sonar images involve textures, 45 which is the case of most sonar images (Fig. 1). 46 The other important issue arising in seabed texture charac- 47 terization is a built-in feature of sonar observation: The value 48 of BS measure depends both on the seafloor type and on the 49 incident angle of the reflected acoustic signal, ranging typically 50 from −85◦ to +85◦ . Fig. 2 shows the BS evolution for three 51 different seafloor types: sand, sand ripples, and mud. In addition 52 to the BS variability within incidence angles, seafloor textures 53 0196-2892/$25.00 © 2008 IEEE 2 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 46, NO. 12, DECEMBER 2008 Fig. 3. Sonar image composed of sand ripples. (Left) [5◦ , 40◦ ]. (Right) [80◦ , 85◦ ]. 55 56 57 for two angular sectors. (Left) [5◦ , 40◦ ]. segmentation schemes. We first state the segmentation issue 93 as a Bayesian pixel-based labeling according to local texture 94 features. The second approach relies on a region-level vari- 95 ational framework, which resorts to a level-set minimization 96 of an energy criterion involving global-region-based seafloor 97 statistics. 98 This paper is organized as follows. The seafloor similarity 99 measure is introduced in Section II. The Bayesian segmentation 100 method is detailed in Section III. The region-based segmen- 101 tation algorithm is described in Section IV. Experiments are 102 reported and discussed in Section V, and conclusions are drawn 103 in Section VI. 104 IE E Pr E oo f are dependent on the incidence angles. Figs. 3 and 4 show a sonar image composed by sand ripples and rocks, respectively, for two angular sectors: [80◦ , 85◦ ] and [5◦ , 40◦ ]. The texture of sand ripples shows a loss of contrast in the specular sec58 tor [5◦ , 40◦ ]: The steep grazing angle reduces the backscatter 59 differences between facing and trailing slopes, while at low 60 incidence angles, much of the variation is lost due to increasing 61 sonar shadow. A similar loss of contrast is observable in the 62 rock samples (Fig. 4). BS behavior according to the incidence 63 angles has been of wide interest for sonar imaging [3], [9]–[12]. 64 Parametric and nonparametric techniques have been proposed 65 to model sonar-image behavior with respect to the incidence 66 angle variations. The effect of the incidence angle on the BS 67 has also been explored as a discriminating feature for seafloor 68 recognition [4], [13], [14]. However, no studies have proposed 69 a method to accurately compensate this phenomena because 70 of the joint dependence of the seafloor types and on the local 71 bathymetry which is generally unknown for sidescan sonar 72 images. To our knowledge, the effects of the incidence angle on 73 textured seabed features have not been addressed for segmen74 tation issues. Only some studies were interested in simulating 75 the behavior of oriented and textured seafloor types [15]–[18]. 76 These methods are mainly based on shape-from-shading [19] 77 and were restricted to synthetic images or to real sonar images 78 involving only one seafloor type. 79 In this paper, we aim at using texture information within 80 sonar seabed images and at developing segmentation algo81 rithms that take into account angular variability of BS and 82 textural features. We propose to characterize seafloors by a 83 wide set of marginal distributions of their filter responses, and 84 we measure seafloor similarities according to a weighted sum 85 of Kullback–Leibler divergences [28] in this feature space. 86 To cope with seafloor angular dependence, we introduce an 87 additional weighting factor, evaluated as an angular distance 88 between the compared texture samples: This angular distance 89 is measured according to Gaussian kernels, whose variance 90 sets the level of the angular variability, depending on textures 91 and seafloor types. The proposed incidence-angle-and-texture92 based similarity measure is exploited to develop two different 54 Fig. 4. Rock texture (Right) [80◦ , 85◦ ]. II. S ONAR -T EXTURE S IMILARITY M EASURE 105 Texture-based segmentation of seabed sonar images gener- 106 ally relies on Haralick [30] parameters or scalar spectral and 107 filter coefficients to model textures [20]–[22]. Recently, in the 108 field of texture analysis, features computed as statistics of local 109 filter responses have been shown to be relevant and discriminant 110 texture descriptors [23]–[27]. Motivated by these studies, we 111 propose to use texture features computed as marginal distrib- 112 utions of a wide set of filter seafloor responses. Each seafloor 113 type denoted by Tk is characterized by a set Qk composed of 114 the marginal distributions of the seafloor with respect to the 115 predefined filters. The following conditions are issued. 116 1) 121 cooccurrence distributions with horizontal and ver- 117 tical displacements denoted by dx and dy , respectively, 118 (dx , dy ) ∈ {0, 1, . . . , 10}. 119 2) 50 distributions of the magnitude of Gabor filter 120 responses, computed for combination of parameters 121 (f0 , σ, θ), where f0 is the radial frequency, σ is 122 the standard√deviation, and θ is the √ orientation, such 123 that f0 ∈ { 2/2k }k=1:6 , σ ∈ {2k 2}k=2:5 , and θ ∈ 124 {0◦ , 25◦ , 45◦ , 90◦ , 135◦ }. 125 3) 48 distributions of the energy of the image wavelet packet 126 coefficient computed for different bands (we used three 127 wavelet types: Haar, Daubechies, and Coiflet). 128 KAROUI et al.: SEABED SEGMENTATION USING OPTIMIZED STATISTICS OF SONAR TEXTURES 3 Gaussian mixture, and the assignment likelihood p(Θj /zs ) is 147 given by 148   −(Θj −θs )2 exp σ2 .  j p(Θj /zs ) =  (1) −(Θi −θs )2 J exp 2 i=1 σ i Hence, the assignment of a given region R to an angular 149 subdomain is given by 150  1 p(Θj /zs ), ds. (2) πR (j) = |R| s IE E Pr E oo f Fig. 5. Cooccurrence distributions computed for parameters (d, θ) = (6, 0◦ ). (Left) [5◦ , 40◦ ]. (Right) [80◦ , 85◦ ]. The angular bounds, defined by the parameters Θj and σj , 151 are set experimentally, and each seafloor type is character- 152 ized by Qk = {Qf,j (Tk )}f =1:F,j=1:J , its filter responses es- 153 timated on the J angular domains. Note that f accounts 154 both for filter types and various associated parameteri- 155 zations: f = 1 → 121 refers to cooccurrence distributions, 156 f = 122 → 171 to Gabor filter, and f = 172 → 219 to 157 wavelet-based distributions. These statistics are computed us- 158 ing Parzen window estimation [29]. Formally, Qf,j (Tk , α) = 159 (1/πTk (j)) Tk p(Θj /zs )gσf (hf (s) − α)ds, where hf is the 160 filter response indexed by f (for a cooccurrence matrix with pa- 161 rameters μ = (d, θ), hf : Ω → [1, N g] × [1, N g], and hf (s) = 162 (I(s), I(s + μ)), where I(s) is the gray value at pixel s and N g 163 is the total gray level number) and gσf is a Gaussian kernel with 164 zero mean and standard deviation σf . 165 We define the similarity measure between a texture sample 166 T with feature set D(T ) = {Df,j (T )}f =1:F,j=1:J and a given 167 seafloor type Tk as follows: 168 J  k  KLΘ w Q , D(T ) = Fig. 6. Cooccurrence distributions computed for parameters (d, θ) = (Left) [5◦ , 40◦ ]. (Right) [80◦ , 85◦ ]. (6◦ , 0). Sonar-texture variability with respect to the incidence angles induces a variability in texture features. Figs. 5 and 6 show the cooccurrence matrices of the images involving sand ripples 132 and rocks shown in Figs. 3 and 4, respectively. A change in 133 cooccurrence distributions can be noticed. In the angular sector 134 [5◦ , 40◦ ], the cooccurrence is bimodal due to the alternation of 135 dark and light values in the image of sand ripples, whereas in 136 the sector [80◦ , 85◦ ], the loss of contrast between dark and light 137 values in the image of sand ripples leads to unimodal distribu138 tion. For cooccurrence distributions related to rock samples, a 139 change in the variance is depicted. 140 To deal with this problem, we propose to define angular 141 subdomains, in which texture characteristics can be regarded 142 as homogeneous. Each angular sector is indexed by j and 143 is characterized by a mean incidence-angle value Θj . For144 mally, this leads in introducing a state variable zs assign145 ing pixel s to a given angular subdomain. (θs , zs ), where 146 θs is the incidence angle of pixel s, is then modeled as a 129 130 131 F j=1 f =1   πT (j)wf2 KL Qkf,j , Df,j (T ) (3) where {wf2 }f =1:F are the feature weights such that 169 F 2 f =1 wf = 1, Df,j (T ) is the distribution related to 170 filter f and estimated on  region T for angular sector j: 171 Df,j (T, α) = (1/πT (j)) T p(Θj /zs )gσf (hf (s) − α)ds, and 172 KL denotes for the Kullback–Leibler divergence [28]; 173 for  two probability distributions Q and D, KL(Q, D) = 174 Q(α)log(Q(α)/D(α))dα. 175 The resulting weighting factors {wf2 }f =1:F are exploited on 176 the one hand for filter selection to keep only the distributions 177 corresponding to the significant weights and, on the other hand, 178 for the definition of an optimized texture-based similarity mea- 179 sure KLΘ w given the selected distributions. In a supervised con- 180 text, the weights are estimated from a training set T composed 181 of N -labeled texture samples: T = {(D(T ), sT )}, where sT is 182 the class of sample T . Formally, {wf2 }f =1:F are issued from the 183 maximization of the global margin defined as follows: 184 MwT = Mw (T ) (4) T ∈T where  d  Θ sT T Mw (T ) = KLΘ w Q , D(T ) − KLw (Q , D(T )) 185 (5) 4 186 187 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 46, NO. 12, DECEMBER 2008 where dT is the nearest class to T different from sT according to the similarity measure KLΘ w  k  (6) dT = arg min KLΘ w Q , D(T ) . k=sT iterative conditional estimation procedure [42]. Using Bayes 227 rule, the posterior distribution is expressed as follows: 228 p(X, Y ) = p(Y /X)p(X) = p(X) The maximization of the margin criterion MwT is carried out using a stochastic gradient method as detailed in our previous 190 work [39]. 188 189 191 III. B AYESIAN S ONAR -I MAGE S EGMENTATION As far as Bayesian image segmentation is concerned, the most popular criteria are the maximum a posteriori (MAP) 194 [31] and the maximum marginal probability (MMP) [32]. It 195 has been shown that the MMP estimation criterion is more 196 appropriate for image segmentation than the MAP criterion 197 [32]. The MAP estimate assigns the same cost to every incorrect 198 segmentation regardless the number of pixels at which the 199 estimated segmentation differs from the true one, whereas the 200 MMP algorithm minimizes the expected value of the number of 201 misclassified pixels. As shown in [32], the MMP procedure is 202 equivalent to maximizing the marginal of the class labels. Let 203 us introduce the following notations: 204 1) S, the image lattice composed of N pixels; 205 2) X = {xs }s∈S , the label random field; 206 3) Y = {ys }s∈S , the random field of the observations, the 207 textural feature in our case. 208 Formally, the MMP scheme is equivalent to the maximi209 zation of x̂MMP = arg max p(xs = k/y). s k∈{1:K} (7) In general, pixel conditional likelihood is computed according to local texture features like Gabor or wavelet coefficients. Here, we aim at using the proposed similarity measure KLΘ w. We associate to each pixel s a set of features D(Ws ) = 214 {Df (Ws )}f =1:F estimated according to a Parzen estimation 215 method [29], within a square window Ws centered at s. The 216 window size that we denote by TW is set by the user according 217 to texture coarseness. In our case, the observation denoted 218 by a random field Y is specified by {ys = D(Ws )}s∈S . The 219 conditional likelihood at each pixel s with respect to class k is 220 then defined from the similarity measure KLΘ w by 210 211 212 213 k −KLΘ w (Q ,D(Ws )) 221 222 223 exp . p(ys /xs = k) = K i −KLΘ w (Q ,D(Ws )) i=1 exp s∈S t∈cs −αc (1−δ(xs ,xt ))+log(p(ys /xs )) . (10) The maximization of local probabilities p(xs = k/y) is car- 229 ried out as follows [32]: We use the Gibbs sampler to gen- 230 erate a discrete-time Markov chain X(t) which converges in 231 distribution to a random field with probability mass functions 232 P (X/Y ). The marginal conditional distributions p(xs = k/y) 233 are then approximated as the fraction of time the Markov chain 234 spends in state k for each class k and each pixel s. The MMP 235 segmentation steps are the following. 236 1) Simulation of Tmax realizations of x1 , x2 , . . . , xTmax of 237 X using Gibbs sampler. 238 2) Using the realizations x1 , x2 , . . . , xTmax , p(xs /y) is esti- 239 mated using the frequency of each realizations 240     δ x1s − k + · · · + δ x1s − k p(xs = k/y) = . Tmax s∈S t∈cs αc (1 − δ(xs , xt )) 3) Choose as xs the class that maximizes p(xs = k/y). 241 IV. V ARIATIONAL S ONAR -I MAGE S EGMENTATION 242 Unlike the Bayesian scheme, the second approach is stated 243 at a region level as the minimization of a constrained energy 244 criterion E({Ωk }k=1:K ) = E1 + E2 , where Ωk is the domain 245 composed of all pixels attributed to the class k, E1 is a texture- 246 based data-driven term, and E2 is a regularization term detailed 247 as follows. 248 A. Functional Terms E ({Ωk }k=1:K ) is evaluated as the log-likelihood of a given 250 partition with respect to texture models. It is evaluated as the 251 sum of the similarities according to the measure KLΘ w between 252 each region Ωk and its corresponding class Tk 253 K E1 ({Ωk }k=1:K ) = k=1  k  KLΘ w Q , D(Ωk ) (11) where Df,j (Ωk ) is the marginal distribution of the image 254 response to the filter indexed by f . For angular domain j, 255 Df,j (Ωk ) is estimated according to Parzen method [29]. 256 E2 is a regularization term, it penalizes the lengths of region 257 contours and is expressed by 258 (9) where δ is the delta function and αc ∈ {αH , αV , αD } are real parameters assigned, respectively, to horizontal, vertical, and 226 diagonal cliques. These parameters are estimated using the 249 1 (8) As a prior PX , we consider a Markov random field associated to an eight-neighborhood system with potential functions given by U2 (x) = 224 225 ≈ exp   IE E Pr E oo f 192 193 p(ys /xs ) s K E2 = k=1 γk |Γk |, γk ∈ ℜ+ (12) where |Γk | is the length of the contour Γk associated to the 259 region Ωk . 260 KAROUI et al.: SEABED SEGMENTATION USING OPTIMIZED STATISTICS OF SONAR TEXTURES 261 B. Computation of the Evolution Equation We solve for the minimization of the functional E using a gradient-descent technique. It relies on the computation of 264 the first derivative of E according to regions {Ωk }k=1:K . The 265 evolution equation of region contours {Γk }k=1:K is then given 266 by the following dynamic scheme [33]: 262 263 ∂Γk (x,t) = Fk (x, t)Nk ∂t Γk (x, 0) = Γ0k (13) 5 where ∂ϕik /∂t, i = 1, 2, 3 are the evolution equation terms associated, respectively, to functionals Ei , i = 1, 2, 3. The derivatives of the energy terms E2 and E3 are directly estimated from level-set functions [35], [36] 287 288 289 290 ∇ϕk ∂ϕ2k (s, t) = γk δα (ϕk )div ∀k ∈ {1 : K} (19) ∂t |∇ϕk | K  ∂ϕ3k (s, t) =−δα (ϕk )λ (Hα (ϕk )−1) ∀k ∈ {1 : K}. ∂t k=1 where Nk is the unit inward normal to Γk at pixel x and at time t and Fk is the velocity field (in our case, Fk = ∇Ek , the 269 derivative of E with respect to Γk ). 270 The explicit implementation of the curve evolution according 271 to the latter dynamic scheme using a difference-approximation 272 scheme cannot deal with topological changes of the moving 273 front. This could be avoided by introducing the level-set method 274 proposed by Osher and Sethian [34]. The basic idea of the 275 method is the implicit representation of the moving interface Γ 276 by a higher dimensional hypersurface ϕ (the level-set function) 277 such that the zero level set of ϕ is actually the set of Γ and 267 268 K γk lim k=1 281 ∀s ∈ Ωk . (14) E2 can be expressed using level-set functions ϕk [36] E2 = α→0  δα (ϕk )|∇ϕk |ds (15) Ω where δα is a regularized delta function    1 1 + cos πs , 2α α δα (s) = 0, if |s| ≤ α if |s| < α. (16) In order to cope with multiclass segmentation and to fulfill the image-partition constraint, we use an additional term E3 given 284 by the following functional [38]: 282 283 λ E3 = 2   Ω 285 286 K k=1 F f =1    k wf2 p(Θj /zs ) KL Qkf,j , Df,j  − Qkf,j /Df,j ∗ gσf (hf (s)) − 1 |∇ϕk | The evolution of the contours {Γk }k=1:K (13) is then equivalent to the evolution of the level-set functions {ϕk }k=1:K [34] ∂ϕk (s, t) = Fk (s, t) |∇ϕk (s, t)| ∂t 280 J ∂ϕ1k (s, t) = − ∂t j=1 IE E Pr E oo f 278 The evolution equation related to E1 is more complex, since it 291 involves computations over the spatial support of each region. 292 To differentiate E1 , we use shape-derivative tools, particularly 293 the Gâteaux derivative theorem given in [37]. As detailed in the 294 Appendix, it leads to 295  Ωinside = {s ∈ Ω/ϕ(s) > 0} Ωoutside = {s ∈ Ω/ϕ(s) < 0} . 279 (20) 2 Hα (ϕk ) − 1 ds, λ ∈ ℜ+ where Hα is a regularized Heaviside function ⎧    ⎨ 12 1 + αs + π1 sin πs , if |s| ≤ α α Hα (s) = 1, if s > α ⎩ 0, if s < −α. As E = E1 + E2 + E3 , we have ∂ϕk ∂ϕ1k ∂ϕ2k ∂ϕ3k (s, t) = (s, t) + (s, t) + (s, t) ∂t ∂t ∂t ∂t (17) (21) where ∗ is the convolution symbol. 296 The evolution equation related to the energy E1 (21) is 297 composed of two terms. 298 J  F 2 k 1) A global term − j=1 f =1 wf p(Θj /zs )KL(Qf,j , 299 Df,j ): This term is always negative or null. It is a contrac- 300 tion force that  reducesthe size of heterogeneous regions. 301 F J 2 k 2) A local term f =1 wf p(Θj /zs )((Qf,j /Df,j ) − 302 j=1 1) ∗ gσf (hf (s)): This term locally compares the features 303 values at each pixel. This term can be positive or neg- 304 ative and aims at readjusting the statistics inside the 305 regions Df,j (Ωk ) to fit to prototype models Qkf,j . The 306 contribution of each descriptor f is weighted by wf2 and 307 p(Θj /zs ), the relative contribution of descriptor f and of 308 angular sector j. 309 The overall evolution equations of the contours {Γk }k=1:K 310 are the following: 311 ∂ϕk (s, t) ∂t = −δα (ϕk ) ⎡ J ×⎣ F wf2 p(Θj /zs ) j=1 f =1 ×  KL(Qkf,j , Df,j )− (18) + γk div ∇ϕk −λ |∇ϕk |  Qkf,j ∗ gσf (hf (s))+1 Df,j  K (Hα (ϕk )−1) k=1 ∀k. (22) We apply these coupled evolution equations until convergence. 312 6 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 46, NO. 12, DECEMBER 2008 Fig. 8. Feature weights. 1 → 121 cooccurrence distributions, 122 → 171 Gabor filter-based features and 172 → 219 wavelet-based distributions. (a) I1. (b) I5. IE E Pr E oo f TABLE I SEGMENTATION ERROR RATES Fig. 7. Test images and their manual segmentation (in black line). 313 V. E XPERIMENTAL R ESULTS AND D ISCUSSION In previous work, we have tested the method on various optic textures (Brodatz textures). The method was compared 316 to other texture-classification methods, and some results are 317 reported in [39] and [40]. Here, we evaluate the proposed 318 seabed-segmentation technique for different real sonar images 319 acquired by a sidescan sonar, as part of a natural seabed320 mapping project (IFREMER, Rebent Project) [41]. A reference 321 interpretation by an expert is available [41]. Fig. 7 shows the 322 set of images on which we carried out the experiments. We 323 superimposed on these images the manual expert segmentation. 324 Image I1 is composed of three seafloor types: rock, mud, and 325 marl ripples [41]; I2 of rock, marl ripples, and mud seafloors; 326 I3 of mud, sand, and marl ripples; I4 of marl and marl ripples; 327 and I5 of sand, sand ripples, and rock. For I2 and I3, the angular 328 variability of the sefloors, particularly marl ripples (for I2) and 329 marl (for I3), is visually clear. 330 For all these images, we first determine the most discriminant 331 features among the initial set of 219 features: We apply the 332 algorithm described in Section II and detailed in [39], and 333 we keep only the feature set such that the cumulative sum 334 of weights exceeds 0.9. Only a small number of features are 335 retained. For example, Fig. 8 shows the plot of feature weights 336 computed for image I1 and I5. For I1, the two cooccurrence 337 matrices account for more than 90% of the total weight sum; 338 these cooccurrence are computed for parameters (dx , dy ) ∈ 339 {(1, 4), (2, 1)}. For I5, the cooccurrence distributions com340 puted for parameters (dx , dy ) ∈ {(2, 1), (2, 2)} are selected. 341 For sonar images, we noticed that cooccurrence matrices are the 314 315 most selected features. In previous work on Brodatz textures 342 [39], [40], we remarked that Gabor and wavelet filters were 343 selected for oriented textures, whereas cooccurrence distribu- 344 tions, which, in addition to the detection of texture structures, 345 detect the intensity values change, are selected in the case of 346 texture having different intensity values and texture with regular 347 motifs. 348 For the five test images, three segmentation algorithms are 349 compared. 350 1) The maximum-likelihood (ML) segmentation denoted by 351 ML. This method consists in maximizing at each pixel the 352 conditional probability p(ys /xs ) given by (8) 353 xˆs = arg max p(ys /xs = k). k (23) Several analysis-window sizes are compared: TW ∈ {7 × 354 7, 17 × 17, 33 × 33}. 355 2) The MMP segmentation described in Section III, applied 356 for several analysis-window sizes: TW ∈ {7 × 7, 17 × 357 17, 33 × 33}. 358 3) The region-based variational segmentation described in 359 Section IV that we denote by V ar. 360 Table I summarizes the different error classification rates for all 361 segmentations. 362 All segmentation methods give quite good results according 363 to the mean classification error rates. MMP and variational 364 approaches are more efficient than the ML-based segmentation 365 because they take into account the spatial dependence between 366 pixels. The difference between these later approaches (MMP 367 and variational methods) mainly lies in the accuracy of the 368 localization of region boundaries and in the dependence of 369 MMP-based segmentation on the window size TW . In fact, 370 for the MMP segmentation, the use of small neighborhood 371 (TW = 7 × 7) leads to more accurate region frontiers but 372 small misclassified patches appear because of the neighborhood 373 KAROUI et al.: SEABED SEGMENTATION USING OPTIMIZED STATISTICS OF SONAR TEXTURES MMP segmentation of I1 using TW = 7 × 7, τ = 12.5%. Fig. 10. MMP segmentation of I1 using TW = 17 × 17, τ = 9.2%. inefficiency for texture characterization. Conversely, the use of large window sizes (TW = 33 × 33) resorts to a lack of 376 accuracy in the localization of the boundaries of the seabed 377 regions because texture features extracted for pixels close to 378 region boundaries involve a mixture of texture characteris379 tics. The variational region-based approach does not need the 380 choice of an analysis window and operates globally on region 381 composed of pixels belonging to the same class. It resorts 382 to a tradeoff between segmentation accuracy and region ho383 mogeneity. Figs. 9–13 show examples of the dependence of 384 MMP segmentation on the sizes of analysis window and on the 385 robustness of the variational approach. In Fig. 14, we plot the 386 mean error rate for different window sizes. 387 We note that MMP and variational segmentations give sim388 ilar results when the analysis-window size is well chosen for 389 instance, TW = 17 × 17 for image I2 (Figs. 15 and 16), but 390 its performance depends a lot on the choice of this parameter. 391 This method can however be appropriate when the aim of the 392 segmentation is to detect texture regions and without seeking 393 for accurate boundaries. 394 The variational approach is also interesting because it is 395 much faster than the MMP segmentation. Being deterministic, 396 the variational approach can be very fast particularly if we use 397 appropriate initialization such as an initial segmentation based 398 on the ML criterion. Whereas the MMP segmentation needs 374 375 Fig. 11. MMP segmentation of I1 using TW = 33 × 33, τ = 13%. IE E Pr E oo f Fig. 9. 7 Fig. 12. Region-based segmentation of I1, τ = 7.5%. a large number of iterations, each iteration is also complex 399 and involves Gibbs sampling. For our implementations, the 400 convergence time of a variational image typically corresponds 401 to one iteration of the MMP algorithm. 402 To stress the interest of taking into account texture variability 403 with respect to incidence angles, additional segmentation re- 404 sults are reported for image I2 and I3 for which the seafloor 405 texture variability is clearer. In Table II, we summarize the 406 segmentation error rates for the segmentation with (J = 3) and 407 with no angular weighting, i.e., using only one angular domain 408 J = 1, for ML, MMP, and variational methods. In Figs. 17 and 409 18 are reported the associated segmentation results. It can be 410 noticed that a classical segmentation (without taking into ac- 411 count texture variability within incidence angles) cannot distin- 412 guish between visually similar seafloors (mud and marl ripples 413 for I2 and marl and sand for I3) near the specular domain. 414 VI. C ONCLUSION 415 We proposed two segmentation algorithms for sonar-image 416 segmentation: a Bayesian algorithm using local statistics and a 417 region-based variational algorithm, both based on a novel sim- 418 ilarity measure between seafloor-type images according to the 419 statistics of their responses to a large set of filters. This similar- 420 ity measure is expressed as a weighted sum of Kullback–Leibler 421 8 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 46, NO. 12, DECEMBER 2008 IE E Pr E oo f Fig. 13. Segmentations of I4. (a) MMP: TW = 7, τ = 4%. (b) MMP: TW = 33, τ = 6%. (c) Variational method: τ = 3%. Fig. 16. Variational-based segmentation of I2, τ = 3%. Fig. 14. Mean segmentation error rate for several window sizes. TABLE II SEGMENTATION ERROR RATES Fig. 15. MMP-based segmentation of I2 TW = 17, τ = 4%. divergences between individual seafloor filter-response statistics. The resulting weighting factors are exploited on the one hand for filter selection and, on the other hand, for taking into account the incidence angular dependence of seafloor tex426 tures. The conclusion is the cooccurrence matrices outperform 427 the other features for our sonar images. The results show 428 that the performance of the Bayesian approach depends on 429 the size of the analysis window. For pixel-based segmenta430 tion (ML and MMP segmentations), the size must be tuned 431 according to the coarseness of given textures. The results 432 also stress the suitability of the region-based approach as 433 compared to the Bayesian pixel-based scheme for texture seg434 mentation and demonstrate the interest of taking into account 422 423 424 425 Fig. 17. I2 region-based segmentation with and without angular weighting. (a) J = 1, τ = 23.5%. (b) J = 3, τ = 3%. Fig. 18. I3 MMP-based segmentation with and without angular weighting. (a) J = 1, τ = 16%. (b) J = 3, τ = 8%. the angular backscatter variabilities to discriminate between 435 seafloor types particularly near the specular sector. 436 KAROUI et al.: SEABED SEGMENTATION USING OPTIMIZED STATISTICS OF SONAR TEXTURES A PPENDIX E VOLUTION E QUATION C OMPUTATION 437 438 441 Using the shape-derivative tools, we want to differentiate the functional  k  F (Ωk ) = KLΘ w Q , D(Ωk ) which can be written as follows: J F πΩk (j)wf2 j=1 f =1 442 443 444  Qkf,j (α) log Rf Qkf,j (α) Df,j (Ωk , α)  Then, we have J F wf2 πΩk (j) j=1 f =1 J F Finally, we get 457 ) dHjk (Ωk , V   = p(Θj /zs )  Γk Qkf,j (α) Qf (α) gσf (hf (s) − α) − G1 (f, α, Ωk ) πΩk (j) )  Hjk . Γk + πΩk (j) Hjk dα  )dα. dHjk (Ωk , V  · Nk )da(s) × (V   )dα dHjk (Ωk , V Rf 1 = πΩk (j) Rf    · Nk )da(s) × (V    Qkf,j (α) 1 = gσ (hf (s)−α) Qf (α) p(Θj /zs ) πΩk (j) Df,j (Ωk , α) f Rf  wf2 dπΩk (j)(Ωk , V j=1 f =1 Qkf,j ∂Hjk = . ∂πΩk (j) πΩk (j) Γk The Gâteaux derivative of F (Ωk ) in the direction of a vector  is given by field V )= dF (Ωk , V Qkf,j ∂Hjk =− ∂G1 G1 According to the previous theorem, we get 456   · Nk )da(s).  ) = − p(Θj /zs )gσ (hf (s) − α) (V dG1 (Ωk , V f p(Θj /zs )gσf (hf (s) − α) ds. F (Ωk ) = 446 dα. 2) Ωk 447  Let us introduce the following notations. 1)   Qkf,j (α) k k Hj = Qf,j (α) log . Df,j (Ωk , α) G1 = 445  455 Using the chain rule, we get   ∂Hjk ∂Hjk  )=  )+ ) dHjk (Ωk , V dG1(Ωk , V dπΩk(j)(Ωk , V ∂G1 ∂πΩk (j) IE E Pr E oo f 439 440 9  Γk   Qkf,j ∗ gσf (hf (s)) − 1 p(Θj /zs ) Df,j (Ωk )  · Nk )da(s). × (V (24) Finally, according to (24) 458 Rf 448 Theorem: See [37]. 449  The Gâteaux derivative 450 Ω k(s, Ω)ds is given by )= dK(Ω, V  ) dF (Ωk , V of a functional of the type K(Ω) =  )ds − ksh (s, Ω, V  N  )da(s) k(s, Ω)(V Γ Ω J =− F p(Θj /zs )wf2 j=1 f =1       k  Qkf,j × KL Qf,j , Df,j − −1 ∗ gσf (hf (s)) Df,j (Ωk ) Γk  ) is where Ω denoted a region and Γ its boundary; ksh (s, Ω, V the shape derivative of k(s, Ω). 453 In our case   ) = − p(Θj /zs )(V  · Nk )da(s) dπΩk (j)(Ωk , V 451 452 Γk 454 Hjk can be written as Hjk = Qkf,j (α) log  πΩk (j)Qkf,j (α) G1 (Ωk , f, j, α)  .    · Nk da(s). × V R EFERENCES 459 [1] P. Brehmer, F. Gerlotto, J. Guillard, F. Sanguinède, Y. Guénnegan, and D. Buestel, “New applications of hydroacoustic methods for monitoring shallow water aquatic ecosystems: The case of mussel culture grounds,” Aquat. Living Resour., vol. 16, no. 3, pp. 333–338, Jul. 2003. [2] M. Mignotte, C. Collet, P. Perez, and P. Bouthemy, “Hybrid genetic optimization and statistical model based approach for the classification of shadow shapes in sonar imagery,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 22, no. 2, pp. 129–141, Feb. 2000. 460 461 462 463 464 465 466 467 10 [3] L. Hellequin, J. M. Boucher, and X. Lurton, “Processing of highfrequency multibeam echo sounder data for seafloor characterization,” IEEE J. Ocean. Eng., vol. 28, no. 1, pp. 78–89, Jan. 2003. [4] G. Le Chenadec and J. M. Boucher, “Sonar image segmentation using the angular dependence of backscattering distributions,” in Proc. Oceans—Europe, Brest, France, 2005, vol. 1, pp. 147–152. [5] G. Le Chenadec, J.-M. Boucher, and X. Lurton, “Angular dependence of K-distributed sonar data,” IEEE Trans. Geosci. Remote Sens., vol. 45, no. 5, pp. 1224–1236, May 2007. [6] E. Jakeman, “Non-Gaussian models for the statistics of scattered waves,” Adv. Phys., vol. 37, no. 5, pp. 471–529, Oct. 1988. [7] A. P. Lyons and D. A. Abraham, “Statistical characterization of highfrequency shallow-water seafloor backscatter,” J. Acoust. Soc. Amer., vol. 106, no. 3, pp. 1307–1315, Sep. 1999. [8] D. A. Abraham and A. P. Lyons, “Novel physical interpretations of k-distributed reverberation,” IEEE J. Ocean. Eng., vol. 27, no. 4, pp. 800– 813, Oct. 2002. [9] N. C. Mitchell, “Processing and analysis of Simrad multibeam sonar data,” Mar. Geophys. Res., vol. 18, no. 6, pp. 729–739, Dec. 1996. [10] G. Le Chenadec, J. M. Boucher, X. Lurton, and J. M. Augustin, “Angular dependence of statistical distributions for backscattered signals: Modeling and application to multibeam echosounder data,” in Proc. Oceans, Sep. 2003, vol. 2, pp. 897–903. [11] D. R. Jackson, D. P. Winbrenner, and A. Ishimaru, “Application of the composite roughness model to high-frequency bottom backscattering,” J. Acoust. Soc. Amer., vol. 79, no. 5, pp. 1410–1422, May 1986. [12] C. de Moustier and D. Alexandrou, “Angular dependence of 12-kHz seafloor acoustic backscatter,” J. Acoust. Soc. Amer., vol. 90, no. 1, pp. 522–531, Jul. 1991. [13] J. E. Hughes-Clarke, “Toward remote seafloor classification using the angular response of acoustic backscatter: A case study from multiple overlapping GLORIA data,” IEEE J. Ocean. Eng., vol. 19, no. 1, pp. 112– 127, Jan. 1994. [14] E. Pouliquen and X. Lurton, “Automated sea-bed classification system for echo-sounders,” in Proc. Oceans, Oct. 1992, vol. 1, pp. 317–321. [15] D. Langer and M. Herbert, “Building qualitative elevation maps from side scan sonar data for autonomous underwater navigation,” in Proc. IEEE Int. Conf. Robot. Autom., Apr. 1991, vol. 3, pp. 2478–2483. [16] R. Li and S. Pai, “Improvement of bathymetric data bases by shape from shading technique using side-scan sonar images,” in Proc. OCEANS, Oct. 1991, vol. 1, pp. 320–324. [17] E. Dura, J. Bell, and D. Lane, “Reconstruction of textured seafloors from side-scan sonar images,” Proc. Inst. Elect. Eng.—Radar Sonar Navig., vol. 151, no. 2, pp. 114–125, Apr. 2004. [18] A. E. Johnson and M. Hebert, “Seafloor map generation for autonomous underwater vehicle navigation,” Auton. Robots, vol. 3, no. 2, pp. 145–168, Jun. 1996. [19] R. Zhang, P. Tsai, J. E. Cryer, and M. Shah, “Shape from shading: A survey,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 21, no. 8, pp. 690– 705, Aug. 1999. [20] T. H. Eggen, “Acoustic sediment classification experiment by means of multibeam echosounders,” Acoust. Classification Mapping Seabed, vol. 15, no. 2, pp. 437–444, 1993. [21] L. M. Linnett, S. J. Clarke, and D. R. Carmichael, “The analysis of sidescan sonar images for seabed types and objects,” in Proc. 2nd Eur. Conf. Underwater Acoust., 1994, vol. 22, pp. 187–194. [22] M. Lianantonakis and Y. R. Petillot, “Sidescan sonar segmentation using active contours and level set method,” in Proc. OCEANS—Europe, Brest, France, 2005, vol. 1, pp. 719–724. [23] L. Xiuwen and W. DeLiang, “Texture classification using spectral histograms,” IEEE Trans. Image Process., vol. 12, no. 6, pp. 661–670, Jun. 2003. [24] O. G. Cula and K. Dana, “3D texture recognition using bidirectional feature histograms,” Int. J. Comput. Vis., vol. 59, no. 1, pp. 33–60, Aug. 2004. [25] R. Fablet and P. Bouthemy, “Motion recognition using nonparametric image motion models estimated from temporal and multiscale co-occurrence statistics,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 25, no. 12, pp. 1619–1624, Dec. 2003. [26] P. Nammalwar, O. Ghita, and P. F. Whelan, “Integration of feature distributions for colour texture segmentation,” in Proc. Int. Conf. Pattern Recog., Aug. 2005, vol. 1, pp. 716–719. [27] Q. Xu, J. Yang, and S. Ding, “Texture Segmentation using LBP embedded region competition,” Electron. Lett. Comput. Vis. Image Anal., vol. 5, no. 1, pp. 41–47, 2004. [28] S. Kullback, “On information and sufficiency,” in The Annals of Mathematical Statistics, vol. 22. New York: Wiley, 1951. [29] E. Parzen, “On estimation of a probability density function and mode,” Ann. Math. Stat., vol. 33, no. 3, pp. 1065–1076, Sep. 1962. [30] R. M. Haralick, “Statistical and structural approaches to texture,” Proc. IEEE, vol. 67, no. 5, pp. 786–804, May 1979. [31] S. Geman and G. Geman, “Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images,” IEEE Trans. Pattern Anal. Mach. Intell., vol. PAMI-6, no. 6, pp. 721–741, Nov. 1984. [32] J. Marroquin, S. Mitter, and T. Poggio, “Probabilistic solution of ill-posed problems in computational vision,” J. Am. Stat. Assoc., vol. 82, no. 397, pp. 76–89, Mar. 1987. [33] M. Kass, A. Withkin, and D. Terzopoulos, “Snakes: Active contour models,” Int. J. Comput. Vis., vol. 1, no. 4, pp. 321–331, Jan. 1988. [34] J. A. Sethian, Level Set Methods. Cambridge, U.K.: Cambridge Univ. Press, 1996. [35] J. F. Aujol, G. Aubert, and L. Blanc-Féraud, “Wavelet-based level set evolution for classification of textured images,” IEEE Trans. Image Process., vol. 12, no. 12, pp. 1634–1641, Dec. 2003. [36] C. Samson, L. Blanc-Féraud, G. Aubert, and J. Zerubia, “A level set method for image classification,” Int. J. Comput. Vis., vol. 40, no. 3, pp. 187–197, 2000. [37] S. Jehan-Besson, M. Barlaud, and G. Aubert, “Image segmentation using active contours: Calculus of variations or shape gradients?” SIAM J. Appl. Math., vol. 63, no. 6, pp. 2128–2154, 2003. [38] H. K. Zhao, T. Chan, B. Merriman, and S. Osher, “A variational level set approach to multiphase motion,” J. Comput. Phys., vol. 127, no. 1, pp. 179–195, Aug. 1996. [39] I. Karoui, R. Fablet, J. M. Boucher, and J. M. Augustin, “Separabilitybased Kullback divergence weighting and filter selection for texture classification and segmentation,” in Proc. 18th ICPR, Hong Kong, Aug. 2006. [40] I. Karoui, R. Fablet, J. M. Boucher, W. Pieczynski, and J. M. Augustin, “Fusion of textural statistics using a similarity measure: Application to texture recognition and segmentation,” Pattern Anal. Appl., vol. 11, no. 3/4, pp. 425–434, Sep. 2008. Computer Science. [41] A. Ehrhold, D. Hamon, and B. Guillaumont, “The Rebent monitoring network, a spatially integrated, acoustic approach to surveying nearshore macrobenthic habitats: Application to the bay of Concarneau (South Brittany, France),” ICES J. Mar. Sci., vol. 63, no. 9, pp. 1604– 1615, Jan. 2006. [42] W. Pieczynski, “Statistical image segmentation,” Mach. Graph. Vis., vol. 1, no. 2, pp. 261–268, 1992. 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 Imen Karoui received the M.Sc. degree in telecommunications from École Polytechnique de Tunis, Tunisia, in 2002, the D.E.A. degree in STIR (signal, télécommunications, images, radar) from the Université de Rennes 1, Rennes, France, in 2003, and the Ph.D. degree in signal processing and telecommunications from the Department of Signal and Communications, École Nationale Supérieure des Télécommunications de Bretagne (Telecom Bretagne), Brest Cedex 3, France, in 2007. She is currently a Postdoctor with Telecom Bretagne. Her research interests include texture analysis and segmentation with application to acoustic imaging and fisheries. 585 586 587 588 589 590 591 592 593 594 595 596 597 Ronan Fablet receive the M.Sc. degree in signal processing and telecommunications from École Nationale Supérieure de l’Aéronautique et de l’Espace, Toulouse Cedex 4, France, in 1997 and the Ph.D. degree in signal processing and telecommunications from the University of Rennes 1, Rennes, France, in 2001. In 2002, he was an INRIA Postdoctoral Fellow with Brown University, Providence, RI. From 2003 to 2007, he had a full-time research position with the Institut Français de Recherche et d’Exploitation de la Mer, Brest, France, in the field of biological oceanography. Since 2008, he has been an Assistant Professor with the Department of Signal and Communications, École Nationale Supérieure des Télécommunications de Bretagne, Brest Cedex 3, France. His main interests include sonar and radar imaging particularly applied to biological and physical oceanography and signal and image processing applied to biological archives. 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 IE E Pr E oo f 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 46, NO. 12, DECEMBER 2008 KAROUI et al.: SEABED SEGMENTATION USING OPTIMIZED STATISTICS OF SONAR TEXTURES Jean-Marc Boucher (M’83) was born in 1952. He received the M.Sc. degree in telecommunications from École Nationale Supérieure des Télécommunications, Paris, France, in 1975 and the Habilitation à Diriger des Recherches degree from the University of Rennes 1, Rennes, France, in 1995. He is currently a Professor with the Department of Signal and Communications, École Nationale Supérieure des Télécommunications de Bretagne, Brest Cedex 3, France, where he is also an Education Deputy Director. He is also the Deputy Manager of a National Scientific Research Center Laboratory (Institut Telecom-Telecom Bretagne-UMR CNRS 3192 lab-STICC, Université Européenne de BretagneFrance). His current researches are related to statistical signal analysis, including estimation theory, Markov models, blind deconvolution, wavelets, and multiscale-image analysis. These methods are applied to radar and sonar images, seismic signals, and electrocardiographic signals. He published about a hundred technical articles in these areas in international journals and conferences. Jean-Marie Augustin received the Master degree in signal processing from the University of Rennes 1, Rennes, France, in 1982. Since 1984, he has been with the Institut Français de Recherche et d’Exploitation de la Mer, Brest, France, the French public institute for marine research, where he is currently a Senior Engineer with the Department of Acoustics and Seismics. His main interests include software development for sonar seafloor mapping and backscatter reflectivity analysis. IE E Pr E oo f 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 11 634 635 636 637 638 639 640 641 642 643 644