Ahmad 2014
Ahmad 2014
Ahmad 2014
Arbab Masood Ahmad, Gul Muhammad Khan, and Sahibzada Ali Mahmud
1 Introduction
Breast cancer is a leading cause of death in women worldwide. The disease has
often times no symptoms till it advances to a dangerous level. It is therefore
highly recommended to carry out screening tests after a certain age. The best
screening method to date is the mammography, in which both the breasts are im-
aged with low dose x-ray radiation. Nowadays digital mammography has become
the standard screening practice.
As mammography is highly subject to the expertise of radiologist and prone to
errors, a decision support system based on machine learning is highly desirable.
There are two main indicators for breast cancer using mammography. These
are masses and microcalcifications. Often times before a suspicious mass ap-
pears in the mammogram, very fine specs appear in the image, which is caused
by micro-calcifications (MCs). The MCs that are 1 mm or smaller in diameter
L. Iliadis et al. (Eds.): AIAI 2014, IFIP AICT 436, pp. 203–213, 2014.
c IFIP International Federation for Information Processing 2014
204 A.M. Ahmad, G.M. Khan, and S.A. Mahmud
and appear in clusters are more likely to be cancerous while those that are larger
in size and scattered randomly are often benign. A mass on the other hand is
considered benign if its shape is round or oval and the margins are circumscribed,
while it is malignant if it is irregular, stellate or micro-lobulated in shape. The
density of breast is such that the mammogram shows a very low contrast between
parenchymal tissue and a mass and hence very difficult to isolate the two, visu-
ally. A number of methods have been developed for computer based classification
of the masses and microcalcifications. Before classification the mammogram im-
age is preprocessed using digital filters to enhance the different regions of the
image. The masses and microcalcifications are then outlined using segmentation
techniques. Before these segmented images could be classified using machine, a
machine learning system is trained with a large set of mammograms that are
classified by expert radiologists. There are a number of mammogram databases
available on the internet. The two most popular of them are Digital Database
for Screening Mammography (DDSM) at the University of South Florida1 and
the MIAS 2
In this paper we present our work in classifying the masses and MCs seen
in mammograms, as either benign or malignant. The system that we devel-
oped assumes that the physician segments the masses or MCs manually using
the CROP function, found in most windows based graphic software packages.
In order to train our system for classification we had to train it with sample
mammogram images from the Digital Database for Screening Mammography
(DDSM). These images are available in compressed lossless JPEG format. The
Windows versions of the uncompressed images were downloaded under a trans-
fer agreement with Dr. Thomas Deserno (nee Lehmann) Department of Medical
Informatics Aachen University of Technology D-52057 Aachen GERMANY, un-
der the IRMA (Image Retrieval in Medical Applications) project. The database
contains 2620 cases comprising of Normal, Benign and Malignant, high resolu-
tion mammograms. Each case further consists of two views of each breast, the
cranio-caudal view and the oblique medio-lateral view. Besides the images, each
case has a .ics file that contains information about the patient and the image
file. Section 4 gives the details of the work done in preparing the training and
testing data for the CGPANN. After training the CGPANN with the former the
system is then evaluated with the later.
The paper is organized as follows: The next section titled ”Literature Sur-
vey” describes the latest research done in the field of mammogram classification.
The sections ”Cartesian Genetic Programming Evolved Artificial Neural Net-
work (CGPANN)” and ”Evolution Strategy” describe the evolutionary compu-
tation system that we used for the classification task. The section ”Experimental
setup” describes the preprocessing and statistical parameters extraction steps
needed before mammogram classification. It also describes the network parame-
ters that we chose. The section ”Results and Analyses” compares the results of
1
http://marathon.csee.usf.edu/Mammography/Database.html
2
http://peipa.essex.ac.uk/info/mias.html
Classification of Mammograms Using CGPANN 205
our experiments with those of the competitors. Finally, the section ”Conclusion
and Future work” summarizes the paper and states our intentions for future
work in this field.
2 Literature Survey
A brief review of the work done in the field of mammogram classification is
presented below. As the abnormality in mammograms can be in the form of
microcalcifications or masses, separate subsections review each of them.
2.1 Microcalcifications
In [1] Xuejun et al. presented a number of ANN architectures for detecting
microcalcifications (MC) in mammogram images. A back propagation neural
network responds to an MC lying in the middle of a region of interest. To reduce
the image size, FFT of the image was also determined. Another network with
multiple outputs had one of its outputs as 1 when its corresponding input had a
microcalcification at its position in the image. A Shift Invariant ANN (SIANN)
was also experimented with. The performance of the networks was compared
with Triple Ring Filter (TRF) which is a rule based filter [2]. A sensitivity of
95% was achieved by using SIANN and TRF together.
In [3] Issam et al. presented an SVM based microcalcification (MC) classifica-
tion technique. In [4] Rolando et al. presented a technique in which a mammo-
gram is first preprocessed with a median filter to remove noise. This is followed
by binarization. The resulting image is then applied to two Gaussian filters. The
optimized difference of Gaussian (DoG) filters is used to enhance those regions
in the image that contain bright points. A number of techniques are then applied
to this image to detect potential MCs.
In [5] Walker et al. discussed a multi-chromosome Cartesian Genetic Pro-
gramming in general and its application for classification of microcalcifications
(MCs) in mammograms, in particular. The process is termed multi-chromosome
Evolution (MCE). About 80% test images were classified correctly. In [6] Zhang
et al. presented a technique in which areas marked by radiologists in the DDSM
database were resized and fourteen features extracted from each area. These fea-
tures are represented by genes in a chromosome of a neural-Genetic Algorithm. A
subset of features is applied as input and the NN trained. A random population
of subsets is generated and each individual is evaluated. Genetic operators of
crossover and mutation are applied. The NN with the best classification rate to-
gether with the selected feature subset is chosen. It was observed that the highest
classification result using the proposed algorithm with NN classifier was 85%.
2.2 Masses
In [7]the authors preprocessed and segmented images from popular databases on
the basis of grey level distribution and texture using Otsu’s method. They ex-
tracted the intensity histogram and Gray Level Co-occurrence Matrix (GLCM)
206 A.M. Ahmad, G.M. Khan, and S.A. Mahmud
In [18] the authors present a mass detection technique in which the pec-
toral muscles are segmented and removed. The mass inside the breast is then
segmented using intensity thresholding. The segmented mass is analyzed for its
textural features, proposed by Haralick, using the gray level co-occurence matrix
(GLCM). The textural indices are applied to a support vector machine (SVM)
for training and classification. An average classification rate of 95% was achieved.
Inputs Outputs
III1
1 4 I
O1
Node 1 Node 4 Node 7 I
7
2 8 I
III2 Node 2 Node 5 5 Node 8 O2
I
I
O3
I
I3II Node 3 3 Node 6
6
Node 9
9
F, I1, W1, C1, I2, W2, C2 F, I1, W1, C1, I3, W2, C2 ,……………………, 1, 8, 2
(b)
(a)
Fig. 1. (a) A typical CGPANN phenotype. Nodes 3, 5, and 9 are inactive and the
remaining nodes are active. (b) Internal view of a single CGPANN neuron.
4 Experimental Setup
We trained our classifier system with 1000 mammogram images
of benign and malignant types each. Using the Matlab command
GET DDSM GROUNDTRUTH we extracted the image information
from the groundtruth file. The overlay file inside the groundtruth file contains
the following information: lesion type, assessment, subtlety, pathology, anno-
tations and a chain code representing the region of interest (ROI) marked by
expert radiologists. The function is normally used to get a binary image of
the ROI but we modified it to return a rectangular image that contains this
ROI (see fig.2c). The network trained with parameters from these rectangular
images, containing masses and MCs, makes it capable to accept parameters
from manually segmented images (using the CROP function) easier. A Gray
Level Co-occurrence Matrix (GLCM) of an image is then formed. The matrix
Classification of Mammograms Using CGPANN 209
contains information about the frequency of gray level repetition between pixels
at a certain offset distance and angle and can be used to extract statistical
parameters of the image. Out of the many possible second order statistical tex-
ture descriptors, proposed by Haralick [24], we extracted only contrast, energy,
homogeneity and correlation. Each of these four parameters were determined
for four different angles and a certain distance, using the graycoprops function
in matlab. The sample parameters were divided into a training set and a testing
set for training and testing our CGPANN. See equations (1) to (4) for the four
Haralick’s parameters.
N
−1
Contrast = Pij (i − j)2 (1)
i,j=0
N
N
Energy = p(i, j)2 (2)
i=1 j=1
N
−1
Pij
Homogeneity = (3)
i,j=0
1 + (i − j)2
[ (ij)p(i, j)] − μx μy
Correlation = (4)
σx σy
Where p(i, j) is the normalized entry in row i and column j of the GLCM, i is
the intensity of one pixel while j is that of the next pixel making the pair for
GLCM [13]. We get a total of 16 parameters. A 10-fold cross validation strategy
was adapted, where ten distinct data sets are formed. Each set has nine parts
for training and one for testing. We experimented with the following six distance
offsets (D): 4, 8, 12, 16, 20 and 24, and found that for D=20 we got the best
average 10-fold accuracy result i.e. Accuracy=90.85%, Sensitivity=86.2% and
Specificity=95.5%. We performed five experiments with this offset value, each
time using a different seed for random number generator in the initial population
in CGPANN. The tenfold results for these experiments are shown in table 1.
The CGPANN network that we used has the following features: Number of
nodes= 100 (10rows × 10columns), Inputs per node= 3 and Number of out-
puts= 1(Average of ten outputs). An input-output set contains the 16 statistical
parameters as inputs and a target output that is 0 for benign and 1 for ma-
lignant. In the process of evolution, an initial population of 50 networks was
formed randomly. Each network is applied all the 2000 sample parameters and
its fitness determined using the following metrics:
True Positive (TP):A=1, T=1
False Positive (FP):A=1, T=0
True Negative (TN):A=0, T=0
False Negative (FN):A=0, T=1
Accuracy=(TP+TN)/N
Sensitivity =TP/(TP + FN)
Specificity=TN/(FP+TN)
210 A.M. Ahmad, G.M. Khan, and S.A. Mahmud
Table 1. Average values for training and testing results for five independent evolution
runs with offset:20 and 10-fold cross validation; Acc: accuracy, Sen: Sensitivity, Spec:
Specificity
Table 2. Comparative results from other authors and the overall 10-fold average from
the proposed method; MC: Micro-calcifications, Acc: accuracy, Sen: Sensitivity, Spec:
Specificity
1 2 3 4 5 6 7 8 90°
[-1 0]
1 1 2 0 0 1 0 0 0
135° 45°
1 1 5 6 8 0 0 1 0 1 0 0 0 [-1 -1]
2 [-1 1]
2 3 5 7 1 3 0 0 0 0 1 0 0 0
0 0 0 0 1 0 0 0 0°
4
[0 1]
4 5 7 1 2
5 1 0 0 0 0 1 2 0
8 5 1 2 5 6 0 0 0 0 0 0 0 1
7 2 0 0 0 0 0 0 0
Pixel of
8 interest
0 0 0 0 1 0 0 0
Fig. 2. Conceptual Illustration of (a) Gray Level Co-occurence Matrix (b) GLCM off-
sets (c) A mammogram with a malignant mass, outlined by expert (Red) and generated
by software (White Rectangle). Courtesy of TM Deserno, Dept. of Medical Informatics,
RWTH Aachen, Germany for the mammogram image
proposed work. All these authors have tried to classify either masses or micro-
calcifications alone. In comparison, our method classifies a sample set containing
both masses and microcalcifications. Amongst the authors who worked in this
area, Al Mutaz et al. [13] used the same Haralick’s statistical texture descrip-
tors as we did and the same database for their system training and testing. The
main difference however is that they used an MLP ANN as the computational
intelligence system for classification while we used CGPANN.
References
1. Zhang, X., Fujita, H., Chen, J., Zhang, Z.: Effect of training artificial neural net-
works on 2d image: An example study on mammography. In: International Confer-
ence on Artificial Intelligence and Computational Intelligence, AICI 2009, vol. 4,
pp. 214–218. IEEE (2009)
2. Ibrahim, N., Fujita, H., Hara, T., Endo, T.: Automated detection of clustered
microcalcifications on mammograms: Cad system application to mias database.
Physics in Medicine and Biology 42(12), 2577 (1997)
3. El-Naqa, I., Yang, Y., Wernick, M.N., Galatsanos, N.P., Nishikawa, R.: Support
vector machine learning for detection of microcalcifications in mammograms. In:
Proceedings of the 2002 IEEE International Symposium on Biomedical Imaging,
pp. 201–204. IEEE (2002)
4. Hernández-Cisneros, R.R., Terashima-Marın, H.: Feature selection for the classifi-
cation of both individual and clustered microcalcifications in digital mammograms
using genetic algorithms. In: A Recombination of the 15th International Confer-
ence Genetic Algorithms (ICGA) and the 11th Genetic Programming Conference
(GP), Seattle, WA, USA (2006)
5. Walker, J.A., Völk, K., Smith, S.L., Miller, J.F.: Parallel evolution using multi-
chromosome cartesian genetic programming. Genetic Programming and Evolvable
Machines 10(4), 417–445 (2009)
6. Zhang, P., Verma, B., Kumar, K.: Neural vs. statistical classifier in conjunction
with genetic algorithm based feature selection. Pattern Recognition Letters 26(7),
909–919 (2005)
7. Vasantha, M., Subbiah Bharathi, V.: Classifications of mammogram images using
hybrid features. European Journal of Scientific Research, 87–96 ISSN
8. Khuwaja, G.A.: Breast cancer detection using mammography. WSEAS Transac-
tions on Mathematics 3, 317–321 (2004)
9. Martı́, J., Freixenet, J., Munoz, X., Oliver, A.: Active region segmentation of mam-
mographic masses based on texture, contour and shape features. In: Perales, F.J.,
Campilho, A.C., Pérez, N., Sanfeliu, A. (eds.) IbPRIA 2003. LNCS, vol. 2652,
pp. 478–485. Springer, Heidelberg (2003)
10. Hong, B.-W., Brady, J.M.: A topographic representation for mammogram seg-
mentation. In: Ellis, R.E., Peters, T.M. (eds.) MICCAI 2003. LNCS, vol. 2879,
pp. 730–737. Springer, Heidelberg (2003)
11. Hong, B.-W., Sohn, B.-S.: Segmentation of regions of interest in mammograms
in a topographic approach. IEEE Transactions on Information Technology in
Biomedicine 14(1), 129–139 (2010)
12. Saidin, N., Sakim, H.A.M., Ngah, U.K., Shuaib, I.L.: Segmentation of breast regions
in mammogram based on density: A review. arXiv preprint arXiv:1209.5494 (2012)
13. Mutaz, A.A., Dress, S., Zaki, N.: Detection of masses in digital mammogram us-
ing second order statistics and artificial neural network. International Journal of
Computer Science & Information Technology (IJCSIT) 3(3), 176–186 (2011)
14. Zhang, Y., Tomuro, N., Furst, J., Raicu, D.S.: Image enhancement and edge-based
mass segmentation in mammogram. In: Proc. SPIE, vol. 7623, pp. 76234P–8 (2010)
15. Delogu, P., Fantacci, M.E., Kasae, P., Retico, A.: Characterization of mammo-
graphic masses using a gradient-based segmentation algorithm and a neural clas-
sifier. Computers in Biology and Medicine 37(10), 1479–1491 (2007)
16. Tahmasbi, A., Saki, F., Shokouhi, S.B.: Mass diagnosis in mammography images
using novel ftrd features. In: 2010 17th Iranian Conference on Biomedical Engi-
neering (ICBME), pp. 1–5. IEEE (2010)
Classification of Mammograms Using CGPANN 213
17. Elfarra, B.K., Abuhaiba, I.S.: Mammogram computer aided diagnosis. Interna-
tional Journal of Signal Processing, Image Processing and Pattern 5(4), 1–30 (2012)
18. Eddaoudi, F., Regragui, F., Mahmoudi, A., Lamouri, N.: Masses detection using
svm classifier based on textures analysis. Applied Mathematical Sciences 5(8),
367–379 (2011)
19. Miller, J.F., Thomson, P.: Cartesian genetic programming. In: Poli, R., Banzhaf,
W., Langdon, W.B., Miller, J., Nordin, P., Fogarty, T.C. (eds.) EuroGP 2000.
LNCS, vol. 1802, pp. 121–132. Springer, Heidelberg (2000)
20. Ahmad, A.M., Khan, G.M., Mahmud, S.A., Miller, J.F.: Breast cancer detection
using cartesian genetic programming evolved artificial neural networks. In: Pro-
ceedings of the Fourteenth International Conference on Genetic and Evolutionary
Computation Conference, pp. 1031–1038. ACM (2012)
21. Ahmad, A.M., Khan, G.M.: Bio-signal processing using cartesian genetic program-
ming evolved artificial neural network (cgpann). In: 2012 10th International Con-
ference on Frontiers of Information Technology (FIT), pp. 261–268. IEEE (2012)
22. Ahmad, A.M., Muhammad Khan, G., Mahmud, S.A.: Classification of arrhythmia
types using cartesian genetic programming evolved artificial neural networks. In:
Iliadis, L., Papadopoulos, H., Jayne, C. (eds.) EANN 2013, Part I. CCIS, vol. 383,
pp. 282–291. Springer, Heidelberg (2013)
23. Miller, J.F., Thomson, P.: Cartesian genetic programming. In: Poli, R., Banzhaf,
W., Langdon, W.B., Miller, J., Nordin, P., Fogarty, T.C. (eds.) EuroGP 2000.
LNCS, vol. 1802, pp. 121–132. Springer, Heidelberg (2000)
24. Haralick, R., Shanmugam, K., Dinstein, I.: Texture features for image classification.
IEEE Transactions on Systems, Man, and Cybernetics 3(6) (1973)