Sandip A. Kharat et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 6 (5) , 2015, 4381-4384
Cotton Crop Discrimination Using Landsat-8 Data
Sandip A. Kharat, Vijaya B. Musande
Dept. of Computer Science & Engineering
MGM’s Jawaharlal Nehru Engineering College, N-6 CIDCO, Aurangabad 431003
Dr. Babasaheb Ambedkar Marathwada University, Aurangabad (MH), India
Abstract‐ The classification and recognition of
agricultural crop types is an important application of
remote sensing. This paper includes the approach and
technique of Remote Sensing (RS) based single crop
identification, based on multispectral temporal data. In
this study multispectral time series images of Landsat-8
has been used for identification of cotton crop for
Aurangabad region (MH) in India. The pixel based
Unsupervised K-Means classification technique is used
for discriminate the land cover distributions. Accuracy
and efficiency of the pixel based classification technique
is compared using kappa statistics and confusion
matrix.
Keywords: Crops Classification, Multispectral,
Confusion matrix, Kappa Coefficient.
I. INTRODUCTION
Agriculture is backbone of Indian economy providing
livelihood to 67% population and contributing approx 35%
to Gross National Product. So keeping track of agricultural
information is essential, remote sensing systems with their
synoptic viewing capability and variety of temporal and
spatial resolution helps in the same. Remote sensing
methods are superior to conventional methods since it is
fast and economic. Remote sensing plays significant role in
agriculture and crop management applications such as crop
inventory, crop production forecasts, drought, flood
damage assessment and crop classification [1]. We have
mainly emphasized on crop classification using
multispectral temporal data. In crop classification
geographic area, crop diversity, field size, crop phenology
and soil condition plays important role.
Cotton crop is main cash crop in India. For an efficient
production and management of cotton crop up to date
information is needed. An early detection of crop plays an
important role and helps policy maker in finding acreage,
crop yield production if further goes in depth it helps in
stress detection, crop disease identification etc.
II. LITERATURE REVIEW
Literature exploited supervised as well as unsupervised
classification of multispectral images. The multispectral
airborne as well as satellite remote sensing technologies
have been utilized as a widespread source for the purpose
of remote classification of vegetation. During the literature
it is found that for crop classification mostly exploited
vegetative indices are NDVI and TNDVI [2]. These VI’s
has its effect on temporal data. Vyas et al. recently
www.ijcsit.com
proposed a multi temporal crop type classification that
successfully classified crops in India [3].They have worked
on multiple crops with multi date data and compare the
result with single date. Multi temporal data improve the
accuracy of classification. Sujay Datta et al. have used
LISS-I Data for wheat crop classification by combining two
dates data using PCA (Principle Component Analysis) &
derived there first two principle components they have got
94% classification accuracy [4]. V.B.Musande et.al has
worked on cotton crop classification using fuzzy approach;
the image to image maximum classification accuracy
observed was 96.5%, data used for this study was AWIFS
for soft classification and LISS-III data for soft testing [5].
Md. Rejaur Rahman et.al used IRS LISS II digital data and
NDVI to identify the sugarcane area and its condition
assessment, the accuracy achieved was 85.25% [6]. The
literature also emphasized on finding best classification
approach for image segmentation. Xiaofang Liu and
Xiaowen Li used dot density weighted fuzzy c-means
clustering (WFCM) to overcome the limitation of FCM (i.e.
equal partition trend for data set.) [7]. Comparisons of
different classification algorithms in the multi-date
classification category have been extensively studied. For
example, Chan et al (2001) compared four classifiers,
namely Multi-Layer Perceptron (MLP), Learning Vector
Quantization (LVQ), Decision Tree (DT) and MaximumLikelihood Classifier (MLC). Seto and Liu (2003)
compared ARTMAP neural network with MLC and
observed that ARTMAP neural network classifiers were
more accurate than MLC classifiers. It is hard to make a
conclusion that some classifiers are always better than the
rests when multiple criteria are used to evaluate the
suitability of algorithms. The proposed work uses
unsupervised K-Means classifier for crop discrimination.
III. PROPOSED METHOD
This section provides proposed crop classification system
using pixel based K-Means unsupervised Classifier. The
use of this technique for specific crop identification follows
a sequence of steps which are explained as below. The flow
chart of the proposed methodology is given in figure 1.
The overall sequence of the proposed methodology is given
as follows:
1) Acquisition of remote sensing image from multispectral
source
2) Image processing
3) Sample set selection
4) Image classification for various land cover features
5) Accuracy assessment of classified image using ground
truth data/Field information.
4381
Sandip A. Kharat et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 6 (5) , 2015, 4381-4384
Output:
Cl: cluster of pixels
Begin
Step 1: cluster centroids are initialized.
Step 2: compute the closest cluster for each pixel and
classify it to that cluster, i.e. the objective is to minimize
the sum of squares of the distances given by the following:
N C
Δij = || xi-cj ||. arg min
ΣΣ
Δij2
(1)
i=1 j=1
Step 3: Compute new centroids after all the pixels are
clustered. The new centroids of a cluster is calculated by
the following
-Σ
xi , where xi belongs to cj
(2)
cj =
Step 4: Repeat steps 2-3 till the sum of squares given in
equation is minimized.
End
Figure 1: Flow Chart of Proposed Methodology
A. Multispectral Image Acquisition:
For the crop identification using remote sensing, remote
sensing images are needed to acquire. In our case we are
only concerned with multispectral sensors hence we can
consider the images from available satellite sensors like
AWIFS, LISS (IRS series), SPOT 5 and also LANDSAT,
MODIS which are good sources of multispectral data. But
here in this study we have used Landsat-8 images of
Aurangabad region.
B. Image Processing :
In this phase we can enhance and restore the image. Image
enhancement may include contrast stretching, edge
enhancement, etc. While in image restoration we can
consider the geometric correction, radiometric correction as
per the need of acquired image. Once the data is made
ready and georeferenced, it would be pre-processed using
various indices and then applied to the classifier.
C. Sample Set Selection:
Here in this phase we have considered the temporal data of
specific region (Mali Sagaj) and nearby villages related
with cotton crop and evaluated the variation in accuracy
over a period of time.
D. Classification Technique:
K-MEANS CLUSTERING ALGORITHM
K-means is one of the basic clustering methods introduced
by Hartigan [8]. This method is applied to segment the
remote sensing image in recent years. The K-means
clustering algorithm for classification of remote sensing
image is summarized as follows:
Algorithm K-means(x, n, c)
Input:
N: number of pixels to be clustered; x={x1, x2, x3… xN}:
pixels of remote sensing image
c = {c1, c2, c3… cj}: clusters respectively.
www.ijcsit.com
E. Accuracy Assessment:
An accuracy assessment of classification is undertaken
using confusion matrices and Kappa statistics. The
accuracy of the classified image was the assessed using a
range of reference data including field data collected in the
study area during the seasonal period. Producer and user
accuracies for each class were calculated along with the
overall accuracies and Kappa statistics.
IV. STUDY AREA AND DATA USED
The study area chosen for the present study is located
between
19031’57.03” N - 19057’45.74”N latitude and
0
74 49’44.46” E - 75016’9.37” E longitude in the state of
Maharashtra, India. The important places in study are Mali
Sagaj, Vaijapur and Aurangabad as shown in Figure 2.
Bajara, Jawar, Soyabin crops are grown in this region in
Kharip Season. Cotton is the main crop in this region
having homogeneous field.
Figure 2: Location of study Area
In this study remotely sensed multispectral images from
Landsat-8 are considered for cotton crop identification.
Landsat-8 has 11-bands with different resolutions given as
follows:
Band (1 to 7) and Band 9 =30m.
Band 8 (Panchromatic) =15m.
Band 10 and Band 11(TIRS) =100m.
But study has considered only three bands namely Green
(B3), Red (B4) and NIR (B5) for classification purpose
which has spatial resolution of 30 m. The details of datasets
used for the study are shown in Table I.
4382
Sandip A. Kharat et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 6 (5) , 2015, 4381-4384
TABLE I: Data Set Used
Date
21-Sep-2014
07-Oct-2014
Date Number
1
2
Phenological Stage
Leaves formation
Flowering Stage.
In this study the field visit is carried out on 16-17,
November 2014 to collect ground truth data so that the
exact positions of training and testing Cotton fields could
identify.
V. RESULT AND DISCUSSION
Color composite image extracted from the Landsat-8
reveals the crops and other cover types in this area Figure
3(a). Figure 3(b), (c) is classification map of study image
on two different dates.
Figure 3(a): Various land cover types in study Area.
Figure 3(b): classification map image Dated 07-10-14 (at
K=10)
Figure 3(c): classification map image dated 08-11-14 (at
K=10)
Visual comparison with color composites indicates
classification maps provided very little separation between
various cover types due to pixel based classification
approach. The image is classified using K-means clustering
for different values of K. According to D.T. Pham et.al for
value K>9 results converges [9]. So result obtained in the
work are consider for (K=10, 11).
The accuracy assessment of classified images is mapped
with ground truth images and classification accuracy
statistics including overall accuracy, producer’s accuracy
and user’s accuracy were calculated based on the confusion
matrices.
Among the classified temporal images we have got both
accuracies (i.e. Producer’s & User’s) good (>82 %). The
image taken on 07-10-2014 provided overall 98.81%
accuracy with Kappa coefficient 0.9801 and image taken
on 08-11-2014 has provided overall 96.18% accuracy with
Kappa coefficient 0.9436 for K=10. The confusion matrices
shows number of pixels classified by classifier with respect
to ground truth data as shown in Table II for both the dates
at different values of K.
TABLE II: ERROR MATRICES AND ACCURACY MEASURES FOR CLASSIFICATION MAPS FOR STUDY SITE
(a) Result of image 07-10-2014 (For K=10 )
Actual category
(Ground Truth)
Classified Category
Water Body
Cotton
Residential A.
Dry Veg
Total
Producer’s
Accuracy (%)
Water Body
Cotton
95360
0
2179
0
97539
0
186025
0
0
186025
Residential
Area
0
0
33925
0
33925
97.77
100
100
(b) Result of image 07-10-2014 (For K=11 )
Actual category
Total
0
39344
0
222234
261578
95360
225369
36104
222234
579067
Overall
Accuracy (%)
84.96
User’s
Accuracy (%)
100
82.54
93.96
100
-92.82
(Ground Truth)
Classified Category
Water Body
Cotton
Water Body
Cotton
Residential A.
Dry Veg
Total
Producer’s
Accuracy (%)
116463
0
2275
0
118738
0
108386
0
0
108386
Residential
Area
0
0
26144
0
26144
98.08
100
100
www.ijcsit.com
Dry Veg.
Dry Veg.
Total
0
20045
0
205126
225171
116463
128431
28419
205126
478439
Overall
Accuracy (%)
91.10
User’s
Accuracy (%)
100
84.39
91.99
100
-95.33
4383
Sandip A. Kharat et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 6 (5) , 2015, 4381-4384
Result of image 08-11-2014
Actual category
(For K=10)
(Ground Truth)
Classified Category
Water Body
Cotton
Water Body
Cotton
Residential A.
Dry Veg
Total
Producer’s
Accuracy (%)
107376
0
0
0
107376
0
57692
0
0
57692
Residential
Area
0
0
144431
467
144898
100
100
99.68
(c) Result of image 08-11-2014
Actual category
Classified
Category
Water Body
Cotton
Residential A.
Dry Veg
Total
Producer’s
Accuracy (%)
[1]
[2]
[5]
[6]
[7]
Total
0
0
22272
263249
285521
107376
57692
166703
263716
595487
Overall
Accuracy (%)
92.20
User’s
Accuracy (%)
100
100
86.64
99.82
-96.18
(For K=11)
(Ground Truth)
Water Body
Cotton
96187
3
0
0
96190
0
36829
0
0
36829
Residential
Area
0
0
106338
0
106338
100
100
100
REFERENCES
Vijaya Musande, Anil Kumar and Karbhari Kale, “Effects of Indices
on Temporal Data for Specific Crop Identification by using
Possibilistic-c means” November, 2011.
Sujay Dutta, N K Patel, T T Medhavy, S K Srivastava, Naveen
Mishra and K R P Singh, “Wheat Crop Classification Using
Multidate IRS LISS-I Data”, Journal of the Indian Society of Remote
Sensing, Vol. 26, No. l & 2, 1998.
Zhang Miao, Li Qiangzi, Wu Bingfang, “Investigating the capability
of multi-temporal Landsat images for crop identification in high
farmland fragmentation regions”. IEEE 2012 First International
Conference on Agro-Geoinformatics - Shanghai, China.
Erik Zillmann, Horst Weichelt, “Crop identification by means of
seasonal statistics of RapidEye time series”, IEEE 2014 Third
International Conference on Agro-Geoinformatics - Beijing, China.
Rabindra K. Panigrahy, S. S. Ray, S. Panigrahy, “Study on the
Utility of IRS-P6 AWiFS SWIR Band for Crop Discrimination and
www.ijcsit.com
Dry Veg.
Dry Veg.
Total
0
0
53857
224742
278599
96187
36832
160195
224742
517956
Overall
Accuracy (%)
80.67
[3]
[4]
[8]
[9]
User’s
Accuracy (%)
100
99.99
66.38
100
-89.60
Md. Rejaur Rahman, A.H.M. Hedayutul Islam, Md. Ataur Rahman,
“NDVI Derived Sugarcane Area Identification and Crop Condition
Assessment”.
Sujit Kumar BALA, Mohammad ALI, A.K.M. Saiful ISLAM,
“Estimation of Potato Yield In and Around Munshigonj Using
Remote Sensing and Gis Techniques”, International Conference on
Water & Flood Management (ICWFM-2007).12 - 14 March 2007,
Dhaka, Bangladesh.
Classification”, J. Indian Soc. Remote Sensing. (June 2009) 37:325–
333.
Babawuro Usman,“Satellite Imagery Land Cover Classification
using K-Means Clustering Algorithm: Computer Vision for
Environmental Information Extraction”, Elixir Comp. Sci. & Engg.
63 (2013) 18671-18675.
D T Pham, S S Dimov, and C D Nguyen,” Selection of K in Kmeans clustering”, Proc. IMechE Vol. 219 Part C: J. Mechanical
Engineering Science.
4384