Abstract
Land use pattern classification of remote sensing imagery data is imperative to research that is used in remote sensing applications. Remote sensing (RS) technologies were exploited to mine some of the significant spatially variable factors, such as land cover and land use (LCLU), from satellite images of remote arid areas in Karnataka, India. Four diverse classification techniques unsupervised, and supervised (Maximum likelihood, Mahalnobis Distance, and Minimum Distance) are applied in Bellary district in Karnataka, India for the classification of the raw satellite images. The developed maps are then visually compared with each other and accuracy evaluations make using of ground-truths are carried out. It was initiated that the Maximum likelihood technique gave the finest results and both Minimum distance and Mahalnobis distance methods overvalued agricultural land areas. In spite of missing a few insignificant features due to the low resolution of the satellite images, a high-quality accord between parameters extracted automatically from the developed maps and field observations was found.
Keywords: Remote sensing (RS); land cover and land use (LCLU)
INTRODUCTION
Remote sensing has become a foreseeable tool for reserve inventory and environmental monitoring at local, provincial, and universal scales in current years. It has become a vital element of information knowledge and provides the capability for sustainable progress of natural resources and environmental fortification. Image data classification and accuracy complexity estimation of the classified images are imperative aspects of remote sensing image analysis. Numerous investigative mechanisms conceded using some algorithms with their complexity of accuracy are reviewed in this study. This is the science and art of obtaining information about an object, geographic area, or phenomenon through the analysis of data acquired by a device that is not in contact with the object, area, or observable fact under investigation. This study deals with remote sensing data acquired through earth observation satellites. Remote sensing image analysis is done to extract useful information about the earth's surfaces. An important step in the analysis of such data is the process of land cover classification. In this, the image pixels are assigned to different land cover classes based on the spectral measurements of each pixel. Pattern recognition technique such as Mahalnobis distance classification is followed in this process. Machine learning techniques can be applied in a supervised and unsupervised approach. Supervised Classification is a procedure for identifying spectrally similar areas on an image by identifying ‘training’ sites of known targets and then extrapolating those spectral signatures to other areas of unknown targets.
DESCRIPTION OF STUDY AREA
The area selected to study in this work is Koluru Hobli of Bellary Taluk and District located in the Karnataka state central southern part of India covering which lies between 150 9΄ to 150 15΄N latitude and 760 55΄ to 760 92΄E longitudes.
RELATED WORK
Rao and Vaidyanadhan,(1981) in agricultural domain land use pattern in spatial data over occupied rising regions but across the world because of its association with dissimilar human marvels. It has been pragmatic that, remote sensing practice is the most competent systematic tool in amalgamation with ground truth and toposheet for assemblage of spatial evidence and actual use in empathy, classification, and mapping of the land use units. Bruzzone et al., (2002) The rising accessibility of remote sensing images, acquired sporadically by satellite sensors on the identical geographical area, makes it exceptionally motivating to extend the monitoring systems competent without human intervention producing and recurrently updating LULC maps of the measured site. Lu and Weng, (2007) remote sensing satellite image data classification based on environmental and socioeconomic have long concerned the significance of the remote sensing community research since most applications are based on the classification results. Satio et al. (2003) Mangroves reach their optimal development in the wet tropics although some little-known mangrove stands are reported in subtropical arid coastlines especially from the Red Sea to Pakistan where they form one of the driest mangrove habitats in the world. Because they constitute the only available evergreen forest in hyper-arid warm coastal areas, the main wetlands for migratory birds, and essential nursery ground for many species of fish, it is imperative to produce a sufficiently accurate map for monitoring their changes and for their protection. The main objective of the present work is to test and select the best methodological approach to discriminate and map the mangroves and related coastal ecosystems in the United Arab Emirates (UAE), between Abu Dhabi and Dubai, a coastal stretch about 750 km long. It was found that the best practical results were produced by the Maximum Likelihood and Mahalnobis classifications although some limitations remain unsolved, especially in open ecosystems, which are common in arid areas. Goel et al. (2003) have evaluated the potential of hyperspectral aerial imagery for the detection of weed infestation and nitrogen fertilization levels in a corn (Zea mays L) crop. A Compact Airborne Spectrographic Imager (CASI) was used to acquire hyperspectral data over a field experiment laid out at the Lods Agronomy Research Centre of Macdonald Campus, McGill University, Canada. Corn was grown under four weed management strategies (no weed control, control of grasses, control of broadleaf weeds, and full weed control) factually combined with nitrogen fertilization rates of 60, 120, and 250 N kg./ha. The aerial image was acquired at the tussle stage, which was 66 days after planting. The classification algorithms (maximum likelihood, minimum distance, Mahalanobis distance parallelepiped, and binary coding) and more sophisticated classification approaches (spectral angle mapped and linear spectral no mixing) were investigated. It was difficult to distinguish the combined effect of both weed and nitrogen treatments when only one factor, either wee or nitrogen treatment, was considered. With different classifiers, depending on the factors considered for the classifier, accuracies ranged from 65.84 percent to 99.46 percent. No single classifier was found useful for all the conditions. Shanmugam et al. (2003) have studied bare sand and semi-fixed dunes to represent ideal conditions for succession ally young slack habitats that support rare species of coastal dune flora such as fen orchid (Liparis loeselii) and liverworts (e.g., Petallophyllum ralfsii). To map this habitat, mapping algorithms are applied to Compact Airborne Spectrographic Imager (CASI) data. Per-pixel mapping was performed using the Minimum distance, Maximum likelihood, and Mahalnobis distance classification algorithms with training data extracted for habitats at various levels of the National Vegetation Classification (NVC) scheme. Sub-pixel mapping was performed using a linear mixture model, fuzzy membership functions, and neural networks, and the sub-pixel proportions of the spectral end members viz. sand, vegetation, and shade/moisture were defined. Results indicate that per-pixel mapping can only be achieved for broad habitat categories that correspond to the level I of the NVC. of the algorithms used, the minimum distance, with an overall mapping accuracy of 92 percent, outperforms both maximum likelihood and Mahalnobis distance. Perumal and Bhaskaran (2010) have studied remote sensing imagery for a wide range of applications from military applications to farm development. The images may be panchromatic, multispectral, hyperspectral, or even ultra spectral of terra bytes. A few image classification algorithms have proved good precision in classifying remote sensing data. But, of late, due to the increasing spatiotemporal dimensions of the remote sensing data, traditional classification algorithms have exposed weaknesses necessitating further research in the field of remote sensing image classification. So an efficient classifier is needed to classify the remote sensing imageries to extract information. The experiment included both supervised and unsupervised classification. Here, different classification methods and their performances are compared. It is found that the Mahalnobis classifier performed the best among the classification. Kun Wang and Youchuan Wan (2009) have studied fuzzy methods that have been widely applied in image classification, which is believed to be more appropriate for handling uncertainty in remote sensing. This research presents an algorithm integrating fuzzy multi-classifiers in classification. Traditional Mahalnobis distance and maximum likelihood classification are fuzzified by using fuzzy means and fuzzy covariance matrices, resulting in two fuzzy partitioned matrices. Experimental results indicate that this new method can increase classification accuracy. Hiremath and Kodge (2010) studied the visualization techniques for data mining in the Latur district of Maharashtra using the ENVI tools framework (ITTVIS, 2006). In the maximum likelihood classification algorithms for checking training data sets in the normal distribution, pixels are classified according to the closest mean point in feature space. Patil et al, (2014) Machine learning techniques have delivered improvements in the accuracy of classification of patterns of features. Remote sensing color-based imageries have hard clustering color pixels with variability in the intensity of colors. It helps the estimation of the crop yield predictions through satellite imageries. Achieved converge accuracy of estimation of vegetation crop yield of fields. Kappa coefficient to achieve high degree accuracy estimation of crop-wise with suitable thresh hold to ground truth data. Validation of schemes of classifiers of the imageries data using various distance measures
MATERIAL AND METHODS
The study area consists of Koluru Hobli of Bellary Taluk and District of Karnataka, which lies between 150 9΄ to 150 15΄N latitude and 760 55΄ to 760 92΄E longitudes.
IRS (Indian Remote Sensing Satellite)-P6 LISS-III(Linear Imaging Self Scanner) imageries of 21st November 2010 are used for the study. The geometrically corrected imageries are obtained from the National Remote Sensing Agency, Department of Space, Government of India, Hyderabad. The topographical map of the study area is overlaid on this image to extract the digital image of the study area (fig.2.). The spatial resolution of the images is 23.5mt. The images were recorded in three spectral bands. Among these only the first two namely Green and Red are in the small range of the electromagnetic spectrum and the third one namely the blue band is useful in the identification of green vegetation-like crops. Ground truth data is collected during field visits in the study area and the toposheet is used to accomplish the task of selecting training sites for each category for training the classifier in supervised classification. A part of the data was used as test sites for assessing classification accuracy.
Details of land cover classes considered
The categories of interest were carefully selected and defined to successfully perform digital image classification. In the present study, a broad land use/land cover classification system is adopted with six categories for each study area as follows. Land use/Land cover categories of Koluru Hobli, Bellary Taluk.
Methods of Image Classification
In the present study, both supervised and unsupervised methods are used for image classification. All classifications are done using ERDAS imagine 9.1 software at the Karnataka State Remote Sensing Applications Centre (KSRSAC), Department of IT and BT, Government of Karnataka, Major Sandeep Unnikrishnan Road, Doddabettahalli, Vidyaranyapura Post, Bengaluru, Karnataka 560097
study area satellite imageries (IRS P6 LISS-III from NRSA Hyderabad dated: 21st November, 2010)
Unsupervised classification algorithm
Unsupervised classification algorithms are commonly used in remote sensing. The ISODATA algorithm has some further refinements by splitting and merging clusters. Clusters are merged if either the number of members (pixel) in a cluster is less than a certain threshold or if the centers of two clusters are closer than a certain threshold. Clusters are split into two different clusters if the cluster standard deviation exceeds a predefined value and the number of members (pixels) is twice the threshold for the minimum number of members. The number of classes (clusters), in the classification processes (was set to 6 classes), the maximum number of iterations (was set to 30), and the convergence threshold, which is the maximum percentage of pixels whose class values are allowed to be unchanged between iterations (was set to 0.95). These values were the same for all the areas. After the execution of the algorithm, the assigned classes (6 classes) were grouped into a number of categories according to their spectral appearance on screen.
The maximum likelihood classification algorithm
Maximum likelihood is one of the most popular supervised classification methods used with remote sensing image data. This method is based on the probability that a pixel belongs to a particular class. The basic theory assumes that these probabilities are equal for all classes and that the input bands have normal distributions. From a statistical viewpoint, the clusters obtained by k-mean can be interpreted as the Maximum Likelihood Estimates (MLE) for the cluster means if we assume that each cluster comes from a spherical Normal distribution with different means but identical variance (and zero covariance). However, this method needs a long time of computation, relies heavily on a normal distribution of the data in each input band, and tends to over-classify signatures with relatively large values in the covariance matrix. The distance (spectral distance) method calculates the spectral distance between the measurement vector for the candidate pixel and the mean vector for each signature, and the equation for classifying by spectral distance is based on the equation for Euclidean distance. It requires the least computational time among other supervised methods, however, the pixels that should not be unclassified become classified, and it does not consider class variability.
Mahalnobis distance classification Algorithm
Mahalnobis distance is similar to the minimum distance, except that the covariance matrix is used instead. Unlike minimum distance, this method takes the variability of classes into account. It could be more useful than the minimum distance in cases where statistical criteria must be taken into account, but the weighting factors that are available with the maximum likelihood option are not needed. However, this method tends to over classify signatures with relatively large values in the covariance matrix. Also, it is slower to compute than the minimum distance; and it relies heavily on a normal distribution of the data in each input band. In this algorithm, each pixel follows a multivariate normal distribution. Let µ1, µ2, . . . , µm and Σ1, Σ2,. . . , Σm denote the population mean vectors and population variance-covariance matrices for m classes respectively. The observation vector Xr at the pixel or when it belongs to class c is a multivariate normal distribution with mean µc and covariance matrix Σc. Then,
Give the Mahalnobis distance of pixel r belonging to class c. Since the class mean vectors µc and covariance matrix Σc are unknown, the sample estimates are obtained from the training set. Let X_1, X_2 ,…., X_m be the sample mean vectors and V1, V2 ,…., Vm be the sample variance-covariance matrices estimated from the training data for m classes respectively. The classification is performed for the whole data set on a pixel-by-pixel basis. The pixel assignment is made based on the value of drc
CLASSIFICATION ACCURACY
The extent to which a manual or automatic processing system correctly identifies selected classes. Overall accuracy (O’s ac) is obtained as the ratio of total correct classifications for all the classes to the total test sample units. The individual class accuracies can be expressed in two ways: by calculating the producer’s accuracy and the user’s accuracy. Producer’s accuracy (P’s ac) is the ratio of correct classifications to the actual data. User’s accuracy (U’s ac) is the ratio of the correct classifications to the total number of pixels classified in specific classes.
Overall accuracy = , User's accuracy = ,
Producer's accuracy =
where, nii = correctly classified pixels,
ni = Marginal totals of classification categories, ni = Marginal totals of reference categories
Kappa Coefficient
Kappa statistic gives the proportion of agreement after the chance agreement is removed and it may be calculated as,
An approximation of variance of Kappa statistic is given by:
Test of significance can be performed to determine if the Kappa coefficient is significantly different from zero using Z test.Ho : K = 0, H1 : K ≠ 0
follows Normal distribution with zero mean and unit variance. Decision: if observed value of │Z│>1.96, the null hypothesis is rejected, at 0.05 level of significance, indicating that the confusion matrix is significantly different from a random result. The classified images were used to estimate the area under different land cover classes. This is done by aggregation. Area underclass c = (No. of pixels in class c) x (area of a pixel)
RESULTS AND DISCUSSION
Classification results obtained with Mahalnobis distance classification: Using Mahalnobis distance classification 86.15 percent of overall accuracy was achieved. The user’s accuracy and producer’s accuracy for individual classes ranged between 74.28 to 92.50 percent and 82.00 to 92.50 percent for different categories using the maximum likelihood algorithm. And producer's and user's accuracy is obtained and presented in the below table for all the classes under study.
Table glossary: F_ll= Fallow land, W_b= Water bodies; Pr’s A=Producer’s Accuracy, Ur’s A=User’s accuracy, f_m=f_measure; CA=Classification Accuracy, OCA= Overall Classification Accuracy, OKS= Overall Kappa Statistics
Test of significance of Kappa coefficients for Koluru Hobli, Bellary Taluk:The kappa coefficient is found to be 0.8321 and the variance of kappa is found to be 0.000675. The Kappa coefficient for Mahalnobis distance classification was highly significant, implying that the classifier produced classification significantly different from a random assignment.
In this study, classifiers were predicted by utilizing remote sensing in the arid region located in northeast Karnataka state. Classification of raw remote sensing satellite imageries data was applied to these with four methods of classification; these are the Unsupervised classification method and three different supervised classification methods. After classified features, accuracy was tested by different methods of accuracy assessment, and the post-classification process was implemented. By using the unsupervised technique it was found that features can be divided into main classes: Paddy, Cotton, Jowar, Chilly, and Fallow land, Water bodies constitute one land cover class achieving a good amount of time complexity and optimal level of classification accuracy.
Three supervised classification methods were utilized in this work; these are maximum likelihood, minimum distance, and Mahalnobis distance. It was noticed that applying the supervised classification methods in the study area enhances the extraction of production accuracy of outcrops. It can be shown also that both maximum likelihood and Mahalnobis distance methods agree with land truths in terms of production and user accuracy. Their predictions of the distribution of Paddy, Cotton, Jowar, Chilly, Fallow land, Water bodies’ outcrops areas were very reasonable, but the minimum distance method overestimated the Paddy, Cotton, Jowar, Chilly, Fallow land, Water bodies’ area in the study area. Error matrices produced to evaluate the classification methods show that the best overall classification accuracy method was the maximum likelihood for all the study areas; with an average accuracy of about 91.15%. The second best overall classification accuracy method was Mahalnobis distance; with an average accuracy of 86.15% and the least overall classification accuracy method was minimum distance with an average accuracy of 82%, and unsupervised maximum likelihood with an average accuracy of 76.15%.
CONCLUSIONS
The pairwise comparison between Kappa coefficients has shown a highly significant difference in maximum likelihood classification. Supervised classification using maximum likelihood, Mahalanobis distance, and minimum distance classifications, performing respective order. Unsupervised classification using maximum likelihood classification outperforms optimum. User’s accuracy classified in specific classes achieved significant improvement in Cotton, Paddy, Chilly, and Fallow land. Jowar and Water body classes are more common variations in Producer’s accuracy. Mahalnobis distance classification is estimated significantly converges to actual results.
REFERENCES
- Bruzzone, L., Cossu, R. and Vernazza, G. 2002. Combining parametric and non-parametric algorithms for partially unsupervised classification of multitemporal remote sensing images. Information Fusion, 3: 289-297.
- Goel, P. K., Prasher, S. O., Landry, J. A., Patel, R. M. and Viau, A. A., 2003. Hyperspectral image classification to detect weed infestations and nitrogen status in corn. Transactions of Americans Society of Agricultural and Biological Engineers,46(2): 539-550.
- Graney, M. J., and Engle, V. F., 2000. Stability of performance of activities of daily living using the MDS.Stat Med., 19(14): 1889-99.
- Hiremath, P. S., and Kodge, B. G., 2010. Visualization techniques for data mining of Latur district satellite imagery. Advances in computational research, 2 (1): 21-24.
- Kun Wang and Youchuan Wan, 2009. Classification of remote sensing images using Fuzzy multi-classifiers.Yangze River Scientific Research Institute, 978(1): 4244-4738.
- Lu, D., Weng, Q., 2007. A survey of image classification methods and techniques for improving classification performance". International Journal of Remote Sensing, 28(5): 823-870.
- Nageshwara R., and Vaidyanadhan, R.,1981. LandUse Capability Studies from Aerial PhotoInterpretation-A Case Study from Krishna Delta, India. Geographical Review of India, 3:226-236.
- Patil, S.S., Sachidananda, Angadi, U.B., Prabhuraj, D.K.,2014, Machine Learning Technique Approaches Versus Statistical Methods in Classification of Multispectral Remote Sensing Data Using Maximum Likelihood Classification: Koluru Hobli, Bellary Taluk, District, Karnataka State, INDIA. International of Advanced Remote Sensing and GIS, 3(1):525-531.
- Perumal, K. and Bhaskaran, R., 2010. Supervised classification performance of multispectral images, Journal of Computing, 2(2): 2151-9617.
- Nageswara Rao K and Vaidyanathan R (1981). Land use capability studies from aerial photo interpretation - A case study from Krishna Delta, India. Geog. Rev. of India, 43(3).
- Satio, H., Bellan, M. F., Al-Habshi, A., Aizpuru, M. and Blasco, F., 2003.Mangrove research and coastal ecosystem studies with SPOT-4 HRVIR and terra aster in the Arabian Gulf. 24(21): 4073 - 4092.
- Shanmugam, S., Lucas, N., Phipps, P., Richards. A., and Barnsley, M., 2003. Assessment of remote sensing techniques for habitat mapping in coastal dune ecosystems, Journal of Coastal Research, 19(1): 64-75.
- Sheldon MR, Fillyaw MJ, Thompson WD.,1996. The use and interpretation of the Friedman test in the analysis of ordinal-scale data in repeated measures designs. Physiother Res Int, 1(4):221-8
Publication Dates
-
Publication in this collection
19 Feb 2024 -
Date of issue
2024
History
-
Received
10 Oct 2023 -
Accepted
15 Nov 2023 -
Published
10 Jan 2024