STUDY OF SOME IMFORMATION EXTRACTION TECHNIQUES FOR HYPERSPECTRAL IMAGING

Prabhu, N.

STUDY OF SOME IMFORMATION EXTRACTION TECHNIQUES FOR HYPERSPECTRAL IMAGING

Prabhu, N.

URI: http://hdl.handle.net/123456789/14735

Date: 2014-10

Abstract:

Remote sensing is an art and science of acquiring information about earth surface without actual contact with the earth surface. It gives data in the form of images which comprises of finite number of elements known as pixels. Such images which are obtained in from remote sensing needs to be processed and the easy and effective way is to do it digitally. Hence the field of digital image processing refers to processing of digital images by means of a computer. Digital image processing plays a key role in the field of image processing, image analysis, image enhancement and coding with a wide range of applications such as, classification, target/anomaly detection, mineral identification, super resolution mapping, etc. In remote sensing, the digital image processing methods are used for deriving the information from airborne and satellite borne images. Remote sensing community has utilized the digital images obtained from airborne and satellite borne sensors and has made enormous progress in recent years. Based on the sensor specifications the digital image produced by each sensor varies in spatial, spectral and radiometric resolutions. Other components like dimension of the image, temporal resolution of the image, swath width of the image, etc., plays secondary role in some types of applications. Due to variation in spectral resolution we have panchromatic (single band), multi-spectral (few bands less than 10) and hyperspectral (100s of bands) data are available. In this research, the study has been done for hyperspectral data. The land use land cover classes on the earth's surface have different physical characteristics. Particularly, for a hyperspectral data, due to coarse spatial resolution pixels may contain two or more classes. Hence classification of hyperspectral data is a way to extract useful information from it. But due to the large spectral dimension of hyperspectral data, it suffers while doing processing. Redundant information from hyperspectral data are carried every time. Also one of the important properties of hyperspectral data is that the neighbouring bands of hyperspectral data are highly correlated. Hence due to the collective reasons the hyperspectral data needs to be reduced. To reduce the hyperspectral data, in this research, the feature extraction techniques have been employed. The two feature extraction techniques employed here are wavelet based feature extraction and PCA based feature extraction. In wavelet based feature extraction three wavelet transforms have been used, namely, Haar wavelets, Daubechies wavelets and Coiflets wavelets. These wavelets along with its sub classes have also been utilized. For each of the sub classes, decomposition [ii] upto 4 levels have been performed. In case of PCA based feature extraction two prominent techniques have been used, segmented PCA and spectrally segmented PCA. The evaluation of the feature extraction techniques has been done by calculating the classification accuracy. Also a study has been made to observe the duration of feature extraction, duration of classification, number of extracted features, relation between classification accuracy and number of reduced features, etc. Once the features have been extracted from hyperspectral data, now it is ready for performing classification. Even though the number of bands reduced to significant level the algorithms used for multi-spectral classification are incapable of producing useful information. Hence for classification of hyperspectral data few algorithms exist in literature. For hyperspectral data also, the classification may be per pixel and sub pixel. By incorporating training pixels if the classification is done then it is called as supervised classification while with no help from training data if classification is done then it is unsupervised classification. Also if the statistical parameter extracted from the data are used then it is parametric classification and if not it is called as non-parametric classification. In this research all types of these classifications are touched and they are support vector machines (SVM) which is supervised, per pixel and non-parametric classifier while linear mixture model (LMM) is also supervised and non-parametric but sub pixel classification. Finally, the independent component analysis (ICAMM) which is parametric classification unsupervised but produces sub pixel outputs. The information extractions by each of these classification techniques are useful in one or the other way. To have a study about the performance of these techniques, three hyperspectral datasets have been taken. Two AVIRIS datasets with different spatial resolutions 4m and 20 m while one HYPERION dataset with spatial resolution 30m. The 20m spatial resolution dataset is named as dataset II covers by most of the region by vegetation and the classes are almost crisp. The 4m resolution data from HYPERON is named as dataset III which covers Roorkee and its surroundings covers by urban area and vegetation. Also some small classes like barren land, sand, etc., are also present in the dataset. Moreover the classes are not crisp enough to retrieve them back easily because of the nature of the land cover and the coarse spatial resolution. The next dataset is again from AVIRIS sensor but its airborne and hence a fine spatial resolution of 4m data is available. This data is over San Diego Naval Station and has many classes whose spectral signatures are almost [iii] similar. The study of classification algorithms has been made on these three hyperspectral datasets. When per pixel classification produces thematic map in which each pixel has been allotted to one and only one class while in sub pixel classification the fraction abundance of each class present in each pixel has been estimated. But that is not suffice until the spatial location has been estimated at sub pixel level. A solution to this problem may be achieved by super resolution mapping. In super resolution mapping, every pixel is divided into a specified zoom factor and each divided portion (sub pixel location) is filled by unique class thus making the original data into much finer spatial resolution. In this research, a novel super resolution mapping algorithm has been proposed, named as Pixel Filling Algorithm, to split each of the pixels and by gathering fractional abundance information from neighbouring pixels each sub pixel location may be filled by class values. This algorithm has been compared with existing super resolution mapping algorithm named as Pixel Swapping Algorithm. Due to the novelty of the pixel filling algorithm, a synthetic dataset has been generated of dimension 45x60. This dataset has been reduced by two low-pass filters, 3x3 and 5x5, to convert the data to dimensions 15x20 and 9x12 respectively. Now if super resolution mapping algorithm has been applied on the reduced dataset by appropriate zoom factors, 3 and 5, respectively then it is expected to get back the original data of dimension 45x60. At every stage of information extraction, the accuracy assessment has been performed. The feature extracted hyperspectral data are classified by SVM and then the conventional error matrix based accuracy assessment has been used. For sub pixel classification techniques the fuzzy error matrix based accuracy assessment has been performed. Finally, for super resolution mapping algorithms, the conventional error matrix based accuracy assessment has been performed but by taking three types of testing samples. Among the feature extraction techniques studied here, Daubechies wavelets perform better in extracting useful information from hyperspectral data. The second level decomposition is better in both accuracy wise and feature reduction wise. Since for all the datasets only the first two decomposition levels give better overall classification accuracies the sub pixel classifications have been performed only to the first two decomposition levels [iv] and that too only for Daubechies and its sub classes. In LMM has two solutions, constrained and unconstrained and there is no significant classification accuracies between them. The accuracy comes to be around 67%, 62% and 61% respectively for dataset II, dataset III and dataset IV. The ICAMM too extracts sub pixel information but due to unsupervised nature it retrieves some classes which are small enough to collect pure pixels from the three hyperspectral datasets. From dataset II, ICAMM retrieves railway track and non-metallic road and from dataset III, it retrieves noise free water body while from dataset IV it retrieves cylindrical drum, aircrafts, etc. In this way, each algorithm proves that it is superior over the other. The results for super resolution mapping have been analysed for the two algorithms, the proposed pixel filling algorithm and pixel swapping algorithm. The pixel filling algorithm performs well in super resolving mixed pixels which are having complicated boundaries and simple boundaries while the pixel swapping works well for only classes having linear boundaries. The accuracy for pixel filling and pixel swapping algorithms has been given in ordered pairs (for easy comparison) (92%, 98%), (96%, 90%), (97%, 91%) for datasets II, III, IV respectively for zoom factor 3 while for zoom factor 5 it is (91%, 94%), (90%, 70%), (90%, 70%) respectively. For dataset I, the overall accuracy of super resolved image by pixel filling algorithm gives 90.7% and 72.7% for zoom factors 3 and 5 respectively. Also the time required to perform super resolution mapping via pixel resolution mapping takes less than 10 seconds while by pixel swapping algorithm it takes more than a minute. Both in accuracy wise and duration for super resolution wise pixel filling algorithm is better than pixel swapping algorithm.

Show full item record