TY - GEN
T1 - KNN-kernel based clustering for spatio-temporal database
AU - Musdholifah, Aina
AU - Mohd Hashim, Siti Zaiton Bt
AU - Wasito, Ito
PY - 2010
Y1 - 2010
N2 - Extracting and analyzing the interesting patterns from spatio-temporal databases, have drawn a great interest in various fields of research. Recently, a number of experiments have explored the problem of spatial or temporal data mining, and some clustering algorithms have been proposed. However, not many studies have been dealing with the integration of spatial data mining and temporal data mining. Moreover, the data in spatial temporal database can be categorized as high-dimensional data. Current density-based clustering might have difficulties with complex data sets including high-dimensional data. This paper presents Iterative Local Gaussian Clustering (ILGC), an algorithm that combines K-nearest neighbour (KNN) density estimation and Kernel density estimation, to cluster the spatiotemporal data. In this approach, the KNN density estimation is extended and combined with Kernel function, where KNN contributes in determining the best local data iteratively for kernel density estimation. The local best is defined as the set of neighbour data that maximizes the kernel function. Bayesian rule is used to deal with the problem of selecting the best local data. This paper utilized Gaussian kernel which has been proven successful in the clustering. To validate the KNN-kernel based algorithm, we compare its performance againts other popular algorithms, such as Self Organizing Maps (SOM) and K-Means, on Crime database. Results show that KNN-kernel based clustering has outperformed others.
AB - Extracting and analyzing the interesting patterns from spatio-temporal databases, have drawn a great interest in various fields of research. Recently, a number of experiments have explored the problem of spatial or temporal data mining, and some clustering algorithms have been proposed. However, not many studies have been dealing with the integration of spatial data mining and temporal data mining. Moreover, the data in spatial temporal database can be categorized as high-dimensional data. Current density-based clustering might have difficulties with complex data sets including high-dimensional data. This paper presents Iterative Local Gaussian Clustering (ILGC), an algorithm that combines K-nearest neighbour (KNN) density estimation and Kernel density estimation, to cluster the spatiotemporal data. In this approach, the KNN density estimation is extended and combined with Kernel function, where KNN contributes in determining the best local data iteratively for kernel density estimation. The local best is defined as the set of neighbour data that maximizes the kernel function. Bayesian rule is used to deal with the problem of selecting the best local data. This paper utilized Gaussian kernel which has been proven successful in the clustering. To validate the KNN-kernel based algorithm, we compare its performance againts other popular algorithms, such as Self Organizing Maps (SOM) and K-Means, on Crime database. Results show that KNN-kernel based clustering has outperformed others.
KW - Bayesian rule
KW - Gaussian Kernel
KW - Iterative Local Gaussian Clustering
KW - KNN
KW - Kernel clustering
KW - Spatio-temporal database
UR - http://www.scopus.com/inward/record.url?scp=77957758160&partnerID=8YFLogxK
U2 - 10.1109/ICCCE.2010.5556805
DO - 10.1109/ICCCE.2010.5556805
M3 - Conference contribution
AN - SCOPUS:77957758160
SN - 9781424462346
T3 - International Conference on Computer and Communication Engineering, ICCCE'10
BT - International Conference on Computer and Communication Engineering, ICCCE'10
T2 - International Conference on Computer and Communication Engineering, ICCCE'10
Y2 - 11 May 2010 through 12 May 2010
ER -