nvmslot898 slot demo https://ejurnal.sttkadesiyogyakarta.ac.id/cor4d/ Publication - n-means: Adaptive Clustering Microaggregation of Categorical Medical Data

n-means: Adaptive Clustering Microaggregation of Categorical Medical Data

Imran Daud
Abstract:
Huge amount of information is managed and shared publically by the individuals and data controllers. Publically shared data contains information that can reveal identity of users, thus affecting privacy of individuals. To palliate these disclosure risks, Statistical Disclosure Control (SDC) methods are applied to the data before it is released. Microaggregation is one of the SDC methods that aggregate similar records into clusters, and then transform them into m indistinguishable records. K-means is a famous data mining clustering algorithm for continuous data, which iteratively maps similar elements into k-cluster until they all converge. However, adapting k-means algorithm for categorical multivariate is a challenging task due to high dimensionality of attributes. In this paper, we extend k-means clustering algorithm to achieve notion of microaggregation of structured data. Moreover, to preserve data utility, we extend fixed clustering nature of this algorithm to adaptive size clusters. For this purpose, we introduce n-means clustering approach that construct clusters based on the semantics of the datasets. In experiments, we proved significance of our proposed system by measuring cohesion of clusters and information loss for utility purpose.
research from:
Year:
2019
Type of Publication:
Article
Journal:
Springer: Intelligent Computing. CompCom 2019. Advances in Intelligent Systems and Computing,
Pages:
13-28
Month:
7

Contact Us

Foundation University Islamabad

Contact us at: research@fui.edu.pk

  •   Islamabad Campus:(+92)51-5788171-250

  •   Rawalpindi Campus:(+92)51-5151437-38

Newsletter

Enter your email and we'll send you more information

Search