Publications

Journal

A meta-learning approach for determining the number of clusters with consideration of nearest neighbors

페이지 정보

profile_image
작성자 관리자
조회 1,569회 작성일 20-12-19 15:37

본문

Journal Information Sciences, 232, 208-224.
Name Lee, J.-S. and Olafsson, S.
Year 2013

An important and challenging problem in data clustering is the determination of the best number of clusters. A variety of estimation methods has been proposed over the years to address this problem. Most of these methods depend on several nontrivial assumptions about the data structure; and such methods may thus fail to discover the true clusters in a dataset that does not satisfy those assumptions. We develop a new approach that takes as a starting point the simple and intuitive observation that close objects should fall within the same cluster, whereas distant ones should not. Based on this simple notion we utilize a new measurement of good clustering called disconnectivity as well as existing goodness measurements; and we embed these measures into a meta-learning approach for estimating the number of clusters. A simulation experiment based on 13 representative models and an application to real world datasets are conducted to show the effectiveness of the proposed method.