KRIBB Repository

검색

Open Access@KRIBB1. Journal Articles Journal Articles

MULTI-K: accurate classification of microarray subtypes using ensemble k-means clustering

Cited 58 time in scopus

Metadata Downloads

Full metadata record

DC Field	Value	Language
dc.contributor.author	E Y Kim	-
dc.contributor.author	Seon-Young Kim	-
dc.contributor.author	D Ashlock	-
dc.contributor.author	D Nam	-
dc.date.accessioned	2017-04-19T09:14:33Z	-
dc.date.available	2017-04-19T09:14:33Z	-
dc.date.issued	2009	-
dc.identifier.issn	1471-2105	-
dc.identifier.uri	10.1186/1471-2105-10-260	ko
dc.identifier.uri	https://oak.kribb.re.kr/handle/201005/9101	-
dc.description.abstract	Background: Uncovering subtypes of disease from microarray samples has important clinical implications such as survival time and sensitivity of individual patients to specific therapies. Unsupervised clustering methods have been used to classify this type of data. However, most existing methods focus on clusters with compact shapes and do not reflect the geometric complexity of the high dimensional microarray clusters, which limits their performance. Results: We present a cluster-number-based ensemble clustering algorithm, called MULTI-K, for microarray sample classification, which demonstrates remarkable accuracy. The method amalgamates multiple k-means runs by varying the number of clusters and identifies clusters that manifest the most robust co-memberships of elements. In addition to the original algorithm, we newly devised the entropy-plot to control the separation of singletons or small clusters. MULTI-K, unlike the simple k-means or other widely used methods, was able to capture clusters with complex and high-dimensional structures accurately. MULTI-K outperformed other methods including a recently developed ensemble clustering algorithm in tests with five simulated and eight real gene-expression data sets. Conclusion: The geometric complexity of clusters should be taken into account for accurate classification of microarray data, and ensemble clustering applied to the number of clusters tackles the problem very well. The C++ code and the data sets tested are available from the authors.	-
dc.publisher	Springer-BMC	-
dc.title	MULTI-K: accurate classification of microarray subtypes using ensemble k-means clustering	-
dc.title.alternative	MULTI-K: accurate classification of microarray subtypes using ensemble k-means clustering	-
dc.type	Article	-
dc.citation.title	BMC Bioinformatics	-
dc.citation.number	0	-
dc.citation.endPage	260	-
dc.citation.startPage	260	-
dc.citation.volume	10	-
dc.contributor.affiliatedAuthor	Seon-Young Kim	-
dc.contributor.alternativeName	김은윤	-
dc.contributor.alternativeName	김선영	-
dc.contributor.alternativeName	Ashlock	-
dc.contributor.alternativeName	남덕우	-
dc.identifier.bibliographicCitation	BMC Bioinformatics, vol. 10, pp. 260-260	-
dc.identifier.doi	10.1186/1471-2105-10-260	-
dc.description.journalClass	Y	-

Appears in Collections:: 1. Journal Articles > Journal Articles

Files in This Item:

8342.pdf 876.06 kB / Adobe PDF
Download

Show simple item record

qrcode

트윗하기

Open Access@KRIBB was built as a OAK to the National Central Library.