

Gene selection is important for cancer classification based on gene expression data, because of high dimensionality and small sample size. In this paper, we present a new gene selection method based on clustering, in which dissimilarity measures are obtained through kernel functions. It searches for best weights of genes iteratively at the same time to optimize the clustering objective function. Adaptive distance is used in the process, which is suitable to learn the weights of genes during the clustering process, improving the performance of the algorithm. The proposed algorithm is simple and does not require any modification or parameter optimization for each dataset. We tested it on eight publicly available datasets, using two classifiers (support vector machine, k-nearest neighbor), compared with other six competitive feature selectors. The results show that the proposed algorithm is capable of achieving better accuracies and may be an efficient tool for finding possible biomarkers from gene expression data. © 2016 Elsevier Inc.
| Engineering controlled terms: | Classification (of information)Cluster analysisClustering algorithmsDiseasesGenesIterative methodsNearest neighbor searchOptimizationText processing |
|---|---|
| Engineering uncontrolled terms | Adaptive distanceCancer classificationGene Expression DataGene selectionKernel-based clustering |
| Engineering main heading: | Gene expression |
| EMTREE medical terms: | accuracyalgorithmArticlecancer classificationcancer diagnosiscontrolled studygene expressiongenetic selectionhumanintermethod comparisonk nearest neighborkernel methodmolecular biologypriority journalprocess optimizationsupport vector machinealgorithmcluster analysisgene expressiongene expression profilingsupport vector machine |
| MeSH: | AlgorithmsCluster AnalysisGene ExpressionGene Expression ProfilingHumansSupport Vector Machine |
| Funding sponsor | Funding number | Acronym |
|---|---|---|
| Natural Science Foundation of Shandong Province | ZR2015AM017,ZR2015AM019 |
This work was supported in part by the Shandong Natural Science Foundation ( ZR2015AM017 , ZR2015AM019 ).
Zhang, Y.; School of Mathematics and Statistics, Shandong University at Weihai, Weihai, China;
© Copyright 2017 Elsevier B.V., All rights reserved.