Skip to main content
Journal of Biomedical InformaticsVolume 62, 1 August 2016, Pages 12-20

A kernel-based clustering method for gene selection with gene expression data(Article)(Open Access)

  Save all to author list
  • aSchool of Mathematics and Statistics, Shandong University at Weihai, Weihai, 264209, China
  • bFaculty of Science, University of Kragujevac, P.O. Box 60, Kragujevac, 34000, Serbia

Abstract

Gene selection is important for cancer classification based on gene expression data, because of high dimensionality and small sample size. In this paper, we present a new gene selection method based on clustering, in which dissimilarity measures are obtained through kernel functions. It searches for best weights of genes iteratively at the same time to optimize the clustering objective function. Adaptive distance is used in the process, which is suitable to learn the weights of genes during the clustering process, improving the performance of the algorithm. The proposed algorithm is simple and does not require any modification or parameter optimization for each dataset. We tested it on eight publicly available datasets, using two classifiers (support vector machine, k-nearest neighbor), compared with other six competitive feature selectors. The results show that the proposed algorithm is capable of achieving better accuracies and may be an efficient tool for finding possible biomarkers from gene expression data. © 2016 Elsevier Inc.

Author keywords

Adaptive distanceCancer classificationGene expression dataGene selectionKernel-based clustering

Indexed keywords

Engineering controlled terms:Classification (of information)Cluster analysisClustering algorithmsDiseasesGenesIterative methodsNearest neighbor searchOptimizationText processing
Engineering uncontrolled termsAdaptive distanceCancer classificationGene Expression DataGene selectionKernel-based clustering
Engineering main heading:Gene expression
EMTREE medical terms:accuracyalgorithmArticlecancer classificationcancer diagnosiscontrolled studygene expressiongenetic selectionhumanintermethod comparisonk nearest neighborkernel methodmolecular biologypriority journalprocess optimizationsupport vector machinealgorithmcluster analysisgene expressiongene expression profilingsupport vector machine
MeSH:AlgorithmsCluster AnalysisGene ExpressionGene Expression ProfilingHumansSupport Vector Machine

Funding details

Funding sponsor Funding number Acronym
Natural Science Foundation of Shandong ProvinceZR2015AM017,ZR2015AM019
  • 1

    This work was supported in part by the Shandong Natural Science Foundation ( ZR2015AM017 , ZR2015AM019 ).

  • ISSN: 15320464
  • CODEN: JBIOB
  • Source Type: Journal
  • Original language: English
  • DOI: 10.1016/j.jbi.2016.05.007
  • PubMed ID: 27215190
  • Document Type: Article
  • Publisher: Academic Press Inc.

  Zhang, Y.; School of Mathematics and Statistics, Shandong University at Weihai, Weihai, China;
© Copyright 2017 Elsevier B.V., All rights reserved.

Cited by 62 documents

Shikalgar, A. , Sonavane, S.
Deep kernelized dimensionality reducer for multi-modality heterogeneous data
(2024) Journal of Ambient Intelligence and Humanized Computing
Bhonde, S.B. , Wagh, S.K. , Prasad, J.R.
Meticulous predictive modelling for classification of cancerous molecular profiles
(2024) Multimedia Tools and Applications
Gulati, A. , Taneja, A. , Rawat, S.
Cancer prediction using graph database
(2024) Machine Learning Techniques and Industry Applications
View details of all 62 citations
{"topic":{"name":"Feature Selection; Support Vector Machine; Data Mining","id":188,"uri":"Topic/188","prominencePercentile":99.17134,"prominencePercentileString":"99.171","overallScholarlyOutput":0},"dig":"6f6f157ff23bd9462ab8fbc48b5487896187d6a62969362c3f21fe391c241c4e"}

SciVal Topic Prominence

Topic:
Prominence percentile: