Skip to main content
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)Volume 10563 LNCS, 2017, Pages 248-2617th International Conference on Model and Data Engineering, MEDI 2017; Barcelona; Spain; 4 October 2017 through 6 October 2017; Code 199259

A feature selection method based on feature correlation networks(Conference Paper)

  Save all to author list
  • aDepartment of Mathematics and Informatics, Faculty of Sciences, University of Novi Sad, Novi Sad, Serbia
  • bFaculty of Computer and Information Science, Univeristy of Ljubljana, Ljubljana, Slovenia

Abstract

Feature selection is an important data preprocessing step in data mining and machine learning tasks, especially in the case of high dimensional data. In this paper we present a novel feature selection method based on complex weighted networks describing the strongest correlations among features. The method relies on community detection techniques to identify cohesive groups of features. A subset of features exhibiting a strong association with the class feature is selected from each identified community of features taking into account the size of and connections within the community. The proposed method is evaluated on a high dimensional dataset containing signaling protein features related to the diagnosis of Alzheimer’s disease. We compared the performance of seven widely used classifiers that were trained without feature selection, with correlation-based feature selection by a state-of-the-art method provided by the WEKA tool, and with feature selection by four variants of our method determined by four different community detection techniques. The results of the evaluation indicate that our method improves the classification accuracy of several classification models while drastically reducing the dimensionality of the dataset. Additionally, one variant of our method outperforms the correlation-based feature selection method implemented in WEKA. © 2017, Springer International Publishing AG.

Author keywords

Alzheimer’s diseaseCommunity detectionFeature correlation networksFeature selection

Indexed keywords

Engineering controlled terms:Clustering algorithmsData miningDiagnosisFeature extractionLearning systemsPopulation dynamics
Engineering uncontrolled termsAlzheimerClassification accuracyCommunity detectionFeature correlationFeature selection methodsHigh dimensional dataHigh-dimensional datasetState-of-the-art methods
Engineering main heading:Classification (of information)

Funding details

Funding sponsor Funding number Acronym
Javna Agencija za Raziskovalno Dejavnost RSOI174023ARRS
Ministarstvo Prosvete, Nauke i Tehnološkog RazvojaMPNTR
  • 1

    Acknowledgments. This work is supported by the bilateral project “Intelligent computer techniques for improving medical detection, analysis and explanation of human cognition and behavior disorders” between the Ministry of Education, Science and Technological Development of the Republic of Serbia and the Slovenian Research Agency. M. Savić, V. Kurbalija and M. Ivanović also thank the Ministry of Education, Science and Technological Development of the Republic of Serbia for additional support through project no. OI174023, “Intelligent techniques and their integration into wide-spectrum decision support.”

  • ISSN: 03029743
  • ISBN: 978-331966853-6
  • Source Type: Book Series
  • Original language: English
  • DOI: 10.1007/978-3-319-66854-3_19
  • Document Type: Conference Paper
  • Volume Editors: Abello A.,Ouhammou Y.,Bellatreche L.,Ivanovic M.
  • Sponsors:
  • Publisher: Springer Verlag

  Savić, M.; Department of Mathematics and Informatics, Faculty of Sciences, University of Novi Sad, Novi Sad, Serbia;
© Copyright 2017 Elsevier B.V., All rights reserved.

Cited by 16 documents

Sheuly, S.S. , Deivard, J. , Seceleanu, T.
Feature Selection Based on Membrane Clustering
(2025) Communications in Computer and Information Science
Gomez-Cravioto, D.A. , Diaz-Ramos, R.E. , Cantu-Ortiz, F.J.
Data Analysis and Forecasting of the COVID-19 Spread: A Comparison of Recurrent Neural Networks and Time Series Models
(2024) Cognitive Computation
Almutairi, M.S.
Evolutionary Multi-Objective Feature Selection Algorithms on Multiple Smart Sustainable Community Indicator Datasets
(2024) Sustainability (Switzerland)
View details of all 16 citations
{"topic":{"name":"Feature Selection; Support Vector Machine; Data Mining","id":188,"uri":"Topic/188","prominencePercentile":99.17134,"prominencePercentileString":"99.171","overallScholarlyOutput":0},"dig":"6f6f157ff23bd9462ab8fbc48b5487896187d6a62969362c3f21fe391c241c4e"}

SciVal Topic Prominence

Topic:
Prominence percentile: