Skip to main navigation Skip to search Skip to main content

ECFS-DEA: an ensemble classifier-based feature selection for differential expression analysis on expression profiles

  • Xudong Zhao
  • , Qing Jiao
  • , Hangyu Li
  • , Yiming Wu
  • , Hanxu Wang
  • , Shan Huang
  • , Guohua Wang*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Background: Various methods for differential expression analysis have been widely used to identify features which best distinguish between different categories of samples. Multiple hypothesis testing may leave out explanatory features, each of which may be composed of individually insignificant variables. Multivariate hypothesis testing holds a non-mainstream position, considering the large computation overhead of large-scale matrix operation. Random forest provides a classification strategy for calculation of variable importance. However, it may be unsuitable for different distributions of samples. Results: Based on the thought of using an ensemble classifier, we develop a feature selection tool for differential expression analysis on expression profiles (i.e., ECFS-DEA for short). Considering the differences in sample distribution, a graphical user interface is designed to allow the selection of different base classifiers. Inspired by random forest, a common measure which is applicable to any base classifier is proposed for calculation of variable importance. After an interactive selection of a feature on sorted individual variables, a projection heatmap is presented using k-means clustering. ROC curve is also provided, both of which can intuitively demonstrate the effectiveness of the selected feature. Conclusions: Feature selection through ensemble classifiers helps to select important variables and thus is applicable for different sample distributions. Experiments on simulation and realistic data demonstrate the effectiveness of ECFS-DEA for differential expression analysis on expression profiles. The software is available at http://bio-nefu.com/resource/ecfs-dea.

Original languageEnglish
Article number203388
JournalBMC Bioinformatics
Volume21
Issue number1
DOIs
StatePublished - 5 Feb 2020
Externally publishedYes

Keywords

  • Accumulation
  • Classification
  • Differential expression analysis
  • Expression profiles
  • Feature selection

Fingerprint

Dive into the research topics of 'ECFS-DEA: an ensemble classifier-based feature selection for differential expression analysis on expression profiles'. Together they form a unique fingerprint.

Cite this