A multi-index ROC-based methodology for high throughput experiments in gene discovery

Dimitri Kagaris, Constantin T. Yiannoutsos

Research output: Contribution to journalArticle

1 Scopus citations

Abstract

We address the problem of ranking differentially expressed genes in high throughput experiments using Receiver Operating Characteristic (ROC) curves. As it is generally unknown whether large expression values constitute 'positive' or 'negative' results or which group is 'healthy' or 'diseased', we generate four ROC curves per gene. We then consider classification indices based on all or part of the four ROC curves and identify genes ranked low by the area under the curve (AUC) but high by at least one alternative index, invariably resulting to the discovery of genes that would otherwise be missed by the AUC index.

Original languageEnglish (US)
Pages (from-to)42-65
Number of pages24
JournalInternational Journal of Data Mining and Bioinformatics
Volume8
Issue number1
DOIs
StatePublished - Jul 8 2013

    Fingerprint

Keywords

  • Breast cancer
  • Gene expression
  • Gene selection
  • Microarray data analysis
  • Ovarian cancer
  • Receiver operating characteristic (ROC) curve

ASJC Scopus subject areas

  • Library and Information Sciences
  • Information Systems
  • Biochemistry, Genetics and Molecular Biology(all)

Cite this