Group variable selection via convex log-exp-sum penalty with application to a breast cancer survivor study

Zhigeng Geng, Sijian Wang, Menggang Yu, Patrick O. Monahan, Victoria Champion, Grace Wahba

Research output: Contribution to journalArticle

5 Scopus citations

Abstract

In many scientific and engineering applications, covariates are naturally grouped. When the group structures are available among covariates, people are usually interested in identifying both important groups and important variables within the selected groups. Among existing successful group variable selection methods, some methods fail to conduct the within group selection. Some methods are able to conduct both group and within group selection, but the corresponding objective functions are non-convex. Such a non-convexity may require extra numerical effort. In this article, we propose a novel Log-Exp-Sum(LES) penalty for group variable selection. The LES penalty is strictly convex. It can identify important groups as well as select important variables within the group. We develop an efficient group-level coordinate descent algorithm to fit the model. We also derive non-asymptotic error bounds and asymptotic group selection consistency for our method in the high-dimensional setting where the number of covariates can be much larger than the sample size. Numerical results demonstrate the good performance of our method in both variable selection and prediction. We applied the proposed method to an American Cancer Society breast cancer survivor dataset. The findings are clinically meaningful and may help design intervention programs to improve the qualify of life for breast cancer survivors.

Original languageEnglish (US)
Pages (from-to)53-62
Number of pages10
JournalBiometrics
Volume71
Issue number1
DOIs
StatePublished - Mar 1 2015

Keywords

  • Breast cancer survivor
  • Finite sample bound
  • Group variable selection
  • High-dimensional data
  • Penalized estimation
  • Sparsity recovery

ASJC Scopus subject areas

  • Statistics and Probability
  • Biochemistry, Genetics and Molecular Biology(all)
  • Immunology and Microbiology(all)
  • Agricultural and Biological Sciences(all)
  • Applied Mathematics

Fingerprint Dive into the research topics of 'Group variable selection via convex log-exp-sum penalty with application to a breast cancer survivor study'. Together they form a unique fingerprint.

  • Cite this