Integrative analysis of somatic mutations and transcriptomic data to functionally stratify breast cancer patients

Jie Zhang, Zachary Abrams, Jeffrey D. Parvin, Kun Huang

Research output: Contribution to journalArticle

5 Citations (Scopus)

Abstract

Background: Somatic mutations can be used as potential biomarkers for subtyping and predicting outcomes for cancer patients. However, cancer patients often carry many somatic mutations, which do not always concentrate on specific genomic loci, suggesting that the mutations may affect common pathways or gene interaction networks instead of common genes. The challenge is thus to identify the functional relationships among the mutations using multi-modal data. We developed a novel approach for integrating patient somatic mutation, transcriptome and clinical data to mine underlying functional gene groups that can be used to stratify cancer patients into groups with different clinical outcomes. Specifically, we use distance correlation metric to mine the correlations between expression profiles of mutated genes from different patients. Results: With this approach, we were able to cluster patients based on the functional relationships between the affected genes using their expression profiles, and to visualize the results using multi-dimensional scaling. Interestingly, we identified a stable subgroup of breast cancer patients that are highly enriched with ER-negative and triple-negative subtypes, and the somatic mutation genes they harbor were capable of acting as potential biomarkers to predict patient survival in several different breast cancer datasets, especially in ER-negative cohorts which has lacked reliable biomarkers. Conclusions: Our method provides a novel and promising approach for integrating genotyping and gene expression data in patient stratification in complex diseases.

Original languageEnglish (US)
Article number513
JournalBMC Genomics
Volume17
DOIs
StatePublished - Aug 22 2016
Externally publishedYes

Fingerprint

Breast Neoplasms
Mutation
Biomarkers
Transcriptome
Genes
Neoplasms
Gene Regulatory Networks
Gene Expression
Survival

Keywords

  • Breast cancer patient stratification
  • Distance correlation
  • Functional analysis of somatic mutation
  • Integrative analysis

ASJC Scopus subject areas

  • Biotechnology
  • Genetics

Cite this

Integrative analysis of somatic mutations and transcriptomic data to functionally stratify breast cancer patients. / Zhang, Jie; Abrams, Zachary; Parvin, Jeffrey D.; Huang, Kun.

In: BMC Genomics, Vol. 17, 513, 22.08.2016.

Research output: Contribution to journalArticle

@article{7da0b803b829402e8f080ae9a427aa1e,
title = "Integrative analysis of somatic mutations and transcriptomic data to functionally stratify breast cancer patients",
abstract = "Background: Somatic mutations can be used as potential biomarkers for subtyping and predicting outcomes for cancer patients. However, cancer patients often carry many somatic mutations, which do not always concentrate on specific genomic loci, suggesting that the mutations may affect common pathways or gene interaction networks instead of common genes. The challenge is thus to identify the functional relationships among the mutations using multi-modal data. We developed a novel approach for integrating patient somatic mutation, transcriptome and clinical data to mine underlying functional gene groups that can be used to stratify cancer patients into groups with different clinical outcomes. Specifically, we use distance correlation metric to mine the correlations between expression profiles of mutated genes from different patients. Results: With this approach, we were able to cluster patients based on the functional relationships between the affected genes using their expression profiles, and to visualize the results using multi-dimensional scaling. Interestingly, we identified a stable subgroup of breast cancer patients that are highly enriched with ER-negative and triple-negative subtypes, and the somatic mutation genes they harbor were capable of acting as potential biomarkers to predict patient survival in several different breast cancer datasets, especially in ER-negative cohorts which has lacked reliable biomarkers. Conclusions: Our method provides a novel and promising approach for integrating genotyping and gene expression data in patient stratification in complex diseases.",
keywords = "Breast cancer patient stratification, Distance correlation, Functional analysis of somatic mutation, Integrative analysis",
author = "Jie Zhang and Zachary Abrams and Parvin, {Jeffrey D.} and Kun Huang",
year = "2016",
month = "8",
day = "22",
doi = "10.1186/s12864-016-2902-0",
language = "English (US)",
volume = "17",
journal = "BMC Genomics",
issn = "1471-2164",
publisher = "BioMed Central",

}

TY - JOUR

T1 - Integrative analysis of somatic mutations and transcriptomic data to functionally stratify breast cancer patients

AU - Zhang, Jie

AU - Abrams, Zachary

AU - Parvin, Jeffrey D.

AU - Huang, Kun

PY - 2016/8/22

Y1 - 2016/8/22

N2 - Background: Somatic mutations can be used as potential biomarkers for subtyping and predicting outcomes for cancer patients. However, cancer patients often carry many somatic mutations, which do not always concentrate on specific genomic loci, suggesting that the mutations may affect common pathways or gene interaction networks instead of common genes. The challenge is thus to identify the functional relationships among the mutations using multi-modal data. We developed a novel approach for integrating patient somatic mutation, transcriptome and clinical data to mine underlying functional gene groups that can be used to stratify cancer patients into groups with different clinical outcomes. Specifically, we use distance correlation metric to mine the correlations between expression profiles of mutated genes from different patients. Results: With this approach, we were able to cluster patients based on the functional relationships between the affected genes using their expression profiles, and to visualize the results using multi-dimensional scaling. Interestingly, we identified a stable subgroup of breast cancer patients that are highly enriched with ER-negative and triple-negative subtypes, and the somatic mutation genes they harbor were capable of acting as potential biomarkers to predict patient survival in several different breast cancer datasets, especially in ER-negative cohorts which has lacked reliable biomarkers. Conclusions: Our method provides a novel and promising approach for integrating genotyping and gene expression data in patient stratification in complex diseases.

AB - Background: Somatic mutations can be used as potential biomarkers for subtyping and predicting outcomes for cancer patients. However, cancer patients often carry many somatic mutations, which do not always concentrate on specific genomic loci, suggesting that the mutations may affect common pathways or gene interaction networks instead of common genes. The challenge is thus to identify the functional relationships among the mutations using multi-modal data. We developed a novel approach for integrating patient somatic mutation, transcriptome and clinical data to mine underlying functional gene groups that can be used to stratify cancer patients into groups with different clinical outcomes. Specifically, we use distance correlation metric to mine the correlations between expression profiles of mutated genes from different patients. Results: With this approach, we were able to cluster patients based on the functional relationships between the affected genes using their expression profiles, and to visualize the results using multi-dimensional scaling. Interestingly, we identified a stable subgroup of breast cancer patients that are highly enriched with ER-negative and triple-negative subtypes, and the somatic mutation genes they harbor were capable of acting as potential biomarkers to predict patient survival in several different breast cancer datasets, especially in ER-negative cohorts which has lacked reliable biomarkers. Conclusions: Our method provides a novel and promising approach for integrating genotyping and gene expression data in patient stratification in complex diseases.

KW - Breast cancer patient stratification

KW - Distance correlation

KW - Functional analysis of somatic mutation

KW - Integrative analysis

UR - http://www.scopus.com/inward/record.url?scp=84983084203&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84983084203&partnerID=8YFLogxK

U2 - 10.1186/s12864-016-2902-0

DO - 10.1186/s12864-016-2902-0

M3 - Article

C2 - 27556157

AN - SCOPUS:84983084203

VL - 17

JO - BMC Genomics

JF - BMC Genomics

SN - 1471-2164

M1 - 513

ER -