Analysis of exome sequences with and without incorporating prior biological knowledge

Junghyun Namkung, Paola Raska, Jia Kang, Yunlong Liu, Qing Lu, Xiaofeng Zhu

Research output: Contribution to journalArticle

4 Citations (Scopus)

Abstract

Next-generation sequencing technology provides new opportunities and challenges in the search for genetic variants that underlie complex traits. It will also presumably uncover many new rare variants, but exactly how these variants should be incorporated into the data analysis remains a question. Several papers in our group from Genetic Analysis Workshop 17 evaluated different methods of rare variant analysis, including single-variant, gene-based, and pathway-based analyses and analyses that incorporated biological information. Although the performance of some of these methods strongly depends on the underlying disease model, integration of known biological information is helpful in detecting causal genes. Two work groups demonstrated that use of a Bayesian network and a collapsing receiver operating characteristic curve approach improves risk prediction when a disease is caused by many rare variants. Another work group suggested that modeling local rather than global ancestry may be beneficial when controlling the effect of population structure in rare variant association analysis.

Original languageEnglish
JournalGenetic Epidemiology
Volume35
Issue numberSUPPL. 1
DOIs
StatePublished - 2011

Fingerprint

Exome
Sequence Analysis
ROC Curve
Genes
Technology
Education
Population

Keywords

  • Association analysis
  • Bayesian network
  • Biological information
  • Population structure
  • Rare variant
  • Receiver operating characteristic
  • Risk prediction model

ASJC Scopus subject areas

  • Genetics(clinical)
  • Epidemiology

Cite this

Analysis of exome sequences with and without incorporating prior biological knowledge. / Namkung, Junghyun; Raska, Paola; Kang, Jia; Liu, Yunlong; Lu, Qing; Zhu, Xiaofeng.

In: Genetic Epidemiology, Vol. 35, No. SUPPL. 1, 2011.

Research output: Contribution to journalArticle

Namkung, Junghyun ; Raska, Paola ; Kang, Jia ; Liu, Yunlong ; Lu, Qing ; Zhu, Xiaofeng. / Analysis of exome sequences with and without incorporating prior biological knowledge. In: Genetic Epidemiology. 2011 ; Vol. 35, No. SUPPL. 1.
@article{46dabc5bdc6048e9b680fbe4e52c2e0e,
title = "Analysis of exome sequences with and without incorporating prior biological knowledge",
abstract = "Next-generation sequencing technology provides new opportunities and challenges in the search for genetic variants that underlie complex traits. It will also presumably uncover many new rare variants, but exactly how these variants should be incorporated into the data analysis remains a question. Several papers in our group from Genetic Analysis Workshop 17 evaluated different methods of rare variant analysis, including single-variant, gene-based, and pathway-based analyses and analyses that incorporated biological information. Although the performance of some of these methods strongly depends on the underlying disease model, integration of known biological information is helpful in detecting causal genes. Two work groups demonstrated that use of a Bayesian network and a collapsing receiver operating characteristic curve approach improves risk prediction when a disease is caused by many rare variants. Another work group suggested that modeling local rather than global ancestry may be beneficial when controlling the effect of population structure in rare variant association analysis.",
keywords = "Association analysis, Bayesian network, Biological information, Population structure, Rare variant, Receiver operating characteristic, Risk prediction model",
author = "Junghyun Namkung and Paola Raska and Jia Kang and Yunlong Liu and Qing Lu and Xiaofeng Zhu",
year = "2011",
doi = "10.1002/gepi.20649",
language = "English",
volume = "35",
journal = "Genetic Epidemiology",
issn = "0741-0395",
publisher = "Wiley-Liss Inc.",
number = "SUPPL. 1",

}

TY - JOUR

T1 - Analysis of exome sequences with and without incorporating prior biological knowledge

AU - Namkung, Junghyun

AU - Raska, Paola

AU - Kang, Jia

AU - Liu, Yunlong

AU - Lu, Qing

AU - Zhu, Xiaofeng

PY - 2011

Y1 - 2011

N2 - Next-generation sequencing technology provides new opportunities and challenges in the search for genetic variants that underlie complex traits. It will also presumably uncover many new rare variants, but exactly how these variants should be incorporated into the data analysis remains a question. Several papers in our group from Genetic Analysis Workshop 17 evaluated different methods of rare variant analysis, including single-variant, gene-based, and pathway-based analyses and analyses that incorporated biological information. Although the performance of some of these methods strongly depends on the underlying disease model, integration of known biological information is helpful in detecting causal genes. Two work groups demonstrated that use of a Bayesian network and a collapsing receiver operating characteristic curve approach improves risk prediction when a disease is caused by many rare variants. Another work group suggested that modeling local rather than global ancestry may be beneficial when controlling the effect of population structure in rare variant association analysis.

AB - Next-generation sequencing technology provides new opportunities and challenges in the search for genetic variants that underlie complex traits. It will also presumably uncover many new rare variants, but exactly how these variants should be incorporated into the data analysis remains a question. Several papers in our group from Genetic Analysis Workshop 17 evaluated different methods of rare variant analysis, including single-variant, gene-based, and pathway-based analyses and analyses that incorporated biological information. Although the performance of some of these methods strongly depends on the underlying disease model, integration of known biological information is helpful in detecting causal genes. Two work groups demonstrated that use of a Bayesian network and a collapsing receiver operating characteristic curve approach improves risk prediction when a disease is caused by many rare variants. Another work group suggested that modeling local rather than global ancestry may be beneficial when controlling the effect of population structure in rare variant association analysis.

KW - Association analysis

KW - Bayesian network

KW - Biological information

KW - Population structure

KW - Rare variant

KW - Receiver operating characteristic

KW - Risk prediction model

UR - http://www.scopus.com/inward/record.url?scp=82455172162&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=82455172162&partnerID=8YFLogxK

U2 - 10.1002/gepi.20649

DO - 10.1002/gepi.20649

M3 - Article

VL - 35

JO - Genetic Epidemiology

JF - Genetic Epidemiology

SN - 0741-0395

IS - SUPPL. 1

ER -