Identification of the linker-SH2 domain of STAT as the origin of the SH2 domain using two-dimensional structural alignment

Qian Gao, Jian Hua, Rich Kimura, Jeffery J. Headd, Xin Yuan Fu, Eugene Y. Chin

Research output: Contribution to journalArticle

20 Citations (Scopus)

Abstract

The availability of large volumes of genomic sequences presents an unprecedented proteomic challenge to characterize the structure and function of various protein motifs. Primary structural alignment is often unable to accurately identify a given motif due to sequence divergence; however, with the aid of secondary structural prediction for analysis, it becomes feasible to explore protein motifs on a proteome-wide scale. Here we report the use of secondary structural alignment to characterize the Src homology 2 (SH2) domains of both conventional and divergent sequences and divide them into two groups, Src-type and STAT-type. In addition to the basic "αβββα" structure (β B), the Src-type SH2 domain contains an extra β-strand (βE or βE-βF motif). Alternatively, the linker domain-conjugated SH2 domain in STAT contains the αB′ motif. Combining BLAST data from βB core motif sequences with predicted secondary structural alignment, we have screened for SH2 domains in various eukaryotic model systems including Arabidopsis, Dictyostelium, and Saccharomyces. Two novel genes carrying the linker-SH2 domain of STAT were discovered and subsequently cloned from Arabidopsis. These genes, designated as STAT-type linker-SH2 domain factors (STATL), are found in a wide array of vascular and nonvascular plants, suggesting that the linker-SH2 domain evolved prior to the divergence of plants and animals. Using this approach, we expanded the number of putative SH2 domain-bearing genes in Dictyostelium and comparatively studied the secondary structural profiles of both typical and atypical SH2 domains. Our results indicate that the linker-SH2 domain of the transcription factor STAT is one of the most ancient and fully developed functional domains, serving as a template for the continuing evolution of the SH2 domain essential for phosphotyrosine signal transduction.

Original languageEnglish (US)
Pages (from-to)704-714
Number of pages11
JournalMolecular & cellular proteomics : MCP
Volume3
Issue number7
DOIs
StatePublished - Jul 2004
Externally publishedYes

Fingerprint

src Homology Domains
Genes
Bearings (structural)
STAT Transcription Factors
Signal transduction
Phosphotyrosine
Proteome
Animals
Proteins
Availability
Amino Acid Motifs
Dictyostelium
Arabidopsis
Saccharomyces
Proteomics
Blood Vessels
Signal Transduction

ASJC Scopus subject areas

  • Biochemistry

Cite this

Identification of the linker-SH2 domain of STAT as the origin of the SH2 domain using two-dimensional structural alignment. / Gao, Qian; Hua, Jian; Kimura, Rich; Headd, Jeffery J.; Fu, Xin Yuan; Chin, Eugene Y.

In: Molecular & cellular proteomics : MCP, Vol. 3, No. 7, 07.2004, p. 704-714.

Research output: Contribution to journalArticle

Gao, Qian ; Hua, Jian ; Kimura, Rich ; Headd, Jeffery J. ; Fu, Xin Yuan ; Chin, Eugene Y. / Identification of the linker-SH2 domain of STAT as the origin of the SH2 domain using two-dimensional structural alignment. In: Molecular & cellular proteomics : MCP. 2004 ; Vol. 3, No. 7. pp. 704-714.
@article{d7200db3bd194b678aede4666530d978,
title = "Identification of the linker-SH2 domain of STAT as the origin of the SH2 domain using two-dimensional structural alignment",
abstract = "The availability of large volumes of genomic sequences presents an unprecedented proteomic challenge to characterize the structure and function of various protein motifs. Primary structural alignment is often unable to accurately identify a given motif due to sequence divergence; however, with the aid of secondary structural prediction for analysis, it becomes feasible to explore protein motifs on a proteome-wide scale. Here we report the use of secondary structural alignment to characterize the Src homology 2 (SH2) domains of both conventional and divergent sequences and divide them into two groups, Src-type and STAT-type. In addition to the basic {"}αβββα{"} structure (β B), the Src-type SH2 domain contains an extra β-strand (βE or βE-βF motif). Alternatively, the linker domain-conjugated SH2 domain in STAT contains the αB′ motif. Combining BLAST data from βB core motif sequences with predicted secondary structural alignment, we have screened for SH2 domains in various eukaryotic model systems including Arabidopsis, Dictyostelium, and Saccharomyces. Two novel genes carrying the linker-SH2 domain of STAT were discovered and subsequently cloned from Arabidopsis. These genes, designated as STAT-type linker-SH2 domain factors (STATL), are found in a wide array of vascular and nonvascular plants, suggesting that the linker-SH2 domain evolved prior to the divergence of plants and animals. Using this approach, we expanded the number of putative SH2 domain-bearing genes in Dictyostelium and comparatively studied the secondary structural profiles of both typical and atypical SH2 domains. Our results indicate that the linker-SH2 domain of the transcription factor STAT is one of the most ancient and fully developed functional domains, serving as a template for the continuing evolution of the SH2 domain essential for phosphotyrosine signal transduction.",
author = "Qian Gao and Jian Hua and Rich Kimura and Headd, {Jeffery J.} and Fu, {Xin Yuan} and Chin, {Eugene Y.}",
year = "2004",
month = "7",
doi = "10.1074/mcp.M300131-MCP200",
language = "English (US)",
volume = "3",
pages = "704--714",
journal = "Molecular and Cellular Proteomics",
issn = "1535-9476",
publisher = "American Society for Biochemistry and Molecular Biology Inc.",
number = "7",

}

TY - JOUR

T1 - Identification of the linker-SH2 domain of STAT as the origin of the SH2 domain using two-dimensional structural alignment

AU - Gao, Qian

AU - Hua, Jian

AU - Kimura, Rich

AU - Headd, Jeffery J.

AU - Fu, Xin Yuan

AU - Chin, Eugene Y.

PY - 2004/7

Y1 - 2004/7

N2 - The availability of large volumes of genomic sequences presents an unprecedented proteomic challenge to characterize the structure and function of various protein motifs. Primary structural alignment is often unable to accurately identify a given motif due to sequence divergence; however, with the aid of secondary structural prediction for analysis, it becomes feasible to explore protein motifs on a proteome-wide scale. Here we report the use of secondary structural alignment to characterize the Src homology 2 (SH2) domains of both conventional and divergent sequences and divide them into two groups, Src-type and STAT-type. In addition to the basic "αβββα" structure (β B), the Src-type SH2 domain contains an extra β-strand (βE or βE-βF motif). Alternatively, the linker domain-conjugated SH2 domain in STAT contains the αB′ motif. Combining BLAST data from βB core motif sequences with predicted secondary structural alignment, we have screened for SH2 domains in various eukaryotic model systems including Arabidopsis, Dictyostelium, and Saccharomyces. Two novel genes carrying the linker-SH2 domain of STAT were discovered and subsequently cloned from Arabidopsis. These genes, designated as STAT-type linker-SH2 domain factors (STATL), are found in a wide array of vascular and nonvascular plants, suggesting that the linker-SH2 domain evolved prior to the divergence of plants and animals. Using this approach, we expanded the number of putative SH2 domain-bearing genes in Dictyostelium and comparatively studied the secondary structural profiles of both typical and atypical SH2 domains. Our results indicate that the linker-SH2 domain of the transcription factor STAT is one of the most ancient and fully developed functional domains, serving as a template for the continuing evolution of the SH2 domain essential for phosphotyrosine signal transduction.

AB - The availability of large volumes of genomic sequences presents an unprecedented proteomic challenge to characterize the structure and function of various protein motifs. Primary structural alignment is often unable to accurately identify a given motif due to sequence divergence; however, with the aid of secondary structural prediction for analysis, it becomes feasible to explore protein motifs on a proteome-wide scale. Here we report the use of secondary structural alignment to characterize the Src homology 2 (SH2) domains of both conventional and divergent sequences and divide them into two groups, Src-type and STAT-type. In addition to the basic "αβββα" structure (β B), the Src-type SH2 domain contains an extra β-strand (βE or βE-βF motif). Alternatively, the linker domain-conjugated SH2 domain in STAT contains the αB′ motif. Combining BLAST data from βB core motif sequences with predicted secondary structural alignment, we have screened for SH2 domains in various eukaryotic model systems including Arabidopsis, Dictyostelium, and Saccharomyces. Two novel genes carrying the linker-SH2 domain of STAT were discovered and subsequently cloned from Arabidopsis. These genes, designated as STAT-type linker-SH2 domain factors (STATL), are found in a wide array of vascular and nonvascular plants, suggesting that the linker-SH2 domain evolved prior to the divergence of plants and animals. Using this approach, we expanded the number of putative SH2 domain-bearing genes in Dictyostelium and comparatively studied the secondary structural profiles of both typical and atypical SH2 domains. Our results indicate that the linker-SH2 domain of the transcription factor STAT is one of the most ancient and fully developed functional domains, serving as a template for the continuing evolution of the SH2 domain essential for phosphotyrosine signal transduction.

UR - http://www.scopus.com/inward/record.url?scp=4043165581&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=4043165581&partnerID=8YFLogxK

U2 - 10.1074/mcp.M300131-MCP200

DO - 10.1074/mcp.M300131-MCP200

M3 - Article

VL - 3

SP - 704

EP - 714

JO - Molecular and Cellular Proteomics

JF - Molecular and Cellular Proteomics

SN - 1535-9476

IS - 7

ER -