Evidence for a strong correlation between transcription factor protein disorder and organismic complexity

Inmaculada Yruela, Christopher J. Oldfield, Karl J. Niklas, A. Dunker

Research output: Contribution to journalArticle

7 Citations (Scopus)

Abstract

Studies of diverse phylogenetic lineages reveal that protein disorder increases in concert with organismic complexity but that differences nevertheless exist among lineages. To gain insight into this phenomenology, we analyzed all of the transcription factor (TF) families for which sequences are known for 17 species spanning bacteria, yeast, algae, land plants, and animals and for which the number of different cell types has been reported in the primary literature. Although the fraction of disordered residues in TF sequences is often moderately or poorly correlated with organismic complexity as gauged by cell-type number (r2<0.5), an unbiased and phylogenetically broad analysis shows that organismic complexity is positively and strongly correlated with the total number of TFs, the number of their spliced variants and their total disordered residues content (r2>0.8). Furthermore, the correlation between the fraction of disordered residues and cell-type number becomes stronger when confined to the TF families participating in cell cycle, cell size, cell division, cell differentiation, or cell proliferation, and other important developmental processes. The data also indicate that evolutionarily simpler organisms allow for the detection of subtle differences in the conserved IDRs of TFs as well as changes in variable IDRs, which can influence the DNA recognition and multifunctionality of TFs through direct or indirect mechanisms. Although strong correlations cannot be taken as evidence for causeand-effect relationships, we interpret our data to indicate that increasing TF disorder likely was an important factor contributing to the evolution of organismic complexity and not merely a concurrent unrelated effect of increasing organismic complexity.

Original languageEnglish (US)
Pages (from-to)1248-1265
Number of pages18
JournalGenome Biology and Evolution
Volume9
Issue number5
DOIs
StatePublished - 2017

Fingerprint

Transcription Factors
transcription factors
protein
Cell Count
algae
Proteins
proteins
cells
Embryophyta
embryophytes
Cell Size
cell differentiation
Cell Division
yeast
cell division
Cell Differentiation
cell cycle
cell proliferation
Cell Cycle
Yeasts

Keywords

  • Cell-type number
  • Complexity
  • Evolution
  • Intrinsically disordered protein (IDP)
  • Transcription factors

ASJC Scopus subject areas

  • Medicine(all)
  • Ecology, Evolution, Behavior and Systematics
  • Genetics

Cite this

Evidence for a strong correlation between transcription factor protein disorder and organismic complexity. / Yruela, Inmaculada; Oldfield, Christopher J.; Niklas, Karl J.; Dunker, A.

In: Genome Biology and Evolution, Vol. 9, No. 5, 2017, p. 1248-1265.

Research output: Contribution to journalArticle

Yruela, Inmaculada ; Oldfield, Christopher J. ; Niklas, Karl J. ; Dunker, A. / Evidence for a strong correlation between transcription factor protein disorder and organismic complexity. In: Genome Biology and Evolution. 2017 ; Vol. 9, No. 5. pp. 1248-1265.
@article{9e3510842bd84d879c46ba3b1d585107,
title = "Evidence for a strong correlation between transcription factor protein disorder and organismic complexity",
abstract = "Studies of diverse phylogenetic lineages reveal that protein disorder increases in concert with organismic complexity but that differences nevertheless exist among lineages. To gain insight into this phenomenology, we analyzed all of the transcription factor (TF) families for which sequences are known for 17 species spanning bacteria, yeast, algae, land plants, and animals and for which the number of different cell types has been reported in the primary literature. Although the fraction of disordered residues in TF sequences is often moderately or poorly correlated with organismic complexity as gauged by cell-type number (r2<0.5), an unbiased and phylogenetically broad analysis shows that organismic complexity is positively and strongly correlated with the total number of TFs, the number of their spliced variants and their total disordered residues content (r2>0.8). Furthermore, the correlation between the fraction of disordered residues and cell-type number becomes stronger when confined to the TF families participating in cell cycle, cell size, cell division, cell differentiation, or cell proliferation, and other important developmental processes. The data also indicate that evolutionarily simpler organisms allow for the detection of subtle differences in the conserved IDRs of TFs as well as changes in variable IDRs, which can influence the DNA recognition and multifunctionality of TFs through direct or indirect mechanisms. Although strong correlations cannot be taken as evidence for causeand-effect relationships, we interpret our data to indicate that increasing TF disorder likely was an important factor contributing to the evolution of organismic complexity and not merely a concurrent unrelated effect of increasing organismic complexity.",
keywords = "Cell-type number, Complexity, Evolution, Intrinsically disordered protein (IDP), Transcription factors",
author = "Inmaculada Yruela and Oldfield, {Christopher J.} and Niklas, {Karl J.} and A. Dunker",
year = "2017",
doi = "10.1093/gbe/evx073",
language = "English (US)",
volume = "9",
pages = "1248--1265",
journal = "Genome Biology and Evolution",
issn = "1759-6653",
publisher = "Oxford University Press",
number = "5",

}

TY - JOUR

T1 - Evidence for a strong correlation between transcription factor protein disorder and organismic complexity

AU - Yruela, Inmaculada

AU - Oldfield, Christopher J.

AU - Niklas, Karl J.

AU - Dunker, A.

PY - 2017

Y1 - 2017

N2 - Studies of diverse phylogenetic lineages reveal that protein disorder increases in concert with organismic complexity but that differences nevertheless exist among lineages. To gain insight into this phenomenology, we analyzed all of the transcription factor (TF) families for which sequences are known for 17 species spanning bacteria, yeast, algae, land plants, and animals and for which the number of different cell types has been reported in the primary literature. Although the fraction of disordered residues in TF sequences is often moderately or poorly correlated with organismic complexity as gauged by cell-type number (r2<0.5), an unbiased and phylogenetically broad analysis shows that organismic complexity is positively and strongly correlated with the total number of TFs, the number of their spliced variants and their total disordered residues content (r2>0.8). Furthermore, the correlation between the fraction of disordered residues and cell-type number becomes stronger when confined to the TF families participating in cell cycle, cell size, cell division, cell differentiation, or cell proliferation, and other important developmental processes. The data also indicate that evolutionarily simpler organisms allow for the detection of subtle differences in the conserved IDRs of TFs as well as changes in variable IDRs, which can influence the DNA recognition and multifunctionality of TFs through direct or indirect mechanisms. Although strong correlations cannot be taken as evidence for causeand-effect relationships, we interpret our data to indicate that increasing TF disorder likely was an important factor contributing to the evolution of organismic complexity and not merely a concurrent unrelated effect of increasing organismic complexity.

AB - Studies of diverse phylogenetic lineages reveal that protein disorder increases in concert with organismic complexity but that differences nevertheless exist among lineages. To gain insight into this phenomenology, we analyzed all of the transcription factor (TF) families for which sequences are known for 17 species spanning bacteria, yeast, algae, land plants, and animals and for which the number of different cell types has been reported in the primary literature. Although the fraction of disordered residues in TF sequences is often moderately or poorly correlated with organismic complexity as gauged by cell-type number (r2<0.5), an unbiased and phylogenetically broad analysis shows that organismic complexity is positively and strongly correlated with the total number of TFs, the number of their spliced variants and their total disordered residues content (r2>0.8). Furthermore, the correlation between the fraction of disordered residues and cell-type number becomes stronger when confined to the TF families participating in cell cycle, cell size, cell division, cell differentiation, or cell proliferation, and other important developmental processes. The data also indicate that evolutionarily simpler organisms allow for the detection of subtle differences in the conserved IDRs of TFs as well as changes in variable IDRs, which can influence the DNA recognition and multifunctionality of TFs through direct or indirect mechanisms. Although strong correlations cannot be taken as evidence for causeand-effect relationships, we interpret our data to indicate that increasing TF disorder likely was an important factor contributing to the evolution of organismic complexity and not merely a concurrent unrelated effect of increasing organismic complexity.

KW - Cell-type number

KW - Complexity

KW - Evolution

KW - Intrinsically disordered protein (IDP)

KW - Transcription factors

UR - http://www.scopus.com/inward/record.url?scp=85026630385&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85026630385&partnerID=8YFLogxK

U2 - 10.1093/gbe/evx073

DO - 10.1093/gbe/evx073

M3 - Article

C2 - 28430951

AN - SCOPUS:85026630385

VL - 9

SP - 1248

EP - 1265

JO - Genome Biology and Evolution

JF - Genome Biology and Evolution

SN - 1759-6653

IS - 5

ER -