Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions

Hongbo Xie, Slobodan Vucetic, Lilia M. Iakoucheva, Christopher J. Oldfield, A. Dunker, Vladimir N. Uversky, Zoran Obradovic

Research output: Contribution to journalArticle

360 Citations (Scopus)

Abstract

Identifying relationships between function, amino acid sequence, and protein structure represents a major challenge. In this study, we propose a bioinformatics approach that identifies functional keywords in the Swiss-Prot database that correlate with intrinsic disorder. A statistical evaluation is employed to rank the significance of these correlations. Protein sequence data redundancy and the relationship between protein length and protein structure were taken into consideration to ensure the quality of the statistical inferences. Over 200 000 proteins from the Swiss-Prot database were analyzed using this approach. The predictions of intrinsic disorder were carried out using PONDR VL3E predictor of long disordered regions that achieves an accuracy of above 86%. Overall, out of the 710 Swiss-Prot functional keywords that were each associated with at least 20 proteins, 238 were found to be strongly positively correlated with predicted long intrinsically disordered regions, whereas 302 were strongly negatively correlated with such regions. The remaining 170 keywords were ambiguous without strong positive or negative correlation with the disorder predictions. These functions cover a large variety of biological activities and imply that disordered regions are characterized by a wide functional repertoire. Our results agree well with literature findings, as we were able to find at least one illustrative example of functional disorder or order shown experimentally for the vast majority of keywords showing the strongest positive or negative correlation with intrinsic disorder. This work opens a series of three papers, which enriches the current view of protein structure-function relationships, especially with regards to functionalities of intrinsically disordered proteins, and provides researchers with a novel tool that could be used to improve the understanding of the relationships between protein structure and function. The first paper of the series describes our statistical approach, outlines the major findings, and provides illustrative examples of biological processes and functions positively and negatively correlated with intrinsic disorder.

Original languageEnglish
Pages (from-to)1882-1898
Number of pages17
JournalJournal of Proteome Research
Volume6
Issue number5
DOIs
StatePublished - May 2007

Fingerprint

Biological Phenomena
Proteins
Intrinsically Disordered Proteins
Databases
Bioinformatics
Bioactivity
Computational Biology
Redundancy
Amino Acid Sequence
Research Personnel
Amino Acids

Keywords

  • Bioinformatics
  • Disorder prediction
  • Intrinsic disorder
  • Intrinsically disordered proteins
  • Protein function
  • Protein structure

ASJC Scopus subject areas

  • Genetics
  • Biotechnology
  • Biochemistry

Cite this

Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions. / Xie, Hongbo; Vucetic, Slobodan; Iakoucheva, Lilia M.; Oldfield, Christopher J.; Dunker, A.; Uversky, Vladimir N.; Obradovic, Zoran.

In: Journal of Proteome Research, Vol. 6, No. 5, 05.2007, p. 1882-1898.

Research output: Contribution to journalArticle

Xie, Hongbo ; Vucetic, Slobodan ; Iakoucheva, Lilia M. ; Oldfield, Christopher J. ; Dunker, A. ; Uversky, Vladimir N. ; Obradovic, Zoran. / Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions. In: Journal of Proteome Research. 2007 ; Vol. 6, No. 5. pp. 1882-1898.
@article{724e2d263d3c419d98fa8825ce368676,
title = "Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions",
abstract = "Identifying relationships between function, amino acid sequence, and protein structure represents a major challenge. In this study, we propose a bioinformatics approach that identifies functional keywords in the Swiss-Prot database that correlate with intrinsic disorder. A statistical evaluation is employed to rank the significance of these correlations. Protein sequence data redundancy and the relationship between protein length and protein structure were taken into consideration to ensure the quality of the statistical inferences. Over 200 000 proteins from the Swiss-Prot database were analyzed using this approach. The predictions of intrinsic disorder were carried out using PONDR VL3E predictor of long disordered regions that achieves an accuracy of above 86{\%}. Overall, out of the 710 Swiss-Prot functional keywords that were each associated with at least 20 proteins, 238 were found to be strongly positively correlated with predicted long intrinsically disordered regions, whereas 302 were strongly negatively correlated with such regions. The remaining 170 keywords were ambiguous without strong positive or negative correlation with the disorder predictions. These functions cover a large variety of biological activities and imply that disordered regions are characterized by a wide functional repertoire. Our results agree well with literature findings, as we were able to find at least one illustrative example of functional disorder or order shown experimentally for the vast majority of keywords showing the strongest positive or negative correlation with intrinsic disorder. This work opens a series of three papers, which enriches the current view of protein structure-function relationships, especially with regards to functionalities of intrinsically disordered proteins, and provides researchers with a novel tool that could be used to improve the understanding of the relationships between protein structure and function. The first paper of the series describes our statistical approach, outlines the major findings, and provides illustrative examples of biological processes and functions positively and negatively correlated with intrinsic disorder.",
keywords = "Bioinformatics, Disorder prediction, Intrinsic disorder, Intrinsically disordered proteins, Protein function, Protein structure",
author = "Hongbo Xie and Slobodan Vucetic and Iakoucheva, {Lilia M.} and Oldfield, {Christopher J.} and A. Dunker and Uversky, {Vladimir N.} and Zoran Obradovic",
year = "2007",
month = "5",
doi = "10.1021/pr060392u",
language = "English",
volume = "6",
pages = "1882--1898",
journal = "Journal of Proteome Research",
issn = "1535-3893",
publisher = "American Chemical Society",
number = "5",

}

TY - JOUR

T1 - Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions

AU - Xie, Hongbo

AU - Vucetic, Slobodan

AU - Iakoucheva, Lilia M.

AU - Oldfield, Christopher J.

AU - Dunker, A.

AU - Uversky, Vladimir N.

AU - Obradovic, Zoran

PY - 2007/5

Y1 - 2007/5

N2 - Identifying relationships between function, amino acid sequence, and protein structure represents a major challenge. In this study, we propose a bioinformatics approach that identifies functional keywords in the Swiss-Prot database that correlate with intrinsic disorder. A statistical evaluation is employed to rank the significance of these correlations. Protein sequence data redundancy and the relationship between protein length and protein structure were taken into consideration to ensure the quality of the statistical inferences. Over 200 000 proteins from the Swiss-Prot database were analyzed using this approach. The predictions of intrinsic disorder were carried out using PONDR VL3E predictor of long disordered regions that achieves an accuracy of above 86%. Overall, out of the 710 Swiss-Prot functional keywords that were each associated with at least 20 proteins, 238 were found to be strongly positively correlated with predicted long intrinsically disordered regions, whereas 302 were strongly negatively correlated with such regions. The remaining 170 keywords were ambiguous without strong positive or negative correlation with the disorder predictions. These functions cover a large variety of biological activities and imply that disordered regions are characterized by a wide functional repertoire. Our results agree well with literature findings, as we were able to find at least one illustrative example of functional disorder or order shown experimentally for the vast majority of keywords showing the strongest positive or negative correlation with intrinsic disorder. This work opens a series of three papers, which enriches the current view of protein structure-function relationships, especially with regards to functionalities of intrinsically disordered proteins, and provides researchers with a novel tool that could be used to improve the understanding of the relationships between protein structure and function. The first paper of the series describes our statistical approach, outlines the major findings, and provides illustrative examples of biological processes and functions positively and negatively correlated with intrinsic disorder.

AB - Identifying relationships between function, amino acid sequence, and protein structure represents a major challenge. In this study, we propose a bioinformatics approach that identifies functional keywords in the Swiss-Prot database that correlate with intrinsic disorder. A statistical evaluation is employed to rank the significance of these correlations. Protein sequence data redundancy and the relationship between protein length and protein structure were taken into consideration to ensure the quality of the statistical inferences. Over 200 000 proteins from the Swiss-Prot database were analyzed using this approach. The predictions of intrinsic disorder were carried out using PONDR VL3E predictor of long disordered regions that achieves an accuracy of above 86%. Overall, out of the 710 Swiss-Prot functional keywords that were each associated with at least 20 proteins, 238 were found to be strongly positively correlated with predicted long intrinsically disordered regions, whereas 302 were strongly negatively correlated with such regions. The remaining 170 keywords were ambiguous without strong positive or negative correlation with the disorder predictions. These functions cover a large variety of biological activities and imply that disordered regions are characterized by a wide functional repertoire. Our results agree well with literature findings, as we were able to find at least one illustrative example of functional disorder or order shown experimentally for the vast majority of keywords showing the strongest positive or negative correlation with intrinsic disorder. This work opens a series of three papers, which enriches the current view of protein structure-function relationships, especially with regards to functionalities of intrinsically disordered proteins, and provides researchers with a novel tool that could be used to improve the understanding of the relationships between protein structure and function. The first paper of the series describes our statistical approach, outlines the major findings, and provides illustrative examples of biological processes and functions positively and negatively correlated with intrinsic disorder.

KW - Bioinformatics

KW - Disorder prediction

KW - Intrinsic disorder

KW - Intrinsically disordered proteins

KW - Protein function

KW - Protein structure

UR - http://www.scopus.com/inward/record.url?scp=33847768609&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33847768609&partnerID=8YFLogxK

U2 - 10.1021/pr060392u

DO - 10.1021/pr060392u

M3 - Article

VL - 6

SP - 1882

EP - 1898

JO - Journal of Proteome Research

JF - Journal of Proteome Research

SN - 1535-3893

IS - 5

ER -