DISOPRED3: precise disordered region predictions with annotated protein-binding activity

DT Jones, D Cozzetto - Bioinformatics, 2015 - academic.oup.com
Bioinformatics, 2015academic.oup.com
Motivation: A sizeable fraction of eukaryotic proteins contain intrinsically disordered regions
(IDRs), which act in unfolded states or by undergoing transitions between structured and
unstructured conformations. Over time, sequence-based classifiers of IDRs have become
fairly accurate and currently a major challenge is linking IDRs to their biological roles from
the molecular to the systems level. Results: We describe DISOPRED3, which extends its
predecessor with new modules to predict IDRs and protein-binding sites within them. Based …
Abstract
Motivation: A sizeable fraction of eukaryotic proteins contain intrinsically disordered regions (IDRs), which act in unfolded states or by undergoing transitions between structured and unstructured conformations. Over time, sequence-based classifiers of IDRs have become fairly accurate and currently a major challenge is linking IDRs to their biological roles from the molecular to the systems level.
Results: We describe DISOPRED3, which extends its predecessor with new modules to predict IDRs and protein-binding sites within them. Based on recent CASP evaluation results, DISOPRED3 can be regarded as state of the art in the identification of IDRs, and our self-assessment shows that it significantly improves over DISOPRED2 because its predictions are more specific across the whole board and more sensitive to IDRs longer than 20 amino acids. Predicted IDRs are annotated as protein binding through a novel SVM based classifier, which uses profile data and additional sequence-derived features. Based on benchmarking experiments with full cross-validation, we show that this predictor generates precise assignments of disordered protein binding regions and that it compares well with other publicly available tools.
Availability and implementation:  http://bioinf.cs.ucl.ac.uk/disopred
Contact:  d.t.jones@ucl.ac.uk
Supplementary information:  Supplementary data are available at Bioinformatics online.
Oxford University Press