Deep Learning Approaches for the Prediction of Protein Functional Sites

Knowing which residues of a protein are important for its function is of paramount importance for understanding the molecular basis of this function and devising ways of modifying it for medical or biotechnological applications. Due to the difficulty in detecting these residues experimentally, predi...

Full description

Saved in:
Bibliographic Details
Published inMolecules (Basel, Switzerland) Vol. 30; no. 2; p. 214
Main Authors Pitarch, Borja, Pazos, Florencio
Format Journal Article
LanguageEnglish
Published Switzerland MDPI AG 01.01.2025
MDPI
Subjects
Online AccessGet full text
ISSN1420-3049
1420-3049
DOI10.3390/molecules30020214

Cover

More Information
Summary:Knowing which residues of a protein are important for its function is of paramount importance for understanding the molecular basis of this function and devising ways of modifying it for medical or biotechnological applications. Due to the difficulty in detecting these residues experimentally, prediction methods are essential to cope with the sequence deluge that is filling databases with uncharacterized protein sequences. Deep learning approaches are especially well suited for this task due to the large amounts of protein sequences for training them, the trivial codification of this sequence data to feed into these systems, and the intrinsic sequential nature of the data that makes them suitable for language models. As a consequence, deep learning-based approaches are being applied to the prediction of different types of functional sites and regions in proteins. This review aims to give an overview of the current landscape of methodologies so that interested users can have an idea of which kind of approaches are available for their proteins of interest. We also try to give an idea of how these systems work, as well as explain their limitations and high dependence on the training set so that users are aware of the quality of expected results.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ObjectType-Review-3
content type line 23
ISSN:1420-3049
1420-3049
DOI:10.3390/molecules30020214