Rodella, Chiara; Lazaridi, Symela; Lemmin, Thomas (2024). TemBERTure: advancing protein thermostability prediction with deep learning and attention mechanisms. Bioinformatics advances, 4(1) Oxford University Press 10.1093/bioadv/vbae103
|
Text
vbae103.pdf - Published Version Available under License Creative Commons: Attribution (CC-BY). Download (1MB) | Preview |
MOTIVATION
Understanding protein thermostability is essential for numerous biotechnological applications, but traditional experimental methods are time-consuming, expensive, and error-prone. Recently, deep learning (DL) techniques from natural language processing (NLP) was extended to the field of biology, since the primary sequence of proteins can be viewed as a string of amino acids that follow a physicochemical grammar.
RESULTS
In this study, we developed TemBERTure, a DL framework that predicts thermostability class and melting temperature from protein sequences. Our findings emphasize the importance of data diversity for training robust models, especially by including sequences from a wider range of organisms. Additionally, we suggest using attention scores from Deep Learning models to gain deeper insights into protein thermostability. Analyzing these scores in conjunction with the 3D protein structure can enhance understanding of the complex interactions among amino acid properties, their positioning, and the surrounding microenvironment. By addressing the limitations of current prediction methods and introducing new exploration avenues, this research paves the way for more accurate and informative protein thermostability predictions, ultimately accelerating advancements in protein engineering.
AVAILABILITY AND IMPLEMENTATION
TemBERTure model and the data are available at: https://github.com/ibmm-unibe-ch/TemBERTure.
Item Type: |
Journal Article (Original Article) |
---|---|
Division/Institute: |
04 Faculty of Medicine > Pre-clinic Human Medicine > Institute of Biochemistry and Molecular Medicine |
Graduate School: |
Graduate School for Cellular and Biomedical Sciences (GCB) |
UniBE Contributor: |
Rodella, Chiara, Lazaridi, Symela, Lemmin, Thomas Max |
Subjects: |
500 Science > 570 Life sciences; biology 600 Technology > 610 Medicine & health |
ISSN: |
2635-0041 |
Publisher: |
Oxford University Press |
Language: |
English |
Submitter: |
Pubmed Import |
Date Deposited: |
23 Jul 2024 17:01 |
Last Modified: |
23 Jul 2024 17:10 |
Publisher DOI: |
10.1093/bioadv/vbae103 |
PubMed ID: |
39040220 |
BORIS DOI: |
10.48350/199154 |
URI: |
https://boris.unibe.ch/id/eprint/199154 |