TemBERTure: advancing protein thermostability prediction with deep learning and attention mechanisms.

Rodella, Chiara; Lazaridi, Symela; Lemmin, Thomas (2024). TemBERTure: advancing protein thermostability prediction with deep learning and attention mechanisms. Bioinformatics advances, 4(1) Oxford University Press 10.1093/bioadv/vbae103

[img]
Preview
Text
vbae103.pdf - Published Version
Available under License Creative Commons: Attribution (CC-BY).

Download (1MB) | Preview

MOTIVATION

Understanding protein thermostability is essential for numerous biotechnological applications, but traditional experimental methods are time-consuming, expensive, and error-prone. Recently, deep learning (DL) techniques from natural language processing (NLP) was extended to the field of biology, since the primary sequence of proteins can be viewed as a string of amino acids that follow a physicochemical grammar.

RESULTS

In this study, we developed TemBERTure, a DL framework that predicts thermostability class and melting temperature from protein sequences. Our findings emphasize the importance of data diversity for training robust models, especially by including sequences from a wider range of organisms. Additionally, we suggest using attention scores from Deep Learning models to gain deeper insights into protein thermostability. Analyzing these scores in conjunction with the 3D protein structure can enhance understanding of the complex interactions among amino acid properties, their positioning, and the surrounding microenvironment. By addressing the limitations of current prediction methods and introducing new exploration avenues, this research paves the way for more accurate and informative protein thermostability predictions, ultimately accelerating advancements in protein engineering.

AVAILABILITY AND IMPLEMENTATION

TemBERTure model and the data are available at: https://github.com/ibmm-unibe-ch/TemBERTure.

Item Type:

Journal Article (Original Article)

Division/Institute:

04 Faculty of Medicine > Pre-clinic Human Medicine > Institute of Biochemistry and Molecular Medicine

Graduate School:

Graduate School for Cellular and Biomedical Sciences (GCB)

UniBE Contributor:

Rodella, Chiara, Lazaridi, Symela, Lemmin, Thomas Max

Subjects:

500 Science > 570 Life sciences; biology
600 Technology > 610 Medicine & health

ISSN:

2635-0041

Publisher:

Oxford University Press

Language:

English

Submitter:

Pubmed Import

Date Deposited:

23 Jul 2024 17:01

Last Modified:

23 Jul 2024 17:10

Publisher DOI:

10.1093/bioadv/vbae103

PubMed ID:

39040220

BORIS DOI:

10.48350/199154

URI:

https://boris.unibe.ch/id/eprint/199154

Actions (login required)

Edit item Edit item
Provide Feedback