1. Academic Validation
  2. Systematic prediction of degrons and E3 ubiquitin ligase binding via deep learning

Systematic prediction of degrons and E3 ubiquitin ligase binding via deep learning

  • BMC Biol. 2022 Jul 14;20(1):162. doi: 10.1186/s12915-022-01364-6.
Chao Hou 1 2 Yuxuan Li 1 2 Mengyao Wang 3 4 Hong Wu 3 4 5 Tingting Li 6 7
Affiliations

Affiliations

  • 1 Department of Biomedical Informatics, School of Basic Medical Sciences, Peking University Health Science Center, Beijing, 100191, China.
  • 2 Key Laboratory for Neuroscience, Ministry of Education/National Health Commission of China, Peking University, Beijing, 100191, China.
  • 3 The MOE Key Laboratory of Cell Proliferation and Differentiation, School of Life Sciences, Peking University, Beijing, 100871, China.
  • 4 Peking-Tsinghua Center for Life Sciences, Beijing, China.
  • 5 Institute for Cancer Research, Shenzhen Bay Laboratory, Shenzhen, China.
  • 6 Department of Biomedical Informatics, School of Basic Medical Sciences, Peking University Health Science Center, Beijing, 100191, China. litt@hsc.pku.edu.cn.
  • 7 Key Laboratory for Neuroscience, Ministry of Education/National Health Commission of China, Peking University, Beijing, 100191, China. litt@hsc.pku.edu.cn.
Abstract

Background: Degrons are short linear motifs, bound by E3 ubiquitin ligase to target protein substrates to be degraded by the ubiquitin-proteasome system. Mutations leading to deregulation of degron functionality disrupt control of protein abundance due to mistargeting of proteins destined for degradation and often result in pathologies. Targeting degrons by small molecules also emerges as an exciting drug design strategy to upregulate the expression of specific proteins. Despite their essential function and disease targetability, reliable identification of degrons remains a conundrum. Here, we developed a deep learning-based model named Degpred that predicts general degrons directly from protein sequences.

Results: We showed that the BERT-based model performed well in predicting degrons singly from protein sequences. Then, we used the deep learning model Degpred to predict degrons proteome-widely. Degpred successfully captured typical degron-related sequence properties and predicted degrons beyond those from motif-based methods which use a handful of E3 motifs to match possible degrons. Furthermore, we calculated E3 motifs using predicted degrons on the substrates in our collected E3-substrate interaction dataset and constructed a regulatory network of protein degradation by assigning predicted degrons to specific E3s with calculated motifs. Critically, we experimentally verified that a predicted SPOP binding degron on CBX6 prompts CBX6 degradation and mediates the interaction with SPOP. We also showed that the protein degradation regulatory system is important in tumorigenesis by surveying degron-related mutations in TCGA.

Conclusions: Degpred provides an efficient tool to proteome-wide prediction of degrons and binding E3s singly from protein sequences. Degpred successfully captures typical degron-related sequence properties and predicts degrons beyond those from previously used motif-based methods, thus greatly expanding the degron landscape, which should advance the understanding of protein degradation, and allow exploration of uncharacterized alterations of proteins in diseases. To make it easier for readers to access collected and predicted datasets, we integrated these data into the website http://degron.phasep.pro/ .

Keywords

Cancer driver mutation; Deep learning; Degron; E3 Ubiquitin ligase; Protein degradation.

Figures
Products