1. Academic Validation
  2. Natural compounds for Alzheimer's prevention and treatment: Integrating SELFormer-based computational screening with experimental validation

Natural compounds for Alzheimer's prevention and treatment: Integrating SELFormer-based computational screening with experimental validation

  • Comput Biol Med. 2025 Feb:185:109523. doi: 10.1016/j.compbiomed.2024.109523.
Junyu Zhou 1 Yong Kwan Kim 2 Chen Li 3 Sunmin Park 4
Affiliations

Affiliations

  • 1 Institute of Advanced Clinical Medicine, Peking University, Beijing, 100191, China; Department of Bioconvergence, Hoseo University, Asan, South Korea.
  • 2 Department of Information and Communication Engineering, Hoseo University, Asan, South Korea.
  • 3 Institute of Advanced Clinical Medicine, Peking University, Beijing, 100191, China.
  • 4 Department of Bioconvergence, Hoseo University, Asan, South Korea; Dept. of Food and Nutrition, Obesity/Diabetes Research Center, Hoseo University, Asan, South Korea. Electronic address: smpark@hoseo.edu.
Abstract

Background: This study aimed to develop and apply a novel computational pipeline combining SELFormer, a transformer architecture-based chemical language model, with advanced deep learning techniques to predict natural compounds (NCs) with potential in Alzheimer's disease (AD) treatment. The NCs were identified based on activity related to seven AD-specific genes, including acetylcholinesterase (AChE), amyloid precursor protein (APP), Beta-secretase 1 (BACE1), and presenilin-1 (PSEN1).

Methods: We implemented a computational pipeline using SELFormer and deep learning techniques, conducted optimal clustering and quantitative structure-activity relationship (QSAR) analyses, and performed a uniform manifold approximation and projection (UMAP) to categorize compounds based on bioactivity levels. Molecular docking analysis was carried out on selected compounds. To validate the computational predictions, we conducted in vitro studies using nerve growth factor (NGF)-differentiated PC12 cells. Finally, we mapped the relationships between food sources containing the identified compounds and their target proteins.

Results: Optimal clustering analysis revealed five distinct groups of NCs, while QSAR analysis highlighted variations in molecular properties across clusters. The UMAP projection identified 17 highly active NCs (pIC50>7). Molecular docking analysis showed that cowanin, β-caryophyllene, and L-citronellol demonstrated decreased binding energy across target proteins. In vitro studies confirmed significant biological activities of these compounds, including increased cell viability, decreased AChE activity, reduced lipid peroxidation and tumor necrosis factor (TNF)-α mRNA expression, and increased brain-derived neurotrophic factor (BDNF) mRNA expression compared to the control. The study also identified natural sources of these compounds, such as anatidae, mangosteen, and celery, providing insights into potential dietary interventions.

Conclusion: This integrated computational and experimental approach offers a promising framework for identifying potential NCs for AD treatment. The results contribute to exploring effective therapeutic strategies against AD.

Keywords

Alzheimer's disease; Computational pipeline; Natural compounds; Selformer; UMAP.

Figures
Products