Abstract Diabetes is a complex illness requiring long-term therapy. Cyclocarya paliurus, a recently confirmed new food resource, shows significant hypoglycemic and hypolipidemic effects in type II diabetes. Triterpenoid saponins are considered as the effective medicinal components of C. paliurus and are useful for the treatment of diabetes mellitus. However, little is known regarding their specific mechanism of actions. In this study, we used active ingredient screening and target prediction techniques to determine the components of C. paliurus responsible for its anti-diabetic effects as well as their targets. In addition, we used bioinformatics technology and molecular docking analysis to determine the mechanisms underlying their anti-diabetic effects. A total of 39 triterpenes were identified through a literature search and 1 triterpene compound by experiments. In all, 33 potential target proteins associated with 36 pathways were predicted to be related to diabetes. Finally, 7 compounds, 15 target proteins, and 15 signaling pathways were found to play important roles in the therapeutic effects of C. paliurus against diabetes. These results provide a theoretical framework for the use of C. paliurus against diabetes. Moreover, molecular docking verification showed that more than 90% of the active ingredients had binding activity when tested against key target proteins, and a literature search showed that the active ingredients identified had anti-diabetic effects, indicating that the results were highly reliable. __________________________________________________________________ Active ingredient screening and target prediction techniques were used to determine the components of Cyclocarya paliurus responsible for its anti-diabetic effects as well as their targets. [32]graphic file with name d0ra06846b-ga.jpg 1. Introduction Diabetes is a disorder of glucose metabolism that leads to high blood sugar levels and to a series of complications which reduce the quality of life of patients, increasing their mortality.^[33]1 Based on its pathogenesis, it can be divided into type 1 diabetes (T1DM) and type 2 diabetes (T2DM), with T2DM accounting for more than 90% of cases.^[34]2 At present, diabetes is treated with drugs, such as sulfonylureas, DPP-4 inhibitors, oral antidiabetic agents, sodium-glucose co-transport inhibitors, and glucokinase activators,^[35]3–7 but many drugs have side effects.^[36]8 Therefore, it is urgent to find new, safe and effective early intervention medicinal compounds for diabetes.^[37]9 Traditional botanical and herbal medicines have been widely used for the treatment of diabetes,^[38]10,11e.g. Lobelia chinensis Lour.,^[39]7Hydrangea macrophylla var. thunbergii,^[40]12 and Cyclocarya paliurus (Batal.) Iljinskaja.^[41]13C. paliurus is a medicinal plant native to southern China. Its leaves have been traditionally used in the form of herbal tea for their beneficial effects against diabetes. Various bioactive compounds have been extracted from the leaves of C. paliurus, including flavonoids, triterpenoids, polysaccharides, and organic acids, which may contribute to its antihyperglycemic, antihyperlipidemic, and antihypertensive effects.^[42]13–24 Although the mechanisms underlying the anti-diabetic effects of the ethanol and aqueous extracts of C. paliurus leaves have been explored,^[43]18 the specific bioactive constituents of C. paliurus and their targets are still unknown. Due to their safety and efficacy, plant-derived triterpenoids have attracted attention for the treatment of diabetes.^[44]25–27 For example, new cucurbitane-type triterpenoids have shown potential for the prevention and management of diabetes by improving insulin sensitivity and glucose homeostasis.^[45]28 Triterpenoids, such as cyclocaric acid B and cyclocarioside H, extracted from C. paliurus leaves, promote glucose uptake in the absence of insulin, and ameliorate inflammation by inhibiting the insulin receptor substrate 1 (IRS-1)/phosphoinositide 3-kinase (PI3K)/protein kinase B (Akt) pathway.^[46]29 However, no C. paliurus triterpenoids with anti-diabetic effects or their specific targets have been described. Network pharmacology is a new systematic method to study target-drug interactions and their influence on disease. Combined with pharmacology and pharmacodynamics, network pharmacology has been successfully applied to explore the mechanisms of action of traditional botanical or herbal medicines at the molecular level,^[47]30 to dissect the relationship between medicines and diseases, and to identify signaling pathways. This approach makes it feasible to explore the therapeutic mechanisms of multi-component and multi-target traditional botanical or herbal medicines.^[48]13 Investigated the beneficial effects of C. paliurus against diabetic dyslipidemia and its mechanisms of action using lipidomics, serum pharmacochemistry, and network pharmacology approaches.^[49]13 Based on network pharmacology analysis, they identified the predicted targets of C. paliurus' s active ingredients (ALOX12, APP, BCL2, CYP2C9, PTPN1) and their linked lipidome targets [PLD2, PLA2G(s), and PI3K(s) families], which could be responsible for the effects of C. paliurus leaf extracts against diabetic dyslipidemia.^[50]13 In this study, we used network pharmacology to explore the effects of C. paliurus triterpenoids on diabetes and elucidate their mechanisms of action. The detailed procedures are shown in [51]Fig. 1. Fig. 1. The flowchart of the network pharmacological analysis approach. [52]Fig. 1 [53]Open in a new tab 2. Materials and methods 2.1. Plant species C. paliurus is commonly called “tian sha Chun” in China, meaning “sweat tree”, due to the flavor of its leaves. This tree can reach 30 m and grows in moist forests on mountains at altitudes between 400 and 2500 m. It is distributed in many regions of China with temperatures between 7.8 °C and 19.9 °C and with precipitation between 825.9 mm and 2394.5 mm,^[54]31,32 including Anhui, Fujian, Guangdong, Guangxi, Guizhou, Hainan, Hubei, Hunan, Jiangsu, Jiangxi, Sichuan, Taiwan, Yunnan, and Zhejiang Provinces. The leaves of this plant are used in traditional Chinese medicine (TCM) for treating disorders like hypertension, high cholesterol, and diabetes, as well as for improving the function of the immune system.^[55]15,33 The first health tea from China approved by the U. S. Food and Drug Administration (FDA) was made with the leaves of C. paliurus.^[56]31 2.2. Chemical ingredients database Leaves of C. paliurus obtained from Taizhou University in October 2018 were air-dried (total weight, 1.6 kg). Dried leaves of C. paliurus (1.6 kg) were pulverized and extracted three times with 90% ethanol (8.0 L) at room temperature. After filtration, the solvents were removed under vacuum to obtain a crude extract (1.1 kg), which was resuspended in H[2]O (1.0 L) and then partitioned successively with petroleum ether (1.0 L), ethyl acetate (EtOAc, 1.0 L) and butyl alcohol (1.0 L) three times. After removal of solvent, the entire EtOAc extract was fractionated by silica gel column chromatography using a gradient of petroleum ether (PE)/EtOAc (20 : 1 to 0 : 1, v/v) and then methanol to obtain a pure component. With the exception of the previous triterpenoids, other triterpenoids compounds from C. paliurus were identified by reviewing previously published studies (ESI Table 1[57]†) and the Traditional Chinese Medicine Systems Pharmacology database (TCMSP, [58]http://lsp.nwu.edu.cn/tcmsp.php).^[59]34 The specific structures were obtained from Pubchem databases ([60]https://pubchem.ncbi.nlm.nih.gov/). If the specific structures could not be found in the databases, the original research articles were reviewed. In addition, two natural products were isolated from the C. paliurus extract. Then, all triterpenes were subjected to network pharmacologic prediction. ChemDraw 14.0 software ([61]https://www.chemdraw.com.cn/xiazai.html) was used to draw the structures, and the files were saved in mol and mol2 formats. 2.3. Target protein database Proteins involved in diabetes were selected from the literature and from the following public databases: PubChem ([62]https://pubchem.ncbi.nlm.nih.gov/), which provides information about chemical substances and their biological activities as a public repository; Genecards database ([63]https://www.genecards.org/), which is a unique bioinformatics and chemical informatics resource; Potential Drug Target Database ([64]http://dddc.ac.cn/pdtd/index.php), which is a protein database for drug target recognition; and RCSB Database ([65]http://www.rcsb.org/pdb/home/home.do), which provides information about therapeutic proteins, nucleic acid targets, targeted diseases, pathways, and the corresponding drugs for each of these targets. Based on the determination of the protein–ligand complex and crystal structures, protein targets were obtained. The crystal structures of the target protein complexes with the original ligands were downloaded from the Protein Data Bank (PDB) database, which is a free database containing structural data of biological macromolecules. These proteins were used as receptors in molecular docking experiments. The structures of the compounds were imported into the Prediction Target of Swiss Target Prediction network database as a *.mol format^[66]35 ([67]http://www.swisstargetprediction.ch/), setting the species to “Homo sapiens” to predict active targets. This platform automatically conducts a docking simulation between the compound and the background target database, selecting 15 potential active targets with the highest affinity. Predicted targets showing high probability were selected and duplicates were deleted. Diabetes mellitus was searched in the GeneCards online database ([68]http://www.genecards.org/) to identify genes related to diabetes. Common targets between the database and C. paliurus were identified by using excel. The results were imported into Cytoscape 3.6.1 software and the component-target interaction network was constructed and analyzed. 2.4. Component-target network To elucidate the relationship between C. paliurus chemical constituents and diabetes targets, a “component-target-disease” network was analyzed by linking the selected chemical constituents, component-related targets, diabetes-related targets, and the corresponding targets of interactive proteins. The networks were visually analyzed and screened using Cytoscape 3.6.1 software. The screening criteria were as follows: potential targets for the treatment of diabetes with C. paliurus had a node value (degree) and BC (betweenness centrality) greater than the median values of all points, and the closeness centrality was greater than the median values of all points. The “degree” and “betweenness” were calculated using the analysis function of Cytoscape 3.6.1 to evaluate its topological features.^[69]36 2.5. Protein–protein interaction (PPI) network To explain interactions between target proteins of related C. paliurus compounds, these were uploaded to the STRING database ([70]https://string-db.org/). The reliability of protein interactions analyzed with the STRING database is divided into multiple levels, with interaction scores between 0.15 and 0.4 considered of low reliability, between 0.4 and 0.7 of medium reliability, and greater than 0.7 of high reliability.^[71]37 Targets showing interactions greater than medium reliability were selected to construct the interaction network of C. paliurus anti-diabetic targets. The node interaction relationship parameters were uploaded into Cytoscape 3.6.1 software to draw the interaction network, and the internet was used to obtain the final protein interaction diagram. 2.6. Gene ontology (GO) and Kyoto encyclopedia of genes and genomes (KEGG) enrichment analysis of target proteins To elucidate the role played by target proteins interacting with the active ingredients of C. paliurus in terms of gene function and signaling pathways, the Database for Annotation, Visualization and Integrated Discovery (DAVID, [72]http://david.ncifcrf.gov/) 6.8 was used to analyze GO function and KEGG pathway enrichment. The biological process (BP), molecular function (MF) and cell components (CC) of the total targets of C. paliurus were analyzed by GO enrichment, and the column graphs were drawn according to the P value. The KEGG pathway was also analyzed using DAVID6.8, setting P ≤ 0.05 and introducing Gene ID from the total targets of C. paliurus. According to the enrichment degree and P value, the column graphs of the KEGG pathway were drawn. The main signaling pathways of diabetes mellitus were selected and imported into Cytoscape 3.6.1 software for the construction and visualization of the “component-target-pathway” interaction network. 2.7. Component-target-pathway network After obtaining all the interaction information of the active ingredient compounds, the target proteins, and the pathways, the “Component-Target-Pathway” network was constructed using Cystoscope 3.6.1 software. In this network, nodes represent components, targets, and pathways, and edges represent interactions between each other. A hypothetical schematic diagram of the target proteins involved in the pathways was drawn. Based on the network model map, a pathway involving active components and target proteins for this disease is initially explored to provide a preliminary theoretical basis for the design of targeted drugs. 2.8. Molecular docking Target proteins were modified using Pymol software, including hydrogenation, water content, optimization, and repair of amino acid residues. The target proteins' docking pockets were constructed to test the ligands. Interaction between active ingredients and targets was evaluated based on binding energy values by means of AutoDock. The smaller the docking binding energy, the better the molecule will bind to the target protein. When the binding energy is less than 0, the ligand and receptor can bind spontaneously. It is generally believed that energy values below 0 indicate there is a certain binding activity between the target and the ligand; energy values below −5.0 kJ mol^−1 indicate that the molecule has a good binding activity for the target, and values below −10.0 kJ mol^−1 indicate a strong binding activity.^[73]38 The 3D structures of the prepared compounds were used as ligands for molecular docking, and the receptors were the target proteins.^[74]39–44 A size of CASP3′ grid box was set as x, y, z center coordinate i.e. 41.391, 14.109, 77.896 and x, y, z size coordinate i.e. 15, 15, 15. A size of MAPK14′ grid box was set as x, y, z center coordinate i.e. 20.647, 13.217, 31.550 and x, y, z size coordinate i.e. 15, 15, 15. 3. Results 3.1. Screening of potential targets Based on an extensive survey of the scientific literature, a total of 39 triterpenoids were identified; this includes one compound isolated and identified in our study and 38 other reported compounds. These were used to create a library of compounds for the network analysis ([75]Table 1). The ADME data for the compounds is shown in ESI Table 2.[76]† We used SwissTargetPrediction to predict the targets of the active compounds, and a total of 585 targets proteins were identified. A total of 33 potential target proteins of C. paliurus anti-diabetes compounds were identified, as shown in [77]Table 2. The information of 39 triterpenoid compounds in Cyclocarya paliurus[78]^a. MOL ID Compound Degree Betweenness centrality Molecular formula CAS M1 2α,3α,23-Trihydroxyurs-12-en-28-oic acid 5 0.0017618 C[30]H[48]O[5] 103974-74-9 M2 3β-Hydroxy-urs-11-en-28,13-lactone 5 0.0017618 C[30]H[46]O[3] 35959-05-8 M3 2α-Hydroxyursolic acid 5 0.0017618 C[30]H[48]O[4] 72881-13-1 M4 2α,3α,23-Trihydroxyurs-12,20(30)-dien-28-oic acid 5 0.0017618 C[30]H[46]O[5] 143839-01-4 M5 2α,3β,23-Trihydroxy-12-ene-28-ursolic acid 5 0.0017618 C[30]H[48]O[5]464-92-6 M6 3β-O-trans-Caffeoyl-morolic acid 4 0.00594168 C[39]H[54]O[6] 97534-10-6 M7 Actinidic acid 4 0.00111799 C[30]H[46]O[5] 341971-45-7 M8 Arjunolic acid 4 0.00111799 C[30]H[48]O[5] 465-00-9 M9 Corosolic acid 5 0.00508509 C[30]H[48]O[4] 4547-24-4 M10 Cyclocaric acid B 5 0.0017618 C[30]H[46]O[5] 182315-46-4 M11 Daucosterol 5 0.00979396 C[35]H[60]O[6] 474-58-8 M12 Hederagenin 5 0.0017618 C[30]H[48]O[4] 465-99-6 M13 Maslinic acid 5 0.0017618 C[30]H[48]O[4] 4373-41-5 M14 Olean-12-en-28-oic acid 5 0.0017618 C[30]H[48]O[2] 17990-43-1 M15 Oleanolic acid 5 0.0017618 C[30]H[48]O[3] 508-02-1 M16 Taraxerol 5 0.00598173 C[30]H[50]O[0] 127-22-0 M17 β-Amyrin 5 0.00590412 C[30]H[50]O 559-70-6 M18 β-Amyrone 4 0.00397423 C[30]H[48]O 638-97-1 M19 β-Sitosterol 5 0.00270752 C[29]H[50]O 83-46-5 M20 Cyclocariosides I 6 0.01121781 C[35]H[56]O[8] 1644624-82-7 M21 Cyclocarioside N 2 0.00019562 C[44]H[74]O[13] 2093058-24-1 M22* 3β,23-Dihydroxy-1,12-dioxo-olean-28-oic acid 6 0.01175987 C[30]H[46]O[6] 2093058-20-7 M23* 3β,23,27-Trihydroxy-1-oxo-olean-12-ene-28-oic acid 6 0.01175987 C[30]H[46]O[6] 2093058-21-8 M24* 2α,3β,23-Trihydroxyurs-11-oxo-12-ene-28-oic acid 6 0.00987916 C[30]H[46]O[6] 107302-99-8 M25 Cyclocarioside II 3 0.00464055 C[35]H[36]O[8] 173294-76-3 M26* Arjunglucoside II 7 0.01868779 C[36]H[58]O[10] 62369-72-6 M27 Quadranoside IV 6 0.00588117 C[36]H[58]O[10] 267001-55-8 M28* α-Boswellic acid 7 0.0127971 C[30]H[48]O[3] 471-66-9 M29 3α,4β,18α-3-Hydroxyurs-12-en-23-oic acid 6 0.00860482 C[30]H[48]O[3]2243454-82-0 M30 β-Boswellic acid 6 0.00860482 C[30]H[48]O[3] 631-69-6 M31 Cyclocarioside H 2 0.0001863 C[43]H[72]O[13] 1403937-87-0 M32 Cyclocarioside J 2 0.0001863 C[35]H[58]O[9] 1644624-86-1 M33* 2α,3β,23-Trihydroxyoleana-11,13(18)-dien-28-oic acid 5 0.01046971 C[30]H[46]O[5] 6790-76-7 M34 Cyclocaric acid A 5 0.00786033 C[30]H[46]O[3] 21754-17-6 M35 (+)-Betulinic acid 4 0.00368899 C[30]H[48]O[3] 472-15-1 M36* Cyclocarioside K 5 0.01444275 C[36]H[58]O[8] 1644624-87-2 M37 Cyclocarioside III 2 0.00041134 C[36]H[60]O[9] 173294-77-4 M38 Cyclocarioside L 2 0.00041134 C[38]H[62]O[12] 2093058-22-9 M39 Cyclocarioside M 1 0 C[40]H[64]O[13] 2093058-23-0 [79]Open in a new tab ^a * means the core active ingredients. The information of the target proteins used in the manuscript. UniProt IDs Protein name Gene name Betweenness centrality Degree [80]P24666 Low molecular weight phosphotyrosine protein phosphatase ACP1 0.04820819 16 [81]P15121 Aldose reductase AKR1B1 0.00271300 2 [82]P10275 Androgen receptor AR 0.01510033 6 [83]P42574 Caspase-3 CASP3 0.00164939 2 [84]P21554 Cannabinoid receptor 1 (by homology) CNR1 0.01369327 2 [85]P11511 Cytochrome P450 19A1 CYP19A1 0.02352865 8 [86]P03372 Estrogen receptor alpha ESR1 0.01221787 5 [87]P00734 Thrombin F2 0.00971275 5 [88]P12104 Fatty acid binding protein intestinal FABP2 0.00305104 3 [89]P15090 Fatty acid binding protein adipocyte FABP4 0.00487387 4 [90]P09038 Basic fibroblast growth factor FGF2 0.02471924 3 [91]P04035 HMG-CoA reductase HMGCR 0.01564161 6 [92]P28845 11-Beta-hydroxysteroid dehydrogenase 1 HSD11B1 0.14538756 24 [93]P80365 11-Beta-hydroxysteroid dehydrogenase 2 HSD11B2 0.00419333 3 [94]O14920 Inhibitor of nuclear factor kappa B kinase beta subunit IKBKB 0.00271300 2 [95]P60568 Interleukin-2 IL2 0.01315293 6 [96]P35968 Vascular endothelial growth factor receptor 2 KDR 0.00265707 2 [97]Q16539 MAP kinase p38 alpha MAPK14 0.01369327 2 [98]P35228 Nitric oxide synthase, inducible NOS2 0.03106597 13 [99]Q96RI1 Bile acid receptor FXR NR1H4 0.00625265 4 [100]P04150 Glucocorticoid receptor NR3C1 0.00661281 4 [101]P42336 PI3-kinase p110-alpha subunit PIK3CA 0.00164939 2 [102]Q07869 Peroxisome proliferator-activated receptor alpha PPARA 0.00487387 4 [103]Q03181 Peroxisome proliferator-activated receptor delta PPARD 0.00487387 4 [104]P37231 Peroxisome proliferator-activated receptor gamma PPARG 0.00487387 4 [105]P35354 Cyclooxygenase-2 PTGS2 0.00176559 2 [106]P18031 Protein-tyrosine phosphatase 1B PTPN1 0.20514580 28 [107]P10586 Receptor-type tyrosine-protein phosphatase F (LAR) PTPRF 0.03748757 15 [108]O00767 Acyl-CoA desaturase SCD 0.00139742 2 [109]P04278 Testis-specific androgen-binding protein SHBG 0.01774612 7 [110]P40763 Signal transducer and activator of transcription 3 STAT3 0.04628878 9 [111]P11473 Vitamin D receptor VDR 0.07454926 10 [112]P15692 Vascular endothelial growth factor A VEGFA 0.03145277 4 [113]Open in a new tab 3.2. Analysis of interactions between active components and target proteins Cytoscape was used to visualize the component-target interactions. The component-target network consists of a total of 73 nodes and 213 edges; 33 of these nodes represent target proteins and 39 nodes active ingredients directly related to the anti-diabetic effects of C. paliurus ([114]Fig. 2). The network shows that the anti-diabetic effects of C. paliurus may be the result of multi-component and multi-target protein effects. The top 7 components based on degree values were arjunglucoside II (M26), α-boswellic acid (M28), 3β,23-trihydroxyurs-11-oxo-12-ene-28-oic acid (M24), 3β,23,27-trihydroxy-1-oxo-olean-12-ene-ene-28-oic acid (M24), dihydroxy-1,12-dioxo-olean-28-oic acid (M22), 23-3β,23,27-trihydroxy-1-oxo-olean-12-ene-28-oic acid (M23), 2α,3β,23-trihydroxyoleana-11,13(18)-dien-28-oic acid (M33) and cyclocarioside K (M36). Of these, four compounds, M23, M24, M25 and M34, complied with the “Lipinski's Rule of Five”. These components are the key active ingredients underlying the antipyretic effects of C. paliurus and their structural formulas are shown in ESI Fig. 2.[115]† The component-targets network revealed that several target proteins could interact with multiple compounds. Network values for active components and target proteins are listed in [116]Tables 1 and [117]2. Fig. 2. Cyclocarya paliurus's component-target network. The red node represents the active ingredients, the green node represents the target proteins, and the size of the nodes represent the size of the degree value. [118]Fig. 2 [119]Open in a new tab 3.3. Analysis of target protein PPI network The interaction network for the anti-diabetic target proteins of C. paliurus, including 33 nodes and 153 edges, was constructed based on an interaction reliability greater than mid-range ([120]Fig. 3). In protein interaction networks, the position of node proteins is not the same. The most important nodes are key nodes. The network node degree parameter is a common method for evaluating key nodes of the network. The PPI network showed that the median value of betweenness centrality and closeness centrality was greater than the median with high degree for a total of 15 target proteins. These 15 target proteins ([121]Table 3) can be considered as core target proteins for the treatment of diabetes and may play an important role in the treatment of diabetes. Among them, PTGS2, VEGFA, CASP3, PPARG, STAT3, and NR3C1 are centrally located in the PPI network ([122]Fig. 3), indicating that these proteins are involved in the pathogenesis of diabetes mellitus. Fig. 3. Cyclocarya paliurus's potential target protein interaction network for treatment of diabetes. The size and color of the node represent the size of the degree value. The darker the color, the larger the node and the more important it is in the network. [123]Fig. 3 [124]Open in a new tab Features of Cyclocarya paliurus's protein–protein interaction network. Gene symbol Name Betweenness centrality Closeness centrality Degree PTGS2 Prostaglandin-endoperoxide synthase 2 0.15607784 0.76190476 22 VEGFA Vascular endothelial growth factor A 0.05924204 0.71111111 19 CASP3 Caspase 3 0.04400629 0.68085106 18 PPARG Peroxisome proliferator activated receptor gamma 0.14458119 0.68085106 17 STAT3 Signal transducer and activator of transcription 3 0.02315961 0.66666667 16 NR3C1 Nuclear receptor subfamily 3 group C member 1 0.07062907 0.62745098 15 MAPK14 Mitogen-activated protein kinase 14 0.01666107 0.64000000 14 ESR1 Estrogen receptor 1 0.04629238 0.62745098 14 KDR Kinase insert domain receptor 0.04561908 0.60377358 13 PIK3CA Phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha 0.01214958 0.60377358 13 CYP19A1 Cytochrome P450 family 19 subfamily a member 1 0.07381888 0.60377358 13 IL2 Interleukin 2 0.01306559 0.57142857 12 AR Androgen receptor 0.01695036 0.58181818 12 PTPN1 Protein tyrosine phosphatase non-receptor type 1 0.09420179 0.59259259 11 PPARA Peroxisome proliferator activated receptor alpha 0.03537093 0.54237288 9 [125]Open in a new tab 3.4. GO analysis of target proteins GO enrichment analysis results ([126]Fig. 4) showed that the numbers of target proteins involved in the BP, MF, and CC categories were 116 (70.7%), 37 (22.5%), and 11 (6.7%), respectively. In the BP category, the target proteins were mainly involved in positive regulation of transcription from RNA polymerase II promoter (13, 39.4%), signal transduction (9, 29.3%), and transcription, DNA-templated (9, 29.3%). In the MF category, the target proteins were mainly involved in protein binding (23, 69.7%), zinc ion binding (9, 27.3%) and transcription factor activity, sequence-specific DNA binding (9, 27.3%). In the CC category, target proteins were mainly associated with the nucleus (17, 51.5%), cytoplasm (14, 42.4%), nucleoplasm (12, 36.4%) and cytosol (12, 36.4%). These results demonstrate that C. paliurus acts on diabetes probably by engaging the above mentioned pathways. Fig. 4. GO and KEGG pathways analyses of the target proteins by using the DAVID database. Data show the top 10 remarkably enriched items in the biological process (A), cell component (B), molecular function (C), and KEGG pathways (D). [127]Fig. 4 [128]Open in a new tab 3.5. KEGG classification of target proteins KEGG pathway annotation showed that 27 of the 33 (81.8%) potential target proteins were enriched and involved in 41 pathways; of these, 35 pathways were significantly correlated with the target proteins (P ≤ 0.05). The following pathways included the largest number of proteins: cancer pathways (11, 40.7%), proteoglycans in cancer (8, 29.6%), insulin resistance (6, 22.2%), acute myeloid leukemia (6, 22.2%), PI3K-Akt signaling pathway (6, 22.2%), PPAR signaling pathway (6, 22.2%), and so on. The top 10 pathways with the largest number of proteins involved are shown in [129]Fig. 4, and the complete KEGG classification results are shown in [130]Table 4. The target proteins involved in cancer pathways included The result of KEGG Classification of target proteins[131]^a. Term Count Percent% P value Genes hsa05200:Pathways in cancer 11 33.33 0.00 AR, CASP3, PPARD, PTGS2, VEGFA, PPARG, PIK3CA, NOS2, IKBKB, FGF2, STAT3 hsa05205:Proteoglycans in cancer 8 24.24 0.00 CASP3, MAPK14, VEGFA, ESR1, PIK3CA, FGF2, STAT3, KDR hsa03320:PPAR signaling pathway 6 18.18 0.00 PPARA, PPARD, SCD, PPARG, FABP4, FABP2 hsa04931:Insulin resistance 6 18.18 0.00 PPARA, PTPRF, PIK3CA, PTPN1, IKBKB, STAT3 hsa04015:Rap1 signaling pathway 6 18.18 0.00 MAPK14, CNR1, VEGFA, PIK3CA, FGF2, KDR hsa04151:PI3K-Akt signaling pathway 6 18.18 0.02 VEGFA, PIK3CA, IKBKB, FGF2, KDR, IL2 hsa04370:VEGF signaling pathway 5 15.15 0.00 PTGS2, MAPK14, VEGFA, PIK3CA, KDR hsa05142:Chagas disease 5 15.15 0.00 MAPK14, PIK3CA, NOS2, IKBKB, IL2 hsa04668:TNF signaling pathway 5 15.15 0.00 CASP3, PTGS2, MAPK14, PIK3CA, IKBKB hsa05145:Toxoplasmosis 5 15.15 0.00 CASP3, MAPK14, NOS2, IKBKB, STAT3 hsa05160:Hepatitis C 5 15.15 0.00 PPARA, MAPK14, PIK3CA, IKBKB, STAT3 hsa04014:Ras signaling pathway 5 15.15 0.02 VEGFA, PIK3CA, IKBKB, FGF2, KDR hsa05206:MicroRNAs in cancer 5 15.15 0.03 CASP3, PTGS2, VEGFA, IKBKB, STAT3 hsa05221:Acute myeloid leukemia 4 12.12 0.00 PPARD, PIK3CA, IKBKB, STAT3 hsa05212:Pancreatic cancer 4 12.12 0.00 VEGFA, PIK3CA, IKBKB, STAT3 hsa04917:Prolactin signaling pathway 4 12.12 0.00 MAPK14, ESR1, PIK3CA, STAT3 hsa05222:Small cell lung cancer 4 12.12 0.01 PTGS2, PIK3CA, NOS2, IKBKB hsa04066:HIF-1 signaling pathway 4 12.12 0.01 VEGFA, PIK3CA, NOS2, STAT3 hsa04660:T cell receptor signaling pathway 4 12.12 0.01 MAPK14, PIK3CA, IKBKB, IL2 hsa05169:Epstein-Barr virus infection 4 12.12 0.02 MAPK14, PIK3CA, IKBKB, STAT3 hsa04152:AMPK signaling pathway 4 12.12 0.02 HMGCR, SCD, PPARG, PIK3CA hsa04380:Osteoclast differentiation 4 12.12 0.02 MAPK14, PPARG, PIK3CA, IKBKB hsa04068:FoxO signaling pathway 4 12.12 0.02 MAPK14, PIK3CA, IKBKB, STAT3 hsa04910:Insulin signaling pathway 4 12.12 0.02 PTPRF, PIK3CA, PTPN1, IKBKB hsa04550:Signaling pathways regulating pluripotency of stem cells 4 12.12 0.02 MAPK14, PIK3CA, FGF2, STAT3 hsa05161:Hepatitis B 4 12.12 0.02 CASP3, PIK3CA, IKBKB, STAT3 hsa04932:Non-alcoholic fatty liver disease (NAFLD) 4 12.12 0.03 PPARA, CASP3, PIK3CA, IKBKB hsa05152:Tuberculosis 4 12.12 0.04 VDR, CASP3, MAPK14, NOS2 hsa04923:Regulation of lipolysis in adipocytes 3 9.09 0.02 PTGS2, PIK3CA, FABP4 hsa00140:Steroid hormone biosynthesis 3 9.09 0.03 HSD11B1, HSD11B2, CYP19A1 hsa04210:Apoptosis 3 9.09 0.03 CASP3, PIK3CA, IKBKB hsa05120:Epithelial cell signaling in Helicobacter pylori infection 3 9.09 0.03 CASP3, MAPK14, IKBKB hsa04920:Adipocytokine signaling pathway 3 9.09 0.04 PPARA, IKBKB, STAT3 hsa05140:Leishmaniasis 3 9.09 0.04 PTGS2, MAPK14, NOS2 hsa05133:Pertussis 3 9.09 0.04 CASP3, MAPK14, NOS2 [132]Open in a new tab ^a Bold text means the important pathways reported and involved in diabetes. AR, CASP3, PPARD, PTGS2, VEGFA, PPARG, PIK3CA, NOS2, IKBKB, FGF2 and STAT3; the target proteins involved in insulin resistance included PPARA, PTPRF, PIK3CA, PTPN1, IKBKB and STAT3; and the target proteins involved in the PI3K-Akt signaling pathway included VEGFA, PIK3CA, IKBKB, FGF2, KDR and IL2. As can be seen, there are multiple target proteins in a pathway and the same target protein exists in multiple pathways. These results suggest that C. paliurus may exert its effects on diabetes by regulating these pathways via core target proteins. 3.6. Component-target-pathway network The component-target-pathway network was constructed to visualize all interactions between target proteins and the anti-diabetic-related pathways. Based on the previous KEGG enrichment analysis, a component-target-pathway network was generated by connecting compounds, targets and pathways ([133]Fig. 5). This network included 106 nodes (39 active compound nodes, 33 composite target protein nodes and 34 pathways nodes) and 745 edges. The component-target-pathway network results are shown in ESI Table 2 and ESI Fig. 1.[134]† Fig. 5. Cyclocarya paliurus's component-target-pathway network. The green diamond-shaped node represents the active ingredient, the blue triangular node represents the target, the yellow circular node represents the pathway, and the size of the node represent of the degree value. Lines represent the relationships between the compounds, targets, and pathways. [135]Fig. 5 [136]Open in a new tab The network analysis showed that the median value of betweenness centrality and closeness centrality was greater than the median with high degree for a total of 15 pathways, including cancer (hsa05200), insulin resistance (hsa04931), HIF-1 signaling pathway (hsa04066), PI3K-Akt signaling pathway (hsa04151) etc. These 15 pathways with several diabetes-associated target proteins can be considered as core pathways for the treatment of diabetes. Among these pathways, insulin resistance, HIF-1 signaling, and PI3K-Akt signaling pathways have been shown to have a clear link with the occurrence of diabetes. As shown in [137]Fig. 6, the active anti-inflammatory and anti-diabetic components present in C. paliurus can synergize with multiple target proteins within these pathways to form a multi-component-multi-target-multi-pathway mechanism. Fig. 6. Distribution of the target proteins of Cyclocarya paliurus on the predicted pathway. The orange boxes are potential target proteins of C. paliurus, while the blue boxes are relevant targets in the pathway. [138]Fig. 6 [139]Open in a new tab 3.7. Docking analysis The protein structure was set to a rigid macromolecule, and the algorithm to local search parameters. Then, the object of molecular docking visualization was selected according to the lowest binding energy for docking. The core compounds and diabetes target proteins were ranked according to the docking binding energy. The top 10 core target proteins in the previous ranking were docked with 7 key medicinal components ([140]Table 5). The docking results showed that the lowest binding energy for the 7 core medicinal components and CASP3 were all below −5.0 kJ mol^−1, indicating that the 7 key medicinal components bound better to the CASP3 target protein. With the exception of 2α,3β,23-trihydroxyoleana-11,13(18)-dien-28-oic acid, the other 6 components showed the lowest docking binding energies with MAPK14 target protein when compared with the other 9 target proteins. Interactions between the 7 core components of C. paliurus and the previously mentioned 10 core target proteins with the lowest binding energy and the smallest binding energy were visualized using Pymol ([141]Fig. 7 and [142]8). The lowest binding energy of the active ingredients of Cyclocarya paliurus to the ten core target proteins for treating diabetes (unit: kcal mol^−1)[143]^a. No. Compound name PTGS2 VEGFA CASP3 PPARG STAT3 NR3C1 MAPK14 ESR1 KDR PIK3CA 1 3β,23-Dihydroxy-1,12-dioxo-olean-28-oic acid −5.2 −4.5 −6.1 2.9 −5.8 −2.4 −6.7 −4.7 −5 −5.3 2 3β,23,27-Trihydroxy-1-oxo-olean-12-ene-28-oic acid −5.2 −4.5 −6.1 3 −5.8 −2.4 −6.7 −4.7 −4.9 −5.3 3 2α,3β,23-Trihydroxyurs-11-oxo-12-ene-28-oic acid −4.9 −4.6 −5.2 0 −5.2 −3.6 −8.7 −2.9 −5 −6.1 4 Arjunglucoside II −4.8 −4.6 −5.5 2.3 −5.4 8.2 −6.7 −4.4 −4.6 −5.1 5 α-Boswellic acid −5.7 −5.3 −6.1 −1.7 −6 −6.3 −9.1 −6.2 −5.2 −5.4 6 2α,3β,23-Trihydroxyoleana-11,13(18)-dien-28-oic acid −4.5 −4 −6 1.7 −5.3 16.5 −0.3 3.1 −4.7 −2.4 7 Cyclocarioside K −4.3 −4.9 −4.9 −2.9 −5.7 4.5 −8 −5.4 −4.1 −6.7 8 Ligand −3.1 −3.2 −3.4 −5.5 −6.1 −11.9 −12.7 −11.4 −3 −7.9 [144]Open in a new tab ^a Ligand means a substance that has the ability to recognize the receptor and can bind to it. The docking method is rigid docking. Algorithm is all local search parameters. Fig. 7. The 3D docking results of ten core protein molecules and seven core components are visualized. The structure of the compound is represented by a stick, the different branches of the protein are represented by different colors, and the yellow dotted line represents its hydrogen bond, which marks the position of the hydrogen bond and the compound in the compound. (A) Interaction between 3β,23-dihydroxy-1,12-dioxo-olean-28-oic acid and MAPK14; (B) interaction between 3β,23,27-trihydroxy-1-oxo-olean-12-ene-28-oic acid and MAPK14; (C) interaction between 2α,3β,23-trihydroxyurs-11-oxo-12-ene-28-oic acid and MAPK14; (D) interaction between arjunglucoside II and MAPK14; (E) interaction between α-boswellic acid and MAPK14; (F) interaction between 2α,3β,23-trihydroxyoleana-11,13(18)-dien-28-oic acid and CASP3; (G) interaction between cyclocarioside K and MAPK14. [145]Fig. 7 [146]Open in a new tab Fig. 8. The 2D docking results of ten core protein molecules and seven core components are visualized. (A) Interaction between 3β,23-dihydroxy-1,12-dioxo-olean-28-oic acid and MAPK14; (B) interaction between 3β,23,27-trihydroxy-1-oxo-olean-12-ene-28-oic acid and MAPK14; (C) interaction between 2α,3β,23-trihydroxyurs-11-oxo-12-ene-28-oic acid and MAPK14; (D) interaction between arjunglucoside II and MAPK14; (E) interaction between α-boswellic acid and MAPK14; (F): interaction between 2α,3β,23-trihydroxyoleana-11,13(18)-dien-28-oic acid and CASP3; (G) interaction between cyclocarioside K and MAPK14. [147]Fig. 8 [148]Open in a new tab 4. Discussion Used as a health tea, C. paliurus has a long history in the treatment of diabetes. In this study, 39 triterpenoids were selected to create a library of compounds, and network pharmacology technology was employed to explore the multiple pharmacological effects of triterpenoids from C. paliurus in the treatment of diabetes. We found that seven core ingredients found in the leaves of C. paliurus, including arjunglucoside II, α-boswellic acid, 3β,23-dihydroxy-1,12-dioxo-olean-28-oic acid, 23-trihydroxyurs-11-oxo-12-ene-28-oic acid, 3β,23,27-trihydroxy-1-oxo-olean-12-ene-28-oic acid, 2α,3β,23-trihydroxyoleana-11,13(18)-dien-28-oic acid and cyclocarioside K, showed potential for the treatment of diabetes. Among them, α-boswellic acid has been shown to exert some anti-diabetic effects by suppressing the expression of proinflammatory cytokines.^[149]45 Cyclocarioside K is a new epoxydammarane triterpenoid saponin isolated from the ethanol leaf extracts of C. paliurus.^[150]46 Arjunglucoside II is a common glucoside derivative of arjunic acid, found in Terminalia arjuna.^[151]47 The DPP-IV inhibitory activity of arjunic acid and its derivatives may have demonstrable therapeutic benefits in diabetes patients with cardiovascular comorbidities.^[152]48 3β,23-Dihydroxy-1,12-dioxo-olean-28-oicacid and 3β,23,27-trihydroxy-1-oxo-olean-12-ene-28-oic acid are new triterpenoid saponins isolated from the CH[3]Cl-soluble extract of C. paliurus leaves, whereas 23-trihydroxyurs-11-oxo-12-ene-28-oic acid and 2α,3β,23-trihydroxyoleana-11,13(18)-dien-28-oic acid are also known triterpenoid saponins isolated from the CH[3]Cl-soluble extract of the leaves of C. paliurus.^[153]49 These triterpenoid saponins may have an effect on diabetes by inhibiting apolipoprotein B48 overproduction.^[154]49,50 Based on the interaction network of C. paliurus's anti-diabetes target proteins, we found that STAT3, PTPN1, PTGS2, VEGFA, CASP3, etc. were the key target proteins. These targets may play an important role in C. paliurus's anti-diabetes effects. According to the results of the component-target and PPI networks, STAT3, CASP3, ACP1, PTPN1, HSD11B1 and other targets were regulated by multiple components, such as arjunglucoside II, cyclocarioside K, quadranoside IV, corosolic acid, cyclocariosides I and so on. Some of these ingredients have been proven to have strong anti-diabetic effects. For example, corosolic acid can reduce glucose levels in human hepatocellular carcinoma cells, as well as in zebrafish and in rats.^[155]51 Molecular docking is one of the most widely used methods to investigate binding of a compound to a target protein.^[156]39–42 We used AutoDock software to analyze molecular docking. Compounds with docking energies below −5 kJ mol^−1 were considered to bind well to their targets. In the PPI network diagram, betweenness is defined as the number of shortest paths between pairs of nodes that run through nodes, and degree centrality is the most direct measure of the importance of a network node. The larger the node degree value, the more important the node is in the network. We used two indicators (betweenness centrality and closeness centrality) as the criteria for screening targets. Only target proteins with betweenness centrality and closeness centrality values greater than the median for all target proteins were selected. Then, all target proteins were sorted according to the degree value. Based on the docking results between the top ten target proteins and core compounds, we observed that although the docking energy of some core compounds to the target proteins was very small and the docking results were good, there were also poor docking results. Only STAT3 showed docking energies for all compounds below −5.0 kJ mol^−1. STAT3 is an important signal transduction factor that participates in the signal transduction process of various cytokines, such as interferon, interleukins and growth factors, and it forms part of the JAK2/STAT3 signaling pathway.^[157]52,53 Many studies have shown that the JAK2/STAT3 signaling pathway is an important pathway that mediates the signal transduction process of various cytokines and growth factors, and can regulate cell growth, differentiation, migration, apoptosis, autophagy, immunity, and metabolism.^[158]54,55 It is considered to play an important role in the development of diabetes.^[159]56,57 STAT3 may regulate diabetes by acting on SOCS3 and thereby affecting the activity of insulin receptor substrate 1 (IRS-1).^[160]58 SOCS3 can mediate central leptin resistance in obese patients.^[161]59 Leptin signal transduction in the hypothalamus can regulate liver glucose and lipid metabolism^[162]60 as well as liver gluconeogenesis.^[163]61 It seems evident that STAT3 plays an important role regulating disorders of glucose metabolism. In network pharmacology analysis, component-target networks are mainly used to screen core compounds and targets. Betweenness centrality and closeness centrality values greater than the median are also the main criteria for screening core target proteins. All eligible target proteins were sorted according to the degree value, and the one with the highest value was PTPN1. In component-target networks, the higher the degree of a target protein, the more the compounds that regulate it. The more active compounds bind to a protein target, the greater the possibility of producing a drug effect, and the higher the possibility of becoming the key node. PTPN1 is a non-transmembrane protein tyrosine phosphatase which functions mainly by regulating the levels of protein tyrosine phosphorylation in cells, and is involved in the regulation of multiple cell signaling pathways modulating cell proliferation, growth and migration.^[164]62 Many studies have found correlations between DNA sequence variations in the PTPN1 gene, SNP and T2DM.^[165]63 PTPN1 can regulate insulin signaling by modulating the expression of the PTP1B enzyme. PTP1B is a key factor that regulates various metabolic processes in the body and is closely associated with the occurrence and development of diseases, especially type 2 diabetes.^[166]64 Studies have shown that inhibiting the expression of the PTP1B enzyme can regulate insulin sensitivity in diabetic mice and improve insulin resistance, producing a therapeutic effect in diabetes.^[167]65 Therefore, we can study the potential pharmacological effects of C. paliurus on diabetes by analyzing the relationship between active compounds and core targets. According to the results of the component-target-protein network analysis, the insulin resistance pathway, the PI3K-Akt signaling pathway and the HIF-1 pathway may play an important role in the anti-diabetes effects of C. paliurus. As shown in [168]Fig. 5, the active anti-diabetes ingredients of C. paliurus can synergize with various target proteins in these pathways, resulting in a multi-component, multi-target and multi-pathway mechanism of action. Insulin resistance, a condition in which cells are resistant to the action of insulin, is often found in obese and diabetic patients and is closely related with the occurrence of diabetes.^[169]66 It is associated with increased activity of phosphatases, including PTPs, PTEN, and PP2A, and decreased activation of signaling molecules, such as PI3K/AKT, resulting in reduced GLUT4 translocation, glucose uptake and glycogen synthesis in skeletal muscle, as well as increased hepatic gluconeogenesis and decreased glycogen synthesis in the liver.^[170]67,68 Many studies have shown that this pathway is activated by changes in the levels of enzymes such as IL-6, Akt and IRS-1, or other abnormalities.^[171]69,70 HIF-1, a transcription factor which functions as a major regulator of oxygen homeostasis,^[172]71 is associated with diabetic nephropathy. HIF-1 expression indirectly affects renal oxygen metabolism.^[173]72,73 In hypoxic conditions, it can reduce damage to the body by regulating the expression of target genes coding for downstream factors related with the hypoxic response, and by modulating cell energy metabolism, glucose metabolism and apoptosis.^[174]74 Studies have shown that oxidative stress and microcirculatory disorders are important mechanisms underlying the development of diabetic nephropathy and other diseases.^[175]75 Antioxidant therapy can increase the expression of medullary HIF in diabetic kidneys and improve renal oxidative damage.^[176]76 HIF activation can attenuate changes in renal oxygen metabolism and mitochondrial function caused by diabetes, thereby reducing proteinuria and improving renal tubular interstitial damage.^[177]77 The PI3K-Akt signaling pathway plays an important role maintaining insulin stability.^[178]75 This signaling pathway is activated by many types of cellular stimuli or toxic insults and regulates fundamental cellular functions, such as transcription, translation, proliferation, growth, and survival.^[179]76 At present, there are studies showing that selective activation of the PI3K/AKT signaling pathway can modulate neuroprotection, angiogenesis, and islet β cell survival in diabetic rats.^[180]75 Many studies indicate that after insulin binds to its receptor, it will self-phosphorylate and phosphorylate IRS-2 tyrosine sites.^[181]77 Phosphorylated IRS-2 binds to the p85 subunit of PI3K to further activate PI3K and to activate Akt after phosphorylation.^[182]78 Akt2 is a subtype of Akt, a serine/threonine kinase and an important signaling molecule located downstream of PI3K^[183]79,80 which can be regulated by modulating Gsk-3β, GLUT-4, etc. A series of downstream molecules promote glycogen synthesis, glucose transport, and other pathways to regulate glucose metabolism, thereby regulating diabetes.^[184]70 Receptor theory is the basic theory of pharmacodynamics. It postulates that the combination of drugs and target proteins is the basis of pharmacodynamics. Molecular docking is a method to evaluate binding of active drug ingredients to target proteins. In network pharmacology research, molecular docking is commonly used to study interactions between small molecules and key target proteins of the network, and to validate key target proteins identified in the network based on the interaction between active ingredients and target proteins.^[185]81 As shown in [186]Fig. 7 and [187]8, molecular docking analysis showed that hydrogen bonds can form spontaneously between the 7 core compounds and the target proteins. These studies showed that the number of hydrogen bonds between the 4 core compounds and the target proteins surpassed 4, indicating that the core components bound well to the key target proteins of the network. Although the molecular docking results support the key node network findings, more in vivo or in vitro experiments are needed to verify the effects of ingredients on target proteins. 5. Conclusions In summary, in this study we applied the network pharmacological approach to investigate the main target proteins of C. paliurus compounds with anti-diabetes activity by constructing a target interaction network, and used molecular docking methods to validate the key findings. Our results indicate that seven compounds, including arjunglucoside II, α-boswellic acid, 3β,23-dihydroxy-1,12-dioxo-olean-28-oic acid, 23-trihydroxyurs-11-oxo-12-ene-28-oic acid, 3β,23,27-trihydroxy-1-oxo-olean-12-ene-28-oic acid, 2α,3β,23-trihydroxyoleana-11,13(18)-dien-28-oic acid and cyclocarioside K show potential for the treatment of diabetes. These triterpene compounds were predicted to interact with PTGS2, VEGFA, CASP3, and other enzymes. These results provide valuable insights into the synergistic mechanism of action of natural medicines. In addition, biological process and pathway enrichment analysis of the target proteins of the active ingredients of C. paliurus enhanced our understanding of its mechanism of action in diabetes. Our study provides scientific evidence supporting the use of C. paliurus for the treatment of diabetes and confirms that modern technologies can be used to explore the therapeutic value of natural medicines. Data availabIlity statement All datasets generated for this study are included in the article/ESI.[188]† Author contributions statement Z. L., Y. T., N. L., Z. Z., and J. L. conceived the experiment. Z. L. collected data, conducted the analytical part, wrote the first version of the manuscript. Y. T., N. L., Z. Z., and J. L. revised the manuscript. J. L. finalized the manuscript. All authors read and approved the final manuscript. Conflicts of interest The authors declare that they have no conflict of interest. Supplementary Material RA-010-D0RA06846B-s001 [189]RA-010-D0RA06846B-s001.pdf^ (345.9KB, pdf) RA-010-D0RA06846B-s002 [190]RA-010-D0RA06846B-s002.xls^ (27.5KB, xls) Acknowledgments