Abstract The content of terpenoids in tobacco can alter its resistance to TMV. NtSPS1, a pivotal structural gene in tobacco, is capable to regulate the terpenoid content. In this study, we investigated the effect of NtSPS1 knockout in HD on the content of terpenoids and the anti-TMV activity of this mutant using gene editing, widely targeted metabolomics, network pharmacology, and molecular docking. 48 terpenoids (six up-regulated and five down-regulated) in NtSPS1 knockout tobacco compared with WT leaves. Notably, solanesol was remarkable downregulation which was lowered by fourfold and compounds 1 (log[2]FC = 18.2), 8 (log[2]FC = 16.7) were significant upregulation between the mutants and wild-type line leaves. The 46 terpenoid’s target network encompassed 150 nodes, 509 edges and their underlying mechanisms in the therapeutic management of TMV are discussed. Furthermore, the network pharmacology and molecular docking revealed that compounds 16, 18, 23, 27, and 36 exhibited significant affinity in their respective interactions. Ultimately, five compounds were assayed for their anti-TMV effects, noteworthily, compounds 36 showed potential anti-TMV activity. Above all, we adopted a multifaceted approach to gain a comprehensive understanding of the terpenoid content and anti-TMV properties in NtSPS1 knockout HD. It enlightens the therapeutic potential of NtSPS1 knockout tobacco and it is helpful to find undescribed anti-TMV activity inhibitors, as well as searching for new anti-TMV candidates from the mutants. Supplementary Information The online version contains supplementary material available at 10.1038/s41598-025-98745-y. Keywords: Nicotiana tabacum, CRISPR/Cas9, NtSPS1, Terpenoid, Network pharmacology, anti-TMV activities Subject terms: Biochemistry, Lipids Introduction Nicotiana tabacum (N. tabacum) is well known for its medicinal value purposes^[42]1–[43]3. Previous phytochemical findings on N. tabacum showed terpenoids play a crucial role in multiple plant functions, for instance response to against biotic and abiotic stresses^[44]4–[45]6. Tobacco mosaic virus (TMV) has a wide host range and naturally causes extensive damage in tobacco (N. tabacum)^[46]7–[47]10, tomato (S. lycopersicum)^[48]11, and pepper (C. annuum)^[49]12,[50]13. Plant metabolites, particularly terpenoids, have demonstrated the ability to counter TMV infection^[51]14–[52]17. Such as solanesol, a non-cyclic trisesquiterpenoid alcohol, is mainly found in solanaceous plants: tobacco^[53]18–[54]20, potato^[55]21,[56]22, and tomato^[57]23, which might participate in the immune response to TMV infection^[58]24,[59]25. NtSPS1 functions as a key enzyme in the biosynthetic pathway of terpenoids, especially solanesol^[60]26–[61]28. After NtSPS1 overexpression in N. tabacum the total content of solanesol increase significantly^[62]29,[63]30. The specific accumulation patterns and regulatory processes of terpenoids in NtSPS1-deficient Hong Hua Da Jin Yuan tobacco (Nicotiana tabacum L., HD) are still unclear. Additionally, there have been no prior studies on the terpenoid levels and anti-TMV effectiveness in this variety following NtSPS1 gene knockout. The present study aimed to systematically compare the terpenoid content and anti-TMV activities of NtSPS1 knockout HD and WT lines by multiple methods. NtSPS1 knockout HD resulted in 48 terpenoids: six up-regulated, five down-regulated, especially remarkable downregulation of solanesol which was lowered by fourfold and significantly upregulation of compound 1 and 8 in mutants compared with the wild-type lines. The 48 terpenoids were divided into five classes, involving eight monoterpenes (two up-regulated metabolites), 23 sesquiterpenes (four up-regulated, one down-regulated metabolites), ten diterpenoids, six triterpenoids (three down-regulated metabolites) and one trisesquiterpenoid (down-regulated metabolites) between the mutants and wild-type lines. Network pharmacology provides a comprehensive framework for examining the complex relationships between terpenoids and TMV, thus pioneering a new approach to decipher the therapeutic principles^[64]31. Guided by network pharmacology and molecular docking, we selected compounds 16, 18, 23, 27, and 36 from our laboratory for testing their anti-TMV activities. The results showed that compound 36 exhibited a protective response against HD, with inhibiting rates of 36.7% (IC[50] = 32.1 µM). Materials and methods Plant material The methods employed for construction of NtSPS1 knockout Nicotiana tabacum L. variety Hong Hua Da Jin Yuan (HD) and growth conditions were the same as those described by Zhang et al. (Supplementary Method S1). Chemical profiles LC-MS/MS methods are employed to determine the total terpenoid content (Supplementary Method S2). Extraction of solanesol from mature leaves of NtSPS1 knockout and WT HD was performed according to the method described by Zhao et al. (Supplementary Method S2.1.1, S2.2)^[65]32. The LC-MS/MS-based widely targeted metabolomics analysis was used to analyze the terpenoid content in NtSPS1 knockout HD, as reported by Li et al. (Supplementary Method S2.1.2, S2.3)^[66]33. The T[1] plants (T[1]mutant HD), which were free of transgenes and carried a homozygous mutation resulting in premature stop codons, were allowed to self-fertilize for further inheritance in the T[2] generation (T[2] mutant HD). To determine the effect of NtSPS1 knockout on HD, T[2] ntsps1 mutants (T[2] mutant HD) and wild-type (WT) HD were cultivated in a glasshouse. The terpenoid content in the leaves of each HD at maturity stage were determined by analyzing their MS fragmentations and/or comparing them with reference samples (Supplementary Tables S1, S2). The PCA scatter plots were generated for all the samples to identify potential biomarkers after data normalization using Principal Component Analysis (PCA). This analysis provides an overview of the chemical relationship between leaves of NtSPS1 knockout and WT HD. Statistical analysis The orthogonal partial least squares discriminant analysis (OPLS-DA) and principal component analysis (PCA) were carried out on all the samples to identify the putative biomarkers after data normalization. Unsupervised PCA (principal component analysis) was performed by statistics function prcomp within R ([67]www.r-project.org). The data was unit variance scaled before unsupervised PCA. VIP values were extracted from OPLS-DA result, which also contain score plots and permutation plots, was generated using R package MetaboAnalystR. The data was log transform and mean centering before OPLS-DA. In order to avoid overfitting, a permutation test (200 permutations) was performed. The HCA (hierarchical cluster analysis) results of samples and metabolites were presented as heatmaps with dendrograms, while pearson correlation coefficients (PCC) between samples were caculated by the cor function in R and presented as only heatmaps. Both HCA and PCC were carried out by R package ComplexHeatmap. For HCA, normalized signal intensities of metabolites (unit variance scaling) are visualized as a color spectrum. For two-group analysis, differential metabolites were determined by VIP (VIP > 1), P-value (P-value < 0.05, Student’s t test) and absolute Log[2]FC (|Log[2]FC| ≥ 0.58). Network pharmacological analysis Screening natural product and targets The database of Swiss Target Prediction ([68]http://www.swisstargetprediction.ch/) (accessed on 1 October 2024) was utilized to search for the compounds and their respective targets of N. tabacum. Subsequently, the molecules and their corresponding target proteins of N. tabacum were determined. By accessing the Uniprot database ([69]https://www.uniprot.org) (accessed on 1 October 2024), the gene names linked to these protein targets were efficiently obtained. Construction of compound-target network Cytoscape is a powerful implement for visualizing network pharmacology and analyzing associated results^[70]34,[71]35. By employing Cytoscape 3.10.0 ([72]https://github.com/cytoscape/cytoscape/releases/3.10.0/), we successfully integrated the terpenoids of N. tabacum with their corresponding targets, thereby robustly constructing a compound-target network for this medicinal plant. The integrated target network is constructed with nodes representing either compounds or targets, and the interactions between them are illustrated by lines. Identification of predicted targets of tobacco mosaic virus The disease targets of tobacco mosaic virus were retrieved in the Genecards databases ([73]https://www.genecards.org/) (accessed on 1 October 2024)^[74]36, and OMIM databases ([75]https://www.omim.org/)(accessed on 1 October 2024)^[76]37. The procured targets from the two databases were merged, removing any duplicate entries in the process for further investigation. Construction of PPI network We imported the target data of those molecules as well as for TMV, into the Venn Diagram ([77]http://bioinformatics.psb.ugent.be/webtools/Venn/) (accessed on 1 October 2024). This program was utilized to pinpoint the relationship between the compound targets and the TMV targets. The overlapping parts, which are involved in both the disease pathology of TMV and the molecular pharmacodynamics, can be tentatively identified as the effective therapeutic targets where the medicinal impact of drugs is genuinely manifested. We constructed an integrated network of molecules and disease targets to identify the interactions between the compounds and their associated therapeutic targets. Subsequently, he integrated network was uploaded to the STRING database ([78]https://string-db.org/) (accessed on 1 October 2024) to obtain information on the protein-protein interaction network^[79]38. The application of this method is relatively rare in TMV, hence no core targets were identified. So, we utilized the previously identified targets for the subsequent work^[80]39–[81]41. Molecular docking We accessed ChemDraw to obtain the molecule structures, the 3D structural files were converted by Chem3D version 18.0. The RSK structure, identified by the PDB code 2OM3, was chosen and prepared for further analysis. Hydrogen was removed, and polar hydrogen atoms were introduced using AutoDock version 4.2.6 ([82]https://autodock.scripps.edu/download-autodock4/) tools. Active sites were constructed, and each one was subsequently docked with the molecule. Active pockets were built, and each active site was docked with the molecule^[83]42. Anti-tobacco mosaic virus activity assays Using half-leaf method detecte the anti-TMV activities and using ningnanmycin as a positive control (Supplementary Method S2.4). The antiviral inhibition rates of molecules 16, 18, 23, 27, and 36 at 20.0 µM are shown in Table [84]1. Table 1. Anti-TMV activities of terpenoids^a. No. % inhibition at 20 µM IC[50] (µM) 16 20.5 ± 2.6 53.9 18 25.2 ± 3.0 65.6 23 25.8 ± 2.8 65.8 27 26.8 ± 3.0 48.9 36 36.7 ± 3.2 32.1 Ningnanmycin 33.2 ± 2.2 36.1 [85]Open in a new tab ^aAll results are expressed as mean ± SD; n = 3. Results NtSPS1 mutants obtained by using CRISPR/Cas9 gene editing The HD plants with NtSPS1 gene knockout were successfully generated using CRISPR/Cas9 technology. An optimized gene editing vector for tobacco, developed by our group, was used to drive the expression of the NtSPS1 sgRNA and the 2 × 35 S promoter was used to drive the expression of Streptococcus pyogenes Cas9 (SpCas9)^[86]43. NPTII, neomycin phosphotransferase gene; LB, left border; RB, right border. (Fig. [87]1A–E). The sequencing data revealed that two distinct lines exhibited two different types of mutation sites, which deletion of 2 bases (Fig. [88]1F). Fig. 1. [89]Fig. 1 [90]Open in a new tab Authentic NtSPS1 of tobacco mutants and wild-type HD. (A) Schematic diagram of pOREU3TR vectors, (B–E) The phenotype of T[0] mutant plants (T[0] mutant HD) and wild type HD in different periods. WT, wild-type HD. ntsps1, NtSPS1 editing HD. ntsps1-Ros1, a transgenic plant carrying a red marker. (F) Mutations for the 56 green tobacco in the T[1] generation (T[1] mutant HD). The protospacer adjacent motif (PAM) site “AGG” needed for Cas9 fragmentation is marked in blue. Hom homozygous. #, related plant. No. number. No off-target effects were observed for the three genes in the ntlht1 mutant strains, including Ntab0818090 and the NtLHT1 gene (Fig. [91]2A–C). Based on the results, NtSPS1 mutants were successfully obtained, and a dark-red marker was introduced to facilitate the generation of transgene-free plants in the T[1] generation and obtained enough homozygous T[2] population. The phenotypes of the NtSPS1 mutants in the T[2] generation were same as those of WT HD (Fig. S2). Fig. 2. [92]Fig. 2 [93]Open in a new tab Exploration of the off-target sites. (A) Blastn result for the NtSPS1 sgRNA in China Tobacco Genome Database. (B) DNA sequences cognition of the 3 off-target genes among the sgRNA site. (C) There were no off-target items detected for the 4 genes in ntsps1 mutants. Ntab0214390, NtSPS1 gene. Chemical profiles Comparison of the metabolic profiles between HDK and HDS group based on widely targeted metabolomics The overall metabolic profiles of T[2] mutant plants (HDK) and WT plants (HDS) at the mature stage show a clear separation tendency, indicating that the terpenoids in the tobacco leaves changed after the knockout of the NtSPS1 gene (Fig. [94]3A). And the largest principal component PC1 contributes 43.67%, and PC2 and PC3 make the contributions of 12.10% and 10.70%, respectively. The cumulative contribution rate of TOP3 principal components is greater than 60%, which indicated the overall explanation ability of the variance of original variables was acceptable. Fig. 3. [95]Fig. 3 [96]Open in a new tab The largest principal component, pathway enrichment, heat map analysis of HDS and HDK based on untargeted metabolomics. (A) The largest principal component plot of HDS (in green) and HDK (in red). (B) Pathway enrichment bubble plot related to HDS. The bubbles’ size and darkness pf the color represented the count and level of the enrichment for terpenoids. The mutation types (HDK) and wild types (HDS) at the mature stage. Functional pathway analysis related to NtSPS1 knockout The pathway enrichment analysis of 11 metabolic pathways including biosynthesis of flavonoids, alkaloids and terpenoids metabolism, etc. based on metabolic were performed (Fig. [97]3B). NtSPS1 knockout leads to a significant decrease in the content of downstream terpenoids such as solanesol. Due to limitations like incomplete database coverage and insufficient terpenoid concentrations leading to a low detection rate, the terpenoids enrichment factor is lower than that of flavonoid and alkaloid. Nevertheless, the enriched result the terpenoid pathway ranked TOP3 as its p-value was 0.053 (almost significant) and the FDR is lower than 0.2, demonstrating the regulation of terpenoid pathway after the NtSPS1 knockout. Moreover, since there are currently few reports on the study of terpenoids and related pathways, further study was carried out. Identification and confirmation of terpenoids based on LC-MS/MS analysis HDK and HDS were measured by LC-MS/MS based on widely targeted metabolomics analysis in positive ion mode. The results demonstrated that the mutants showed obvious regulation of terpenoid content. The assignment of the peaks for 47 terpenoids were shown in Fig. [98]4 and Table [99]S1. These molecules including 8 monoterpenes (1, 5, 6, 7, 11, 13, 22, 29), 23 sesquiterpenes (2, 3, 4, 8, 9, 10, 12, 14, 15, 16, 17, 18, 20, 23, 25, 26, 27, 30, 31, 32, 33, 34, 36), 10 diterpenoids (19, 21, 24, 28, 35, 38, 41, 42, 43, 44), 6 triterpenoids (37, 39, 40, 45, 46, 47) were characterized based on MS^n fragmentation and reference materials (Fig. [100]4A and Tables S1, S2, Fig. S2). Fig. 4. [101]Fig. 4 [102]Open in a new tab Untargeted metabolomics analysis. (A) LC-MS/MS-base multiple reaction monitoring mode (MRM) of one NtSPS1 knockout HD (HDK) in positive ion mode. (B) PCA loading plot of the 47 terpenoids. HDS, the wild types. Numbers are the IDs of 47 terpenoids. (C) Heatmap analysis was conducted on 10 terpenoids which change significantly between HDS and HDK, the content of terpenoids were normalized and visualized where the red and green color represented up- and down-regulation. (D) The variation in terpenoid content (Log[2]FC) between HDS and HDK. For each test, three specialized replicates were managed. At least 6 tobacco were evaluated in each sample. Data interpreted are mean ± SE of three specialty tests. The principal component analysis (PCA) loading plot based on all terpenoids was shown in Fig. [103]4B. NtSPS1 knockout leads to a significant decrease in the content of downstream terpenoids such as solanesol. We have examined solanesol content using the method reported by Zhang et al. (see Supplementary Method S2.1.1), and we found that solanesol content has decreased by fourfold. There was a clear distinction between the HDK and HDS groups, with the replicate samples of each cluster closely grouped together. This suggests that the variations in terpenoids were consistent across the samples within each group and strongly associated with the NtSPS1 knockout of the HDK group. Metabolic changes in HDS and HDK The changes of 48 terpenoids before and after NtSPS1 knockout in HD are presented. Among them, 10 terpenoids were differential between HDS and HDK (|Fold change| ≥ 2 and VIP ≥ 1), 6 terpenoids were upregulated and 5 were downregulated, 37 did not show significant changes (Fig. [104]4C, and D, Supplementary Method S2.2). Network pharmacological analysis Compound-target network and analysis We have created a detailed network of component-target interactions for these molecules, considering their unique structural features and specific receptors as identified by Swiss target prediction. As shown in Fig. [105]5A, a total of 150 nodes and 509 edges were assigned, with green ellipses indicating potential targets for anti-TMV. We obtained 36 compounds and their respective targets for N. tabacum from the Swiss target prediction database. Fig. 5. [106]Fig. 5 [107]Open in a new tab Network pharmacological analysis of terpenoids in TMV. (A) Compound-target network (36 yellow diamonds represent the terpenoids of N. tabacum, the circles around the diamonds represent drug targets). (B) Venn diagram of the target of compounds and the target of TMV. PPI network construction and analysis After screening the active targets of TMV through the disease database and eliminating duplicate values, we identified 390 disease targets. Subsequently, we employed this data to create a Venn chart (Fig. [108]5B), comparing the active target compounds associated with the disease to those found in the compound formulations. The overlap between these sets revealed 115 common targets between the molecules and TMV. To clarify the complex interactions between pharmaceutical constituents and TMV targets, we established an extensive network linking compounds to disease targets. To depict these shared targets, we constructed a PPI network featuring 7 nodes, yet without any connecting edges. So, we utilized the previously identified targets for the subsequent work. Molecular docking Utilizing network pharmacology, we have previously pinpointed five essential active ingredients: compounds 16, 18, 23, 27 and 36. Subsequently, we performed molecular docking analysis of these compounds with RSK protein (Fig. [109]6). This analysis sought to confirm the critical function of the compounds in treating TMV. Following that, we conducted a detailed examination of the docking outcomes for proteins and ligands individually. Subsequently, we scrutinized the docking results of proteins and ligands separately, molecular docking revealed that 16, 18, 23, 27, and 36 were well located within the bonding site of anti-TMV by contacting ARG92, ILE93, SER15, VAL11, SER14, PHE10, GLY85, ASP116, and ALA86 residues. Our results show that the compounds have a strong affinity for binding to the RSK protein. Additionally, the interactions between each ligand-macromolecule complex are facilitated by hydrogen bonds, ensuring the stability of these interactions. Fig. 6. [110]Fig. 6 [111]Open in a new tab Interactions between TMV-helicase (sticks are used to represent key residues while solid lines indicate H-bonds) and 16 (A), 18 (B), 23 (C), 27 (D), and 36 (E). Anti-tobacco mosaic virus activity assays Research indicates that solanesol could be involved in the resistance anti-TMV, but upon TMV infection, mutants (solanesol content has decreased by fourfold) and wild-type plants exhibit comparable characteristics (Fig. [112]8A and B). Other substances might be responsible for anti-TMV effects; to verify this hypothesis, guided by network pharmacology and molecular docking, the anti-TMV properties of compounds 16, 18, 23, 27, and 36 were evaluated (Fig. [113]7). The HD was pre-treated with 20.0 µM solutions of the compounds or a DMSO solution for 6 h prior to inoculation with TMV. The consequences clearly show that 36 offered protection against TMV, with an IC[50] value of 32.1 µM (Table [114]1). For the practical application of our lab findings to tobacco cultivation, we chose mutant tobacco at the 7–8 leaf stage to evaluate the therapeutic efficacy of 36 against TMV with ningnanmycin as positive control. The negative control group showed a severe infection of TMV, the lesions were clearly visible, the growth rate was slow, and the leaves exhibited noticeable deformities (Fig. [115]8B), this may substantially affect the tobacco leaf yield and quality. However, after therapy with 36 (compared with ningnanmycin, Fig. [116]8C), the lesions showed a decrease compared to that of ningnanmycin (Fig. [117]8D). This result demonstrates that 36 exerts a significant therapeutic effect on HDK infected with TMV, and it can substantially alleviate the detrimental effects of TMV on the host plants. Fig. 8. [118]Fig. 8 [119]Open in a new tab The curative effect of compound 36 on TMV after Photos were taken 20 days after treatment (down). (A) Normal mutations, (B) HDK and HDS at the 7–8 leaf stage infected with TMV to be treated, (C) HDK infection of TMV treated with 100.0 µg/mL ningnanmycin (NNM), (D) HDK infection of TMV treated with compound 36. Fig. 7. [120]Fig. 7 [121]Open in a new tab Structures of compounds for anti-TMV evaluation. Discussion We identified 48 terpenoid metabolites and categorized them into five distinct classes: monoterpenoids, sesquiterpenes, diterpenoids, triterpenoids, and polyisoprenoid alcohol. Among these, compounds 1, 8, 9, 11, 25, 32, 33, 39, 45, 46, and solanesol exhibited differential accumulation between the mature leaves of the NtSPS1 knockout plants and wild-type plants (Tables S1, S2, and S3). Knockdown of NtSPS1 may inhibit the synthesis of compounds 9, 39, 45, 46, and solanesol, potentially leading to the accumulation of compounds 1, 8, 11, 25, 32, and 33, which could consequently result in the observed effects. The PCA scatter plots summarize the chemical connection between mutants and WT plants at the mature stage. Meanwhile, it can be observed from Fig. [122]4B that HDK samples were clustered together, while HDS samples were grouped together, which indicated that our analysis was stable and repeatable. Additionally, we investigated the potential healing properties of the 36 metabolites for anti-TMV infections. These discoveries highlight the key role of terpenoids from N. tabacum in managing TMV. By extracting TMV-related targets from the Genecards database, we pinpointed 115 potential targets that overlapped with the compounds. The analysis of docking poses indicated that compounds 16, 18, 23, 27, and 36 exhibited strong interactions with the catalytic pocket of TMV (Fig. [123]6). Compound 16, the oxygen of the carbonyl group formed hydrogen bond with ARG92 and ILE93 residues (Fig. [124]6A). Compound 18, the hydroxyl groups on the ligand backbone and sugar formed hydrogen bonds with SER15 and VAL11 residues (Fig. [125]6B). Compound 23, the hydroxyl groups on the ligand backbone formed hydrogen bonds with SER14 and PHE10 residues (Fig. [126]6C). Compound 27, the hydroxyl groups on the ligand backbone and sugar formed hydrogen bonds with GLY85, ASP116, and ALA86 residues (Fig. [127]6D). Compound 36, the hydroxyl group on the ligand backbone formed hydrogen bond with ALA86 residue (Fig. [128]6E). This research inevitably has some inherent limitations. First, the chemical and pharmacological basis for NtSPS1 knockout on tobacco is still to be fully understood. In addition, the leaves of N. tabacum are also not fully used due to the absence of chemical and pharmacological supports. In this investigation, NtSPS1 silencing significantly downregulated solanesol content (decreased to 25% compared with wild type control) in tobacco leaves. As well as we used LC-MS/MS-based widely targeted metabolomics to understand terpenoid content of mutant and WT plants at the mature stage. But the precision and currency of database information need to be enhanced, given that the predictive results of network pharmacology heavily depend on comprehensive bioinformatics data. Moreover, although network pharmacology mainly concentrates on investigating the interactions between natural products and biological systems, it does not completely account for the intricate and holistic nature of N. tabacum. In addition, molecular docking is primarily used to simulate drug-target interactions, but N. tabacum often involve multiple targets and therapeutic pathways. Moreover, our study has only identified the key targets of terpenoids from N. tabacum in TMV treatment, hinting at the biological mechanisms and important pathways without further exploration. Conclusion 48 terpenoids, including six up-regulated and five down-regulated terpenoids compared with the wild type leaves control. We carried out an extensive investigation into the use of terpenoids for anti-TMV infections. Employing a combination of network pharmacology and molecular docking techniques, this exhaustive study sought to thoroughly comprehend the medicinal properties of N. tabacum. In the future, guided by network pharmacology and molecular docking, compounds 16, 18, 23, and 27 were found to have moderate anti-TMV activities. Furthermore, compound 36 exhibited potential anti-TMV activity, with inhibition rates in the range of 36.7% (IC[50] = 32.1 µM, Table [129]1). This detailed analysis of gene knockout mutants and LC-MS/MS analysis will offer new insights into the chemical differences between the mutants and the wild type (WT), as well as their extensive application. In this study, we investigated the effect of NtSPS1 knockout in HD on the content of terpenoids and the anti-TMV activity of this mutant. This provides valuable insight into the regulation of terpenoids, specifically solanesol, biosynthesis in tobacco. In recent years, research on terpenoid metabolism has been accelerated by advancements in synthetic biology. Advancements have led to engineering platforms for high-value terpenoid production, but engineering terpenoids in HD still face challenges due to complex metabolic and regulatory networks. Due to the limited number of standard substances, we did not obtain more comprehensive information about the correlation between NTSPS1 and other terpenoid content. In further research, we will isolate and identify more compounds from NtSPS1 knockout HD to search for relevant connections of the knockout and WT plants. Electronic supplementary material Below is the link to the electronic supplementary material. [130]Supplementary Material 1^ (2.3MB, pdf) Abbreviations NtSPS1 A solanesyl diphosphate synthases gene HD Nicotiana tabacum L. variety Hong Hua Da Jin Yuan TMV Tobacco mosaic virus WT Wild-type Author contributions JL, MLC, and QFH conceived and designed the project; JMD performed the experiment of T0 mutant plants; XL, LFZ, JDZ, and JW performed the experiment of T1 and T2 mutant plants; YX and JL performed the experiment of LC-MS/MS analysis; GYY completed the biological evaluation; JMD, XL, JDZ, JW and GYY analyzed the experimental data; JMD wrote the initial draft; All the authors discussed the results and contributed to the preparation of the final paper. Funding The authors are grateful to the project of Yunnan Daguan Laboratory (No. YNDG202301YZ04 and No. YNDG202401YZ01), the key projects of China Tobacco Yunnan Industrial (No. 2022JY02 and No. 2023YJ05) and the Project from Yunnan Key Laboratory of Tobacco Chemistry (No. 2021539200340239) for the financial support. The authors declare that they have no competing interests with the funding company. Data availability The datasets generated and/or analyzed during the current study are available in the PDB data of 2OM3 (https://www.rcsb.org/structure/2OM3), Full wwPDB EM Validation Report of 2OM3 (https://files.rcsb.org/pub/pdb/validation_reports/om/2om3/2om3_full_va lidation.pdf). The datasets used and/or analyzed during the present study are available from the corresponding author on reasonable request. Declarations Competing interests The authors declare no competing interests. Footnotes Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. Contributor Information Jing Li, Email: lijing_1107@163.com. Ming-Li Chen, Email: chenmingli@caas.cn. Qiu-Fen Hu, Email: huqiufena@aliyun.com. References