Abstract Objective The aim of the study was to propose a signature based on genes associated with antigen processing and presentation (APscore) to predict prognosis and response to immune checkpoint inhibitors (ICIs) in advanced gastric cancer (aGC). Background How antigen presentation-related genes affected the immunotherapy response and whether they could predict the clinical outcomes of the immune checkpoint inhibitor (ICI) in aGC remain largely unknown. Methods In this study, an aGC cohort (Kim cohort, RNAseq, N=45) treated by ICIs, and 467 aGC patients from seven cohorts were conducted to investigate the value of the APscore predicting the prognosis and response to ICIs. Subsequently, the associations of the APscore with the tumor microenvironment (TME), molecular characteristics, clinical features, and somatic mutation variants in aGC were assessed. The area under the receiver operating characteristic curve (AUROC) of the APscore was analyzed to estimate response to ICIs. Cox regression or Log-rank test was used to estimate the prognosis of aGC patients. Results The APscore constructed by principal component analysis algorithms was an effective predictive biomarker of the response to ICIs in the Kim cohort and 467 aGC patients (Kim: AUC =0.85, 95% CI: 0.69–1.00; 467 aGC: AUC =0.69, 95% CI: 0.63–0.74). The APscore also was a prognostic biomarker in 467 aGC patients (HR=1.73, 95% CI: 1.21−2.46). Inhibitory immunity, decreased TMB and low stromal scores were observed in the high APscore group, while activation of immunity, increased TMB, and high stromal scores were observed in the low APscore group. Next, we evaluated the value of several central genes in predicting the prognosis and response to ICIs in aGC patients, and verified them using immunogenic, transcriptomic, genomic, and multi-omics methods. Lastly, a predictive model built successfully discriminated patients with vs. without immunotherapy response and predicted the survival of aGC patients. Conclusions The APscore was a new biomarker for identifying high-risk aGC patients and patients with responses to ICIs. Exploration of the APscore and hub genes in multi-omics GC data may guide treatment decisions. Keywords: antigen processing and presentation, immune checkpoint inhibitor, immunotherapy, tumor microenvironment, survival Introduction The morbidity and mortality of gastric cancer (GC) rank fifth and fourth among all malignant tumors ([39]1). In recent years, immune checkpoint inhibitors (ICIs), such as anti-cytotoxic T-lymphocyte-associated protein 4 (CTLA-4) inhibitors and anti-programmed death 1 (PD-1)/programmed death-ligand 1 (PD-L1) inhibitors, in combination with chemotherapy and immunotherapy have made significant progress in multiple types of cancers ([40]2). Several ICIs have been approved for the clinical treatment of advanced GC (aGC) patients ([41]3, [42]4). But the overall response to ICIs is only 20% to 40% for GC patients ([43]2, [44]5). So it is urgent to accurately identify effective biomarkers to select GC patients with responses to ICIs. The tumor mutation burden (TMB), neoantigen load and clonality, copy number alterations (CNA), microsatellite instability (MSI) status, tumor microenvironment (TME), especially T cell inflammation, PD-L1 expression, and mutations in specific genes are now deemed to be predictive markers for immunotherapy in aGC patients. These indicators do, however, have limitations that have hindered their clinical application ([45]6–[46]9). Tumor immunogenicity mainly consists of tumor antigenicity and antigen presentation, playing a crucial part in response to ICIs in most types of cancer ([47]10, [48]11). But accuracy of identifying aGC patients with response to ICIs according to TMB is low ([49]12–[50]14). Tumor antigens from mutated proteins of tumor cells enable tumor cells to be recognized and killed by CD8^+ T cells through antigen-presentation mechanisms ([51]11, [52]15). Therefore, antigen presentation significantly affects the effect of ICIs treatment. Nevertheless, we do not find relevant studies on how antigen presentation-related genes affect the immunotherapy response and whether they can predict the clinical effect of ICIs therapy in aGC patients. Thus, we proposed a signature based on antigen processing and presentation genes to predict prognosis and response to ICI in aGC patients. To the best of our knowledge, this study was the first to explore the role of antigen processing and presentation in aGC and select hub genes to understand progressive tumor mechanisms and offer promising and personalized strategies to diagnose and treat GC. Methods Data source The study used GC-related data from two public platforms, the Cancer Genome Atlas (TCGA) Genomic Data Commons Data Portal ([53]https://portal.gdc.cancer.gov/) and the Gene Expression Omnibus database (GEO) ([54]https://www.ncbi.nlm.nih.gov/geo/). TCGA provided mRNA expression, copy number mutation, and clinicopathologic profiles including age, gender, Lauren type, tumor grade, tumor stage, and survival. GEO provided six GC-related cohorts including [55]GSE84437 ([56]16), [57]GSE57303 ([58]17), [59]GSE34942 ([60]18), [61]GSE29272 ([62]19), [63]GSE15459 ([64]20) and ACRG/[65]GSE62254 ([66]21). The aGC was defined as the presence of metastatic disease or tumor stages higher than IV. Patients without any survival data as well as those with less stage IV or primary tumors were excluded. The final merged aGC cohort (FMAC) consisted of 467 patients, encompassing 103 from the TCGA-STAD dataset and 364 from six GEO datasets. All gene expression or transcriptome data were transformed with log2 (x +1) and then were batch rectified by SVA Package of R ([67]22). Next, this study included two cohorts of cancer patients treated with immunotherapy. 45 aGC patients treated with ICIs in Korea ([68]5) (The Kim cohort, PRJEB25780, [69]https://www.ebi.ac.uk/ena/data/view/PRJEB25780) were included in the first cohort. The cohort collected some data about RNAseq data, immunotherapy regimens, response to ICIs, TCGA subtype, microsatellite instability (MSI), Epstein‐Barr virus (EBV), Mesenchymal subtype, single nucleotide variants (SNVs) and immune signature ([70]5). The second cohort, the IMvigor210 cohort ([71]23) ([72]http://research-pub.gene.com/IMvigor210CoreBiologies) of bladder cancer (BC), provided gene expression data, survival, immunotherapy regimens and response. Partial response (PR) and complete response (CR) to ICIs were considered to be immunotherapy responses (responses to ICIs). Moreover, patients without immunotherapy responses included those with progressive disease (PD) and stable disease (SD). Weighted gene co-expression network analysis To identify antigen processing and presentation genes related to immunotherapy response, WGCNA package of R was used to calculate Pearson correlation coefficient among genes, and select an appropriate soft threshold β according to the scale-free topological fitting index of R^2 >0.9 and average connectivity ([73]24). So, the constructed network could be in line with the standard of scale-free network. Next, the gene network was constructed by one-step method, and the adjacency matrix of expression data was transformed into a topological overlap matrix. The hierarchical clustering method was used to plot the hierarchical cluster tree to cluster genes into different color modules. The feature values of gene modules were calculated, and then Pearson correlation coefficient and correlation between the feature vector and clinical information were calculated. The gene set modules with the highest correlation with immunotherapy response, antigen processing and presentation were selected for further analysis. We first calculated the eigenvalues of gene modules, and then calculated the correlation coefficients between the feature vectors of the modules and clinical features (immunotherapy and antigen processing and presentation). The gene set modules with significant correlation with immunotherapy response and antigen processing and presentation were available for further analysis. Analysis of biological function and pathways of genes The Gene Ontology (GO; molecular function, cellular component, and biological process) database and informatics resource ([74]http://www.geneontology.org) were used to annotate gene function enrichment. Kyoto Encyclopedia of Genes and Genomes (KEGG) database ([75]http://www.genome.ad.jpl/kegg/) with analysis of cells or organisms senior functional behavior was used to annotate various pathways enrichment of genes. Additionally, the identification of key pathways enriched by hub genes was conducted by the GSCALite([76]http://bioinfo.life.hust.edu.cn/web/GSCALite/) and the Metascape web server([77]http://metascape.org/) ([78]25, [79]26). Construction of APscore The WGCNA selected genes associated with immunotherapy response and antigen processing and presentation for further analysis. Above genes often involved hundreds of genes, of which significant interrelationships were inevitable. To better analyze the expression data of these genes, we synthesized multiple variables with multiple correlations into several representative variables, representing the majority of the original variables’ information but unrelated to each other. Therefore, we performed principal component analysis (PCA) to reduce the dimension of the above genes expression data set ([80]27). Then we obtained principal component 1 and 2, of which the sum was defined as the APscore ([81]28). The specific formula is as follows: APscore= [MATH: (PC1i+PC2i) :MATH] , where i represents the expression of genes. PCA scores for some important tumor-related pathways including antigen processing machinery, immune checkpoint, CD8 T effector, DNA damage repair, epithelial-mesenchymal transition (EMT) and so on were calculated according to the expression of genes enriched in these pathways ([82]29–[83]31). Assessment of immune microenvironment and immunotherapy response Twenty-two immune cell invasions according to gene expression were assessed by the CIBERSORT algorithm ([84]32). According to gene expression, the absolute abundance of eight immune and two stromal cell populations were evaluated by the microenvironment cell populations-counter (MCP-counter) method ([85]33). The ESTIMATED algorithm assessed each patient’s immune and stromal scores according to gene expression ([86]34). The TIDE algorithm platform was used to evaluate immunotherapy response according to standardized gene expression in FMAC ([87]35). Selection of hub genes association with prognosis and immunotherapy response The ten most crucial hub genes associated with immunotherapy response and antigen processing and presentation were identified using the MCODE and Cytohubba of the Cytoscape software ([88]36, [89]37). Then we assessed the efficacy of hub genes predicting prognosis in FMAC and immunotherapy response in the Kim cohort. The predictive efficacy was also validated in another aGC cohort ([90]GSE26253) treated with chemoradiotherapy ([91]38). Construction of predictive model for prognosis and immunotherapy response First, we selected target differently expressed genes (DEGs) that were associated with immunotherapy response and enriched in the antigen processing and presentation in the Kim cohort. Second, those target DEGs were intersected with genes from FMAC, and then shared genes were obtained and used to construct the predictive model for prognosis by Cox regression in FMAC and for immunotherapy response by logistics regression in the Kim cohort. Last, the risk score of the predictive model was calculated based on the coefficient of Cox regression or logistics regression and gene expression. The specific formula is as follows: [MATH: risk sc ore= (expression of genei×coefficienti) :MATH] Construction of nomogram for prognosis We used Cox regression to select some independent variables including several signaling pathways, the risk score of the predictive model for prognosis and immunotherapy response, and the ratio of M1 Macrophages to M2 macrophages, which had significant associations with the prognosis of GC patients. According to the above independent variables, we built a nomogram to predict the survival of aGC patients. Additionally, each patient’s risk score and 3, 5 and 8-year survival were calculated. The Cox regression was conducted by rms or survival package of R; the nomogram was plotted by regplot package of R ([92]39). The timeROC package of R was used to plot receiver operating characteristics (ROC) curves for 3, 5 and 8-year survival, respectively ([93]40). The calibration and decision curves were plotted by the rms package and rmda of R, respectively. Specimen collection Twenty-five patients with GC who underwent surgical resection in the Affiliated Hospital of Jiangnan University were collected. None of the patients received any chemotherapy, radiotherapy or biotherapy before surgery. This study was approved by the ethics committee of affiliated hospital of Jiangnan University and followed the guidelines of the Declaration of Helsinki. Informed consent was obtained from all participants. All specimens were diagnosed as GC by two independent pathological diagnosis doctors. All samples were immersed in formalin solution and fixed for paraffin embedding (FFPE). 4 μm sections were prepared for immunohistochemical staining. Hematoxylin-eosin staining The FFPE section was dewaxed with xylene. Xylene was then deoxidized with gradient alcohol. The tissue was stained for 3min by Hematoxylin. The nuclear staining was observed under the microscope. The tissue was further stained for 90s by the Eosin solution, and the staining was examined under the microscope. Immuno-histochemistry Quantitative analysis methods performed IHC and multi-color immunofluorescence. First, FFPE was dewaxed and hydrated, and the EDTA method repaired tissue antigen. H[2]O[2] was used to inactivate endogenous peroxidase in tissues. Second, the tissue was covered entirely with primary antibody solution and placed in a wet box at 4°C for 12 hours. Then, the tissue retemperature at room temperature for 30min. The second antibody (horseradish peroxidase-labeled) was added and incubated for 60 min at room temperature for Tetramethylbenzidine color rendering. An optical microscope observed the brown-yellow particles as a positive color reaction. The primary antibody used in this study: CK (ab52625, Abcam, 1: 1000), CD8 (14-0081-82, Invitrogen Antibodies, 1:200), PD-L1(ab205921, Abcam PD-L1(ab205921, Abcam, 1: 1000), STAT1: (ab29045, Abcam, 1:2000), IFIT3 (ab76818, Abcam, 1:000), TAPBM (ab13518:Abcam, 1: 400). Immune classifications of GC were defined by the CD8^+ T cells infiltrating the extent and number of tissues ([94]41, [95]42). The immune-inflamed subtype was characterized by CD8^+ T cells spread throughout the tumor parenchyma and surrounding stroma. The excluded-immune subtype was characterized by CD8^+ T cells infiltration in the peritumor stroma, not in the parenchyma. The deserted-immune subtype was characterized by the absence of CD8^+ T cells in tumor parenchyma and stroma. Multi-color immunofluorescence After being dewaxed, the (FFPE section was repaired by EDTA antigen repair buffer (PH 9.0). The tissues were added with PBS (PH7.4) and primary antibody and incubated overnight at 4°C. The tissues were then covered with secondary antibodies and incubated at room temperature for 50min. DAPI dye was dropped and incubated for 10min at room temperature, away from light. The slices were briefly shaken dry and sealed with anti-fluorescence quenching sealing tablets. The fluorescence microscope was used to observe the protein’s location, shape and quantity, and images were collected. Statistical analysis The t-test or Wilcoxon signed-rank test was used to compare two groups of continuous variables. One-way analysis of variance or Kruskal-Wallis H test was used for comparison among multiple (n>2) groups. The chi-square test or nonparametric rank-sum test was used to compare categorical variables. Pearson or Spearman correlation analysis conducted correlation analysis of two continuous variables. Kaplan-Meiers curve was used to show the survival of the sample over time, and the log-rank test was used to test whether there were significant differences in the survival of multiple groups. The Cox regression was used to select factors affecting survival. Logistic regression was used to screen factors affecting immunotherapy response. All statistical analysis was conducted in R software (version 4.0.3). All analyses were two-sided and tested at the nominal 0.05 significance level. Results Selection of key modules associated with prognosis and response to ICIs We identified 2759 DEGs in patients with and without response to ICIs in the Kim cohort ([96] Figure 1A ; [97]Table S1 ). According to the optimal soft-thresholding power of 26, WGCNA of these DEGs was conducted ([98] Figures S1A, B ) and showed that these DEGs were divided into four different modules ([99] Figures 1B, C ). Of which the blue module (r = 0.4, p = 0.007), the brown module (r =0.56, p = 6e–05) and the grey module (r = −0.65, p = 1e–06) were significantly related with immunotherapy response ([100] Figure 1C ), respectively. Furthermore, the number of brown module members (n=85) significantly correlated with the gene signature (r=0.3, p=0.005, [101]Figure 1D ). Similar result was found in the grey module (n=1737, r=0.55, p=6.5e–138; [102]Figure 1E ). However, there were no significant correlations of the gene signature with the blue module and turquoise module (blue module: r= −0.13, p=0.21; turquoise module: r= −0.078, p=0.35; [103]Figures S1C, D ). Figure 1. [104]Figure 1 [105]Open in a new tab Identification of genes associated with immunotherapy response. (A) The volcano figure of differentially expressed genes (DEGs) associated with immunotherapy response in the Kim cohort. logFC: log-fold changes. (B) Weighted gene coexpression network analysis (WGCNA) of 2759 DEGs with a soft threshold β = 26. (C) Heat maps showing the gene modules associated with immunotherapy response. The relationship between the gene signature and brown module genes (D) or grey module genes (E). Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis for genes in brown module (F) or grey module (G). GO analysis showed that brown module genes were enriched in several immune functions, such as immune receptor activity, major histocompatibility complex (MHC) protein binding, T cell activation and cytokine activity ([106] Figure S1E ; [107]Table S2 ). Those grey module genes were enriched in the extracellular organization and MHC protein complex functions ([108] Figure S1F ; [109]Table S3 ). Moreover, KEGG analysis showed that brown module genes were enriched in cytokine-cytokine receptor interaction, antigen processing and presentation, T cell receptor signalling pathway, NOD-like receptor signalling pathway, JAK-STAT signalling pathway, TNF signalling pathway and Toll-like receptor signalling pathway, which affected the occurrence, progression and response to ICIs in multiple types of tumors ([110]29–[111]31) ([112] Figure 1F ; [113]Table S4 ). Those grey module genes were enriched in the cGMP-PKG signaling pathway, antigen processing and presentation, Gastric acid secretion and Rap1 signaling pathway ([114] Figure 1G ; [115]Table S5 ). Notably, five brown module genes and 23 grey module genes were enriched in antigen processing and presentation ([116] Tables S4, 5 ), implying that antigen presentation status may significantly contribute to immunotherapy. Next, of 1822 genes (85 brown and 1737 grey genes), 637 genes (turquoise module) were significantly related to antigen processing and presentation by WGCNA (r=0.54, p=1e–04; [117]Figures S2A, B ). Furthermore, 637 genes significantly correlated with the gene signature (r=0.56, p=7.3e–54; [118]Figure S2C ). The APscore predicts immunotherapy in the Kim cohort After intersecting 637 genes above with 10936 genes from FMAC ([119]GSE84437, [120]GSE57303, [121]GSE34942, [122]GSE29272, [123]GSE15459, ACRG/[124]GSE62254 and TCGA-STAD), altogether 366 genes were used to construct a scoring system based on the PCA algorithm, which we called APscore ([125] Figures 2A, B ). To estimate the relationships of the APscore with target genes, immunotherapy response, immune signature, mesenchymal subtype, TCGA subtype, number of SNVs, EBV subtype and MSI subtype in GC, a combined heat map was plotted ([126] Figure 2A ). Figure 2. [127]Figure 2 [128]Open in a new tab The characteristics of the APscore as an indicator of immunotherapy response in aGC patients. (A) The relationship of the APscore with other subtypes of GC. (B) Venn showing 366 genes associated with the APscore. (C) Receiver operating characteristics (ROC) for the APscore in predicting immunotherapy response. Comparison of the APscore in different groups of responses to immunotherapy (D), TCGA subtypes (E), MSI subtypes (F), EBV subtypes (G), Mesenchymal subtypes (H), Number of SNVs subtypes (I) and immune signature subtypes (J). Based on the cutoff thresholds derived by the Youden index for continuous variables—the APscores of all patients, each patient was regrouped into the high or low-APscore group ([129]43). It was found that most of the 28 genes enriched in antigen processing and presentation were expressed higher in the high-APscore group than in the low-APscore group. The APscore could distinguish between patients with vs. without response to ICIs [area under curve (AUC) =0.85, 95% CI (confidence interval): 0.69–1.00; [130]Figure 2C ]). Moreover, higher APscore was observed in patients with immunotherapy response (p=0.00019, [131]Figure 2D ). Next, the APscore was significantly different among four TCGA subtypes, two MSI subtypes, two EBV subtypes and three Mesenchymal subtypes ([132] Figures 2E–I , all p<0.05). High-APscore patients were likely to have positive EBV status, high MSI status and high mutational load, which were known as beneficial factors of immunotherapy, implying that the APscore may be a valuable indicator of immunotherapy. Similarly, higher APscore was observed in the high immune signature subtype (P= 7.5e–05, [133]Figure 2J ). The immune signature involved 12 genes, CCL2, CCL3, CCL4, CCL5, CCL8, CXCL9, CXCL10, CXCL11, CXCL13, CL18, CCL19 and CCL21, of which expression levels reflected tumor microenvironment in GC ([134]44). So, it was worth further exploring microenvironment features in different APscore groups. Immune microenvironment characteristics of different APscore groups To assess the immunotherapy response, standardized genes expression of all patients in FMAC were entered into TIDE algorithm platform. The APscore of each patient was calculated based on the standardized expression of 366 genes, which were associated with antigen processing and presentation. The APscore showed significant negative correlations with TIDE, Dysfunction, CD8, Merck18, IFNG and MSI Expr Sig, and positive correlations with CAF, TAM M2, Exclusion and MDSC (all p< 0.05; [135]Figures 3A , [136]S3A ). A high TIDE score represented poor efficacy of immune checkpoint blocking therapy (ICB) and short survival. Moreover, a higher APscore was observed in patients with response to ICIs predicted by the TIDE algorithm ([137] Figure 3A , p=1.2e–10), consistent with [138]Figure 2D . Expected, the APscore distinguished patients with response to ICIs from patients without response to ICIs in FMAC (AUC=0.69, 95% CI: 0.63–0.74; [139]Figure S3C ). These results show that the APscore can predict immunotherapy response in aGC patients. Figure 3. [140]Figure 3 [141]Open in a new tab The associations of the APscore with immunotherapy response, immune microenvironment, and survival in aGC patients. (A) The relationship of the APscore with indicators of immunotherapy response predicted by the TIDE algorithm. (B) The relationship of the APscore with infiltration of the 22 immune cells calculated by the CIBERSORT algorithm. (C) The interrelations of the APscore with PCA genes scores of important biological pathways of GC. The white squares represent no statistically significant. (D) Kaplan-Meier survival plots in both groups of 467 aGC patients. (E) Comparison of the tumor purity, estimate, immune, and stromal scores between the high-APscore and low APscore-groups. (F) Comparison of the 22 immune cells proportions between the high-APscore and low APscore-groups. Asterisks denote significant differences (*p< 0.05; **p< 0.01; ***p< 0.001). Next, we used the CIBERSORT algorithm to estimate the infiltration of the 22 immune cells. The APscore had significantly positive correlations with regulatory T cells (Tregs), memory resting CD4 T cells, M0 Macrophages, activated mast cells and activated NK cells, and significantly negative correlations with gamma delta T cells, follicular helper T cells, plasma cells, resting dendritic cells, memory B cells, eosinophils, activated dendritic cells, neutrophils and CD8 T cells (all p< 0.05, [142]Figure 3B ). Of note, the correlations of the APscore with PCA genes scores of crucial biological pathways showed that the APscore had significantly positive correlations with cell cycle, TGF beta signaling pathway, Wnt signaling pathway, antigen processing machinery, antigen processing and presentation, DNA damage repair, angiogenesis, EMT1 and nucleotide excision repair scores, and significantly negative correlations with mismatch repair, DNA replication, EMT2, MAPK signaling pathway, JAK/STAT signaling pathway, NF kappa B signaling pathway, ECM receptor interaction and base excision repair scores ([143] Figure 3C , all p< 0.05). Patients exceeding a certain APscore threshold, defined by the Youden index, were classed as a high-APscore group, the opposite as a low-APscore group. By log-rank test, patients in the high-APscore group had poorer survival than those in the low-APscore group (p=0.0022, HR=1.73, 95% CI: 1.21−2.46, [144]Figure 3D ). In addition, we used ESTIMATED algorithms to estimate immune and stromal scores for each patient. The estimate, immune, and stromal scores were significantly higher in the low APscore-group than that in the high APscore-group, while the opposite trend was observed for the tumor purity (all p<0.05, [145]Figure 3E ). Similarly, representative cells for anti-tumor immunity such as NK cells, memory resting CD4 T cells, Tregs, Monocytes and M0 Macrophages were more abundant in the high-APscore group (all p<0.05, [146]Figure 3F ). These results further verify that the APscore may influence the prognosis and response to ICIs through classifying tumor subtypes according to the tumor microenvironment. SNVs of different APscore groups Previous studies suggest that accumulated somatic mutations induced the immune system to produce anticancer cells, and tumor mutation load affected immunotherapy response and the survival of GC patients ([147]10, [148]11, [149]44). To explore the genetic imprints of different APscore groups, we used SNV data of 101 aGC patients from TCGA-STAD to analyze the relationship between the TMB and the APscore. The APscore had a significantly negative correlation with TMB (r=−0.42, p=1.2e–05, [150]Figure 3SD ). The high-APscore group had lower TMB than the low−APscore group (p=0.001, [151]Figure 3SE ). Moreover, the high-TMB group had longer survival than the low-TMB group (log-rank test, p = 0.015, [152]Figure 3SF ). However, we could not find significantly different survival with a p-value of 0.05 among four subgroups formed by the APscore and TMB (p=0.05, [153]Figure 3SG ). This result may be largely due to limited samples (n=101). Next, the APscore as a binary variable entered into Cox regression model, and then was an independent prognostic predictor of aGC patients after adjusting other clinicopathologic characteristics, including Lauren subtype, age, gender, cancer grade, CD274 expression, TMB and the ratio of M1 Macrophages to M2 Macrophages (HR=4.42, 95% CI:1.36−14.28, [154]Figure 3SH ). Moreover, the efficacy of the APscore predicting survival of aGC patients was explored in 14 independently thermal TCGA cohorts including 6673 patients ([155] Table S6 ). The APscore was a significant factor in the prognosis of patients with breast invasive carcinoma (BRCA), head and neck squamous cell carcinoma (HNSC), brain lower grade glioma (LGG) or skin cutaneous melanoma (SKCM), which were generally considered thermal tumors with infiltration by a variety of T cells ([156] Figure S3I ). Therefore, it needs to explore further the interaction mechanism of the APscore and T-cell infiltration to influence the prognosis. To assess the driver mutations in different APscore groups, the mutated genes from TCGA-STAD were listed using maftools. In the high-APscore group, the top five mutated genes were TP53, TTN, MUC16, LRP1B, and DNAH5, respectively, whereas the top five were TTN, MUC16, ARID1A, PIK3CA and KMT2D, respectively, in the low-APscore group ([157] Figure S3J ). Of note, compared with the low-APscore group, the high-APscore group had lower frequencies of mutated genes ARID1A, PCLO, KMT2D, OBSCN and PIK3CA (all p<0.05, [158]Figure S3J ). A previous study used the TCGA-STAD dataset to estimate the correlation of PIK3CA with sensitivity or resistance ([159]8). Our results provide new insights into the role of the APscore in GC with special emphasis on its effect on individual mutations to offer a reference for the effect of tumor immunotherapy. Selection of hub genes associated with prognosis To assess novel targets for GC immunotherapy, we selected the ten most important hub genes (STAT1, IRF1, ISG15, IRF9, BST2, PSMB8, STAT2, IFI35, IFIT3 and OAS2) from above 366 genes being used to generate the APscore. Next, we plotted the interaction map of hub genes and pathways using the Cytoscape software ([160] Figures 4A, B ). Of note, ten genes were all enriched in interferon-alpha (IFNα)/beta (IFNβ) signaling using the Metascape ([161] Table S7 ). Some studies suggest that IFNα and IFNβ are potential anti-tumor immune cytokines, resulting in improved outcomes for patients with malignancies of heterogeneous histologies ([162]45, [163]46). In the study, hub genes activated four signaling pathways: cell cycle, EMT, ER Hormone and apoptosis, and inhibited five signaling pathways: RTK, RAS/MAPK, TSC/mTOR, PI3K/AKT and AR Hormone pathway ([164] Figure 4B ). This result further highlights the potential value of hub genes to predict immunotherapy. It needs to explore the interaction between hub genes and T cells to influence immunotherapy and prognosis in GC. Figure 4. [165]Figure 4 [166]Open in a new tab Selection and characteristics of hub genes associated with immunotherapy response and prognosis. (A) A network of protein-protein interactions for 10 hub genes. (B) The profiles of hub genes affecting canonical signaling pathways. The Kaplan-Meier survival curves of STAT1 (C) and IFIT3 (D) expression affecting prognosis. (E) ROC for the STAT1 and IFIT3 predicting immunotherapy response in the Kim cohort. (F) The correlations of hub genes with abundance of seven immune and two stromal cells, calculated by the MCP-counter method. (G) Immunohistochemistry detected the expression of CD8, PD-L1, STAT1, and IFIT3 in the immune-inflamed subtype, immune-excluded subtype, and immune-deserted subtype of GC. The scale corresponds to 50μm. (H) Multi-color fluorescence staining detected the spatial distribution of STAT1, IFIT3 and CD8 in the immune-inflamed subtype, immune-excluded subtype, and immune-deserted subtype of GC. The scale corresponds to 20μm. Next, it was found that ten genes affected the survival of patients in FMAC ([167] Figures 4C, D , [168]4SA ). The expression level of STAT1 and IFIT3 was significantly associated with longer survival (STAT1: log-rank test, p = 0.021; IFIT3: log-rank test, p = 0.00064). In the Kim cohort, patients responding to ICIs had higher expression levels of ten genes than those (all P< 0.05, [169]Figure S4B ). Intriguingly, STAT1 and IFIT3 distinguished individuals with response to ICIs from individuals without response to ICIs in the Kim cohort (STAT1: AUC =0.84, 95% CI: 0.66−1.00; IFIT3: AUC =0.77, 95% CI: 0.59−0.95; [170]Figure 4E ). Survival analysis further showed that high expression level of STAT1 and IFIT3 predicted progression-free survival (PFS) benefit for aGC patients in the [171]GSE26253 cohort treated with chemoradiotherapy (all p<0.05; [172]Figures S4C, D ). In addition, the expression level of STAT1 and IFIT3 was positively related to seven types of immune cells and negatively associated with fibroblasts and endothelial cells calculated by the MCPcounter (p< 0.05, [173]Figure 4F ; [174]Table S8 ). We observed similar correlations of the expression level of STAT1 and IFIT3 with activated dendritic cells, activated NK cells, CD8 T cells and memory activated CD4 T cells, which were representative cells for anti-tumor immunity estimated by the CIBERSORT. (all p< 0.05; [175]Figure 4SE ). To further explore the relationship between the expression of STAT1 and IFIT3 and the immune grade of GC, we selected 25 FFPE samples to quantify the potential relationship between the above two genes and tumor immune microenvironment using HE staining, immunohistochemistry and multi-colour immunofluorescence. The GC was divided into three immune subtypes: immune-inflamed subtype, immune-excluded subtype, immune-deserted subtype according to CD8^+ T cell infiltration in the tissues ([176] Figure 4G ). At the same time, the relationship between the two genes and immune cell localization and PD-L1 expression was also shown in [177]Figure 4G . It was found that the highest expression of PD-L1, STAT1 and IFIT3 in the immune-inflamed subtype, followed by the immune-excluded subtype and immune-deserted subtype. Moreover, we further experimented with multi-color fluorescence staining to explore the spatial relationship between the two genes and CD8 in different immune subtypes of GC ([178] Figure 4H ). Expected, STAT1 and IFIT3 were highly expressed in tumors with high CD8 infiltration levels but lowly expressed in tumors with low CD8 infiltration levels. These results suggest that two genes may influence the role of CD8^+ T cells in GC immunotherapy. Association of hub genes variants with prognosis Many studies have shown that individual altered genes affect the survival and response to ICIs in multiple types of tumors ([179]12, [180]47). The distributions of SNVs from the TCGA-STAD cohort showed that altered frequencies of STAT1 and IFIT3 came first and third among 48 gastric cancer patients ([181] Figure 4SF ). Similarly, the altered frequencies of STAT1 and IFIT3 came second and fourth among 525 samples from the pan-cancer TCGA cohort ([182] Figure 4SF ). TIMER web serve allowed us to assess the relationship between six immune cell infiltrates and hub gene somatic copy number variations (CNVs) according to the TCGA-STAD dataset ([183]48, [184]49). It was found that there were significant interactions between five various CNVs of each hub gene and the infiltration level of six immune cells (all p<0.05; [185]Figure S5A ). Moreover, heterozygous amplification was found to dominate in uterine carcinosarcoma (UCS), esophageal carcinoma (ESCA), adrenocortical carcinoma (ACC) and testicular germ cell tumors (TGCT); and heterozygous deletion was present in ovarian serous cystadenocarcinoma (OV) and breast invasive carcinoma (BRCA, [186]Figure S5B ). Heterozygous amplification and heterozygous deletion were the top two presents in the TCGA-STAD dataset. Next, survival analysis of pan-cancer showed that hub genes might be risk factors of survival for five TCGA cohorts, including LGG, kidney renal papillary cell carcinoma (KIRP), kidney renal clear cell carcinoma (KIRC), uveal melanoma (UVM) and thymoma (THYM; HR > 1, p< 0.05; [187]Figure S6A ), while the opposite tendency was found for the SKCM cohort (HR<1, P<0.05). Hub genes mainly inhibited apoptosis, EMT, and ER hormone signaling pathways and activated RTK, AR Hormone, PI3KAKT and TSCmTOR signaling pathways in pan-cancer cohort ([188] Figure S6B ). Moreover, Hub genes expression had significantly positive correlations with the frequencies of CNV in most diverse cancer types, implying that CNV may positively regulate the hub genes expression in the process of invasion, metastasis and recurrence of a variety of tumors ([189] Figure S6C ). A predictive model for prognosis and immunotherapy response By intersecting 28 genes being enriched in antigen processing and presentation ([190] Figures 1F, G ), with 10511 genes from 467 patients in FMAC, 20 candidate genes were obtained ([191] Figure 5A ). After stepwise backward Cox regression, the final predictive model consisting of nine genes (B2M, CTSB, HLA-A, HLA-C, HLA-DRA, HLA-F, HSPA2, KLRD1 and TAPBP) was constructed. The formula of the predictive model calculated the risk score of each patient. Risk scores in dead patients were significantly higher than that in alive patients (p=2e−05, [192]Figure 5B ). Those patients with risk scores exceeding the median risk score were classified as high-risk patients. On the contrary, other patients were classified as low-risk patients. The survival of high-risk patients was poorer than low-risk patients (p< 0.0001, [193]Figure 5C ). In particular, the risk score could identify dead patients from alive patients (AUCs for one-year, three-year, and five-year survival were 0.628, 0.643 and 0.676, respectively; [194]Figures 5D, E ). Figure 5. [195]Figure 5 [196]Open in a new tab Construction and validation of the predictive model for prognosis and immunotherapy response. (A) Venn map showing hub genes used to construct the predictive model. (B) Comparison of the standardized risk score between dead and alive patients in FMAC. (C) The Kaplan-Meier survival curves between the high-risk and low-risk patients in FMAC. (D) Standardized risk score distribution plot in FMAC. (E) ROC for the risk score predicting prognosis in FMAC. (F) Comparison of the standardized risk score between tumor progression and tumor progression-free patients in the [197]GSE26253 dataset. (G) The Kaplan-Meier progression-free survival (PFS) curves between high-risk and low-risk patients in [198]GSE26253 dataset. (H) Standardized risk score distribution plot in [199]GSE26253 dataset. (I) ROC for the risk score predicting prognosis in [200]GSE26253 dataset. (J) Comparison of the standardized risk score between patients with immunotherapy response and patients without immunotherapy response in the Kim cohort. (K) Standardized risk score distribution plot in the Kim cohort. (L) ROC for the risk score predicting immunotherapy response in the Kim cohort. (M) Comparison of the standardized risk score between patients with immunotherapy response and patients without immunotherapy response in the IMvigor210 cohort. (N) Standardized risk score distribution plot in the IMvigor210 cohort. (O) ROC for the risk score predicting immunotherapy response in the IMvigor210 cohort. (P) The Kaplan-Meier survival curves between the high-risk and low-risk patients in the IMvigor210 cohort. Next, we used the [201]GSE26253 dataset to validate the predictive model for prognosis and immunotherapy response and found that risk scores in tumor progression samples were significantly higher than those of tumor progression-free samples (p= 0.0021, [202]Figure 5F ). The PFS of high-risk patients was poorer than low-risk patients (log-rank test, p=0.00087, [203]Figure 5G ). Additionally, the risk score identified tumor progression samples from tumor progression-free samples (AUCs for one-year, three-year, and five-year survival were 0.782, 0.728 and 0.787, respectively; [204]Figures 5H, I ). To further explore whether the above nine genes can predict the response to ICIs in the Kim cohort, the risk score of each patient was calculated using the formula of logistic regression analyses. Expectedly, patients with response to ICIs had higher risk scores than those without response to ICIs (p=2.6e−08, [205]Figure 5J ). Of note, the risk score perfectly discriminated patients with vs. without response to ICIs as evidenced from the ROC curve and corresponding high AUC value (AUC=0.97, 95% CI: 0.92−1.00; [206]Figures 5K, L ). Moreover, we used the IMvigor210 cohort of BC to estimate the ability of the risk score to predict response to ICIs. Consistent with the results of the Kim cohort, risk scores in BC patients with response to ICIs were significantly higher than those without response to ICIs (p=5.2e−05, [207]Figure 5M ). The risk score also could discriminate BC patients with vs. without response to ICIs (AUC=0.63, 95% CI: 0.57−0.69; [208]Figures 5N, O ). Lastly, the survival of high-risk patients was poorer than low-risk patients (p=0.0036, [209]Figure 5P ) for the IMvigor210 cohort. In brief, the risk score consisted of nine genes associated with antigen processing and presentation is an effective indicator of prognosis and response to ICIs for aGC patients. Predictive model genes for prognosis and immunotherapy response In the Kim cohort, patients with response to ICIs had lower HSPA2 expression than that without response to ICIs, whereas the opposite was observed for the other eight genes (all p<0.05, [210]Figure 6A ). Of nine genes, eight could discriminate patients with vs. without response to ICIs except for HLA-A ([211] Figure 6B ). Figure 6. [212]Figure 6 [213]Open in a new tab TAPBP was related to the immunotherapy response and the tumor immune microenvironment. (A) Comparison of nine predictive model genes between CR/PR group and SD/PD group in the Kim cohort. (B) ROCs for nine genes predicting immunotherapy response in the Kim cohort. (C) The Kaplan-Meier survival curves between the Kim cohort’s high-TAPBP expression group and the low-TAPBP expression group. (D) HE and IHC staining results in tumor and normal tissues in GC. The expression of TAPBP in between tumor tissues and normal tissues (E) or adjacent normal tissues (F) from the TCGA-STAD cohort. (G) The multi-color fluorescence staining detected the spatial distribution of TAPBP and CD8 in the immune-inflamed subtype, immune-excluded subtype, and immune-deserted subtype of GC. The distributions of nine predictive model genes somatic alterations from TCGA-STAD cohort (H) and pan-cancer TCGA cohort (I). Asterisks denote significant differences (*p< 0.05; **p< 0.01; ***p< 0.001). We further assessed the correlations between nine predictive model genes and 22 immune cells ([214] Figure S7A ). Eight genes (TAPBP, KLRD1, HSPA2, HLA-DRA, CTSB, B2M, HLA-C and HLA-A) were significantly associated with more than nine types of immune cells in 467 patients (all p<0.05). This result implied that nine genes might regulate immune cell infiltration in tumor tissue. Notably, TAPBP was significantly correlated with most immune cells (n=16) and was positively correlated with CD8 T cells, activated memory CD4 T cells and activated NK cells, which could kill tumor cells. The MCP-counter was used to estimate the absolute abundance of eight immune and two stromal cell populations in GC tissues from TCGA-STAD transcriptomic data, showing that TAPBP had positive relationships with most immune cells except for neutrophils ([215] Figure S7B ). Additionally, the survival of patients with high TAPBP expression was better than low TAPBP expression, implying that this gene might be a protective factor (p = 0.019, [216]Figure 6C ). The hematoxylin-eosin staining (HE) showed that the infiltrating gastric carcinoma cells in tumor tissues were higher than in adjacent tissues ([217] Figure 6D ). Moreover, immunohistochemistry (IHC) was used to measure the expression of TAPBP protein in tumor tissues and adjacent normal tissues. The staining intensity score of TAPBP in tumor tissues was significantly higher than that in adjacent normal tissues ([218] Figure 6D ). According to the TCGA-STAD cohort, it was found that the expression of TAPBP mRNA in tumor tissues was higher than that in normal tissues (Mann-Whitney U test, p<0.0001; [219]Figure 6E ). Pair Wilcoxon signed-rank test also suggested that the expression of TAPBP mRNA in tumor tissues was higher than in adjacent normal tissues (p<0.0001; [220]Figure 6F ). The multi-color fluorescence staining revealed the highest expression of TAPBP and CD8 in the immune-inflamed subtype, followed by the immune-excluded subtype and immune-deserted subtype ([221] Figure 6G ). Moreover, patients with high HLA-C, HLA-DRA, HLA-F and KLRD1 expression had better survival, while the opposite trend was observed for high B2M, CTSB, HLA-A and HSPA2 expression ([222] Figure 7SC ). We used 1146 individuals with more stage III from 16 independent TCGA cohorts to assess the prognostic value of TAPBP. Although TAPBP was not significantly associated with survival in 16 types of tumors, the meta-analysis showed that patients with high-expression TAPBP had better survival than low-expression TAPBP (p<0.0001, [223]Figure 8SA ). The distributions of somatic alterations from the TCGA-STAD cohort showed that altered frequencies of B2M and TAPBP came top two among 56 GC patients ([224] Figure 6H ). Similarly, the altered frequencies of B2M and TAPBP came first and fourth among 484 samples from pan-cancer TCGA cohort ([225] Figure 6I ). We further analyzed the difference of nine predictive model genes between subtypes in the pan-cancer TCGA cohort, and found six types of hot cancers including BRCA, lung adenocarcinoma (LUAD), glioblastoma multiforme (GBM), KIRC, lung squamous cell carcinoma (LUSC) and HNSC were the most significant ([226] Figure 8SB ). KIRC has the largest fold difference of nine predictive model genes expression between tumor and normal tissue ([227] Figure S8C ). Similar changes in nine predictive model genes were observed for BRCA and HNSC, whereas the opposite was observed for LUSC. The heatmap showed that nine predictive model genes initiated apoptosis, EMT, hormones ER and RAS/MAPK signaling pathways, while the opposite function was observed in cell cycle, hormones AR and DNA damage response signaling pathways ([228] Figure S8D ). The above results revealed that nine predictive model genes might interact with immune cells in the tumor microenvironment to affect a variety of cancers invasion, metastasis and recurrence. A nomogram for prognosis The univariate Cox analysis identified the risk score of nine prediction model genes and nine signaling pathways, including ECM receptor interaction, DNA damage repair, EMT2, Nucleotide excision repair, Base excision repair, Immune checkpoint, CD8 T effector and JAK-STAT signaling pathways, significantly associated with the prognosis of GC patients (all p<0.05; [229]Figure 7A ). The multivariate Cox analysis further showed that the risk score of nine prediction model genes, MAPK signaling pathway, Wnt signaling pathway, the ratio of M1 Macrophages to M2 macrophages were risk factors of the survival in GC, while ECM receptor was a predictive factor (all p<0.05; [230]Figure 7B ). Although CD8 T effector did not have a significant association with the survival (p =0.055), it was an essential indicator of immunotherapy and was enter into the multivariate Cox analysis. Figure 7. [231]Figure 7 [232]Open in a new tab Construction of a nomogram for prognosis in FMAC. (A) The univariate and (B) multivariable Cox regression for the risk score and essential signaling pathways in FMAC. (C) The nomogram constructed by six variables. The red dot represents the risk score corresponding to the value of the independent variable. According to the total scores of the nomogram, the three-year, five-year and eight-year death probability in randomly selected the 400th patient being 0.901, 0.958 and 0.977, respectively. (D) ROCs for three-year, five-year and eight-year death probability of the nomogram. (E) Calibration curves of the predicted five-year survival probability and actual five-year survival probability. The gray line indicates ideal fit; the circle indicates the predicted five-year survival probability of nomogram; the stars indicate bootstrap correction estimates, and the error bars indicate the 95% CI of these estimates. (F) Decision curves for the nomogram, the risk score of nine prediction model genes and other four factors. Next, we built a nomogram model of six significant variables in the multivariate Cox regression ([233] Figure 7C ). The AUCs for predicting three-year, five-year, and eight-year survival of GC patients were 0.657, 0.689 and 0.709, respectively ([234] Figure 7D ). When compared to previously published prognostic biomarkers of various measurements, including mRNA expression, lncRNA expression, microRNA expression, protein, and DNA methylation level for predicting the prognosis of GC patients ([235]50–[236]55), the AUC of our nomogram model for predicting eight-year survival appeared to be higher. The capacity to predict outcomes for more patients is likely increased by the addition of more clinical characteristics to our model, or it may be facilitated by the inclusion of more samples and a longer period of follow-up in our study. Calibration curves showed that the predicted 5-year survival probability and actual 5-year survival probability were hovering near the 45-degree diagonal line, implying that the nomogram model has a superior ability to predict GC prognosis ([237] Figure 7E ). Moreover, decision curve analysis showed that the curves of the nomogram model and risk score of nine prediction model genes were isolated from angles of treatment for all and treatment for none, highlighting the clinical potential of the nomogram model in the prognosis of aGC ([238] Figure 7F ). Discussion The antigen processing and presentation plays a crucial role in effector T cells recognition of tumor cells, influencing the identification of patients susceptible to immunotherapy ([239]11). This study identified a novel biomarker of genes related to antigen processing and presentation, predicting response to ICIs for aGC patients. Currently, most studies use TCGA or GEO datasets to build models for predicting prognosis rather than response to ICIs for aGC patients due to the lack of ICIs treatment results. Therefore, we explored the response to ICIs for aGC patients in the Kim cohort treated by ICIs to make the results have direct clinical translational significance. In this study, we built a scoring system to evaluate the efficacy of immunotherapy for aGC patients. We validated the results—the APscore distinguished patients with or without response to ICIs in the Kim cohort. Moreover, patients with response to ICIs had higher APscore than those without response to ICIs. Several subtypes of GC, such as high MSI subtype, positive EBV subtype, high TMB subtype and high immune signature subtype, benefited from ICIs treatment ([240]5). These subtypes also had high Apscore in the Kim cohort, further revealing that APscore might effectively predict immunotherapy efficacy. Thus, the critical biomarkers involved in the mechanism of immunotherapy need to be identified and assessed. Ten hub genes, STAT1, IRF1, ISG15, IRF9, BST2, PSMB8, STAT2, IFI35, IFIT3 and OAS2 from 366 antigen processing and presentation-related genes were further analyzed. The Metascape shows that ten hub genes are enriched in interferon-alpha/beta signaling, which has been reported to influence APM gene expression ([241]56, [242]57). This result suggests that we can enhance antigen presentation efficiency of initially unresponsive patients with ICIs by stimulating interferon signal transduction to improve immunotherapy efficacy, especially in aGC patients with low APscore, which has potential transformation significance. A large body of evidence has been provided about STAT1 inhibiting proliferation and promoting apoptosis of tumor cells, and it is considered a potential tumor inhibitor ([243]58, [244]59). STAT can induce tumor cell apoptosis by interfering with the MARK signaling pathway or down-regulating interleukin (IL-6) expression and STAT3 of interferon-alpha/beta signaling ([245]60, [246]61). Furthermore, new studies show that down-regulated STAT1 expression in peripheral blood of GC patients results in immune escape from gastric epithelial cancer caused by decreased anti-tumor immune function ([247]62, [248]63). However, there are few experimental studies on the interaction between immune microenvironment and tumor antigenicity and antigen presentation in GC immunotherapy. This study used multi-color immunofluorescence staining to accurately locate positive spatial associations of IFIT3 and STAT1 expression with PD-L1 and CD8 expression, representative immunotherapy targets in different immunotypes of aGC tissues. Hub genes selected to share a common pathway enrichment of interferon signaling pathway, and the relationship between specific genes and immunotherapy may provide new insight into the ICB therapy for GC. According to nine genes (B2M, CTSB, HLA-A, HLA-C, HLA-DRA, HLA-F, HSPA2, KLRD1 and TAPBP) enriched in antigen processing and presentation, we built a predictive survival model for aGC patients with high accuracy and sensitivity. In addition, a nomogram based on the risk score and several signatures of critical signaling pathways was plotted and showed that the risk score had robust survival prediction ability. Currently, the relationships between nine genes and the survival of various tumors may be easily explored using TCGA or GEO data. However, how they affect the immunotherapy efficiency of GC has rarely been studied. We found that the prediction model of the above nine genes predicted response to ICIs in aGC patients with a value of AUC 0.97. These results suggest that the nine genes are valuable for further study in the immunotherapy mechanism of GC. In addition, several other genes have already been linked to cancer-specific immune responses. For example, mutation of B2M gene resulted in defective expression of MHC-1 molecular antigen, which could not deliver antigen to CD8^+T cells through TCR and disturbed the positive selection of CD8 cells in the thymus to affect the development of CD8 T cells ([249]64, [250]65). Several studies showed that HLA-I, including HLA-A, HLA-B and HLA-C positive tumors may be more susceptible to immune checkpoint inhibitors ([251]66, [252]67). In contrast, HLA-I negative tumors might be associated with acquired drug resistance to PD-1 blockade in cancer patients. It was found that the increased cell apoptosis in gastric cancer leads to the release and degradation of CTSB protein from the lysosome, resulting in gastric cancer cell death ([253]68). The cleavage of Caspase-1 by CTSB activates caspase-1, which promotes the secretion of interleukin-1β (IL-1β), leading to an inflammatory response ([254]69, [255]70). Compared with atezolizumab monotherapy for biliary tract cancer, TAPBP expression was higher in the combined treatment group ([256]71). Compared with attezolzumab monotherapy for biliary tract cancer, TAPBP expression was higher in the combined treatment group (atezolizumab + MEK inhibitor). As an important biomarker, TAPBP from TIGS is an effective intrinsic tumor biomarker and can predict ICIs response in pan-cancers ([257]10). The TAPBP effectively predicted immunotherapy response in aGC patients and stratified patients into groups with significantly different survival in this study. In addition, the positive correlation between TAPBP expression and tumor microenvironment was confirmed by multi-color immunofluorescence staining, providing a clue for further study on the effect of TAPBP on ICB therapy. There are some limitations to this study. First, we did not conduct external validation because it was challenging to gather clinical patient data for ICIs treatment in aGC patients. Nonetheless, 467 patients from TCGA and GEO datasets were used to validate our results by estimating immunotherapy response by TIDE. Second, even though IHC and fluorescence experiments were carried out to validate the model results, large-scale protein sequencing analysis would be a better choice. Third, we analyzed the effects of APscore on the immune microenvironment, SNV, TMB and signaling pathways, but the underlying mechanism remained unclear. Therefore, to better understand the significance of APscore in predicting GC immunotherapy response, further in vivo and in vitro experimental research are required. In summary, our work presents for the first time a novel signature based on genes associated with antigen processing and presentation, that can help identify response to immunotherapy, allow identification of high-risk patients and predict prognosis in aGC patients. Further studies using gene expression profiles may represent a powerful approach to explore the biological mechanism of tumor immunotherapy escape, and may provide new targets for transforming anti-PD-1 resistant tumors into a responsive state. Data availability statement The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation. Ethics statement This study was reviewed and approved by the ethics committee of affiliated hospital of Jiangnan University (approval number: LS2020050). Written informed consent was obtained from all participants for their participation in this study. Authors contributions KW and FY conceived the study. MW, BH, XW and QY participated in study selection, data extraction and statistical analysis. MW and ZL performed Hematoxylin-eosin staining (HE) and organized sample data. BH provided FFPE tumor samples. KW, ZY and JW made the figures. KW and FY wrote the first draft of the manuscript. All authors interpreted the results, and approved the final version for submission. Funding This work was supported by the top Talent Support Program for young and middle-aged people of Wuxi Health Committee (No. HB2020040), Mega-project of Wuxi Commission of Health (No. Z202007). Acknowledgments