Abstract

Objective

   The aim of the study was to propose a signature based on genes
   associated with antigen processing and presentation (APscore) to
   predict prognosis and response to immune checkpoint inhibitors (ICIs)
   in advanced gastric cancer (aGC).

Background

   How antigen presentation-related genes affected the immunotherapy
   response and whether they could predict the clinical outcomes of the
   immune checkpoint inhibitor (ICI) in aGC remain largely unknown.

Methods

   In this study, an aGC cohort (Kim cohort, RNAseq, N=45) treated by
   ICIs, and 467 aGC patients from seven cohorts were conducted to
   investigate the value of the APscore predicting the prognosis and
   response to ICIs. Subsequently, the associations of the APscore with
   the tumor microenvironment (TME), molecular characteristics, clinical
   features, and somatic mutation variants in aGC were assessed. The area
   under the receiver operating characteristic curve (AUROC) of the
   APscore was analyzed to estimate response to ICIs. Cox regression or
   Log-rank test was used to estimate the prognosis of aGC patients.

Results

   The APscore constructed by principal component analysis algorithms was
   an effective predictive biomarker of the response to ICIs in the Kim
   cohort and 467 aGC patients (Kim: AUC =0.85, 95% CI: 0.69–1.00; 467
   aGC: AUC =0.69, 95% CI: 0.63–0.74). The APscore also was a prognostic
   biomarker in 467 aGC patients (HR=1.73, 95% CI: 1.21−2.46). Inhibitory
   immunity, decreased TMB and low stromal scores were observed in the
   high APscore group, while activation of immunity, increased TMB, and
   high stromal scores were observed in the low APscore group. Next, we
   evaluated the value of several central genes in predicting the
   prognosis and response to ICIs in aGC patients, and verified them using
   immunogenic, transcriptomic, genomic, and multi-omics methods. Lastly,
   a predictive model built successfully discriminated patients with vs.
   without immunotherapy response and predicted the survival of aGC
   patients.

Conclusions

   The APscore was a new biomarker for identifying high-risk aGC patients
   and patients with responses to ICIs. Exploration of the APscore and hub
   genes in multi-omics GC data may guide treatment decisions.

   Keywords: antigen processing and presentation, immune checkpoint
   inhibitor, immunotherapy, tumor microenvironment, survival

Introduction

   The morbidity and mortality of gastric cancer (GC) rank fifth and
   fourth among all malignant tumors ([39]1). In recent years, immune
   checkpoint inhibitors (ICIs), such as anti-cytotoxic
   T-lymphocyte-associated protein 4 (CTLA-4) inhibitors and
   anti-programmed death 1 (PD-1)/programmed death-ligand 1 (PD-L1)
   inhibitors, in combination with chemotherapy and immunotherapy have
   made significant progress in multiple types of cancers ([40]2). Several
   ICIs have been approved for the clinical treatment of advanced GC (aGC)
   patients ([41]3, [42]4). But the overall response to ICIs is only 20%
   to 40% for GC patients ([43]2, [44]5). So it is urgent to accurately
   identify effective biomarkers to select GC patients with responses to
   ICIs.

   The tumor mutation burden (TMB), neoantigen load and clonality, copy
   number alterations (CNA), microsatellite instability (MSI) status,
   tumor microenvironment (TME), especially T cell inflammation, PD-L1
   expression, and mutations in specific genes are now deemed to
   be predictive markers for immunotherapy in aGC patients. These
   indicators do, however, have limitations that have hindered their
   clinical application ([45]6–[46]9). Tumor immunogenicity mainly
   consists of tumor antigenicity and antigen presentation, playing a
   crucial part in response to ICIs in most types of cancer ([47]10,
   [48]11). But accuracy of identifying aGC patients with response to ICIs
   according to TMB is low ([49]12–[50]14). Tumor antigens from mutated
   proteins of tumor cells enable tumor cells to be recognized and killed
   by CD8^+ T cells through antigen-presentation mechanisms ([51]11,
   [52]15). Therefore, antigen presentation significantly affects the
   effect of ICIs treatment. Nevertheless, we do not find relevant studies
   on how antigen presentation-related genes affect the immunotherapy
   response and whether they can predict the clinical effect of ICIs
   therapy in aGC patients.

   Thus, we proposed a signature based on antigen processing and
   presentation genes to predict prognosis and response to ICI in aGC
   patients. To the best of our knowledge, this study was the first to
   explore the role of antigen processing and presentation in aGC and
   select hub genes to understand progressive tumor mechanisms and offer
   promising and personalized strategies to diagnose and treat GC.

Methods

Data source

   The study used GC-related data from two public platforms, the Cancer
   Genome Atlas (TCGA) Genomic Data Commons Data Portal
   ([53]https://portal.gdc.cancer.gov/) and the Gene Expression Omnibus
   database (GEO) ([54]https://www.ncbi.nlm.nih.gov/geo/). TCGA provided
   mRNA expression, copy number mutation, and clinicopathologic profiles
   including age, gender, Lauren type, tumor grade, tumor stage, and
   survival. GEO provided six GC-related cohorts including [55]GSE84437
   ([56]16), [57]GSE57303 ([58]17), [59]GSE34942 ([60]18), [61]GSE29272
   ([62]19), [63]GSE15459 ([64]20) and ACRG/[65]GSE62254 ([66]21). The aGC
   was defined as the presence of metastatic disease or tumor stages
   higher than IV. Patients without any survival data as well as those
   with less stage IV or primary tumors were excluded. The final merged
   aGC cohort (FMAC) consisted of 467 patients, encompassing 103 from the
   TCGA-STAD dataset and 364 from six GEO datasets. All gene expression or
   transcriptome data were transformed with log2 (x +1) and then were
   batch rectified by SVA Package of R ([67]22).

   Next, this study included two cohorts of cancer patients treated with
   immunotherapy. 45 aGC patients treated with ICIs in Korea ([68]5) (The
   Kim cohort, PRJEB25780,
   [69]https://www.ebi.ac.uk/ena/data/view/PRJEB25780) were included in
   the first cohort. The cohort collected some data about RNAseq data,
   immunotherapy regimens, response to ICIs, TCGA subtype, microsatellite
   instability (MSI), Epstein‐Barr virus (EBV), Mesenchymal subtype,
   single nucleotide variants (SNVs) and immune signature ([70]5). The
   second cohort, the IMvigor210 cohort ([71]23)
   ([72]http://research-pub.gene.com/IMvigor210CoreBiologies) of bladder
   cancer (BC), provided gene expression data, survival, immunotherapy
   regimens and response. Partial response (PR) and complete response (CR)
   to ICIs were considered to be immunotherapy responses (responses to
   ICIs). Moreover, patients without immunotherapy responses included
   those with progressive disease (PD) and stable disease (SD).

Weighted gene co-expression network analysis

   To identify antigen processing and presentation genes related to
   immunotherapy response, WGCNA package of R was used to calculate
   Pearson correlation coefficient among genes, and select an appropriate
   soft threshold β according to the scale-free topological fitting index
   of R^2 >0.9 and average connectivity ([73]24). So, the constructed
   network could be in line with the standard of scale-free network.

   Next, the gene network was constructed by one-step method, and the
   adjacency matrix of expression data was transformed into a topological
   overlap matrix. The hierarchical clustering method was used to plot the
   hierarchical cluster tree to cluster genes into different color
   modules. The feature values of gene modules were calculated, and then
   Pearson correlation coefficient and correlation between the feature
   vector and clinical information were calculated. The gene set modules
   with the highest correlation with immunotherapy response, antigen
   processing and presentation were selected for further analysis. We
   first calculated the eigenvalues of gene modules, and then calculated
   the correlation coefficients between the feature vectors of the modules
   and clinical features (immunotherapy and antigen processing and
   presentation). The gene set modules with significant correlation with
   immunotherapy response and antigen processing and presentation were
   available for further analysis.

Analysis of biological function and pathways of genes

   The Gene Ontology (GO; molecular function, cellular component,
   and biological process) database and informatics resource
   ([74]http://www.geneontology.org) were used to annotate gene function
   enrichment. Kyoto Encyclopedia of Genes and Genomes (KEGG) database
   ([75]http://www.genome.ad.jpl/kegg/) with analysis of cells or
   organisms senior functional behavior was used to annotate various
   pathways enrichment of genes. Additionally, the identification of key
   pathways enriched by hub genes was conducted by the
   GSCALite([76]http://bioinfo.life.hust.edu.cn/web/GSCALite/) and the
   Metascape web server([77]http://metascape.org/) ([78]25, [79]26).

Construction of APscore

   The WGCNA selected genes associated with immunotherapy response and
   antigen processing and presentation for further analysis. Above genes
   often involved hundreds of genes, of which significant
   interrelationships were inevitable. To better analyze the expression
   data of these genes, we synthesized multiple variables with multiple
   correlations into several representative variables, representing the
   majority of the original variables’ information but unrelated to each
   other. Therefore, we performed principal component analysis (PCA) to
   reduce the dimension of the above genes expression data set ([80]27).
   Then we obtained principal component 1 and 2, of which the sum was
   defined as the APscore ([81]28). The specific formula is as follows:
   APscore=
   [MATH: <mrow><msup><mo>∑</mo><mo>​</mo></msup><mo
   stretchy="false">(</mo><mi>P</mi><mi>C</mi><msub><mn>1</mn><mi>i</mi></
   msub><mo>+</mo><mi>P</mi><mi>C</mi><msub><mn>2</mn><mi>i</mi></msub><mo
   stretchy="false">)</mo></mrow> :MATH]
   , where i represents the expression of genes. PCA scores for some
   important tumor-related pathways including antigen processing
   machinery, immune checkpoint, CD8 T effector, DNA damage repair,
   epithelial-mesenchymal transition (EMT) and so on were calculated
   according to the expression of genes enriched in these pathways
   ([82]29–[83]31).

Assessment of immune microenvironment and immunotherapy response

   Twenty-two immune cell invasions according to gene expression were
   assessed by the CIBERSORT algorithm ([84]32). According to gene
   expression, the absolute abundance of eight immune and two stromal cell
   populations were evaluated by the microenvironment cell
   populations-counter (MCP-counter) method ([85]33). The ESTIMATED
   algorithm assessed each patient’s immune and stromal scores according
   to gene expression ([86]34). The TIDE algorithm platform was used to
   evaluate immunotherapy response according to standardized gene
   expression in FMAC ([87]35).

Selection of hub genes association with prognosis and immunotherapy response

   The ten most crucial hub genes associated with immunotherapy response
   and antigen processing and presentation were identified using the MCODE
   and Cytohubba of the Cytoscape software ([88]36, [89]37). Then we
   assessed the efficacy of hub genes predicting prognosis in FMAC and
   immunotherapy response in the Kim cohort. The predictive efficacy was
   also validated in another aGC cohort ([90]GSE26253) treated with
   chemoradiotherapy ([91]38).

Construction of predictive model for prognosis and immunotherapy response

   First, we selected target differently expressed genes (DEGs) that were
   associated with immunotherapy response and enriched in the antigen
   processing and presentation in the Kim cohort. Second, those target
   DEGs were intersected with genes from FMAC, and then shared genes were
   obtained and used to construct the predictive model for prognosis by
   Cox regression in FMAC and for immunotherapy response by logistics
   regression in the Kim cohort. Last, the risk score of the predictive
   model was calculated based on the coefficient of Cox regression or
   logistics regression and gene expression. The specific formula is as
   follows:
   [MATH:
   <mrow><mi>r</mi><mi>i</mi><mi>s</mi><mi>k</mi><mo> </mo><mi>s</mi><mi>c
   </mi><mi>o</mi><mi>r</mi><mi>e</mi><mo>=</mo><msup><mo>∑</mo><mo>​</mo>
   </msup><mrow><mo>(</mo><mrow><mi>e</mi><mi>x</mi><mi>p</mi><mi>r</mi><m
   i>e</mi><mi>s</mi><mi>s</mi><mi>i</mi><mi>o</mi><mi>n</mi><mo> </mo><mi
   >o</mi><mi>f</mi><mo> </mo><mi>g</mi><mi>e</mi><mi>n</mi><msub><mi>e</m
   i><mi>i</mi></msub><mo>×</mo><mi>c</mi><mi>o</mi><mi>e</mi><mi>f</mi><m
   i>f</mi><mi>i</mi><mi>c</mi><mi>i</mi><mi>e</mi><mi>n</mi><msub><mi>t</
   mi><mi>i</mi></msub></mrow><mo>)</mo></mrow></mrow> :MATH]

Construction of nomogram for prognosis

   We used Cox regression to select some independent variables including
   several signaling pathways, the risk score of the predictive model for
   prognosis and immunotherapy response, and the ratio of M1 Macrophages
   to M2 macrophages, which had significant associations with the
   prognosis of GC patients. According to the above independent variables,
   we built a nomogram to predict the survival of aGC patients.
   Additionally, each patient’s risk score and 3, 5 and 8-year survival
   were calculated.

   The Cox regression was conducted by rms or survival package of R; the
   nomogram was plotted by regplot package of R ([92]39). The timeROC
   package of R was used to plot receiver operating characteristics (ROC)
   curves for 3, 5 and 8-year survival, respectively ([93]40). The
   calibration and decision curves were plotted by the rms package and
   rmda of R, respectively.

Specimen collection

   Twenty-five patients with GC who underwent surgical resection in the
   Affiliated Hospital of Jiangnan University were collected. None of the
   patients received any chemotherapy, radiotherapy or biotherapy before
   surgery. This study was approved by the ethics committee of affiliated
   hospital of Jiangnan University and followed the guidelines of the
   Declaration of Helsinki. Informed consent was obtained from all
   participants. All specimens were diagnosed as GC by two independent
   pathological diagnosis doctors. All samples were immersed in formalin
   solution and fixed for paraffin embedding (FFPE). 4 μm sections were
   prepared for immunohistochemical staining.

Hematoxylin-eosin staining

   The FFPE section was dewaxed with xylene. Xylene was then deoxidized
   with gradient alcohol. The tissue was stained for 3min by Hematoxylin.
   The nuclear staining was observed under the microscope. The tissue was
   further stained for 90s by the Eosin solution, and the staining was
   examined under the microscope.

Immuno-histochemistry

   Quantitative analysis methods performed IHC and multi-color
   immunofluorescence. First, FFPE was dewaxed and hydrated, and the EDTA
   method repaired tissue antigen. H[2]O[2] was used to inactivate
   endogenous peroxidase in tissues. Second, the tissue was covered
   entirely with primary antibody solution and placed in a wet box at 4°C
   for 12 hours. Then, the tissue retemperature at room temperature for
   30min. The second antibody (horseradish peroxidase-labeled) was added
   and incubated for 60 min at room temperature for Tetramethylbenzidine
   color rendering. An optical microscope observed the brown-yellow
   particles as a positive color reaction.

   The primary antibody used in this study: CK (ab52625, Abcam, 1: 1000),
   CD8 (14-0081-82, Invitrogen Antibodies, 1:200), PD-L1(ab205921, Abcam
   PD-L1(ab205921, Abcam, 1: 1000), STAT1: (ab29045, Abcam, 1:2000), IFIT3
   (ab76818, Abcam, 1:000), TAPBM (ab13518:Abcam, 1: 400). Immune
   classifications of GC were defined by the CD8^+ T cells infiltrating
   the extent and number of tissues ([94]41, [95]42). The immune-inflamed
   subtype was characterized by CD8^+ T cells spread throughout the tumor
   parenchyma and surrounding stroma. The excluded-immune subtype was
   characterized by CD8^+ T cells infiltration in the peritumor stroma,
   not in the parenchyma. The deserted-immune subtype was characterized by
   the absence of CD8^+ T cells in tumor parenchyma and stroma.

Multi-color immunofluorescence

   After being dewaxed, the (FFPE section was repaired by EDTA antigen
   repair buffer (PH 9.0). The tissues were added with PBS (PH7.4) and
   primary antibody and incubated overnight at 4°C. The tissues were then
   covered with secondary antibodies and incubated at room temperature for
   50min. DAPI dye was dropped and incubated for 10min at room
   temperature, away from light. The slices were briefly shaken dry and
   sealed with anti-fluorescence quenching sealing tablets. The
   fluorescence microscope was used to observe the protein’s location,
   shape and quantity, and images were collected.

Statistical analysis

   The t-test or Wilcoxon signed-rank test was used to compare two groups
   of continuous variables. One-way analysis of variance or Kruskal-Wallis
   H test was used for comparison among multiple (n>2) groups. The
   chi-square test or nonparametric rank-sum test was used to compare
   categorical variables. Pearson or Spearman correlation analysis
   conducted correlation analysis of two continuous variables.
   Kaplan-Meiers curve was used to show the survival of the sample over
   time, and the log-rank test was used to test whether there were
   significant differences in the survival of multiple groups. The Cox
   regression was used to select factors affecting survival. Logistic
   regression was used to screen factors affecting immunotherapy response.
   All statistical analysis was conducted in R software (version 4.0.3).
   All analyses were two-sided and tested at the nominal 0.05 significance
   level.

Results

Selection of key modules associated with prognosis and response to ICIs

   We identified 2759 DEGs in patients with and without response to ICIs
   in the Kim cohort ([96] Figure 1A ; [97]Table S1 ). According to the
   optimal soft-thresholding power of 26, WGCNA of these DEGs was
   conducted ([98] Figures S1A, B ) and showed that these DEGs
   were divided into four different modules ([99] Figures 1B, C ). Of
   which the blue module (r = 0.4, p = 0.007), the brown module (r =0.56,
   p = 6e–05) and the grey module (r = −0.65, p = 1e–06) were
   significantly related with immunotherapy response ([100] Figure 1C ),
   respectively. Furthermore, the number of brown module members (n=85)
   significantly correlated with the gene signature (r=0.3, p=0.005,
   [101]Figure 1D ). Similar result was found in the grey module (n=1737,
   r=0.55, p=6.5e–138; [102]Figure 1E ). However, there were no
   significant correlations of the gene signature with the blue module and
   turquoise module (blue module: r= −0.13, p=0.21; turquoise module: r=
   −0.078, p=0.35; [103]Figures S1C, D ).

Figure 1.

   [104]Figure 1
   [105]Open in a new tab

   Identification of genes associated with immunotherapy response. (A) The
   volcano figure of differentially expressed genes (DEGs) associated with
   immunotherapy response in the Kim cohort. logFC: log-fold changes. (B)
   Weighted gene coexpression network analysis (WGCNA) of 2759 DEGs with a
   soft threshold β = 26. (C) Heat maps showing the gene modules
   associated with immunotherapy response. The relationship between the
   gene signature and brown module genes (D) or grey module genes (E).
   Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment
   analysis for genes in brown module (F) or grey module (G).

   GO analysis showed that brown module genes were enriched in several
   immune functions, such as immune receptor activity, major
   histocompatibility complex (MHC) protein binding, T cell activation and
   cytokine activity ([106] Figure S1E ; [107]Table S2 ). Those grey
   module genes were enriched in the extracellular organization and MHC
   protein complex functions ([108] Figure S1F ; [109]Table S3 ).
   Moreover, KEGG analysis showed that brown module genes were enriched in
   cytokine-cytokine receptor interaction, antigen processing and
   presentation, T cell receptor signalling pathway, NOD-like receptor
   signalling pathway, JAK-STAT signalling pathway, TNF signalling pathway
   and Toll-like receptor signalling pathway, which affected the
   occurrence, progression and response to ICIs in multiple types of
   tumors ([110]29–[111]31) ([112] Figure 1F ; [113]Table S4 ). Those grey
   module genes were enriched in the cGMP-PKG signaling pathway, antigen
   processing and presentation, Gastric acid secretion and Rap1 signaling
   pathway ([114] Figure 1G ; [115]Table S5 ). Notably, five brown module
   genes and 23 grey module genes were enriched in antigen processing and
   presentation ([116] Tables S4, 5 ), implying that antigen presentation
   status may significantly contribute to immunotherapy.

   Next, of 1822 genes (85 brown and 1737 grey genes), 637 genes
   (turquoise module) were significantly related to antigen processing and
   presentation by WGCNA (r=0.54, p=1e–04; [117]Figures S2A, B ).
   Furthermore, 637 genes significantly correlated with the gene signature
   (r=0.56, p=7.3e–54; [118]Figure S2C ).

The APscore predicts immunotherapy in the Kim cohort

   After intersecting 637 genes above with 10936 genes from FMAC
   ([119]GSE84437, [120]GSE57303, [121]GSE34942, [122]GSE29272,
   [123]GSE15459, ACRG/[124]GSE62254 and TCGA-STAD), altogether 366 genes
   were used to construct a scoring system based on the PCA algorithm,
   which we called APscore ([125] Figures 2A, B ). To estimate the
   relationships of the APscore with target genes, immunotherapy response,
   immune signature, mesenchymal subtype, TCGA subtype, number of SNVs,
   EBV subtype and MSI subtype in GC, a combined heat map was plotted
   ([126] Figure 2A ).

Figure 2.

   [127]Figure 2
   [128]Open in a new tab

   The characteristics of the APscore as an indicator of immunotherapy
   response in aGC patients. (A) The relationship of the APscore with
   other subtypes of GC. (B) Venn showing 366 genes associated with the
   APscore. (C) Receiver operating characteristics (ROC) for the APscore
   in predicting immunotherapy response. Comparison of the APscore in
   different groups of responses to immunotherapy (D), TCGA subtypes (E),
   MSI subtypes (F), EBV subtypes (G), Mesenchymal subtypes (H), Number of
   SNVs subtypes (I) and immune signature subtypes (J).

   Based on the cutoff thresholds derived by the Youden index for
   continuous variables—the APscores of all patients, each patient was
   regrouped into the high or low-APscore group ([129]43). It was found
   that most of the 28 genes enriched in antigen processing and
   presentation were expressed higher in the high-APscore group than in
   the low-APscore group. The APscore could distinguish between patients
   with vs. without response to ICIs [area under curve (AUC) =0.85, 95% CI
   (confidence interval): 0.69–1.00; [130]Figure 2C ]). Moreover, higher
   APscore was observed in patients with immunotherapy response
   (p=0.00019, [131]Figure 2D ).

   Next, the APscore was significantly different among four TCGA subtypes,
   two MSI subtypes, two EBV subtypes and three Mesenchymal subtypes
   ([132] Figures 2E–I , all p<0.05). High-APscore patients were likely to
   have positive EBV status, high MSI status and high mutational load,
   which were known as beneficial factors of immunotherapy, implying that
   the APscore may be a valuable indicator of immunotherapy. Similarly,
   higher APscore was observed in the high immune signature subtype (P=
   7.5e–05, [133]Figure 2J ). The immune signature involved 12 genes,
   CCL2, CCL3, CCL4, CCL5, CCL8, CXCL9, CXCL10, CXCL11, CXCL13, CL18,
   CCL19 and CCL21, of which expression levels reflected tumor
   microenvironment in GC ([134]44). So, it was worth further exploring
   microenvironment features in different APscore groups.

Immune microenvironment characteristics of different APscore groups

   To assess the immunotherapy response, standardized genes expression of
   all patients in FMAC were entered into TIDE algorithm platform. The
   APscore of each patient was calculated based on the standardized
   expression of 366 genes, which were associated with antigen processing
   and presentation.

   The APscore showed significant negative correlations with TIDE,
   Dysfunction, CD8, Merck18, IFNG and MSI Expr Sig, and positive
   correlations with CAF, TAM M2, Exclusion and MDSC (all p< 0.05;
   [135]Figures 3A , [136]S3A ). A high TIDE score represented poor
   efficacy of immune checkpoint blocking therapy (ICB) and short
   survival. Moreover, a higher APscore was observed in patients with
   response to ICIs predicted by the TIDE algorithm ([137] Figure 3A ,
   p=1.2e–10), consistent with [138]Figure 2D . Expected, the APscore
   distinguished patients with response to ICIs from patients without
   response to ICIs in FMAC (AUC=0.69, 95% CI: 0.63–0.74; [139]Figure S3C
   ). These results show that the APscore can predict immunotherapy
   response in aGC patients.

Figure 3.

   [140]Figure 3
   [141]Open in a new tab

   The associations of the APscore with immunotherapy response, immune
   microenvironment, and survival in aGC patients. (A) The relationship of
   the APscore with indicators of immunotherapy response predicted by the
   TIDE algorithm. (B) The relationship of the APscore with infiltration
   of the 22 immune cells calculated by the CIBERSORT algorithm. (C) The
   interrelations of the APscore with PCA genes scores of important
   biological pathways of GC. The white squares represent no statistically
   significant. (D) Kaplan-Meier survival plots in both groups of 467 aGC
   patients. (E) Comparison of the tumor purity, estimate, immune, and
   stromal scores between the high-APscore and low APscore-groups. (F)
   Comparison of the 22 immune cells proportions between the high-APscore
   and low APscore-groups. Asterisks denote significant differences (*p<
   0.05; **p< 0.01; ***p< 0.001).

   Next, we used the CIBERSORT algorithm to estimate the infiltration of
   the 22 immune cells. The APscore had significantly positive
   correlations with regulatory T cells (Tregs), memory resting CD4 T
   cells, M0 Macrophages, activated mast cells and activated NK cells, and
   significantly negative correlations with gamma delta T cells,
   follicular helper T cells, plasma cells, resting dendritic cells,
   memory B cells, eosinophils, activated dendritic cells, neutrophils and
   CD8 T cells (all p< 0.05, [142]Figure 3B ). Of note, the correlations
   of the APscore with PCA genes scores of crucial biological pathways
   showed that the APscore had significantly positive correlations with
   cell cycle, TGF beta signaling pathway, Wnt signaling pathway, antigen
   processing machinery, antigen processing and presentation, DNA damage
   repair, angiogenesis, EMT1 and nucleotide excision repair scores, and
   significantly negative correlations with mismatch repair, DNA
   replication, EMT2, MAPK signaling pathway, JAK/STAT signaling pathway,
   NF kappa B signaling pathway, ECM receptor interaction and base
   excision repair scores ([143] Figure 3C , all p< 0.05).

   Patients exceeding a certain APscore threshold, defined by the Youden
   index, were classed as a high-APscore group, the opposite as a
   low-APscore group. By log-rank test, patients in the high-APscore group
   had poorer survival than those in the low-APscore group (p=0.0022,
   HR=1.73, 95% CI: 1.21−2.46, [144]Figure 3D ). In addition, we used
   ESTIMATED algorithms to estimate immune and stromal scores for each
   patient. The estimate, immune, and stromal scores were significantly
   higher in the low APscore-group than that in the high APscore-group,
   while the opposite trend was observed for the tumor purity (all p<0.05,
   [145]Figure 3E ). Similarly, representative cells for anti-tumor
   immunity such as NK cells, memory resting CD4 T cells, Tregs, Monocytes
   and M0 Macrophages were more abundant in the high-APscore group (all
   p<0.05, [146]Figure 3F ). These results further verify that the APscore
   may influence the prognosis and response to ICIs through classifying
   tumor subtypes according to the tumor microenvironment.

SNVs of different APscore groups

   Previous studies suggest that accumulated somatic mutations induced the
   immune system to produce anticancer cells, and tumor mutation load
   affected immunotherapy response and the survival of GC patients
   ([147]10, [148]11, [149]44). To explore the genetic imprints of
   different APscore groups, we used SNV data of 101 aGC patients from
   TCGA-STAD to analyze the relationship between the TMB and the APscore.
   The APscore had a significantly negative correlation with TMB (r=−0.42,
   p=1.2e–05, [150]Figure 3SD ). The high-APscore group had lower TMB than
   the low−APscore group (p=0.001, [151]Figure 3SE ). Moreover, the
   high-TMB group had longer survival than the low-TMB group (log-rank
   test, p = 0.015, [152]Figure 3SF ). However, we could not find
   significantly different survival with a p-value of 0.05 among four
   subgroups formed by the APscore and TMB (p=0.05, [153]Figure 3SG ).
   This result may be largely due to limited samples (n=101).

   Next, the APscore as a binary variable entered into Cox regression
   model, and then was an independent prognostic predictor of aGC patients
   after adjusting other clinicopathologic characteristics, including
   Lauren subtype, age, gender, cancer grade, CD274 expression, TMB and
   the ratio of M1 Macrophages to M2 Macrophages (HR=4.42, 95%
   CI:1.36−14.28, [154]Figure 3SH ). Moreover, the efficacy of the APscore
   predicting survival of aGC patients was explored in 14 independently
   thermal TCGA cohorts including 6673 patients ([155] Table S6 ). The
   APscore was a significant factor in the prognosis of patients with
   breast invasive carcinoma (BRCA), head and neck squamous cell carcinoma
   (HNSC), brain lower grade glioma (LGG) or skin cutaneous melanoma
   (SKCM), which were generally considered thermal tumors with
   infiltration by a variety of T cells ([156] Figure S3I ). Therefore, it
   needs to explore further the interaction mechanism of the APscore and
   T-cell infiltration to influence the prognosis.

   To assess the driver mutations in different APscore groups, the mutated
   genes from TCGA-STAD were listed using maftools. In the high-APscore
   group, the top five mutated genes were TP53, TTN, MUC16, LRP1B, and
   DNAH5, respectively, whereas the top five were TTN, MUC16, ARID1A,
   PIK3CA and KMT2D, respectively, in the low-APscore group ([157] Figure
   S3J ). Of note, compared with the low-APscore group, the high-APscore
   group had lower frequencies of mutated genes ARID1A, PCLO, KMT2D, OBSCN
   and PIK3CA (all p<0.05, [158]Figure S3J ). A previous study used the
   TCGA-STAD dataset to estimate the correlation of PIK3CA with
   sensitivity or resistance ([159]8). Our results provide new insights
   into the role of the APscore in GC with special emphasis on its effect
   on individual mutations to offer a reference for the effect of tumor
   immunotherapy.

Selection of hub genes associated with prognosis

   To assess novel targets for GC immunotherapy, we selected the ten most
   important hub genes (STAT1, IRF1, ISG15, IRF9, BST2, PSMB8, STAT2,
   IFI35, IFIT3 and OAS2) from above 366 genes being used to generate the
   APscore. Next, we plotted the interaction map of hub genes and pathways
   using the Cytoscape software ([160] Figures 4A, B ). Of note, ten genes
   were all enriched in interferon-alpha (IFNα)/beta (IFNβ) signaling
   using the Metascape ([161] Table S7 ). Some studies suggest that IFNα
   and IFNβ are potential anti-tumor immune cytokines, resulting in
   improved outcomes for patients with malignancies of heterogeneous
   histologies ([162]45, [163]46). In the study, hub genes activated four
   signaling pathways: cell cycle, EMT, ER Hormone and apoptosis, and
   inhibited five signaling pathways: RTK, RAS/MAPK, TSC/mTOR, PI3K/AKT
   and AR Hormone pathway ([164] Figure 4B ). This result further
   highlights the potential value of hub genes to predict immunotherapy.
   It needs to explore the interaction between hub genes and T cells to
   influence immunotherapy and prognosis in GC.

Figure 4.

   [165]Figure 4
   [166]Open in a new tab

   Selection and characteristics of hub genes associated with
   immunotherapy response and prognosis. (A) A network of protein-protein
   interactions for 10 hub genes. (B) The profiles of hub genes affecting
   canonical signaling pathways. The Kaplan-Meier survival curves of STAT1
   (C) and IFIT3 (D) expression affecting prognosis. (E) ROC for the STAT1
   and IFIT3 predicting immunotherapy response in the Kim cohort. (F) The
   correlations of hub genes with abundance of seven immune and two
   stromal cells, calculated by the MCP-counter method. (G)
   Immunohistochemistry detected the expression of CD8, PD-L1, STAT1, and
   IFIT3 in the immune-inflamed subtype, immune-excluded subtype, and
   immune-deserted subtype of GC. The scale corresponds to 50μm. (H)
   Multi-color fluorescence staining detected the spatial distribution of
   STAT1, IFIT3 and CD8 in the immune-inflamed subtype, immune-excluded
   subtype, and immune-deserted subtype of GC. The scale corresponds to
   20μm.

   Next, it was found that ten genes affected the survival of patients in
   FMAC ([167] Figures 4C, D , [168]4SA ). The expression level of STAT1
   and IFIT3 was significantly associated with longer survival (STAT1:
   log-rank test, p = 0.021; IFIT3: log-rank test, p = 0.00064). In the
   Kim cohort, patients responding to ICIs had higher expression levels of
   ten genes than those (all P< 0.05, [169]Figure S4B ). Intriguingly,
   STAT1 and IFIT3 distinguished individuals with response to ICIs from
   individuals without response to ICIs in the Kim cohort (STAT1: AUC
   =0.84, 95% CI: 0.66−1.00; IFIT3: AUC =0.77, 95% CI: 0.59−0.95;
   [170]Figure 4E ). Survival analysis further showed that high expression
   level of STAT1 and IFIT3 predicted progression-free survival (PFS)
   benefit for aGC patients in the [171]GSE26253 cohort treated with
   chemoradiotherapy (all p<0.05; [172]Figures S4C, D ). In addition, the
   expression level of STAT1 and IFIT3 was positively related to seven
   types of immune cells and negatively associated with fibroblasts and
   endothelial cells calculated by the MCPcounter (p< 0.05, [173]Figure 4F
   ; [174]Table S8 ). We observed similar correlations of the expression
   level of STAT1 and IFIT3 with activated dendritic cells, activated NK
   cells, CD8 T cells and memory activated CD4 T cells, which were
   representative cells for anti-tumor immunity estimated by the
   CIBERSORT. (all p< 0.05; [175]Figure 4SE ).

   To further explore the relationship between the expression of STAT1 and
   IFIT3 and the immune grade of GC, we selected 25 FFPE samples to
   quantify the potential relationship between the above two genes and
   tumor immune microenvironment using HE staining, immunohistochemistry
   and multi-colour immunofluorescence. The GC was divided into three
   immune subtypes: immune-inflamed subtype, immune-excluded subtype,
   immune-deserted subtype according to CD8^+ T cell infiltration in the
   tissues ([176] Figure 4G ). At the same time, the relationship between
   the two genes and immune cell localization and PD-L1 expression was
   also shown in [177]Figure 4G . It was found that the highest expression
   of PD-L1, STAT1 and IFIT3 in the immune-inflamed subtype, followed by
   the immune-excluded subtype and immune-deserted subtype. Moreover, we
   further experimented with multi-color fluorescence staining to explore
   the spatial relationship between the two genes and CD8 in different
   immune subtypes of GC ([178] Figure 4H ). Expected, STAT1 and IFIT3
   were highly expressed in tumors with high CD8 infiltration levels but
   lowly expressed in tumors with low CD8 infiltration levels. These
   results suggest that two genes may influence the role of CD8^+ T cells
   in GC immunotherapy.

Association of hub genes variants with prognosis

   Many studies have shown that individual altered genes affect the
   survival and response to ICIs in multiple types of tumors ([179]12,
   [180]47). The distributions of SNVs from the TCGA-STAD cohort showed
   that altered frequencies of STAT1 and IFIT3 came first and third among
   48 gastric cancer patients ([181] Figure 4SF ). Similarly, the altered
   frequencies of STAT1 and IFIT3 came second and fourth among 525 samples
   from the pan-cancer TCGA cohort ([182] Figure 4SF ). TIMER web serve
   allowed us to assess the relationship between six immune cell
   infiltrates and hub gene somatic copy number variations (CNVs)
   according to the TCGA-STAD dataset ([183]48, [184]49). It was found
   that there were significant interactions between five various CNVs of
   each hub gene and the infiltration level of six immune cells (all
   p<0.05; [185]Figure S5A ). Moreover, heterozygous amplification was
   found to dominate in uterine carcinosarcoma (UCS), esophageal carcinoma
   (ESCA), adrenocortical carcinoma (ACC) and testicular germ cell tumors
   (TGCT); and heterozygous deletion was present in ovarian serous
   cystadenocarcinoma (OV) and breast invasive carcinoma (BRCA,
   [186]Figure S5B ). Heterozygous amplification and heterozygous deletion
   were the top two presents in the TCGA-STAD dataset.

   Next, survival analysis of pan-cancer showed that hub genes might be
   risk factors of survival for five TCGA cohorts, including LGG, kidney
   renal papillary cell carcinoma (KIRP), kidney renal clear cell
   carcinoma (KIRC), uveal melanoma (UVM) and thymoma (THYM; HR > 1, p<
   0.05; [187]Figure S6A ), while the opposite tendency was found for the
   SKCM cohort (HR<1, P<0.05). Hub genes mainly inhibited apoptosis, EMT,
   and ER hormone signaling pathways and activated RTK, AR Hormone,
   PI3KAKT and TSCmTOR signaling pathways in pan-cancer cohort ([188]
   Figure S6B ). Moreover, Hub genes expression had significantly positive
   correlations with the frequencies of CNV in most diverse cancer types,
   implying that CNV may positively regulate the hub genes expression in
   the process of invasion, metastasis and recurrence of a variety of
   tumors ([189] Figure S6C ).

A predictive model for prognosis and immunotherapy response

   By intersecting 28 genes being enriched in antigen processing and
   presentation ([190] Figures 1F, G ), with 10511 genes from 467 patients
   in FMAC, 20 candidate genes were obtained ([191] Figure 5A ). After
   stepwise backward Cox regression, the final predictive model consisting
   of nine genes (B2M, CTSB, HLA-A, HLA-C, HLA-DRA, HLA-F, HSPA2, KLRD1
   and TAPBP) was constructed. The formula of the predictive model
   calculated the risk score of each patient. Risk scores in dead patients
   were significantly higher than that in alive patients (p=2e−05,
   [192]Figure 5B ). Those patients with risk scores exceeding the median
   risk score were classified as high-risk patients. On the contrary,
   other patients were classified as low-risk patients. The survival of
   high-risk patients was poorer than low-risk patients (p< 0.0001,
   [193]Figure 5C ). In particular, the risk score could identify dead
   patients from alive patients (AUCs for one-year, three-year, and
   five-year survival were 0.628, 0.643 and 0.676, respectively;
   [194]Figures 5D, E ).

Figure 5.

   [195]Figure 5
   [196]Open in a new tab

   Construction and validation of the predictive model for prognosis and
   immunotherapy response. (A) Venn map showing hub genes used to
   construct the predictive model. (B) Comparison of the standardized risk
   score between dead and alive patients in FMAC. (C) The Kaplan-Meier
   survival curves between the high-risk and low-risk patients in FMAC.
   (D) Standardized risk score distribution plot in FMAC. (E) ROC for the
   risk score predicting prognosis in FMAC. (F) Comparison of the
   standardized risk score between tumor progression and tumor
   progression-free patients in the [197]GSE26253 dataset. (G) The
   Kaplan-Meier progression-free survival (PFS) curves between high-risk
   and low-risk patients in [198]GSE26253 dataset. (H) Standardized risk
   score distribution plot in [199]GSE26253 dataset. (I) ROC for the risk
   score predicting prognosis in [200]GSE26253 dataset. (J) Comparison of
   the standardized risk score between patients with immunotherapy
   response and patients without immunotherapy response in the Kim cohort.
   (K) Standardized risk score distribution plot in the Kim cohort. (L)
   ROC for the risk score predicting immunotherapy response in the Kim
   cohort. (M) Comparison of the standardized risk score between patients
   with immunotherapy response and patients without immunotherapy response
   in the IMvigor210 cohort. (N) Standardized risk score distribution plot
   in the IMvigor210 cohort. (O) ROC for the risk score predicting
   immunotherapy response in the IMvigor210 cohort. (P) The Kaplan-Meier
   survival curves between the high-risk and low-risk patients in the
   IMvigor210 cohort.

   Next, we used the [201]GSE26253 dataset to validate the predictive
   model for prognosis and immunotherapy response and found that risk
   scores in tumor progression samples were significantly higher than
   those of tumor progression-free samples (p= 0.0021, [202]Figure 5F ).
   The PFS of high-risk patients was poorer than low-risk patients
   (log-rank test, p=0.00087, [203]Figure 5G ). Additionally, the risk
   score identified tumor progression samples from tumor progression-free
   samples (AUCs for one-year, three-year, and five-year survival were
   0.782, 0.728 and 0.787, respectively; [204]Figures 5H, I ).

   To further explore whether the above nine genes can predict the
   response to ICIs in the Kim cohort, the risk score of each patient was
   calculated using the formula of logistic regression analyses.
   Expectedly, patients with response to ICIs had higher risk scores than
   those without response to ICIs (p=2.6e−08, [205]Figure 5J ). Of note,
   the risk score perfectly discriminated patients with vs. without
   response to ICIs as evidenced from the ROC curve and corresponding high
   AUC value (AUC=0.97, 95% CI: 0.92−1.00; [206]Figures 5K, L ). Moreover,
   we used the IMvigor210 cohort of BC to estimate the ability of the risk
   score to predict response to ICIs. Consistent with the results of the
   Kim cohort, risk scores in BC patients with response to ICIs were
   significantly higher than those without response to ICIs (p=5.2e−05,
   [207]Figure 5M ). The risk score also could discriminate BC patients
   with vs. without response to ICIs (AUC=0.63, 95% CI: 0.57−0.69;
   [208]Figures 5N, O ). Lastly, the survival of high-risk patients was
   poorer than low-risk patients (p=0.0036, [209]Figure 5P ) for the
   IMvigor210 cohort. In brief, the risk score consisted of nine genes
   associated with antigen processing and presentation is an effective
   indicator of prognosis and response to ICIs for aGC patients.

Predictive model genes for prognosis and immunotherapy response

   In the Kim cohort, patients with response to ICIs had lower HSPA2
   expression than that without response to ICIs, whereas the opposite was
   observed for the other eight genes (all p<0.05, [210]Figure 6A ). Of
   nine genes, eight could discriminate patients with vs. without response
   to ICIs except for HLA-A ([211] Figure 6B ).

Figure 6.

   [212]Figure 6
   [213]Open in a new tab

   TAPBP was related to the immunotherapy response and the tumor immune
   microenvironment. (A) Comparison of nine predictive model genes between
   CR/PR group and SD/PD group in the Kim cohort. (B) ROCs for nine genes
   predicting immunotherapy response in the Kim cohort. (C) The
   Kaplan-Meier survival curves between the Kim cohort’s high-TAPBP
   expression group and the low-TAPBP expression group. (D) HE and IHC
   staining results in tumor and normal tissues in GC. The expression of
   TAPBP in between tumor tissues and normal tissues (E) or adjacent
   normal tissues (F) from the TCGA-STAD cohort. (G) The multi-color
   fluorescence staining detected the spatial distribution of TAPBP and
   CD8 in the immune-inflamed subtype, immune-excluded subtype, and
   immune-deserted subtype of GC. The distributions of nine predictive
   model genes somatic alterations from TCGA-STAD cohort (H) and
   pan-cancer TCGA cohort (I). Asterisks denote significant differences
   (*p< 0.05; **p< 0.01; ***p< 0.001).

   We further assessed the correlations between nine predictive model
   genes and 22 immune cells ([214] Figure S7A ). Eight genes (TAPBP,
   KLRD1, HSPA2, HLA-DRA, CTSB, B2M, HLA-C and HLA-A) were significantly
   associated with more than nine types of immune cells in 467 patients
   (all p<0.05). This result implied that nine genes might regulate immune
   cell infiltration in tumor tissue. Notably, TAPBP was significantly
   correlated with most immune cells (n=16) and was positively correlated
   with CD8 T cells, activated memory CD4 T cells and activated NK cells,
   which could kill tumor cells. The MCP-counter was used to estimate the
   absolute abundance of eight immune and two stromal cell populations in
   GC tissues from TCGA-STAD transcriptomic data, showing that TAPBP had
   positive relationships with most immune cells except for neutrophils
   ([215] Figure S7B ). Additionally, the survival of patients with high
   TAPBP expression was better than low TAPBP expression, implying that
   this gene might be a protective factor (p = 0.019, [216]Figure 6C ).
   The hematoxylin-eosin staining (HE) showed that the infiltrating
   gastric carcinoma cells in tumor tissues were higher than in adjacent
   tissues ([217] Figure 6D ). Moreover, immunohistochemistry (IHC) was
   used to measure the expression of TAPBP protein in tumor tissues and
   adjacent normal tissues. The staining intensity score of TAPBP in tumor
   tissues was significantly higher than that in adjacent normal tissues
   ([218] Figure 6D ). According to the TCGA-STAD cohort, it was found
   that the expression of TAPBP mRNA in tumor tissues was higher than that
   in normal tissues (Mann-Whitney U test, p<0.0001; [219]Figure 6E ).
   Pair Wilcoxon signed-rank test also suggested that the expression of
   TAPBP mRNA in tumor tissues was higher than in adjacent normal tissues
   (p<0.0001; [220]Figure 6F ). The multi-color fluorescence staining
   revealed the highest expression of TAPBP and CD8 in the immune-inflamed
   subtype, followed by the immune-excluded subtype and immune-deserted
   subtype ([221] Figure 6G ).

   Moreover, patients with high HLA-C, HLA-DRA, HLA-F and KLRD1 expression
   had better survival, while the opposite trend was observed for high
   B2M, CTSB, HLA-A and HSPA2 expression ([222] Figure 7SC ). We used 1146
   individuals with more stage III from 16 independent TCGA cohorts to
   assess the prognostic value of TAPBP. Although TAPBP was not
   significantly associated with survival in 16 types of tumors, the
   meta-analysis showed that patients with high-expression TAPBP had
   better survival than low-expression TAPBP (p<0.0001, [223]Figure 8SA ).

   The distributions of somatic alterations from the TCGA-STAD cohort
   showed that altered frequencies of B2M and TAPBP came top two among 56
   GC patients ([224] Figure 6H ). Similarly, the altered frequencies of
   B2M and TAPBP came first and fourth among 484 samples from pan-cancer
   TCGA cohort ([225] Figure 6I ). We further analyzed the difference of
   nine predictive model genes between subtypes in the pan-cancer TCGA
   cohort, and found six types of hot cancers including BRCA, lung
   adenocarcinoma (LUAD), glioblastoma multiforme (GBM), KIRC, lung
   squamous cell carcinoma (LUSC) and HNSC were the most significant
   ([226] Figure 8SB ). KIRC has the largest fold difference of nine
   predictive model genes expression between tumor and normal tissue
   ([227] Figure S8C ). Similar changes in nine predictive model genes
   were observed for BRCA and HNSC, whereas the opposite was observed for
   LUSC. The heatmap showed that nine predictive model genes initiated
   apoptosis, EMT, hormones ER and RAS/MAPK signaling pathways, while the
   opposite function was observed in cell cycle, hormones AR and DNA
   damage response signaling pathways ([228] Figure S8D ). The above
   results revealed that nine predictive model genes might interact with
   immune cells in the tumor microenvironment to affect a variety of
   cancers invasion, metastasis and recurrence.

A nomogram for prognosis

   The univariate Cox analysis identified the risk score of nine
   prediction model genes and nine signaling pathways, including ECM
   receptor interaction, DNA damage repair, EMT2, Nucleotide excision
   repair, Base excision repair, Immune checkpoint, CD8 T effector and
   JAK-STAT signaling pathways, significantly associated with the
   prognosis of GC patients (all p<0.05; [229]Figure 7A ). The
   multivariate Cox analysis further showed that the risk score of nine
   prediction model genes, MAPK signaling pathway, Wnt signaling pathway,
   the ratio of M1 Macrophages to M2 macrophages were risk factors of the
   survival in GC, while ECM receptor was a predictive factor (all p<0.05;
   [230]Figure 7B ). Although CD8 T effector did not have a significant
   association with the survival (p =0.055), it was an essential indicator
   of immunotherapy and was enter into the multivariate Cox analysis.

Figure 7.

   [231]Figure 7
   [232]Open in a new tab

   Construction of a nomogram for prognosis in FMAC. (A) The univariate
   and (B) multivariable Cox regression for the risk score and essential
   signaling pathways in FMAC. (C) The nomogram constructed by six
   variables. The red dot represents the risk score corresponding to the
   value of the independent variable. According to the total scores of the
   nomogram, the three-year, five-year and eight-year death probability in
   randomly selected the 400th patient being 0.901, 0.958 and 0.977,
   respectively. (D) ROCs for three-year, five-year and eight-year death
   probability of the nomogram. (E) Calibration curves of the predicted
   five-year survival probability and actual five-year survival
   probability. The gray line indicates ideal fit; the circle indicates
   the predicted five-year survival probability of nomogram; the stars
   indicate bootstrap correction estimates, and the error bars indicate
   the 95% CI of these estimates. (F) Decision curves for the nomogram,
   the risk score of nine prediction model genes and other four factors.

   Next, we built a nomogram model of six significant variables in the
   multivariate Cox regression ([233] Figure 7C ). The AUCs for predicting
   three-year, five-year, and eight-year survival of GC patients were
   0.657, 0.689 and 0.709, respectively ([234] Figure 7D ). When compared
   to previously published prognostic biomarkers of various measurements,
   including mRNA expression, lncRNA expression, microRNA expression,
   protein, and DNA methylation level for predicting the prognosis of GC
   patients ([235]50–[236]55), the AUC of our nomogram model for
   predicting eight-year survival appeared to be higher. The capacity to
   predict outcomes for more patients is likely increased by the addition
   of more clinical characteristics to our model, or it may be facilitated
   by the inclusion of more samples and a longer period of follow-up in
   our study. Calibration curves showed that the predicted 5-year survival
   probability and actual 5-year survival probability were hovering near
   the 45-degree diagonal line, implying that the nomogram model has a
   superior ability to predict GC prognosis ([237] Figure 7E ). Moreover,
   decision curve analysis showed that the curves of the nomogram model
   and risk score of nine prediction model genes were isolated from angles
   of treatment for all and treatment for none, highlighting the clinical
   potential of the nomogram model in the prognosis of aGC ([238]
   Figure 7F ).

Discussion

   The antigen processing and presentation plays a crucial role in
   effector T cells recognition of tumor cells, influencing the
   identification of patients susceptible to immunotherapy ([239]11). This
   study identified a novel biomarker of genes related to antigen
   processing and presentation, predicting response to ICIs for aGC
   patients. Currently, most studies use TCGA or GEO datasets to build
   models for predicting prognosis rather than response to ICIs for aGC
   patients due to the lack of ICIs treatment results. Therefore, we
   explored the response to ICIs for aGC patients in the Kim cohort
   treated by ICIs to make the results have direct clinical translational
   significance.

   In this study, we built a scoring system to evaluate the efficacy of
   immunotherapy for aGC patients. We validated the results—the APscore
   distinguished patients with or without response to ICIs in the Kim
   cohort. Moreover, patients with response to ICIs had higher APscore
   than those without response to ICIs. Several subtypes of GC, such as
   high MSI subtype, positive EBV subtype, high TMB subtype and high
   immune signature subtype, benefited from ICIs treatment ([240]5). These
   subtypes also had high Apscore in the Kim cohort, further revealing
   that APscore might effectively predict immunotherapy efficacy. Thus,
   the critical biomarkers involved in the mechanism of immunotherapy need
   to be identified and assessed.

   Ten hub genes, STAT1, IRF1, ISG15, IRF9, BST2, PSMB8, STAT2, IFI35,
   IFIT3 and OAS2 from 366 antigen processing and presentation-related
   genes were further analyzed. The Metascape shows that ten hub genes are
   enriched in interferon-alpha/beta signaling, which has been reported to
   influence APM gene expression ([241]56, [242]57). This result suggests
   that we can enhance antigen presentation efficiency of initially
   unresponsive patients with ICIs by stimulating interferon signal
   transduction to improve immunotherapy efficacy, especially in aGC
   patients with low APscore, which has potential transformation
   significance. A large body of evidence has been provided about STAT1
   inhibiting proliferation and promoting apoptosis of tumor cells, and it
   is considered a potential tumor inhibitor ([243]58, [244]59). STAT can
   induce tumor cell apoptosis by interfering with the MARK signaling
   pathway or down-regulating interleukin (IL-6) expression and STAT3 of
   interferon-alpha/beta signaling ([245]60, [246]61). Furthermore, new
   studies show that down-regulated STAT1 expression in peripheral blood
   of GC patients results in immune escape from gastric epithelial cancer
   caused by decreased anti-tumor immune function ([247]62, [248]63).
   However, there are few experimental studies on the interaction between
   immune microenvironment and tumor antigenicity and antigen presentation
   in GC immunotherapy. This study used multi-color immunofluorescence
   staining to accurately locate positive spatial associations of IFIT3
   and STAT1 expression with PD-L1 and CD8 expression, representative
   immunotherapy targets in different immunotypes of aGC tissues. Hub
   genes selected to share a common pathway enrichment of interferon
   signaling pathway, and the relationship between specific genes and
   immunotherapy may provide new insight into the ICB therapy for GC.

   According to nine genes (B2M, CTSB, HLA-A, HLA-C, HLA-DRA, HLA-F,
   HSPA2, KLRD1 and TAPBP) enriched in antigen processing and
   presentation, we built a predictive survival model for aGC patients
   with high accuracy and sensitivity. In addition, a nomogram based on
   the risk score and several signatures of critical signaling pathways
   was plotted and showed that the risk score had robust survival
   prediction ability. Currently, the relationships between nine genes and
   the survival of various tumors may be easily explored using TCGA or GEO
   data. However, how they affect the immunotherapy efficiency of GC has
   rarely been studied. We found that the prediction model of the above
   nine genes predicted response to ICIs in aGC patients with a value of
   AUC 0.97. These results suggest that the nine genes are valuable for
   further study in the immunotherapy mechanism of GC. In addition,
   several other genes have already been linked to cancer-specific immune
   responses. For example, mutation of B2M gene resulted in defective
   expression of MHC-1 molecular antigen, which could not deliver antigen
   to CD8^+T cells through TCR and disturbed the positive selection of CD8
   cells in the thymus to affect the development of CD8 T cells ([249]64,
   [250]65). Several studies showed that HLA-I, including HLA-A, HLA-B and
   HLA-C positive tumors may be more susceptible to immune checkpoint
   inhibitors ([251]66, [252]67).

   In contrast, HLA-I negative tumors might be associated with acquired
   drug resistance to PD-1 blockade in cancer patients. It was found that
   the increased cell apoptosis in gastric cancer leads to the release and
   degradation of CTSB protein from the lysosome, resulting in gastric
   cancer cell death ([253]68). The cleavage of Caspase-1 by CTSB
   activates caspase-1, which promotes the secretion of interleukin-1β
   (IL-1β), leading to an inflammatory response ([254]69, [255]70).
   Compared with atezolizumab monotherapy for biliary tract cancer, TAPBP
   expression was higher in the combined treatment group ([256]71).
   Compared with attezolzumab monotherapy for biliary tract cancer, TAPBP
   expression was higher in the combined treatment group (atezolizumab +
   MEK inhibitor). As an important biomarker, TAPBP from TIGS is an
   effective intrinsic tumor biomarker and can predict ICIs response in
   pan-cancers ([257]10). The TAPBP effectively predicted immunotherapy
   response in aGC patients and stratified patients into groups with
   significantly different survival in this study. In addition, the
   positive correlation between TAPBP expression and tumor
   microenvironment was confirmed by multi-color immunofluorescence
   staining, providing a clue for further study on the effect of TAPBP on
   ICB therapy.

   There are some limitations to this study. First, we did not conduct
   external validation because it was challenging to gather clinical
   patient data for ICIs treatment in aGC patients. Nonetheless, 467
   patients from TCGA and GEO datasets were used to validate our results
   by estimating immunotherapy response by TIDE. Second, even though IHC
   and fluorescence experiments were carried out to validate the model
   results, large-scale protein sequencing analysis would be a better
   choice. Third, we analyzed the effects of APscore on the immune
   microenvironment, SNV, TMB and signaling pathways, but the underlying
   mechanism remained unclear. Therefore, to better understand the
   significance of APscore in predicting GC immunotherapy response,
   further in vivo and in vitro experimental research are required.

   In summary, our work presents for the first time a novel signature
   based on genes associated with antigen processing and presentation,
   that can help identify response to immunotherapy, allow identification
   of high-risk patients and predict prognosis in aGC patients. Further
   studies using gene expression profiles may represent a powerful
   approach to explore the biological mechanism of tumor immunotherapy
   escape, and may provide new targets for transforming anti-PD-1
   resistant tumors into a responsive state.

Data availability statement

   The raw data supporting the conclusions of this article will be made
   available by the authors, without undue reservation.

Ethics statement

   This study was reviewed and approved by the ethics committee of
   affiliated hospital of Jiangnan University (approval number:
   LS2020050). Written informed consent was obtained from all participants
   for their participation in this study.

Authors contributions

   KW and FY conceived the study. MW, BH, XW and QY participated in study
   selection, data extraction and statistical analysis. MW and ZL
   performed Hematoxylin-eosin staining (HE) and organized sample data. BH
   provided FFPE tumor samples. KW, ZY and JW made the figures. KW and FY
   wrote the first draft of the manuscript. All authors interpreted the
   results, and approved the final version for submission.

Funding

   This work was supported by the top Talent Support Program for young and
   middle-aged people of Wuxi Health Committee (No. HB2020040),
   Mega-project of Wuxi Commission of Health (No. Z202007).

Acknowledgments