Abstract
Background: Immunotherapies targeting immune checkpoint proteins
CTLA-4, PD-1, and PD-L1 have saved lives, but these therapies have only
been effective in some patients. Patients positive for expression of
immune checkpoint proteins in the tumor microenvironment show better
response to immune checkpoint inhibitors. Consequently, knowledge of
which genes are consistently expressed in lymphocytes within the tumor
microenvironment can convey potentially effective and complementary new
immunotherapy targets.
Results: We identified 54 genes that have higher co-expression with the
pan T-cell marker CD3E than CTLA4 or PDCD1. In a dataset of 26 patients
who received anti-PD-1 therapy, we observed that co-expression between
CD3E and PDCD1 was higher among responders than non-responders,
supporting our correlation-based approach.
Conclusions: The genes highlighted in these analyses, which include
CD6, TIGIT, CD96, and SLAMF6, warrant further investigation of their
therapeutic potential.
Methods: We analyzed and ranked genes that were co-expressed with the
pan T-cell marker CD3E in 9,601 human tumors, spanning 31 cancer types.
To further identify targets that may be complementary to existing PD-1
therapy, we examined and ranked genes with high CD3E co-expression and
relatively low PDCD1 co-expression.
Keywords: immunotherapy, tumor microenvironment, T lymphocyte, cancer
genomics, RNA-seq
INTRODUCTION
The long-term effectiveness of broad-spectrum chemotherapies [[34]1]
and molecularly-targeted therapies [[35]2] is mitigated by the
evolutionary dynamics of tumor cells, wherein natural selection favors
increasingly aggressive and drug-resistant clones. Therapeutic
resistance presents one of the universal challenges in cancer treatment
and can be attributed to the extensive genetic [[36]3] and phenotypic
[[37]4] diversity present within tumor cell populations. Recent
achievements in the field of immunotherapy [[38]5–[39]7] to treat
advanced cancers have renewed optimism about harnessing the power of
the adaptive immune system, and its ability to produce a
nearly-unlimited diversity of antigen-recognizing receptors, to achieve
lasting therapeutic results [[40]8–[41]10]. The promise of
immunomodulatory approaches lies in the potential to confront one
dynamic, diversity-rich system (the evolving tumor) with another (the
adaptive immune system) [[42]11–[43]14]. Nevertheless, there are
challenges to using the immune system to fight cancer: tumors appear to
have a diversity of immune-evasion mechanisms that modulate the immune
response [[44]15–[45]18]. As we learn more about the biomarkers that
identify which immunotherapy approaches have promise for each patient
[[46]19–[47]21], these challenges might be successfully addressed by
developing a variety of approaches for immunomodulation in diverse
patients and circumstances.
The goal of immunotherapy is typically to reverse tumor immune-evasion
mechanisms and restore local immune response against cancer cells
[[48]22–[49]25]. Cytotoxic T-lymphocyte associated protein 4 (CTLA-4),
which represses early T-cell activation, was the first immune
checkpoint receptor to be targeted therapeutically [[50]26, [51]27].
Most recent immunotherapy studies focus on antibodies to block the
programmed cell death protein 1 (PD-1) or its ligands (e.g. PD-L1),
which have enabled breakthroughs in the treatment of melanoma,
non-small cell lung cancer, and renal cell carcinoma [[52]28–[53]31].
These treatments can markedly improve patient survival, but only a
minority of patients respond to therapy [[54]32, [55]33]. Analysis of
PD-1 blockade response data reveals increased response in patients with
higher tumor PD-L1 expression, and higher PD-1 expression on the tumor
infiltrating lymphocytes (TILs) [[56]34]. These facts reaffirm the
biological basis for immunotherapy: by reprogramming the suppressed
TILs, it is possible to make tumors newly vulnerable to the immune
system.
The development of an arsenal of approaches to modulate immune response
is dependent on the identification and prioritization of targets that
are likely to modulate immune activity in tumors in complementary ways
to extant therapies [[57]11–[58]13]. To predict the best
immunomodulatory treatment targets that would reverse the suppression
of tumor-infiltrating T cells, an approach is needed that evaluates
potential immunomodulatory interactions with T cells in the tumor
microenvironment [[59]35–[60]37], analogous to approaches outside the
realm of immunotherapy that identify targets based on functional
associations of disease [[61]38]. We reasoned that immunomodulatory
targets would have to be relatively abundantly expressed in
tumor-infiltrating T cells. Accordingly, we used mRNA expression of the
pan T-cell marker CD3E as a metric for T-cell abundance within a tumor
[[62]12], and examined correlations between expression of CD3E and
putative targets across 9,601 human tumors spanning 31 cancer types. We
deemed genes whose expression is highly correlated with CD3E—and
therefore are likely to be expressed in T cells within the tumor
microenvironment—to be promising targets for therapy. We further
reasoned that the most complementary targets would be those that are
not co-expressed in common with PDCD1 (the gene coding for PD-1), as
they might function in individuals for whom anti-PD-1 therapy is
insufficient or inviable. Therefore, we also examined the correlation
of putative targets with PDCD1 expression. We identified targets with
high CD3E correlation and relatively low PDCD1 correlation, suggestive
of the presence of T cells with low PD-1 expression. These
immune-related genes could potentially be new targets for therapy
complementary to PD-1 blockade therapy. Lastly, we extended our
analysis to consider complementarity to therapies directed at both PD-1
and CTLA-4.
RESULTS
T-cell associated genes exhibit consistent expression in tumor
microenvironments across cancer types
We generated a heat map to indicate the strength of correlation in
expression between each gene in a 40-gene candidate panel and the pan
T-cell marker CD3E, across 31 cancer types. Candidate genes were
selected based on expert knowledge for their known functional relevance
in T-cell function and their known effects on CD8 T-cell function. CD3E
was selected as a T-cell marker because of its relatively reliable and
broad expression in T cells. Analyses using genes encoding other
components of CD3 gave similar results ([63]Supplementary Figure 1).
Differentiating between CD4^+ and CD8^+ T-cell markers did result in
different rankings, consistent with evidence that these T-cell types
play divergent roles in the tumor microenvironment [[64]39–[65]42]. We
chose not to use a combination of genes in a signature-based analysis
because we did not want our search for new targets to be biased by a
predefined gene signature. Four of the candidate genes we examined
showed more highly correlated expression with CD3E than was exhibited
by the sometimes highly effective immunotherapy target and checkpoint
inhibitor PDCD1. Another successful immunotherapy target, CTLA4, ranks
eighth on the list of candidate genes in terms of how strongly
correlated its co-expression is with the CD3E T-cell marker ([66]Figure
1A). Genes tend to show highly consistent CD3E co-expression patterns
across the 31 cancer types studied ([67]Figure 1A). Additional
potential gene targets are identified that have more highly correlated
expression with CD3E than PDCD1 (ranked #55) and CTLA4 (ranked #103)
when we expanded the list of studied genes from our 40-candidate list
to all genes known to be expressed in cancer ([68]Figure 1B,
[69]Supplementary Figure 2). Promisingly, these genes tend to have
known immune regulatory functions ([70]Table 1). The gene for the
co-stimulatory cell-adhesion molecule CD2 ranks at the top of both the
candidate and exome-wide lists.
Figure 1. CD3E co-expression across cancer types (TCGA abbreviations;
[71]Supplementary Table 2).
[72]Figure 1
[73]Open in a new tab
Among the (A) 40 candidate genes of the gene panel, several candidate
genes show more correlated co-expression with the pan T-cell marker
CD3E than successfully-targeted immune checkpoints like PDCD1 and CTLA4
(shown in bold). Candidate genes show remarkable consistency in their
co-expression patterns across a variety of cancer types. Correlation
coefficients in (B) a cancer exome-wide analysis reveal many genes that
are highly-correlated with gene expression of pan T-cell marker CD3E in
the tumor microenvironment. Higher correlation in expression than
exhibited by PDCD1 and CTLA4 is observed in 54 and 102 genes,
respectively. The gene CD2 tops both lists. Only two other candidate
genes (TIGIT and CD27, shown in bold) fall in the top-50 ranking in the
exome-wide analysis (#13 and #20; [74]Supplementary Figure 2). The
color of each tile indicates correlation with CD3E expression, (blue,
low to red, high; green: not assayed). The genes are ranked according
to the median of correlation coefficients across cancer types.
Candidate genes from the gene panel are shown in bold.
Table 1. Genes with highest CD3E co-expression in exome-wide analysis.
Rank^a Symbol Pearson’s r Gene description^b Panel?^c
1 CD2 0.955 T-cell surface antigen CD2 ✓
2 CD247 0.944 T-cell receptor T3 zeta chain
3 SIRPG 0.942 signal-regulatory protein gamma; CD172g antigen
4 CD3D 0.933 T-cell surface glycoprotein CD3 delta chain
5 LCK 0.918 lymphocyte-specific protein tyrosine kinase
6 SIT1 0.917 signaling threshold regulating transmembrane adaptor 1
7 CD6 0.912 T-cell differentiation antigen CD6
8 CXCR3 0.910 chemokine (C-X-C motif) receptor 3
9 SH2D1A 0.907 T-cell signal transduction molecule SAP
10 SLA2 0.904 Src-like-adaptor 2; modulator of antigen receptor
signaling
11 SLAMF6 0.902 SLAM family member 6; activating NK receptor
12 ITGAL 0.897 antigen CD11A (p180), lymphocyte function-associated
antigen 1
13 TIGIT 0.896 T cell immunoreceptor with Ig and ITIM domains ✓
14 CD96 0.896 T cell activation, increased late expression
15 TRAF3IP3 0.892 TRAF3-interacting JNK-activating modulator
16 ACAP1 0.882 ArfGAP with coiled-coil, ankyrin repeat and PH domains 1
17 RASAL3 0.882 RAS protein activator like 3
18 ITK 0.879 interleukin-2-inducible T cell kinase
19 TBC1D10C 0.879 carabin; TBC1 domain family, member 10C; RAS
signaling inhibitor
20 CD27 0.877 T-cell activation antigen CD27 ✓
21 CD48 0.872 CD48 antigen (B-cell membrane protein)
22 GZMK 0.870 granzyme K (granzyme 3; tryptase II)
23 SASH3 0.865 SAM and SH3 domain containing 3
24 LY9 0.865 lymphocyte antigen 9
25 SEPT1 0.865 septin 1; serologically defined breast cancer antigen
NY-BR-24
26 PTPRCAP 0.862 protein tyrosine phosphatase, receptor type,
C-associated protein
27 ARHGAP9 0.858 Rho GTPase activating protein 9
28 CD3G 0.857 T-cell receptor T3 gamma chain
29 TESPA1 0.856 thymocyte expressed, positive selection associated 1
30 MAP4K1 0.854 MAPK/ERK kinase kinase kinase 1
31 CORO1A 0.842 coronin, actin binding protein, 1A
32 SLAMF1 0.841 signaling lymphocytic activation molecule family member
1
33 NLRC3 0.839 NLR family, CARD domain containing 3
34 CST7 0.838 cystatin F; leukocystatin
35 IL2RG 0.838 interleukin 2 receptor, gamma
36 WAS 0.837 Wiskott-Aldrich syndrome
37 CD8A 0.834 T-cell surface glycoprotein CD8 alpha chain
38 CD5 0.829 CD5 antigen (p56-62)
39 GIMAP5 0.828 immunity-associated nucleotide 5 protein
40 IKZF1 0.828 IKAROS family zinc finger 1 (Ikaros)
41 ZAP70 0.828 zeta chain of T-cell receptor associated protein kinase
70
42 KLRK1 0.827 killer cell lectin like receptor K1
43 CCL5 0.827 C-C motif chemokine ligand 5
44 GPR171 0.826 G protein-coupled receptor 171
45 PVRIG 0.825 transmembrane protein PVRIG; CD112 receptor
46 PTPN7 0.824 protein tyrosine phosphatase, non-receptor type 7
47 CXCR6 0.823 C-X-C motif chemokine receptor 6
48 EVI2B 0.823 ecotropic viral integration site 2B
49 CCR5 0.819 C-C motif chemokine receptor 5
50 GZMA 0.815 granzyme A; Cytotoxic T-lymphocyte-associated serine
esterase-3
51 ICOS 0.815 inducible T-cell co-stimulator ✓
52 GRAP2 0.813 GRB2-related adaptor protein 2
53 PTPRC 0.808 CD45; protein tyrosine phosphatase, receptor type, C
54 GIMAP4 0.806 GTPase, immunity-associated protein 4
55 PDCD1 0.806 programmed cell death 1 ✓
103 CTLA4 0.727 cytotoxic T-lymphocyte associated protein 4 ✓
[75]Open in a new tab
^aGenes are ranked according to median CD3E co-expression across cancer
types.
^bGene descriptions are taken from the NCBI Gene database.
^cPresent in the 40-gene panel if indicated by a check mark.
Candidate gene targets to complement anti-PD1 therapies
We re-ranked the 40-candidate gene panel and the exome-wide gene lists,
using a “PDCD1-complementarity score” that simultaneously considers the
strength of correlation between the gene target and CD3E (as above),
and the strength of correlation in gene expression between the target
and PDCD1. This re-ranking provided a list of genes that could
potentially serve not only as effective therapeutic targets in general,
but as complementary therapies to anti-PD-1 therapy, in that they might
be especially effective for patients whose tumors do not respond to
anti-PD-1 therapy. Genes with the highest PDCD1-complementarity score
were highly correlated in expression with CD3E and less correlated with
PDCD1 expression ([76]Figure 2A, [77]Supplementary Figure 3). When
taking complementarity to PDCD1 into account, the C-C chemokine
receptor type 7 gene CCR7 tops the list, and CD2 ranks #24. Among genes
in our 40-candidate panel, CD40LG, CD2, and BTLA remain highly
complementary in our joint analysis of complementarity to therapies
directed at PDCD1 and to CTLA4 ([78]Figure 2B–[79]2D, [80]Supplementary
Figure 4).
Figure 2. PDCD1-complementarity and joint CTLA4-PDCD1 complementarity scores
of exome-wide gene list.
[81]Figure 2
[82]Open in a new tab
Scatter plots show correlations for each target gene, with results from
each cancer type superimposed and colored (blue, low to yellow, high)
according to (A) PDCD1-complementarity score, which is high when
expression of CD3E and PDCD1 are respectively correlated and
uncorrelated with the target gene—genes with the highest
PDCD1-complementarity scores are highly correlated with CD3E
(approaching one on the x-axis) and less correlated with PDCD1
(approaching zero on the y-axis); and (B) joint CTLA4-PDCD1
complementarity score (including only scores above zero), which is high
when expression of CD3E is correlated with the target gene and
expression of PDCD1 and CTLA4 are uncorrelated with the target gene.
Genes with the highest joint complementarity scores have close to zero
correlation in expression with both CTLA4 and PDCD1. Bar plots show the
top 50 genes ranked according to (C) their median PDCD1-complementarity
score and (D) their joint CTLA4-PDCD1 complementarity score across
cancer types. Top candidate genes (red bars) and top genes from the
cancer exome (blue bars) are shown. To the right of each bar is its
ranking based on complementarity score (CS) and CD3E co-expression
(CD3E). Ranks up to 50 are in boldface.
GSEA pathway enrichment analysis of the exome-wide gene list ranked by
CD3E co-expression revealed significant enrichment of immune-related
pathways ([83]Table 2; [84]Supplementary Table 1). The top 50 pathways
all have FDR Q value below 1 × 10^−5, ranked by normalized enrichment
scores in the range 2.06–2.38. The top-ranked pathway is TCR signaling
in naïve CD4^+ T cells (curated by the Pathway Interaction Database,
[85]Supplementary Figure 5). Other pathways of interest include the CD8
TCR pathway, numerous immunoregulatory interactions, natural killer
cell mediated cytotoxicity, and costimulation by the CD28 family; PD-1
has recently been demonstrated to exert its primary effect via
regulation of CD28 [[86]43–[87]45].
Table 2. The top ten enriched pathways from GSEA.
Rank Pathway name NES^a
1 TCR pathway (PID) 2.383
2 CD8 TCR pathway (PID) 2.366
3 Immunoregulatory interactions between lymphoid & non-lymphoid cells
(REACTOME) 2.360
4 Hematopoietic cell lineage (KEGG) 2.315
5 Class A1 rhodopsin like receptors (REACTOME) 2.302
6 Interferon gamma signaling (REACTOME) 2.302
7 Natural killer cell mediated cytotoxicity (KEGG) 2.300
8 Cell adhesion molecules cams (KEGG) 2.280
9 Cytokine cytokine receptor interaction (KEGG) 2.259
10 Costimulation by the CD28 family (REACTOME) 2.251
[88]Open in a new tab
^aNormalized Enrichment Score. All pathways listed have a false
discovery rate Q < 1 × 10^−5.
Patients with response to anti-PD1 therapy have higher correlation between
PDCD1 and CD3E expression than non-responders
To examine how the relationship between CD3E expression and PDCD1
expression influences response to anti-PD-1 therapy, we analyzed
RNA-Seq transcriptome data from pretreatment samples of 26 metastatic
melanoma patients who received anti-PD-1 therapy [[89]82]. Patients had
been categorized as ‘responders’ (n = 10 who experienced partial
response, n = 4 who experienced complete response) and ‘non-responders’
(n = 12 who experienced progressive disease). Comparing CD3E and PDCD1
expression among responders and non-responders, we find higher
correlation among responders than non-responders ([90]Figure 3A). Among
the ‘responder’ patients, the Pearson’s correlation coefficient for
transcript per million (TPM) expression values for CD3E and PDCD1 is
0.998, while ‘non-responders’ have a lower correlation coefficient of
0.688. A linear regression of these expression values for responders
yields an R^2 statistic of 0.997, compared to a value of 0.473 for
non-responders, indicating that a linear model has over twice the
explanatory power for responders than non-responders. Squared residuals
from the linear regressions are significantly lower for responders than
non-responders ([91]Figure 3B; P = 0.003, Mann–Whitney U test). One
patient in the cohort, a responder, was an outlier with very high PDCD1
and CD3E expression values ([92]Figure 3A). When this patient is
excluded from the analysis, the pattern holds. Analysis of the
responder cohort excluding the outlier yields a CD3E-PDCD1 correlation
coefficient of 0.931, R^2 of 0.868, and squared residuals from the
linear regression remain significantly lower than non-responders; P =
0.005. These findings support our hypothesis that the immune regulatory
genes we have discovered with highly correlated expression with CD3E
may be promising candidates for new targeted-therapies.
Figure 3. PDCD1 and CD3E expression in responders and non-responders to
anti-PD-1 therapy.
[93]Figure 3
[94]Open in a new tab
Pretreatment samples from metastatic melanoma patients who ultimately
responded to anti-PD-1 therapy show more highly correlated
co-expression between PDCD1 and the pan T-cell marker CD3E, as
indicated by (A) a linear regression of the expression values for
responders with a coefficient of determination R^2 = 0.997, compared to
R^2 = 0.473 for non-responders; and (B) squared residuals from the
linear regression of CD3E and PDCD1 that are significantly lower for
responders than non-responders (P = 0.003).
DISCUSSION
We examined a 40-gene panel of candidate immune regulators and an
unbiased list of 12,082 expressed genes in cancer to find genes
consistently co-expressed with the T-cell marker CD3E in the tumor
microenvironment. We found that expression patterns were remarkably
consistent across the 31 cancer types analyzed, and that the top genes
were highly enriched within immune-related pathways, indicating that
tumor-infiltrating T cells may have universal characteristics that
could be targeted effectively in multiple cancers. We also identified
T-cell associated genes whose expression patterns do not correlate
highly with PCDC1 that might be targeted for development of
complementary therapies to anti-PD-1, or for patients who do not
respond to anti-PD-1 therapy. When we examined anti-PD-1 responders and
non-responders in metastatic melanoma, we found that responders had a
significantly higher correlation between CD3E and PDCD1 than did
non-responders, which supports the hypothesis that correlation in
expression with CD3E in the tumor microenvironment can be a useful
criterion for identifying new therapeutic targets with potential for
therapeutic response. It has been shown that although PD-L1 expression
in tumor biopsies does appear to predict response to anti-PD-1
therapies, many tumors predicted as PD-L1 positive do not respond,
while some responses occur in PD-L1-negative tumors [[95]46–[96]51].
Our results are similar in that we find that some patients with high
CD3E-PDCD1 correlation are not responders, whereas others with lower
correlation do respond to anti-PD1 therapy. Also, when we performed a
similar analysis on response to immune checkpoint therapies among
patients with clear cell renal carcinoma, we did not find significant
differences in CD3E-PDCD1 correlation in responders and non-responders
[[97]52]. More detailed expression analyses of responders and
non-responders in additional cancer types will help to shed light on
additional genetic and tumor microenvironmental factors that influence
response to new and existing and therapies [[98]52, [99]53].
As a purely correlation-based approach, our exome-wide expression
analysis is very coarse-grained, and does not incorporate multi-omic
pharmacogenomic data [[100]54] or the specific molecular biology of the
genes identified. Extensive molecular biological research on their
functional and structural properties is required to assess their
viability as immunomodulatory targets [[101]55, [102]56]. However, the
guidance provided by our approach complements extant molecular
biological investigation, highlighting well-studied genes that deserve
continued attention as well as pointing out genes whose molecular
biology is less well known, but which could be important for future
research. While a subset of these genes will lack exploitable
properties or would produce undesirable outcomes if they were targeted,
it is also likely that novel and complementary targets that do have
high potential are highlighted by our analysis. Furthermore, the
consistency of the results of our analysis across cancer types inspires
confidence that successful targeting of the genes could yield a high
breadth of therapeutic applicability either alone or in combination
with other therapies.
Among the more well-studied genes with potential for targeting, some
have been previously identified by other approaches. CD6, a known
T-cell co-stimulatory molecule, was identified in the exome-wide
analysis as being co-expressed with CD3E. The binding of CD6 to CD166
(also called ALCAM: activated leukocyte-cell adhesion molecule) enables
formation of a functional immune synapse. Indeed, an antibody that
particularly antagonizes the function of CD6—Itolizumab—is under
investigation for use as an anti-inflammatory in psoriasis patients
[[103]57, [104]58]. This antibody presents a possibility of stabilizing
(i.e. agonizing) the CD6:CD166 interaction in the immune synapse so as
to stimulate effector T-cell function. Of course, the systemic
consequences of such an agent might outweigh the local benefits of a
productive anti-tumor immune response.
Other well-studied targets are encoded by TIGIT and CD96 [[105]59,
[106]60], which were similarly highly co-expressed with CD3E, and are
known to inhibit effector T-cell activation. TIGIT and CD96 compete
against the stimulatory receptor CD226 for shared ligands (i.e. CD155)
and thus suppress CD226 activation [[107]61]. TIGIT can suppress
effector T-cell function by directly suppressing CD226 in cis or
suppress APC function by signaling through CD155 in trans [[108]62].
TIGIT and CD96 both suppress natural killer (NK) cell function as well
[[109]63, [110]64], and animal models of CD96 knockout mice revealed
hyperinflammatory status with increased IFN-γ production in NK cells
[[111]64]. These mice are also resistant to a experimental lung
metastasis model, suggesting a potential therapeutic role of CD96
blockade in cancer treatment [[112]65]. TIGIT is known to additionally
result in immunosuppression mediated by T regulatory cells (via
secretion of IL-10 and TGF-β) [[113]66]. Immunoregulatory function of
TIGIT was shown to occur as a consequence of engagement with CD155 on
dendritic cells, which results in increase IL-10 production,
suppressing the effector T cells while promoting regulatory T cells
[[114]62]. Given the known functions of these receptors, the CD3E
co-expression data reported here provides additional rationale for the
development of selective antagonists and context for potential
therapeutic application.
Expression of SLAMF6 (also called Ly108)—a CD2 family member that plays
a critical role in NK-cell development and activation—is also
correlated with CD3E expression. SLAMF6 is known to be expressed on T
cells, and its co-stimulation was shown to drive naïve CD4^+ T cells
toward a Th1 phenotype, inducing IFN-γ production [[115]67]. SLAMF6
educates NK cells by forming homodimers at a synapse between cells.
This homodimerization reduces NK-cell activity toward hematopoietic
(i.e. SLAMF6^+) cells while enhancing activity toward non-hematopoietic
(SLAMF6^−) tumor cells [[116]68]. During development, SLAMF6 also
reduces NK-cell differentiation and proliferation [[117]69, [118]70].
Antibodies targeting SLAMF6 have demonstrated efficacy in mouse
oncology models [[119]71], underscoring the therapeutic potential of
targeting this receptor and highlighting the potential of NK cells to
play a critical role in anti-tumor immunity.
Additional immunotherapy-relevant patterns emerge when the genes that
are highly ranked in our analyses are considered in aggregate. For
instance, the high rank of the NK-cell mediated cytotoxicity pathway in
the GSEA analysis, as well as pathways that include NK-cell associated
proteins (e.g. SLAMF6), could reflect the prevalence of MHC loss or
reduction in T-cell rich tumors. Loss or reduction of MHC is an
emerging mechanism of immune evasion by tumor cells [[120]72–[121]74].
MHC loss or reduction would simultaneously be expected to reduce
presentation of tumor-associated antigens to the T cells and,
conversely, to make the tumor cells more susceptible to NK-cell
targeting. This implication of susceptibility indicates potential value
in stimulating NK-cell activation toward tumor cells that have lost MHC
expression. Our analysis does not partition gene expression
associations among the various T-cell populations (e.g. conventional ab
T cells, gd T cells or NK T cells), but future work could examine
correlations within partitions.
Our analysis informs immunomodulatory target selection for tumors with
infiltrating T cells but low PDCD1 expression. Other
target-identification efforts could be aimed at recruiting T cells to
the tumor or eliminating physical barriers (e.g. extracellular matrix)
that limit the ability of effector T cells to exert cytotoxic effects
on the tumor cell directly. It is possible that both approaches will be
required in concert to ultimately drive the desired response in
patients. For example, the remarkable success of chimeric-antigen
receptor T-cell therapy (CAR-T therapy) in leukemias has been
contrasted against more limited effects in solid tumors and some
lymphomas. Targets identified in this report could also be relevant to
the engineering of T cells to target solid tumors. Limited efficacy of
CAR-T therapy in solid tumors might arise due to local signals that
suppress cytotoxic effects upon arrival of the engineered T cells to
the tumor microenvironment. The targets and pathways identified in this
report might provide guidance regarding engineering approaches (i.e.
Cas9-mediated knockouts or expression of decoy receptors) that could be
applied where traditional antibody and small-molecule inhibitors are
not feasible.
Immunotherapy approaches have produced some remarkable therapeutic
successes, yet there is much uncharted territory left to explore. By
identifying genes expressed in common with T-cell markers within the
tumor microenvironments of 9,601 patients and 31 types of cancer, our
work helps to map the boundaries of this landscape, narrowing the list
of gene candidates for new therapeutic targets. Continued research will
map out the immune composition and dynamic nature of the tumor
microenvironment—providing opportunities to identify novel and
complementary targets, and expanding the efficacy of therapeutics and
the breadth of patients who benefit.
MATERIALS AND METHODS
Detecting T-cell associated gene expression in the tumor microenvironment
RNA-Seq V2 data from 31 tumor types (including 9601 tumor samples
total) were obtained from The Cancer Genome Atlas (TCGA) using the open
platform cBio Cancer Genomics Portal [[122]75, [123]76] in the form of
RSEM z-scores. Though available from this source, we excluded thymoma
from our analysis because tumors of the thymus (where T cells mature)
are known to directly alter the T-cell composition [[124]77]. To find
candidate genes expressed by T cells in the tumor microenvironment, we
generated lists of genes for each cancer type, ranked by the strength
of their correlation with CD3E expression, a pan T-cell marker
[[125]78]. Pearson product-moment correlation coefficients were
calculated using the corrcoef method of Python’s numpy library, applied
to all tumors with expression values for both CD3E and a given gene of
interest. A weak correlation between a candidate gene and the T-cell
marker could indicate that not all T cells in the tumor express the
gene, that there is variability in the amount of expression among T
cells in the tumor, and/or that cells other than T cells in the tumor
are expressing the gene. Each of these possibilities could have
implications for the effectiveness of T-cell-based targeted therapy,
and all support the conclusion that targets with weaker correlations
may be less successful than targets with stronger correlations.
We first performed this analysis on a candidate gene panel of 40 known
immune-associated genes, and then we extended the analysis to an
unbiased ranking of the cancer exome (12082 genes found to be expressed
in cancer) [[126]79]. The candidate gene panel included costimulatory
genes (CD2, CD27, CD28, TNFRSF8, CD40, CD40LG, CD70, ICOS, ICOSLG,
TNFRSF4, TNFSF4, TNFRSF9), putative T-cell inhibitory genes (CTLA4,
PDCD1, CD274, PDCD1LG2, VSIR, CD160, TNFRSF14, CD200, CD200R1, TIGIT,
CD276, VTCN1, BTLA, LAG3, HAVCR2), regulatory markers (FOXP3,
TNFRSF18), and metabolic checkpoints (ADORA1, ADORA2A, ADORA2B, HK2,
GLS, IDO1, TPI1). We also included several myeloid cell-related immune
checkpoints (CD14, CSF1R, KIR2DL1, PVR), based on their involvement in
the complex cell-cell interaction within the tumor microenvironment.
Identifying candidate gene targets to complement anti-PD1 therapies
We developed a scoring metric to rank candidate genes according to
their potential usefulness as alternative or complementary targets to
PDCD1, an immune checkpoint gene that has already been successfully
targeted in several cancer types. This metric, which we term ‘PDCD1
complementarity score’, takes into account each gene’s strength of
correlation with CD3E expression (to indicate consistent expression by
T cells in the tumor microenvironment), and also minimizes the overlap
in expression pattern with PDCD1. The goal of this analysis is to
identify genes that will potentially be useful as targets for combined
immunotherapy and/or for the subset of patients who do not respond to
anti-PD-1 therapy. The promise of a target gene can be quantified with
a function of the co-expressions of two pairs of genes: the target gene
and CD3E; and the target gene and PDCD1, by the score
[MATH:
|ρTC|(|ρTC<
/mi>|−|ρ<
mi>TP|), :MATH]
where co-expression is captured by the absolute value of their Pearson
correlation coefficients, denoted |ρ[TC]| and |ρ[TP]|, respectively.
While negative co-expression with CD3E could in principle give a high
complementarity score, no high-scoring genes (e.g. in the top 600 of
our ranked list) exhibited negative co-expression with CD3E.
Identifying candidate gene targets to complement anti-PD1 and anti-CTLA-4
therapies
To identify genes that could be complementary to CTLA4 as well as
PDCD1, we extended the complementarity score to include co-expression
with CTLA4. The joint complementarity score for a candidate gene was
quantified as
[MATH:
|ρTC| min{|ρ<
/mi>TC|−|
mo>ρTP|,|ρTC|−|ρTA<
/mi>|}, :MATH]
where |ρ[TP]| is the absolute value of the Pearson correlation
coefficient for the target expression and CTLA4 expression. Genes with
a high joint complementarity score have a high co-expression with CD3E
and relatively low co-expression with both PDCD1 and CTLA4.
Gene set enrichment analysis
To identify overrepresented pathways among genes with high CD3E
co-expression, we used the gene- set enrichment software GSEA (v3.0)
from the Broad Institute [[127]80, [128]81], using the ‘GSEAPreranked’
algorithm. Our input rank file consisted of the exome-wide gene set and
each gene’s corresponding median co-expression value across the 31
cancer types. We tested against the ‘Canonical Pathways’ gene set
(v6.1), selecting the weighted algorithm with default parameters.
Results were compared using Normalized Enrichment Score (NES) and false
discovery rate (FDR).
Linking anti-PD1 therapy response to strength of correlation between CD3E and
PD-1 expression in the tumor microenvironment
We accessed RNA-Seq transcriptome data from pretreatment samples of 26
metastatic melanoma patients who received anti-PD1 therapy, and that
had been categorized as ”responders” (n = 10 who experienced partial
response, plus n = 4 who experienced complete response) and
”non-responders” (n = 12 who experienced progressive disease) according
to irRECIST criteria [[129]82]. We looked for differences between
responders and non-responders in terms of the strength of correlation
between CD3E expression and PDCD1 expression. Because patients whose
tumor-infiltrating T cells consistently express PDCD1 are more likely
to respond to anti-PD1 therapy [[130]5, [131]83, [132]84], we
hypothesized a stronger correlation for responders than non-responders.
Following the same logic, we also hypothesized that genes that are
candidates for future tumor-infiltrating T-cell therapies will show
high correlation in expression with CD3E in the tumor microenvironment.
To assess differences in the CD3E-PDCD1 correlation between responders
and non-responders, we calculated Pearson’s correlation coefficients as
described above, and performed linear regressions using the ols
function of Python’s statsmodels library. We quantified linear model
fits by comparing their coefficients of determination (R^2), and
evaluated the statistical significance of the best fits using the
Mann–Whitney U test.
SUPPLEMENTARY MATERIALS
[133]oncotarget-10-4532-s001.pdf^ (3.2MB, pdf)
ACKNOWLEDGMENTS