Abstract
Background/Objectives: Multiple myeloma (MM) is a challenging, B cell
malignancy characterised by the uncontrolled proliferation of plasma
cells within the bone marrow. Despite significant advances in treatment
options nowadays, MM remains an incurable malignancy, with the majority
of patients succumbing to the disease. MM develops from a pre-malignant
state known as monoclonal gammopathy of unknown significance (MGUS),
which then has the potential to evolve either into smouldering
(asymptomatic) multiple myeloma (SMM) or into MM. Since novel drug
discovery takes years to reach the clinic, drug repurposing, which
concerns the detection of existing drugs for a novel disease, can be
applied. Methods: To address this critical and still unmet medical
need, we present a comprehensive signature-based drug-repurposing
approach using all the publicly available bulk transcriptomics datasets
on mGUS, sMM, and MM. Results: Our study included an in-house scoring
scheme approach enabling further filtering and prioritisation,
resulting in 25 candidate repurposed drugs for mGUS, 23 for sMM, and 66
for MM. The corresponding gene targets and the related functional terms
have been analysed, providing extra information for stage-specific
underlying mechanisms in myeloma. Lastly, enabled by a specific
computational workflow, we propose drug combinations between our top
candidate repurposed drugs and FDA-approved drugs for MM. Conclusions:
Together, these results deliver a stage-specific, transparent resource
for MM drug repurposing and combination design, intended to accelerate
translation toward earlier disease intervention and improved patient
outcomes.
1. Introduction
Multiple myeloma, the second-most common haematological malignancy
after non-Hodgkin lymphoma, originates from terminally differentiated
plasma cells primarily found in the bone marrow. The disease is
characterised by the secretion of a monoclonal immunoglobulin protein
(M protein) in most patients, driving clinical manifestations such as
hypercalcemia, renal insufficiency, anaemia, and bone disease [[32]1].
Monoclonal gammopathy of undetermined significance (MGUS) represents an
asymptomatic precursor to multiple myeloma, involving clonal plasma
cells and M protein secretion. Approximately 15% of MGUS cases progress
to multiple myeloma over 25 years, and early detection has shown
positive effects on overall survival [[33]2]. Smouldering multiple
myeloma (sMM) serves as an intermediate stage between MGUS and MM, with
recent updates considering ultra-high-risk sMM as part of multiple
myeloma and suggesting that early treatment for high-risk sMM may delay
progression to full-blown MM [[34]3]. More specifically, MGUS is an
asymptomatic clonal plasma cell disorder (M protein < 3 g/dL; marrow
plasma cells < 10%; no common symptoms of MM). Progression to MM
averages ~1%/year but varies with isotype, M protein level, and free
light chain ratio [[35]4]. sMM shows higher tumour burden (≥10% marrow
plasma cells and/or higher M protein) without end organ damage; risk
stratification identifies subsets for early intervention [[36]5].
Symptomatic MM is defined by clonal plasma cells (≥10%) plus
myeloma-defining events, reflecting genomic lesions and
microenvironmental cues driving proliferation, immune evasion, and bone
disease [[37]6]. Established diagnostic and prognostic biomarkers in MM
include detecting M protein, free immunoglobulin light chain,
β2-microglobulin and albumin, creatinine and calcium, and FISH analysis
for cytogenetic abnormalities. These biomarkers are used in the
disease’s staging system. Additionally, imaging is also used to detect
the degree of bone marrow infiltration. Emerging diagnostic biomarkers
include next-generation sequencing and next-generation flow cytometry,
extracellular vesicles, miRNAs, and circulating tumour cells [[38]7].
Current management includes regular monitoring for MGUS and low-risk
sMM, which avoids drug toxicity but carries a small ongoing risk of
progression [[39]4]. In selected high-risk sMM, early treatment (e.g.,
daratumumab- or lenalidomide-based) can delay progression and improve
outcomes, but increases infection risk and cost, and patient-selection
criteria are still evolving [[40]8]. In symptomatic MM, proteasome
inhibitors (bortezomib, carfilzomib, ixazomib) yield high response
rates, but neuropathy with bortezomib and cardiopulmonary events with
carfilzomib remain key limitations, alongside eventual resistance
[[41]9]. Immunomodulatory drugs (IMiDs; lenalidomide, pomalidomide)
provide durable benefit and underpin maintenance therapy yet cause
cytopenia and venous thromboembolism [[42]10]. Anti-CD38 monoclonal
antibodies (daratumumab, isatuximab) achieve rapid cytoreduction and
synergise with proteasome inhibitor and IMiD-based backbones, but are
associated with infusion-related reactions, infections,
hypogammaglobulinemia, and higher treatment cost [[43]11].
BCMA-directed therapies—including CAR-T cells, bispecific antibodies,
and antibody–drug conjugates—can induce deep remissions in
relapsed/refractory MM, yet are limited by cytokine-release syndrome,
immune-effector cell-associated neurotoxicity, infections with
prolonged hypogammaglobulinemia, manufacturing/access constraints, and
relapse driven by antigen escape [[44]11].
Early standard treatment for MM included alkylating agents such as
melphalan and/or cyclophosphamide, combined with corticosteroids in
most cases. This treatment was followed by autologous stem cell
transplantation, a standard treatment option still used today.
Compounds such as thalidomide, lenalidomide, and pomalidomide also
became available soon afterward. During the last two decades, the
therapeutic pool of MM expanded even further, with the discovery of
proteasome inhibitors (bortezomib), histone deacetylase inhibitors
(panobinostat), and more recently, nuclear export inhibitors
(selinexor). In addition, approval of immunotherapies for MM in 2015
starting with daratumumab and elotuzumab monoclonal antibodies and
later on with the antibody–drug conjugate belantamab mafodotin, the
bispecific teclistamab, and chimeric antibody receptor (CAR)-T cell
products, such as idecabtagene and vicleucel, are standard treatment
options nowadays. Despite the outstanding discoveries and improvements
that were made regarding the treatment for MM, almost all patients
become refractory to treatment and relapse. Because of that, there is a
high need to detect or develop drugs for the treatment of relapsed
and/or refractory MM or for halting the progression to MM [[45]12]. One
such approach is drug repurposing. Drug repurposing concerns the
detection of novel indications for drugs that are already approved for
another disease. Specifically, in silico drug repurposing offers the
ability to detect potential drugs using computational methodologies in
a cost and time-efficient way.
Motivated by the lack of a cure for MM or drugs that could halt the
progression to MM, we focused on applying the avenue of computational
drug-repurposing to highlight promising drug candidates and to study
the underlying mechanisms of the drugs and pathways related to the
disease and the stages that precede the disease. Since each MM stage
(MGUS, sMM, and MM) is characterised by different symptoms and
molecular mechanisms, we stratified the patient samples based on the
different stages to pinpoint stage-specific candidate repurposed drugs.
Here, we present a novel approach to use the transcriptomic signatures
of the molecular basis of the different stages of MM to detect
candidate repurposed drugs per severity stage. Initially, we performed
drug-repurposing analysis for different MM stages using publicly
available transcriptomic data. Moreover, we show the pathways that our
proposed repurposed drugs are involved in and their structural
similarities with current clinical trial drugs of MM.
2. Objectives
* To analyse publicly available transcriptomic datasets across the
full spectrum of multiple myeloma (MGUS, sMM, and MM) to identify
stage-specific differentially expressed genes (DEGs) and perform
pathway enrichment analysis.
* To apply a computational drug-repurposing pipeline aimed at
identifying candidate therapeutic compounds for each disease stage
and uncovering shared and distinct candidate drugs across the MM
progression.
* To investigate the molecular targets and associated pathways of the
repurposed drug candidates, offering insight into the biological
complexity and heterogeneity of myeloma at each stage.
* To propose rational drug combination strategies, integrating
FDA-approved treatments for MM with newly identified repurposed
candidates.
3. Materials and Methods
On 2 September 2024, we collected all the available bulk transcriptomic
data of the 3 stages of MM progression: MGUS, SMM, and MM. We performed
DE analysis per dataset and MM stage, using the Limma R package
(version 3.64.0). Enrichment analysis of the DE genes and in silico
drug repurposing using 3 different drug-repurposing tools then
followed. After collecting the candidate repurposed drugs, we used an
already established in-house scoring scheme to further filter the
drugs. A structural comparison of the highlighted proposed drugs with
the drugs that are currently in clinical trials was then followed. The
targets of the proposed candidate repurposed drugs were found and
enrichment analysis of those targets was then performed. Lastly,
suggestions of candidate drug combinations of the MM FDA-approved drugs
with our proposed candidate repurposed drugs were followed. The
detailed pipeline is shown in [46]Figure 1.
Figure 1.
[47]Figure 1
[48]Open in a new tab
General pipeline of the study. Step 1: Transcriptomics data collection
from Gene expression omnibus (GEO). Step 2: Pre-processing of data and
detection of differentially expressed genes in R using the limma R
package. Step 3: Transcriptomics-based drug repurposing—3 different
tools were used: Connectivity Map (CMap) CLUE, L1000CDS^2, and
SigComLINCS. The top 150 over-expressed and 150 under-expressed genes
based on log[2]FC from the gene list with an adjusted p value of <0.05
were used as an input. Gene ontology enrichment analysis of the
differentially expressed genes and drug targets was followed. Kept the
top pathways per dataset to keep a unique list for each stage using an
in-house scoring scheme. Step 4: Collection of clinical trials drugs
and structural similarity was performed for candidate repurposed drugs
and drugs in clinical trials. Step 5: Proposal of drug combinations
through 4 different models, using the FDA-approved drugs for MM and our
proposed candidate repurposed drugs.
3.1. Data
Six different microarray MM datasets were retrieved from the Gene
expression omnibus (GEO) [[49]13]—a transcriptional data repository.
Some of the datasets did not include all disease stages. The selection
of the datasets was based on the disease staging: MGUS, sMM, and MM
([50]Table 1). To our knowledge, these were the only publicly available
datasets on the 2 September 2024 that clearly used this progression
staging for the disease.
Table 1.
The experimental design of the transcriptomic datasets used in this
study. All stages were compared to controls.
No Ref. GEO Accession Number Stage
1 [[51]14] [52]GSE36474 MM
2 [[53]15] [54]GSE5900 MGUS
sMM
3 [[55]16] [56]GSE6477 MGUS
sMM
MM
4 [[57]17] [58]GSE13591 MGUS
MM
5 [[59]18] [60]GSE47552 MGUS
sMM
MM
6 [[61]19] [62]GSE80608 MGUS
MM
[63]Open in a new tab
3.2. Pre-Processing of Data
Each dataset selected was quantile-normalised and log[2]-transformed
where necessary. Subsequent analysis was performed in R statistical
environment ([64]http://www.R-project.org/, accessed 12 September 2024)
[[65]20]. Each of the six datasets and for each stage was processed
using the Limma R package [[66]21], a linear model that calculates a
moderated t-statistic from gene expression experiments.
3.3. Detection of Differentially Expressed Genes
After the dataset pre-processing, probe-set IDs were matched to gene
symbols according to each platform’s annotation file. We maintained the
most differentially expressed ones in cases of gene symbol
correspondence to multiple probe-sets ([67]Tables S1–S6). Explicitly
stage-labelled cohorts across the MGUS → sMM → MM spectrum are rare and
span different microarray platforms with unequal stage composition.
Thus, we adopted a conservative strategy: within-study normalisation
and limma DE, followed by rank-level aggregation across studies per
stage.
From the Limma analysis result, we kept the top 150 over-expressed and
150 under-expressed genes based on log[2]FC from the gene list with an
adjusted p value of <0.05. All comparisons were made using the disease
state (i.e., MGUS, sMM, or MM) vs. control samples. The selected number
of genes (150 over-expressed and 150 under-expressed) corresponds to
the input number limit of the drug-repurposing tools we used in the
sequel. Many widely used signature-reversal tools perform best with
balanced up/down lists of limited size. Using a fixed symmetric cutoff
ensures comparability across tools and stages and focuses on the
high-confidence perturbation while avoiding noise from long tails where
directionality is least stable across datasets. To keep the study
tractable and avoid tool-specific optimisation (which risks
overfitting), we retained 150/150 as a pre-specified setting.
3.4. Pathway Analysis of DEGs
The Gene ontology (GO) enrichment analysis was conducted using the
differentially expressed genes (DEGs) identified in the transcriptomics
study. The focus was on biological processes (BPs), and the
clusterProfiler R package (accessed on 18 April 2025) [[68]22] was
employed for the analysis. This procedure was applied individually to
each stage and dataset, and a scoring method was used to combine the
results, leading to a single list of pathways for each stage of the
disease ([69]Tables S7–S9).
To rank the pathways for each stage and dataset, leading to a unified
list per stage, we ranked the pathways within each dataset according to
their adjusted p values. Hence, the pathways from each dataset for each
stage were combined as a union of unique pathways and ranked by
calculating the weighted sum of normalised average rankings and the
normalised number of appearances according to Equation (1):
[MATH:
Score
mi> i= w1<
mtext> ∗ R<
mrow>i +
w2 ∗ Ai<
/mi>, i <
mo>= 1, …
, N pathways
mtext> :MATH]
(1)
where R[i] is the average ranking score from each of the three tools,
A[i] is the number of appearances of each pathway for the different
datasets per stage, and w[1] and w[2] are set to 0.7 and 0.3,
respectively. This scoring scheme was adapted from [[70]23].
3.5. Transcriptomics-Based Drug Repurposing
The transcriptomic-based drug repurposing was performed using three
different drug-repurposing tools: Connectivity Map (CMap) CLUE
[[71]24], L1000CDS^2 [[72]25], and SigComLINCS [[73]26]. The 300
differentially expressed genes (based on their log[2]FC value) from the
six different datasets were used as transcriptomic signatures. Next,
each set was used as an input to the aforementioned repurposing tools.
These tools use transcriptional expression data from multiple human
cell lines to probe relationships between diseases and therapeutic
agents. Drugs are sorted according to a score (inhibition score), which
characterises if a drug can reverse (drugs with a strong negative score
value) or mimic (drugs with a strong positive score value) the
expression levels of a disease based on a given set of genes. For each
stage and each dataset, we obtained a list of candidate repurposed
drugs predicted by each of the three tools, ranked based on their
inhibition score. Since the output of L1000CDS^2 is limited to 50
drugs, we applied the same cutoff for all the other repurposed drug
lists. Hence, the top 50 drugs from each of the three tools were
combined as a union of unique drugs and ranked by calculating the
weighted sum of normalised average rankings and the normalised number
of appearances according to Equation (2):
[MATH:
Score
mi> i = w1 ∗ Ri +
w2 ∗ Ai, i<
mtext> = 1,
mtext>… , N drugs
:MATH]
(2)
where R[i] is the average ranking score from each of the three tools,
A[i] is the number of appearances of each drug in the three DR tools,
and w[1] and w[2] are set to 0.7 and 0.3, respectively. The drug lists
obtained from all datasets were combined and re-ranked using Equation
(1) to conclude a single drug list from all the datasets. This scoring
scheme was adapted from [[74]23].
After the scoring was completed, we retained drugs with a score of 0.75
or higher for each disease stage.
3.6. Collection of the Currently Running Clinical Trials of MM and Its Stages
All listed clinical studies related to the three stages were collected
from [75]www.clinicaltrials.gov, accessed on 5 January 2025. The
downloaded file was filtered separately for MGUS, sMM, and MM and
entries that did not match these terms were removed. Additionally,
clinical trials that were either suspended, withdrawn, unknown, or
terminated were removed in order to keep the entries that are active,
completed, or will be recruiting soon. Specifically, only
small-molecule drugs and biologicals were obtained from the studies,
and everything else was removed.
3.7. Structural Similarity
The structures of the candidate repurposed drugs and the clinical trial
drugs we collected were downloaded in the form of the Simplified
Molecular Input Line Entry Systems (SMILES) through the PubChem
Identifier Exchange Service of the PubChem database
([76]https://pubchem.ncbi.nlm.nih.gov/idexchange/idexchange.cgi,
accessed on 21 March 2025) [[77]27]. We then converted the SMILES
format into a single 2D structure data file (SDF) using the OpenBabel
software [[78]28]. The Rcpi R package (accessed on 27 March 2025)
[[79]29] was then used to perform structural similarity across the
different drug groups collected, using an in-house script. We used an
80% Tanimoto similarity as a threshold. Additionally, we used a merged
SDF of the shortlisted repurposed drugs of the three stages (MGUS, sMM,
and MM) as input in the ChemBioServer 2.0
([80]https://chembioserver.vi-seem.eu/, accessed on 30 March 2025)
[[81]30], a publicly available tool that provides filtering,
clustering, comparison of drug structures, and networking of chemical
compounds to facilitate both drug discovery and repurposing. Drugs were
clustered using the Soergel distance ≤ 0.15 corresponding to a Tanimoto
similarity 87% (Tanimoto similarity = 1/(1 + Soergel distance))
[[82]31].
3.8. Drug Target Pathway Analysis
To further explore the candidate repurposed drugs, we extracted the
corresponding gene targets of each drug through mainly the Drug
repurposing hub database
([83]https://repo-hub.broadinstitute.org/repurposing, accessed on 30
March 2025) [[84]32] per stage. In cases where no target was found,
DrugBank ([85]https://go.drugbank.com/, accessed on 30 March 2025) and
PubChem ([86]https://pubchem.ncbi.nlm.nih.gov/, accessed on 30 March
2025) were also used. ([87]Tables S13–S15). The Gene ontology (GO)
enrichment analysis was conducted using the drug target genes of the
top candidate repurposed drugs, as chosen using the scoring scheme
mentioned above. The focus was on biological processes (BPs), and the
clusterProfiler R package was employed for the analysis. This procedure
was applied individually to each stage, and the top pathways were kept
using an adjusted p value of <0.05.
3.9. Drug Combination Synergies
The DrugComb database [[88]33] was used to extract and analyse
experimental drug combination data across multiple cancer cell lines.
DrugComb ([89]https://drugcomb.fimm.fi, accessed on 30 March 2025) is
an open access data portal containing drug combination studies, which
are standardised and harmonised. In total, 437,932 drug combinations
were tested on a variety of cancer cell lines. The data were downloaded
and filtered to include only MM-related cell lines and FDA-approved
drugs for MM. The analysis was performed in R. Four synergy models were
used:
* ZIP (Zero Interaction Potency): Measures interactions across
different doses [[90]34].
* Loewe Additivity: Compares observed combination effects to the
expected additive effects.
* HSA (Highest Single Agent): Evaluates whether the combination is
superior to the best-performing single agent.
* Bliss Independence: Assesses interactions based on independent
probabilities of drug effects.
We used four complementary reference models. HSA benchmarks against the
best single agent (conservative). Bliss assumes probabilistic
independence. Loewe assumes dose equivalence (appropriate for similar
mechanism pairs) and is typically most stringent when mechanisms
diverge. ZIP integrates potency and effect shifts across the response
surface. Therefore, model divergence, especially lower Loewe synergy
for mechanistically distinct pairs, is expected and informative.
4. Results
The pipeline adopted in this study and the main steps are illustrated
in [91]Figure 1. The overall process entails the analysis of
stage-specific MM-related transcriptomics datasets to identify
significant genes, with the subsequent identification and shortlisting
of candidate repurposed drugs and the pathways they target.
4.1. Differential Expression Analysis
The first part of this study included the collection and analysis of
publicly available transcriptomics datasets of MGUS, sMM, and MM
patients and controls. Following the pre-processing of these datasets,
we performed differential analysis to identify differentially expressed
genes (DEGs) between patients in each stage vs. controls. We used a
cutoff of adjusted p values < 0.05 and then sorted the differentially
expressed genes based on their log[2] fold-change (log[2]FC) value. All
differential expression analysis comparisons are presented in
[92]Tables S1–S6. The top five differentially expressed genes for each
comparison are listed in [93]Table 2.
Table 2.
The top 5 differentially expressed genes for each comparison.
MGUS sMM MM
[94]GSE36474 MAB21L1
XG
EMX2
HOXB-AS3
FAM3A
[95]GSE5900 KIT
IGLC1
PRR15
C7orf55
LOC100293211 KIT
CYAT1
LOC100293211
IGHV3-73
C7orf55
[96]GSE6477 CLC
PRG2
LOC100293211
RNASE2
PRG3 IGHD
IGLJ3
LOC100293211
CKAP2
IGHA1 IGHD
IGLJ3
LOC100293211
CKAP2
IGHA1
[97]GSE13591 IGLV1-44
IGLC1
LOC100293211
IGK
CKAP2 IGHD
LOC100293211
AbParts
IGLV1-44
IGHM
[98]GSE47552 IGKV2D-40
SNORD115-1
SNORD115-6
GPR15
SNORD115-44 IGKV2D-40
IGKV2D-26
IGKV1D-27
IGHV1OR15-1
IGKV1OR2-3 IGKV2D-40
IGKV2D-26
IGKV1OR2-3
IGKV6-21
IGKV1D-27
[99]GSE80608 SFRP2
H19
SLC14A1
F2R
SCIN FLG
H19
SLC14A1
F2R
SCIN
[100]Open in a new tab
4.2. Pathway Analysis of Differentially Expressed Genes
GO enrichment analysis was performed to detect the statistically
significant pathways involving the DEGs from the different datasets per
stage. In the MGUS stage, several biological processes (BPs) related to
the immune response were significantly detected ([101]Table S7). The
highest scores were associated with processes like the adaptive immune
response, particularly those involving immune receptor recombination-
and leukocyte-mediated immunity. This indicates heightened immune
activity at this early stage, where processes such as regulation of
leukocyte and lymphocyte proliferation, cell chemotaxis, and immune
responses to hydrogen peroxide are prominent. Notably, positive
regulation of cytokine production, reactive oxygen species (ROS)
metabolic process, and cell–cell adhesion suggest an active environment
where immune cells are mobilised and interact to control abnormal cell
growth.
In the sMM stage, immune system regulation intensifies, as seen in the
overrepresentation of GO terms related to mononuclear and lymphocyte
proliferation, positive regulation of cell activation, and T cell
differentiation ([102]Table S8). Nearly all immune-related pathways
show high scores (0.99), indicating robust immune modulation. Key
processes such as immune response-activating cell surface receptor
signalling and antigen receptor-mediated signalling pathways are highly
active, suggesting a pre-cancerous state where immune surveillance
attempts to combat disease progression. Additionally, terms involving
positive regulation of leukocyte and lymphocyte adhesion imply strong
immune cell communication and activation during this transitional
phase.
In the MM stage, there is still emphasis in immune-related processes,
with some unique changes ([103]Table S9). The B cell receptor
signalling pathway and B cell activation become more prominent,
suggesting the involvement of B cells in the disease’s progression.
Additionally, cytokine production, particularly interleukin-6 (IL-6),
and processes related to leukocyte migration and myeloid leukocyte
cytokine production highlight the advanced immune dysregulation. These
factors are likely contributing to the chronic inflammatory
environment.
Across the three stages (MGUS, sMM, and MM), there is a clear
progression in immune response activities. In the MGUS stage, the
immune system is highly active with processes focused on immune cell
proliferation and activation. As the disease progresses to the sMM
stage, the immune system continues to play a central role, with
heightened regulation of lymphocyte activation and immune signalling
pathways. By the MM stage, however, there is a shift towards immune
dysfunction, with an emphasis on B cell activation and cytokine
production, particularly IL-6, a known contributor to MM progression.
This progression reflects how the immune system, initially attempting
to control the abnormal cells, becomes increasingly compromised,
allowing for tumour growth and proliferation in later stages. Full
lists of pathways for each stage are presented in [104]Tables S7–S9.
4.3. Identification of the Shortlisted Candidate Repurposed Drugs for MM and
Its Stages
To perform in silico drug-repurposing analysis, we selected the top 150
over- and 150 under-expressed genes as they are required for most
repurposing tools. Using the DEG sets, we performed a series of in
silico drug-repurposing analyses with existing computational tools (see
Methods); Connectivity Map (CLUE), L1000CDS^2, and SigCom LINCS,
leading to three lists of candidate repurposed drugs, for each stage
and dataset, followed by a scoring process yielding to three lists of
proposed repurposed drugs for each stage (MGUS, sMM, and MM). The top
repurposed drugs selected are shown in [105]Figure 2. Our in silico
drug-repurposing analysis identified several candidate drugs with the
potential for repurposing in MM. These drugs were evaluated based on
their clinical trial status, preclinical evidence, mechanisms of
action, and gene targets ([106]Table S16). While some drugs have been
previously investigated in MM, others have only been tested in other
cancer types. Key mechanisms of action included cyclin-dependent kinase
(CDK) inhibition, histone deacetylase (HDAC) inhibition, and selective
estrogen receptor modulation.
Figure 2.
[107]Figure 2
[108]Open in a new tab
Top-scored candidate repurposed drugs for each disease state (MGUS,
sMM, and MM) according to our scoring scheme [[109]23]. (A). MGUS, (B).
sMM, and (C). MM. Bortezomib, which is detected in the sMM stage, is an
already FDA drug for MM.
For MGUS, the top candidate repurposed drugs proposed regarding the
in-house scoring scheme used were geldanamycin, roscovitine, PP-30,
mitomycin C, and collybolide. For sMM, the top candidate repurposed
drugs proposed regarding the scoring scheme used were olprinone,
lamotrigine, diprotin A, collybolide, and 112726-66-6 (BTCP). For MM,
the top candidate repurposed drugs proposed regarding the scoring
scheme used in this study were radicicol, piperlongumine, entinostat,
vecuronium, and terreic acid ([110]Tables S10–S12).
4.4. Investigation of Structural Similarity Concerning Ongoing Clinical
Trials
The shortlisted candidate repurposed drugs were screened for structural
similarity with the MGUS-, sMM-, and MM-related drugs in clinical
trials, and FDA-approved drugs of MM. Pairwise structural similarity
was calculated using a Tanimoto score threshold of 80%. As shown in
[111]Figure 3, heatmaps for each disease state suggest that the
majority of all examined drugs are not very similar, indicating a lack
of redundancy and a wide range of structural diversity in both our
shortlist and clinical trials. However, some similarities are present,
particularly for MM. For instance, ten candidate repurposed drugs are
also used in ongoing clinical trials, therefore showing a Tanimoto
similarity of 100%. Additionally, the candidate repurposed drug
exemestane, was found to have a similarity score of 82% against
dehydroepiandrosterone, a clinical trial drug. The repurposed drug
fluocinolone acetonide was also found to have a similarity score of 82%
against the clinical trial drug dexamethasone. Lastly, the repurposed
drug ivermectin b1a showed an 81% similarity with the clinical trial
drug bryostatin 1.
Figure 3.
[112]Figure 3
[113]Open in a new tab
Structural similarity heatmaps of shortlisted repurposed drugs along
with the ongoing drugs in clinical trials for (A) MGUS, (B) sMM, and
(C) MM. For MM, the maximum similarity between a repurposed drug and a
clinical trial drug was kept, since too many drugs were available for
this stage. X-axis shows the candidate repurposed drugs. Y-axis shows
the drugs in clinical trials for each disease state.
Additionally, the shortlisted candidate repurposed drugs were screened
for structural similarity among the three stages: MGUS, sMM, and MM.
Pairwise structural similarity was calculated using a Tanimoto score
threshold of 87%. As shown in [114]Figure 4, again in this comparison,
the hierarchical clustering suggests that all drugs are not very
similar, indicating a lack of redundancy and a wide range of structural
diversity in our shortlisted candidate repurposed drugs. Ten of these
drugs were detected in two disease stages as shown by the asterisk.
These drugs include terreic acid, calyculin A, radicicol, sofalcone,
piperlongumine, vorinostat, 849234-64-6
(4-acetamido-N-(2-amino-5-thiophen-2-ylphenyl)benzamide), bortezomib,
elesclomol, and linifanib.
Figure 4.
[115]Figure 4
[116]Open in a new tab
Hierarchical clustering of the shortlisted candidate repurposed drugs
for MGUS, sMM, and MM. The different groups in each box are thresholded
at Soergel distance value 0.15. Candidate repurposed drugs that are
detected in two disease stages are marked with an asterisk (*).
To further explore the candidate repurposed drugs, we extracted the
corresponding gene targets of each drug through mainly the Drug
repurposing hub database, and in cases where no target was found,
DrugBank and PubChem were also used ([117]Tables S13–S15).
We then performed Gene ontology (GO) analysis (biological processes)
per disease stage, using the gene targets of the candidate drugs
through the ClusterProfiler R package. Additionally, hierarchical
clustering of the enriched terms was performed again using the
ClusterProfiler R package. This relies on the pairwise similarities of
the enriched terms calculated by the use of Jaccard’s similarity index
(JC) ([118]Supplementary Figures S1 and S2).
To further elucidate the functional differences and commonalities
between the three disease stages, we generated an enrichment map plot
using the ClusterProfiler R package, organising enriched terms into a
network with edges connecting overlapping gene sets. In this way,
mutually overlapping gene sets tend to cluster together, making it easy
to identify functional modules. These gene targets of the proposed
candidate repurposed drugs from our analysis, provide valuable insights
into the fundamental transcriptional characteristics underlying the
functional properties of MGUS, sMM, and MM.
This analysis highlights BPs involved in “cell signalling pathways”,
such as vascular endothelial growth factor signalling pathway,
brain-derived neurotrophic factor receptor signalling pathway, response
to insulin and others ([119]Figure 5, yellow cluster). Moreover, the
purple cluster is involved in cellular response to insulin stimulus,
cellular response to peptide hormone stimulus, response to macrophage
colony-stimulating factor, and others. Notably, the yellow and purple
clusters overlapped spatially, indicating that these functionally
coherent groups share biologically related terms. Additionally, BPs
involved in membrane and action potentials and potassium ion transport
were also highlighted ([120]Figure 5, red cluster). GO terms associated
with this cluster were highly enriched in all disease stages, and
particularly in MGUS and sMM. Another important cluster generated by
this analysis highlights BPs involved in epigenetic regulation of gene
expression and protein modification ([121]Figure 5, green cluster).
Additionally, GO terms associated with this cluster were highly
enriched in all disease stages. Moreover, a cluster highlighting
Circadian rhythms, protein localisation, mitochondrial function, and
others, has also been generated through this analysis. Most BPs of this
cluster have been highlighted in MM (blue cluster, [122]Figure 5). The
blue-green cluster is focused on hormone-related signalling pathways,
with the GO terms associated with this cluster being enriched in all
disease stages. Lastly, a small cluster associated with response to
oxidative stress has also been highlighted (yellow cluster, [123]Figure
5).
Figure 5.
[124]Figure 5
[125]Open in a new tab
Biological theme comparison of MGUS, sMM, and MM. The enrichment map
shows the top enriched terms in MGUS, sMM, and MM, organised into a
network with edges connecting overlapping gene sets. Coloured-based
clustering was performed using the emapplot function, which reflects
functional similarity among enriched terms.
4.5. Drug Combination Synergies
The drug combination data were extracted through the DrugComb database
(see Methods). In this work, we kept four synergy models: ZIP (Zero
Interaction Potency), Loewe Additivity, HSA, and Bliss Independence.
Drug combinations are considered synergistic if they exceed specific
thresholds: moderate synergy (>+5) and strong synergy (>+10)
([126]Supplementary Figure S3).
4.6. Top Drug Combinations Identified
To find the top synergies regarding our disease of interest, MM, a
systematic analysis of MM’s FDA-approved drugs was carried out to
detect synergies in general and, more specifically, synergies with our
proposed candidate repurposed drugs. When looking for drug combinations
common across all synergy models, 17 drug combinations were detected
([127]Supplementary Figures S4 and S5), from which, however, none
include any of our proposed candidate repurposed drugs. Therefore, we
stick to the drug combinations for each synergy model separately, which
are then combined in a final list.
A total of 181 drug combinations exhibited synergy across multiple,
with some combinations including drugs from our proposed candidate
repurposed drug list. The strongest synergies observed per model are
shown in [128]Figure 6, in a subset of 20 top-ranking combinations.
Notably, an analogue of one of the proposed candidate repurposed drugs,
erlotinib, was also identified, further supporting its potential
application in combination therapy. Top drug combinations of FDA drugs
along with our proposed candidate repurposed drugs include lenalidomide
+ retinoic acid (from the ZIP and Bliss models), erlotinib
hydrochloride + bortezomib (from the ZIP model), and 23541-50-6
(daunorubicin HCL) + thalidomide (from the HAS model). The total
significant drug combinations can be found in [129]Table S19.
Figure 6.
[130]Figure 6
[131]Open in a new tab
Summary of the top 20 drug combinations per model. The figure presents
a comparative analysis of synergy scores across different models,
highlighting the most promising combinations for MM treatment. (A)
Synergy Zip Score model, (B) Synergy Loewe Score model, (C) Synergy HSA
Score model and (D) Synergy Bliss Score Model.
The results suggest that several repurposed drugs could be explored
further for their potential in MM treatment. Additionally, the
identification of highly synergistic drug combinations underscores the
importance of combination therapy approaches. Future studies should
focus on validating these findings in preclinical and clinical settings
to assess their therapeutic potential.
5. Discussion
The search for effective treatments for MM continues to be a
substantial problem, requiring ongoing investigation into new
therapeutic approaches. Drug repurposing, the process of using existing
drugs for a novel disease, offers a promising approach due to its cost-
and time-effective nature. Drug repurposing is essential in the
advancement of innovative anti-cancer therapies. This study explores
the capabilities of transcriptomic signature-based drug repurposing
using all the publicly available bulk transcriptomics datasets on MGUS,
sMM, and MM. We filter and prioritise candidate repurposed drugs to be
shortlisted for further analysis in the future. Our study included a
scoring scheme from a previous work of our group [[132]23], resulting
in 25 candidate repurposed drugs for MMGUS, 23 for sMM, and 66 for MM.
From these, 18 candidate repurposed drugs for MGUS, 16 for sMM, and 52
for MM had available structure information. These highlighted drugs
have generally been shown to be structurally distinct from each other,
meaning that most of the proposed drugs do not belong to the same
structural subgroup. Additionally, Gene ontology terms (biological
processes) were detected using the DEGs of each dataset and disease
state. The same scoring scheme used for the candidate repurposed drugs
was also used here to select a single list of pathways for each disease
state. Lastly, we detected the gene targets of the proposed repurposed
drugs along with the associated Gene ontology terms (biological
processes).
According to MeSH, when analysing the top 15 proposed candidate
repurposed drugs for each disease stage—MGUS, sMM, and MM—three drugs
were characterised as antineoplastic for MGUS, including geldanamycin,
mitomycin, and roscovitine. Additionally, for sMM, two drugs were
detected as antineoplastic: linifanib and vorinostat. Lastly, for MM,
five out of the fifteen top proposed candidate repurposed drugs were
antineoplastic agents, including radicicol, entinostat, salermide,
temozolomide, and daunorubicin. Other drug categories among the top
proposed candidate repurposed drugs include immunosuppressants (e.g.,
cyclosporin A detected for sMM).
To synthesise the pathway-level signals, we summarise key therapeutic
families implicated by our stage-specific targets and drug candidates
([133]Table 3). This table highlights where candidates align with known
mechanisms versus under-explored processes enriched in earlier stages.
Table 3.
Mechanistic families for shortlisted repurposing candidates across the
MGUS, sMM, and MM spectrum.
Pathway Family Representative Targets/Mechanisms Example Candidates
from Our Shortlist Key References