Abstract

   Background/Objectives: Multiple myeloma (MM) is a challenging, B cell
   malignancy characterised by the uncontrolled proliferation of plasma
   cells within the bone marrow. Despite significant advances in treatment
   options nowadays, MM remains an incurable malignancy, with the majority
   of patients succumbing to the disease. MM develops from a pre-malignant
   state known as monoclonal gammopathy of unknown significance (MGUS),
   which then has the potential to evolve either into smouldering
   (asymptomatic) multiple myeloma (SMM) or into MM. Since novel drug
   discovery takes years to reach the clinic, drug repurposing, which
   concerns the detection of existing drugs for a novel disease, can be
   applied. Methods: To address this critical and still unmet medical
   need, we present a comprehensive signature-based drug-repurposing
   approach using all the publicly available bulk transcriptomics datasets
   on mGUS, sMM, and MM. Results: Our study included an in-house scoring
   scheme approach enabling further filtering and prioritisation,
   resulting in 25 candidate repurposed drugs for mGUS, 23 for sMM, and 66
   for MM. The corresponding gene targets and the related functional terms
   have been analysed, providing extra information for stage-specific
   underlying mechanisms in myeloma. Lastly, enabled by a specific
   computational workflow, we propose drug combinations between our top
   candidate repurposed drugs and FDA-approved drugs for MM. Conclusions:
   Together, these results deliver a stage-specific, transparent resource
   for MM drug repurposing and combination design, intended to accelerate
   translation toward earlier disease intervention and improved patient
   outcomes.

1. Introduction

   Multiple myeloma, the second-most common haematological malignancy
   after non-Hodgkin lymphoma, originates from terminally differentiated
   plasma cells primarily found in the bone marrow. The disease is
   characterised by the secretion of a monoclonal immunoglobulin protein
   (M protein) in most patients, driving clinical manifestations such as
   hypercalcemia, renal insufficiency, anaemia, and bone disease [[32]1].
   Monoclonal gammopathy of undetermined significance (MGUS) represents an
   asymptomatic precursor to multiple myeloma, involving clonal plasma
   cells and M protein secretion. Approximately 15% of MGUS cases progress
   to multiple myeloma over 25 years, and early detection has shown
   positive effects on overall survival [[33]2]. Smouldering multiple
   myeloma (sMM) serves as an intermediate stage between MGUS and MM, with
   recent updates considering ultra-high-risk sMM as part of multiple
   myeloma and suggesting that early treatment for high-risk sMM may delay
   progression to full-blown MM [[34]3]. More specifically, MGUS is an
   asymptomatic clonal plasma cell disorder (M protein < 3 g/dL; marrow
   plasma cells < 10%; no common symptoms of MM). Progression to MM
   averages ~1%/year but varies with isotype, M protein level, and free
   light chain ratio [[35]4]. sMM shows higher tumour burden (≥10% marrow
   plasma cells and/or higher M protein) without end organ damage; risk
   stratification identifies subsets for early intervention [[36]5].
   Symptomatic MM is defined by clonal plasma cells (≥10%) plus
   myeloma-defining events, reflecting genomic lesions and
   microenvironmental cues driving proliferation, immune evasion, and bone
   disease [[37]6]. Established diagnostic and prognostic biomarkers in MM
   include detecting M protein, free immunoglobulin light chain,
   β2-microglobulin and albumin, creatinine and calcium, and FISH analysis
   for cytogenetic abnormalities. These biomarkers are used in the
   disease’s staging system. Additionally, imaging is also used to detect
   the degree of bone marrow infiltration. Emerging diagnostic biomarkers
   include next-generation sequencing and next-generation flow cytometry,
   extracellular vesicles, miRNAs, and circulating tumour cells [[38]7].

   Current management includes regular monitoring for MGUS and low-risk
   sMM, which avoids drug toxicity but carries a small ongoing risk of
   progression [[39]4]. In selected high-risk sMM, early treatment (e.g.,
   daratumumab- or lenalidomide-based) can delay progression and improve
   outcomes, but increases infection risk and cost, and patient-selection
   criteria are still evolving [[40]8]. In symptomatic MM, proteasome
   inhibitors (bortezomib, carfilzomib, ixazomib) yield high response
   rates, but neuropathy with bortezomib and cardiopulmonary events with
   carfilzomib remain key limitations, alongside eventual resistance
   [[41]9]. Immunomodulatory drugs (IMiDs; lenalidomide, pomalidomide)
   provide durable benefit and underpin maintenance therapy yet cause
   cytopenia and venous thromboembolism [[42]10]. Anti-CD38 monoclonal
   antibodies (daratumumab, isatuximab) achieve rapid cytoreduction and
   synergise with proteasome inhibitor and IMiD-based backbones, but are
   associated with infusion-related reactions, infections,
   hypogammaglobulinemia, and higher treatment cost [[43]11].
   BCMA-directed therapies—including CAR-T cells, bispecific antibodies,
   and antibody–drug conjugates—can induce deep remissions in
   relapsed/refractory MM, yet are limited by cytokine-release syndrome,
   immune-effector cell-associated neurotoxicity, infections with
   prolonged hypogammaglobulinemia, manufacturing/access constraints, and
   relapse driven by antigen escape [[44]11].

   Early standard treatment for MM included alkylating agents such as
   melphalan and/or cyclophosphamide, combined with corticosteroids in
   most cases. This treatment was followed by autologous stem cell
   transplantation, a standard treatment option still used today.
   Compounds such as thalidomide, lenalidomide, and pomalidomide also
   became available soon afterward. During the last two decades, the
   therapeutic pool of MM expanded even further, with the discovery of
   proteasome inhibitors (bortezomib), histone deacetylase inhibitors
   (panobinostat), and more recently, nuclear export inhibitors
   (selinexor). In addition, approval of immunotherapies for MM in 2015
   starting with daratumumab and elotuzumab monoclonal antibodies and
   later on with the antibody–drug conjugate belantamab mafodotin, the
   bispecific teclistamab, and chimeric antibody receptor (CAR)-T cell
   products, such as idecabtagene and vicleucel, are standard treatment
   options nowadays. Despite the outstanding discoveries and improvements
   that were made regarding the treatment for MM, almost all patients
   become refractory to treatment and relapse. Because of that, there is a
   high need to detect or develop drugs for the treatment of relapsed
   and/or refractory MM or for halting the progression to MM [[45]12]. One
   such approach is drug repurposing. Drug repurposing concerns the
   detection of novel indications for drugs that are already approved for
   another disease. Specifically, in silico drug repurposing offers the
   ability to detect potential drugs using computational methodologies in
   a cost and time-efficient way.

   Motivated by the lack of a cure for MM or drugs that could halt the
   progression to MM, we focused on applying the avenue of computational
   drug-repurposing to highlight promising drug candidates and to study
   the underlying mechanisms of the drugs and pathways related to the
   disease and the stages that precede the disease. Since each MM stage
   (MGUS, sMM, and MM) is characterised by different symptoms and
   molecular mechanisms, we stratified the patient samples based on the
   different stages to pinpoint stage-specific candidate repurposed drugs.

   Here, we present a novel approach to use the transcriptomic signatures
   of the molecular basis of the different stages of MM to detect
   candidate repurposed drugs per severity stage. Initially, we performed
   drug-repurposing analysis for different MM stages using publicly
   available transcriptomic data. Moreover, we show the pathways that our
   proposed repurposed drugs are involved in and their structural
   similarities with current clinical trial drugs of MM.

2. Objectives

     * To analyse publicly available transcriptomic datasets across the
       full spectrum of multiple myeloma (MGUS, sMM, and MM) to identify
       stage-specific differentially expressed genes (DEGs) and perform
       pathway enrichment analysis.
     * To apply a computational drug-repurposing pipeline aimed at
       identifying candidate therapeutic compounds for each disease stage
       and uncovering shared and distinct candidate drugs across the MM
       progression.
     * To investigate the molecular targets and associated pathways of the
       repurposed drug candidates, offering insight into the biological
       complexity and heterogeneity of myeloma at each stage.
     * To propose rational drug combination strategies, integrating
       FDA-approved treatments for MM with newly identified repurposed
       candidates.

3. Materials and Methods

   On 2 September 2024, we collected all the available bulk transcriptomic
   data of the 3 stages of MM progression: MGUS, SMM, and MM. We performed
   DE analysis per dataset and MM stage, using the Limma R package
   (version 3.64.0). Enrichment analysis of the DE genes and in silico
   drug repurposing using 3 different drug-repurposing tools then
   followed. After collecting the candidate repurposed drugs, we used an
   already established in-house scoring scheme to further filter the
   drugs. A structural comparison of the highlighted proposed drugs with
   the drugs that are currently in clinical trials was then followed. The
   targets of the proposed candidate repurposed drugs were found and
   enrichment analysis of those targets was then performed. Lastly,
   suggestions of candidate drug combinations of the MM FDA-approved drugs
   with our proposed candidate repurposed drugs were followed. The
   detailed pipeline is shown in [46]Figure 1.

Figure 1.

   [47]Figure 1
   [48]Open in a new tab

   General pipeline of the study. Step 1: Transcriptomics data collection
   from Gene expression omnibus (GEO). Step 2: Pre-processing of data and
   detection of differentially expressed genes in R using the limma R
   package. Step 3: Transcriptomics-based drug repurposing—3 different
   tools were used: Connectivity Map (CMap) CLUE, L1000CDS^2, and
   SigComLINCS. The top 150 over-expressed and 150 under-expressed genes
   based on log[2]FC from the gene list with an adjusted p value of <0.05
   were used as an input. Gene ontology enrichment analysis of the
   differentially expressed genes and drug targets was followed. Kept the
   top pathways per dataset to keep a unique list for each stage using an
   in-house scoring scheme. Step 4: Collection of clinical trials drugs
   and structural similarity was performed for candidate repurposed drugs
   and drugs in clinical trials. Step 5: Proposal of drug combinations
   through 4 different models, using the FDA-approved drugs for MM and our
   proposed candidate repurposed drugs.

3.1. Data

   Six different microarray MM datasets were retrieved from the Gene
   expression omnibus (GEO) [[49]13]—a transcriptional data repository.
   Some of the datasets did not include all disease stages. The selection
   of the datasets was based on the disease staging: MGUS, sMM, and MM
   ([50]Table 1). To our knowledge, these were the only publicly available
   datasets on the 2 September 2024 that clearly used this progression
   staging for the disease.

Table 1.

   The experimental design of the transcriptomic datasets used in this
   study. All stages were compared to controls.
   No Ref. GEO Accession Number Stage
   1 [[51]14] [52]GSE36474 MM
   2 [[53]15] [54]GSE5900 MGUS
   sMM
   3 [[55]16] [56]GSE6477 MGUS
   sMM
   MM
   4 [[57]17] [58]GSE13591 MGUS
   MM
   5 [[59]18] [60]GSE47552 MGUS
   sMM
   MM
   6 [[61]19] [62]GSE80608 MGUS
   MM
   [63]Open in a new tab

3.2. Pre-Processing of Data

   Each dataset selected was quantile-normalised and log[2]-transformed
   where necessary. Subsequent analysis was performed in R statistical
   environment ([64]http://www.R-project.org/, accessed 12 September 2024)
   [[65]20]. Each of the six datasets and for each stage was processed
   using the Limma R package [[66]21], a linear model that calculates a
   moderated t-statistic from gene expression experiments.

3.3. Detection of Differentially Expressed Genes

   After the dataset pre-processing, probe-set IDs were matched to gene
   symbols according to each platform’s annotation file. We maintained the
   most differentially expressed ones in cases of gene symbol
   correspondence to multiple probe-sets ([67]Tables S1–S6). Explicitly
   stage-labelled cohorts across the MGUS → sMM → MM spectrum are rare and
   span different microarray platforms with unequal stage composition.
   Thus, we adopted a conservative strategy: within-study normalisation
   and limma DE, followed by rank-level aggregation across studies per
   stage.

   From the Limma analysis result, we kept the top 150 over-expressed and
   150 under-expressed genes based on log[2]FC from the gene list with an
   adjusted p value of <0.05. All comparisons were made using the disease
   state (i.e., MGUS, sMM, or MM) vs. control samples. The selected number
   of genes (150 over-expressed and 150 under-expressed) corresponds to
   the input number limit of the drug-repurposing tools we used in the
   sequel. Many widely used signature-reversal tools perform best with
   balanced up/down lists of limited size. Using a fixed symmetric cutoff
   ensures comparability across tools and stages and focuses on the
   high-confidence perturbation while avoiding noise from long tails where
   directionality is least stable across datasets. To keep the study
   tractable and avoid tool-specific optimisation (which risks
   overfitting), we retained 150/150 as a pre-specified setting.

3.4. Pathway Analysis of DEGs

   The Gene ontology (GO) enrichment analysis was conducted using the
   differentially expressed genes (DEGs) identified in the transcriptomics
   study. The focus was on biological processes (BPs), and the
   clusterProfiler R package (accessed on 18 April 2025) [[68]22] was
   employed for the analysis. This procedure was applied individually to
   each stage and dataset, and a scoring method was used to combine the
   results, leading to a single list of pathways for each stage of the
   disease ([69]Tables S7–S9).

   To rank the pathways for each stage and dataset, leading to a unified
   list per stage, we ranked the pathways within each dataset according to
   their adjusted p values. Hence, the pathways from each dataset for each
   stage were combined as a union of unique pathways and ranked by
   calculating the weighted sum of normalised average rankings and the
   normalised number of appearances according to Equation (1):
   [MATH:
   <mrow><mrow><msub><mrow><mi>S</mi><mi>c</mi><mi>o</mi><mi>r</mi><mi>e</
   mi><mtext> </mtext></mrow><mrow><mi>i</mi></mrow></msub><mo>=</mo><mtex
   t> </mtext><msub><mrow><mi>w</mi></mrow><mrow><mn>1</mn></mrow></msub><
   mtext> </mtext><mo>∗</mo><mtext> </mtext><msub><mrow><mi>R</mi></mrow><
   mrow><mi>i</mi></mrow></msub><mtext> </mtext><mo>+</mo><mtext> </mtext>
   <msub><mrow><mi>w</mi></mrow><mrow><mn>2</mn></mrow></msub><mtext> </mt
   ext><mo>∗</mo><mtext> </mtext><msub><mrow><mi>A</mi></mrow><mrow><mi>i<
   /mi></mrow></msub><mo>,</mo><mtext> </mtext><mi>i</mi><mtext> </mtext><
   mo>=</mo><mtext> </mtext><mn>1</mn><mo>,</mo><mtext> </mtext><mo>…</mo>
   <mo>,</mo><mtext> </mtext><mi>N</mi><mtext> </mtext><mi>p</mi><mi>a</mi
   ><mi>t</mi><mi>h</mi><mi>w</mi><mi>a</mi><mi>y</mi><mi>s</mi><mtext> </
   mtext></mrow></mrow> :MATH]
   (1)

   where R[i] is the average ranking score from each of the three tools,
   A[i] is the number of appearances of each pathway for the different
   datasets per stage, and w[1] and w[2] are set to 0.7 and 0.3,
   respectively. This scoring scheme was adapted from [[70]23].

3.5. Transcriptomics-Based Drug Repurposing

   The transcriptomic-based drug repurposing was performed using three
   different drug-repurposing tools: Connectivity Map (CMap) CLUE
   [[71]24], L1000CDS^2 [[72]25], and SigComLINCS [[73]26]. The 300
   differentially expressed genes (based on their log[2]FC value) from the
   six different datasets were used as transcriptomic signatures. Next,
   each set was used as an input to the aforementioned repurposing tools.
   These tools use transcriptional expression data from multiple human
   cell lines to probe relationships between diseases and therapeutic
   agents. Drugs are sorted according to a score (inhibition score), which
   characterises if a drug can reverse (drugs with a strong negative score
   value) or mimic (drugs with a strong positive score value) the
   expression levels of a disease based on a given set of genes. For each
   stage and each dataset, we obtained a list of candidate repurposed
   drugs predicted by each of the three tools, ranked based on their
   inhibition score. Since the output of L1000CDS^2 is limited to 50
   drugs, we applied the same cutoff for all the other repurposed drug
   lists. Hence, the top 50 drugs from each of the three tools were
   combined as a union of unique drugs and ranked by calculating the
   weighted sum of normalised average rankings and the normalised number
   of appearances according to Equation (2):
   [MATH:
   <mrow><mrow><msub><mrow><mi>S</mi><mi>c</mi><mi>o</mi><mi>r</mi><mi>e</
   mi><mtext> </mtext></mrow><mrow><mi>i</mi></mrow></msub><mtext> </mtext
   ><mo>=</mo><mtext> </mtext><msub><mrow><mi>w</mi></mrow><mrow><mn>1</mn
   ></mrow></msub><mtext> </mtext><mo>∗</mo><mtext> </mtext><msub><mrow><m
   i>R</mi></mrow><mrow><mi>i</mi></mrow></msub><mtext> </mtext><mo>+</mo>
   <mtext> </mtext><msub><mrow><mi>w</mi></mrow><mrow><mn>2</mn></mrow></m
   sub><mtext> </mtext><mo>∗</mo><mtext> </mtext><msub><mrow><mi>A</mi></m
   row><mrow><mi>i</mi></mrow></msub><mo>,</mo><mtext> </mtext><mi>i</mi><
   mtext> </mtext><mo>=</mo><mtext> </mtext><mn>1</mn><mo>,</mo><mtext> </
   mtext><mo>…</mo><mtext> </mtext><mo>,</mo><mtext> </mtext><mi>N</mi><mt
   ext> </mtext><mi>d</mi><mi>r</mi><mi>u</mi><mi>g</mi><mi>s</mi><mtext> 
   </mtext></mrow></mrow> :MATH]
   (2)

   where R[i] is the average ranking score from each of the three tools,
   A[i] is the number of appearances of each drug in the three DR tools,
   and w[1] and w[2] are set to 0.7 and 0.3, respectively. The drug lists
   obtained from all datasets were combined and re-ranked using Equation
   (1) to conclude a single drug list from all the datasets. This scoring
   scheme was adapted from [[74]23].

   After the scoring was completed, we retained drugs with a score of 0.75
   or higher for each disease stage.

3.6. Collection of the Currently Running Clinical Trials of MM and Its Stages

   All listed clinical studies related to the three stages were collected
   from [75]www.clinicaltrials.gov, accessed on 5 January 2025. The
   downloaded file was filtered separately for MGUS, sMM, and MM and
   entries that did not match these terms were removed. Additionally,
   clinical trials that were either suspended, withdrawn, unknown, or
   terminated were removed in order to keep the entries that are active,
   completed, or will be recruiting soon. Specifically, only
   small-molecule drugs and biologicals were obtained from the studies,
   and everything else was removed.

3.7. Structural Similarity

   The structures of the candidate repurposed drugs and the clinical trial
   drugs we collected were downloaded in the form of the Simplified
   Molecular Input Line Entry Systems (SMILES) through the PubChem
   Identifier Exchange Service of the PubChem database
   ([76]https://pubchem.ncbi.nlm.nih.gov/idexchange/idexchange.cgi,
   accessed on 21 March 2025) [[77]27]. We then converted the SMILES
   format into a single 2D structure data file (SDF) using the OpenBabel
   software [[78]28]. The Rcpi R package (accessed on 27 March 2025)
   [[79]29] was then used to perform structural similarity across the
   different drug groups collected, using an in-house script. We used an
   80% Tanimoto similarity as a threshold. Additionally, we used a merged
   SDF of the shortlisted repurposed drugs of the three stages (MGUS, sMM,
   and MM) as input in the ChemBioServer 2.0
   ([80]https://chembioserver.vi-seem.eu/, accessed on 30 March 2025)
   [[81]30], a publicly available tool that provides filtering,
   clustering, comparison of drug structures, and networking of chemical
   compounds to facilitate both drug discovery and repurposing. Drugs were
   clustered using the Soergel distance ≤ 0.15 corresponding to a Tanimoto
   similarity 87% (Tanimoto similarity = 1/(1 + Soergel distance))
   [[82]31].

3.8. Drug Target Pathway Analysis

   To further explore the candidate repurposed drugs, we extracted the
   corresponding gene targets of each drug through mainly the Drug
   repurposing hub database
   ([83]https://repo-hub.broadinstitute.org/repurposing, accessed on 30
   March 2025) [[84]32] per stage. In cases where no target was found,
   DrugBank ([85]https://go.drugbank.com/, accessed on 30 March 2025) and
   PubChem ([86]https://pubchem.ncbi.nlm.nih.gov/, accessed on 30 March
   2025) were also used. ([87]Tables S13–S15). The Gene ontology (GO)
   enrichment analysis was conducted using the drug target genes of the
   top candidate repurposed drugs, as chosen using the scoring scheme
   mentioned above. The focus was on biological processes (BPs), and the
   clusterProfiler R package was employed for the analysis. This procedure
   was applied individually to each stage, and the top pathways were kept
   using an adjusted p value of <0.05.

3.9. Drug Combination Synergies

   The DrugComb database [[88]33] was used to extract and analyse
   experimental drug combination data across multiple cancer cell lines.
   DrugComb ([89]https://drugcomb.fimm.fi, accessed on 30 March 2025) is
   an open access data portal containing drug combination studies, which
   are standardised and harmonised. In total, 437,932 drug combinations
   were tested on a variety of cancer cell lines. The data were downloaded
   and filtered to include only MM-related cell lines and FDA-approved
   drugs for MM. The analysis was performed in R. Four synergy models were
   used:
     * ZIP (Zero Interaction Potency): Measures interactions across
       different doses [[90]34].
     * Loewe Additivity: Compares observed combination effects to the
       expected additive effects.
     * HSA (Highest Single Agent): Evaluates whether the combination is
       superior to the best-performing single agent.
     * Bliss Independence: Assesses interactions based on independent
       probabilities of drug effects.

   We used four complementary reference models. HSA benchmarks against the
   best single agent (conservative). Bliss assumes probabilistic
   independence. Loewe assumes dose equivalence (appropriate for similar
   mechanism pairs) and is typically most stringent when mechanisms
   diverge. ZIP integrates potency and effect shifts across the response
   surface. Therefore, model divergence, especially lower Loewe synergy
   for mechanistically distinct pairs, is expected and informative.

4. Results

   The pipeline adopted in this study and the main steps are illustrated
   in [91]Figure 1. The overall process entails the analysis of
   stage-specific MM-related transcriptomics datasets to identify
   significant genes, with the subsequent identification and shortlisting
   of candidate repurposed drugs and the pathways they target.

4.1. Differential Expression Analysis

   The first part of this study included the collection and analysis of
   publicly available transcriptomics datasets of MGUS, sMM, and MM
   patients and controls. Following the pre-processing of these datasets,
   we performed differential analysis to identify differentially expressed
   genes (DEGs) between patients in each stage vs. controls. We used a
   cutoff of adjusted p values < 0.05 and then sorted the differentially
   expressed genes based on their log[2] fold-change (log[2]FC) value. All
   differential expression analysis comparisons are presented in
   [92]Tables S1–S6. The top five differentially expressed genes for each
   comparison are listed in [93]Table 2.

Table 2.

   The top 5 differentially expressed genes for each comparison.
   MGUS sMM MM
   [94]GSE36474 MAB21L1
   XG
   EMX2
   HOXB-AS3
   FAM3A
   [95]GSE5900 KIT
   IGLC1
   PRR15
   C7orf55
   LOC100293211 KIT
   CYAT1
   LOC100293211
   IGHV3-73
   C7orf55
   [96]GSE6477 CLC
   PRG2
   LOC100293211
   RNASE2
   PRG3 IGHD
   IGLJ3
   LOC100293211
   CKAP2
   IGHA1 IGHD
   IGLJ3
   LOC100293211
   CKAP2
   IGHA1
   [97]GSE13591 IGLV1-44
   IGLC1
   LOC100293211
   IGK
   CKAP2 IGHD
   LOC100293211
   AbParts
   IGLV1-44
   IGHM
   [98]GSE47552 IGKV2D-40
   SNORD115-1
   SNORD115-6
   GPR15
   SNORD115-44 IGKV2D-40
   IGKV2D-26
   IGKV1D-27
   IGHV1OR15-1
   IGKV1OR2-3 IGKV2D-40
   IGKV2D-26
   IGKV1OR2-3
   IGKV6-21
   IGKV1D-27
   [99]GSE80608 SFRP2
   H19
   SLC14A1
   F2R
   SCIN FLG
   H19
   SLC14A1
   F2R
   SCIN
   [100]Open in a new tab

4.2. Pathway Analysis of Differentially Expressed Genes

   GO enrichment analysis was performed to detect the statistically
   significant pathways involving the DEGs from the different datasets per
   stage. In the MGUS stage, several biological processes (BPs) related to
   the immune response were significantly detected ([101]Table S7). The
   highest scores were associated with processes like the adaptive immune
   response, particularly those involving immune receptor recombination-
   and leukocyte-mediated immunity. This indicates heightened immune
   activity at this early stage, where processes such as regulation of
   leukocyte and lymphocyte proliferation, cell chemotaxis, and immune
   responses to hydrogen peroxide are prominent. Notably, positive
   regulation of cytokine production, reactive oxygen species (ROS)
   metabolic process, and cell–cell adhesion suggest an active environment
   where immune cells are mobilised and interact to control abnormal cell
   growth.

   In the sMM stage, immune system regulation intensifies, as seen in the
   overrepresentation of GO terms related to mononuclear and lymphocyte
   proliferation, positive regulation of cell activation, and T cell
   differentiation ([102]Table S8). Nearly all immune-related pathways
   show high scores (0.99), indicating robust immune modulation. Key
   processes such as immune response-activating cell surface receptor
   signalling and antigen receptor-mediated signalling pathways are highly
   active, suggesting a pre-cancerous state where immune surveillance
   attempts to combat disease progression. Additionally, terms involving
   positive regulation of leukocyte and lymphocyte adhesion imply strong
   immune cell communication and activation during this transitional
   phase.

   In the MM stage, there is still emphasis in immune-related processes,
   with some unique changes ([103]Table S9). The B cell receptor
   signalling pathway and B cell activation become more prominent,
   suggesting the involvement of B cells in the disease’s progression.
   Additionally, cytokine production, particularly interleukin-6 (IL-6),
   and processes related to leukocyte migration and myeloid leukocyte
   cytokine production highlight the advanced immune dysregulation. These
   factors are likely contributing to the chronic inflammatory
   environment.

   Across the three stages (MGUS, sMM, and MM), there is a clear
   progression in immune response activities. In the MGUS stage, the
   immune system is highly active with processes focused on immune cell
   proliferation and activation. As the disease progresses to the sMM
   stage, the immune system continues to play a central role, with
   heightened regulation of lymphocyte activation and immune signalling
   pathways. By the MM stage, however, there is a shift towards immune
   dysfunction, with an emphasis on B cell activation and cytokine
   production, particularly IL-6, a known contributor to MM progression.
   This progression reflects how the immune system, initially attempting
   to control the abnormal cells, becomes increasingly compromised,
   allowing for tumour growth and proliferation in later stages. Full
   lists of pathways for each stage are presented in [104]Tables S7–S9.

4.3. Identification of the Shortlisted Candidate Repurposed Drugs for MM and
Its Stages

   To perform in silico drug-repurposing analysis, we selected the top 150
   over- and 150 under-expressed genes as they are required for most
   repurposing tools. Using the DEG sets, we performed a series of in
   silico drug-repurposing analyses with existing computational tools (see
   Methods); Connectivity Map (CLUE), L1000CDS^2, and SigCom LINCS,
   leading to three lists of candidate repurposed drugs, for each stage
   and dataset, followed by a scoring process yielding to three lists of
   proposed repurposed drugs for each stage (MGUS, sMM, and MM). The top
   repurposed drugs selected are shown in [105]Figure 2. Our in silico
   drug-repurposing analysis identified several candidate drugs with the
   potential for repurposing in MM. These drugs were evaluated based on
   their clinical trial status, preclinical evidence, mechanisms of
   action, and gene targets ([106]Table S16). While some drugs have been
   previously investigated in MM, others have only been tested in other
   cancer types. Key mechanisms of action included cyclin-dependent kinase
   (CDK) inhibition, histone deacetylase (HDAC) inhibition, and selective
   estrogen receptor modulation.

Figure 2.

   [107]Figure 2
   [108]Open in a new tab

   Top-scored candidate repurposed drugs for each disease state (MGUS,
   sMM, and MM) according to our scoring scheme [[109]23]. (A). MGUS, (B).
   sMM, and (C). MM. Bortezomib, which is detected in the sMM stage, is an
   already FDA drug for MM.

   For MGUS, the top candidate repurposed drugs proposed regarding the
   in-house scoring scheme used were geldanamycin, roscovitine, PP-30,
   mitomycin C, and collybolide. For sMM, the top candidate repurposed
   drugs proposed regarding the scoring scheme used were olprinone,
   lamotrigine, diprotin A, collybolide, and 112726-66-6 (BTCP). For MM,
   the top candidate repurposed drugs proposed regarding the scoring
   scheme used in this study were radicicol, piperlongumine, entinostat,
   vecuronium, and terreic acid ([110]Tables S10–S12).

4.4. Investigation of Structural Similarity Concerning Ongoing Clinical
Trials

   The shortlisted candidate repurposed drugs were screened for structural
   similarity with the MGUS-, sMM-, and MM-related drugs in clinical
   trials, and FDA-approved drugs of MM. Pairwise structural similarity
   was calculated using a Tanimoto score threshold of 80%. As shown in
   [111]Figure 3, heatmaps for each disease state suggest that the
   majority of all examined drugs are not very similar, indicating a lack
   of redundancy and a wide range of structural diversity in both our
   shortlist and clinical trials. However, some similarities are present,
   particularly for MM. For instance, ten candidate repurposed drugs are
   also used in ongoing clinical trials, therefore showing a Tanimoto
   similarity of 100%. Additionally, the candidate repurposed drug
   exemestane, was found to have a similarity score of 82% against
   dehydroepiandrosterone, a clinical trial drug. The repurposed drug
   fluocinolone acetonide was also found to have a similarity score of 82%
   against the clinical trial drug dexamethasone. Lastly, the repurposed
   drug ivermectin b1a showed an 81% similarity with the clinical trial
   drug bryostatin 1.

Figure 3.

   [112]Figure 3
   [113]Open in a new tab

   Structural similarity heatmaps of shortlisted repurposed drugs along
   with the ongoing drugs in clinical trials for (A) MGUS, (B) sMM, and
   (C) MM. For MM, the maximum similarity between a repurposed drug and a
   clinical trial drug was kept, since too many drugs were available for
   this stage. X-axis shows the candidate repurposed drugs. Y-axis shows
   the drugs in clinical trials for each disease state.

   Additionally, the shortlisted candidate repurposed drugs were screened
   for structural similarity among the three stages: MGUS, sMM, and MM.
   Pairwise structural similarity was calculated using a Tanimoto score
   threshold of 87%. As shown in [114]Figure 4, again in this comparison,
   the hierarchical clustering suggests that all drugs are not very
   similar, indicating a lack of redundancy and a wide range of structural
   diversity in our shortlisted candidate repurposed drugs. Ten of these
   drugs were detected in two disease stages as shown by the asterisk.
   These drugs include terreic acid, calyculin A, radicicol, sofalcone,
   piperlongumine, vorinostat, 849234-64-6
   (4-acetamido-N-(2-amino-5-thiophen-2-ylphenyl)benzamide), bortezomib,
   elesclomol, and linifanib.

Figure 4.

   [115]Figure 4
   [116]Open in a new tab

   Hierarchical clustering of the shortlisted candidate repurposed drugs
   for MGUS, sMM, and MM. The different groups in each box are thresholded
   at Soergel distance value 0.15. Candidate repurposed drugs that are
   detected in two disease stages are marked with an asterisk (*).

   To further explore the candidate repurposed drugs, we extracted the
   corresponding gene targets of each drug through mainly the Drug
   repurposing hub database, and in cases where no target was found,
   DrugBank and PubChem were also used ([117]Tables S13–S15).

   We then performed Gene ontology (GO) analysis (biological processes)
   per disease stage, using the gene targets of the candidate drugs
   through the ClusterProfiler R package. Additionally, hierarchical
   clustering of the enriched terms was performed again using the
   ClusterProfiler R package. This relies on the pairwise similarities of
   the enriched terms calculated by the use of Jaccard’s similarity index
   (JC) ([118]Supplementary Figures S1 and S2).

   To further elucidate the functional differences and commonalities
   between the three disease stages, we generated an enrichment map plot
   using the ClusterProfiler R package, organising enriched terms into a
   network with edges connecting overlapping gene sets. In this way,
   mutually overlapping gene sets tend to cluster together, making it easy
   to identify functional modules. These gene targets of the proposed
   candidate repurposed drugs from our analysis, provide valuable insights
   into the fundamental transcriptional characteristics underlying the
   functional properties of MGUS, sMM, and MM.

   This analysis highlights BPs involved in “cell signalling pathways”,
   such as vascular endothelial growth factor signalling pathway,
   brain-derived neurotrophic factor receptor signalling pathway, response
   to insulin and others ([119]Figure 5, yellow cluster). Moreover, the
   purple cluster is involved in cellular response to insulin stimulus,
   cellular response to peptide hormone stimulus, response to macrophage
   colony-stimulating factor, and others. Notably, the yellow and purple
   clusters overlapped spatially, indicating that these functionally
   coherent groups share biologically related terms. Additionally, BPs
   involved in membrane and action potentials and potassium ion transport
   were also highlighted ([120]Figure 5, red cluster). GO terms associated
   with this cluster were highly enriched in all disease stages, and
   particularly in MGUS and sMM. Another important cluster generated by
   this analysis highlights BPs involved in epigenetic regulation of gene
   expression and protein modification ([121]Figure 5, green cluster).
   Additionally, GO terms associated with this cluster were highly
   enriched in all disease stages. Moreover, a cluster highlighting
   Circadian rhythms, protein localisation, mitochondrial function, and
   others, has also been generated through this analysis. Most BPs of this
   cluster have been highlighted in MM (blue cluster, [122]Figure 5). The
   blue-green cluster is focused on hormone-related signalling pathways,
   with the GO terms associated with this cluster being enriched in all
   disease stages. Lastly, a small cluster associated with response to
   oxidative stress has also been highlighted (yellow cluster, [123]Figure
   5).

Figure 5.

   [124]Figure 5
   [125]Open in a new tab

   Biological theme comparison of MGUS, sMM, and MM. The enrichment map
   shows the top enriched terms in MGUS, sMM, and MM, organised into a
   network with edges connecting overlapping gene sets. Coloured-based
   clustering was performed using the emapplot function, which reflects
   functional similarity among enriched terms.

4.5. Drug Combination Synergies

   The drug combination data were extracted through the DrugComb database
   (see Methods). In this work, we kept four synergy models: ZIP (Zero
   Interaction Potency), Loewe Additivity, HSA, and Bliss Independence.
   Drug combinations are considered synergistic if they exceed specific
   thresholds: moderate synergy (>+5) and strong synergy (>+10)
   ([126]Supplementary Figure S3).

4.6. Top Drug Combinations Identified

   To find the top synergies regarding our disease of interest, MM, a
   systematic analysis of MM’s FDA-approved drugs was carried out to
   detect synergies in general and, more specifically, synergies with our
   proposed candidate repurposed drugs. When looking for drug combinations
   common across all synergy models, 17 drug combinations were detected
   ([127]Supplementary Figures S4 and S5), from which, however, none
   include any of our proposed candidate repurposed drugs. Therefore, we
   stick to the drug combinations for each synergy model separately, which
   are then combined in a final list.

   A total of 181 drug combinations exhibited synergy across multiple,
   with some combinations including drugs from our proposed candidate
   repurposed drug list. The strongest synergies observed per model are
   shown in [128]Figure 6, in a subset of 20 top-ranking combinations.
   Notably, an analogue of one of the proposed candidate repurposed drugs,
   erlotinib, was also identified, further supporting its potential
   application in combination therapy. Top drug combinations of FDA drugs
   along with our proposed candidate repurposed drugs include lenalidomide
   + retinoic acid (from the ZIP and Bliss models), erlotinib
   hydrochloride + bortezomib (from the ZIP model), and 23541-50-6
   (daunorubicin HCL) + thalidomide (from the HAS model). The total
   significant drug combinations can be found in [129]Table S19.

Figure 6.

   [130]Figure 6
   [131]Open in a new tab

   Summary of the top 20 drug combinations per model. The figure presents
   a comparative analysis of synergy scores across different models,
   highlighting the most promising combinations for MM treatment. (A)
   Synergy Zip Score model, (B) Synergy Loewe Score model, (C) Synergy HSA
   Score model and (D) Synergy Bliss Score Model.

   The results suggest that several repurposed drugs could be explored
   further for their potential in MM treatment. Additionally, the
   identification of highly synergistic drug combinations underscores the
   importance of combination therapy approaches. Future studies should
   focus on validating these findings in preclinical and clinical settings
   to assess their therapeutic potential.

5. Discussion

   The search for effective treatments for MM continues to be a
   substantial problem, requiring ongoing investigation into new
   therapeutic approaches. Drug repurposing, the process of using existing
   drugs for a novel disease, offers a promising approach due to its cost-
   and time-effective nature. Drug repurposing is essential in the
   advancement of innovative anti-cancer therapies. This study explores
   the capabilities of transcriptomic signature-based drug repurposing
   using all the publicly available bulk transcriptomics datasets on MGUS,
   sMM, and MM. We filter and prioritise candidate repurposed drugs to be
   shortlisted for further analysis in the future. Our study included a
   scoring scheme from a previous work of our group [[132]23], resulting
   in 25 candidate repurposed drugs for MMGUS, 23 for sMM, and 66 for MM.
   From these, 18 candidate repurposed drugs for MGUS, 16 for sMM, and 52
   for MM had available structure information. These highlighted drugs
   have generally been shown to be structurally distinct from each other,
   meaning that most of the proposed drugs do not belong to the same
   structural subgroup. Additionally, Gene ontology terms (biological
   processes) were detected using the DEGs of each dataset and disease
   state. The same scoring scheme used for the candidate repurposed drugs
   was also used here to select a single list of pathways for each disease
   state. Lastly, we detected the gene targets of the proposed repurposed
   drugs along with the associated Gene ontology terms (biological
   processes).

   According to MeSH, when analysing the top 15 proposed candidate
   repurposed drugs for each disease stage—MGUS, sMM, and MM—three drugs
   were characterised as antineoplastic for MGUS, including geldanamycin,
   mitomycin, and roscovitine. Additionally, for sMM, two drugs were
   detected as antineoplastic: linifanib and vorinostat. Lastly, for MM,
   five out of the fifteen top proposed candidate repurposed drugs were
   antineoplastic agents, including radicicol, entinostat, salermide,
   temozolomide, and daunorubicin. Other drug categories among the top
   proposed candidate repurposed drugs include immunosuppressants (e.g.,
   cyclosporin A detected for sMM).

   To synthesise the pathway-level signals, we summarise key therapeutic
   families implicated by our stage-specific targets and drug candidates
   ([133]Table 3). This table highlights where candidates align with known
   mechanisms versus under-explored processes enriched in earlier stages.

Table 3.

   Mechanistic families for shortlisted repurposing candidates across the
   MGUS, sMM, and MM spectrum.
   Pathway Family Representative Targets/Mechanisms Example Candidates
   from Our Shortlist Key References