Abstract

   Explaining predictions for drug repositioning with biological knowledge
   graphs is a challenging problem. Graph completion methods using
   symbolic reasoning predict drug treatments and associated rules to
   generate evidence representing the therapeutic basis of the drug. Yet
   the vast amounts of generated paths that are biologically irrelevant or
   not mechanistically meaningful within the context of disease biology
   can limit utility. We use a reinforcement learning based knowledge
   graph completion model combined with an automatic filtering approach
   that produces the most relevant rules and biological paths explaining
   the predicted drug’s therapeutic connection to the disease. In this
   work we validate the approach against preclinical experimental data for
   Fragile X syndrome demonstrating strong correlation between
   automatically extracted paths and experimentally derived
   transcriptional changes of selected genes and pathways of drug
   predictions Sulindac and Ibudilast. Additionally, we show it reduces
   the number of generated paths in two case studies, 85% for Cystic
   fibrosis and 95% for Parkinson’s disease.

   Subject terms: Drug discovery, Machine learning, Computational models,
   Psychiatric disorders
     __________________________________________________________________

   Explaining predictions for drug repositioning with biological knowledge
   graphs is a challenging problem. Here, the authors present an approach
   for automated biological evidence generation and show strong
   correlation between extracted paths and derived transcriptional changes
   of genes and pathways for predictions of Sulindac and Ibudilast in
   FragileX.

Introduction

   Discovering safe and effective treatments for rare diseases presents a
   formidable challenge, starting with the sourcing, normalising, and
   integration of copious, diffuse and diverse data sources that inform
   drug discovery. When it comes to the more than 7000 rare and genetic
   disorders, valuable information is frequently dispersed across various
   databases, encompassing clinical symptoms, impacted pathways, animal
   models, and potential treatments. To address this issue, AI-driven
   computational tools and knowledge can be harnessed to interconnect this
   diverse data, enabling the prediction of innovative drug candidates.
   Typically, existing computational methods generate an overwhelming
   number of therapeutic hypotheses, necessitating labour-intensive manual
   curation by experts specializing in the respective disease. This
   process involves a significant amount of time dedicated to establishing
   the therapeutic linkage between the drug and the disease, given the
   identification of the mechanism of action is pivotal in establishing
   clinical tractability.

   Knowledge graphs (KG) have been used extensively in the recent past to
   solve complex problems in life sciences including drug discovery for
   rare diseases. Knowledge graphs are constructed with head
   entity-relation-tail entity (h, r, t) triples where entities correspond
   to nodes and relations correspond to links connecting the entities.
   Biological knowledge graphs are constructed with biological nodes such
   as drugs, diseases, genes, pathways, phenotypes, proteins etc and the
   links between these nodes. Knowledge base completion (KBC)^[40]1 is the
   task of predicting the tail entity t, given the head entity h, and the
   relation r, or head entity h given the tail entity t and relation r. It
   can also be used to predict unseen relations between the head and tail
   entities. Several approaches have been proposed in the past for KBC
   that learn a continuous vector space representation for entities and
   relations. These methods include translational models using
   distance-based scoring (TransE^[41]2, TransH^[42]3, RotatE^[43]4),
   semantic matching models (RESCAL^[44]5, DistMULT^[45]6, ComplEX^[46]7),
   Graph convolutional networks (GCN^[47]8, R-GCN^[48]9), Attention
   networks (GAT^[49]10) and context-based encoding approaches
   (KG-BERT^[50]11). Yet these models cannot produce a rationale for the
   predictions. Alternatively, there are comparatively few models such as
   AnyBURL^[51]12,[52]13, Minerva^[53]14 that target the symbolic space
   capable of also producing a set of learnt logical rules for each
   prediction. Many recent studies have focused on the application of
   existing drugs for new therapeutic indications using biological
   knowledge graphs, for example, by identifying potential candidates
   based on shared biological mechanisms or targets. One of those has
   built a billion-edge biomedical KG from millions of biomedical
   documents to identify potential drug repositioning candidates for
   diseases with unknown treatment^[54]15. Biological KGs have been built
   integrating preclinical, clinical, and literature evidence from public
   sources to predict and rank genes leading to potential mechanisms of
   EGFRi resistance^[55]16. To inform clinical decision-making, knowledge
   graphs have been built from experimental data, public databases, and
   literature to augment and enrich proteomics data^[56]17. Few other
   studies have used knowledge graph completion models for drug
   repositioning in cancer and auto-immune
   conditions^[57]16,[58]18,[59]19.

   The list of ranked predicted drugs by KBC models can differ across
   models even with the same drug and disease data due to differences in
   model logic. This is observed usually with different ranking evaluation
   scores for different models^[60]20. Notwithstanding, these algorithms
   generate prediction lists of hundreds of drugs with the potential to
   treat the query disease. Real world limitations including cost, assay
   availability and capacity, mean that only a limited number of drug
   predictions can be experimentally tested preclinically for each
   disease. This constraint necessitates drug discovery scientists to
   devise methods for filtering drug predictions, prioritizing those that
   have a higher likelihood of demonstrating efficacy and safety in
   preclinical experiments. There are various strategies to increase
   success, but it all comes down to understanding at a biological level
   how a proposed treatment might be therapeutically beneficial for the
   patient. At its simplest level, this reduces to establishing an overlap
   between mechanisms of disease biology with the biology that the drug
   targets. We call this process establishing “therapeutic rationale”, a
   comprehensive rationale providing a full picture of how a drug may be
   useful for treating the patient. It considers the current complete
   understanding of disease biology, including known causal genes and
   perturbed biological pathways and is established by combining several
   techniques including extraction of “evidence chains”, a set of paths in
   the knowledge graph that explain the relationships between a disease
   and a target drug, connected via biological entities.

   The explainability of predictions made by a model is addressed by
   relatively few methods, generally based on logical and path-based
   approaches that are capable of providing the user with an explicit
   explanation path that may serve as a justification for experimentally
   testing the prediction. One of the pioneers is the Path Ranking
   Algorithm (PRA)^[61]21,[62]22. KG embeddings learnt from a KBC model
   have been used to create novel ranked paths between drugs and diseases,
   but the approach was limited to one-hop explainable paths^[63]23.
   Reasoning with logical rules has been addressed in areas such as Markov
   logic networks (MLNs)^[64]24, however, such techniques typically do not
   scale well to modern, large-scale KGs. Recently, symbolic models such
   as RuleN^[65]25, its successors AnyBURL^[66]12,[67]13, MINERVA^[68]14
   and PoLo^[69]26 mine logical rules using a reinforced search-based path
   sampling for knowledge graph reasoning considering it as a
   neural-driven multi-hop problem to make predictions. Rules learnt
   during the learning phase are probabilistically annotated with
   confidence scores that represent the probability of predicting a
   correct fact with the rule. More details are provided in “Methods”. The
   following example shows a rule explaining how compound X can treat
   disease Y.
   [MATH: <mi mathvariant="normal">compound</mi><mo>_</mo><mi
   mathvariant="normal">disease</mi><mo>_</mo><mi
   mathvariant="normal">treats</mi><mrow><mo>(</mo><mrow><mi>X</mi><mo>,</
   mo><mi>Y</mi></mrow><mo>)</mo></mrow><mo>⇐</mo><mi
   mathvariant="normal">compound</mi><mo>_</mo><mi
   mathvariant="normal">gene</mi><mo>_</mo><mi
   mathvariant="normal">binds</mi><mrow><mo>(</mo><mrow><mi>X</mi><mo>,</m
   o><mi>A</mi></mrow><mo>)</mo></mrow><mo>,</mo><mi
   mathvariant="normal">compound</mi><mo>_</mo><mi
   mathvariant="normal">gene</mi><mo>_</mo><mi
   mathvariant="normal">activates</mi><mrow><mo>(</mo><mrow><mi>B</mi><mo>
   ,</mo><mi>A</mi></mrow><mo>)</mo></mrow><mo>,</mo><mspace
   width="0.25em"></mspace><mi
   mathvariant="normal">compound</mi><mo>_</mo><mi
   mathvariant="normal">in</mi><mo>_</mo><mi
   mathvariant="normal">trial</mi><mo>_</mo><mi
   mathvariant="normal">for</mi><mrow><mo>(</mo><mrow><mi>B</mi><mo>,</mo>
   <mi>Y</mi></mrow><mo>)</mo></mrow> :MATH]

   Compound X binds to gene A, which is activated by compound B, which is
   in trial for the disease Y.

   The rule can be written in a simplified form as follows.
   [MATH: <mspace width="0.3em"></mspace><mspace
   width="0.3em"></mspace><mi
   mathvariant="normal">Compound</mi><mo>−</mo><mi>b</mi><mi>i</mi><mi>n</
   mi><mi>d</mi><mi>s</mi><mo>→</mo><mi>G</mi><mi>e</mi><mi>n</mi><mi>e</m
   i><mspace
   width="0.25em"></mspace><mo>←</mo><mi>a</mi><mi>c</mi><mi>t</mi><mi>i</
   mi><mi>v</mi><mi>a</mi><mi>t</mi><mi>e</mi><mi>s</mi><mo>−</mo><mi
   mathvariant="normal">Compound</mi><mo>−</mo><mi>i</mi><mi>n</mi><mspace
   width="0.25em"></mspace><mi>t</mi><mi>r</mi><mi>i</mi><mi>a</mi><mi>l</
   mi><mspace
   width="0.25em"></mspace><mi>f</mi><mi>o</mi><mi>r</mi><mo>→</mo><mi
   mathvariant="normal">Disease</mi> :MATH]

   Given a compound prediction Lumacaftor for the disease cystic fibrosis
   and the above rule supporting the prediction we generate a path or an
   evidence chain from the Healx KG as shown in Fig. [70]1a. The Healx KG
   is built from several publicly available and proprietary data sources
   and internally curated data. Details of nodes and edge types in this KG
   and their sources are provided in “Methods” under “Data”.

Fig. 1. Example rules and corresponding evidence chains generated from Healx
KG.

   [71]Fig. 1
   [72]Open in a new tab

   a An example path from the graph satisfying the rule (Compound – binds
   −> Gene <− activates – Compound – in trial for −> Disease) for a given
   candidate drug prediction “Lumacaftor” in cystic fibrosis. b Example
   rules learnt by the model and paths generated for each rule. c Example
   of an uninformative evidence chain containing two ancestor
   relationships. Yellow symbol indicates the type compound, white
   indicates disease, green indicates gene, red indicates phenotype and
   blue indicates pathway.

   Several other examples of rules and the corresponding paths that could
   be generated from the graph are shown in Fig. [73]1b. They all show the
   therapeutic basis of how a drug is connected to the disease. A
   limitation of this approach is the vast amount of evidence chains that
   can be generated for a single drug prediction that is infeasible to
   review by human experts in a reasonable time. The number of evidence
   chains grow with the number of rules associated with each prediction.
   Notwithstanding, in the biological context of the disease of interest
   many of the rules associated with predictions are irrelevant,
   redundant, or not beneficial to establishing a molecular understanding
   of efficacy or safety (hereafter, uninformative). Evidence chains of
   this class do not inform on therapeutic action, efficacy, or safety.
   Neither do they enable decision-making in a drug discovery programme
   and can be safely discarded. For example, the evidence generated for
   the drug Tobramycin as shown in Fig. [74]1c contains two ancestor
   relationships related to cystic fibrosis, one of them being “Rare
   genetic disease” and a “treats” relation with a very common condition
   “Cataract” which makes this chain uninformative when attempting to
   understand a potential cystic fibrosis treatment. The disease-disease
   ancestor relationships are derived from MONDO^[75]27 and
   Orphanet^[76]28 ontologies.

   Rules generated for each prediction may also differ according to the
   disease of interest. Therefore, it’s not possible to construct a
   universal set of rules to be used in all disease case studies since a
   new disease study might involve a new rule that has not been previously
   observed, yet still be biologically relevant. Manual curation is one
   solution to choosing the most appropriate rules required prior to
   generating evidence chains but would limit the usefulness of the
   approach for routine drug discovery given the inherent lack of
   scalability of expert human curation. In addition, manual curation
   poses a risk of personal bias that may skew the analysis and limit its
   reproducibility. To make evidence chains more relevant to the disease
   of interest, the disease landscape should be defined, and an automated
   way of generating biologically meaningful evidence should be
   implemented as we suggest here.

   We start from a symbolic KBC model with a reinforcement learning-based
   reasoning approach AnyBURL^[77]12,[78]13 to make predictions and
   generate rules that support those predictions. We introduce a
   multi-stage pipeline that incorporates an automatic filtering model.
   This pipeline works in conjunction with AnyBURL’s rules to filter and
   generate biologically relevant and meaningful explanations for the
   predictions it produces. A similar approach to ours has been used in
   previous work^[79]29 where neural-driven RL based symbolic reasoning
   methods MINERVA^[80]14 and PoLo^[81]26 have been used to make
   biomedical KG predictions and reasoning chains applying pruning to the
   correct target type. The high-level ranked rules as given by MINERVA in
   the method appeared to be relevant but the low-level reasoning chains
   had to be manually reviewed by subject matter experts to confirm
   validity. Our primary contribution in this study is the automated
   extraction of pertinent high-level rules and biologically meaningful
   low-level reasoning chains feasible enough to be reviewed by experts.
   The inputs to the method include the knowledge graph and a list of
   genes and pathways of importance specific to the disease produced by
   curators and bioinformaticians in the disease landscape analysis (see
   “Methods”). The auto-filtering model includes a rule filter,
   significant path filter and a gene/pathway filter to subset only the
   biologically significant chains, reducing the total amount of evidence
   generated that would require human expert review. It also includes a
   deductive path builder that builds additional evidence chains using
   prioritized edge types when we have multiple chains involving the same
   nodes. More details are provided in “Methods”, and a workflow diagram
   of the entire pipeline is presented in Fig. [82]2 in “Results”.

Fig. 2. Evidence chains generation workflow.

   [83]Fig. 2
   [84]Open in a new tab

   a Generating predictions and the corresponding rules from prediction
   models and AnyBURL b Generating evidence chains without the
   auto-filtering model. c Gathering biologically relevant gene-disease
   associations (GDAs) and pathways as input to the gene/pathway filter. d
   Generating evidence chains with application of the auto-filtering model
   consisting of the rule-based, significant path, gene/pathway filters
   and the deductive path builder.

   We demonstrate our approach in Fragile X Syndrome (FXS), a rare genetic
   disorder where we validate the method against preclinical experimental
   data that showed a strong correlation between our evidence chains
   analysis results and experimentally derived transcriptional changes of
   selected genes and hallmark pathways of the predicted drug treatments
   Sulindac and Ibudilast. We also systematically tested the approach on
   two disease case studies cystic fibrosis and Parkinson’s disease in
   order to validate the method against known treatments, assessing the
   general applicability of the approach while also covering different
   therapeutic areas. We chose these two diseases by, firstly, selecting
   for diseases that had at least one known approved treatment, diseases
   which were not excessively complex and had a genetic cause. We avoided
   complex and common diseases such as cancers and auto-immune conditions
   since validating the output against known information for these
   diseases would involve too many genes and pathways and likely form an
   overly complex case study of our automatic evidence chains filtering
   approach. Cystic fibrosis met our selection criteria of having a
   causative gene (CFTR) with known treatments and is well-suited for
   validation purposes as we can verify whether the gene and treatment
   links are extracted by our pipeline. To further evaluate our approach,
   we chose Parkinson’s disease, which is a more common disease with
   approved treatments for some of its phenotypes. Knowledge graph
   approaches have been applied to Parkinson’s disease in recent studies
   using it for drug repositioning^[85]30 and validating predicted targets
   against a gold standard dataset of genes associated with the
   disease^[86]31.

   This study represents a systematic evaluation of validating the
   evidence chains generated for diseases for therapeutic rationale using
   a knowledge graph completion model, path generation, and the
   application of an automatic filtering model. Consequently, the primary
   challenge in this work stemmed from the absence of any pre-existing
   benchmark data for quantitative validation of the approach. Creating a
   gold standard validation set of evidence chains for drug-disease pairs
   is prohibitively infeasible. It would require decision-making on what
   is the rationale for choosing the top-ranking evidence chains for any
   given drug-disease pair, differs for each disease of interest, and
   therefore would require a prohibitively large amount of time to conduct
   expert review from curators, pharmacologists and bioinformaticians.
   Therefore, we adopted a qualitative approach to address this challenge
   until such a gold standard validation set becomes available. We reached
   out to experts to evaluate whether the evidence chains generated by our
   case study diseases are relevant to the disease of interest and
   analysed if the disease related genes and pathways provided by curators
   are automatically extracted by our pipeline. In addition, and as an
   alternative to qualitative analysis, we explored ways to evaluate the
   efficiency of the auto-filtering method applied to rules and evidence
   chains. From the complete list of initial rules generated by AnyBURL
   for both cystic fibrosis and Parkinson’s disease predictions, we
   manually curated a set of rules with the help of drug discovery
   scientists, which they thought could produce biologically relevant
   evidence chains. We made the entire set of rules from AnyBURL available
   for review and requested curators to assess the biological relevance of
   these rules in the context of any disease in general. This approach
   eliminated bias toward a specific disease and ensured comparability in
   the curation process. We checked whether the automated filtering method
   can retain all the curated rules before generating evidence. Lastly, we
   performed a case study on Fragile X syndrome, where we validated our
   approach against preclinical experimental data that demonstrated a
   strong correlation between our evidence chains analysis results and
   experimentally derived transcriptional changes of selected genes and
   hallmark pathways of the predicted drug treatments Sulindac and
   Ibudilast.

Results

Evidence chain generation workflow

   Figure [87]2 shows a complete workflow diagram of the entire pipeline.
   It begins with predictions and proceeds through therapeutic hypothesis
   generation, gathering biological relevance, and generating evidence
   chains. The figure provides a comparison between the results obtained
   with and without automatic filtering, demonstrating the impact of this
   filtering process on the evidence chains.

Predictions and rule generation

   We start with the input data, coming from the Healx KG as shown in
   Fig. [88]2a. AnyBURL and a set of other drug prediction models are
   trained on this data and predictions are generated. AnyBURL produces a
   set of rules for each prediction. Either the top n (n = 100 in our
   case) predictions from the prediction models or specific predictions of
   interest from the top n are chosen for hypothesis generation. The
   criteria for choosing query predictions will differ according to
   therapeutic programme needs and are not discussed further here since
   our primary focus is on evidence chain generation. After selecting the
   predictions to be used for therapeutic hypothesis generation, we
   acquire the associated rules generated by AnyBURL for those
   predictions. Subsequently, the process of automatic filtering can be
   initiated.

Hypothesis generation without filtering

   Once a subset of predictions and their corresponding rules are
   established from the previous step, we generate evidence chains as
   shown in Fig. [89]1b, by querying the graph for all possible paths
   following the rules that suggested the prediction. Here we show how
   paths could be generated if no filters were applied. We would only
   search paths with the types of nodes mentioned in the rule. In
   Fig. [90]1b, black box, we call this, “evidence chains without any
   filters applied” to the AnyBURL output.

Hypothesis generation with filtering

   Here the auto-filtering model is applied to the subset of predictions
   and their corresponding rules as shown in Fig. [91]1d. The Rule-based
   filter is first applied. As previously described, uninformative rules
   do not provide a useful biologically relevant therapeutic rationale,
   and this automatically eliminates drug predictions that were suggested
   by less relevant rules. Only the resulting predictions and rules go
   through the phases of significant path filter, deductive path building
   and the gene/pathway filters as shown in Fig. [92]1d. More details on
   filters are provided in Methods. Gene and pathways for filtering are
   gathered from the biological relevance gathering phase as described
   below.

Biological relevance gathering

   We perform an extensive landscape study for the given disease and
   identify gene-disease associations (GDAs) and biological pathways
   specific to the diseases of interest as shown in Fig. [93]1c for cystic
   fibrosis, Parkinson’s disease and FXS. GDAs and pathways are sourced
   from Genomics England PanelApp^[94]32, Open Targets^[95]33,
   Pharos^[96]34, Geneshot^[97]35 and Healx KG. GDAs that were found in at
   least four of the five resources are selected for manual validation.
   Manual curation assessed the validity of the associated evidence
   provided by the original resource, and further scientific literature
   searches were performed when this evidence was insufficient. In order
   to obtain a list of disease-relevant pathways, we used GDAs curated in
   the previous step to perform pathway enrichment analysis using Fisher’s
   Exact Test method to map sets of genes to pathway terms^[98]36.
   Furthermore, we have applied Benjamini–Hochberg correction^[99]37 to
   account for multiple hypotheses testing and then used an adjusted P
   value threshold of 0.01 to extract the most statistically significant
   pathways. Pathways were sourced from KEGG^[100]38, Reactome^[101]39 and
   Wikipathways^[102]40 and are defined as a series of interconnected
   biochemical reactions that occur within a cell or organism, leading to
   a specific biological outcome. As described in Fig. [103]1c, the
   resulting genes and pathways are fed into the gene and pathway filter
   in the pipeline, providing biological context for the automated
   evidence chains filtering process. Paths that contain at least one of
   the genes or pathways are retained.

Rule-based filtering in practice

   Finally, as described in Fig. [104]1d, we arrive at a reduced number of
   evidence chains after the application of all filters. To summarize, we
   compute the reduction achieved via auto-filtering, by computing all
   possible 2-hop paths between the predictions and the disease, paths
   from all rules produced by AnyBURL for the predictions and deducting
   the final paths produced by the pipeline. We present and discuss the
   percentage of reduction achieved for cystic fibrosis and Parkinson’s
   disease case studies in “Results” further below in Table 3.

Fragile X syndrome case study

   We validated the utility of our automatic filtering pipeline against
   FXS preclinical experimental data. Briefly, Sulindac and Ibudilast
   emerged as promising compounds in several non-Knowledge Graph
   (KG)-based drug prediction models, ranking highly, and with high
   confidence as potential treatments for FXS. Furthermore, these drugs
   were found to improve behaviour and cognition in FXS mouse models (Fmr1
   KO1 and Fmr1 KO2) during in-house efficacy studies. However, the non-KG
   predictive models lacked the ability to provide a therapeutic rationale
   for their predictions. In this case study we show how our unique
   pipeline excels at generating biologically useful evidence for these
   drug predictions, even when they originated from different models. The
   only requirement is that query drugs must also be predicted by the
   AnyBURL model.

   We compared the evidence chains extracted for FXS against available
   preclinical experimental data to confirm that the pipeline
   automatically extracted mechanistically significant and meaningful
   information with the potential to inform preclinical decision-making.
   We analysed the gene expression levels and pathways inferred from FXS
   mouse models treated with Sulindac and Ibudilast and checked whether
   gene expression levels were changing in line with expectations of the
   evidence chains. The preclinical experimental details are provided in
   “Methods”. Results are presented in Tables [105]1 and [106]2, which
   detail the significantly regulated genes and associated pathways for
   each treatment. For Ibudilast, there is 622 times upregulation and 713
   times downregulation in P values. In Sulidac, its 1637 times
   upregulation and 1578 times downregulation. Here, we are demonstrating
   the observed molecular effects of predicted reasoning from evidence
   chains observed for FXS and not the entire list of genes altered in the
   preclinical testing. It is an experimental confirmation of the in
   silico hypothesis of the mechanism. In addition, Fig. [107]3a, b
   presents the automatically generated evidence chains, showing the
   pathways and significant genes for each drug which were reviewed by
   experts and considered significant.

Table 1.

   Genes and pathways found in evidence chains that have an effect in
   preclinical experiments in the Fmr1 KO mouse model for Ibudilast
   Gene P value Pathway associated FDR-adjusted pathway P value
   PDE4D 0.0276 ↑ cAMP signalling pathway 1.02 E-4 ↑
   PENK 0.0434 ↓ Neuroactive ligand–receptor interaction 0.0123 ↓
   FOS 0.0602 ↑(significant logFC = 0.854) cAMP signalling pathway 1.02
   E-4 ↑
   AKT1 0.029 ↓ cAMP signalling pathway 1.02 E-4 ↑
   MEF2C 0.0287 ↑ cGMP-PKG signalling pathway 0.0184 ↑
   NPY2R 0.0094 ↓ Neuroactive ligand–receptor interaction 0.0123 ↓
   [108]Open in a new tab

   Columns are Gene; HGNC gene symbol, P value; statistical significance
   of the gene being differentially expressed based on an estimate of log
   fold change following drug treatment. Values computed by DESeq2, P
   values are two-sided. Pathway associated; pathway term describing gene
   function following analysis by gene set enrichment analysis,
   FDR-adjusted pathway P value; statistical significance of pathway
   enrichment score corrected for multiple hypothesis testing using
   Benjamini–Hochberg procedure.

Table 2.

   Genes and pathways found in evidence chains that have an effect in
   preclinical experiments in the Fmr1 KO mouse model for Sulindac
   Gene P value Pathway associated FDR-adjusted pathway P value
   PTGS1 0.0269 ↓ Serotonergic synapse Not significant
   MAPK3 0.0028 ↓ Pathways of neurodegeneration 4.55 E-24 ↓(Most
   significant “Alzheimer’s disease”)
   PTGS2 0.0042 ↓ Alzheimer disease 4.55 E-24 ↓
   [109]Open in a new tab

   Columns are Gene; HGNC gene symbol, P value; statistical significance
   of the gene being differentially expressed based on an estimate of log
   fold change following drug treatment. Values computed by DESeq2, P
   values are two-sided. Pathway associated; pathway term describing gene
   function following analysis by gene set enrichment analysis,
   FDR-adjusted pathway P value; statistical significance of pathway
   enrichment score corrected for multiple hypothesis testing using
   Benjamini–Hochberg procedure.

Fig. 3. Evidence chains produced for Ibudilast and Sulindac and validated
against preclinical experimental data.

   Fig. 3
   [110]Open in a new tab

   a Evidence chains extracted for Ibudilast by the evidence generation
   and auto-filtering pipeline. b Evidence chains extracted for Sulindac
   by the evidence generation and auto-filtering pipeline. Yellow symbol
   indicates the type compound, white indicates disease, green indicates
   gene, red indicates phenotype and blue indicates pathway.

   In Fig. [111]3a, we observe that the cAMP (cyclic adenosine
   monophosphate) signalling and cGMP (cyclic guanosine
   monophosphate)-protein kinase G (PKG) signalling pathways have been
   extracted by our pipeline in the paths connecting Ibudilast to FXS. The
   phosphodiesterase (PDE) inhibitor Ibudilast has been shown to inhibit
   PDE3A (cAMP), PDE10 (cAMP & cGMP), PDE11 (cGMP), and PDE4 (cAMP) with
   preferential potency against PDE4^[112]41 and has shown to have several
   beneficial effects in the brain^[113]42–[114]44. FXS patients have
   reduced cAMP levels^[115]45 and several preclinical and clinical
   studies have supported PDE4 inhibition as a viable target in
   FXS^[116]46–[117]49. Inhibition of PDE10, levels which are elevated in
   FXS^[118]50, has also demonstrated efficacy in a preclinical model of
   FXS by normalising EEG brain activity^[119]51. The selectivity of
   ibudilast against both PDE4 and PDE10 make it an attractive therapeutic
   candidate for FXS.

   BPN-14770, a specific PDE4D inhibitor, improved behaviour and cognition
   in a mouse model of FXS^[120]52. In addition to this, BPN-14770 has
   recently been shown to improve cognition in FXS patients in a phase 2
   trial^[121]49. Ibudilast displayed a similar improvement in cognition
   and behaviour in an FXS mouse model, likely due to the significant
   elevation in the cAMP signalling pathway (P < 0.0001) reported in
   Table [122]1. Evidence chains also predict Ibudilast as a treatment via
   c-FOS gene and a non-significant elevation in c-FOS was also observed
   following Ibudilast treatment in the disease models as shown in
   Table [123]1 (P = 0.062). c-FOS gene is an early response gene that
   acts as a transcription factor and is upregulated in response to
   cAMP-dependent CREB activation following an increase in neuronal
   activity.

   Upregulation of c-FOS leads to an elevation in the expression of
   downstream proteins necessary to strengthen neuronal connectivity to
   allow for memory formation. c-FOS levels are reduced in FXS mouse
   models as a result of reduced cAMP signalling^[124]53. In contrast, our
   gene expression data demonstrated that PDE4D expression was
   significantly elevated (P = 0.0276) in the cortex following 3 mg/kg
   Ibudilast treatment (Table [125]1). This could simply be a compensatory
   mechanism, whereby chronic inhibition of the enzyme results in an
   elevation in its expression. This has been reported for rolipram,
   another PDE4 inhibitor, whereby rolipram treatment significantly
   elevated cAMP levels while paradoxically significantly elevating PDE4
   levels^[126]54.

   The evidence chains also predict Ibudilast as a possible treatment for
   FXS through proenkephalin (PENK). PENK is an endogenous opioid
   polypeptide hormone that has been implicated in neuroinflammation and
   identified as an early indicator of vascular dementia^[127]55. Although
   neuroinflammation in FXS is still a contentious topic, microglia and
   astrocytes from FXS mice have been shown to produce an elevated
   proinflammatory cytokine response when activated^[128]56,[129]57. We
   found that Ibudilast treatment significantly reduced PENK expression,
   suggesting a reduction in neuroinflammatory pathways. Ibudilast has
   been reported to have anti-inflammatory properties through cAMP
   signalling, TLR4 inhibition, and has been shown to protect against
   reactive oxygen species, a common precursor to
   inflammation^[130]58–[131]60.

   From the evidence chains we see that Ibudilast, through its association
   with modulating both the cAMP and cGMP pathways, is linked to FXS
   through the proteins AKT1 and MEF2C, and their association with
   epilepsy (Fig. [132]3a). Although not all FXS patients develop
   seizures, preclinical and clinical studies have demonstrated that by
   targeting one particular pathophysiological pathway, such as seizures,
   can often alleviate multiple symptoms in FXS^[133]61–[134]66. It
   therefore stands to reason that a small molecule which is able to
   alleviate seizure incidence could improve other symptoms in FXS and
   therefore warrants further investigation.

   From literature it is evident that inhibiting AKT1 prevents epilepsy in
   a rat model of temporal lobe epilepsy^[135]67, it therefore seems
   rational that modulating AKT signalling through Ibudilast treatment
   could reduce seizures in FXS patients. We see that 3 mg/kg Ibudilast
   treatment significantly reduced AKT1 expression (P = 0.029) in the
   cortex of Fmr1 KO2 mice (Table [136]1). MEF2C haploinsufficiency has
   been linked to an increase in seizure frequency and epilepsy^[137]68.
   From our internal data we found that cortical MEF2C expression was
   significantly elevated (P = 0.0287) in Fmr1 KO2 mice treated with
   3 mg/kg Ibudilast (Table [138]1). This data suggests that elevated
   MEF2C expression following Ibudilast treatment could reduce seizure
   frequency in FXS.

   The evidence chains also indicate a connection between Ibudilast and
   FXS through epilepsy, mediated by the neuropeptide Y receptor protein
   NPY2R (as shown in Table [139]1). NPY2R gene, which regulates anxiety,
   sleep, appetite, and neuronal excitability, has been found to have
   elevated expression levels in brain biopsies from patients with
   temporal lobe epilepsy^[140]69. Administration of 6 mg/kg of Ibudilast
   significantly decreased the cortical expression of NPY2R (P = 0.0094)
   in Fmr1 KO2 mice, as indicated in Table [141]1.

   Diving deeper into to the evidence chains, Sulindac, an inhibitor of
   PTGS1 and PTGS2, has been predicted as a potential therapy for FXS by
   linking these two proteins to their involvement in Alzheimer’s disease
   and amyloid precursor protein (APP) processing (Fig. [142]3b). APP is a
   precursor to the toxic amyloid beta protein which eventually forms the
   amyloid plaques associated with Alzheimer’s disease. The expression of
   PTGS1 and PTGS2 changes throughout the progression of Alzheimer’s
   disease pathology and is believed to contribute to the
   neuroinflammatory aspect of the disease^[143]70. To this end, PTGS2
   inhibition has demonstrated therapeutic benefit in preclinical models
   of Alzheimer’s disease. FXS patients also have elevated amyloid beta
   levels as a result of an increased expression of APP. Modulating
   PTGS1/2 levels with Sulindac could therefore be a beneficial treatment
   for FXS. As conformation of this, Sulindac was found to normalize
   behaviour and improve cognition in a mouse model of FXS and
   significantly reduce the expression of both PTGS1 (P = 0.0269) and
   PTGS2 (P = 0.0042) in the cortex of Fmr1 KO1 mice (Table [144]2).

   Evidence chain analysis also shows that Sulindac can potentially affect
   the MAPK3 signalling pathway, which has been implicated in the symptoms
   and pathophysiology of FXS. Minocycline in particular, has shown to
   effectively improve social and cognitive deficits, which are thought to
   be driven by improved spine maturation, in preclinical models of
   FXS^[145]71,[146]72. In Fig. [147]3b, minocycline and Sulindac have
   shared profile of both inhibiting ERK1 (MAPK3) activity and its
   subsequent downstream signalling cascade, further supported by
   previously published works^[148]73,[149]74.

   Clonidine, an alpha2 adrenergic receptor agonist, has been demonstrated
   to effectively treat symptoms of ADHD in individuals with FXS^[150]75.
   It has also been used effectively to manage substance
   withdrawal^[151]76. Clonidine works, in part, by modulating downstream
   signalling via MAPK3^[152]77. In addition, MAPK3 signalling is also
   involved in the physiology of substance withdrawal^[153]78. The link
   between clonidine and MAPK3 through substance withdrawal and FXS
   supports the rationale behind why Sulindac can be a possible treatment
   for FXS (Fig. [154]3b).

   Baclofen, a GABAB receptor agonist, has shown efficacy in FXS mouse
   models and mixed results in patients^[155]79, where it failed primary
   outcome measures but met certain secondary endpoints. Reasons for this
   are thought to be due to small cohorts and lack of patient
   stratification, rather than poor efficacy of the drug^[156]80. Baclofen
   has also been found to be effective in treating trigeminal neuralgia
   through COX2 inhibition^[157]81,[158]82. Trigeminal neuralgia has been
   associated with MAPK3 signalling^[159]83, which is the same signalling
   pathway linking Sulindac as a possible treatment for FXS
   (Fig. [160]3b). In addition, Aripiprazole, has been effective in
   treating both FXS and idiopathic autism^[161]84 and has been shown to
   modulate MAPK3 signalling^[162]85,[163]86, which is what links this
   drug to Sulindac.

   Overall, our evidence chains imply that Sulindac might be a good
   treatment for FXS due to its link through PTGS1, PTGS2 and MAPK3
   signalling. These suggestions from our evidence chains analysis are
   further validated using internal in vivo data, as we observed 5 mg/kg
   Sulindac significantly reducing the expression of MAPK3, PTGS1 and
   PTGS2 in the cortex of Fmr1 KO1 mice (Table [164]2). Further to the
   previous discussion, additional literature evidence suggests that
   reducing MAPK3 signalling has benefits in alleviating the symptoms and
   pathophysiology in preclinical models of FXS^[165]87–[166]90, which
   further strengthens the use of Sulindac as a potential treatment for
   FXS.

Parkinson’s disease and cystic fibrosis case study

   Typically, drug discovery projects focus on uncovering novel
   drug-disease relationships and forming hypotheses regarding the
   potential effectiveness of modulating that link, for example with a
   small molecule. However, here we intend to establish the performance
   and validate the utility of our evidence chains filtering approach by
   rediscovering a set of known treatment relationships along with
   evidence chains that may explain the therapeutic basis of the
   treatments. We are presenting these results as a means of validating
   the method’s output as baseline justification, demonstrating its
   capability to reconstruct information that is already well-known in the
   scientific literature. To achieve this, we collaborated with curators
   who performed rigorous manual curation of the literature to compile a
   comprehensive list of approved treatments for cystic fibrosis and
   Parkinson’s diseases. These treatments were curated based on their
   relevance to the disease itself or its associated symptoms
   (phenotypes). In total, we identified 45 approved treatments for
   Parkinson’s disease and 17 for cystic fibrosis. Prior to running
   experiments, we removed all direct links between the approved
   treatments and the disease in the knowledge graph. Our pipeline was
   able to predict these approved treatments (44 out of 45 approved drugs
   for Parkinson’s disease and all 17 treatments for cystic fibrosis) in
   the absence of known links and successfully generated the corresponding
   evidence chains for those treatments.

   From the extensive list of initial rules generated by AnyBURL for both
   case study disease predictions, we undertook a manual curation effort
   involving collaboration with drug discovery scientists, who identified
   rules with the potential to yield biologically relevant evidence
   chains, as previously described. Our automated filtering model retained
   all the curated rules for both diseases. This outcome affirms that the
   method effectively preserved valuable information, ensuring that it was
   not overlooked prior to the path generation process.

   Figure [167]4 illustrates evidence chains generated for cystic fibrosis
   (Fig. [168]4a) and Parkinson’s disease (Fig. [169]4b) for few of the
   approved treatments predicted. Cystic fibrosis (CF) is a rare genetic
   autosomal recessive disorder caused by mutations in the cystic fibrosis
   transmembrane conductance regulator (CFTR) gene. Notably, the
   importance of CFTR for cystic fibrosis is also observable in
   Fig. [170]4a, which clearly demonstrates the ability of our evidence
   chains methodology to identify the most pertinent information about the
   disease.

Fig. 4. Evidence chains linking approved treatments to the case study
diseases.

   [171]Fig. 4
   [172]Open in a new tab

   a Most significant evidence chains extracted for cystic fibrosis b Most
   significant evidence chains extracted for Parkinson’s disease c
   Evidence chains linking treatment drugs to cystic fibrosis that are
   less significant. Yellow symbol indicates the type compound, white
   indicates disease, green indicates gene, red indicates phenotype and
   blue indicates pathway.

   Our evidence chains show both Mannitol and Tobramycin as potential
   treatments for cystic fibrosis, as they have been traditionally used to
   treat bronchiectasis, which can be caused by chronic infections or
   cystic fibrosis^[173]91. QBW251 (Icenticaftor), which functions as a
   potentiator of the CFTR protein by binding and unlocking the CFTR
   channel to facilitate chloride ion transport^[174]92, has also shown
   clinical efficacy in treating cystic fibrosis and chronic obstructive
   pulmonary disease^[175]92–[176]94. Similarly, our predictions
   identified Ivacaftor, Tezacaftor, and Lumacaftor as potential
   treatments due to their similar mechanism of action with QBW251.
   Lumacaftor’s link to cystic fibrosis in our evidence chains was through
   improving pancreatic functioning in acute pancreatitis and this is a
   symptom associated with the disease^[177]95,[178]96.

   Interestingly, we also found that our unique auto-filtering approach
   can generate potentially valuable rules and evidence that, while not
   originally biologically significant to the disease of interest, might
   still be useful for further experimentation in some cases and even
   uncover novel rationale that is not known. We showcase this with cystic
   fibrosis. Specifically, Acetylcysteine, an approved cystic fibrosis
   treatment, was found to have a significant number of evidence chains
   linking it to the disease. Figure [179]4a displays examples of strong
   evidence chains indicating that Acetylcysteine could be beneficial in
   addressing pulmonary fibrosis symptoms, which is a common complication
   in cystic fibrosis patients suggesting it may be a potential treatment.
   However, in Fig. [180]4c it becomes difficult to determine whether
   Acetylcysteine is a viable candidate for treating cystic fibrosis as
   the evidence chains shown in Fig. [181]4c are more indirect. The first
   evidence chain in Fig. [182]4c suggests that both Acetylcysteine and
   Tenapanor are being tested as treatments for end-stage renal failure.
   Kidney failure is a complex health issue that can arise due to a
   variety of factors, so the fact that two compounds are being tested as
   treatments for this condition does not necessarily mean that they will
   be effective treatments. Secondly, there are 4367 interventional
   clinical trials for kidney failure listed in Clinicaltrials.gov, so the
   fact that a compound is being tested for this condition does not
   necessarily mean that it is an approved and reliable treatment.
   Therefore, compared to the evidence presented in Fig. [183]4a, the
   evidence presented in Fig. [184]4c is not as strong or biologically
   relevant. However, for compounds and diseases for which there is not
   any strong direct evidence like the ones presented in Fig. [185]4a, the
   less biologically relevant evidence presented in Fig. [186]4c may still
   provide useful insight for further experimentation.

   Parkinson’s disease (PD) is a neurodegenerative disorder characterized
   by inflammation and oxidative stress, which play critical roles in its
   pathogenesis. Looking at the predictions and corresponding evidence
   chains in Fig. [187]4b, Cabergoline is suggested as a potential
   treatment for the disease, mainly due to its link with
   hyperprolactinemia and the JAK/STAT signalling pathway. Cabergoline,
   with its potent D2 selectivity, is currently an effective treatment for
   hyperprolactinemia^[188]97. The pathophysiology of Parkinson’s disease
   is associated with increased neuroinflammation, such as IL6 signalling
   through the JAK/STAT pathway.

   Rivastigmine, a cholinesterase inhibitor, used for the treatment of
   Alzheimer’s disease was also predicted as a treatment for Parkinson’s
   disease according to Fig. [189]4b. In the evidence chain analysis,
   Alzheimer’s disease is linked to it through the inflammatory cytokine,
   IL6. Commonly seen in various neurodegenerative diseases,
   neuroinflammation and inflammatory cytokines are important drivers of
   pathophysiology, and there is strong evidence that inflammatory
   cytokines, such as IL6 are involved in Parkinson’s disease and
   Alzheimer’s disease progression^[190]98,[191]99. The evidence chains
   show Entacapone as a treatment through a link with Huntington disease,
   another neurodegenerative condition linked with ataxia.

   Although the evidence shown here are for already approved treatments,
   we have included them as means to validate the approach and to sanity
   check the extracted evidence chains with already known information.
   Overall, all these evidence chains and their corresponding supporting
   information demonstrate the capability of our methodology to capture
   essential information for identifying potential treatments for diseases
   and to provide critical insights into how those drugs may treat the
   diseases.

Reduction in the number of paths generated

   To assess the overall efficiency of the automated pipeline, we compute
   all possible paths in the Healx KG up to 2-hop length between the
   approved treatment drugs and the disease for Parkinson’s disease and
   cystic fibrosis. In total 719,203 paths for Parkinson’s disease and
   310,901 paths for cystic fibrosis were computed as shown in
   Table [192]3. We also compute the total number of paths for each
   approved drug by using all rules given by AnyBURL for the drug without
   any filtering applied. Table [193]3 shows the number of all possible
   2-hop paths that could be generated, the number of paths extracted from
   rules given by AnyBURL, the number of paths after application of the
   auto-filtering model and the total reduction achieved for all the
   approved drugs listed for both diseases. In Fig. [194]1, we showed a
   complete workflow of how these evidence chains are generated with and
   without the auto-filtering method. With the automated filtering
   approach, we achieved a 77% reduction in evidence chains compared to
   the ones generated by AnyBURL for Parkinson’s disease resulting in a
   total of 95% reduction compared to all the possible paths that can be
   generated. Similarly, for cystic fibrosis, we achieved a 60% reduction
   compared to AnyBURL and in total 85% compared to the full list of
   paths. However, if this were a novel drug discovery programme, then
   there would still be a prohibitively large number of paths to explore
   by human experts.

Table 3.

   Reduction in evidence chains space with auto-filtering pipeline
   3-hop possible paths No. of AnyBURL paths No. of filtered paths Total
   reduction
   Parkinson's 719,203 156,254 (562k reduction) 34,738 (121k reduction)
   684 K
   Cystic fibrosis 310,901 112,276 (198k reduction) 44,025 (68k reduction)
   266 K
   [195]Open in a new tab

   In fact, the case studies presented here already have approved
   treatments and these nodes are highly connected in the graph. Even
   though we removed the direct links between the treatments and the
   disease in the input graph, it still produces a significant number of
   paths. In disease case studies without an approved treatment, we find
   that these numbers are lower compared to what is shown in Table [196]3.
   For example, in the FXS case study the method extracted 249 evidence
   chains for Sulindac and 324 for Ibudilast which is feasible to review
   by experts and demonstrates the applicability of the approach in
   diseases without an approved treatment.

Discussion

   We have demonstrated the usefulness of our automated methodology for
   distilling informative evidence chains useful in drug discovery through
   the study of Fragile X Syndrome, a rare genetic disorder and two other
   diseases, Parkinson’s disease and cystic fibrosis. Our contribution
   enables KBC models such as AnyBURL to be applied in establishing
   pharmacological rationale through the reduction of uninformative paths
   without loss of biological signal. We started with a symbolic model
   AnyBURL applied to the Healx KG to make treatment predictions and
   generate useful evidence chains explaining indicated treatments with an
   automatic filtering approach (explained in “Methods”). Our results from
   the FXS study demonstrate a strong correlation between the evidence
   chains and experimentally derived transcriptional changes in essential
   genes and hallmark pathways of the drug treatments indicating that the
   drugs may be causally linked with therapeutic benefit in the disease
   model. In both cystic fibrosis and Parkinson’s disease, we found the
   most relevant genes and pathways extracted automatically by the
   evidence chains. The approach can be applied consistently across
   multiple diseases, eliminating the need for manual curation of rules or
   evidence in the process. This clearly highlights the potential of
   automating the process of generating therapeutic rationale for drug
   predictions. We suggest this approach enables the power of tailored
   reinforcement learning to be applied more frequently as a tool for
   rapidly deriving mechanistic insights, reducing the time and cost
   investment required for laboratory experiments, and guiding further
   clinical development.

   As shown in Table [197]3, we achieved a 77% reduction in the paths
   generated for Parkinson’s disease and 60% for cystic fibrosis with the
   automatic filtering approach alone. While this still leaves many paths
   to explore for highly connected diseases in the graph, such as those
   with approved treatment, for rare disease case studies like FXS we find
   that these numbers are amenable to human triage. As future work we
   intend to reduce this space more by using additional rules that make
   use of confidence scores. Although rule confidence scores are produced
   by AnyBURL they do not translate to biological relevance. Rules with
   the highest confidence scores are not always the most significant or
   are too vague or generic from a drug discovery perspective. It is also
   important to note that these rules involve many independent degrees of
   freedom with their own assumptions. High confidence relations between
   any two given entities A and B and B and C does not imply a high
   confidence relationship between A and C. However, for future work
   assigning a rule or path confidence score is vital and could be done
   using better path ranking methodologies or even embedding evidence
   weights in the training process of the model, so that irrelevant chains
   are penalised pushing the model towards generating even more
   biologically relevant rules and paths.

   As discussed briefly in the Introduction section there are no benchmark
   datasets available to evaluate evidence chains generated for a given
   drug prediction in a specific disease. Creating a gold standard
   validation set of evidence chains for a list of drug-disease pairs
   would require decision-making on what is the rationale for choosing the
   top-ranking evidence chains, differs for each disease of interest, and
   therefore would require an inordinate amount of time to conduct expert
   review from curators, pharmacologists and bioinformaticians. As future
   work we plan to create such a benchmark dataset of highly relevant
   paths for drug-disease pairs to quantitatively validate the approach.

   We intend to also integrate data on protein-protein interactions,
   proteins and biological functions interactions and interactions between
   biological functions in the knowledge graph to better explain the
   mechanism of action of the drug in the disease of interest. Another
   limitation in this work is the use of publicly available, proprietary,
   and curated data sources alone for the input knowledge graph. Details
   of the data used in this study is shown in section Data in “Methods”.
   Enormous amounts of data are available from biological text sources
   processed by our natural language processing (NLP) pipelines, and they
   extract biological relations between entities. Yet using it here causes
   bottlenecks in the evidence chain generation and interpretation process
   due to the large number of rules/evidence that can be generated from
   this data. As part of our future work, we look for better ways to
   incorporate data processed by our NLP pipeline.

Methods

Data

   The data described in this paper producing all experimental results
   shown in “Results” were sourced from Healx’s proprietary Knowledge
   graph (Healx KG) which includes internally curated edges for rare
   diseases, compounds, and other entities. It is comprised of various
   public and commercial data sources including ChEMBL^[198]100,
   CTD^[199]101, Drugbank^[200]102, HGNC^[201]103, Human Phenotype
   Ontology^[202]104, KEGG Pathway^[203]38, NCBI Gene^[204]105,
   Orphanet^[205]29, PanelApp^[206]33, Open Targets^[207]33,
   Pharos^[208]34, SIDER^[209]106, Geneshot^[210]35, UniProt^[211]107,
   NDF-RT^[212]108, OMIM^[213]109, Reactome^[214]39, Wikipathways^[215]40,
   MONDO^[216]28, MeSH^[217]110, Pharmaprojects and together with
   internally curated data. The internally curated data added to the
   knowledge graph comprises around 2000 associations relating to
   diseases, phenotypes, compounds, and targets. All information was
   curated based on published sources of information and does not require
   ethical approval. Table 1 in the supplementary file provides a complete
   list of publicly available and commercial or licensed sources and the
   versions used in building the Healx KG. Additional information is
   extracted from public documents and package inserts. All data is stored
   as a set of triples consisting of head and tail entities connected via
   a relation type. The Healx KG consists of 8 node types including
   Compound, Disease, Gene, Protein, Pathway, Mechanism, ATC and Phenotype
   and 29 edge types. Supplementary Table [218]2 provides information on
   the types of edges in the graph. In Supplementary Fig. [219]1, we also
   provide a meta-graph showing the different node and edge types and how
   they are connected in the Healx KG.

   We share a non-confidential version of the Healx KG, a subgraph of the
   full KG which contains 129,501 edges and 37,331 nodes. This graph does
   not contain nodes and edges coming from proprietary data sources and
   internal curation. To enable the reproducibility of the filtering
   method, we have repeated all experiments with this subgraph for
   Parkinson’s disease. In Supplementary Fig. [220]2, we have presented a
   few interesting evidence chains observed for predictions, Rivastigmine,
   Entacapone and Cabergoline with their connections to Parkinson’s
   disease using this Healx subgraph. We also share the percentage of
   reduction achieved in Supplementary Table [221]5 showing the total
   possible number of 2-hop paths that could be generated for 34 predicted
   approved treatments for Parkinson’s in this graph and the number of
   paths extracted for those predictions with AnyBURL and the
   auto-filtering model. On average we observe 97% reduction in the number
   of paths for the predictions. The main results of the manuscript are
   produced using the source code we are sharing with the manuscript and
   the full Healx KG. This accounts for any minor differences in specific
   evidence chain composition shown in the Supplementary results for
   Parkinson’s disease, yet this does not limit or distract from the main
   learning of our manuscript regarding reduction in uninformative paths
   without loss of biological signal and the general applicability of our
   workflow in drug discovery.

Anytime bottom-up rule learning (AnyBURL)

   The reinforcement learning-based approach in AnyBURL yields an
   explanation in terms of the rules that make a prediction. The learning
   process is conducted in a sequence of time spans of length t[s]. Within
   a time span the algorithm learns as many rules as possible by
   iteratively sampling random paths with the goal of covering and
   satisfying as many training facts as possible within a given time span.
   A rule takes the form, h(c[0], c[1]) <- b[1](c[1], c[2]),…, b[n](c[n],
   c[n+1]) with a ground path rule of length n where the head of the rule
   is h(…) and b[1](…) through b[n](…) is its body and c[0] to c[n] are
   variables that can represent entities in the knowledge graph. Each rule
   has a confidence score usually defined as the number of body
   groundings, divided by the number of those body groundings that make
   the head true. Variables in rules are instantiated with specific
   entities to obtain rule groundings^[222]111. This is usually done by
   instantiating variables in the rule body by filling in the entities
   whose relational triples match the rule body. A corresponding new
   triple is obtained by instantiating the rule head. The instantiated
   rule is called a grounding. The learnt rules are then applied to make
   predictions given the fact that the predicted entity is supported by at
   least one rule. This generates a set of predicted candidates for a
   given query along with the set of rules that suggested it.

   Candidate predictions are ordered via the maximum confidence of all
   rules that have generated the candidates. If the maximum score of
   several candidates is the same, the candidates are ordered via the
   second-best rule that generates them, and so on, until a rule is found
   that makes a difference. AnyBURL can learn rules in a short time,
   defined usually in seconds as a parameter t[s] which is quite
   competitive compared to many other approaches since unlike embedding
   models it does not require time-consuming hyperparameter tuning to
   achieve good performance. The learning and prediction yield a set of
   ranked candidate predictions and associated rules. In Fig. [223]1, we
   have shown examples of rules learnt by AnyBURL. The body of the rule
   contains relationships in the graph between entities represented by
   variables forming paths. Each rule shown has a confidence score
   indicative of the significance to the candidate prediction. However,
   following review, we find these scores do not translate to a simple
   measure of biological importance. We note, the highest-ranking rules
   are not always the most significant, or are too vague or generic from a
   drug discovery perspective.
   [MATH: <mi
   mathvariant="normal">Compound</mi><mo>−</mo><mi>t</mi><mi>r</mi><mi>e</
   mi><mi>a</mi><mi>t</mi><mi>s</mi><mo>→</mo><mi
   mathvariant="normal">Disease</mi><mo>←</mo><mi>d</mi><mi>e</mi><mi>s</m
   i><mi>c</mi><mi>e</mi><mi>n</mi><mi>d</mi><mi>a</mi><mi>n</mi><mi>t</mi
   ><mo>−</mo><mi mathvariant="normal">Disease</mi> :MATH]
   1
   [MATH: <mi
   mathvariant="normal">Compound</mi><mo>−</mo><mi>t</mi><mi>r</mi><mi>e</
   mi><mi>a</mi><mi>t</mi><mi>s</mi><mo>→</mo><mi
   mathvariant="normal">Disease</mi><mo>−</mo><mi>a</mi><mi>n</mi><mi>c</m
   i><mi>e</mi><mi>s</mi><mi>t</mi><mi>o</mi><mi>r</mi><mo>→</mo><mi
   mathvariant="normal">Disease</mi> :MATH]
   2
   [MATH: <mi
   mathvariant="normal">Compound</mi><mo>−</mo><mi>b</mi><mi>i</mi><mi>n</
   mi><mi>d</mi><mi>s</mi><mo>→</mo><mi
   mathvariant="normal">Gene</mi><mo>−</mo><mi>p</mi><mi>a</mi><mi>r</mi><
   mi>t</mi><mi>i</mi><mi>c</mi><mi>i</mi><mi>p</mi><mi>a</mi><mi>t</mi><m
   i>e</mi><mi>s</mi><mo>→</mo><mi
   mathvariant="normal">pathway</mi><mo>−</mo><mi>i</mi><mi>n</mi><mi>v</m
   i><mi>o</mi><mi>l</mi><mi>v</mi><mi>e</mi><mi>s</mi><mo>→</mo><mi
   mathvariant="normal">Disease</mi> :MATH]
   3
   [MATH: <mi
   mathvariant="normal">Compound</mi><mo>−</mo><mi>i</mi><mi>n</mi><mi>v</
   mi><mi>o</mi><mi>l</mi><mi>v</mi><mi>e</mi><mi>s</mi><mo>→</mo><mi
   mathvariant="normal">Pathway</mi><mo>−</mo><mi>i</mi><mi>n</mi><mi>v</m
   i><mi>o</mi><mi>l</mi><mi>v</mi><mi>e</mi><mi>s</mi><mo>→</mo><mi
   mathvariant="normal">Disease</mi><mo>−</mo><mi>a</mi><mi>n</mi><mi>c</m
   i><mi>e</mi><mi>s</mi><mi>t</mi><mi>o</mi><mi>r</mi><mo>→</mo><mi
   mathvariant="normal">Disease</mi> :MATH]
   4
   [MATH: <mi
   mathvariant="normal">Compound</mi><mo>−</mo><mi>t</mi><mi>r</mi><mi>e</
   mi><mi>a</mi><mi>t</mi><mi>s</mi><mo>→</mo><mi
   mathvariant="normal">Disease</mi><mo>−</mo><mi>a</mi><mi>s</mi><mi>s</m
   i><mi>o</mi><mi>c</mi><mi>i</mi><mi>a</mi><mi>t</mi><mi>e</mi><mi>s</mi
   ><mo>→</mo><mi
   mathvariant="normal">Gene</mi><mo>←</mo><mi>a</mi><mi>s</mi><mi>s</mi><
   mi>o</mi><mi>c</mi><mi>i</mi><mi>a</mi><mi>t</mi><mi>e</mi><mi>s</mi><m
   o>−</mo><mi
   mathvariant="normal">Disease</mi><mo>−</mo><mi>a</mi><mi>n</mi><mi>c</m
   i><mi>e</mi><mi>s</mi><mi>t</mi><mi>o</mi><mi>r</mi><mo>→</mo><mi
   mathvariant="normal">Disease</mi> :MATH]
   5

   For example, rules (1) and (2) shown above frequently receive high
   confidence scores compared to other rules but are less significant in
   the drug discovery context since they only have ancestor and descendant
   relationships that do not inform any mechanistic understanding of how
   the drug treats a disease. Although rules (3), (4) and (5) scored low
   they contain more biologically informative relationships such as
   disease–gene, gene-pathway associations and so generate more compelling
   evidence with greater interpretability in a drug discovery context.
   Therefore, we considered all rules associated with a single prediction
   to generate evidence for the predictions, not applying any confidence
   score filtering in terms of rule composition. In all experiments, known
   treatment relationships between the disease of interest and the drug
   were removed from the Knowledge graph before training the AnyBURL model
   on them.

Evidence generation with automatic filtering

   Our proposed methodology utilizes automatic filtering to extract
   biologically meaningful evidence from a list of predictions and their
   corresponding rules, with a focus on identifying paths in the graph
   that are most relevant to the disease of interest which we call the
   evidence chains. The complete workflow is shown in Fig. [224]2. The
   method is flexible and allows for the disabling of certain filters as
   needed to meet the requirements of the project, making the approach
   flexible and consistently applied across multiple disease projects. A
   path, in this context, refers to a sequence of nodes and relations that
   starts at a compound entity and ends at a disease entity, providing
   evidence that the compound can potentially be useful in treating the
   disease. There are four stages to the automated filtering process
   starting with rule filtering, significant path filtering, deductive
   reasoning-based path building and gene/pathway filtering as shown in
   Fig. [225]2. Each stage is described below.

Rules-based filtering

   From the initial set of rules suggested for candidate predictions, we
   filter out rules containing less relevant information in the biological
   context. Rules containing more than one “disease_disease_ancestor “or
   “disease_disease_descendant” relationship will be ignored. Rules
   containing more than two “in trial for”, “in vivo preclinical trial
   for” and “disease _compound has orphan designation for” relationships
   will be ignored. Examples of such rules are shown below.
   [MATH: <mi
   mathvariant="normal">Compound</mi><mo>−</mo><mi>t</mi><mi>r</mi><mi>e</
   mi><mi>a</mi><mi>t</mi><mi>s</mi><mo>→</mo><mi
   mathvariant="normal">Phenotype</mi><mo>←</mo><mi>p</mi><mi>r</mi><mi>e<
   /mi><mi>s</mi><mi>e</mi><mi>n</mi><mi>t</mi><mi>s</mi><mo>−</mo><mi
   mathvariant="normal">Disease</mi><mo>←</mo><mi>a</mi><mi>n</mi><mi>c</m
   i><mi>e</mi><mi>s</mi><mi>t</mi><mi>o</mi><mi>r</mi><mo>−</mo><mi
   mathvariant="normal">Diseas</mi><mi>e</mi><mo>−</mo><mi>a</mi><mi>n</mi
   ><mi>c</mi><mi>e</mi><mi>s</mi><mi>t</mi><mi>o</mi><mi>r</mi><mo>→</mo>
   <mi mathvariant="normal">Disease</mi> :MATH]
   [MATH: <mi
   mathvariant="normal">Compound</mi><mo>−</mo><mi>i</mi><mi>n</mi><mspace
   width="0.25em"></mspace><mi>t</mi><mi>r</mi><mi>i</mi><mi>a</mi><mi>l</
   mi><mspace
   width="0.25em"></mspace><mi>f</mi><mi>o</mi><mi>r</mi><mo>→</mo><mi
   mathvariant="normal">Disease</mi><mo>←</mo><mi>i</mi><mi>n</mi><mspace
   width="0.25em"></mspace><mi>t</mi><mi>r</mi><mi>i</mi><mi>a</mi><mi>l</
   mi><mspace
   width="0.25em"></mspace><mi>f</mi><mi>o</mi><mi>r</mi><mo>−</mo><mi
   mathvariant="normal">Compound</mi><mo>−</mo><mi>i</mi><mi>n</mi><mspace
   width="0.25em"></mspace><mi>t</mi><mi>r</mi><mi>i</mi><mi>a</mi><mi>l</
   mi><mspace
   width="0.25em"></mspace><mi>f</mi><mi>o</mi><mi>r</mi><mo>→</mo><mi
   mathvariant="normal">Disease</mi> :MATH]

   The underlying rationale for this filter is firstly an evidence chain
   with more than one ancestor or descendant relationship becomes
   uninformative when trying to understand a potential treatment like
   explained in Fig. [226]1c. Depending on how the graph was constructed
   and how disease sub-types are categorised in the graph this filter
   could be altered. In our case with Healx KG, we expected one link of
   the sub-type with the actual rare disease, so the essential information
   is still captured in the evidence chains. Secondly, there are typically
   a wide range of trials, orphan designations or in vivo preclinical
   trials associated with a given disease, many of which fail at various
   stages and do not ultimately lead to a treatment for the disease. In
   contrast to the “compound treats disease” relationship, which is
   restricted to the “approved” drug space, other compound-disease
   relationships such as in trial for or in vivo preclinical trial for may
   be considered more “relaxed”. Including more than one “relaxed” edge in
   the evidence chains may lead to broader and noisier explanations being
   present in the evidence set. Therefore, paths for each prediction are
   generated only for the remaining set of rules.

Significant path filtering

   This filter focuses on retaining only the significant paths. For
   example, in the case of Levofloxacin treating cystic fibrosis, two
   paths may be generated as shown in Fig. [227]5, one with an in trial
   for relationship between the disease Bronchitis and the compound, and
   another with a treats relationship between the same entities. However,
   the treats relationship is considered more significant than the in
   trial for relationship, and only the former is retained during the
   filtering process. This same approach is applied for in vivo
   preclinical trial for and has orphan designation for relationships too.
   They are ignored if a treats relation exists between the same entities
   in the evidence chain.

Fig. 5. Significant path-based filtering in evidence chains.

   [228]Fig. 5
   [229]Open in a new tab

   The paths between Levofloxacin and cystic fibrosis showing a treats and
   an in trial for link between the drug and the disease Bronchitis. The
   yellow symbol indicates the type of compound, white indicates disease.

Deductive reasoning-based path building

   In this stage paths which were not already extracted by the pipeline
   using AnyBURL are deduced using significant paths obtained in the
   previous stage. These paths still exist in the graph but are not
   directly extracted by the pipeline, most likely due to not learning a
   specific rule during the training or the rule not being applied during
   the prediction stage of the AnyBURL model. In the Levofloxacin example
   below there are four paths generated by the pipeline as shown. Each
   path corresponds to a different rule, between the entities Levofloxacin
   and cystic fibrosis.
   [MATH: <mi
   mathvariant="normal">Levofloxacin</mi><mo>−</mo><mi>i</mi><mi>n</mi><ms
   pace
   width="0.25em"></mspace><mi>t</mi><mi>r</mi><mi>i</mi><mi>a</mi><mi>l</
   mi><mspace
   width="0.25em"></mspace><mi>f</mi><mi>o</mi><mi>r</mi><mo>→</mo><mi
   mathvariant="normal">Bronchitis</mi><mo>←</mo><mi
   mathvariant="normal">treats</mi><mo>−</mo><mi
   mathvariant="normal">Dornase</mi><mspace
   width="0.25em"></mspace><mspace width="1.0em"></mspace><mi
   mathvariant="normal">alfa</mi><mo>−</mo><mi>i</mi><mi>n</mi><mspace
   width="0.25em"></mspace><mi>t</mi><mi>r</mi><mi>i</mi><mi>a</mi><mi>l</
   mi><mspace
   width="0.25em"></mspace><mi>f</mi><mi>o</mi><mi>r</mi><mo>→</mo><mi>C</
   mi><mi mathvariant="normal">ystic</mi><mspace
   width="0.25em"></mspace><mi mathvariant="normal">fibrosis</mi> :MATH]
   [MATH: <mi
   mathvariant="normal">Levofloxacin</mi><mo>−</mo><mi>i</mi><mi>n</mi><ms
   pace
   width="0.25em"></mspace><mi>t</mi><mi>r</mi><mi>i</mi><mi>a</mi><mi>l</
   mi><mspace
   width="0.25em"></mspace><mi>f</mi><mi>o</mi><mi>r</mi><mo>→</mo><mi
   mathvariant="normal">Bronchitis</mi><mo>←</mo><mi>t</mi><mi>r</mi><mi>e
   </mi><mi>a</mi><mi>t</mi><mi>s</mi><mo>−</mo><mi>D</mi><mi
   mathvariant="normal">ornase</mi><mspace width="0.25em"></mspace><mspace
   width="1.0em"></mspace><mi
   mathvariant="normal">alfa</mi><mo>−</mo><mi>t</mi><mi>r</mi><mi>e</mi><
   mi>a</mi><mi>t</mi><mi>s</mi><mo>→</mo><mi
   mathvariant="normal">Cystic</mi><mspace width="0.25em"></mspace><mi
   mathvariant="normal">fibrosis</mi> :MATH]
   [MATH: <mi
   mathvariant="normal">Levofloxacin</mi><mo>−</mo><mi>t</mi><mi>r</mi><mi
   >e</mi><mi>a</mi><mi>t</mi><mi>s</mi><mo>→</mo><mi
   mathvariant="normal">Bronchitis</mi><mo>←</mo><mi>t</mi><mi>r</mi><mi>e
   </mi><mi>a</mi><mi>t</mi><mi>s</mi><mo>−</mo><mi
   mathvariant="normal">Dornase</mi><mspace
   width="0.25em"></mspace><mspace width="1.0em"></mspace><mi
   mathvariant="normal">alfa</mi><mo>−</mo><mi>i</mi><mi>n</mi><mspace
   width="0.25em"></mspace><mi>t</mi><mi>r</mi><mi>i</mi><mi>a</mi><mi>l</
   mi><mspace
   width="0.25em"></mspace><mi>f</mi><mi>o</mi><mi>r</mi><mo>→</mo><mi
   mathvariant="normal">Cystic</mi><mspace width="0.25em"></mspace><mi
   mathvariant="normal">fibrosis</mi> :MATH]
   [MATH: <mi
   mathvariant="normal">Levofloxacin</mi><mo>−</mo><mi>t</mi><mi>r</mi><mi
   >e</mi><mi>a</mi><mi>t</mi><mi>s</mi><mo>→</mo><mi
   mathvariant="normal">Bronchitis</mi><mo>←</mo><mi>i</mi><mi>n</mi><mspa
   ce
   width="0.25em"></mspace><mi>t</mi><mi>r</mi><mi>i</mi><mi>a</mi><mi>l</
   mi><mspace
   width="0.25em"></mspace><mi>f</mi><mi>o</mi><mi>r</mi><mo>−</mo><mi
   mathvariant="normal">Dornase</mi><mspace
   width="0.25em"></mspace><mspace width="1.0em"></mspace><mi
   mathvariant="normal">alfa</mi><mo>−</mo><mi>t</mi><mi>r</mi><mi>e</mi><
   mi>a</mi><mi>t</mi><mi>s</mi><mo>→</mo><mi
   mathvariant="normal">Cystic</mi><mspace width="0.25em"></mspace><mi
   mathvariant="normal">fibrosis</mi> :MATH]

   A graphical representation of the above paths is shown in Fig. [230]6a.
   Given that treats relationship is more specific than in trial for and
   and it exists between all entities but in different paths, we can
   deduce the path highlighted in red in the figure. The deduced path is
   more specific compared to all five paths extracted as shown in
   Fig. [231]6b. This path builder helps to further check the output from
   previous filters and remove other redundant paths that still exist in
   the generated paths and to deduce paths that have not been directly
   extracted by the pipeline.

Fig. 6. Deductive reasoning applied to evidence chains output.

   [232]Fig. 6
   [233]Open in a new tab

   a Paths existing between the entities before deducing significant paths
   b The most significant path deduced after application of the deductive
   reasoning-based path builder. Yellow symbol indicates the type
   compound, white indicates disease.

Gene- and pathway-based filtering

   Once the paths are generated, we filter for paths that contain at least
   one of the genes or pathways that are considered mechanistically
   important to the disease given by bioinformaticians and curators after
   an extensive disease landscape study for the given disease. How we
   obtain these Gene-Disease associations is explained under Fig. [234]2
   in “Biological relevance gathering”. The rationale for including a gene
   and pathways-based filter is to reduce the set of evidence chains to
   those that are more disease-informative and thereby the most likely
   candidates for targeting with a suitable therapeutic hypothesis. The
   result is to focus on key genes and pathways, over the list of all
   possible genes and pathways that can link a drug to the disease. The
   pipeline is flexible so individual filters could be turned off. If a
   more comprehensive set of paths are required, the gene/pathway filter
   could be disabled.

Preclinical experimental details

   Following are the details of the preclinical experimentation on mice
   dosing and RNA-seq analysis in FXS case studies. All work in the study
   involving animals was carried out by a third-party CRO in line with the
   requirements of the United Kingdom Animals (Scientific Procedures) Act,
   1986. Due to Fragile X Syndrome being an X-linked inherited disorder,
   with the most severe phenotype persisting in males, we only tested male
   mice. In vivo proof-of-concept studies with Ibudilast (Merck, I0157)
   and Sulindac (Merck, S8139) confirmed that the drug was efficacious in
   both the Fmr1 KO1 and KO2 mouse models. The Fmr1 KO1 mouse model was
   developed by insertional inactivation of exon 5 of the Fmr1 gene,
   leading to the loss of FMRP expression as described at the
   Dutch–Belgian Fragile X Consortium^[235]112. Despite the loss of exon
   5, the Fmr1 gene still has an intact promoter, resulting in an abnormal
   residual Fmr1 RNA transcript being expressed. The new Fmr1 KO2 mutant
   was developed as an alternate model in which the Fmr1 RNA transcript
   expression is abolished, thereby preventing FMRP expression. Both mouse
   models were extensively validated by others, and to date, no major
   difference has been reported^[236]113–[237]118. Healx used the Fmr1 KO1
   mouse model in earlier studies, but it was later superseded by the Fmr1
   KO2 model, which lacks expression of FMRP and any Fmr1
   transcript^[238]119,[239]120.

   Fmr1 KO2 model was inbred at GeN DDI, Fmr1 KO1 model was inbred at
   Jackson laboratories. Fmr1 KO2 mice were backcrossed for at least eight
   generations to C57BL/6J, and WT C57BL/6 J littermates were used as
   controls. Fmr1 KO1 mice were backcrossed for at least eight generations
   to FVB/n, and WT FVB/n littermates were used as controls. Heterozygous
   breeding pairs were used to generate WT and KO littermates for all
   studies. Ten male mice (aged 2 months) were used for each treatment
   group across all behavioural experiments. Only male mice were used for
   all experiments because FXS is an X-linked inherited disorder with the
   most severe phenotype persisting in males. Fmr1 KO1, Fmr1 KO2 and WT
   littermate mice were injected intraperitoneal (i.p.) with vehicle [10%
   DMSO (Merck, 276855) in 90% (20% Captisol (Selleckchem, S4592) in
   Saline)] or ibudilast or sulindac for 2 weeks. All drugs were
   formulated and dosed in the vehicle solution. Administration volumes
   were 3.85 mL/kg, such that an adult mouse weighing 26 g received a
   0.1 mL injection volume. For all test articles, the volume to be
   administered was based on each mouse’s body weight. Animals were housed
   in groups of 4 animals per cage, of the same genotype in a temperature-
   and humidity-controlled room with a 12-h light–dark cycle (lights on 7
   a.m.–7 p.m.). Mice were housed in commercial plastic cages
   (40 × 23 × 12 cm) with Aspen bedding and without environmental
   enrichment on a ventilated rack system. Food and water were available
   ad libitum. Experiments were conducted in line with the requirements of
   the United Kingdom Animals (Scientific Procedures) Act, 1986. To
   identify transcriptomic changes as a result of drug treatment, Fmr1 KO2
   mice were dosed with Sulindac (2.5 mg/kg or 5 mg/kg) and Fmr1 KO1 mice
   were dosed with Ibudilast (3 mg/kg or 6 mg/kg) for 2 weeks. Following
   dosing mice were cervically dislocated before dissecting out the
   hippocampus and cortex for storage at −80 °C. Brains were transported
   to Eurofins genomics for RNA extraction and RNA-seq generation. RNA-seq
   reads were aligned to the Mouse B38 genome (mm10) using OmicsoftGenCode
   V24. Low read count genes were filtered out (1 CPM in at least one
   sample, median <10 across all samples). Gene count matrices were loaded
   into R (V4.2.0). Differential gene expression analysis was performed
   using DESeq2^[240]121. Gene set enrichment analysis (GSEA) was
   performed using a ranked gene list from the DESeq2 results using
   fgsea^[241]122. Genes were ranked by logFC and raw P value.
   Statistically significant thresholds were abs(logFC) >0.58 and P value
   <0.05. Pathway information was taken from KEGG.

Reporting summary

   Further information on research design is available in the [242]Nature
   Portfolio Reporting Summary linked to this article.

Supplementary information

   [243]Supplementary Information^ (348.3KB, pdf)
   [244]Reporting Summary^ (290.1KB, pdf)
   [245]Peer Review File^ (3.4MB, pdf)

Source data

   [246]Source Data^ (1.1MB, xlsx)

Author contributions

   S.S. designed and performed research including analysis of the data
   with the method; B.O. performed critical review and interpretation of
   the results, designed the automatic filtering pipeline, curated ground
   truth rules for disease case studies; S.S. and J.C. contributed to the
   creation of the software pipeline including evidence chains generation
   and automatic filtering; D.O’D contributed to the subgraph creation and
   the code repository enabling reproducibility of the results; W.C., A.C.
   and E.T. analysed evidence chains and preclinical experimental data for
   FXS and interpreted results; S.S., B.O., W.C. and I.R. wrote the paper;
   N.T. and I.R. reviewed all experimental results for publication.

Peer review

Peer review information

   Nature Communications thanks Krishna Bulusu, and Frank Kooy for their
   contribution to the peer review of this work. A peer review file is
   available.

Data availability

   In Supplementary Table [247]1, we provide details of both public and
   commercial data sources used in Healx KG. CTD^[248]101, SIDER^[249]106,
   DrugBank^[250]102, KEGG^[251]38, OMIM^[252]109 and Pharmaprojects are
   commercial data sources. CTD^[253]101 data can be used only for
   research and educational purposes, and any Commercial users are
   required to purchase a license to access data from the CTD website.
   SIDER^[254]106 data is licenced under a creative commons
   Attribution-Noncommercial-Share Alike 4.0 License. For commercial use
   or customized versions, license should be obtained from biobyte
   solutions GmbH. Use and re-distribution of the content of
   DrugBank^[255]102 for any purpose requires a license. Academic users
   may apply for a free license for certain use cases and all other users
   require a paid license. KEGG^[256]38 database is available for academic
   use but any commercial use requires a license. Use of OMIM^[257]109 is
   provided free of charge to any individual for personal use, for
   educational or scholarly use, or for research purposes through the
   front end of the database. Commercial users who want to download all or
   part of OMIM must obtain a license by paying applicable licensing fees
   to and entering into a license agreement with Johns Hopkins University
   (JHU). Pharmaprojects comes with a commercial license granting full
   access to their APIs. We have shared a subgraph of the Healx KG data
   created for reproducibility purposes and is available in github,
   [258]https://github.com/healx/automated-biological-evidence-generation-
   in-drug-discovery^[259]123. We have shared the experimental results
   from this subgraph showing a few interesting evidence chains generated
   for Parkinson’s disease in Supplementary Fig. [260]2 and the percentage
   of reduction achieved in Supplementary Table [261]5. The source data
   for this is provided as a Source Data file. The raw sequence (RNA-seq)
   data used in the Fragile X study has been deposited in the NCBI
   Sequence Read Archive (SRA)) under BioProject PRJNA1096445 titled
   ‘Brain-specific gene expression changes in FXS mouse model after
   Sulindac or Ibudilast treatment’. The list of curated rules from the
   complete set given by AnyBURL as given by drug discovery scientists for
   both Parkinson’s disease and cystic fibrosis are included in
   Supplementary Table [262]3. The rules set produced for predictions in
   both diseases have overlaps, hence they are presented in a single
   table. We also present FXS rules separately in Supplementary
   Table [263]4 which were automatically generated by the pipeline before
   evidence chain generation. The curated list was only used in
   Parkinson’s and cystic fibrosis to check if the automatic filtering
   retained these useful rules. All rules shared in the supplementary
   files are from the full Healx KG. [264]Source data are provided with
   this paper.

Code availability

   The Python implementation of our methodology is available at
   [265]https://github.com/healx/automated-biological-evidence-generation-
   in-drug-discovery^[266]123. The AnyBURL v21 Java model was trained
   using OpenJDK v14 and small bug fixes to this model have been made and
   are available in the Github repository. All downstream analyses were
   performed using Python 3.11, with additional packages used are ‘attrs‘,
   ‘click‘ and ‘numpy‘. Please read the README document for information on
   downloading and running the code.

Competing interests

   The authors declare no competing interests.

Footnotes

   Publisher’s note Springer Nature remains neutral with regard to
   jurisdictional claims in published maps and institutional affiliations.

Supplementary information

   The online version contains supplementary material available at
   10.1038/s41467-024-50024-6.

References