Abstract Accumulating evidence links numerous abnormalities in cerebral metabolism with the progression of Alzheimer's disease (AD), beginning in its early stages. Here, we integrate transcriptomic data from AD patients with a genome-scale computational human metabolic model to characterize the altered metabolism in AD, and employ state-of-the-art metabolic modelling methods to predict metabolic biomarkers and drug targets in AD. The metabolic descriptions derived are first tested and validated on a large scale versus existing AD proteomics and metabolomics data. Our analysis shows a significant decrease in the activity of several key metabolic pathways, including the carnitine shuttle, folate metabolism and mitochondrial transport. We predict several metabolic biomarkers of AD progression in the blood and the CSF, including succinate and prostaglandin D2. Vitamin D and steroid metabolism pathways are enriched with predicted drug targets that could mitigate the metabolic alterations observed. Taken together, this study provides the first network wide view of the metabolic alterations associated with AD progression. Most importantly, it offers a cohort of new metabolic leads for the diagnosis of AD and its treatment. Introduction Alzheimer's disease (AD) is the most common form of dementia. It is estimated that AD affects more than 35 million patients worldwide and its incidence is expected to increase with the aging of the population. Although extensive investigations of AD have taken place over the past few decades, its pathogenesis has yet to be elucidated. Currently no treatment is available to prevent or halt the progression of AD. Moreover, the clinical diagnosis of AD is not possible until a patient reaches the dementia phase of the disease [27][1]. A more accurate and earlier diagnosis of AD could enable the use of potential disease-modifying drugs and thus, there is a need for biological markers for the early stages of AD [28][2]. Metabolic alterations have been proposed to be involved in AD from the early stages of the disease [29][3]. Increasing evidence indicates an antecedent and potentially causal role of brain hypometabolism in AD pathogenesis [30][4]. Perturbations in mitochondrial function have long been observed in AD patients, including decreased activity of key mitochondrial enzymes [31][4], [32][5]. Consequently, ATP production and oxygen consumption become impaired [33][6]. Impaired glucose transport has also been reported in AD brains. Moreover, there is a link between cholesterol turnover and neurodegenerative diseases and hypercholesterolemia has been proposed as a risk factor for AD [34][7]. However, the relationship between cholesterol levels and the clinical manifestation of dementia remains unclear [35][8]. There is also a debate regarding the role of certain vitamins such as vitamin D and folic acid in the pathogenesis of AD [36][9], [37][10]_ENREF_14. Clearly from all of this mounting evidence, multiple metabolic pathways may play a key role in AD's progression. Recent studies of gene expression from brains of AD patients further point to the strong association between metabolic alterations and AD, already from the early stages of the disease [38][11], [39][12]. However, such gene expression analyses have been limited to transcriptional alterations and therefore cannot encompass the effects of putative post-transcriptional modifications that are known to play an important role in metabolism [40][13]. Furthermore, they do not allow the identification of biomarkers and drug targets in any direct manner. Our aim here is to go beyond these gene expression results and to elucidate the metabolic changes in AD by employing the increasingly prevalent toolkit of analysis methods provided by the emerging field of Genome-Scale Metabolic Modeling (GSMM). GSMMs have become trusted tools in the study of metabolic networks [41][14], and provide a platform for interpreting omics data in a biochemically meaningful manner [42][15]. GSMM analysis mostly relies on constraint-based modeling (CBM), in which constraints are systematically imposed on the GSMM solution space, and the outcomes of the model are limited to physically realizable phenotypes. GSMMs have been extensively used for the study of metabolism in microorganisms and in humans both in health and disease, enabling the prediction of various metabolic phenotypes such as enzyme activities and metabolite uptake and secretion fluxes, as well as interpretation of various types of high throughput data, often yielding clinically relevant results [43][16]–[44][21]. In a recent GSMM paper studying brain metabolism, three different neuronal sub-types were reconstructed in a GSMM of brain energy metabolism [45][22]. Focused on the core of cerebral energy metabolism, this reconstruction has suggested that glutamate decarboxylase provides a neuroprotective effect which is correlated with the brain regional specificity of AD [46][22]. Our investigation begins with an effort to harness GSMM to systematically describe the metabolic state in AD on a global, network level. We do this by employing a method termed integrative Metabolic Analysis Tool (iMAT), which incorporates gene expression into a GSMM to predict metabolic flux activity [47][18]. This method has already been shown to successfully predict tissue specific metabolic activity in several healthy human tissues, including the brain [48][18]. iMAT incorporates gene expression to predict global metabolic flux activity that is the most consistent with known constraints across the entire metabolic network, and reflects post transcriptional modifications that are not evident in the raw expression data ([49]Figure 1). We utilized a relatively large dataset of gene expression microarrays from the cortex of AD patients and elderly controls [50][23] (including 363 samples), which we integrated with the human metabolic model to study the metabolic changes in AD. This model-based genome-scale view of AD metabolism leads to the identification of various pathways whose activities are altered significantly in AD, and importantly, are not revealed by standard pathway enrichment analysis of the raw gene expression solely, in a model-free manner. We next predict novel biomarkers for AD by comparing predicted uptake and secretion fluxes of various metabolites as the disease progresses. Finally, we predict perturbations in the metabolic network that can transform the metabolic state of AD back closer to a healthy state, highlighting new potential metabolic drug targets for AD that may work on a global, network level. Figure 1. The workflow of iMAT analysis. [51]Figure 1 [52]Open in a new tab A. First, we discretize the expression of each metabolic gene measured into 3 levels: high, moderate and low. Next, iMAT integrates these expression levels into the human metabolic model by maximizing the number of enzymes whose predicted flux activity is consistent with their expression level, yielding a prediction of the overall network flux distribution that is most consistent with the model's constraints under steady state. This analysis is done separately for the control and the AD states. B. A Toy example of the integration of the metabolic network and gene-expression by iMAT and the prediction of enzyme flux-activities (taken and modified from [53][18]). Circular nodes represent metabolites, solid edges represent reactions, and diamond nodes represent enzymes associated by arrows to the reactions they catalyze. Grey, red and green represent moderate, significantly low and significantly high expression of the enzyme-encoding genes, respectively. The predicted flux involving the activation of reactions is shown as green edges. Enzymes E4 and E7 are predicted to be post-transcriptionally up-regulated and down-regulated respectively. Methods Datasets The microarrays data used in this study were obtained from the Gene Expression Omnibus (GEO) site ([54]www.ncbi.nlm.nih.gov). The first dataset contains expression data from 363 cortical samples of controls and AD patients' post-mortem brains ([55]GSE15222) [56][23]. We additionally analyzed blood leukocytes gene expression that includes 3 controls, 3 AD and 3 MCI samples ([57]GSE18309) [58][24]. All datasets were filtered for metabolic genes included in the human metabolic model [59][16]. iMAT analysis We first employed a discrete representation of significantly high or low enzyme-expression levels across tissues. Gene expression levels from the microarray analysis were discretized to highly (1), lowly (−1), or moderately (0) expressed, for each sample. This discretization was based on a threshold of the mean expression +0.3 SD for highly expressed genes, the mean −0.3 SD for lowly expressed genes. Genes between these thresholds were defined as 0, and the entire process is applied for each sample separately. As iMAT requires only a single such discrete representation, the final input includes only those reactions that were classified as highly/lowly expressed in at least 2/3 of the samples. The list of genes that were defined as highly and lowly expressed as input for iMAT is detailed in [60]Table S1. In the iMAT [61][18] analysis, the discretized gene expression levels were incorporated into the metabolic model to predict a set of high and low activity reactions. Network integration is done by mapping the genes to the reactions according to the metabolic model, and by solving a constraint-based modeling optimization problem to find a steady-state metabolic flux distribution, following [62][18]. By using this CBM approach we assign permissible flux ranges to all the reactions in the network, in a way that satisfies the stoichiometric and thermodynamic constraints embedded in the model and maximizes the number of reactions whose activity is consistent with their expression state. The simulation conditions that were used were the default ones, i.e. the boundaries of the model reaction fluxes are between −1000 to 1000. Enrichment of metabolic pathways (gene expression and iMAT) Based on iMAT results, which predict the activity of the reactions in the metabolic model, a hypergeometric p-value was computed for each pathway in the model for being enriched with active or inactive reactions in AD. Subsequently, for comparison of the iMAT results to the gene expression, gene expression measurements were forst translated to the reaction level using the model's gene-protein-reactions mapping, and subsequently the list of altered reactions was again analyzed for pathway enrichment in a standard manner as above. In both analyses, a correction for multiple hypotheses was done using false discovery rate (FDR) method of 0.05. Flux Variability Analysis (FVA)_ENREF_26 [63][25] Metabolic biomarkers are predicted based on a comparison of exchange reaction intervals between the healthy case and each of the disease states. For exchange intervals A = [minA, maxA] and B = [minB, maxB] (where A and B represent the flux intervals in the control and AD stages), we define: A