Abstract
   HIV infection affects up to 30% of children presenting with severe
   acute malnutrition (SAM) in Africa and is associated with increased
   mortality. Children with SAM are treated similarly regardless of HIV
   status, although mechanisms of nutritional recovery in HIV and/or SAM
   are not well understood. We performed a secondary analysis of a
   clinical trial and plasma proteomics data among children with
   complicated SAM in Kenya and Malawi. Compared to children with SAM
   without HIV (n = 113), HIV-infected children (n = 54) had evidence
   (false discovery rate (FDR) corrected p < 0.05) of metabolic stress,
   including enriched pathways related to inflammation and lipid
   metabolism. Moreover, we observed reduced plasma levels of
   zinc-α-2-glycoprotein, butyrylcholinesterase, and increased levels of
   complement C2 resembling findings in metabolic syndrome, diabetes and
   other non-communicable diseases. HIV was also associated (FDR corrected
   p < 0.05) with higher plasma levels of inflammatory chemokines.
   Considering evidence of biomarkers of metabolic stress, it is of
   potential concern that our current treatment strategy for SAM
   regardless of HIV status involves a high-fat therapeutic diet. The
   results of this study suggest a need for clinical trials of therapeutic
   foods that meet the specific metabolic needs of children with HIV and
   SAM.
   Subject terms: Molecular medicine, Risk factors, HIV infections,
   Dyslipidaemias, Metabolic syndrome, Proteomics
Introduction
   Malnutrition, specifically undernutrition in all its forms, remains a
   global public health burden that accounts for 45% of all death among
   children under 5 years old^[38]1. Despite careful monitoring and
   adherence to guidelines set by the World Health Organization, whilst in
   general, uncomplicated SAM cases treated in the community do well, up
   to 25% of children with complicated severe acute malnutrition (SAM)
   treated in a hospital environment do not survive^[39]2–[40]5.
   Furthermore, about one in five children treated for complicated SAM and
   discharged alive, die in the first year after discharge in low-resource
   settings^[41]6–[42]8. However, our understanding of the pathophysiology
   underlying the poor prognosis for these children is surprisingly
   limited.
   Infection with the human immunodeficiency virus (HIV) is a common
   co-morbidity of SAM in sub-Saharan Africa affecting up to 30% of
   admissions among SAM cases^[43]9. HIV-infected or exposed children are
   significantly more likely to be stunted, wasted, and
   underweight^[44]10. They also more often present with other clinical
   complications and greater susceptibility to infections, thus further
   complicating their clinical management, which may include providing
   more aggressive antimicrobial therapy and higher caloric nutritional
   intervention^[45]11. Moreover, response to clinical management is also
   less predictable and less well-understood in HIV-infected children
   compared to their uninfected counterparts^[46]12. Although acute
   opportunistic infections play a key role in the outcome of these
   children, intestinal pathology including inflammation and
   malabsorption, and metabolic perturbations may also be present.
   However, mechanisms driving poor nutritional recovery of children with
   HIV even when detected co-morbidities are treated remain poorly
   understood^[47]12.
   We hypothesised that inflammatory, metabolic and other pathways which
   are likely to be involved in the response to infection, survival and
   nutritional recovery differ between children with SAM with and without
   HIV. We conducted a secondary analysis of clinical data and biological
   samples from a randomised clinical trial in Kenya and Malawi^[48]13.
Results
Patient characteristics
   Table [49]1 presents the baseline characteristics of the children in
   the randomised trial. A total of 843 complicated SAM children were
   recruited for the randomised trial, of which 179 (22%) patients were
   HIV(+). Age was higher and MUAC was lower in HIV(+) children than
   HIV(−) counterparts. Most HIV cases were found in Malawi. Sex and the
   presence of oedema were not associated with HIV status. Mortality was
   more than two times higher among in HIV(+) compared to HIV(−)
   (p < 0.001). Children whose HIV status were unknown had the highest
   mortality of 34%, which indicates bias due to frequent death before
   testing could be undertaken or refusal of testing when a child was more
   severely ill.
Table 1.
   Descriptive characteristics of the study participants.
   All HIV (+) HIV (−) Unknown HIV status p*
   n (%) 843 179 (21%) 618 (73%) 46 (5%)
   Median age in months [IQR] 16 [10–25] 21 [12–31] 16 [10–25] 10 [8–17]
   < 0.001
   % girls (n) 45% (359) 45% (81) 45% (278) 56% (26) 0.95
   Mean MUAC in cm [95% CI] 11.2 [11.1–11.3] 10.5 [10.36–10.7] 11.4
   [11.3–11.5] 11.2 [10.9–11.5] < 0.001
   Mean weight-for-age z-score [95% CI] − 4.01 [− 4.11 to − 3.92] − 4.51
   [− 4.72 to − 4.31] − 3.92 [− 4.03 to − 3.80] − 3.56 [− 3.94 to − 3.92]
   < 0.001
   % mortality (n) 15% (127) 26% (47) 10% (64) 34% (16) < 0.001°
   % oedematous (n) 31% (264) 30% (54) 33% (203) 15% (7) 0.50
   Site
   Coast Provincial General Hospital, Kenya 39% (329) 25% (45) 40% (247)
   80% (37) Reference
   Kilifi County Hospital, Kenya 22% (187) 22% (40) 23% (145) 4% (2) 0.08
   Queen Elizabeth Central Hospital, Malawi 39% (327) 52% (94) 36% (226)
   15% (7) < 0.001
   [50]Open in a new tab
   *Comparison between HIV(+) and HIV(−).
   °Adjusted for age, sex and site of recruitment.
   Among HIV(+), 33% were already receiving an anti-retroviral treatment
   (ART) regime: 53/179 (30%) on highly active antiretroviral therapy
   (HAART), and 7/179 (4%) on Nevirapine only. About half of the children
   (90/179) were naïve for ART whereas HIV treatment status was unknown
   for 16% (29/179). Mortality was not significantly different among
   children on HAART, ART naïve and children with unknown HIV treatment
   status (Supplementary Table [51]1).
HIV is associated with increased inflammation, immune activation,
dysregulated lipid metabolism, and increased proteolysis in children with SAM
   Among the children included in the proteomics study, 54 were HIV (+)
   and 113 were HIV(−) (Table [52]2). In this sub-population, age, sex and
   the presence of oedema were not significantly associated with HIV.
   HIV(+) children also had significantly lower MUAC and higher mortality
   than HIV(−) children.
Table 2.
   Patient characteristics of those subjected to proteomics analysis.
   All HIV (+) HIV (−) p*
   n 167 54 113
   Median age in months [IQR] 15 [10–26] 15 [10–26] 15 [10–24] 0.433
   n girls (%) 76 (45%) 27 (50%) 49 (43%) 0.506
   Mean MUAC at admission (cm) [95% CI] 10.9 [10.7–11.1] 10.2 [9.8–10.5]
   11.3 [11.0–11.5] < 0.001
   n oedematous (%) 49 (29%) 15 (28%) 34 (30%) 0.856
   n mortality (%) 79 (47%) 36 (67%) 43 (38%) < 0.001°
   Use of antiretroviral medication
   Naïve 27 (50%)
   Highly active antiretroviral therapy (HAART) 14 (26%)
   Nevirapine only 3 (6%)
   Unknown 10 (18%)
   [53]Open in a new tab
   *Comparison between HIV(+) and HIV(−).
   °Adjusted for age, sex, site of recruitment, oedema.
   A total of 204 circulating proteins were annotated and compared between
   children with and without HIV infection. Of these, levels of 42
   proteins were found to be significantly associated with HIV status in
   the initial univariate analysis (Fig. [54]1A) (Supplementary Table
   [55]2). Specifically, HIV(+) was associated with higher circulating
   levels of immunoglobulins, inflammatory proteins such as calprotectin
   (S100 calcium binding protein A8 and S100 calcium binding protein A9),
   complement proteins, and proteins related to host response to infection
   (i.e. lipopolysaccharide binding protein, galectin 3 binding protein
   and CD5 molecule-like protein). Enrichment analysis suggested that
   HIV(+) children had higher levels of proteins associated with classical
   complement pathway activation, immune activation and inflammation than
   HIV(−) children. Neutrophil aggregation and chemokine production
   appeared to be the pathways most highly enriched in HIV(+) compared to
   HIV(−) SAM children. To substantiate these results, we quantified
   chemokine and cytokine levels in plasma. As shown, most chemokines had
   the tendency to be associated with HIV infection, where elevated plasma
   concentration of 12 were significantly associated with HIV status in
   SAM children (Fig. [56]1B), namely: monocyte chemoattractant protein 1
   (MCP1), macrophage inflammatory protein 1 beta (MIP1b, CCL4),
   granulocyte colony-stimulating factor (GCSF), interleukin 1 beta
   (IL1b), tumour necrosis factor alpha (TNFa), interleukins 2,5,7, 8 and
   15 (IL2 , 5, 7, 8, 15), interleukin 12 subunit beta (IL12p40),
   interferon gamma-induced protein 10 (IP-10), and interleukin-1 receptor
   antagonist (IL-1RA).
Figure 1.
   [57]Figure 1
   [58]Open in a new tab
   Univariate analysis of plasma proteome and individual plasma cytokines
   associated with HIV. (A) Volcano plot showing several significantly
   different (FDR adjusted p value < 0.05) proteins and their log2 HIV(+)
   versus HIV(−) fold change. Red points represent those significantly
   higher in plasma of HIV(−), blue points significantly enriched in
   plasma of HIV(+) and orange points significantly higher than 1.5 folds
   in HIV(+) compared to HIV(−) SAM children. Vertical lines indicate
   significance level at p = 0.05 and 0.01; horizontal lines indicate more
   than 1.5 folds enrichment. (B) Log odds plots showing association of
   chemokine markers analysed using Luminex platform and HIV status.
   Points indicate log odds ratio for every log increase in plasma protein
   concentration; bars indicate 95% confidence interval.
   Out of the 43 differentially expressed proteins, three proteins were
   found to be negatively associated with HIV status on initial univariate
   analysis, namely: adiponectin, kininogen-1 and peptidase inhibitor 16.
   Among HIV(+) children, there were no statistically significant
   associations with receiving HAART (n = 14) compared to ART naïve
   (n = 27) children (Supplementary Fig. [59]1), recognising our study was
   not powered for this comparison. Furthermore, sensitivity analysis to
   address the possibility of HIV maternal antibodies in younger children,
   showed no significant interaction of age above or below 18 months and
   individual proteins plasma levels towards HIV status, although power to
   detect was limited.
   The weighted EN model extracted 73 circulating proteins (Fig. [60]2A)
   that are associated with HIV status with AUROC = 0.97 [95% CI
   0.95–0.99] (Fig. [61]2B) and misclassification error rate of 2.4%.
   Optimism-adjusted validated AUROC after bootstrapping was 0.90 [95% CI
   0.90–0.902], indicating a robust model. Pathway enrichment analysis
   highlighted that apart from immune activation, HIV(+) children with SAM
   had increased levels of proteins involved in proteolysis and lipid
   mobilisation pathways, specifically increased very low-density
   lipoprotein assembly, indicating metabolic dysregulation related to
   cholesterol and triglyceride metabolism among HIV(+) patients
   (Fig. [62]2D).
Figure 2.
   [63]Figure 2
   [64]Open in a new tab
   Multivariate analysis of plasma proteome associated with HIV. (A)
   Elastic net (EN) regularized regression lambda parameter optimization
   curve, optimal lambda parameter was chosen based on the highest area
   under the receiver operator curve (AUROC); (B) AUROC (0.97 [95% CI
   0.95–0.99]) of the EN model generated using the lambda parameter, alpha
   parameter was set to 0.75, final model extracted 34 protein features,
   optimism-adjusted bootstrap validation of the generated EN model,
   validated AUROC = 0.90 [95% CI 0.90–0.90] using 2000 iterations; (C)
   Gene entology (GO-terms) enrichment analysis of proteins extracted by
   the EN model. X-axis represents z-scores; y-axis, fold enrichment, and
   size of the spheres represent the number of proteins involved in the
   particular pathway. Gold circles represent pathways enriched in HIV(+)
   whereas blue circles are pathways more associated with HIV(−). The grey
   circle indicate that there are as much proteins in this pathway that
   are significantly upregulated and downregulated in HIV. Only
   significantly enriched pathways (p < 0.05 after FDR adjustment) are
   plotted. See main text for explanation of the plots. Pathways enriched
   are identified in the table. (D) Log odds ratio plot of the three
   proteins extracted after bootstrap validation with log odds on the
   x-axis and bars indicating 95% confidence interval obtained using
   weighted logistic regression with HIV as outcome variable and the three
   proteins as covariates. Weights used were obtained by inverse
   probability of treatment weights; (E) predictive ability of the
   weighted logistic regression model using the three bootstrap validated
   proteins with HIV as outcome variable, AUROC = 0.80 [95% CI 0.73–0.87].
   After 2000 bootstrap iterations during bootstrap validation, 3 proteins
   were consistently extracted by the EN model > 80% of the time
   (Fig. [65]2D), namely: butyrylcholinesterase (BChE), complement C2 and
   zinc-α-2-glycoprotein (ZAG), indicating that these three proteins are
   likely to be the most important features associated with HIV in
   children with complicated SAM. Weighted logistic regression model of
   these 3 proteins showed good discrimination of HIV status (AUROC = 0.80
   [95% CI 0.74–0.87]) (Fig. [66]2E).
Discussion
   In this study, we report plasma proteomic differences associated with
   HIV status, suggesting that HIV imposes additional metabolic and
   inflammatory insults among HIV(+) children with SAM. Our results show
   that pathways involved in inflammatory response, complement cascade
   activation and lipid metabolism dysregulation are associated with HIV
   status. Circulating levels of several plasma chemokines were also found
   to be higher in HIV(+) among children with SAM. Greater inflammatory
   responses in these children could be related to the higher inpatient
   mortality of HIV(+) compared to HIV(−) children with SAM.
   An earlier metabolomics study in Uganda reported reduced serum levels
   of adiponectin and leptin, whereas serum triglycerides, ketones and
   even-chain acylcarnitines were higher in HIV(+) children with SAM
   indicating perturbed lipid metabolism^[67]14. Our current study
   therefore concurs with this finding, as we also found reduced plasma
   levels of adiponectin in HIV(+) SAM children compared to HIV(−) SAM
   children, along with upregulation of pathways involved in lipid
   transport and metabolism, specifically very low-density lipoprotein
   assembly.
   Using optimism-adjusted bootstrap validation of the EN model, we found
   three proteins: complement c2, BChE and ZAG robustly distinguished
   HIV(+) from HIV(−) in children with SAM, demonstrating the ability of
   multivariate analysis techniques, such as EN, to uncover underlying
   relationships between protein markers which would be difficult to
   identify when analysed individually. The activation of the complement
   system during HIV infection has been previously discussed at length,
   which is associated with the increased cellular invasion of HIV in
   cells^[68]15–[69]17.
   On the other hand, BChE is a protein synthesized in the liver and
   abundant in plasma, which hydrolyses acetylcholine. Although very
   similar to its sister protein, acetylcholinesterase, biological
   functions of BChE appear to be more varied but less understood^[70]18.
   In a recent study in China, low circulating BChE was found to be highly
   associated with HIV severity, was predictive of mortality in adults,
   and was proposed as a plausible strategy for severity classification
   among adults with HIV^[71]19. BChE is also reported to be reduced in
   SAM, stress and inflammation^[72]20. In animal studies, BChE deficiency
   was found to strongly affect fat metabolism and promotes hepatic lipid
   accumulation^[73]21. Serum BChE levels have been found to have a
   significant negative correlation with serum total cholesterol and serum
   low-density-lipoprotein cholesterol among people with diabetes
   mellitus^[74]22.
   ZAG is a newly characterized adipokine that is involved in lipolysis,
   body weight regulation and may also be involved in the development of
   insulin resistance^[75]23. Reduction in plasma levels of ZAG was
   previously reported to be implicated in dyslipidaemia in HIV(+) adults
   under ART treatment^[76]23. Reduced circulating levels of ZAG has also
   been found among adults with clinically diagnosed metabolic syndrome,
   based on guidelines of the United States National Cholesterol Education
   Program (NCEP) Expert Panel Adult Treatment Panel (ATP) III
   criteria^[77]24. Serum ZAG levels have been reported lower among adults
   with impaired glucose tolerance and type 2 diabetes mellitus^[78]25.
   Taken together, our results therefore suggest that children with both
   HIV and SAM manifest hallmarks of metabolic stress similar to those
   occurring in metabolic syndrome and other non-communicable diseases
   (NCD).
   This study is the first proteomics investigation on the interaction
   between HIV and SAM. In summary, our results, which together with the
   previously published metabolomics study^[79]14, strengthens evidence on
   the increased metabolic stress and altered metabolic response among
   children living with both HIV and SAM. Our results also concur with
   previous studies that reported elevated metabolic stress among
   non-malnourished adults living with HIV leading to increased prevalence
   or risk for metabolic syndrome, cardiovascular diseases, diabetes and
   other non-communicable diseases^[80]26–[81]32.
   Metabolic abnormalities have previously been reported to be attributed
   HAART use among HIV(+) patients^[82]33. In a recent systematic review,
   use of two classes of HAART, protease inhibitors and nonnucleoside
   reverse transcriptase inhibitors, has been found to be associated with
   abnormalities in plasma lipid profiles^[83]34. However, dysregulation
   in lipid metabolism has also been reported in HAART-naïve patients,
   which indicates that HIV infection alone cause lipid metabolism
   perturbations. An earlier longitudinal study of 50 men in the USA
   reported notable declines in serum total cholesterol after HIV
   infection compared to results of blood analysis from last seronegative
   visit. Large increases in total cholesterol and low-density
   lipoproteins (LDL) were detected after HAART initiation^[84]35.
   However, many other studies reported increases in total cholesterol
   among HIV-infected patients naïve to HAART. For instance, in a study of
   ART-naïve HIV-infected adults in Ethiopia, malnutrition and lipid
   abnormalities (specifically total cholesterol) were associated with
   CD4 + T cell counts^[85]36. In in vitro studies, transfection of a
   T-cell (RH9) with HIV led to the enhanced production of free fatty
   acids and LDL^[86]37. Furthermore, monocytes isolated from HIV-infected
   patients both taking HAART and HAART-naïve, were found to have altered
   expression patters of receptors linked with lipid metabolism (i.e. FXR,
   PXR, PPARα, GR, RARα and RXR) compared to monocytes of HIV-uninfected
   controls^[87]38. For our study however, we are unable to ascertain
   whether the lipid metabolism dysregulation we observed is due primarily
   on the viral load itself or the use of HAART due to lack of power for
   this sub-analysis. Majority of the participants subjected to proteomics
   analysis were HAART-naïve (50%), where 26% were on HAART, 6% were on
   Nevirapine alone and we had no data on treatment of 18% of the patients
   (Table [88]2). In all these studies cited, authors argue to need for
   monitoring of lipid profiles in HIV-infected populations. Hence, lipid
   monitoring may also inform nutritional and clinical recovery of
   children with SAM and HIV and could be implemented to improve clinical
   care for these children.
   However, despite our knowledge that HIV-infected populations have
   altered metabolic requirements compared to HIV-uninfected counterparts,
   WHO guidelines for the nutritional management for SAM are globally the
   same regardless of HIV status, which is summarized in Table
   [89]3^[90]39. Nutritional management for in-patient children with SAM
   involves provision of a low-protein, low-fat milk-based food, F75,
   every three hours. F75 is used during clinical stabilization occurring
   during the first few days after admission and is not intended for
   weight gain. Once the children are clinically stabilized and are able
   to tolerate the milk/solute load, children are transitioned to F100, a
   higher-calorie, high-fat milk intended to boost weight gain or to
   Ready-to-Use Therapeutic Food (RUTF), a peanut-based calorie-dense
   diet. Upon discharge from in-patient care, children are referred to
   community based nutritional therapeutic centres where they are provided
   with RUTF on a 2 weekly basis.
Table 3.
   Nutritional management protocol for children with severe acute
   malnutrition^[91]39.
   Stabilization phase In-patient rehabilitation phase Out-patient
   rehabilitation phase
   Days 1—7 Weeks 2—6 Lengths vary depending on site
   Complicated SAM F75 F100 RUTF
   Uncomplicated SAM – – RUTF
   Composition
   Energy (kcal per 100 mL F75/F100 or 100 g RUTF) 75 100 5.2–5.5
   Protein (% total energy) 5 12 10–12
   Fat (% total energy) 32 53 45–60
   [92]Open in a new tab
   Considering evidence of biomarkers of metabolic syndrome and NCD in
   HIV(+) children with SAM, it is of potential concern that our current
   treatment strategy involves a high-fat therapeutic diet. About 50% of
   much needed calories during the growth catch-up phase are supplied as
   lipids, which HIV(+) children may not be able to efficiently
   assimilate. Alterations in lipid metabolism in HIV(+) children with SAM
   may also mean that the high amounts of dietary lipids could be
   deposited as ectopic fat in the liver and muscle, predisposing to
   insulin resistance, diabetes, cardiovascular problems and other NCDs
   later in life. Although long-term metabolic follow-up studies could be
   done for HIV(+) children previously treated for either complicated and
   uncomplicated SAM, significant barriers are the high mortality rate in
   earlier studies of HIV(+) children with SAM, cost and difficulty
   tracing them years later. The results of this study indicate a need for
   clinical trials of F100 or RUTF modified to meet the expected metabolic
   needs of HIV(+) children with SAM. This could initially be done in
   relatively small groups with outcomes that include measuring metabolic
   stress.
   Several studies on nutritional intervention strategies among
   HIV-infected adults have been reported. For instance, a study in the
   USA showed that dietary fat intake, specifically saturated fats, was
   significantly associated with hypertriglyceridemia among HIV-infected
   adults (18–60 years)^[93]40. Moreover, in a preclinical model, high
   saturated fat consumption was found to accelerate immunodeficiency
   virus disease progression in macaques, specifically increased mortality
   hazard and circulating levels of pro-inflammatory cytokines, especially
   IL8^[94]41, which has been previously reported to be associated with
   lipodystrophy among HIV patients^[95]42. In our study, we also found a
   significant association between high plasma IL8 concentration and HIV
   in SAM children. Hence, modifying the saturated fat composition of the
   milk-based F75 and F100 could potentially lower metabolic stress.
   The European Society for Parenteral and Enteral Nutrition (ESPEN) have
   given a grade A recommendation for the use of medium-chain triglyceride
   (MCT)-based diet on HIV(+) patients with diarrhoea and severe
   undernutrition in its 2006 ESPEN Guidelines on Enteral
   Nutrition^[96]43. Grade A recommendations are given to strategies based
   on meta-analysis or at least one randomised control trial. In this
   case, the recommendation was based on a prospective, randomized
   double-blind comparative trial on 24 adult patients with HIV and
   diarrhoea of more than 4-week duration, fat malabsorption, and loss of
   10–20% of ideal body weight^[97]44. In this study, the authors found
   improved outcomes from diarrhoea and fat malabsorption from MCT than
   long-chain triglyceride-based diet among HIV(+) adults.
   HIV infection has been reported to be accompanied by substantial damage
   to gut integrity and changes in gut microbiome composition^[98]45. In
   this study, we observed increased circulating levels of LPS binding
   protein, which is a marker of bacterial translocation from the gut into
   the bloodstream. Therefore, understanding the interaction between HIV
   and gut microbiota could provide insights into aetiology and
   interventional points of view. As more evidence on the role of gut
   microbiota and gut integrity on health outcomes emerge, we must also be
   aware of the potential impact of antibiotics and nutritional
   therapeutic strategies on the microbiome. Markers of gut health and
   microbiome restoration among children with HIV and SAM therefore need
   to be studied in parallel with improved/modified RUTF formulations to
   fully elucidate the mechanisms of their efficacy.
   Lastly, the long-term metabolic effect of nutritional intervention
   strategies for SAM still remains unresolved. Most specifically, the
   potential metabolic stress associated with the rapid weight gain during
   the nutritional rehabilitation phase after SAM and its implications on
   nutritional outcomes during adulthood demands urgent research
   attention, especially for HIV(+) children with SAM.
   Limitations of this study include absence of data on viral load and
   CD4+ counts of the patients, which could provide a deeper understanding
   of the results. Furthermore, in this study, we did not find association
   between oedematous malnutrition and HIV status, although several
   studies have a found higher HIV prevalence among non-oedematous
   children with SAM^[99]46–[100]48. In our study however, we found high
   in-patient mortality rate (16/46, 34%) among children with unknown HIV
   status, where 39/46 (85%) had non-oedematous SAM. Considering the high
   rate of mortality, these children may have been HIV(+). This highlights
   the need for earlier HIV screening among children with SAM. Finally, a
   deeper understanding of the comorbidity of HIV and SAM would require
   studies also involving non-malnourished HIV+ and HIV− children
   preferably in various geographical and social contexts. Hence, further
   studies are needed fully characterize the interplay between HIV
   infection and malnutrition.
Conclusion
   Plasma proteomics reveals that HIV(+) children with SAM manifest
   hallmarks of metabolic stress similar to those observed in
   non-communicable diseases. This could be related to the poor
   nutritional recovery and high mortality of HIV(+) children with SAM
   despite clinical and nutritional intervention. The results of this
   study indicate a need for clinical trials modifying the composition of
   F100 or RUTF to meet the specific metabolic needs of HIV(+) children
   with SAM during rehabilitation phase. This could initially be done in
   relatively small groups with outcomes that include measuring metabolic
   stress.
Methods
Patient recruitment and study design
   This is a secondary analysis of a nested case control study from a
   randomised controlled trial ([101]NCT02246296), which tested the effect
   of a lactose-free, low-carbohydrate F75 milk to limit carbohydrate
   malabsorption, diarrhoea and refeeding syndrome among children
   hospitalized for complicated SAM at Queen Elizabeth Central Hospital in
   Blantyre, Malawi, Kilifi County Hospital and Coast General Hospital,
   Mombasa, Kenya^[102]13. Children aged 6 months to 13 years were
   eligible for enrolment into the trial at admission to hospital if they
   had SAM, defined as: mid-upper arm circumference (MUAC) < 11.5 cm or
   weight-for-height Z score <  − 3 if younger than 5 years of age, BMI Z
   score <  − 3 if older than 5 years, or oedematous malnutrition at any
   age and had medical complications or failing an appetite test, as
   defined by WHO guidelines^[103]49. Children were excluded if they had a
   known allergy to milk products and did not provide consent. Biological
   samples were obtained before the children received the randomised
   treatment irrespective of HIV status. Unless a child’s HIV positive
   status was documented, HIV status was assessed by offering an antibody
   test at admission plus appropriate counselling. For this analysis,
   patients that tested positive on an HIV antibody test were considered
   HIV(+) and children with missing or declined HIV test were excluded.
   To compare the proteomic profiles between HIV infected and non-infected
   children with SAM, we used data from a nested case–control study to
   investigate inpatient mortality. Of 127 children who died, 92 had
   sufficient samples available for proteomics analysis. Since the main
   outcome of our current study is HIV, we excluded deaths with unknown
   HIV status (n = 13), resulting to 79 cases included in this analysis.
   Among children who survived, 92 had been randomly selected in the
   nested case–control study matched on site of recruitment. After
   excluding children with unknown HIV status (n = 4), 88 controls from
   the nested case–control study were used for this analysis. Proteomic,
   cytokine, and chemokine data was generated using plasma samples
   collected at admission during enrolment to the trial. A weighted
   analysis was designed to help overcome selection bias, as described in
   the data analysis section below.
Proteomics, cytokine and chemokine analysis
   Untargeted proteomics and targeted cytokines and chemokines analysis of
   plasma samples were performed following methods described
   previously^[104]50. The targeted protein panel included: epidermal
   growth factor (EGF); eotaxin; granulocyte-colony stimulating factor
   (GCSF); granulocyte–macrophage colony-stimulating factor (GMCSF);
   interferon alpha-2 (IFNa2); interferon gamma (IFNg); interleukins 10,
   12p40, 12p70, 13, 15, 17A, 1a, 1b, 1RA, 2 to 8; interferon
   gamma-induced protein 10 (IP10); monocyte chemoattractant protein 1
   (MCP1), macrophage inflammatory protein 1 alpha and beta (MIP1a & b);
   tumour necrosis factor alpha (TNFa) and beta (TNFb); and vascular
   endothelial growth factor (VEGF).
Data analysis
   Data analyses were performed using R v3.5^[105]51. Analysis of the
   prevalence of HIV(+), nutritional status and their associations with
   inpatient mortality utilised the entire trial dataset (N = 843).
   Analysis of categorical data was performed using Fisher’s test and
   generalised linear models for continuous outcomes. Logistic regression
   was used to analyse binary outcomes adjusting for age, sex, presence of
   oedema, and site of recruitment. These associations were also adjusted
   for MUAC. As a sensitivity analysis to address the possibility of
   confounding due to HIV maternal antibodies in younger children, a test
   of interaction between age above or below 18 months and individual
   proteins towards HIV status was performed.
   The proteomics, cytokines and chemokines analyses were secondary
   analyses of data collected from a nested case–control study with
   inpatient mortality as its primary outcome, hence with strong selection
   bias. The analysis for the association between HIV status and
   individual proteins was therefore performed using logistic regression
   analysis with inverse probability weighting (IPW) to correct for
   selection bias^[106]52–[107]55. Weights (w) were calculated as
   suggested by Samuelsen^[108]53 wherein the weight for each observation
   selected into the nested case–control study was computed as the inverse
   of the probability of being selected for the nested study from the main
   clinical trial. The probability of inclusion was therefore calculated
   as:
   [MATH: pi=11+e-β0+β1<
   mo>×1+β2×2⋯+βn×n; :MATH]
   where p(i) is the probability of inclusion in the nested case–control
   study and × 1, × 2, …, × n are HIV status, sex, age, presence of
   oedema, mid-upper arm circumference, and site of recruitment of the ith
   observation (child) based on the entire trial population. Inverse
   probability weight is therefore:
   [MATH: wi=1pi
   :MATH]
   Differences in individual proteins abundances were considered
   statistically significant when p < 0.05 after adjustment for multiple
   comparisons using Benjamini–Hochberg false discovery rate
   (FDR)^[109]56.
   Multivariate analysis was undertaken in order to determine several
   proteins that are collectively associated with HIV status, some of
   which may not be significantly associated to HIV independently. This
   was performed using a weighted elastic net (EN) model implemented using
   the “glmnet” package in R^[110]57. EN is a penalized regression
   approach that was developed to help overcome problems caused by high
   dimensional data. It is an integration of two regularized approaches,
   ridge regression and least absolute shrinkage and selection operator
   (LASSO), wherein the contribution of each of these models to the final
   EN model is controlled by the α parameter^[111]57,[112]58. The strong
   penalization imposed by LASSO draws coefficients to zero thereby
   eliminating non-predictive proteins features, whereas ridge regression
   addresses potential multi-collinearity problems in high-dimensional
   data^[113]57,[114]58.
   Weighted EN model generation was performed with HIV status as outcome,
   protein profile as predictors, and w as observation weights. The
   penalization parameter lambda, which influences the shrinkage of
   variable coefficients to zero thus eliminating some non-contributing
   variables, was determined by estimating the area under the receiver
   operator curve (ROC) of the population using ten-fold cross validation.
   Several alpha parameter values were assessed and a final value of 0.85
   was taken to achieve a compromise between predictive ability and fewer
   number of features extracted. The final lambda parameter was based on
   the value which gave the highest area under the ROC (AUROC) value.
   Proteins with significant association with HIV status after correction
   for false discovery and those extracted by the EN model were then
   uploaded to The Database for Annotation, Visualization and Integrated
   Discovery (DAVID) v6.8 Bioinformatics Resource^[115]59 to assess the
   Gene ontology (GO) enriched pathways of the differentially expressed
   proteins.
   EN model validity was judged based on the AUROC and misclassification
   error rate. The fitted EN model performance measured as
   optimism-corrected AUC was validated using bootstrap, following the
   procedure of Smith et al.^[116]60. Bootstrapping was performed on 2000
   iterations using the “BootValidation” package in R. Protein features
   extracted at least 80% of all iterations by the bootstrap EN model were
   then considered to be the most relevant protein biomarkers. To test how
   well these proteins can discriminate HIV status, they were then fitted
   on a weighted logistic regression with HIV as outcome.
Visualisation of significantly enriched GO terms
   Bubble plots were used to visualise the significantly enriched pathways
   (p < 0.05 after adjustment for FDR) obtained from DAVID. The p-values
   in DAVID were obtained using a modified Fisher’s exact test^[117]61.
   The y-axis represents the fold enrichment which indicates the magnitude
   of the enrichment, as calculated in DAVID. Fold enrichment is defined
   as:
   [MATH: foldenrichment=(m/n)M/N,
   :MATH]
   where m is the number of proteins significantly associated with HIV
   status or proteins extracted by the EN model that belong to a
   particular pathway, while M is the total number of proteins belonging
   to the same pathway. Variable n is the number of all proteins
   significantly associated with HIV status or extracted by the EN model
   and N is the total number of all proteins in the human background.
   Therefore, a fold enrichment of ten indicates that 10% of the proteins
   significantly associated with HIV status belong to a particular
   pathway, and 1% of all annotated proteins in the human background
   belongs to the same pathway^[118]61. However, the proponents of this
   metric warn that big fold enrichments could be obtained from a small
   number of proteins, which could be due to small n or pathways with
   fewer members.
   The x-axis on the hand represents the enrichment z-score for a
   particular pathway^[119]62, which is calculated as follows:
   [MATH:
   z-score
   =up-dow
   mi>ncount;
   :MATH]
   where up is the total number of proteins upregulated, down is the total
   number of proteins downregulated, and count is the total number of
   proteins in the input which belongs to a particular pathway. Variables
   up and down were based on the weighted logistic regression for each
   individual protein. Hence, if five proteins belonging to pathway x were
   upregulated and two were downregulated, the z-score for pathway x would
   be: (5–2)/√7 = 1.13. A positive z-score indicates that the particular
   pathway is overall upregulated in HIV(+), whereas a negative z-score
   indicates an overall downregulation^[120]62.
Ethics approval
   The secondary analyses of the trial were approved by the Kenyan
   National Ethics Committee, KEMRI-SERU (KEMRI/RES/7/3/1). The trial was
   registered at clinicaltrials.gov ([121]NCT02246296).
Supplementary information
   [122]41598_2020_68143_MOESM1_ESM.docx^ (112.5KB, docx)
   Supplementary Information 1 (DOCX 112 kb)
Acknowledgements