Abstract Spectral Counts approaches (SpCs) are largely employed for the comparison of protein expression profiles in label-free (LF) differential proteomics applications. Similarly, to other comparative methods, also SpCs based approaches require a normalization procedure before Fold Changes (FC) calculation. Here, we propose new Complexity Based Normalization (CBN) methods that introduced a variable adjustment factor (f), related to the complexity of the sample, both in terms of total number of identified proteins (CBN(P)) and as total number of spectral counts (CBN(S)). Both these new methods were compared with the Normalized Spectral Abundance Factor (NSAF) and the Spectral Counts log Ratio (Rsc), by using standard protein mixtures. Finally, to test the robustness and the effectiveness of the CBNs methods, they were employed for the comparative analysis of cortical protein extract from zQ175 mouse brains, model of Huntington Disease (HD), and control animals (raw data available via ProteomeXchange with identifier PXD017471). LF data were also validated by western blot and MRM based experiments. On standard mixtures, both CBN methods showed an excellent behavior in terms of reproducibility and coefficients of variation (CVs) in comparison to the other SpCs approaches. Overall, the CBN(P) method was demonstrated to be the most reliable and sensitive in detecting small differences in protein amounts when applied to biological samples. Introduction In recent years, Proteomics has gained centrality in Omics studies for basic and translational applications, especially for diagnostic purposes and for targeted and/or personalized medicine [[40]1–[41]3]. Differential proteomics approaches are attracting particular attention due to the possibility to compare protein expression profiles from multiple biological conditions, e. g. wild type vs mutant or vs pharmacologically treated, etc [[42]4,[43]5]. Different methodologies have been developed to carry out the qualitative-quantitative analysis of the protein content in samples using both labeled and label-free approaches. The labeling based methods employed fluorescent (e. g. Difference in Gel Electrophoresis (DIGE)) [[44]6,[45]7] or isotopic reagents (as for Isobaric Tag for Relative and Absolute Quantitation (iTRAQ), Stable Isotope Labeling with Amino acids in Cell culture (SILAC) and Tandem Mass Tagging (TMT) [[46]8–[47]12]) to differently label the two or more proteomes under investigation. These strategies provide higher levels of reproducibility because are based on the contemporaneous electrophoretic separation and/or tandem mass spectrometry (MS/MS) analysis of the samples. Despite these advantages, labeling procedures are time-consuming and very expensive for the high cost of the labeling reagents [[48]13]. More recently, Label-Free (LF) approaches have been introduced for quantification of proteomic profiles by exploiting liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) analyses [[49]14–[50]16]. These procedures [[51]17–[52]19] represented an effective solution to overcome these drawbacks due to their reduced costs and simplified sample preparation. Nevertheless, longer analysis time and high-performance MS/MS instruments are needed. Label-Free quantification has emerged as a consequence of the great technical advances in the development and design of high-resolution LC-MS/MS instruments [[53]20], such as the Orbitrap mass analyzer [[54]21]. This analytical approach relies on the measurement of two main parameters, Extracted Ion Chromatogram (XIC) and Spectral Counts (SpCs) [[55]19,[56]22]. Measurement of XIC is a MS1-based strategy in which the ion current associated with each peptide ion is individually considered. The relative quantification of proteins can then be obtained by evaluating the total current associated with all the peptides belonging to a specific protein in the two conditions [[57]23]. SpCs quantification is an MS/MS-based procedure. The number of fragmentation events, (i.e. the spectral counts), measured for all the peptides belonging to the same protein in each sample are summed up resulting in the relative quantification of the protein in the different conditions [[58]24,[59]25]. Both XIC and SpCs methods need massive use of tandem mass spectrometry, producing a large amount of data that needs to be normalized, i.e. corrected for instrumental or uncontrollable variations before becoming suitable for quantitative purposes. In particular, specifically, designed bioinformatic tools are required to manage and process data, while appropriate normalization methods have to be used [[60]26,[61]27]. Normalization procedure has been assessed in multiple ways by several authors [[62]28,[63]29]. In general, XIC methods require steps of higher complexity such as isotopic patterns resolution, feature detection, retention time alignment, etc. Therefore, well defined and automated processing and normalization algorithms have been integrated into specifically designed software packages, as for MaxQuant [[64]30–[65]32]. In SpCs methods, instead, fragmentation events are crucial for both protein identification and quantification. Commonly, software workflows for SpCs quantification result in discrete, unnormalized data with normalization left to the user [[66]33,[67]34]. Although XIC-based methods show a greater accuracy and linear range, the access to the last generation high-resolution instruments and/or computational processing for XIC data are often prohibitive, with the consequence that SpC methods are still a valuable alternative for a large number of applications, as demonstrated by recent literature [[68]35–[69]38]. Here we propose a new Complexity Based Normalization (CBN) method introducing new adjustment factors in the SpC normalization formula related to the complexity of the analyzed samples in terms of total identified proteins (CBN(P)) or spectral counts (CBN(S)). Both CBN methods are compared with the multiple label-free quantification modes based on the SpC approaches, namely Ratio of Spectral Count (R[SC]) [[70]33] and Normalized Spectra Abundance Factor (NSAF) [[71]39]. The new proposed normalization approach was optimized on standard protein mixtures with different complexity, and then applied to the differential analysis of the proteome profiles extracted from the brain cortex of zQ175 knock-in mice [[72]40], an animal model of Huntington Disease (HD), in comparison with the wild-type counterpart. The CBN(P) method was demonstrated to be the most reliable and sensitive in detecting small differences in protein amounts when applied to biological samples. Materials and methods Preparation of standard mixtures Six standard tryptic peptide mixtures (Waters, Milford, Massachusetts, US) including Bovine Hemoglobin Chain A (HBA, Uniprot ID [73]P01966, B. Taurus), Bovine Hemoglobin Chain B (HBB, [74]P02070, B. Taurus), Bovine Serum Albumin (BSA, [75]P02769, B. Taurus), yeast Enolase (ENO, [76]P00924, S. Cerevisiae), yeast Alcohol Dehydrogenase (ADH, [77]P00330, S. Cerevisiae) and rabbit Glycogen Phosphorylase (PYG, [78]P00489, O. Cuniculus) were dissolved in 0.2% formic acid (FA) and spiked into an E. Coli total tryptic digest (Waters, Milford, Massachusetts, US) solution, so that final concentration of the digest reaches 67 ng/μL. In all mixtures obtained and indicated with A-E letters, HBA, HBB, BSA and PYG were always added in a fixed amount, while only ENO and ADH were added in variable amounts ranging from 1/25 to 5 times the fixed standards (i.e. 0.94–23.5ng/μl for ENO and 0.76–19 ng/μl for ADH). The obtained standard mixtures have been employed for the optimization of mass spectrometry analyses and for setting quantification methods. The zQ175 mouse model All protocols involving animals were carried out in accordance with institutional guidelines in compliance with Italian law (D. Lgs no. 2014/26, implementation of the 2010/63/UE) and authorization n.324/2015-PR issued May 6, 2015, by Ministry of Health. The Ethics Committee of the University of Milano approved studies in mice (Ethical Approval 74/13; Ethical Approval 74/14). JAX stock number was B6J.129S1-Htt tm1Mfc /190ChdiJ. For biochemical analyses, animals were euthanized by dislocation and all efforts were made to minimize suffering. Genotyping of the zQ175 (C57BL/6J) mouse colony was performed by PCR of genomic DNA obtained from tail samples (Nucleo Spin Tissue, Macherey-Nagel, catalog 740952.250) at weaning. CAG repeats of zQ175 mice were sized by using the following PCR primers: forward: CATTCATTGCCTTGCTGCTAAG; reverse: CTGAAACGACTTGAGCGACTC. Cycling conditions were 94°C for 10 minutes, 30 cycles × (96°C for 30 seconds, 57°C for 30 seconds, 72°C for 30 seconds), 72°C for 7 minutes. Preparation of total protein lysates from mouse cortical tissues For proteomics and biochemical analyses, animals were euthanized by dislocation and the brains were rapidly removed and the cerebral cortices were immediately excised from the brain, frozen in liquid nitrogen and smashed. Specifically, cortical tissues from three symptomatic homozygous zQ175 mutant and three wild type (WT) mice at 50 weeks of age were used. The samples were then lysed by combining chemical and mechanical methods. Cortical tissues were crushed and repeatedly pipetted in the lysis buffer (50 mM Tris-HCl pH 7.4, 150 mM NaCl, 0.1% v/v Sodium Dodecyl Sulphate (SDS), 1 mM Phenylmethylsulfonyl fluoride (PMSF), 1% v/v Nonidet P-40 (NP40), protease inhibitors EDTA-free), then passed ten times each through a syringe. Following these mechanical processes, they were put on a stirring wheel, (20 minutes at 4°C), and then centrifuged, (16,100 x g, 30 minutes, 4°C). After the lysis procedure, supernatants from all samples were collected and quantified according to Bradford protein assay and 50 μg from each were used for the Differential Proteomics experiment and Western Blot assays. SDS-PAGE and in situ hydrolysis 50μg from each sample were suspended in Laemmli buffer containing 100 mM dithiothreitol (DTT), boiled for 10 minutes and loaded on a 20x20 cm 4% - 15% bis-acrylamide gradient gel. After SDS-PAGE separation, the gel was stained with Coomassie Blue Brilliant and each lane was excised in 29 slices by using the same cutting scheme. All gel slices were in situ hydrolyzed as previously reported [[79]41]. Finally, the peptide mixtures were vacuum-dried and resuspended in 0.2% formic acid (FA) solution to be analyzed by LC-MS/MS. LC-MS/MS analyses Quantitative data were collected on an LTQ Orbitrap XL (ThermoScientific, Waltham, MA) coupled to a nanoLC system (nanoEasy II). All peptide mixtures were analyzed using the same chromatographic conditions, i.e. 3 microliters of each sample were fractionated onto a C18 capillary reverse-phase column (100 mm, 75 μm, 5 μm) working at 250 nl/min flow rate, using a linear gradient of eluent B (0.2% formic acid in 95% acetonitrile) in A (0.2% formic acid and 2%acetonitrile in MilliQ water) from 5% to 40% in 80 minutes was run. MS/MS analyses were performed using Data-Dependent Acquisition (DDA) mode: one MS scan (mass range from 400 to 1800 m/z) was followed by MS/MS scans of the five most abundant ions in each MS scan, applying a dynamic exclusion window of 40 seconds. All samples were run in duplicates. Protein identification and quantification Raw data obtained from nanoLC-MS/MS were analyzed with MaxQuant 1.5.2 integrated with the Andromeda search engine [[80]42]. To that end, an appropriate Fasta file was generated by downloading from UniProt and subsequently merging amino acid sequences of both standard proteins and whole E. Coli proteome. The selected parameters for protein identification were the following: minimum 2 peptides, at least 1 unique; variable modifications allowed were methionine oxidation and pyroglutamate formation on N-terminal glutamine; accuracy for the first search was set to 10 ppm, then lowered to 5 ppm in the main search; 0.01 FDR was used, with a reverse database for decoy; retention time alignment and second peptides search functions were allowed. Protein quantification has been performed only using unique unmodified peptides. As output, MaxQuant resulted in discrete spectral counts, which were then normalized by using NSAF, R[SC], or CBNs approaches. As concerns data collected for the Differential Proteomics analysis on zQ175 mice, following MaxQuant analysis data, each couple of technical duplicate was mediated for every protein, obtaining a set of 6 data (3 for WT and 3 for zQ175). These values were employed for manual normalization according to the above mentioned SpCs methods. Western blot and densitometric analyses Mice brains total protein extracts were separated by SDS-PAGE and then transferred onto nitrocellulose membranes (Bio-rad, Hercules, California, US). Membranes were blocked with 5% non-fat milk and then incubated with the following antibodies: rabbit polyclonal anti-hnRNPH (Abcam, ab154894), rabbit polyclonal anti-SERPIN B6 (Abcam, ab233229; Proteintech, 14962-1-AP), rabbit polyclonal anti-IRGM (Abcam, ab118569), rabbit polyclonal anti-UQCRQ (Proteintech, 14975-1-AP), rabbit monoclonal anti-HOMER1 (Abcam, ab184955), rabbit monoclonal anti-HTT (Cell Signalling Technology, mAb#5656), rabbit polyclonal anti-OSBPL2 (Proteintech, 14751-1-AP), mouse monoclonal anti-β-Actin (Origene, TA811000) and mouse monoclonal anti-Vinculin (Sigma-Aldrich, V9131). Membranes were incubated with horseradish peroxidase-conjugated secondary antibody (1:5000 for mouse host antibodies and 1:10000 for rabbit host antibodies) for 45 minutes at room temperature and the signals were detected by enhanced chemiluminescence (ECL) detection system (Thermo Fisher Scientific, Inc., Waltham, MA). For densitometric analyses, the software Quantity One 4.6.8 (Bio-Rad, Hercules, California, US) has been employed; all protein band intensities were normalized with the corresponding β–actin signal, except HTT, whose intensities were normalized on vinculin signal. Multiple-Reaction Monitoring (MRM) analyses As additional validation method multiple-reaction monitoring (MRM) approach was employed. 50 μg of cell lysates obtained as described above were digested by trypsin onto S-Trap filters, according to the manufacturer protocol (Protifi, Huntington, NY). The three biological replicates of zQ175 and wild-type mice were pooled respectively, and HTT, SERPIN B6, HOMER1, OSPBL2, SAMM50, APO-A4, CAMK2A, STIP1, RIC8A, ATP2A2 peptide transitions were monitored with a Xevo-TQS mass spectrometer coupled to a nanoAcquity UHPLC (Waters, Milford, MA, US) equipped with IonKey CHIP interface. Peptide mixtures were separated on peptide BEH C18 130Ǻ, 1.7μm, 150 μm x 50mm, iKey by using a linear gradient of eluent B (95% acetonitrile LC-MS grade (Sigma Aldrich, St. Louis, Missouri, US), 0.2% formic acid (Sigma Aldrich, St. Louis, Missouri, US)) from 7% to 95% over 115 minutes working at a flow rate of 3μl/min. Raw data were processed with Skyline v20.1.0.155 (MacCoss Lab Software, Dept of Genome Sciences, UW) and the total area of each peptide transition was used for the relative quantification of the specific proteins. For each protein, at least two prototypic peptides were selected and at least two transitions for each parent ion were monitored, as in silico predicted by using Skyline. Proteins were not monitored all together but in three different runs, both for wild type and zQ175: run A, included HTT, HOMER1, RIC8A; run B, ATP2A2, APO-A4, OSBPL2; run C, SAMM50, SERPIN B6, CAMK2A, STIP1 and in each run ACTIN was monitored as the internal standard. Each run was analyzed in duplicate and the total area of each peptide transitions was used for the relative quantification of the specific proteins, normalized for actin FC. Statistical analyses Multiexperiment Viewer v4.9.0 [[81]43], (MeV) was employed to perform statistical analysis of MaxQuant output. Western blots results were evaluated by univariate statistical analysis using GraphPad Prism 8.0, and the results are presented as the mean ± standard deviation (SD) by three biological replicates. The statistical significance of the observed difference in western blot analysis was determined by parametric (Welch’s t) or non-parametric (Mann–Whitney test) tests when data failed the Shapiro–Wilk normality test. Mass spectrometry data (LF) statistical significance was determined by the unpaired Student’s t-test and differences were considered statistically significant at Benjamini-Hochberg corrected p-value (FDR)<0.05 [[82]44]. Functional over-representation analysis STRING database ([83]https://string-db.org/) [[84]45] (STRING v11: Protein-Protein Association Networks With Increased Coverage, Supporting Functional Discovery in Genome-Wide Experimental Datasets) was used to perform a pathways enrichment analysis on differentially expressed proteins in mouse brains identified in each method (NSAF, Rsc, CBN(P), CBN(S)). Results Development of a new method for Spectral Counts normalization A new mathematic method, called Complexity Based Normalization (CBN), was developed for the normalization of Spectral Counts data in differential proteomics experiments to calculate the relative Fold Change for each protein between two experimental conditions. The new formula ([85]1) employed for SpCs normalization is reported as follows in comparison with the Normalized Spectra Abundance Factor (NSAF) ([86]2) and the Spectral Counts log Ratio (R[SC]) ([87]3) methods. [MATH: CBNx,< /mo>M=Sx,MtM+f :MATH] (1) [MATH: NSAFx< /mi>,M=Sx,MLxi=1PSi,MLi :MATH] (2) [MATH: RSCx,M =log2 (Sx,M+f tMSx,M +f) :MATH] (3) In CBN methods ([88]1), S[x,M] are the Spectral Counts of protein x in mixture M, t[M] are the total spectral counts of mixture M, f is the complexity-based adjustment factor. In addition to common variables, in NSAF method ([89]2) L[i] represents the amino acid length of "i" protein, while P is the total number of proteins identified in the analysis. Finally, in the R[SC] method ([90]3) f is a fixed adjustment factor = 0.5. The output of these formulas consists of a numeric value representing the quantity of a specific protein in each mixture that is used to calculate the relative Fold Change (FC). Both R[SC] and CBN require the use of an adjustment factor "f" to avoid the presence of missing values. In the R[SC] method, Beissbarth et al. [[91]46] have fixed the f value constant to 0.5. For both CBN methods we propose the employment of a variable adjustment factor according to sample complexity. Specifically, the CBN formula has been developed with f = 1/P (CNB(P)), where P is the number of proteins occurring in the MaxQuant output data or with f = 1/t (CBN(S)), where t is the sum of all spectral counts associated to all proteins in all mixtures. As a consequence of these definitions, the adjustment factor becomes complexity-based, since the P and t values are both related to the total number of proteins in the sample. Setting up the normalization methods on E. Coli proteome standard sample The newly developed CBN formulas were evaluated in comparison with existing normalization methods in differential proteomics experiments using the label-free procedure applied to the E. Coli proteome. Commercial tryptic digests from six different proteins (HBA, HBB, and BSA from B. Taurus, ENO and ADH from S. Cerevisiae and PYG, from O. Cuniculus) were spiked into a fixed matrix consisting of a constant amount of E. Coli total proteins tryptic digest, to mimic a real sample environment. Five mixtures (A, B, C, D, E) were prepared as described above, in which only ADH and ENO tryptic peptides were added in different amounts. Each mixture was analyzed by nanoLC-MS/MS in duplicates and the raw data processed by using MaxQuant. Following MaxQuant analysis, a set of unnormalized SpCs data was obtained and used to test the different normalization methodologies, NSAF, Rsc, and the newly developed CBNs, either in terms of total identified proteins (CBN(P)) or spectral counts (CBN(S)). A total of 409 proteins (P), including the six spiked standards were identified in all samples by MaxQuant analysis, together with 17837 total spectral counts (t) ([92]S1 Table). The P and t experimental values were then introduced in the CBN formulas for the computation of the f factor. As a first step, the reproducibility of the different approaches was tested making use of the two technical replicates prepared and analyzed for each mixture. As an example, [93]Fig 1A–1D shows the corresponding scatter plots concerning the quantitative measurements of all proteins identified in the two replicates of mixture A by the four different normalization methods used, i.e. NSAF, R[SC], CBN(P) with f = 1/P and CBN(S) with f = 1/t. Fig 1. Scatter plots of the quantitative measurements of all the E. Coli proteins. [94]Fig 1 [95]Open in a new tab Scatter plots of the quantitative measurements of all the E. Coli proteins identified in the two replicates of mixture A by the four different normalization methods used, i.e. NSAF (panel A), R[SC] (panel B), CBN(P) with f = 1/P (panel C) and CBN(S) with f = 1/t (panel D). On the X-axis is reported the normalized Spectral Counts of replicate 1, and on the Y-axis the normalized Spectral Counts of replicate 2. When the data produced by the applied method were perfectly reproducible between the two technical replicates, the scatter plot should result in a perfectly linear behavior (R^2 = 1) and, most importantly, should have a unitary slope. As showed in [96]Fig 1A–1D, a very poor correlation occurred when data were elaborated by NSAF and Rsc in comparison to CBN methods, with a high number of points largely scattered, therefore suggesting a lower level of reproducibility. Linear regression best-fit values are summarized in [97]S2 Table. It should be underlined that the scatter plots associated with the two CBN methods showed quite similar shape although the value of the adjustment factor was greatly different. Furthermore, as shown in panel 1B, a large scattering of data that could not be accommodated in a linear behavior was observed in the R[SC]normalization method. [98]Fig 2A reports the slopes calculated for the best fitting lines in each pair of technical replicates analyzed with the four methods for all samples except mixture B, since it showed a very low reproducibility in all methods ([99]S1 Fig), suggesting the occurrence of technical problems in the LC-MS/MS analysis. Findings reported in [100]Fig 2A showed that all the SpCs methods displayed an acceptable behavior except for R[SC,] whose median value was much lower than the expected value (i.e. 1) suggesting that a fixed adjustment factor "f" strongly affects the median value. The reproducibility of each method was also evaluated by calculating the coefficient of variation (CV) in the quantification of E. Coli proteins throughout the ten LC-MS/MS runs in which the amount of E. Coli tryptic peptides was constant ([101]Fig 2B). The R[SC] method showed a very good behavior with a low coefficient of variation whereas the NSAF displayed a greater dispersion of data and a very high CV value. This is very likely due to the absence of any adjustment factor in this method making NSAF very susceptible to quantitative biases caused by the absence of data. As showed, CBN(S) performed well, showing a CV dispersion intermediate between NSAF and Rsc. The best result occurred in CBN(P), which presented the lowest CV values contained in the narrowest dispersion window, suggesting it as the most precise among the investigated methods. Fig 2. Comparative analysis of all spectral counts normalization methods applied to E. Coli proteome. [102]Fig 2 [103]Open in a new tab (A) Median values for the best fitting slopes calculated for each pair of technical replicates for all samples analyzed (except mixture B) with each normalization method. (B) Coefficient of variation (CV) for the evaluation of data dispersion for all the normalization methods used in the analysis of E. Coli proteome. (C) Representation of the logarithmic Fold Change distribution around the theoretical value indicated by the point line for all normalization methods. (D) Box plots of the CV for the fold change in the mix E / mix D pair, calculated on all the four possible pairs of the technical replicates. In panels B and D the box and whisker extremes represent 25–75%. These preliminary data strengthened our hypothesis that differences in Fold Change values are due to the different sizes of the adjustment factor, since the shape of the data is identical. The normalization methods were then tested for their ability to correctly assess protein quantification when two different sets of data were compared. Therefore, the five mixtures and the corresponding replicates were compared by two according to the scheme: A1/B1, A2/B1, A1/B2, A2/B2, etc., and the Fold Change (FC) values of all proteins in each pair were calculated. As the amount of E. Coli proteome was constant in all samples, the expecting Folding Change value for each protein should have been equal to 1 (Log[2] (FC) = 0, [104]Fig 2C). For each pair, the mean value of the Fold Change and its associated CV was calculated. As an example, in [105]Fig 2D the data related to the four possible combinations of mixture E/mixture D pair were reported. Inspection of the results showed a very regular distribution of data points for the CBN(P) method, with more than 75% of values having an associated CV of 25% or less. NSAF and Rsc displayed a slightly more dispersed distribution of Fold Change values, while CBN(S) showed the highest dispersion of data. Then, we moved to evaluate the statistical relevance of proposed methods, through the calculation of False Discovery Rate (FDR). An unpaired Student's t-test was performed to extrapolate the E. Coli statistically significant Differentially Expressed Proteins (DEPs): the latter represented the pool of false positives, since no variations were expected in E. Coli proteins. The FDRs, calculated as the percentage of false positives of total identified proteins, resulted in 5.1% for Rsc, 6.1% for both CBNs, and 7.6% for NSAF. These values are very close to the reference one (5%), indicating a high reliability of all methods, excepted NSAF, which showed the highest value of false positives. Furthermore, for each method the values of Fold Change cutoffs were estimated, treating the standard mixtures A—C and D—E as a couple of samples to be compared (A and C vs D and E). The FC cutoffs were calculated for each method by evaluating the dispersion of FCs of the unchangeable proteins of E. Coli proteome, considered not statistically significant according to unpaired Student's t-test (p<0.05). The FC thresholds were defined by the lower Q1-(IQR*1.5) and upper Q3+(IQR*1.5) extremes, where Q1, Q3 represented 25 and 75 percentiles respectively, and IQR is defined as the inter-quartile distance. The obtained FC cutoffs were summarized in [106]Table 1: Table 1. Methods fold changes cutoffs. METHOD Lower FC value Upper FC value CBN(P) 0.79 1.20 CBN(S) 0.49 1.53 NSAF 0.45 1.60 RSC 0.51 1.51 [107]Open in a new tab Fold Changes cutoffs calculated for each method are reported, indicating upper and lower significant values. These findings showed very similar results for CBN(S), Rsc, and NSAF, while CBN(P) performed quite differently. In particular, its narrowest cutoffs again confirmed that CBN(P) is the most reliable method in the detection of slight differences in protein expression levels (low FCs), which in biological samples might be relevant. Comparison of normalization methods for the quantification of the six standard proteins spiked within the E. Coli proteome The different normalization methods were then evaluated in the quantification of the Fold Changes of the six standard proteins spiked within the E. Coli proteome. The Fold Change values were determined by all methods for the six standard proteins spiked within the E. Coli proteome in all mixtures ([108]S3 Table). The performances of the different normalization methods were evaluated for the quantification of BSA, HBA, HBB, and PYG proteins, whose concentration was constant in all mixtures, and for quantification of ENO and ADH, whose amount changed in the samples. [109]Fig 3A reports plots showing the distribution of the logarithmic Fold Change for unchangeable proteins obtained with the four normalization methods. All methods displayed similar results indicating a general high accuracy in the FC measurements, as mainly demonstrated in BSA and HBB, where the mean values agree to the true value (FC = 0). In HBA the best fit occurred in CBN(P) method, confirming very good performances in terms of accuracy and precision. However, almost all measured values scattered within 0.3 logarithmic units from the theoretical values, corresponding to an absolute Fold Change of 1.2, which is reasonably considered as an unchanged amount of protein. Fig 3. Fold change analysis comparing all spectral counts normalization methods. [110]Fig 3 [111]Open in a new tab (A) The mean Fold Change values for unchangeable proteins (BSA, HBA, HBB, and PYG) within the ten different pairs of mixtures calculated by the four normalization methods. The theoretical values are 0 and it is indicated by a point line. (B) Comparison of the experimental and theoretical Fold Change values for the relative quantification of ENO and ADH obtained by using the SpC-based methods and reported in the logarithmic scale. (C) Table reporting linear regression best-fit values for all ENO and ADH linear regressions. The linear correlations between experimental and theoretical values obtained for all different normalization methods were investigated for ENO and ADH, whose concentrations were changeable ([112]Fig 3B). Very similar correlation values were obtained for both proteins in the calculation of relative abundance determined by NSAF, Rsc and CBNs, suggesting that all methods are equally able to correctly evaluate differences in the amount of proteins. For ENO, linear correlations were confirmed also decreasing 10 times the concentration of the lowest point reported in [113]Fig 3B (data not shown), confirming a good behavior for all methods. In conclusion, the quantitative measurement using standard mixtures demonstrated that both the newly proposed CBN methods resulted reliable in the estimation of all expected FCs, either when the protein amount changed or when it was constant, for which they showed a very good adherence to theoretical data even in a complex mixture. Application of CBN methods to the differential proteomics analysis of samples from cerebral cortex from Huntington’s Disease mice Once the effectiveness of the newly developed SpC-based normalization methods was defined, the two CBN approaches were challenged with the analysis of more complex samples, in which the expression level of a large number of proteins is expected to significantly change. A label-free differential proteomics experiment was performed on the brain cortex proteome from mice affected by Huntington’s Disease (HD) (homozygous mutant HTT knock in line zQ175) [[114]40] and corresponding control samples (WT). Equal amounts of total protein extracts from three wild type and three mutant mice cortices were separated by SDS-PAGE ([115]S2 Fig) and each of the six lanes was cut in 29 slices that were in situ hydrolyzed with trypsin. The corresponding 174 peptide mixtures were then analyzed by nanoLC-MS/MS in duplicates. The resulting set of 348 LC-MS/MS raw data (available via ProteomeXchange with identifier PXD017471) was processed by MaxQuant for protein identification and quantification ([116]S4 Table). The set of unnormalized SpCs data was then elaborated by the newly developed CBN methods (CBN(P) and CBN(S)). The other methods (NSAF and Rsc) were used for comparison. The lists of proteins were then filtered according to their differential expression and statistical significance (FDR<5%) by using the Multi experiment Viewer, as described above. Statistically significant up- and down-regulated proteins, identified by each method according to FC cut off values previously calculated, were reported in the histogram in [117]Fig 4A. It is surprising to note that the two CBN methods led to the identification of a quite different number of statistically significant proteins. In CBN(S) as well as Rsc, it was nearly 140, whereas the CBN(P) as well as the NSAF approach led to the identification of only about a hundred of significant proteins. Fig 4. Visualization of identified proteins with all methods in the WT and HD mouse model. [118]Fig 4 [119]Open in a new tab (A) The histogram shows for each method the number of proteins that appeared to be both differentially expressed and statistically significant. (B) Venn diagram referred to all proteins. The central area represents the proteins common to all methods. In particular, the specific and shared proteins were highlighted in a Venn diagram ([120]Fig 4B). Details of statistically significant identified proteins were summarized in [121]S5 Table. The reliability of the proposed normalization methods was further confirmed by quantification of some selected proteins by two independent methodologies, i.e. western blot analyses and Multiple Reaction Monitoring (MRM) mass spectrometry analysis. In western blot assays ([122]Fig 5A), the expression levels of IRGM1, OSBPL2 and SERPIN B6, which were up-regulated in HD mice and hnRNP H, HTT, SAMM50, UQCRQ and HOMER1, whose expression was instead decreased in HD mice were evaluated on total protein extracts of three WT and three HD mice (the same samples employed for the proteomic experiment), in duplicate. Fig 5. Validation of a selected group of proteins differentially expressed in WT and HD mouse model. [123]Fig 5 [124]Open in a new tab (A) Western blot assays performed on total protein extracts from three mutant (zQ175(1), zQ175(2), zQ175(3)), and three wild-type (wt(1), wt(2), wt(3)) mice with antibodies against the selected proteins. β-actin was used for normalization. (B) MRM superimposed traces of transitions of one of CAMK2A proteotypic peptide reported for zQ175 (left panel) and WT (right panel) together with one ACTIN peptide transition. (C) Densitometric analysis of data from the western blot of panel A. The indicated values in the graph represent the percentage of arbitrary units compared to WT to which 100% was assigned. Results are represented as the as mean ± SD (standard deviation). The statistical significance was evaluated by parametric (Welch’s) or non-parametric (Mann-Whitney) tests when data failed the Shapiro–Wilk normality test. * p < 0.05, ** p < 0.01 *** p < 0.001, **** p < 0.0001. (D) Fold Change measured by MRM analysis of pooled HD and WT samples, respectively. Upon the densitometric analysis, each sample was normalized with the intensity of β-actin developed on the same membrane, except HTT normalized with the intensity of vinculin. The normalized intensities were then evaluated by univariate statistical analysis and the results summarized in [125]Table 2 and graphically reported in [126]Fig 5C. Table 2. Summary of proteins FCs validated by western blot and/or MRM. Protein FC MRM FC WB FC CBN(S) FC CBN(P) FC RSC FC NSAF HTT 0.59 0.42 0.08 0.59 0.14 / CAMK2A 0.44 / / 0.67 / / SAMM50 0.80 0.66 0.44 0.69 0.47 / ATP2A2 0.69 / / 0.69 / / HOMER1 0.60 0.72 0.45 0.71 0.45 / IRGM1 / 1.93 19.73 1.22 4.09 UP APO-A4 1.27 / / 1.29 / / RIC8A 1.30 / 4.46 1.30 / / STIP1 1.35 / 2.38 1.32 2.11 2.50 OSBPL2 1.32 1.18 6.37 / 2.92 / SERPIN B6 1.33 1.99 2.55 / 2.02 2.75 HNRNP H / 0.46 0.30 / / / UQCRQ / 0.71 0.36 / / / [127]Open in a new tab In the table proteins selected for validation and their FC values calculated by LF, MRM, and WB methods are reported. Since in several cases the differences in expression levels were about 20–30%, the western blot technique might not be enough reliable in detections on a restricted number of replicates. Therefore, we integrated validation experiments by quantifying some proteins by using a tandem mass spectrometry methodology: the Multiple Reaction Monitoring (MRM). HTT, SERPIN B6, HOMER1, OSBPL2, and SAMM50 were also monitored by MRM, together with other proteins, APO-A4, CAMK2A, STIP1, RIC8A and ATP2A2, whose antibodies were not available or not efficient in western blot analyses. ACTIN was monitored too and employed as an internal standard for normalization procedure. In detail, three zQ175 and WT cortex brain samples (the same samples employed for the proteomic experiment) were pooled respectively, and few micrograms were employed for a shotgun proteomics experiment by using the MRM targeted approach. Peptide mixtures were separated by nanoLC-MS/MS in duplicate and two or more transitions of at least two proteotypic peptides were monitored for each of the above proteins and employed for relative protein quantification. Areas of monitored transitions ([128]Fig 5B, [129]S6 Table) were measured, mediated among transitions, and all peptides belonging to the same protein, employed to calculate FC, which were normalized with FC measured for ACTIN in each couple of runs. FC results are summarized in [130]Table 2 and graphically reported in [131]Fig 5D. A strong agreement among all measured FCs emerged by comparing the results obtained by western blot and MRM, confirming the reliability in the validation of both methods. Moreover, for all proteins, the variation trends detected by LF approaches were confirmed by both methods ([132]Table 2) also in the detection of slight FCs. In detail, levels of HTT protein are significantly reduced in HD mice compared to controls, according to previously observed [[133]47]. This protein was identified by Rsc and CBN approaches as statistically significant, although the three methods measured very different FC values: CBN(S) FC = 0.08, CBN(P) FC = 0.59, Rsc FC = 0.14. By comparing these values with those obtained by western blot (0.42) and MRM (0.59), the FC measured by CBN(P) was the only perfectly in agreement with both values. This finding was not a random or a sporadic event, but it was recurrent in all proteins, from those quantified also in other methods (HTT, SAMM50, HOMER1, IRGM1, RIC8A, STIP1, OSBPL2, UQCRQ) to those statistically significant only for CBN(P) (CAMK2A, APO-A4, ATP2A2) ([134]Table 2). hnRNP H and UQCRQ are proteins identified as statistically significant only by CBN(S), while SERPIN B6 varied significantly for all methods excepted CBN(P). hnRNP H and SERPIN B6 were endorsed by western blot assays, founding their densitometric mean values statistically significant. Surprisingly, FC values calculated by CBN(S) for these proteins strongly agreed with those obtained from the other methods. Altogether these data confirm that the quantification of results obtained by normalization of the differential proteomic data performed by the CBN methods is reliable, although CBN(P) was confirmed to be the best in terms of precision and reliability in comparison with CBN(S) and the other investigated methods. The results of the differential proteomics experiments were then evaluated on a functional basis by gathering the differentially expressed proteins within biological functional networks. A bioinformatic analysis involving all the differentially expressed proteins identified by all methods were carried out using KEGG pathways and the STRING databases. The results are summarized in [135]Table 3 where the main pathways showing FDR<0.05 are reported. Functional classes showing the lower FDR values include metabolic pathways, oxidative phosphorylation, and neurodegenerative diseases (Parkinson's Disease, Huntington's Disease, Alzheimer's Disease), confirming the effectiveness of the newly developed methods not only in terms of sensitivity in the detection of single up- or down-regulated proteins, but also looking at the biological meaning of aggregated results. Table 3. List of functional pathways identified by STRING analysis. Pathways Description Observed Gene Count (NSAF) FDR Observed Gene Count (RSC) FDR Observed Gene Count (CBN(P)) FDR Observed Gene Count (CBN(S)) FDR Metabolic pathways 19 2.62E-09 42 1.10E-09 22 2.30E-06 42 1.51E-10 Oxidative phosphorylation 7 0.0037 14 9.68E-09 7 0.000105 12 7.09E-07 Parkinson's disease 9 0.00049 14 1.77E-08 8 1.53E-05 12 1.11E-06 Huntington's disease 9 0.00156 14 2.78E-07 9 9.82E-06 13 1.24E-06 Alzheimer's disease 9 0.00111 13 8.40E-07 7 0.000397 12 3.94E-06 Calcium signaling pathway 8 0.00412 7 0.0328 7 0.000566 8 0.0109 Val, Leu and Ile degradation 6 0.00049 5 0.00648 3 0.0269 5 0.00578 cGMP-PKG signaling pathway 8 0.0037 8 0.0094 5 0.02 7 0.0247 Fatty acid degradation 5 0.00194 4 0.0254 3 0.0213 4 0.0223 [136]Open in a new tab Statistically significant proteins identified by each method were analyzed by STRING, a bioinformatic tool for the functional clusterization in protein networks; all pathways are reported in descending order of FDR. Discussion Label-Free methods for differential proteomics investigations generate a large amount of data that needs to be normalized, i.e. corrected for instrumental or uncontrollable variations before becoming employable for quantitative purposes. The choice of suitable normalization formula is a not trivial aspect in these approaches, because it might strongly affect the experimental results. For Spectral Counts based Label-Free approaches, normalization procedure has been assessed in different ways, resorting in some cases to the use of fixed and arbitrary corrective factors, such as in Rsc formula [[137]33], to overcome biases associated with the presence of missing values in the analysis, as occurs in NSAF method [[138]39] lacking of adjustment elements. In light of these considerations, we decide to test new formulas for SpCs data normalization, in which the corrective factor was not an arbitrary choice of the operator, but strictly and directly defined by an intrinsic property of the system under investigation, such as its complexity. A new approach defined Complexity Based Normalization (CBN) was introduced for the normalization of Spectral Counts data in differential proteomics experiments to calculate the relative protein Fold Changes. The new formula employed a complexity-based adjustment factor generating two different methods, CBN(P) where f = 1/P and CBN(S) in which f = 1/t with P and t referring to the total number of identified proteins (P) or the total spectral counts in the LC-MS/MS analysis (t), respectively. The performances in terms of reliability, precision, accuracy, linearity, and sensitivity of the two CBN methods in the analysis of raw data from differential proteomics experiments carried out according to the label-free strategies were evaluated in comparison with existing SpC-based normalization approaches, NSAF and Rsc. Methods evaluation was carried out by analyzing a set of samples consisting of six different standard proteins spiked in an E. Coli total tryptic digest matrix and the capability of the methods to quantify both the total matrix and the spiked proteins were monitored. Both CBN(P) and CBN(S) showed very good reproducibility in the analysis of the technical replicates and were able to correctly quantify the unchanged amount of E. Coli proteins. When the accuracy of all methods was compared, CBN(P) showed the best behavior, while the CBN(S) performance was comparable with the one showed by NSAF and RSC methods; a similar trend has been detected also for unchangeable spiked proteins. In addition, other differences could be observed between all methods when the coefficient of variation in the quantification of E. Coli proteins was examined. Although both CBN methods displayed a low dispersion of data and low CV, CBN(P) displayed a better performance than CBN(S) suggesting that the precision is affected by the size of the adjustment factor. The evidence that the choice of the adjustment factor value might affect performances of SpC-based normalization approaches is confirmed also for NSAF and R[SC] methods. In fact, NSAF showed low reproducibility greater dispersion of data and very high CV value especially in the quantification of the lowest abundant proteins. This is likely due to the absence of any adjustment factor in this method making NSAF very susceptible to quantitative biases caused by the absence of data. The Rsc method showed low reproducibility in the quantification of lowest abundance proteins in the quantitative analysis of E. Coli proteome but low coefficient of variation in the quantitation of spiked proteins suggesting again that a fixed adjustment factor "f" might affect the analysis of a large set of data. Otherwise, all quantitative methods were characterized by a wide linearity concentration-range when changeable spiked proteins were quantified, proving a response directly proportional to the protein amount in a wide concentration range. When CBNs methods were tested on biological complex samples (protein extracts from the brain of HD mice vs WT), both methods generated reproducible, accurate, sensible, and reliable quantitative data. Their FC trends were always confirmed by western blot and/or MRM validations, suggesting that a complexity-based adjustment factor properly works in the correction of output data, leading to suitable final quantitative measurements of protein expression variations. However, the two CBN methods performed differently in the Fold Change calculation. Although CBN(P) recognized a lower number of changeable significant proteins, it has proven to be the most reliable method to appreciate minimal statistically significant changes in protein expression levels, since its FC cutoff was the narrowest among those calculated for other methods. Moreover, when the FC values calculated by all methods for selected proteins were compared with those calculated by western blot and/or MRM, a surprisingly perfect accordance was found solely with FCs derived from CBN(P), suggesting that this method performs as the highest reliable and accurate among all. CBN(S), in most cases, performed more like Rsc and NSAF than CBN(P). Nevertheless, CBN(S) method showed a higher sensitivity in the detection of statistically relevant FC of low abundant proteins that give rise to a low number of SpCs, such as UQCRQ and hnRNP H. These proteins were statistically significant solely for CBN(S), but their FCs were confirmed by western blot assays with a good accordance. Differences in performances between CBN(P) and CBN(S) might be due to the dimension of the adjustment factor, which is not negligible, considering that the difference between total SpCs and total proteins is about two orders of magnitude (in mice analysis, total SpCs = 2.45x10^5 and P = 2860). Moreover, over-representation analysis of CBNs data carried out by STRING reveals that the most of proteins flows in molecular pathways affected in HD brain tissue and other neurodegenerative model systems, thus confirming the robustness and the biological coherence of data provided by these new methods. Indeed, processes such as energy metabolism [[139]5,[140]48–[141]51], oxidative phosphorylation and mitochondria functionality [[142]52–[143]54], calcium homeostasis [[144]55,[145]56], already known to be deregulated in cortical samples from HD mice, were identified, thus indicating the highest correlation level of functional data. In conclusion, in this study, we demonstrated how all SpCs normalization methods are strongly affected by the presence/absence or by the value of adjustment factors “f”. The presence of a correction factor allows overcoming the effect of the absence of data in FC calculation and leads to methods with the lowest coefficient of variation. Moreover, the association of the "f" with sample complexity makes the operator free from the choice of the best value for the correction factor. Our data have shown that CBN(S) and CBN(P) are both a viable alternative to other existing SpC-based quantification methods. Furthermore, both CBN(S) and CBN(P) are two sensitive methods, although each shows a different reactivity: CBN(P) is capable of appreciating small but statistically and biologically significant FC variations in proteins well represented in the proteome; on the contrary, the CBN(S) enhances the differences in levels of protein expression in lower abundant proteins. Finally, if someone would ask which methods we prefer, our choice falls on CBN(P), because it performs as the most reliable and sensitive “sensor” for FCs. In fact, from our point of view, it is preferable to work on a lower number of data which, however, are the result of more stringent and reliable selection standards, rather than getting lost in a sea of data often difficult to interpret from a biological point of view. Supporting information S1 Fig. Median values for the best fitting slopes calculated for each pair of technical replicates including mixture B for all samples analyzed with each normalization method. (PDF) [146]Click here for additional data file.^ (134.1KB, pdf) S2 Fig. Images of the SDS-PAGE of mice samples. Three biological replicates cortices of zQ175 and WT mice were loaded and gel slices were cut following the scheme reported. (PDF) [147]Click here for additional data file.^ (181.3KB, pdf) S3 Fig. Images of the entire membranes whose inserts are reported in [148]Fig 5A. Full-length western blots. Technical replicates of HTT, IRGM1, Homer1, OSBPL2, hnRNP H, UQCRQ and Samm50 in zQ175 and WT mice were developed with same antibodies. Technical replicates for SerpinB6 were developed with two different antibodies, as reported below the images. (PDF) [149]Click here for additional data file.^ (697.3KB, pdf) S1 Table. Details of the qualitative and quantitative mass spectrometric analysis of standard mixtures spiked in standard E. Coli proteome. In sheet 1, raw data generated by MaxQuant following the bioinformatic analysis for protein identification and quantification are reported. The processed protein groups applying each SpCs method normalization formulas are disclosed in sheets 2, 3, 4, and 5. (XLS) [150]Click here for additional data file.^ (1.2MB, xls) S2 Table. Linear regression best-fit value referred to MixA. Included is Slope, Y-intercept, and X-intercept calculated by NSAF, RSC, CBN(P), and CBN(S) methods. (XLS) [151]Click here for additional data file.^ (25KB, xls) S3 Table. Fold Change value (FC) and expected Fold Change (exp FC) value by NSAF, RSC, CBN(P) and CBN(S) methods. Included is Fold Change value (FC) and expected Fold Change (exp FC) value of standard proteins (HBA, HBB, BSA, ADH, PYG, and ENO) measured and/or calculated in B/A, C/A, D/A, E/A, C/B, D/B, E/B, D/C, E/C and E/C mixture by using NSAF, RSC, CBN(P), CBN(S). (XLS) [152]Click here for additional data file.^ (44.5KB, xls) S4 Table. Details of the qualitative and quantitative mass spectrometric analysis of zQ175 and Wild-Type mouse HD model. In sheet 1, raw data generated by MaxQuant following the bioinformatic analysis for protein identification and quantification are reported. The processed protein groups applying each SpCs method normalization formulas are disclosed in sheets 2, 3, 4, and 5. (XLS) [153]Click here for additional data file.^ (22.6MB, xls) S5 Table. Statistically significant identified proteins. In the table are reported the FCs and p-values of statistically significant identified proteins (p value<0.05) and FDR (<5%). Protein names, gene names, Uniprot codes and peptides are also indicated as headers. (XLSX) [154]Click here for additional data file.^ (58.6KB, xlsx) S6 Table. Summary of MRM results. In the table are reported the FCs values of proteins validated by Multiple Reaction Monitoring (MRM). Run, protein names, peptide sequences, Collision Energy, m/z of precursor ions, m/z of fragment ions, total transition area, peptide FC, and averaged and normalized FC for each protein. (XLSX) [155]Click here for additional data file.^ (37.5KB, xlsx) Acknowledgments