ABSTRACT In this study, we evaluated the diagnostic value and molecular characteristics of plasma extracellular vesicles (EVs)-derived miRNAs for patients with solitary pulmonary nodules (SPNs), particularly ground-glass nodules (GGNs). This study was registered at [64]www.clinicaltrials.gov under registration number [65]NCT03230019. Small RNA sequencing was performed to assess plasma EVs miRNAs in 59 patients, including 12 patients with benign nodules (2017, training set). MiRNA profiles of 40 an additional individuals were sequenced (2018, validation set). Overall, 16 pure GGNs, 21 mixed GGNs, and 42 solid nodules were included, with paired post-operative plasma samples available for 20 patients. The target miRNA/reference miRNA ratio was used to construct a support vector machine (SVM) model. The SVM model with the best specificity showed 100% specificity in both the training and validation sets independently. The model with the best sensitivity showed 100% and 96.9% sensitivity in the training and validation sets, respectively. Principal component analysis revealed that pure GGN distributions were distinct from those of solid nodules, and mixed GGNs had a diffuse distribution. Among differentially expressed miRNAs, miR-500a-3p, miR-501-3p, and miR-502-3p were upregulated in tumor tissues and enhanced overall survival. The SVM classifier accurately distinguished malignant GGNs and benign nodules. The distinct profile characteristics of miRNAs provided insights into the feasibility of EVs miRNAs as prognostic factors in lung cancer. KEYWORDS: Ground-glass nodule, extracellular vesicles, microRNA, biomarker, support vector machine Introduction Implementation of lung cancer screening using low-dose computed tomography (CT) has increased the rate of detection of small pulmonary nodules [[66]1–[67]3]. Nodules that demonstrate ground-glass opacity on CT, including pure ground-glass nodules (pGGNs) and mixed ground-glass nodules (mGGNs), are particularly challenging to diagnose owing to their malignant potential and heterogeneous characteristics [[68]4,[69]5]. Hence, noninvasive diagnostic methods and risk stratification approaches for pulmonary GGNs need to be developed. Extracellular vesicles (EVs), comprising nucleotides and proteins, are present in various body fluids [[70]6–[71]9]. EVs contents in body fluids may have applications as diagnostic biomarkers; however, their sensitivity is still unclear [[72]10,[73]11]. Exosome-associated glypican-1 is a diagnostic biomarker for early pancreatic ductal adenocarcinoma, with absolute specificity and sensitivity[[74]12]. Moreover, glypican-1-positive circulating exosomal levels are correlated with tumor burden and patient survival before and after surgery. Circulating exosomal microRNAs (miRNAs) may serve as potential diagnostic biomarkers for non-small cell lung cancer (NSCLC) [[75]13–[76]17]. Moreover, Jim et al. established a high-throughput sequencing method for exosomal miRNAs in patients with stage I NSCLC and healthy volunteers; a set of adenocarcinoma-specific and squamous cell carcinoma-specific candidate miRNAs was identified[[77]18]. However, these studies have several limitations, including the selective bias inherent to retrospective studies and less rigorous identification methods[[78]19]. Additionally, no controlled studies have compared the differences between malignant solitary pulmonary nodules (SPNs) and benign diseases. Accordingly, in this prospective study, we aimed to determine the diagnostic value and molecular characteristics of EVs miRNA profiles among malignant SPNs (particularly GGNs), benign nodules, and paired post-operative plasma samples. Materials and methods Study design Consecutive patients with SPNs were included in this prospective study between July 2017 and March 2018 at Guangdong Provincial People’s Hospital. Blood samples were collected from patients in vacutainer tubes containing an anticoagulant (REF367863; Becton Dickinson, Franklin Lakes, NJ, USA) before surgery. Furthermore, the interval for postoperative blood sample collection ranged from 1 week to 1 month. Patients enrolled in 2017 were grouped into the training set, whereas patients enrolled in 2018 were grouped into the validation set. This protocol was approved by the ethics committee of Guangdong Provincial People’s Hospital and registered at [79]www.clinicaltrials.gov on 26 June 2017 (registration number: [80]NCT03230019). All patients provided written informed consent for the collection of plasma samples and for their pathological data to be used in this study. Plasma isolation Blood samples included in this study were collected in 10-mL vacutainer tubes with K2EDTA anticoagulant (REF367525; BD, USA), gently inverted to mix eight times, stored with the tubes remaining upright, and then shipped at 4°C within 1 h after collection. After analysis, plasma samples with hemolysis levels of less than 4 were used in this study. To harvest plasma, blood samples were centrifuged at 1500 × g for 15 min at 4°C, and each 1-mL fraction of the supernatant was transferred into a fresh 1.5-mL tube and stored at −80°C before use. Plasma EV isolation EVs were isolated using 3D Medicine isolation reagent (L3525; 3DMed, Shanghai, China), a polyethylene glycol-based method. This EV isolation reagent has been modified and improved from the work of Rider et al[[81]20]. and has been registered to the National Medical Products Administration as a Class I medical device (#HMXB20190091) specifically for isolation of EVs in the clinical setting. Briefly, plasma samples were centrifuged at 12,000 × g for 10 min at 4°C after incubation in a water bath. Supernatants were equilibrated to ambient temperature, filtered with a 0.45-μm filter, and then filtered with a 0.22-μm filter. The filtered supernatant was then transferred to fresh 1.5-mL tubes, and one-fourth volume of isolation reagent (L3525; 3DMed, Shanghai, China) was added and mixed by inverting the tubes several times. The mixture was incubated overnight at 4°C and centrifuged at 4700 × g for 30 min at 4°C to obtain EVs precipitates. The isolated EVs were resuspended in 200 μL phosphate-buffered saline (PBS). Western blotting Protein extraction was performed using EV isolation reagent (N3525; 3DMed, Shanghai, China) from 120 μL plasma, and EVs were homogenized in 100 μL RIPA lysis buffer with proteinase inhibitors (P0013B; Beyotime, Shanghai, China) on ice for 30 min. Samples were then centrifuged at 12,000 × g for 10 min at 4°C, and 80 µL supernatant was combined with 20 µL SDS-PAGE Sample Loading Buffer 5× (P0015; Beyotime). The mixtures were then incubated for 10 min at 100°C. Protein samples were separated via sodium dodecyl sulphate-polyacrylamide gel electrophoresis on 4–20% gels (Bio-Rad, Redmond, WA, USA), electroblotted onto polyvinylidene difluoride membranes (Millipore, Billerica, MA, USA), and then incubated with primary anti-CD9 antibody (diluted 1:500; cat. no. 13,174; Cell Signaling Technology, Danvers, MA, USA), anti-Alix (3A9) mouse monoclonal antibody (diluted 1:500; cat. no. 2171; Cell Signaling Technology, Danvers, MA, USA), anti-Syntenin antibody (diluted 1:500; cat. no. ab19903; Abcam, Cambridge, UK), anti-TSG101 polyclonal antibody (diluted 1:500; cat. no. abs115706; Absin Bioscience Inc., Shanghai, China), and anti-Calnexin antibody (diluted 1:1000; cat. no. 2679; Cell Signaling Technology, Danvers, MA, USA) at ambient temperature for 2 h. Horseradish peroxidase-conjugated anti-rabbit IgG and anti-mouse IgG were used as the secondary antibodies (diluted 1:5000; cat. no. 7074 and 7076; Cell Signaling Technology, Danvers, MA, USA). Antibody binding was detected using an enhanced chemiluminescence system in accordance with the manufacturer’s protocol (Tanon-5200Multi; Shanghai, China). Proteins extracted from plasma were used as a negative control. Protein levels were calculated from three independent experiments using western blotting. EVs samples from approximately 10 µL plasma were analyzed for Alix, TSG101, syntenin, CD9, and Calnexin levels. Nanoparticle tracking analysis (NTA) To track the number and size of EVs, a Nanosight NS 300 system (NanoSight Technology, Malvern, UK) was used [[82]21,[83]22]. Isolated EVs were resuspended in PBS at a concentration of 5 μg/mL and were further diluted 100- to 1000-fold, to achieve between 20 and 100 objects/frame. Samples were manually injected into the sample chamber at ambient temperature. Each sample was configured with a 488-nm laser and a high-sensitivity scientific complementary metal-oxide semiconductor camera, and measurement were performed in triplicate at camera setting 13 with an acquisition time of 30 s and a detection threshold setting of 7. At least 200 completed tracks were analyzed and obtained per video. Finally, nanoparticle tracking data were analyzed for EVs samples using the NTA analytical software (version 2.3). Scanning electron microscopy (SEM) For SEM analysis, EVs were resuspended in PBS and fixed in 5% glutaraldehyde. After washing with PBS for 5 min, the EVs were immobilized in 1% OsO[4] in PBS and dehydrated with a series of ethanol concentrations (40%, 60%, 80%, and 96–98%). After the ethanol was evaporated, the samples were allowed to dry at ambient temperature for 24 h on Si substrate and then analyzed via SEM (Hitachi High-Technologies, Tokyo, Japan) after gold-palladium sputtering. RNA isolation from plasma evs RNA was isolated from EVs using a miRNeasy Serum/Plasma Kit (Qiagen, Valencia, CA, USA) in accordance with the manufacturer’s protocol. Briefly, 700 μL Qiazol was added to the isolated EVs in 200 μL PBS, vortexed, and incubated. Thereafter, 90 μL chloroform was added to the mixture, and the samples were vortexed again, incubated, and centrifuged at 12,000 × g, for 15 min at 4°C. The upper aqueous phase was then transferred to a new tube and supplemented with 100% ethanol at twice the original volume. Thereafter, the samples were loaded onto a spin column with a collection tube and centrifuged at 8000 × g for 15 s. The flow-through was discarded, and the spin column was rinsed twice with a wash buffer and air-dried. The recovered EVs RNA was eluted in 15 μL RNase-free water. To control the quality of miRNAs, the yield and distribution were analyzed using an Agilent 2100 bioanalyzer with Small RNA Chips (5067–1548; Agilent, Technologies, Inc., Santa Clara, CA, USA). Preparation and sequencing of small RNA libraries Small RNA library preparation for high-throughput sequencing was carried out using a NEBNext Small RNA Library Prep Set for Illumina (New England BioLabs, Inc., Ipswich, MA, USA).Thereafter, the small RNA library was purified using a NucleoSpin Gel and PCR Clean-up kit (Qiagen, Shanghai, China). The resulting purified library was analyzed for purity on an Agilent 2100 Bioanalyzer (Agilent Technologies, Inc., Santa Clara, CA, USA) and then subjected to single-ended, strand-specific sequencing on an Illumina HiSeq X10 platform. Bioinformatics analysis of mirna-seq data The 3′ adaptors of reads were cleaved using a custom programme. Subsequently, the reads were aligned to human genome hg19 assembly ([84]http://hgdownload.soe.ucsc.edu/goldenPath/hg19/bigZips/) by BWA 0.7.12-r1039[[85]23]. The annotations were generated from Gencode[[86]24] v25 and miRBase[[87]25] v21 for statistical analysis and to determine expression levels. miRNA expression was calculated by counting the number of reads mapped to regions annotated by mature miRNAs. To enable cross-sample comparisons, miRNAs expressed (mapped by at least one read) in all samples were considered housekeeping genes. The size factors were calculated on the basis of the 75th percentile of the housekeeping genes described by DESeq2[[88]26], and expression levels were normalized by dividing the raw read counts by size factors. Statistical analysis To scale for miRNA expression, principal component analysis (PCA) was performed, wherein the normalized expression levels of each miRNA were scaled by dividing the expression levels by the maximum expression levels of the miRNAs. PCA was then performed on the matrix constructed by the scaled expression levels, using the prcomp function in R 3.3.3. Pathway enrichment of the experimentally validated targets of miRNAs was performed using mirPath[[89]27] v.3, which provided EASE score and false discovery rates using Fisher’s exact tests and unbiased empirical distributions. Kaplan-Meier plot analysis of The Cancer Genome Atlas (TCGA) data was performed using OncoLnc[[90]28]. Support vector machine (SVM) analysis was performed with default parameters of svm.LinearSVC in the sklearn package. The model kernel was “linear”, penalization was ‘L2ʹ, and loss was “squared_hinge”. Differentially expressed miRNAs between the two groups were defined with two-tailed p values (t-tests) less than 0.001 and having log[2] fold-changes (group means) greater than 1 or less than – 1. Differentially expressed miRNAs between the before and after surgery groups were analyzed using pairwise t tests, whereas differentially expressed miRNAs between solid groups 1 and 2 were calculated by Mann-Whitney tests. Results with p values of less than or equal to 0.05 were considered statistically significant. Results Patients and clinicopathological data In total, 99 patients were enrolled in this study, including 20 patients (20.2%) with benign nodules. The other samples were diagnosed as adenocarcinoma, ranging from adenocarcinoma in situ (AIS) to invasive adenocarcinoma. Detailed clinical data are summarized in [91]Table 1. Ten patients were diagnosed with AIS, which is technically not a malignant disease. However, from the perspective of the pathological progression of lung adenocarcinoma, we still classified AIS as a type of malignant nodule. Table 1. Clinicopathological characteristics of samples in the training and validation sets. Characteristics Training set (n = 59) Validation set (n = 40) p value Sex 0.413  Male 24 (40.7%) 20 (50.0%)  Female 35 (59.3%) 20 (50.0%) Age (years), median (average) 60 (31–82) 60.5 (29–76) 0.646 Nodule classification 0.673  Pure GGO 9 (15.3%) 7 (17.5%)  Mixed GGO 11 (18.6%) 10 (25.0%)  Solid 39 (66.1%) 23 (57.5%) Nodule location 0.713  RUL 22 (37.3%) 12 (30.0%)  RML 7 (11.8%) 4 (10.0%)  RLL 9 (15.3%) 8 (20.0%)  LUL 12 (20.3%) 12 (30.0%)  LLL 9 (15.3%) 4 (10.0%) Nodule diameter (cm) 1.64 ± 0.66 1.61 ± 0.62 0.822 Solid component diameter (cm) 0.99 ± 0.60 1.19 ± 0.38 0.382 Pathology 0.775  AIS 7 (11.9%) 3 (7.5%)  MIA 2 (3.4%) 3 (7.5%)  IA 38 (64.4%) 26 (65.0%)  Hamartoma 3 (5.1%) 0 (0%)  Nonspecific inflammation 5 (8.5%) 5 (12.5%)  Tuberculosis 2 (3.4%) 2 (5.0%)  Fungus 1 (1.7%) 1 (2.5%)  Sclerosing pneumocytoma 1 (1.7%) 0 (0%) EGFR status 1.000  Negative 12 (25.5%) 8 (25.0%)  19 del 15 (31.9%) 9 (28.1%)  L858R 16 (34.0%) 11 (34.4%)  Uncommon mutation 2 (4.3%) 2 (6.3%)  Unknown 2 (4.3%) 2 (6.3%) ALK status 0.734  Negative 43 (91.5%) 28 (87.5%)  Positive 3 (6.4%) 2 (6.3%) [92]Open in a new tab Abbreviations: RUL, right upper lobe; RML, right middle lobe; RLL, right lower lobe; LUL, left upper lobe; LLL, left lower lobe; AIS, adenocarcinoma in situ; MIA, minimally invasive adenocarcinoma; IA, invasive adenocarcinoma. Overall, the 79 malignant samples included 16 pGGNs, 21 mGGNs, and 42 solid nodules, of which 20 were paired post-operative plasma samples. Representative imaging features and pathological information are shown in [93]Figure 1 Figure 1. Figure 1. [94]Open in a new tab Representative imaging and pathological information in different groups. Abbreviations: AIS, adenocarcinoma in situ; MIA, minimally invasive adenocarcinoma; IA, invasive adenocarcinoma; SP, sclerosing pneumocytoma. Evs characterization To characterize the isolated EVs in this study, three experiments suggested by the ISEV were performed. First, according to MISEV 2018[[95]29], we evaluated several vesicle markers, including Alix, TSG101, Syntenin, CD9, and Calnexin, in the seven representative EV samples using Western blotting ([96]Figure 2(a)). Furthermore, to characterize the concentrations and sizes of EVs in the plasma, a Nanosight NS300 system was used for nanoparticle tracking analysis. The results showed that most isolated vesicles were around 100 nm, which is the typical size of exosomes ([97]Figure 2(b)). Additionally, SEM images from two representative EV samples are shown in [98]Figure 2(c). Figure 2. Figure 2. [99]Open in a new tab Characteristic proteins and morphology of extracellular vesicles (EVs). (a) The protein levels of Alix, TSG101, syntenin, CD9, and Calnexin in the EVs of seven representative samples were assessed using western blotting. (b) Nanoparticle tracking analysis results from representative EVs samples are shown. (c) Images of EVs from two representative samples were taken by scanning electron microscopic analysis. The representative EVs morphology is highlighted by a red box. Normalization, stabilization, and combination to construct the SVM classifier Samples with mapping rates of greater than 90% were used. The mean percentages of miRNAs of total reads in the training and validation sets were both over 60%, and the reads distribution of each sample is shown in Supplementary Fig. 1A and 1B. We set 5 mapped and expressed reads as the cut-off. Because EVs miRNA may have originated from different cell types, similar to the concept of normalization during quantitative polymerase chain reaction, the object was analyzed as a ratio of the two miRNAs, wherein one miRNA was considered the “target gene” and the other miRNA was considered the “reference gene.” The prediction value was calculated by dividing the target gene expression levels by those of the reference gene for each sample. The area under the curve (AUC) of the receiver operating characteristic (ROC) curve was calculated independently in the training set and the validation set for each miRNA ratio. To achieve better stability, miRNA ratios with the same directions in both sets and similar AUCs (difference in AUCs ≤ 0.05) were selected. To identify biomarkers with the best cut-off to differentiate benign nodules from malignant nodules across two sets of samples, two pairs of miRNA ratios were used to construct a two-dimensional space, and the SVM could be applied to determine the best cut-off to distinguish benign from malignant nodules. The sensitivity and specificity of each pair of miRNA ratio combinations are highlighted in [100]Table 2 in the order of sum diagnostic value. Table 2. Separate sensitivity and specificity of SVM models in the training and validation sets in the order of sum area under the curve (AUC; top 10). Pair 1 __________________________________________________________________ Pair 2 __________________________________________________________________ Training set __________________________________________________________________ Validation set __________________________________________________________________ Target miRNA Ref miRNA Target miRNA Ref miRNA Sensitivity Specificity Sensitivity Specificity miR-185-5p miR-32-5p miR-140-3p let-7f-5p 0.85 0.75 0.59 1.00 let-7c-5p miR-425-3p miR-26a-5p miR-23a-3p 0.79 0.83 0.56 1.00 let-7c-5p miR-185-5p miR-4433b-3p miR-1468-5p 0.79 0.92 0.59 0.88 miR-191-5p miR-32-5p let-7a-5p miR-103a-3p 0.77 0.83 0.53 1.00 miR-142-5p let-7a-3p let-7g-5p miR-425-3p 0.91 0.67 0.53 1.00 let-7a-5p miR-505-5p miR-199a-3p miR-99b-3p 0.98 0.75 0.63 0.75 miR-185-5p miR-378i miR-495-3p miR-99b-3p 0.66 1.00 0.44 1.00 let-7a-5p miR-550a-5p miR-199a-3p miR-760 0.96 0.67 0.59 0.88 let-7a-5p miR-505-5p miR-139-3p miR-185-3p 0.77 0.92 0.66 0.75 let-7c-5p miR-7976 miR-15b-5p miR-486-3p 0.72 0.92 0.44 1.00 [101]Open in a new tab As shown in [102]Figure 3, miR-185-5p/miR-32-5p and miR-140-3p/let-7f-5p displayed the highest diagnostic value in the SVM model, with a sensitivity and specificity of 85.1% and 75%, respectively, in the training set and 59.3% and 100%, respectively, in the validation set. Furthermore, the combination of miRNAs with the highest sensitivity was miR-27a-5p/miR-550a-5p and miR-1908-5p/miR-98-5p. The sensitivities of this combination in the training and validation sets were 100% and 96.9%, respectively. Moreover, the combination of miRNAs with the highest specificity was miR-185-5p/miR-378i and miR-495-3p/miR-99b-3p. The specificity of this combination was 100% in both the training and validation sets. Figure 3. Figure 3. [103]Open in a new tab Visualized distribution of several support vector machine (SVM) classifiers with different interests. (a) The SVM model with the highest diagnostic value. (b) The SVM model with the highest sensitivity. (c) The SVM model with the highest specificity. Malignant nodule: orange; benign nodule: blue. Characteristics of evs mirna profiles in three subsets of malignant nodules To further evaluate the potential classifications of blood EVs miRNA profiles, PCA was performed before surgery for all samples obtained from patients with malignant nodules. The pGGN samples were distributed distinctly from solid samples ([104]Figure 4(a)). pGGN samples were primarily localized in the central region, whereas solid samples were categorized into two groups. Interestingly, 11 of 12 solid nodules with small deletions on exon 19 of the epidermal growth factor receptor gene were categorized as group 1; this was significantly higher than in group 2. mGGN samples could also be categorized into the same groups as solid samples ([105]Figure 4(b)). These results revealed the complexity among malignant nodules from the profiles of blood EVs miRNAs. Figure 4. Figure 4. [106]Open in a new tab Principal component analysis (PCA). (a) PCA of pure ground-glass nodule (pGGN) samples and solid samples. (b) PCA of all malignant nodules, including pGGNs, mixed ground-glass nodules (mGGNs), and solid samples. Biological analysis of the stratification of malignant nodules Because the major contributor to the classification in PCA is PC 1, the miRNAs were ranked in accordance with their weight in PC 1. MiRNAs showing the largest contributions in PC 1 had either large positive or negative weights. Differentially expressed miRNAs between groups 1 and 2 with absolute weights larger than 0.05 were selected ([107]Figure 5(a)). Pathway enrichment analysis revealed that these miRNAs primarily targeted cancer-related pathways, such as the cell cycle and adherens junctions ([108]Figure 5(b)). Interestingly, three differentially expressed miRNAs with positive weights, namely miR-500a-3p, miR-501-3p, and miR-502-3p, showed strong hazard ratios for the survival of patients with lung adenocarcinoma in tumor tissues (Supplementary Figure 2A). In contrast, miRNAs with high negative weights did not show significant hazard ratios (Supplementary Figure 2B). Figure 5. Figure 5. [109]Open in a new tab Biological analysis of the stratification of malignant nodules. (a) Differentially expressed miRNAs between solid groups 1 and 2 according to PCA (with absolute weights larger than 0.05 were selected). (b) Pathway enrichment analysis of the differentially expressed miRNAs between solid groups 1 and 2. (c) Differences in the expression levels of miR-500a-3p, miR-501-3p, and miR-502-3p between groups 1 and 2. (d) Survival curves of samples with different expression levels of miR-500a-3p, miR-501-3p, and miR-502-3p using data from TCGA database (top 15% versus bottom 15%). miR-500a-3p, miR-501-3p, and miR-502-3p are proximally located in the intron of CLCN5 and are probably coregulated (Supplementary Figure 2C). These three miRNAs were upregulated in group 1 compared with those in group 2, with significant p values (p < 0.001, p < 0.001, and p < 0.001, respectively; [110]Figure 5(c)). Intriguingly, the upregulation of these three miRNAs in tumor tissues from TCGA was associated with better overall survival than that observed in tumor tissues with the downregulation of these miRNAs ([111]Figure 5(d)). Overall, the upregulation of these three miRNAs in the blood EVs of group 1 solid nodules was associated with improved prognosis in patients. Comparison of mirna profiles between pre- and postsurgical plasma samples A comparison of the miRNA profiles of blood EVs before and after surgery revealed nine miRNAs with significant differential expression profiles before and after surgery ([112]Table 3). Among these miRNAs, miR-320b and miR-128-3p also showed differential expression between group 1 and group 2 solid nodules ([113]Figure 6(a,[114]b)). The targets of miR-320b were enriched in the Hippo signaling pathway, which is related to cell proliferation ([115]Figure 6(c)). However, no pathways were enriched for targets of miR-128-3p. These data suggested that group 1 and group 2 solid nodules may be developed from distinct carcinogenic pathways. Furthermore, miR-500a-3p, miR-501-3p, and miR-502-3p were upregulated after surgery, with marginal p values (p = 0.058, p = 0.011, and p = 0.020, respectively), suggesting that abnormal levels of miR-500a-3p, miR-501-3p, and miR-502-3p may be associated with tumor progression ([116]Figure 6(d)). Table 3. Significant differential expression of miRNAs before and after surgery. miRNA After surgery mean Before surgery mean p value Fold change miR-340-5p 1472.9 2740.8 0.00016 0.54 miR-128-3p 15,605.1 20,923.2 0.00056 0.75 miR-2110 938.3 717.0 0.00015 1.31 miR-486-3p 1098.8 745.0 0.00051 1.47 miR-320c 879.9 590.6 0.00085 1.49 miR-320d 195.7 127.4 0.00074 1.54 miR-320b 2872.0 1610.9 0.00004 1.78 miR-320a 58,350.9 31,927.5 0.00016 1.83 miR-4732-5p 704.5 379.4 0.00003 1.86 [117]Open in a new tab Figure 6. Figure 6. [118]Open in a new tab Biological analysis of exosomal miRNAs between pre- and postsurgical plasma samples. (a) Expression levels of miR-320b and miR-128-3p before and after surgery. (b) Differences in the expression levels of miR-320b and miR-128-3p between group 1 and group 2. (c) The targets of miR-320b were enriched in the Hippo signaling pathway and adherens junctions. (d) Expression levels of miR-500a-3p, miR-501-3p, and miR-502-3p before and after surgery. Differentially expressed miRNAs between the before and after surgery groups were analyzed using pairwise t tests, whereas differentially expressed miRNAs between solid group 1 and 2 were calculated by Mann-Whitney tests. Results with p values of less than or equal to 0.05 were considered statistically significant. Discussion EVs contain numerous bioactive molecules, such as proteins and nucleic acids, from parent cells, thereby playing vital roles in intercellular communication [[119]6,[120]9,[121]30,[122]31]. Previous studies have reported the promising diagnostic value of circulating EV contents owing to their unique compositions and functions [[123]12,[124]18,[125]32]. However, these studies are inadequate because of their selection bias and small sample size. In contrast, this prospective study involved the analysis of consecutive blood samples from patients with SPNs, and the plasma EVs miRNA profiles were analyzed via high-throughput sequencing. The diagnostic candidates were further confirmed via SVM, and the potential biological characteristics of EVs miRNAs were analyzed via bioinformatics analysis. Cell-free DNA is another important circulating material that is often analyzed in studies involving liquid biopsies. An interim analysis of TCGA data with regard to liquid biopsy for early stage NSCLC was published in 2018[[126]33]. Overall, in 63 patients with stage I–IIIA NSCLC, with the precondition of 98% specificity, the sensitivities of whole-genome bisulphite assays, whole genome assays, and targeted assays were 41%, 38%, and 51%, respectively. Moreover, plasma proteins can serve as diagnostic markers for early-stage lung cancer. In the PANOPTIC trial[[127]34], the proteomics classifier attained 97% sensitivity, 44% specificity, and a 98% negative predictive value. Similarly, Li established a 13-protein classifier for pulmonary nodules, which also approached a closed negative predictive value of 90%[[128]35]. Thus, comparable diagnostic value was observed for the SVM classifier of EVs miRNA. We believed that the current results are promising for clinical practice based on the following findings. First, in this study, of all 79 malignant nodules, 17 nodules (21.5%) were smaller than 1 cm, and 43 nodules (54.4%) ranged from 1 to 2 cm. Moreover, these nodules included 16 pure GGNs and 21 mixed GGNs. In other words, the cases included in this study were stage IA lung cancer, which is the earliest stage in TNM lung cancer staging. Accordingly, most of these tumors would have been difficult to biopsy in clinical practice, representing a major challenge to be overcome in thoracic surgery. Second, we could not maintain high specificity and sensitivity at the same time for biological markers. For SPN noninvasive diagnosis, specificity is the primary consideration. As we found in this study, miRNA pairs with high specificity could achieve 100% specificity in both the training set and validation set. Additionally, miRNA pairs with high sensitivity could achieve 100% and 96.9% sensitivity in the training set and validation set, respectively. Hence, we believe that these miRNA pairs had diagnostic value and were stable. miR-500a-3p, miR-501-3p, and miR-502-3p were upregulated in group 1 upon PCA and after surgery, indicating that miR-500a-3p, miR-501-3p, and miR-502-3p were associated with tumor progression. Moreover, analysis of TCGA data revealed that upregulation of miR-500a-3p, miR-501-3p, and miR-502-3p was associated with enhanced overall patient survival. Although the occurrence of these three miRNAs in lung cancer is unclear, the tumor-promoting abilities of these miRNAs have been observed in other cancer types. Indeed, miR-500a-3p is upregulated in various human cancers and non-neoplastic diseases [[129]36–[130]39]. In liver cancer, miR-500a-3p is upregulated in liver cancer stem cells, thereby promoting cancer stem cell-like properties via activation of signal transducer and activator of transcription 3; additionally, miR-500a-3p overexpression is correlated with poor survival [[131]40,[132]41]. In gastric cancer, overexpression of the same miRNA was correlated with cancer progression and resistance to apoptosis, leading to sustained activation of nuclear factor-κB in vitro and in vivo[[133]42]. The tissue factor pathway inhibitor is repressed by miR-500 in prostate cancer, in which upregulation of this miRNA is associated with poor prognosis and overall survival[[134]43]. Results of functional analyses revealed that overexpression of miR-501-3p in HCCLM3 cancer cells inhibits cell proliferation, migration, invasion, and the epithelial-mesenchymal transition, whereas loss of miR-501-3p in PLC/PRF/5 cancer cells facilitates all of these cellular activities[[135]44]. M2 macrophage-derived exosomal miR-501-3p inhibits the tumor-suppressor TGFBR3 gene and facilitates the development of PDAC by activating the TGF-β signaling pathway[[136]45]. Importantly, miR-501-3p is also significantly downregulated in exosomes from patients with prostate cancer[[137]46]. In addition, overexpression of miR-502-3p dramatically inhibits hepatocellular carcinoma cell proliferation, metastasis, invasion, and cell adhesion[[138]47]. The contrasting association with tumorigenesis and disease outcomes may be explained on the basis of the diverse source of EVs miRNAs detected in patient plasma. EVs play important roles in cell-cell communication by transporting lipids, proteins, and nucleic acids across the extracellular space. In addition to promoting progression in tumor cells, exchange of EVs is a crucial mode of communication among dendritic cells, the most potent antigen-presenting cells[[139]48]. Among others, miR-500 and miR-501-3p are exclusively present in EVs originating from mature dendritic cells, indicating that these EVs miRNAs present in patient plasma may originate from different sources. For example, high levels of miR-500a-3p and miR-501-3p in postoperative samples may indicate activated immunity in response to tissue damage after surgery. In contrast, these changes may also be associated with sustained inflammatory responses during tumor growth[[140]49]. Several limitations in this study warrant mention. First, as a single-center study, it is uncertain whether these predictive tools are applicable to other populations with different cultures and ethnic groups. Future work with larger cohorts from multiple centers is still needed to externally validate our results. Second, we lacked prognostic information because all cases were newly diagnosed. In summary, the current study provided a stable and valuable SVM classifier model based on plasma EVs-derived miRNAs to distinguish malignant GGNs and benign nodules (highest specificity: 100%; highest sensitivity: 98.9%). Additionally, distinct characteristics of miRNA profiles among pure GGNs, mixed GGNs, and solid samples were newly discovered. We believe that the development of an effective, noninvasive diagnostic and risk stratification model could be helpful for clinicians. Funding Statement This work was supported by the National Natural Science Foundation of China [grant nos. 81673031 and 81872510]; the Health Collaborative Innovation Major Project from Guangzhou Science and Technology Bureau [grant no. 201704020161]; the Natural Science Foundation of Guangdong [grant no. S2013010016354]; the Guangdong Provincial Key Laboratory of Lung Cancer Translational Medicine [grant no. 2012A061400006]; the Special Fund for Research in the Public Interest from National Health and Family Planning Commission of PRC [grant no. 201402031]; and the Research Fund from Guangzhou Science and Technology Bureau [grant no. 2014Y2-00050]. Acknowledgments