Abstract Cnaphalocrocis medinalis (Guenée) is one of the important insect pests in rice field. Bt agents were recommended in the C. medinalis control and Bt rice is bred as a tactic to control this insect. However, the tolerance or resistance of insect to Bt protein is a main threat to the application of Bt protein. In order to investigate the response of C. medinalis transcriptome in defending a Cry1C toxin, high-through RNA-sequencing was carried in the C. medinalis larvae treated with and without Cry1C toxin. A total of 35,586 high-quality unigenes was annotated in the transcriptome of C. medinalis midgut. The comparative analysis identified 6,966 differently expressed unigenes (DEGs) between the two treatments. GO analysis showed that these genes involved in proteolysis and extracellular region. Among these DEGs, carboxylesterase, glutathione S-transferase and P450 were differently expressed in the treated C. medinalis midgut. Furthermore, trypsin, chymotrypsin, and carboxypeptidase were identified in DEGs, and most of them up-regulated. In addition, thirteen ABC transporters were downregulated and three upregulated in Cry1C-treated C. medinalis midgut. Based on the pathway analysis, antigen processing and presentation pathway, and chronic myeloid leukemia pathway were significant in C. medinalis treated with Cry1C toxin. These results indicated that serine protease, detoxification enzymes and ABC transporter, antigen processing and presentation pathway, and chronic myeloid leukemia pathway may involved in the response of C. medinalis to Cry1C toxin. This study provides a transcriptomal foundation for the identification and functional characterization of genes involved in the toxicity of Bt Cry protein against C. medinalis, and provides potential clues to the studies on the tolerance or resistance of an agriculturally important insect pest C. medinalis to Cry1C toxin. Introduction The rice leaffolder, Cnaphalocrocis medinalis (Guenée), is one of the important migratory insect pests of rice in China [[32]1–[33]2]. It widely distributes in humid tropical and temperate regions of Oceania, Africa, and Asia [[34]3–[35]4]. It scrapes folds and feeds the rice leaves causing the reduction of photosynthetic activity [[36]1,[37]5]. Heavy occurrence often happens in main rice-growing areas like China, Japan, Korea, Vietnam, Philippines, and so on [[38]3], and frequently results in great rice yield losses [[39]2]. The damage area by C. medinalis were more than 20 million hm^2 in China in the most of years ranged from 2003 to 2010 [[40]2]. A report from the Ministry of Agriculture, China showed that the damage area by C. medinalis was up to 15.5 million hm^2 in 2015 [[41]4]. So far, the control of C. medinalis mainly relied on the chemicals. However, the misuse and overuse of chemicals resulted in many negative issue such as insect resistance, resurgence, environmental pollution, human health concerned by public [[42]6]. Environment-friendly methods are recommended in the pest control. Bacillus thuringiensis, as one of the important biological agents has been applied in the pest control for more than 60 years due to its crystal protein with insecticidal activity against insect pests [[43]7–[44]9]. Cry genes derived from B. thuringiensis were successfully transferred into plants like cotton, rice and these plants harbored resistance to Lepidopteran insect pests [[45]10–[46]13]. In China, Bt agent was recommended as a friendly bio-agent to use in the control of C. medinalis and Bt rice in China was approved to release for production test in limited area in 2009, but not commercially planted[[47]11]. The application of Bt protein and Bt crop could greatly reduce the use of chemical insecticides. However, insect resistance is a big threat to the application of Bt protein and Bt crop. So far, at least eight insect pests evolved resistance against Bt crop or Bt protein [[48]14–[49]15]. Coincidentally, many insect pests evolved resistance to Bt proteins under selection with Bt protein in the laboratory condition [[50]8,[51]16]. Considering the sustainable use of the Bt protein or Bt rice, the potential of resistance evolution of C. medinalis against Bt protein will become an inescapable issue [[52]17]. Midgut, one of the important organs in the insect body, not only is a place for food ingestion and utilization, but also a place for detoxification to xenobiotics [[53]18]. Interaction of Bt protein and insect mainly occurred in midgut. Several reports revealed that resistance of insect against Bt protein were correlated with the variation of genes in midgut [[54]14,[55]19–[56]21]. Next generation sequencing (NGS) is a strong technology which could generate large volumes of sequence information [[57]22]. NGS provides us with unprecedented high-throughput and low-cost sequencing platforms applied in a variety of manners, including de novo whole-genome sequencing, resequencing of genomes to identify variations, de novo transcriptome and gene expression profiling, and detecting methylation patterns [[58]23]. Now, the Illumina/Solexa sequencing technology is dominated in the NGS market, featuring high data accuracy and a broad range of applications. In this study, transcriptome of C. medinalis midgut was sequenced and de novo assembled using Illumina sequencing technology. The gene expression patterns in the midgut were compared in C. medinalis larvae treated with and without Bt Cry1C protein. Moreover, the genes of digestion and detoxification system were given greater attention. The results provide clues on the roles of midgut in the response to Bt proteins, information for further study candidate genes involved in interaction of Bt protein and C. medinalis and promote the understanding of potential resistance or tolerance of C. medinalis to Bt proteins. Materials and methods Insect rearing Adults of C. medinalis were collected with sweep net from paddy fields in suburb of Hangzhou, Zhejiang, China in August, 2015. The moths were placed in plastic cup covered with nylon mesh and fed with 10% honey solution. Eggs laid on the mesh were removed and transferred to a box with detached leaf of 45 d-old Taichung Native 1 (TN1). The larvae that hatched from eggs were cultured in the detached leaf then used in the experiment. All insect cultures were kept at 27±1°C with 70–80% RH and a photoperiod of 14: 10 (L: D) h. Insect collection is from our experimental field, and C. medinalis is not endangered or protected species. Treatment of C. medinalis larvae with Cry1C toxin Fifth instar larvae of C. medinalis were selected for the feeding experiments. The larvae were reared on rice leaves dipped with Cry1C activated toxin (MP, Cavey, CWRU, US) solution which resulted in a growth inhibition but did not cause visible death of the larvae for 48 hours. Larvae treated with PBS buffer were used as negative control. All treated larvae were incubated for 48 h at 27±1°C with 70–80% RH and a photoperiod of 14: 10 (L: D) h. Approximately 150 larvae were used for each treatment. RNA isolation, cDNA library construction, transcriptomic sequencing Total RNA was isolated from the midgut of C. medinalis larvae using TRIzol reagent (Invitrogen, Carlsbad, CA, USA) following the manufacturer's protocol. The total RNA quantity and purity were analyzed with Bioanalyzer 2100 and RNA 6000 Nano Lab Chip Kit (Agi-lent, CA, USA) with RIN number >7.0. Approximately 10 μg of total RNA was subjected to isolate Poly (A) mRNA with poly-T oligo attached magnetic beads (Invitrogen). Following purification, the mRNA is fragmented into small pieces using divalent cations under elevated temperature, and the cleaved RNA fragments were reverse-transcribed to create the final cDNA library in accordance with the protocol for the mRNA-Seq sample preparation kit (Illumina, San Diego, USA). The cDNA library was sequenced run with Illumina HiSeqTM2000 sequence platform at LC Sceiences (USA) following the vendor's recommended protocols. Two or three biological replicates for each treatment were prepared. De novo RNA-seq assembly and annotation After sequencing, the raw data were filtered against low-quantity reads and adaptor contamination using Trinity. Clean reads were de novo assembled using Trinity with K-mer = 25 [[59]24]. Annotations of all the unigenes were performed by a BLASTx search against the Nr, Swiss-Prot, KOG, KEGG databases and Pfam with a cut-off E-value of ≤1e-5. We obtained the Gene Ontology terms ([60]http://www.geneontology.org) of each C. medinalis unigene with the software Blast2GO ([61]http://www.blast2go.org) using the default parameters [[62]25]. Gene expression level was normalized by RSEM-based lgorithm to get RPKM value [[63]26–[64]27]. Based on the expression levels, thresholds of FDR<0.05 and log2 fold-change (log2FC) ≥1 were set for identifying significant differential expressed unigenes (DEGs) between control and Cry1C treated larvae. Enrichment analysis All the DEGs were mapped to terms in KEGG databases to identify significantly enriched metabolic pathway or signal transduction pathways in DEGs. Pathway enrichment analysis provides all terms that significantly enriched in differentially expressed genes in comparison to the control C. medinalis. The P-value was calculated for each pathway by Fisher's exact test with a Q value threshold<0.05. Quantitative real-time PCR (qRT-PCR) validation To conform the data, a subset of DEGs was validated by qRT-PCR. qRT-PCR was performed in 20 μL reaction mixtures composed of 2.5 μL of template cDNA,10 μL of 2×SYBR Green PCR Master Mix (Fermentas,Waltham, MA, USA), and 0.25 mM each primer on the CFX96 Real-Time System (Bio-Rad, Hercules, CA, USA). The selected genes were verified with the following cycling conditions: 95°C for 30 s, followed by 40 cycles of 95°C for 30 s, 60°C for 35 s. The melting curve analysis was used to analyze the specificity of the qPCR product. The sequences of the primers used are listed in [65]S1 Table. A ubiquinol-cytochrome c reductase served as an internal control. The relative gene expression values were calculated using the 2^−ΔΔCt method [[66]28]. Results Assembly and annotation of the C. medinalis transcriptome The transcriptomes of the midgut of C. medinalis larvae treated with and without Cry1C protein were sequenced and compared for each treatment. A total of 37.33G bases of clean reads were obtained from the cDNA libraries, and 35,586 high-quality unigenes and 65,016 transcripts were assembled ([67]Table 1). The average gene and transcript length were 784 and 895 base pair (bp), respectively ([68]Table 1). Mean GC% of gene and transcript were 40.66% and 39.18%, respectively. N50 (the shortest sequence length at 50% of the transcriptome) of gene and transcript were 1,211 and 1,378 bp, respectively. The size distributions of all the unigenes studied were as follows: 18,720 unigenes (17.8%) exhibited lengths of more than 2,000 bp; 17,474 unigenes (16.7%) showed lengths between 500 and 1000 bp; 26,983 unigenes (25.7%) demonstrated lengths between 300 and 500 bp; and 41,789 unigenes (39.8%) were shorter than 300 bp ([69]S1 Fig). The size distributions of all the transcripts studied were as follows: 18,720 unigenes (17.8%) exhibited lengths of more than 2,000 bp; 17,474 unigenes (16.7%) showed lengths between 500 and 1,000 bp; 26,983 unigenes (25.7%) demonstrated lengths between 300 and 500 bp; and 41,789 unigenes (39.8%) were shorter than 300 bp ([70]S2 Fig). Table 1. Summary statistics for the transcriptome of larval midgut of Cnaphalocrocis medinalis. Statistics gene transcript Total number 35,586 65,016 Total assembled bases 27,902,176 58,253,881 Min length (bp) 201 201 Max length (bp) 17,428 17,428 length > 2 kb 2,725 6,530 Average length (bp) 784 895 Median length (bp) 471 572 N50 length (bp) 1,211 1,378 Mean (G + C)s (%) 40.66 39.18 Median (G + C)s (%) 38.60 36.60 [71]Open in a new tab To annotate these unigenes, All unigene sequences were against the protein databases (Nr, SwissProt, KEGG, COG, Pfam, GO) using BLASTx (E-value