Abstract Transcriptome analysis has been used to investigate many economically traits in chickens; however, alternative splicing still lacks a systematic method of study that is able to promote proteome diversity, and fine-tune expression dynamics. Hybridization has been widely utilized in chicken breeding due to the resulting heterosis, but the dynamic changes in alternative splicing during this process are significant yet unclear. In this study, we performed a reciprocal crossing experiment involving the White Leghorn and Cornish Game chicken breeds which exhibit major differences in body size and reproductive traits, and conducted RNA sequencing of the brain, muscle, and liver tissues to identify the inheritance patterns. A total of 40 515 and 42 612 events were respectively detected in the brain and muscle tissues, with 39 843 observed in the liver; 2807, 4242, and 4538 events significantly different between two breeds were identified in the brain, muscle, and liver tissues, respectively. The hierarchical cluster of tissues from different tissues from all crosses, based on the alternative splicing profiles, suggests high tissue and strain specificity. Furthermore, a comparison between parental strains and hybrid crosses indicated that over one third of alternative splicing genes showed conserved patterns in all three tissues, while the second prevalent pattern was non-additive, which included both dominant and transgressive patterns; this meant that the dominant pattern plays a more important role than suppression. Our study provides an overview of the inheritance patterns of alternative splicing in layer and broiler chickens, to better understand post-transcriptional regulation during hybridization. Keywords: chicken, hybridization, RNA-seq, dominant, alternative splicing, inheritance pattern Introduction Splicing of pre-mRNA is a crucial post-transcriptional process that increases proteome diversity in eukaryotes. Alternative splicing (AS) generates multiple isoforms from a single gene using different combinations of exons. AS is a widespread and complex component of gene regulation in humans and domestic animals, and increasing evidence suggests that aberrant AS functionality can be the cause or consequence of many diseases, and may also associating with economically important traits in domestic animals ([30]Pan et al., 2008; [31]Merkin et al., 2012; [32]Gao et al., 2018; [33]Dlamini et al., 2021). Therefore, splice-altering therapies using animal models have been extensively studied for many diseases such as neurodegeneration and muscular dystrophies, and AS events have also emerged as new biomarkers in some circumstances ([34]Montes et al., 2019; [35]Zhao, 2019). Changes in AS are regulated by the interactions between cis- and trans-acting elements, and studies in Camellia and Drosophila suggest that parental genetic divergence may affect the regulation patterns in hybrids due to these interactions ([36]Coolon et al., 2014; [37]Zhang et al., 2019). Therefore, it is important to study the AS regulatory mechanisms in birds. Based on different combinations of the constitutive and alternative exons, AS is divided into seven types: exon skipping (SE), intron retention (RI), alternative 5′ splice sites (A5SS), alternative 3′ splice sites (A3SS), mutually exclusive exons (MXE), alternative promoters (AFEs and ALEs), and alternative polyadenylation (tandem 3′UTRs). SE is the most prevalent AS event in approximately 40% of higher eukaryotes, and commonly generates functional isoforms, while RI is the dominant type in plants ([38]Barbazuk et al., 2008; [39]Weatheritt et al., 2016; [40]Cardoso et al., 2018; [41]Chen S.-Y. et al., 2019). With the rapid development of sequencing technology, a broad range of bioinformatics approaches can identify and classify AS events using isoform-based and count-based methods, of which several tools perform robustly and exhibit excellent overall performance ([42]Mehmood et al., 2019). However, there is a lack of systematic analysis of AS in chickens, and it is necessary to study the components and divergence patterns of splicing events, providing an alternate view of transcriptome plasticity. Hybridization is ubiquitous in nature―involving more than 25 and 10% of plants and animals, respectively―and widely utilized in breeding programs ([43]Whitney et al., 2010). Some hybrids show enhanced environmental adaption and growth rate, whereas others are infertile or exhibit negative economic traits ([44]Abasht and Lamont, 2007; [45]Chen, 2010; [46]Chen et al., 2013; [47]Zhang et al., 2015; [48]Clasen et al., 2017). Hybridizing two different strains can remodel the parental gene patterns, with the genes in hybrids diverging from the mid-parental value, leading to “transcriptome shock” ([49]Hegarty et al., 2006; [50]Han et al., 2014; Cui et al., 2015). These genes mainly contribute to some transgressive phenotypes called over- and under-dominant genetic patterns ([51]Zhuang and Tripp, 2017). Additive and dominant patterns also represent phenotypical variations, while conserved patterns show parental similarity. To take advantage of genome-wide methods for expression analysis, the classic hypotheses of inheritance, dominance, over-dominance, and epistasis should have more contributions at the molecular level to explore the mechanism of heredity ([52]Shull, 1908; [53]Bateson, 1910; [54]Jones, 1917; [55]Mcmanus et al., 2010). However, the identified predominant genetic patterns regulating phenotype divergence are not always consistent among studies because of different genetic backgrounds, species, and tissues employed in those studies. Studies on Camellia and coffee have indicated that the non-additive expression prevailed over other patterns ([56]Marie-Christine et al., 2015; [57]Zhang et al., 2019). On the other hand, several studies have suggested that additivity is the predominant genetic pattern in maize, rice, and cotton ([58]Swanson-Wagner et al., 2006; [59]Li et al., 2008; [60]Rapp et al., 2009), while divergent outcomes suggest that additive or high parent-dominance is the major pattern in chickens ([61]Mai et al., 2019; [62]Zhuo et al., 2019). Most previous studies have identified different inheritance patterns based on gene expression, and there is rarely a transcriptomic study that investigates AS event patterns, considering the association between AS and gene regulation at the post-transcriptional level. In this study, Cornish Game (CG) and White Leghorn (WL), representing broilers and layers, respectively, were used as the parental strains to produce purebred and reciprocal crossed progenies. Taking advantage of RNA-sequencing (RNA-seq), tissue samples from the brain, liver, and breast muscle were collected and sequenced. Splicing events from each sample were identified, classified, and quantified, and events significantly different between purebreds were detected for further study. Finally, five main types of inheritance patterns―conserved, additive, parental-enhancing/suppressing, dominant, and transgressive―were categorized, and compared between purebred strains and hybrid crosses. Changes in alternatively spliced genes indicated that the diverse AS inheritance patterns have different influences on heredity, and variation during hybridization. AS is an effective and novel approach for investigating genetic patterns, and understanding the molecular mechanisms of heterosis. Materials and Methods Sample Collection and RNA Sequencing We used CG, a broiler breed with superior growth and muscle development, and WL, a layer breed with high egg production, acquired from the National Engineering Laboratory for Animal Breeding of the China Agricultural University. The four breeding patterns used in this study resulted in pure-bred progeny, CG, and WL, representing the first generation (F1) with parents of the same type, and reciprocal cross progeny WL ♂ × CG ♀ (LC), and CG ♂ × WL ♀ (CL) representing F1 hybrids ([63]Supplementary Figure S1). Each group had six offspring (three female and three male), except CL where only two females were obtained. We collected the brain, breast muscle, and liver from 23 one-day-old chicks, and extracted total RNA from the tissue samples using Trizol reagent (Invitrogen, Carlsbad, CA, United States). The DNA and RNA quality was assessed using a NanoDrop 2000 spectrophotometer (Thermo Fisher Scientific Inc., United States) and agarose gel electrophoresis. After synthesizing cDNA, PCR amplifications, and library construction, total RNA was sequenced, using paired-end 100-bp reads with a 300-bp insert, on an Illumina HiSeq 2500 platform (Illumina Inc., San Diego, CA, United States) using standard Illumina RNA-seq protocols. RNA-Sequencing Data Alignment and Alternative Splicing Analysis The RNA-seq data were aligned to the chicken reference genome (Gallus_gallus-6.0) using STAR v2.7.5 ([64]Dobin et al., 2012). Duplicate reads were removed to eliminate potential bias, using SAMtools ([65]Li, 2009; [66]Dozmorov et al., 2015). Putative AS events were detected and annotated from aligned RNA-seq data using rMATS v4.1.0 ([67]Wang et al., 2017). Five major types of AS events were identified: A3SS, A5SS, RI, MXE, and SE. To quantify and compare event variation, the percent spliced-in (PSI) value of each AS was calculated for each sample using reads on target and junction counts, where PSI was equal to, the number of reads specific to exon inclusion isoform divided by the sum of reads specific to exon inclusion and exclusion isoforms. Moreover, a hierarchical model for paired replicates and false discovery rate (FDR) correction (FDR < 0.05) was used to determine the statistical differences between the parental strains ([68]Shen et al., 2012). Only the events occurring on autosomes were considered in this study because of incomplete dosage compensation in the chicken sex chromosome. Additionally, if the sum of inclusion and skipping read counts is less than 10 (average in replicate samples), AS was considered low quality and then filtered. Liver samples from the hybrid females were removed because the detection process for putative AS events provided abnormal results. Classification of Alternative Splicing Inheritance Patterns To measure differences between parents and hybrids in order to identify inheritance patterns of AS, samples were recombined as male cross (MC: CG ♂, WL ♂, CL ♂), female cross (FC: CG ♀, WL♀, CL♀), male reciprocal cross (MR: CG ♂, WL ♂, LC ♂), and female reciprocal cross (FR: CG♀, WL♀, LC♀). First, shared AS was determined by merging expressed events in hybrids with significantly different (FDR < 0.05) events in pure-bred, and calculating the average PSI value for biological replicates. A 1.25-fold threshold was set as the criterion for classification as conserved or non-conserved splicing ([69]Gu et al., 2020). Non-conserved AS was classified into eight types: events for which quantification in the hybrid was less than in CG and greater than in WL (or vice versa) was defined as additive (2 types); splicing for which quantification in the hybrid was remarkably higher or lower than in one of the parents and similar to that in another was categorized as dominant (4 types); and splicing for which quantification in the hybrid was significantly greater or lesser than in both parents was classified as transgressive inheritance (two types). Non-conserved genes which were identified in the same mode in more than two groups in a tissue, would be considered as exhibiting that inheritance mode in the corresponding tissue. All eight AS types are listed in [70]Supplementary Figure S1. Exploring the biological function of these genes further, Gene Ontology (GO) classification and enrichment analysis were performed using PANTHER v14 ([71]Thomas et al., 2003), and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis was carried out using the KOBAS v3.0 ([72]Bu et al., 2021). Statistical Test of Inheritance Patterns R (v4.0.2) was used for most of the statistical analyses in this study. Principal component analysis was performed using Prcomp for statistically different AS between the parental lines. Besides, the Venn diagram was visualized by the “VennDiagram” package, and heat-map was prepared using “pheatmap,” with hierarchical clustering among samples performed by “hclust.” After classifying inheritance patterns, the Kruskal–Wallis test was carried out to identify differences among the three tissues, while the Mann–Whitney test was used to compare divergence between CG/WL-like dominant, up/down-regulation dominant, and over/under-dominant cases in each tissue. Results Alternative Splicing Divergence Between Cornish Game and White Leghorn We collected RNA-seq data from brain, liver, and breast muscle tissues of two inbred chicken strains, CG, and WL, representing parental lines, as well as their reciprocal crossed progenies. There was 246.3 Gb of RNA-seq data and 3.6 million mapped reads for each sample. After filtering low-quality reads using the NGS QC Toolkit v2.3 ([73]Patel et al., 2012) and mapping them onto the reference genome, we obtained, on average, 22.8, 21.3, and 17.8 million mapped reads per individual for the brain, muscle, and liver tissues, respectively. AS events were quantified as PSI and classified into five types―A3SS, A5SS, MXE, RI, and SE―while statistically significant differences between WL and CG were identified. The number of putative splicing events was 45668, 47983, and 46313 in the brain, muscle, and liver tissues, respectively, most of which were expressed (86%), and related to more than 7000 genes ([74]Figure 1). SE formed a large proportion of splicing in the brain and muscle tissues (approximately 36%), A3SS was slightly higher than SE in the liver tissue, and RI only accounted for 3% of the tissues. 28, 38, and 44% of genes only underwent one splicing (simple event), and over 56% of events among the tissues were complex events ([75]Figure 2A). Most of the events were primarily distributed on chromosome-1 and numbered over 5200, while those located on chromosome-30 were less than 50 in number. In addition, over 6000 event locations were positioned on chromosome-4 in the liver and over 2700 in chromosome-7 in muscle tissues ([76]Figure 2B). On the other hand, several genes participated in more than two types of events; 100 genes covered five types, with approximately 60% related to multiple types of events, among the three tissues ([77]Figure 3). To summarize, locations of alternatively spliced events widely exist in tissues, and have a complex correlation with genes. FIGURE 1. [78]FIGURE 1 [79]Open in a new tab Overview of alternative splicing in Cornish and White Leghorn. FIGURE 2. [80]FIGURE 2 [81]Open in a new tab Basic information of alternative splicing events in each tissue. (A) Complexity of AS events per gene. (B) The distribution of AS events in the chicken genome. The distribution of 45668, 47983, and 46313 putative AS events identified from brain, muscle and liver on the chicken autosomes are shown. Most of the events distributed primarily on chromosomes 1 and 2. FIGURE 3. [82]FIGURE 3 [83]Open in a new tab Venn diagrams illustrating statistical results of different types AS in tissue and divergence AS among tissues. From among 40000 expressed AS, we used a hierarchical model to detect 2807, 4242, and 4538 (FDR < 0.05) events as significantly different between the two strains, in the brain, muscle, and liver tissues, respectively. These spliced events related to 1823, 2020, and 1568 genes, approximately half of which could be tissue-specific. In addition, the number of up-regulated splicing events was 3.4–5.5% (1387, 2038, and 2188), while down-regulated events accounted for 3.5–5.9% (1420, 2204, and 2350), between layers and broilers. Based on PSI values of divergence events in each sample, principal component analysis was performed ([84]Figure 4); the tissue was significantly influenced by AS, and the strain probably played an important role in the liver and brain. The parent-of-origin effect might have influenced muscle formation because hybrids were observed to be slightly closer to the maternal group, while it showed insignificant influence on sex determination of the offspring. Further, hierarchical clustering using Spearman correlation―where the samples were classified into three tissue-based clusters and purebred were always categorized in the same group―showed that tissue- and strain-based clustering tended to be stronger than sex-based clustering ([85]Figure 5). FIGURE 4. [86]FIGURE 4 [87]Open in a new tab Principal Component Analysis of alternative splicing. PCA analysis is performed using PSI value for significantly different (FDR < 0.05) AS events between purebreds, which events located on the sex chromosomes has been excluded. FIGURE 5. [88]FIGURE 5 [89]Open in a new tab Correlations and hierarchical clustering of alternative splicing. (A) Spearman correlation based on alternative splicing (PSI) expressed in brain, liver, and muscle. (B) Hierarchical cluster for AS events in each group. Classification of Alternative Splicing Inheritance Patterns After filtering and removing sex chromosome splicing, AS between parental strains was merged with expressed splicing in hybrids; 754 splicing events in the liver were used for further analysis as against 1030 and 1950 AS events in the brain and muscle because only liver samples from male hybrids were available. Based on the difference in expression between parental and hybrid splicing, events were categorized into nine types in all four groups (MC, MR, FC, and FR) ([90]Figure 6). FIGURE 6. [91]FIGURE 6 [92]Open in a new tab Scatter-plot and pie representing different inheritance patterns. inheritance patterns identified from reciprocal crosses and parental lines brain, muscle, and liver tissues were classified into nine clusters listed at the bottom of the images, respectively. Scatterplots represent relationships between F1 hybrids and their two parents, and pie charts show the relative proportions of these patterns for each cluster using different colors. Based on statistical results ([93]Table 1), we detected a significant difference in the sum of conserved, additive, WL-like dominant, and enhancing/suppressing dominant patterns among the three tissues (Kruskal–Wallis test, p-value ≈ 0.02 < 0.05), whereas there was no divergence in the number of CG-like dominant (p-value = 0.51) and transgressive patterns (p = 0.11). Conserved splicing was predominant, with above 34% in all three tissue types, while additive splicing accounted for a small proportion of non-conserved patterns, with an average of 8.7, 9.3, and 8.0% in the brain, muscle, and liver tissues, respectively. A greater proportion of events between the hybrids and their parents exhibited a non-additively expressed pattern, in which the sum of parental dominance was approximately 30%, and transgressivity accounted for approximately 22.7, 11.6, and 22.5% in the brain, muscle, and liver tissues, respectively. Therefore, up-regulated dominance was higher than down-regulated, in muscle and brain tissues (Mann–Whitney test, p-value = 0.03 < 0.05). Interestingly, the relative proportion of CG-like dominance was larger in brain tissues of the female groups and liver tissues of the male groups, while WL-like dominance was greater in the brain and muscle tissues of the male groups. No differences were detected between over-dominance and under-dominance in each tissue type. To check whether our conclusion was sensitive to specific statistical methods, we used Fisher’s exact test to identify events with significant divergence between hybrids and parents ([94]Supplementary Table S1), with the resulting p-values controlled for an FDR (FDR < 0.05). Overall, we drew the same conclusion that most non-conserved events were dominant, with transgressive modes during hybridization, while a few events displayed additive patterns among the tissues. TABLE 1. Summary statistics for splicing patterns in reciprocal crosses among tissues. Tissue Group Conser vative Additivity CG-like dominant WL-like dominant Transgressivity CG>WL CG