Abstract

   Genome wide association analyses in diverse populations can identify
   complex trait loci that are specifically present in one population or
   shared across multiple populations, which help to better understand the
   genetic architecture of complex traits in a broader genetic context. In
   this study, we conducted genome-wide association studies and
   meta-analysis for 38 fatty acid composition traits with 12–19 million
   imputed genome sequence SNPs in 2446 pigs from six populations,
   encompassing White Duroc × Erhualian F[2], Sutai,
   Duroc-Landrace-Yorkshire (DLY) three-way cross, Laiwu, Erhualian, and
   Bamaxiang pigs that were originally genotyped with 60 K or 1.4 million
   single nucleotide polymorphism (SNP) chips. The analyses uncovered 285
   lead SNPs (P < 5 × 10^-8), among which 78 locate more than 1 Mb to the
   lead chip SNPs were considered as novel, largely augmented the
   landscape of loci for porcine muscle fatty acid composition.
   Meta-analysis enhanced the association significance at loci near FADS2,
   ABCD2, ELOVL5, ELOVL6, ELOVL7, SCD, and THRSP genes, suggesting
   possible existence of population shared mutations underlying these
   loci. Further haplotype analysis at SCD loci identified a shared 3.7 kb
   haplotype in F[2], Sutai and DLY pigs showing consistent effects of
   decreasing C18:0 contents in the three populations. In contrast, at
   FASN loci, we found an Erhualian specific haplotype explaining the
   population specific association signals in Erhualian pigs. This study
   refines our understanding on landscape of loci and candidate genes for
   fatty acid composition traits of pigs.

   Keywords: genome-wide association studies, sequence imputation,
   meta-analysis, fatty acid composition, pig

Introduction

   Fatty acid composition affects nutritional value and palatability of
   meat ([41]Wood et al., 2004). The predominant saturated fatty acids
   (SFA) in pork are C14:0, C16:0 and C18:0, which together account for
   about 38% of total fatty acids. From nutritional quality, replacement
   of the dietary saturated fatty acids (SFA) with monounsaturated fatty
   acids (MUFA) and polyunsaturated fatty acids (PUFA) are beneficial for
   human health by lowering low-density lipoprotein cholesterol ([42]FAO,
   2010; [43]Sacks et al., 2017), which are risk factors for
   cardiovascular disease ([44]Katan et al., 1994). Moreover, in term of
   palatability of meat, the saturated fatty acids are associated with
   firmness of meat fat, while content of monounsaturated fatty acids,
   mainly C18:1n-9 and C16:1n-7 is positively correlated with overall
   acceptability of meat ([45]Cameron et al., 2000). Therefore, a
   reasonable goal to improve fat quality of pork is to reduce the content
   of C14:0 and C16:0, and simultaneously increase the percentage of
   monounsaturated fatty acids, which would produce pork with better
   nutritional and palatable quality.

   Genome-wide association studies (GWAS) based on SNP arrays have
   successfully identified numerous genomic regions associated with fatty
   acid composition traits in different pig populations ([46]Ramayo-Caldas
   et al., 2012; [47]Yang et al., 2013; [48]Ros-Freixedes et al., 2016;
   [49]Zhang et al., 2016; [50]Zhang et al., 2017). Due to the low density
   of porcine SNP chips and long range linkage disequilibrium among
   markers, most of the identified loci have large confidence intervals,
   which require further fine mapping studies to pinpoint the causal
   variants. Increasing the marker density and combined analyses on
   multiple populations can potentially provide a much higher mapping
   resolution for the trait associated loci ([51]Pausch et al., 2017;
   [52]Bouwman et al., 2018). Sequencing thousands of individuals can be
   expensive, a cost-effective approach of increasing marker density is
   through genotype imputation from a reference population with
   whole-genome sequences ([53]Druet et al., 2014).

   In this study, by referring genome sequences from a panel of 396
   individuals, we imputed 34 million SNPs to 2446 pigs from six pig
   populations that were previously genotyped with 60 K or 1.4 million SNP
   chips, and conducted genome-wide association and meta-analysis on 38
   fatty acid composition traits. The analyses helped to identify new loci
   and enhanced the association significance of loci detected by chip
   SNPs. We further investigated the population shared and specific
   association signatures at SCD and FASN gene loci, respectively, through
   haplotype effect and phylogenetic tree analyses. Finally, we performed
   functional annotation of the lead SNPs with Ensembl tool Variant Effect
   Predictor (VEP) ([54]McLaren et al., 2010), and based on ChIP-seq peak
   data of trimethylation at lysine 4 of histone 3 (H3K4me3) and
   acetylation at lysine 27 of histone 3 (H3K27ac) from 3 porcine liver
   samples ([55]Villar et al., 2015). In addition, we investigated
   functional protein–protein interactions (PPI) encoded by candidate
   genes using the STRING Genomics 10.5 database of PPI network
   ([56]Szklarczyk et al., 2017). These analyses provided useful
   information on the genetic architecture and biological pathway
   underlying the variation in muscle fatty acid composition in pigs.

Materials and Methods

Ethics Statements

   All the experiments that involved animals were carried out in
   accordance with the approved guidelines by the Ministry of Agriculture
   and Rural Affairs of China. The ethics committee of Jiangxi
   Agricultural University approved the animal experiments in this study.

Animals and Phenotypes

   In this study, six pig populations including the White Duroc ×
   Erhualian F[2], Sutai, Duroc-Landrace-Yorkshire (DLY) three-way cross,
   Laiwu, Erhualian, and Bamaxiang pigs were investigated. The detailed
   information on the six populations has been described ([57]Yang et al.,
   2013; [58]Liu et al., 2015; [59]Xiong et al., 2015; [60]Zhang et al.,
   2016; [61]Zhang et al., 2017). Briefly, the F[2] cross was generated by
   mating two white Duroc boars and 17 Erhualian sows to produce F[1]
   animals, nine F[1] boars, and 59 F[1] sows were intercrossed to produce
   976 F[2] males and 945 F[2] females in six batches. The Sutai pig is a
   Chinese synthetic breed that is generated by crossing Duroc boars and
   Taihu sows, and has been selected for prolificacy and growth for more
   than 18 generations. The Erhualian, Laiwu and Bamaxiang pigs are
   Chinese indigenous breeds. Erhualian is well known for its high
   prolificacy. Laiwu is famous for its high intramuscular fat content
   (average 9∼12%). Bamaxiang has features of two end black coat color and
   small body size. A total of 385 Erhualian and 390 Laiwu pigs at ages of
   about 90 days were purchased from Changzhou city in Jiangsu Province
   and Laiwu city in Shandong Province, respectively. Bamaxiang pigs were
   purchased from Bama County in Guangxi Province at ages of about 60
   days. A total of 698 DLY pigs at ages of 180 ± 3 days were bought from
   a commercial pig farm from Xiushui city in Jiangxi Province. DLY boars
   were castrated at ages of about 25 days, and all DLY pigs were fed a
   corn-soybean diet containing 16% crude protein, 3132 digestible energy
   and 0.85% lysine. In the F[2], Sutai, Erhualian, Laiwu and Bamaxiang
   populations, all piglets were weaned at day 46, males were castrated at
   day 90. All fattening pigs were raised in consistent indoor condition
   and fed with corn-soybean based diet containing 16% crude protein, 3100
   kJ digestible energy and 0.78 % lysine under standard management. F[2]
   and Sutai pigs at 240 ± 5 days, DLY pigs at 180 ± 5 days, and
   Erhualian, Laiwu and Bamaxiang pigs at 300 ± 5 days were slaughtered in
   the commercial abattoir.

   Fatty acid compositions in longissimus dorsi muscle were measured in
   591 F[2], 296 Sutai, 608 DLY, 305 Laiwu, 331 Erhualian and 315
   Bamaxiang pigs according to methods as described previously ([62]Folch
   et al., 1957). We also calculated the ratios of different pairs of
   fatty acids that reflect the activity of corresponding fatty acids
   elongase and desaturase using the equations described before
   ([63]Ulbricht and Southgate, 1991; [64]Pamplona et al., 1998). A total
   of 38 fatty acid composition traits were obtained for subsequent
   analysis ([65]Supplementary Table 1).

Genotypes

   Genomic DNA of the animals was extracted from ear biopsies using a
   classic phenol/chloroform method. A total of 1020 F[2] and 524 Sutai
   individuals were genotyped with 62,163 SNPs using PorcineSNP60 v1
   BeadChip, 610 DLY, 331 Erhualian, and 319 Laiwu pigs were genotyped for
   61,565 SNPs using PorcineSNP60 v2 BeadChip ([66]Ramos et al., 2009),
   and 307 Erhualian and 318 Bamaxiang pigs were genotyped for 1,348,804
   SNPs with 1.4 M Affymetrix Axiom SNP chip. In each population,
   individuals with genotype call rate higher than 90%, SNPs with call
   rate greater than 90% and the minor allele frequency (MAF) higher than
   5% were kept for further analyses. All quality control procedures were
   performed using PLINK program ([67]Purcell et al., 2007).

   A number of 396 sequenced individuals were used as the reference
   population for genotype imputation. The 396 pigs consist of 221 pigs
   from 20 Chinese indigenous breeds/populations, 17 Asian wild boars, 140
   international commercial pigs, majority of which are Duroc, Large
   White, Landrace and Pietrain pigs, and 18 European wild boars
   ([68]Supplementary Table 2). Most of the sequencing was conducted by
   Illumina HiSeq or Xten platforms using 100-150 bp paired-end libraries.
   The average depth of sequence data is 20.8×, ranged from 4.2× to 45.5×.
   The sequence data of 232 out of the 396 individuals was generated by
   our lab, the rest was downloaded from public domain ([69]Groenen et
   al., 2012; [70]Li et al., 2013; [71]Moon et al., 2015). Among the 396
   pigs, two White Duroc and 17 Erhualian pigs are founders of the white
   Duroc × Erhualian F[2] population, and 10 Meishan, 12 Bamaxiang, 15
   Laiwu, 32 Duroc, 24 Landrace, and 71 Large White have similar or the
   same ancestry to individuals from six populations investigated in this
   study.

   For most of the sequenced individuals, the raw sequence reads which
   contains > 50% of base-pairs with base quality score <20 or have >10%
   of base-pairs are no calls (Ns) were removed. The clean reads of each
   individual were aligned to Sus scrofa reference genome assembly 10.2
   using BWA ([72]Li and Durbin, 2009). The mapped reads were sorted using
   SAMtools ([73]Li et al., 2009), and processed with Indel realignment
   and duplicate marking in Picard
   ([74]http://broadinstitute.github.io/picard/). Then, GVCF files were
   generated from each BAM file using Haplotypecaller in GATK by filtering
   BadCigar and BadMate reads, and combined using CombineGVCFs in GATK.
   The SNPs were called from combined GVCF files using GenotypeGVCFs
   function in GATK with options of stand_call_conf 30.0 and
   stand_emit_conf 30.0, and filtered using VariantFiltration with option
   of QD < 2.0 || FS > 60.0 || MQ < 40.0 || HaplotypeScore > 13.0 ||
   MappingQualityRankSum < -12.5 || ReadPosRankSum < -8.0 according to the
   best practice guidelines of GATK ([75]McKenna et al., 2010). Finally, a
   total of 34 million SNPs were called. Haplotypes of the 396 reference
   animals were constructed using Beagle version 4.0 ([76]Browning and
   Browning, 2007).

Genotype Imputation

   The 60 K and 1.4 M chip SNPs that have reverse strand compared to
   genome sequencing SNPs were flipped using PLINK program ([77]Purcell et
   al., 2007). Then, 34 million SNPs in the 396 sequenced individuals were
   imputed to the 2446 pigs that were genotyped for 60 K or 1.4 million
   SNPs using the Beagle v4. After quality control, 14,009,920,
   13,686,661, 12,655,741, 13,176,546, 18,282,745 and 18,824,020 SNPs with
   MAF > 0.05 in 591 F[2], 296 Sutai, 608 DLY, 305 Laiwu, 331 Erhualian
   and 315 Bamaxiang pigs, respectively, were retained for subsequent
   analysis. The accuracy of imputed SNPs were evaluated by Beagle R^2
   values ([78]Browning and Browning, 2007).

Genome-Wide Association Studies and Meta-Analysis

   The association tests were conducted within each population using
   following model implemented in GEMMA program ([79]Zhou and Stephens,
   2012), the same model was also employed to estimate the genomic
   heritability based on the SNP data:
   [MATH: <mrow><mi mathvariant="bold">y</mi><mo>=</mo><mi
   mathvariant="bold">x</mi><mi>β</mi><mo>+</mo><mi
   mathvariant="bold">Zu</mi><mo>+</mo><mi
   mathvariant="bold">e</mi></mrow> :MATH]

   where y is the vector of residual phenotypic values corrected for sex
   and slaughter batch as fixed effects; x is the incidence vector of
   genotypes of a marker, in which three genotypes AA, AB, and BB of a
   locus were coded as 0, 1 and 2, respectively, where B is the minor
   allele. β is the additive allelic effect of the marker; Z is identity
   matrix, u is a vector of random polygenic effects that assumed to
   follow MVN(0, G
   [MATH: <mrow><msubsup><mi>σ</mi><mi>a</mi><mn>2</mn></msubsup></mrow>
   :MATH]
   ), where G represents the genomic relationship matrix calculated from
   whole-genome SNP markers and
   [MATH: <mrow><msubsup><mi>σ</mi><mi>a</mi><mn>2</mn></msubsup></mrow>
   :MATH]
   is the additive genetic variance; and e is the vector of random
   residuals, e ∼ MVN (0, I
   [MATH: <mrow><msubsup><mi>σ</mi><mi>e</mi><mn>2</mn></msubsup></mrow>
   :MATH]
   ), where I is the identity matrix and
   [MATH: <mrow><msubsup><mi>σ</mi><mi>e</mi><mn>2</mn></msubsup></mrow>
   :MATH]
   is the residual variance component. The SNPs that reached a P value
   threshold of 5 × 10^-8 were empirically considered as significant
   ([80]Pe'er et al., 2008; [81]Johnson et al., 2010). We defined the most
   significant SNPs on each chromosome that satisfied P value threshold of
   5 × 10^-8 as lead SNPs for a given trait. The phenotypic variation of a
   trait explained by the lead SNP was calculated by
   (V[reduce]−V[full])/V[reduce], where V[reduce] and V[full] are the
   variance of residuals of ordinary linear model without and with
   genotypes of the lead SNPs in the models. This estimator was very
   similar to those calculated using 2p(1-p)a^2/σ[p]^2, where p is the
   allele frequency, and a is additive allelic effect of a given SNP,
   σ[p]^2 is phenotypic variance of a trait under study. Meta-analysis of
   GWAS results on the six populations was performed using a z-score
   method implemented in METAL software ([82]Willer et al., 2010), which
   calculated and tested a statistics that combined the effects and
   standard errors of each SNPs on a given trait in six populations by
   taking into account sample size and direction of genotype effects in
   each population. As some of 38 traits investigated in this study are
   highly correlated to each other, the lead SNPs for different traits can
   locate closely and in high linkage disequilibrium, we therefore
   combined the lead SNPs that were within 1 Mb to a same genomic region,
   and use the lead SNP with strongest association (measured by –log[10]P
   value) as a sentinel SNP for that region ([83]Table 1).

Table 1.

   Summary of a selection of sentinel lead SNPs for fatty acid composition
   and metabolism traits in six pig populations by chromosome regions.
   Chromosome:
   position Trait Pop Candidate genes p-value MAF
   (minor/major) Var (%)^1 Variant annotation^2
   1:8873447 C18:0 Bamaxiang – 3.02E-08 0.30 (A/G) 11.24 Intron
   2:746298 ACL Erhualian – 2.31E-08 0.07 (G/A) 17.70 Upstream
   2:8929954 C20:3n-6/C18:2n-6 Erhualian FADS2, FADS1 2.90E-22 0.43 (C/T)
   25.05 Intergenic
   2:74664653 C20:3n-6 F[2] PLIN5 1.89E-08 0.06 (G/A) 4.66 Intergenic
   2:115831690 C20:2n-6 DLY – 5.22E-09 0.06 (C/A) 5.20 Intergenic
   2:145804018 FattyAI Bamaxiang – 5.09E-09 0.38 (T/C) 12.50 Intergenic
   3:92010559 C20:1n-9 Erhualian – 9.96E-09 0.33 (T/C) 11.77 Intergenic
   4:63910717 C18:1n-9/C18:0 F[2] – 1.69E-10 0.14 (A/G) 9.49 Intron
   4:86701762 C20:2n-6 F[2] – 6.22E-09 0.42 (G/T) 9.30 Intergenic
   5:73950290 C20:0/C18:0 Sutai ABCD2 2.24E-11 0.25 (A/G) 10.75 Intron
   6:67864540 C20:3n-6 Sutai – 2.52E-08 0.27 (T/G) 7.54 Intergenic
   6:71008471 C16:1n-7/C16:0 Laiwu – 2.96E-08 0.45 (G/T) 5.61 Intergenic
   7:27337970 C18:1n-9/C16:1n-7 Laiwu APOM 4.15E-09 0.35 (C/T) 9.13
   Intergenic
   7:30614484 C18:1n-9/C16:1n-7 Bamaxiang – 3.40E-09 0.12 (C/T) 11.85
   Upstream
   7:34919339 n-3 F[2] – 4.93E-15 0.48 (A/G) 15.04 Intron
   7:52837555 C20:1n-9 Bamaxiang ACSBG1 8.21E-10 0.24 (G/A) 11.69
   Intergenic
   7:58228023 C20:2n-6/C18:2n-6 Laiwu – 1.52E-10 0.35 (C/A) 19.97
   Intergenic
   7:134678195 C20:1n-9/C18:1n-9 Erhualian ELOVL5 1.07E-23 0.22 (G/T)
   34.48 Intergenic
   8:119726275 C18:1n-9/C16:1n-7 DLY ELOVL6 1.90E-18 0.06 (T/C) 13.06
   Intergenic
   8:129588470 ACL DLY MTTP 1.64E-08 0.12 (G/A) 6.35 Intergenic
   8:138708016 C20:4n-6/C20:2n-6 Bamaxiang SNCA 3.14E-08 0.06 (A/G) 9.13
   Intron
   9:11302313 C20:4n-6/C20:3n-6 Erhualian DGAT2 3.13E-08 0.07 (G/A) 7.30
   Intron
   9:13950534 C16:0/C14:0 Laiwu THRSP 2.91E-08 0.10 (T/C) 17.00 Intron
   12:1176481 C18:1n-9/C16:1n-7 Erhualian FASN 2.38E-30 0.07 (T/C) 40.86
   Intergenic
   12:59707351 SFA Laiwu – 1.45E-10 0.33 (T/C) 11.23 Intergenic
   13:24928872 C20:1n-9/C20:0 Sutai ACAA1 1.14E-09 0.16 (C/T) 15.95 Intron
   13:40365857 ACL Bamaxiang ACOX2 4.00E-09 0.38 (T/G) 11.35 Intergenic
   13:165434306 C20:1n-9 Sutai – 3.62E-08 0.07 (A/G) 11.79 Intergenic
   14:121454019 C18:0 DLY SCD 3.14E-33 0.33 (T/G) 28.20 Intergenic
   15:146109641 C20:0 Laiwu – 3.26E-09 0.42 (C/T) 10.01 Intergenic
   16:34617582 C20:1n-9/C20:0 Sutai – 2.45E-08 0.30 (G/A) 16.65 Intergenic
   16:36821647 C20:0/C18:0 Laiwu – 1.80E-22 0.09 (T/G) 35.98 Upstream
   16:41393886 C20:0/C18:0 F[2] ELOVL7 3.94E-44 0.29 (A/G) 36.26
   Intergenic
   16:43497948 C20:0/C18:0 DLY ELOVL7 3.66E-62 0.12 (A/T) 43.50 Intergenic
   16:48557856 C20:1n-9/C20:0 Bamaxiang – 3.10E-11 0.38 (G/A) 17.33
   Intergenic
   X:96131734 C14:0 F[2] – 8.49E-09 0.41 (T/A) 6.91 Intergenic
   X:103467488 FattyAI F[2] – 1.63E-10 0.32 (A/C) 8.04 Intergenic
   [84]Open in a new tab

   Pop, population; MAF, minor allele frequency. ^1Var (%), percentage of
   phenotypic variation explained by the lead SNP for the most significant
   trait. ^2Variant annotation, variant annotation by Variant Effect
   Predictor (VEP) supported by Ensemble; “–”, no candidate gene found.

Haplotype Analysis

   For a given locus, we first extracted the haplotypes of 41 SNPs
   centered on the lead SNP (20 SNP on each side of the lead SNP), and
   then estimated the effects of the haplotypes on respective traits using
   a mixed model similar to that implemented in [85]Nicod et al. (2016)
   ([86]Nicod et al., 2016):
   [MATH: <mrow><mi mathvariant="bold">y</mi><mo>=</mo><mi
   mathvariant="bold">H</mi><mi>β</mi><mo>+</mo><mi
   mathvariant="bold">Zu</mi><mo>+</mo><mi
   mathvariant="bold">e</mi></mrow> :MATH]

   Where H is a n×p incidence matrix for the genotype of haplotypes in the
   populations, n is the sample size and p is number of haplotypes under
   study. β is a vector of haplotype effects with p elements. The y, Z, u
   and e are the same as those in the single SNP GWAS model. We multiply
   both side of the mixed model with inverse matrix of W:
   [MATH: <mrow><msup><mi
   mathvariant="bold">W</mi><mrow><mo>−</mo><mn>1</mn></mrow></msup><mi
   mathvariant="bold">y</mi><mo>=</mo><mrow><mo>(</mo><mrow><msup><mi
   mathvariant="bold">W</mi><mrow><mo>−</mo><mn>1</mn></mrow></msup><mi
   mathvariant="bold">H</mi></mrow><mo>)</mo></mrow><mi
   mathvariant="bold">β</mi><mo>+</mo><msup><mi
   mathvariant="bold">W</mi><mrow><mo>−</mo><mn>1</mn></mrow></msup><mrow>
   <mo>(</mo><mrow><mi mathvariant="bold">Zu</mi><mo>+</mo><mi
   mathvariant="bold">e</mi></mrow><mo>)</mo></mrow></mrow> :MATH]

   Where W is the square root of variance and covariance matrix of the
   phenotypes V, which is calculated by
   [MATH: <mrow><msubsup><mi>σ</mi><mtext>a</mtext><mn>2</mn></msubsup><mi
   mathvariant="bold">G</mi><mo>+</mo><msubsup><mi>σ</mi><mtext>e</mtext><
   mn>2</mn></msubsup><mi mathvariant="bold">I</mi></mrow> :MATH]
   ,
   [MATH:
   <mrow><msubsup><mi>σ</mi><mtext>a</mtext><mn>2</mn></msubsup></mrow>
   :MATH]
   and
   [MATH:
   <mrow><msubsup><mi>σ</mi><mtext>e</mtext><mn>2</mn></msubsup></mrow>
   :MATH]
   were estimated in EMMA ([87]Kang et al., 2008). W is calculated through
   eigenvalue decomposition of V in R program. After the transformation,
   the haplotype effects can be estimated by an ordinary linear model
   implemented in R program. We generated a phylogenetic tree of
   haplotypes using neighbor joining method implemented in MEGA7 program
   ([88]Kumar et al., 2016).

Variants Annotation and Gene Functional Analysis

   The genomic position of all SNP used in GWAS were annotated based on
   Sus scrofa 10.2 assembly ([89]Groenen et al., 2012). To facilitate the
   comparison of our results with those based on Sus scrofa 11.1 assembly,
   we also converted the coordinates of lead SNPs to Sus scrofa 11.1
   assembly using Liftover
   ([90]https://genome.ucsc.edu/cgi-bin/hgLiftOver) ([91]Supplementary
   Table 3). All lead SNPs were annotated using the Variant effect
   predictor (VEP) (Ensembl release 89) ([92]McLaren et al., 2010). We
   searched for functionally plausible candidate genes in regions within
   500 kb of the lead SNPs using UCSC web browser
   ([93]https://genome.ucsc.edu), which provide homologous genes of
   different species in the respective QTL region identified in this
   study. We chose the genes with function relevant to fatty acid or lipid
   metabolism as candidate genes through literature searching. STRING
   v10.5 was utilized to examine the candidate genes in context of
   protein–protein interactions (PPI) network ([94]Szklarczyk et al.,
   2017). Only interactions with a high level of confidence (score >0.4)
   were retained in the global PPI network. ClueGO in Cytoscape was
   employed to implement gene ontology (GO) and the KEGG pathway
   enrichment analysis ([95]Bindea et al., 2009). The enrichment analysis
   was carried out using right-sided hypergeometric test, corrected for
   multiple testing using Benjamini-Hochberg approach. Kappa statistics
   were used to group the enriched terms ([96]Bindea et al., 2009). The
   minimum connectivity of the pathway network (kappa score) was set to
   0.4.

Results

Accuracy of Genotype Imputation

   The average Beagle R^2 values of the SNPs used GWAS are 0.754, 0.756,
   0.741, 0.759, 0.892 and 0.893 for F[2], Sutai, DLY, Laiwu, Erhualian
   and Bamaxiang pigs, respectively. The Beagle R^2 is positively
   associated with minor allele frequency of the SNPs ([97]Supplementary
   Figure 1). Much higher average imputation accuracy was observed in
   Erhualian and Bamaxiang pigs, because these two populations were
   genotyped with 1.4 million SNPs, which have much higher marker density
   compared to the 60 K SNP genotyped for the rest of four populations.

Genome Wide Association Studies Within Each Population

   We estimated genomic heritability (h[g]^2) for the 38 fatty acid
   composition traits in each of the six populations based on the whole
   genome SNPs, the average of h[g]^2 estimates for the 38 traits in the
   six populations is 0.46. A total of 125(54.8%) h[g]^2 estimates were
   between 0.3 and 0.6, and 48 (21.1%) were greater than 0.6. The h[g]^2
   estimated from imputed SNPs were highly correlated with those estimated
   from chip SNPs ([98]Zhang et al., 2019) ([99]Supplementary Figure 2).

   We performed genome wide association studies for 38 fatty acid
   composition traits using the imputed SNPs in each of the six
   populations. We defined the most significant SNPs on each chromosome
   that satisfied P value threshold of 5 × 10^-8 as lead SNPs or QTL for a
   given trait. In total, we identified 131 lead SNPs on 15 porcine
   chromosomes ([100]Supplementary Table 3). The number of lead SNPs for
   each trait range from 0 to 3, and is positively associated with h[g]^2
   (P[spearmancorrelation] = 2 × 10^-12) ([101]Supplementary Figure 3). A
   total of 29 lead SNPs that located at more than 1 Mb from the lead SNPs
   identified by SNP arrays were considered as novel. Additionally, 27
   lead SNPs have at least 2 units of increase (range from 2 to 10) in
   association strength (–log[10]P value) compared to those identified
   based on 60K or 1.4M chips were regarded as enhanced
   ([102]Supplementary Table 3).

   We next show four examples that GWAS based on imputed SNPs improved
   upon the results obtained from chip SNPs. In the F[2] population, we
   previously identified a significant lead SNP (7:134683639, P = 6.80 ×
   10^-14) for C20:1n-9 on SSC7 that explain 21.0% of phenotypic variance
   based chip SNPs. By contrast, association of the imputed SNP
   (7:134527363) achieve a P value of 1.58 × 10^-23 and explain 31.2% of
   phenotypic variance for C20:1n-9 ([103]Figure 1A). Another example is
   that the top associated variant (8:126831850, P = 1.11 × 10^-9) for
   C18:1n-9/C16:1n-7 on chromosome 8 found in 60K chips is less
   significant than, and locate approximately 6 Mb from the top SNP
   (8:120599982, P = 3.99 × 10^-12) detected hereby ([104]Figure 1B).
   Notably, the newly identified SNP locates closer to ELOVL6, the most
   functionally plausible gene in this region, and those SNPs identified
   in DLY and Erhualian populations ([105]Zhang et al., 2016; [106]Zhang
   et al., 2017) ([107]Supplementary Table 3). In Sutai pigs, 5:73950290
   for C20:0/C18:0 (P = 2.24 × 10^-11) on SSC5 identified here shows
   greater association significance than the one (5:70181543, 3.20 ×
   10^-7) identified based on 60 K SNP data ([108]Figure 1C). In Erhualian
   pigs, the lead SNP (2:8929954, P = 2.90 × 10^-22) for C20:3n-6/C18:2n-6
   currently identified was more significant than the loci (2:8929236, P =
   3.48 × 10^-18) previously detected using 1.4 M SNP chip ([109]Figure
   1D). These examples demonstrate GWAS based on the imputed genome
   sequence SNPs helped to refine the signals revealed by 60K SNP data
   ([110]Figure 2, [111]Table 1 and [112]Supplementary Table 3).

Figure 1.

   [113]Figure 1
   [114]Open in a new tab

   Comparison of association strength between imputed and chip SNPs. (A)
   region for C20:1n-9 on SSC7 in F[2] population; (B) region for
   C18:1n-9/C16:1n-7 on SSC8 in F[2] population. (C) region for
   C20:0/C18:0 on SSC5 in Sutai population. (D) region for
   C20:3n-6/C18:2n-6 on SSC2 in Erhualian population. Imputed and chip
   SNPs were denoted in blue and red color, respectively. The lead SNPs
   are marked by triangles.

Figure 2.

   [115]Figure 2
   [116]Open in a new tab

   A combined Manhattan plot for GWAS on fatty acid composition traits
   across six populations. Genome-wide representation of all lead SNPs
   identified single population GWAS, which were marked by a colored dot.
   Results from different populations were represented by different
   colors. The y axis shows the -log[10]p-values for association with
   corresponding fatty acid composition traits and the x axis shows the
   genomic position of genetic variants. Candidate genes are denoted with
   different colors, blue for candidate gene previously identified, and
   red for candidate gene newly found in current study.

Meta-Analysis of Genome-Wide Association Studies

   We performed a meta-analysis on the genome wide association statistics
   from the six populations. A total of 154 lead SNPs (P < 5× 10^-8) were
   identified ([117]Supplementary Table 4). Among these, 49 locates >1 Mb
   from the lead SNPs identified in single population GWAS. Moreover, we
   observed 5 lead SNPs displaying more than 5 units of –log[10]P values
   enhancement upon those of lead SNPs identified in a single population
   ([118]Figure 3, [119]Supplementary Table 3, and [120]Supplementary
   Table 4). These include 7:134556509 for C20:1n-9/C18:1n-9 near ELOVL5
   gene (P = 9.62 × 10^-46 vs. P = 1.07 × 10^-23 at 7: 134678195 in
   Erhualian pigs) ([121]Figure 3A), 8:119942096 for C18:1n-9/C16:1n-7
   near ELOVL6 gene (P = 8.51 × 10^-24 vs. P = 1.90 × 10^-18 at
   8:119726275 in DLY pigs) ([122]Figure 3B), 14:121398370 for C18:0 near
   SCD gene (P = 7.34 × 10^-50 vs. P = 3.14 × 10^-33 at 14:121454019 in
   DLY pigs) ([123]Figure 3C) and 16:43507850 for C20:0/C18:0 near ELOVL7
   gene (P = 3.80 × 10^-95 vs. P = 3.66 × 10^-62 at 16:43497948 in DLY
   pigs) ([124]Figure 3D). The other loci that were enhanced in
   meta-analyses located near FADS2, ABCD2, and THRSP genes
   ([125]Supplementary Table 4).

Figure 3.

   [126]Figure 3
   [127]Open in a new tab

   Comparison of association strength between signal-population GWAS and
   Meta-analysis. (A) Associations of SNP in a 4 Mb region on SSC7 for
   C20:1n-9/C18:1n-9 in Erhualian pigs (blue dots) versus GWAS
   meta-analysis (red dots). (B) Associations of SNP in an 8 Mb region on
   SSC8 for C18:1n-9/C16:1n-7 in DLY pigs (blue dots) versus GWAS
   meta-analysis (red dots). (C) Associations of SNP in an 8 Mb region on
   SSC14 for C18:0 in DLY pigs (blue dots) versus GWAS meta-analysis (red
   dots). (D) Associations of SNP in an 8 Mb region on SSC16 for
   C20:0/C18:0 in DLY pigs (blue dots) versus GWAS meta-analysis (red
   dots). The lead SNPs were marked with gray diamond.

Haplotype Analyses of Population-Shared and Specific Loci

   Next, we carried out haplotype analyses on two QTL regions near SCD and
   FASN genes based on following considerations: (1) both regions are
   strongly associated with multiple primary saturated and
   mono-unsaturated fatty acids that are closely related to fat quality of
   meat; (2) both SCD and FASN genes have direct functional relevance to
   the fatty acid composition traits. (3) Meta-analysis largely enhanced
   association significance of SCD loci but not FASN loci, it is of
   interest to investigate the effects and phylogeny of haplotypes in
   these two regions in the six populations to further clarify the
   phenomenon.

   Near SCD gene, we identified lead SNPs within a 1.83 Mb region (between
   120.10 and 121.93 Mb) that had significant effects on multiple fatty
   acid traits (C16:0, C16:1n-7, C18:0 and C18:1n-9) in three populations
   (F[2], Sutai, and DLY). Within this region, the most significant
   association was found between 14:121454019 and C18:0 in DLY pigs
   ([128]Figure 4A). We empirically investigated the effects of 41 SNP
   haplotypes centered on 14:121454019 in F[2], Sutai, and DLY pigs
   (Materials and Methods). The [129]Figure 4B showed estimated effects of
   haplotypes with frequency >0.05 in each of the three populations.
   Notably, we observed that Hap3 in F[2] pigs (P = 1.5×10^-4), Hap4 in
   Sutai pigs (P = 7.6×10^-8), and Hap2, Hap3 and Hap4 in DLY pigs (P =
   2.2×10^-16) with significant effect of decreasing C18:0 content in
   three populations share an 3.7-kb segment (121,450,788-121,454,457)
   ([130]Figures 4B–D), these analyses suggested that the coincidence of
   the association signals in the three populations could be brought about
   by a same causal mutation located in or linked to the shared 3.7 kb
   haplotype. One exception is for Hap2 in Sutai pigs, this haplotype also
   has significant effect of reducing C18:0 content (P = 7.6×10^-8), but
   do not contain the shared 3.7 kb segment.

Figure 4.

   [131]Figure 4
   [132]Open in a new tab

   Haplotype analysis of the major QTL for C18:0 on SSC14 in the F[2],
   Sutai and DLY populations. (A) The significant regional plots for the
   SNP that affects the C18:0 content on SSC14 across three populations.
   (B) Distribution of effects of 41 SNP haplotype centered on the lead
   SNP for C18:0 identified in DLY pigs. The points represent estimates of
   the haplotype effects. Vertical bars represented the standard errors of
   the haplotype effect estimates. Haplotypes that shared the 3.7 kb
   segment (121,450,788-121,454,457 bp) were highlighted by orange color.
   (C) Heatmap of haplotypes spans 20kb upstream and downstream
   (121,434,379-121,473,888 bp) of the lead SNP 14:121,454,019 in
   individuals that carry Hap3 in F2, Hap4 in Sutai, Hap2, Hap3 and Hap4
   in DLY pigs. (D) The detail base pair information of a 7-kb segment
   than encompassed the 3.7 kb (121,450,788-121,454,457 bp) chromosome
   segment shared by F[2]-Hap3, Sutai-Hap4, DLY-Hap2, DLY-Hap3 and
   DLY-Hap4 haplotypes.

   Near the FASN gene on SSC12, we identified lead SNPs for C14:0 in both
   Erhualian and Laiwu pigs ([133]Supplementary Table 3). The lead SNP
   (12:1482194, P = 3.41 × 10^-22) identified in Erhualian pigs located at
   1.21 Mb from that identified in Laiwu pigs (12:273754, P = 4.42 ×
   10^-11) ([134]Supplementary Table 3). It is of interest to investigate
   whether a same causative mutation is underlying the association
   signatures in the two populations. We estimated the effects of
   haplotypes of 41 SNPs centered on Erhualian lead SNP (12:1482194) in
   the six populations. Interestingly, we identified a haplotype (Hap4) in
   Erhualian pigs displaying significant effect (P = 4.2 × 10^-19) on
   reducing C14:0 content ([135]Figure 5A). Phylogenetic analysis on 28
   major haplotypes with frequency >0.05 in corresponding population
   suggested that the Hap4 in Erhualian pigs was not clustered with any
   haplotypes from the other populations ([136]Figure 5B). These analyses
   suggested that Hap4 that uniquely found in Erhualian pigs underlying
   the population-specific GWAS signal in Erhualian pigs.

Figure 5.

   [137]Figure 5
   [138]Open in a new tab

   Haplotype analysis of the major QTL for C14:0 on SSC12 in the EHL
   populations. (A) Distribution of effects of 41 SNP haplotype centered
   on the lead SNP for C14:0 (1,482,028-1,483,956 bp) identified in
   Erhualian pigs in the six populations. The points represent the
   estimates of the haplotype effects. Vertical bars represented the
   standard errors of the haplotype effect estimates. Haplotype (Hap4)
   with significant effect of decreasing C14:0 in Erhualian pigs was
   highlighted by orange color. (B) Neighbor-joining tree for major
   haplotypes of the QTL on SSC12 across six populations. Haplotype
   (Erhualian-Hap4) specific to Erhualian pigs is highlighted by red
   colors.

Annotation and Biological Insights

   The lead SNPs identified through GWAS in each of the populations and
   GWAS meta-analysis correspond to 205 unique lead SNPs. Among these
   SNPs, 63% locate in intergenic region, 26% locate in introns of genes,
   10% located in upstream or downstream regions of genes, and 1% are
   synonymous variants ([139]Supplementary Table 3 and [140]Supplementary
   Table 4). Majority of these SNPs are not coding variants. We therefore
   further annotated these SNPs using published H3K27ac and H3K4me3 peaks
   ([141]Villar et al., 2015), and found that a total of 15% of the 205
   variants locate within the ChIP–seq peaks (H3K27ac: 12.7% and H3K4me3:
   2.5%), representing a 2.5 fold enrichment compared with the whole
   genome SNPs under investigations (H3K27ac: 5.4% and H3K4me3: 0.75%)
   ([142]Supplementary Table 3 and [143]Supplementary Table 4).

   To gain insight into biological pathways underlying the variations in
   muscle fatty acid composition in pigs, we investigated the 32 candidate
   genes that were found within 500 Kb region of the lead SNPs with
   function relevant to fatty acid or lipid metabolism in context of
   protein-protein interaction network in STRING database ([144]Szklarczyk
   et al., 2017). Interestingly, the 32 candidate genes appeared to be
   highly connected among each other, and several of newly identified
   candidate genes were evidenced to link to previously identified genes
   such as FASN and SCD ([145]Figure 6A) ([146]Zhang et al., 2016;
   [147]Zhang et al., 2017). Gene ontology enrichment analysis highlighted
   metabolic processes of fatty acids (P = 2.2 × 10^-32), neutral lipid (P
   = 1.6 × 10^-17) and long-chain fatty acid (7.6 × 10^-17) as the most
   enriched terms ([148]Figure 6B and [149]Supplementary Table 5), this is
   expected as the 32 genes were chosen according to their functional
   relevance to fatty acids. Nevertheless, it is still of interest to
   observe that many genes related to fatty acid catabolism or synthesis
   located in the vicinity of the fatty acid associated loci, suggesting
   fatty acid catabolism and synthesis are the primary biological
   mechanisms that affect the muscle fatty acid composition in pigs.

Figure 6.

   [150]Figure 6
   [151]Open in a new tab

   Protein-protein interaction network and gene ontology enrichment
   analysis of candidate genes. (A) Protein-protein interaction network of
   the 32 most plausible candidate genes of the lead SNPs detected by GWAS
   and meta-analysis in STRING v10.5 database. (B) Gene ontology
   enrichment analysis of candidate genes. Over-represented GO/pathway
   terms were grouped based on kappa statistics. GO/pathway terms are
   represented as nodes, and the node size represents the term enrichment
   significance, while the edges represent significant similarity between
   categories.

Discussion

   Identification of genetic variants for fatty acid compositions would
   provide a cost effective way to improve pork fat quality. Previously,
   we have performed GWAS for fatty acid composition traits in the six
   populations based on 60 K or 1.4 M SNP chips ([152]Yang et al., 2013;
   [153]Zhang et al., 2016; [154]Zhang et al., 2017). Imputation based
   GWAS had helped to identify missing QTL for lumbar number in Sutai pigs
   ([155]Yan et al., 2017) and hematological traits in the F[2] pigs
   ([156]Yan et al., 2018). In this study, by performing GWAS and
   meta-analyses based on imputed genome sequence variants, we refined
   previously identified loci and reveal a number of new loci for fatty
   acid composition traits. Moreover, the imputed data allowed us
   investigate the population shared and specific QTL at higher
   resolution. Especially, integrating the GWAS signals that shared across
   populations e.g., at SCD loci, helped to greatly refine the QTL.

   The genotype accuracy of the imputed SNPs is critical for the success
   of the association study. Previously, same imputation strategy was
   employed to reveal solid new loci for vertebral number and
   hematological traits in the same populations to those investigated in
   current study ([157]Yan et al., 2017; [158]Yan et al., 2018), which
   demonstrated that imputing from relatively low density SNP chip data
   e.g., 60K SNP genotypes, to whole sequence SNPs provide a valuable
   strategy to improve the GWAS detection precision and power
   ([159]Supplementary Figure 1). A total of 78 out of the 285 lead SNPs
   detected in this study were considered as new loci. Although most of
   the newly identified loci showed moderate association significance,
   with P values ranged from 1.10 × 10^-10 to 4.54 × 10^-8, we found a
   number of functional plausible candidate genes near these loci. For
   instance, in F[2] pigs, 2:74664653 for C20:3n-6 (P = 1.89 × 10^-8)
   located 133 Kb from PLIN5 gene, which plays a crucial role in the
   regulation of intracellular fatty acid fluxes and oxidation
   ([160]Laurens et al., 2016). In Sutai pigs 5:73950290 for C20:0/C18:0
   (P = 2.24 × 10^-11) on SSC5 located about 200 kb from ABCD2 (73.68 –
   73.74Mb) gene that involved in very long chain fatty acid catabolic
   process ([161]van Roermund et al., 2011). The lead SNP (13:24928872)
   for C20:1n-9/C20:0 was approximately 240 Kb from ACAA1 (Acetyl CoA-Acyl
   Transferase), this gene encode an enzyme important for β-oxidation of
   fatty acids in peroxisomes ([162]Zha et al., 2005). The locus for
   C18:1n-9/C16:1n-7 on SSC7 at 27.3 Mb in Laiwu pigs was adjacent to APOM
   (27.48 Mb), with function related to lipid and lipoprotein metabolism.
   APOM gene was reported to be significantly associated with fat
   deposition traits in pigs ([163]Pan et al., 2010). In human, it was
   showed that down-regulation of APOM expression can be induced by
   palmitic acid ([164]Luo et al., 2014). In Erhualian pigs, the lead SNP
   at 11.30 Mb for C20:4n-6/C20:3n-6 is near to DGAT2, a gene associated
   with long chain fatty acid metabolism ([165]Cases et al., 2001). In
   Bamaxiang pigs, the novel lead SNP for C20:4n-6/C20:2n-6 at 138.71 Mb
   on SSC8 locates in intron of SNCA gene. In mice, previous studies
   demonstrated that ablation of SNCA reduces C20:4n-6 turnover and
   increases C22:6n-6 incorporation in brain phospholipids ([166]Golovko
   et al., 2006; [167]Golovko et al., 2007).

   Moreover, the meta-analyses revealed a number of new candidate genes
   involved in metabolism/transport of fatty acid or lipid. These included
   NAA40 (lipid metabolic process), SLC27A6 (Transport of long-chain fatty
   acids), ANKRD23 (A nuclear protein involved in energy metabolism),
   ABCD3 (Peroxisome biogenesis, oxidation of dicarboxylic acids), ACSF3
   (fatty acid biosynthetic process), ACOT7 (acyl-CoA metabolic process),
   ACAT1 (catalyzes the reversible formation of acetoacetyl-CoA), ABCG4
   (cellular cholesterol homeostasis), PRKAR2B (regulating energy balance
   and adiposity), PCCA (alpha subunit of the heterodimeric mitochondrial
   enzyme Propionyl-CoA carboxylase), ACAD9 (catalyze the rate-limiting
   step in the beta-oxidation of fatty acyl-CoA), ABCG1 (phospholipids
   transport), PTGIS (monooxygenases which catalyze and synthesis of
   cholesterol, steroids and other lipids), AWAT1-AWAT2 (diacylglycerol
   acyltransferase) ([168]Supplementary Table 4).

   Furthermore, in additional to functionally plausible candidate genes,
   we also found several new lead SNPs overlapping with those loci
   identified in other populations. For instance, the lead SNP for C14:0
   at 134.70 Mb on SSC4 is close to the region (136.10–136.33 Mb on SSC4)
   for C16:1n-7 evidenced in Iberian × Landrace F[2] cross ([169]Munoz et
   al., 2013). The lead SNP for C20:3n-6 at 135.72 Mb on SSC6 near LEPR
   gene is close to the loci for longissmus muscle SFA, PUFA, and PUFA/SFA
   contents in Duroc pigs ([170]Ros-Freixedes et al., 2016). The lead SNP
   for C20:1n-9/C20:0 at 24.93 Mb on SSC13 in Sutai pigs coincides with
   the 24.49 - 25.37 Mb region for back fat C16:0 in the IBMAP population
   ([171]Munoz et al., 2013). The lead SNP for C20:0 at 146.11 Mb on SSC15
   in Laiwu pigs was close to loci at 145.53 Mb for C17:0 in a Duroc pig
   multigenerational population ([172]van Son et al., 2017). The loci at
   148.47 Mb on SSC9 and 57.71 Mb on SSC10 identified in current study are
   overlapped with 146 – 148 Mb on SSC9 for PUFA in Duroc pigs
   ([173]Ros-Freixedes et al., 2016) and 55 - 56 Mb for C18:2n-6 and PUFA
   on SSC10 in Large White pigs ([174]Zappaterra et al., 2018),
   respectively. Therefore, it is reasonable to believe most of the newly
   identified loci are not artifacts.

   We observed significant associations at SCD, ELOVL5, ELOVL6, ELOVL7,
   and THRSP in multiple populations. Therefore, we would have chance to
   leverage GWAS signals from multiple populations to refine respective
   QTL at these loci. Meta-analyses in cattle have demonstrate the power
   of integrating multiple population data to identify a small number of
   candidate causal variants ([175]Bouwman et al., 2018). Correspondingly,
   in this study, the meta-analyses largely enhanced the association
   signals at SCD, ELOVL5, ELOVL6, and ELOVL7, the most significant SNPs
   identified in meta-analyses at these loci are potential candidates for
   follow up functional studies.

   Through haplotype analyses, we identified a 3.7 kb haplotype
   (121,450,788-121,454,457) shared across F[2], Sutai and DLY showing a
   consistent effect of decreasing C18:0 content. Based on a 660 K SNP
   chip data, an independent study identified a top SNP for back fat C18:0
   content at 121,401,766 bp on SSC14 ([176]van Son et al., 2017), which
   is about 50 kb from the 3.7 kb region identified in our study. Both
   regions located about 400 kb downstream of SCD gene, therefore, we
   speculate that the underlying mutation could affect the expression of
   SCD through distant regulation mechanism, and hence affect the C18:0
   content in longissimus muscle. Further study is required to identify
   the underlying causative mutation.

   We did not find any missense mutations among the lead SNPs identified
   from single population GWAS or meta-analysis, suggesting the variations
   in the fatty acid composition in pigs is primarily affected by
   regulatory variants. This is further supported by the enrichment of
   lead SNPs in the H3K27ac and H3K4me3 peaks that are representatives of
   active promoters and enhancers ([177]Villar et al., 2015).

   Despite that fatty acid composition traits have been investigated for
   decades, the underlying molecular pathways that influence the fatty
   acid composition of longissimus muscle remains elusive. Functional
   annotation and enrichment of candidate genes at identified loci,
   supported that genes pathways related to metabolism rather than to
   transport of fatty acids would be primary biological process that
   affect the fatty acid composition in longissmus muscle.

Conclusion

   We performed GWAS and GWAS meta-analysis for 38 fatty acid composition
   traits in 2446 individuals from six different pig populations based on
   imputed genome sequencing variants. The analyses identified 78 new
   associations and refined a number of previously identified loci for
   fatty acid composition traits. This study demonstrated that genotype
   imputation from sequencing data help to improve power and precision of
   GWAS. Leveraging data from multiple populations with diverse genetic
   background hold great promise to fine map QTL that shared across
   populations. Evidences of candidate genes with function directly
   related to fatty acid metabolism and transport near respective QTL
   deepen our understanding of biological mechanism underlying the porcine
   fatty acid composition traits. The results generated in this study
   provide beneficial information for pig breeding program to genetically
   improve fatty acid profile in pork.

Data Availability Statement

   Publicly available data were analyzed for this study. it can be found
   at the following links:
   [178]https://www.ncbi.nlm.nih.gov/sra/ERP001813/
   [179]https://www.ncbi.nlm.nih.gov/sra/SRA065461/
   [180]https://www.ncbi.nlm.nih.gov/sra/SRP047260/
   [181]https://www.ncbi.nlm.nih.gov/sra/SRA096093/
   [182]https://www.ncbi.nlm.nih.gov/bioproject/PRJNA238851/
   [183]https://dataview.ncbi.nlm.nih.gov/object/PRJNA550237?reviewer=muq3
   5cdjpr0nv5ivec9r1civl2 [184]https://doi.org/10.5061/dryad.7kn7r

Ethics Statement

   The ethics committee of Jiangxi Agricultural University approved the
   animal experiments in this study.

Author Contributions

   LH conceived and designed the experiment, and revised the manuscript.
   BY supervised the experiment and data analyses, and wrote part of the
   manuscript. JZ measured the phenotype, analyzed the data and wrote the
   manuscript. YZ, HG, LC, JM, CC, HA and SX contributed to experimental
   materials. All authors read and approved the final manuscript.

Funding

   The work was supported by funds from National Natural Science
   Foundation of China (31572379 and 31230069).

Conflict of Interest

   The authors declare that the research was conducted in the absence of
   any commercial or financial relationships that could be construed as a
   potential conflict of interest.

   The reviewer JM declared a past co-authorship with one of the authors
   SX to the handling editor.

Supplementary Material

   The Supplementary Material for this article can be found online at:
   [185]https://www.frontiersin.org/articles/10.3389/fgene.2019.01067/full
   #supplementary-material
   [186]Click here for additional data file.^ (852.9KB, docx)

References