Abstract
The size of the gluteus medius muscle (GM) in swine significantly
impacts both hindlimb conformation and carcass yield, while little is
known about the genetic architecture of this trait. This study aims to
estimate genetic parameters and identify candidate genes associated
with this trait through a genome-wide association study (GWAS). A total
of 439 commercial crossbred pigs, possessing both Landrace and
Yorkshire ancestry, were genotyped using the Porcine 50K chip. The
length and width of the GM were directly measured, and the area was
then calculated from these values. The heritabilities were estimated by
HIBLUP (V1.5.0) software, and the GWAS was conducted employing the
BLINK model implemented in GAPIT3. The heritability estimates for the
length, width, and area of the GM were 0.43, 0.40, and 0.46,
respectively. The GWAS identified four genome-wide significant SNPs
(rs81381267, rs697734475, rs81298447, and rs81458910) associated with
the gluteus medius muscle area. The PDE4D gene was identified as a
promising candidate gene potentially involved in the regulation of
gluteus medius muscle development. Our analysis revealed moderate
heritability estimates for gluteus medius muscle size traits. These
findings enhance our understanding of the genetic architecture
underlying porcine muscle development.
1. Introduction
Pork constitutes a major portion of global meat consumption [[44]1]. In
response to increasing demand for animal protein [[45]2], improving
pork yield and meat quality has become a pivotal breeding goal for the
pig industry [[46]3]. The gluteus medius muscle (GM) is anatomically
positioned in the proximal hindlimb region of swine and is one of the
major muscles of the porcine hindlimb. The development of the gluteus
medius plays a crucial role in determining the meat yield of the
hindlimb and affects the quality by regulating the composition of
muscle fiber types [[47]4]. These physiological characteristics
position the gluteus medius as a significant quality indicator in the
processing of pork hams [[48]5,[49]6], thus enhancing the commercial
value of the pig hindlimb. Consequently, the gluteus medius size
exhibits both a direct impact on carcass yield and a significant
correlation with the high-value cut percentage in swine.
Genome-wide association studies (GWAS) have emerged as a powerful
strategy for elucidating the genetic basis underlying complex traits
[[50]7,[51]8]. Currently, although an increasing number of GWAS have
examined carcass quality traits, the predominant focus has remained on
readily quantifiable phenotypic measures, including backfat thickness
(BFT) and loin muscle area (LMA). For example, Yang et al. [[52]9]
identified the DOCK7 gene associated with body weight and BFT utilizing
GWAS in 1200 Landrace and Yorkshire pigs, and Zhuang et al. [[53]10]
localized multiple quantitative trait loci (QTLs) and genes affecting
LMA and lumbar muscle depth (LMD) through GWAS in 6043 Duroc pigs.
Despite these advances, the analysis of the genetic basis of deeper
muscular structures such as gluteus medius is still significantly
deficient. Currently, the pig QTL database
([54]https://www.animalgenome.org/cgi-bin/QTLdb/SS/index, accessed on
26 April 2025) catalogs 3,977 gluteus medius-associated QTLs, mainly
based on fat content measurements. Notably, significant gaps persist in
elucidating the genetic determinants underlying gluteus medius
developmental characteristics, such as gluteus medius cross-sectional
area (GMA). Given the gluteus medius’s contribution to both carcass
leanness and meat quality, this gap impedes our comprehensive
understanding of porcine carcass quality assessment.
In this study, we performed a GWAS of GMA utilizing a population of 439
crossbred pigs which possessing both Landrace and Yorkshire ancestry.
We aim to explore genetic markers governing gluteus medius development,
thereby advancing marker-assisted selection strategies for lean meat
production.
2. Materials and Methods
2.1. Phenotypic Data
In this study, phenotypic data were obtained from 439 crossbred pigs.
These pigs were derived from a Landrace × Yorkshire rotational
crossbred commercial population. All animals were raised under
standardized environmental and management conditions. Following
industry-standard slaughter protocols, carcass weights were
systematically recorded during processing.
For morphological characterization, the gluteus medius muscle (GM) was
measured on the left side of each carcass post-evisceration ([55]Figure
1). Two primary parameters were collected, (1) gluteus medius length
(GML): Maximum longitudinal dimension (mm); (2) gluteus medius width
(GMW): Perpendicular width at the muscle’s broadest cross-section (mm).
The cross-sectional area of the gluteus medius (GMA) was subsequently
estimated, using the following formula:
[MATH: GMA=(GML×GMW)/2(mm2)
mo> :MATH]
(1)
Figure 1.
Figure 1
[56]Open in a new tab
Schematic representation of the gluteus medius anatomical location and
measurement protocol. L represents gluteus medius length and W
represents gluteus medius width.
2.2. Genotypic Data
Muscle tissue samples were collected from each pig and stored in 75%
ethanol at -20 °C before genomic analysis. Genomic DNA was extracted
from each sample using standard phenol-chloroform extraction protocols,
with nucleic acid purity assessed spectrophotometrically through
absorbance ratios (A260/280 and A260/230). High-quality DNA samples
were subsequently genotyped using a 50K SNP chip, yielding a total of
52,000 SNPs. Following initial genotyping, genotype imputation was
performed using BEAGLE v5.4 [[57]11] to address missing data points.
Quality control (QC) was implemented through VCFtools [[58]12] with the
following exclusion criteria: (1) SNPs with minor allele frequency
(MAF) < 0.05 %; (2) SNPs with call rates < 90%; (3) All non-autosomal
SNPs. After QC, a total of 45,487 SNPs were retained for subsequent
analysis.
2.3. Estimation of Genetic Parameters
The genetic parameters of GML, GMW, and GWA were estimated using a
mixed linear model using HIBLUP (V1.5.0) software [[59]13], and the
analysis model was as follows:
[MATH: y=Xb+Za+e :MATH]
(2)
where y is the vector of observed phenotypic values, b is a vector of
fixed effects (carcass weight), a is a vector of additive genetic
effects, assumed to follow a~N (0, G
[MATH: σa2 :MATH]
), where G is the genomic relationship matrix and
[MATH: σa2 :MATH]
is the additive genetic variance; X and Z are the incidence matrices
for fixed and random effects, and e is the vector of residual errors,
with e~N(0, I
[MATH: σe2 :MATH]
), where I is the identity matrix and
[MATH: σe2 :MATH]
is the residual variance.
2.4. Genome-Wide Association Study
We conducted GWAS using the BLINK model (Bayesian-information and
Linkage-disequilibrium Iteratively Nested Keyway), implemented in GAPIT
(version 3.0) [[60]14], due to its high computational efficiency and
strong statistical performance. The BLINK method conducts two fixed
effect models and one filtering process [[61]15]. The two fixed effect
models can be written as follows:
[MATH: yi=
Si1*b1+Si2*b2+…+Sik*bk+Sijdj+ei :MATH]
(3)
[MATH: yi=
Si1*b1+Si2*b2+…+Sik*bk+ei :MATH]
(4)
where
[MATH: yi :MATH]
represents the observation on the ith individual;
[MATH: Sik* :MATH]
is the genotype of the k pseudo QTN;
[MATH: bk :MATH]
is the effect of the k pseudo QTN;
[MATH: Sij :MATH]
is the genotype of the ith individual and jth genetic marker;
[MATH: dj :MATH]
is the effect of the jth marker. The optimization is performed using
Bayesian information criteria (BIC). In addition, the first three
principal components (PCs) were used to account for population
stratification. The genome-wide significance level was defined as p =
0.05/N [[62]16], where N represents the number of SNPs analyzed.
2.5. Functional Annotation of Significant Loci
Significant SNPs and candidate genes were mapped and annotated using
the following bioinformatics resources: SNP positions were determined
based on the Sus scrofa reference genome assembly 11.1
([63]http://ensembl.org/Sus_scrofa/Info/Index, accessed on 7 May 2025).
Candidate genes within 1Mb genomic window (±500 kb) of significantly
associated SNPs were identified using the BioMart data
([64]http://asia.ensembl.org/biomart/martview/, accessed on 7 May
2025). Functional annotation analysis of candidate genes was then
performed using the DAVID database ([65]https://david.ncifcrf.gov/,
accessed on 8 May 2025), including Gene Ontology (GO) term enrichment
analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway
enrichment analysis. SNPs and candidate genes were queried for
correlated traits through the PigBiobank database [[66]17], which
integrates multi-omics data including expression quantitative trait
loci (eQTLs) and phenotypic associations from the PigGTEx project.
Complementary gene functional analyses were performed by GeneCards
([67]https://www.genecards.org/, accessed on 8 May 2025).
3. Results
3.1. Phenotypic Variation and Heritability Estimates
[68]Table 1 summarizes the descriptive statistics and genetic parameter
estimates for gluteus medius length (GML), width (GMW), and area (GMA).
The GMA demonstrated the greatest phenotypic variability among the
measured traits, exhibiting a coefficient of variation (CV) of 36.86%.
The high CV of GMA suggests potential genotype-by-environment (G×E)
interactions. For the genotypic data, after quality control, a total of
45,487 high-quality SNPs were retained for downstream genetic analyses.
Moderate heritability estimates (0.40~0.46) were observed for all three
morphometric traits, with the GMA showing the strongest genetic
component (h^2 = 0.46 ± 0.10, p = 6.26 × 10^−6). All heritability
estimates reached highly significant levels (p < 0.001), indicating
substantial genetic contributions to trait variation. Population
stratification was assessed using principal component analysis (PCA),
with the first three PCs incorporated in subsequent association
analyses to control for potential population structure effects. The 3D
PCA plot is shown in [69]Figure S1.
Table 1.
Descriptive statistics of phenotype and heritability of gluteus medius
size.
Trait Mean ± (SD) Min Max CV (%) h^2 ± (SE) p
GML (mm) 103.99 ± 17.74 26.86 153.83 17.06 0.43 ± 0.10 1.22 × 10^−5
GMW (mm) 20.45 ± 5.37 4.04 35.84 26.29 0.40 ± 0.10 1.13 × 10^−4
GMA (mm^2) 1086.60 ± 400.56 127.10 2364.20 36.86 0.46 ± 0.10 6.26×
10^−6
[70]Open in a new tab
GML, gluteus medius length; GMW, gluteus medius width; GMA, gluteus
medius area; SD, standard deviation; CV, coefficient of variance; h^2,
heritability; SE, standard error.
3.2. Genome-Wide Association Study
Given that the GMA trait exhibited the highest heritability, we
subsequently conducted a genome-wide association study using the BLINK
model in GAPIT3 to identify potential causal variants. [71]Figure 2
presents the Manhattan plot and QQ plot, with the genome-wide
significance threshold set at −log10(0.05/45487) = 5.96. The genomic
inflation factor (λ) was calculated as 1.185. Four genome-wide
significant SNPs (rs81381267, rs697734475, rs81298447, and rs81458910)
were identified ([72]Table 2). Notably, the most significant
association signal was observed for rs69773447 (p = 4.84 × 10^−9),
located on chromosome 10, which accounted for 6.21% of the phenotypic
variance. The QQ plot demonstrates the distribution of observed versus
expected −log10(p) values, where the diagonal reference line (red)
represents the null hypothesis of no association. The minimal deviation
of most data points from this line indicates appropriate control of
population stratification and test statistics, while the extreme
outliers represent true association signals.
Figure 2.
[73]Figure 2
[74]Open in a new tab
Manhattan and Q–Q plots for GWAS of the GMA trait. The solid line
represents the genome-wide significant threshold of 5.96.
Table 2.
Significant SNPs from the GWAS for the GMA trait.
SSC Position SNP ID p EPV (%) Distance (kb) Nearest Gene
4 14,866,061 rs81381267 2.98 × 10^−8 11.30 67.27 MTSS1
10 1,441,934 rs697734475 4.84 × 10^−9 6.21 −72.29 RGS21
13 176,619,148 rs81298447 9.39 × 10^−7 9.51 −139.70 ROBO1
16 39,179,599 rs81458910 3.64 × 10^−8 13.41 within PDE4D
[75]Open in a new tab
SSC, Sus scrofa chromosome; EPV, Explained phenotypic variance;
Distance, the SNP located upstream/downstream of the nearest gene.
3.3. Candidate Genes Search and Functional Annotation
By integrating association results from the PigBiobank database, we
annotated the top 10 phenotypic traits associated with each significant
SNP ([76]Table S1). Three SNPs (rs697734475, rs81298447, and
rs81458910) showed significant associations with carcass composition
and meat quality traits, including backfat thickness, lean meat
percentage, and loin muscle depth. Within 500 kb flanking regions of
the significant SNPs, we annotated 19 protein-coding genes ([77]Table
S2). GO and KEGG analyses suggest that these genes may be involved in
energy metabolism, lipid synthesis, and signaling, thereby affecting
muscle development ([78]Table S3, [79]Figure S2). The nearest genes to
each SNP and their genomic distances are detailed in [80]Table 2. The
most significantly associated SNP, rs697734475 (chr10:1441934; p = 4.84
× 10^−9), was flanked by two genes encoding regulators of G-protein
signaling: RGS21 (72.29 kb upstream) and RGS18 (78.94 kb downstream).
Furthermore, rs81381267 (chr4:14866061), accounting for 11.30% of the
phenotypic variance, was positioned adjacent to metastasis suppressor 1
(MTSS1), which encodes an actin-binding protein crucial for
cytoskeletal reorganization and cellular membrane dynamics [[81]18].
The roundabout guidance receptor 1 (ROBO1) gene, located proximal to
rs81298447 (chr13:176619148), encodes a receptor that binds SLIT2
ligands and mediates functions in cellular migration and angiogenesis
pathways [[82]19]. Notably, rs81458910 (chr16:39179599) accounted for
13.41% of the phenotypic variance and was located within the
phosphodiesterase 4D (PDE4D) gene, a compelling candidate gene involved
in regulating both myogenesis and adipogenesis processes.
4. Discussion
In modern pork production, premium meat cuts obtained through carcass
segmentation (e.g., ham products derived from porcine hind limbs)
significantly enhance commercial value [[83]20]. The GM, as a principal
muscular component of the hind limb, plays a crucial role in
determining both limb conformation and overall carcass merit. However,
due to its deep anatomical location, conventional phenotyping methods
are inadequate for precise morphological assessment, necessitating the
implementation of marker-assisted selection (MAS) to optimize breeding
strategies.
In this investigation, we analyzed GM morphological traits (length
[GML], width [GMW], and area [GMA]) in a crossbred population. Notably,
the GMA exhibited the highest coefficient of variation, indicative of
substantial environmental or genetic influences on this trait. Using
HIBLUP for variance component estimation, we identified
moderate-to-high heritability estimates (h^2 ≥ 0.4) for all GM traits
([84]Table 1), confirming significant genetic contributions to muscular
development. Comparative analysis revealed that GML demonstrated
greater heritability than GMW, suggesting that linear measurements may
be more informative than transverse dimensions for the genetic
evaluation of GM morphology. Importantly, GMA showed the highest
heritability (h^2 = 0.46 ± 0.10) with strong statistical significance,
highlighting its exceptional potential for genetic improvement through
selective breeding.
Based on these findings, we performed GWAS focusing specifically on
GMA. Unlike traditional GWAS models (e.g., MLM), the BLINK algorithm
employed here reduces false positives by iteratively adjusting for
population structure and linkage disequilibrium [[85]15]. The BLINK
model demonstrated higher statistical power than the conventional MLM
model [[86]16]. Our crossbred population, derived from Landrace and
Yorkshire lineages, enhances allele diversity and improves detection
power for minor-effect loci. This genetic heterogeneity enables more
robust identification of trait-associated variants within complex
architectures, significantly increasing GWAS resolution. Furthermore,
we employed estimated breeding values (EBVs) rather than raw phenotypic
measurements for GWAS, as EBVs incorporate genomic relationships to
increase detection power [[87]21].
A total of four SNPs associated with the GMA trait were identified by
GWAS. Among the identified variants, rs697734475 showed the strongest
statistical significance (p = 4.84 × 10^−9). The identified SNP is
flanked by two members of the Regulator of G-protein Signaling (RGS)
family—RGS18 and RGS21. The RGS proteins regulate heterotrimeric G
proteins through stimulation of guanosine triphosphatase (GTPase)
activity, thereby contributing to a wide range of downstream cellular
signaling [[88]22]. GO analysis also showed that these RGS family genes
are involved in G protein signaling inhibition ([89]Table S3). RGS18
exhibits tissue-specific expression patterns, with predominant
localization in hematopoietic lineages including progenitor cells and
megakaryocytes, where it plays a crucial role in modulating platelet
activation and thrombotic responses [[90]23]. In contrast, RGS21
demonstrates ubiquitous expression across multiple tissues and
participates in diverse physiological processes. In addition, several
studies have revealed RGS21 in the modulation of olfactory and
gustatory perception in animals [[91]24,[92]25,[93]26]. Given that
chemosensory genes can influence porcine fat deposition and meat
tenderness through feeding behavior modulation [[94]27,[95]28], we
speculated that RGS21 may indirectly regulate GM development via
analogous pathways. The SNPs rs81381267 and rs81458910 accounted for
11.30% and 13.41% of the phenotypic variance, respectively,
underscoring their substantial contribution to trait variability.
Notably, rs81458910 is precisely located within the PDE4D gene.
According to GeneCards, PDE4D encodes a cAMP-specific phosphodiesterase
that hydrolyzes the secondary messenger cyclic AMP (cAMP), serving as a
critical regulator of numerous essential physiological processes. The
iswine database [[96]29] expression profile demonstrates that PDE4D
exhibits tissue-specific expression patterns, with the highest
abundance in muscle-related tissues, suggesting its crucial involvement
in regulating skeletal muscle formation and function through
cAMP-mediated signaling pathways. The study by Xiong et al. [[97]30]
identified a critical association between PDE4D and the observed
variations in skeletal muscle growth across different pig breeds.
Moreover, single-nucleus RNA sequencing analyses have established PDE4D
as a molecular marker for adipocytes in both Laiwu and Large White
pigs, with potential implications for intramuscular fat deposition
[[98]31,[99]32,[100]33]. Collectively, these findings suggest that
PDE4D may be involved in regulating the growth and development of
muscle tissues, particularly the GM, representing a novel candidate
gene for porcine carcass traits and meat quality. However, studies have
demonstrated that mutations in the PDE4D gene are associated with
severe human disorders. For example, PDE4D was reported to be
associated with asthma and acrodysostosis [[101]34,[102]35]. Further
investigation is warranted to determine whether specific PDE4D
mutations may increase risks of certain defects.
Integrated GO and KEGG analyses identified key genes involved in
muscular lipid metabolism ([103]Table S3). SQLE and ELOVL7 were
significantly enriched in endoplasmic reticulum-related GO terms and
metabolic pathways (KEGG ssc01100; p < 0.05). SQLE (Squalene
Epoxidase), encoding squalene epoxidase, catalyzes the rate-limiting
conversion of squalene to 2,3-oxidosqualene in cholesterol
biosynthesis. Its inhibition induces apoptosis through ER stress and
altered cholesterol synthesis [[104]36]. Differential SQLE expression
between high- and low-lipid deposition groups in Nanyang pigs
[[105]37], along with its SNP (c.2565G>T) associations with meat
quality traits [[106]38], highlight its dual role in lipid metabolism
and muscle development. ELOVL7 (Fatty Acid Elongase 7), the
rate-limiting fatty acid elongase, mediates VLCFA synthesis
(C18:0-C20:0) [[107]39]. GWAS consistently associate ELOVL7-linked SNPs
with porcine fatty acid profiles
[[108]40,[109]41,[110]42,[111]43,[112]44], supported by its
downregulation in obese Yorkshire pigs [[113]45], suggesting its
importance in meat quality regulation. Additional key regulators
include: RNF139 (Ring Finger Protein 139), an ER-localized E3 ligase
that modulates lipid metabolism via SREBP2 processing and HMGCR
degradation [[114]46,[115]47]; NDUFB9 (NADH: ubiquinone oxidoreductase
subunit B9), whose expression correlates with adipogenesis and is
epigenetically regulated [[116]48,[117]49]. This integrated analysis
reveals critical molecular networks governing lipid metabolism and meat
quality in swine.
5. Conclusions
In this study, we revealed moderate heritability estimates for all
gluteus medius muscle size indicators. Based on a genome-wide
association study, we identified four significant SNPs (rs81381267,
rs697734475, rs81298447, and rs81458910) related to the gluteus medius
muscle area. Subsequent functional annotation identified two promising
candidate genes (PDE4D and RGS21). These findings provide novel
insights into the genetic architecture underlying porcine gluteus
medius muscle size.
Supplementary Materials
The following supporting information can be downloaded at:
[118]https://www.mdpi.com/article/10.3390/vetsci12080730/s1, Figure S1.
Three-dimensional PCA Plot. Figure S2. GO and KEGG enrichment results.
Table S1. Top 10 Phenotypes Associated with Genome-Wide Significant
SNPs from PigBiobank Meta-GWAS Analysis. Table S2. The distribution of
genes within a 1 Mb range around significantly associated SNPs for the
GMA trait. Table S3. Enrichment analysis of genes in the 1 Mb range of
SNP significantly associated with GMA traits.
[119]vetsci-12-00730-s001.zip^ (64.8KB, zip)
Author Contributions
Conceptualization, B.S., H.S. and Y.H.; methodology: C.B., H.S., and
Y.H.; data curation, J.K., C.C., J.L., S.L. and W.L.; formal analysis,
J.F. and X.Z.; writing—original draft, Y.H. and H.S.; writing—review
and editing, Y.H.. and H.S. All authors have read and agreed to the
published version of the manuscript.
Institutional Review Board Statement
This study was approved by the Institutional Animal Care and Use
Committee of Jilin University (No. SY202507020) for all experimental
protocols.
Informed Consent Statement
Not applicable.
Data Availability Statement
Upon reasonable request, the datasets of this study can be available
from the corresponding author.
Conflicts of Interest
The authors declare no conflicts of interest.
Funding Statement
This research was funded by the National Natural Science Foundation of
China, grant number 32202628.
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data
contained in all publications are solely those of the individual
author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI
and/or the editor(s) disclaim responsibility for any injury to people
or property resulting from any ideas, methods, instructions or products
referred to in the content.
References