Introduction

Because of its beneficial health effects, skeletal muscle strength is an important trait, even in populations that live in modern societies and no longer rely heavily on physical labor. Indeed, both in the elderly and the general population, muscle strength is inversely associated with all-cause mortality,1 osteoporosis,2 cardiovascular diseases,3 metabolic syndrome prevalence4 and other pathological conditions and chronic diseases.5 Heritability estimates vary from 31 to 78% with large differences between muscle groups, contraction velocities or muscle lengths (reviewed in Peeters et al6). Apart from a genetic component, muscle strength is influenced by environmental factors such as nutrition, training or social status, and by the interaction between genes and environment, rendering it a complex multifactorial trait.

The ‘Human gene map for performance and health-related fitness phenotypes’7 indicates that only a limited number of genes has been implicated in muscle strength and even fewer associations have consistently been replicated. Genetic linkage studies regarding muscle strength are even scarcer. To date, only Tiainen et al8 and our research group9, 10, 11, 12, 13, 14 reported results on genetic linkage for skeletal muscle strength characteristics. We were the first to perform linkage analyses on a unique collection of young male Caucasian siblings drawn from the Leuven Genes for Muscular Strength study (LGfMS).9 Single and multipoint microsatellite marker-based linkage analyses revealed significant or suggestive linkage of chromosomal regions 12q12-14, 12q22-23 and 13q14.2 to knee muscle strength.9, 11 It can, however, not be excluded that genes other than the original myostatin pathway genes underlie these linkage findings and further fine mapping of the region surrounding these candidate genes is therefore warranted.

In this study, we further investigated the 12q12-14 region. We applied a two-staged genetic fine mapping approach, followed by a replication of the most promising associations and an evaluation of mRNA variation in relation to a ACVR1B SNP, in an independent study sample. We are the first to reveal evidence for variations in the ACVR1B gene to determine – at least in part – muscle strength properties. Our observations fit with available physiological data that demonstrate a potential role for ACVR1B in the regulation of muscle mass, as it is part of the transforming growth factor β (TGFβ) pathway regulating myostatin – a negative regulator of muscle mass – signaling.15, 16

Materials and methods

Study design

A two-staged approach was designed to fine map the chr12q12-14 linkage peak. The most significant findings were then replicated in an independent study sample and the influence of a specific SNP on mRNA expression levels in quadriceps muscle tissue was tested. A schematic representation of the study design can be found in Figure 1.

Figure 1
figure 1

Study design. The different stages of the study are displayed according to their chronological order from left to right. Sample description, genotyping details and statistical analyses are presented from top to bottom. LGfMS, Leuven Genes for Muscular Strength study; BLSA, Baltimore Longitudinal Study of Aging; SPAH, study conducted within the framework of the Flemish Policy Research Centre Sport, Physical Activity and Health.

Fine mapping stage 1

SNPs were selected under the 1-LOD confidence interval of the previously defined linkage peak,9, 11 using an empirical two-step strategy, in which candidate genes are prioritized using a bioinformatics approach, and the top genes are chosen for further SNP selection with a linkage disequilibrium (LD)-based method, as described in Supplementary file 1 (available online) and elsewhere.17 A list of the candidate genes and corresponding polymorphisms is available in Supplementary file 2 (available online) (Supplementary Table S1).

Joint linkage and association analyses, and family-based association analyses tested the association of these polymorphisms with knee strength measurements.

Fine mapping stage 2

Candidate genes were ranked on the basis of (1) their significance in (a) the joint linkage and association approach, (b) the family-based association approach and (c) both methods; (2) their significance over different strength measurements; (3) the knowledge about their function in muscle tissue (known, putative, general function in different tissues or no known/unclear function). The highest ranked gene ACVR1B was selected for further follow-up. INHBC was also included because it is part of the same TGFβ signaling pathway as ACVR1B, even though it had a lower ranking. An extended sample was genotyped for the stage 1 SNPs, complemented with additional tagSNPs and polymorphisms likely to have functional consequences, such as exonic SNPs, SNPs in intron/exon boundaries (possibility to influence the splicing process), in (putative) transcription factor-binding sites or in highly conserved regions (Table 1). Assessment of potential functionality was based on queries from SNPselector,18 SNPseek (http://snp.wustl.edu/cgi-bin/SNPseek/index.cgi) and publicly available genetic databases.

Table 1 Stage 2 genes and corresponding polymorphisms

Replication

Genetic linkage and association analyses are prone to false-positive findings. Therefore, we applied a staged design, used complementary association methods and derived empirical P-values. Moreover, we replicated the associations with putatively functional polymorphisms, detected in the second stage of the fine mapping, in an independent sample.

Study samples

Two-staged fine mapping

Siblings analyzed in stage 1 and stage 2 of this study were selected from the LGfMS (748 men; 17–36 years), based on family size and DNA availability. The recruitment protocol and subject characteristics have been described elsewhere.9, 11 For stage 1, a total of 500 individuals were included (169 subjects overlap with the microsatellite-based linkage study11). For stage 2, a total of 536 subjects were included with an overlap of 464 subjects with stage 1. Subjects for whom genotyping failed in stage 1 were excluded from stage 2, and subjects for whom new DNA was collected between the two stages were included. Descriptive statistics of these samples can be found in Supplementary file 2 (Supplementary Table S2).

Replication sample

The replication sample consists of 193 Caucasian men (20–90 years) from the Baltimore Longitudinal Study of Aging (BLSA)19, 20, 21 and 74 healthy Caucasian men (60–78 years) from a study conducted within the framework of the Flemish Policy Research Centre Sport, Physical Activity and Health (SPAH).22, 23

The medical and ethical committee of the Katholieke Universiteit Leuven approved the LGfMS and SPAH studies. The BLSA experimental protocols were approved by the Institutional Review Boards for Human Subjects at Medstar Research Institute (Baltimore, MD, USA), Johns Hopkins Bayview Medical Center (Baltimore, MD, USA) and the University of Maryland (College Park, MD, USA). All subjects gave written informed consent before participating. Descriptive statistics of the replication subsamples can be found in Supplementary file 2 (Supplementary Tables S2 and S3).

Strength measurements

Isometric and dynamic strength of the knee extensors and flexors was measured using an isokinetic dynamometer as described in Supplementary file 1 and elsewhere (LGfMS,9, 11 BLSA20, 21 and SPAH23).

SNP genotyping

Detailed SNP genotyping methods can be found in Supplementary file 1. In short, DNA was extracted from whole blood (LGfMS, BLSA, SPAH) or from saliva (Oragene DNA Self-Collection Kits; Oragene, Ontario, Canada; LGfMS). Genotyping was performed on an Illumina Bead Array platform (Illumina Inc., San Diego, CA, USA; LGfMS stage 1), a Sequenom iPLEX Gold platform (Sequenom Inc., San Diego, CA, USA; LGfMS stage 2 and SPAH) or with Taqman allelic discrimination assays (Applied Biosystems, Life Technologies Corporation, Carlsbad, CA, USA; BLSA). Overall sample success rates, locus success rates and genotype call rates can be found in Supplementary file 1.

Quantification of ACVR1B mRNA expression

Muscle tissue (right vastus lateralis) was obtained from 16 healthy male volunteers (18–28 years) from an independent study conducted in our research center.24 A detailed description of the mRNA extraction and real-time PCR protocol for ACVR1B mRNA expression is available in Supplementary file 1 and elsewhere.25

Statistical analyses

Pedstats v0.6.426 was used to check pedigrees for Mendelian consistency and to test Hardy–Weinberg equilibrium (HWE). Unlikely genotypes were identified and discarded using the error checking option of Merlin v1.0.1.27

QTDT v2.5.1 software28 was applied to compare a test for linkage between the SNPs and the strength measurements with a test for linkage while modeling association.29 When association between a SNP and the studied trait is present, the linkage signal should become nonsignificant when association is added to the model. Merlin v1.0.127 was used to calculate IBD probabilities. Genetic distances were derived from physical distances (NCBI build 35) using Rutgers Map Interpolator based on Rutgers first-generation combined linkage-physical map (Rutgers map v.1).30

Haploview31 was used to examine the LD structure across the region of interest and to calculate r2 between all pairs of SNPs within a candidate gene.

Complementary to the combined linkage and association analysis, we conducted family-based association tests (FBAT v1.7.3) under the null hypothesis of ‘linkage and no association’.32, 33 The haplotype version of this test (HBAT)34 was used to obtain empirical P-values by means of the Monte-Carlo permutation procedure (10 000 permutations). An additive model was tested and trait offsets were specified as the sample mean of the trait. As only empirical P-values are reported, we should note that these were derived under the null hypothesis of ‘no linkage and no association’ since permutation tests are too computer-intensive under the alternative hypothesis. However, simulations show that tests under both null hypotheses render comparable results (data not shown).

SAS v9.1.3 (SAS Institute, Cary, NC, USA) was used to calculate descriptive statistics and to assess genotypic differences in muscle strength in the replication sample and in ACVR1B mRNA expression using analyses of (co)variance. Because the strength testing protocols between the replication samples differed (eg, in equipment, angle for the isometric test, measurement units), strength measurements were normalized to Z-scores (mean=0; standard deviation=1) per sample. Similar strength measurements were combined in the joined sample as described in Supplementary file 2, Supplementary Table S3. Because of the large age range of the replication individuals, age was included as a covariate in the analyses.

Probability levels <0.05 are considered significant. Instead of correcting P-values for multiple testing, we relied on the use of complementary association analysis methods, calculation of empirical P-values and a staged design including replication in an independent sample to minimize the possibility of false-positive findings.

Results

Fine mapping stage 1

In stage 1, candidate genes and SNPs in the 1-LOD region of the original chr12q12-14 linkage peak were selected using an empiric two-step approach. First, a bioinformatical prioritization approach was used to select 86 candidate genes out of the 454 genes in the region. Second, 181 tagging SNPs, determined on CEPH (Centre d’Etude du Polymorphisme Humain) genotypes downloaded from the Hapmap35 website, were selected in 74 candidate genes. For the remaining 12 genes, no polymorphisms meeting our criteria (Illumina designability rank=1 (ie, high success rate of assay design); MAF>0.05) were present. These polymorphisms were supplemented with 28 SNPs in the gaps in-between the genes, to enhance power for linkage.17 A list of the selected SNPs is available in Supplementary file 2 (Supplementary Table S1).

Of the 209 SNPs, 198 were successfully genotyped, of which 7 SNPs were discarded (2 were monomorphic and 5 departed from HWE at the P<0.001 level), yielding 191 SNPs for further analysis.

These SNPs were successfully genotyped in a sample of 499 young male siblings. Association between a SNP and the studied muscle strength characteristics was evaluated using a combined linkage and association approach.29 Consistent drops in –log P-values after modeling a joint test for linkage and association – indicating the presence of association between SNP and trait – were present for rs10784948 in contactin 1 (CNTN1) and rs7960176 in YY1 associated factor 2 (YAF2; Figure 2, open arrow) and for rs10783485 and rs10783486 in activin receptor 1B (ACVR1B; Figure 2, black arrow).

Figure 2
figure 2

Combined linkage and association analyses on selected strength measurements. A drop in –log P-value for the combined linkage and association analysis (thin line) compared to the linkage analysis (thick line) indicates evidence for association. Associated SNPs rs10783485 and rs10783486 in ACVR1B are marked with a black arrow and associated SNPs rs10784948 (CNTN1) and rs7960176 (YAF2) are marked with an open arrow.

In addition, family-based association analyses were performed. Figure 3 shows the empirical P-values for an additive model for the 191 SNPs. SNPs in five genes (ACVR1B, cyclin-dependent kinase 4 (CDK4), cullin-associated and neddylation-dissociated 1 (CAND1), inhibin β C (INHBC) and myosin IA (MYO1A)) were repeatedly highly suggestively associated (P<0.01 or −log P>2) with different strength measurements.

Figure 3
figure 3

Family-based association results for selected strength measurements using single SNP markers. Empirical P-values from HBAT analyses were calculated after 10 000 permutations. Strength measurements at 150° (bold line), at 120° (triangle), at 60°/s (cross) and at 120°/s (circle) are shown. Most suggestive significant results (P<0.05, −log P-value >1.3) were marked with associated genes.

Stage 2 candidate genes were ranked on the basis of significance in the different association tests, significance over different strength measurements and (putative) functionality in muscle tissue. ACVR1B got the highest ranking because of its high significance in both the combined linkage and association approach (Figure 2) and the family-based association analyses (Figure 3), and because the ACVR1B protein has a role in the molecular TGFβ pathway regulating the signaling of myostatin, a negative regulator of skeletal muscle mass.15, 16 Even though it ranked lower, INHBC was also included for follow-up because it is a part of the same superfamily of proteins as ACVR1B.

Fine mapping stage 2

In stage 2, 20 and 13 polymorphisms were selected within ACVR1B and INHBC, respectively, (Table 1 and Supplementary Figures S1 and S2; available in Supplementary file 2) based on their probability to influence ACVR1B and INHBC gene function or expression. Of these 33 SNPs, 7 failed during analyses (including rs10783486, a stage 1 tagSNP), and 10 were 100% homozygous. The resulting 16 informative SNPs were genotyped in 536 siblings. Genotype distributions for all successfully genotyped polymorphisms were in HWE (P>0.001).

ACVR1B

Family-based association results for the ACVR1B SNPs and knee muscle strength can be found in Table 2. For rs10783485 and rs746434 (two tagSNPs), the association results from stage 1 were confirmed in the extended sample (stage 2: N=536 versus stage 1: N=499; overlap of 464 individuals) with significant associations for most of the knee flexion strength and some of the extension strength parameters (0.00002<P<0.039). No additional subjects were genotyped for rs10783486, so stage 1 results are presented in Table 2. The conserved SNP rs11612312 and SNP rs2854464, located in a putative miRNA-binding site, both showed an association with the dynamic knee flexion and extension measurements at 60°/s. Additionally, the polymorphisms show (a trend toward) significance for isometric strength.

Table 2 Empirical P-values for association of ACVR1B polymorphisms and selected strength measurements

INHBC

Only two of four selected tagSNPs showed limited levels of association with muscle strength (Table 3). Marker rs533975 was associated with torque flexion at 60°/s and 120°/s (P<0.049) and rs2943693 with torque flexion at 60°/s, and torque extension at both 60°/s and 120°/s (P<0.021). None of the other SNPs showed association with isometric or dynamic knee torque.

Table 3 Empirical P-values for association of INHBC polymorphisms and selected strength measurements

Replication analyses

We replicated the associations with putatively functional polymorphisms, detected in the second stage of the fine mapping, in men selected from two independent studies. Given the limited evidence for association for the INHBC gene, we focused the replication effort on rs10783485, rs10783486 and rs2854464 in ACVR1B.

All three polymorphisms were genotyped in 192 men from the BLSA study.19 In the SPAH individuals22 (n=74), only rs10783485 and rs2854464 were genotyped because of the technical difficulties. All genotype distributions were in HWE (P>0.001).

Replication analyses tested whether individuals homozygous for the strength increasing allele – as determined in stage 2 of the fine mapping – differed significantly from other individuals.

Because of testing protocol differences, strength measurements were normalized per sample before the analyses. Age was included as a covariate in the analyses because of the large age range of the individuals. Rs10783485 only showed significant association with moderate velocity torque of the knee extensors (GG: 0.074±0.079 versus GT+TT: −1.15±0.070; P=0.38). Individuals homozygous for the rs2854464 strength increasing A allele had significantly higher dynamic knee extensor strength than GG and AG individuals (all P<0.036; Table 4).

Table 4 Muscle strength characteristics according to rs2854464 genotype in the combined sample

Quantification of mRNA expression

Effects at the mRNA level were tested only for rs2854464, located in a putative miR-24-binding site 985–1011 bp downstream of the stop codon of the ACVR1B gene, and inducing a higher strength in AA compared with AG+GG individuals (Tables 2, 3 and 4).

We genotyped this SNP in a limited sample (N=16) from an independent study conducted in our research center from which muscle tissue biopsies were available.24 Relative expression levels were 0.93±0.18 and 0.95±0.25 for AA homozygotes and G-allele carriers, respectively, and did not differ significantly (P=0.80).

Discussion

We described the results of a two-staged gene-centered fine mapping approach applied on a previously identified linkage peak in the 12q12-14 region, and identified genetic variants in the activin receptor 1B (ACVR1B) gene to be associated with human muscle strength in a sample form the Leuven Genes for Muscular Strength study. A replication effort in a (size-limited) sample consisting of 266 men from two independent studies partly supports this association.

This is the first report that demonstrates a possible genetic link between knee strength and ACVR1B, which is a member of the TGFβ superfamily of proteins – as is myostatin36 – and is known to be involved in the molecular pathway regulating myostatin and activin signaling. These signal pathways are triggered by the binding of the ligand to a type II receptor, the recruitment of a type I receptor and the formation of a heteromeric, active receptor complex. The type I receptor is the center of the signaling as it is essential for signal specificity as well as for signal propagation beyond the plasma membrane.15, 16

Cross-linking studies demonstrated that activin receptor IIB (ACVR2B) is the primary type II receptor for myostatin.37, 38 Blocking the activity of this receptor in mice models leads to dramatic increases in muscle mass, comparable with those seen in myostatin knock out mice.37, 39 Additional evidence for a role of activin receptor signaling in determination of human muscle mass and strength comes from Walsh et al,40 who showed a genetic association between ACVR2B and follistatin variants and skeletal muscle mass and strength. Moreover, resistance training could downregulate ACVR2B expression, suggesting a role for ACVR2B and the activin pathway in muscle response to strength training.41, 42

Although it has been recognized that activin receptors are important in myostatin signaling, published studies have focused on the initial binding of myostatin to ACVR2B, whereas no studies reported on the subsequent steps of the signaling cascade. Cross-linking studies, however, showed that after binding of myostatin to ACVR2B, a type I receptor (ACVR1B (ALK4) or TGFβ receptor 1 (ALK5)) is recruited.38

Combined linkage and association, and family-based association results show that genetic variation in ACVR1B can influence human muscle strength. Of particular interest is the association between rs2854464 and muscle strength, with 2% lower knee strength for G-allele carriers compared to AA homozygotes in the LGfMS sample. Follow-up replication analyses in an independent but size-limited study sample also show an enhanced strength for AA homozygotes with regard to dynamic knee extensor strength.

The rs2854464 polymorphism is located in a putative miR-24-binding site in the 3′ untranslated region (UTR) of the ACVR1B mRNA. Wang et al43 showed that miR-24 could decrease human ACVR1B expression at mRNA and protein levels. Published evidence shows that genetic variation might influence organismal phenotypes44, 45 by perturbing miRNA-mediated gene regulation. Similarly, we hypothesize that the rs2854464 polymorphism affects muscle strength by interfering with miRNA binding.

Recently, Sun et al46 showed that this miR-24 is involved in the inhibition of skeletal muscle differentiation by TGFβ. Even though miR-24 is not a muscle-specific miRNA, they suggested that miR-24 might function during differentiation and homeostatic maintenance of cardiac and skeletal muscle tissue. We hypothesize that the effect of miR-24 on TGFβ signaling is mediated through a miR-24-binding site in ACVR1B, one of the type I receptors important for TGFβ as well as myostatin signaling.

It should be noted that the significances found for several of the selected ACVR1B SNPs are not completely independent findings, as these SNPs are in (high) LD with each other. The rs2854464 polymorphism is part of a haplotype block consisting of rs746434, rs11612312, rs10783486, rs2172603 and rs2854464 (determined using the four gamete rule in Haploview31). Haplotype analyses using HBAT software34 confirm the single SNP analyses (data not shown). Although rs10783485 is not part of the same haplotype block, it does show considerable LD with SNPs in the haplotype block (r2 range 0.15–0.44) and the observed significant results could therefore be related to rs2854464.

Even though a set of significant association results was observed for INHBC gene variants, these were not consistent over the different strength measurements. Therefore, these associations were not considered to be compelling evidence for a role of INHBC gene variants in determining muscle strength variation.

We find less significant results for associations at higher contraction velocities and different influences on knee flexors and extensors, suggesting a velocity and/or muscle group specific set of genes.11 This is consistent with the observation that the importance of genetic factors decreases with increasing velocities in concentric torques due to genetic variation in contractile and elastic components, which contribute differently to strength production at different velocities.47

It could be argued that the currently applied two-stage strategy infers a bias toward genes with a known function in (the regulation of) muscle strength. Indeed, in both stages literature knowledge was used to prioritize genes for further follow-up. In addition, it should be noted that not all genetic variation in the linkage region was captured using our stage 1 SNP selection method because we only genotyped a limited set of SNPs with a MAF>5%. It can therefore not be ruled out that genes and/or polymorphisms other than the ones we investigated are involved in the linkage signal. Moreover, genes for which no significant associations were found should not be considered negatively associated as not all SNPs within these genes were covered (eg, rare alleles). However, we were aware of these limitations from the beginning of our study and acknowledge that additional research will be necessary to determine which other genes/polymorphisms, in addition to ACVR1B, could be responsible for the linkage signal. These analyses should take into account that since the onset of this study a considerable amount of new information about human genetic variation has become available (eg, the 1000 genomes project; http://www.1000genomes.org).

In summary, we conducted a genetic fine mapping to identify variations in specific genes underlying a previously reported linkage peak for knee muscle strength on chr12q12-14. We are the first to describe, and replicate, a genetic link of ACVR1B, a receptor involved in the myostatin signaling pathway, to the determination of variation in muscle strength phenotypes. A SNP, rs2854644, located within a putative miR-24-binding site in the 3′ UTR of the ACVR1B gene, was found to be significantly associated with human knee muscle strength in three independent non-athletes samples from a wide age range.