Introduction

Coal workers' pneumoconiosis (CWP) is a lung disease produced by the inhalation and deposition of occupational coal dust. There are more than 100 000 coal miners in the United States of America. In China, there are more than 5 000 000 coal miners and over 600 000 cases with silicosis, which are half of the cases in the world.1 CWP is characterized by the accumulation of inflammatory cells in the lung, thickening of the alveolar walls and formation of fibrotic nodules.2 However, only a part of the individuals exposed to coal dust or silica develop CWP in their life, suggesting that genetic susceptibility factors also have a role in the development of CWP.

microRNAs (miRNAs) are a family of endogenous, small and noncoding RNAs, 17–27nt in length, that negatively regulate gene expression by suppressing translation or degrading mRNAs.3 Strongly conserved among distantly related organisms, miRNAs are involved in a variety of biologic processes, including cell cycle regulation, differentiation, development, metabolism, neuronal patterning and aging.4 It has been estimated that the human genome contains about 1000 miRNAs5 and that they could regulate nearly 30% of human genes.6 Genetic variants including single-nucleotide polymorphisms (SNPs) or mutations in miRNAs may alter miRNA expression and/or maturation. For example, an SNP located at the eighth nucleotide of mature miR-125a in normal subjects blocks pri-miR-125a processing to pre-miRNA and thus reduces the miRNA-mediated translational suppression of its target.7 Recently, Hu et al.8 conducted a screening for common SNPs in miRNA sequences and identified four SNPs (rs2910164, rs2292832, rs11614913 and rs3746444) located at the pre-miRNA regions of miR-146a, miR-149, miR-196a2 and miR-499, respectively (Figure 1). Furthermore, they also evaluated associations between these SNPs in the pre-miRNAs and lung cancer susceptibility.9 As CWP is a dust-associated lung disease, it could have similar mechanisms underlying the development of lung cancer.

Figure 1
figure 1

Sequences and mature forms of microRNAs (miRNAs) (ad). Sequence for mature miRNA is bolded, and the position of single-nucleotide polymorphism in the miRNA is underlined.

In this study, we hypothesized that the four SNPs in the pre-miRNAs are associated with risk of CWP. To test this hypothesis, we genotyped the polymorphisms and assessed the associations with risk of CWP in our ongoing case–control study in the Chinese population.

Materials and methods

Study subjects

Four hundred ninety-six CWP patients and 513 controls were recruited from the coal mines of Xuzhou Mining Business Group, China, between January 2006 and December 2008. High kilovolt chest X-ray and physical examinations were performed for all, reconfirming the diagnoses based on the 2002 Chinese Roentgenodiagnostic Criteria of Pneumoconiosis, which is the same as the 1980 International Labour Office Classification of Pneumoconioses in the judgment of opacity profusion. The pneumoconiosis patients were classified into stage I, stage II and stage III according to the size, profusion and distribution range of opacities on chest X-ray. The participation rate of the cases was approximately 96%. The control subjects were workers matched with the CWP cases for dust exposure years and work types from the same company. The response rate of those controls contacted for participation in the study was approximately 88%. Those who refused to participate in the study did not want to donate their blood samples for the genetic testing needed in this study. Information about individual demographics as well as data on smoking status was obtained through face-to-face interviews. The research protocol was approved by the institutional review board of Nanjing Medical University.

SNP selection and genotyping

The study for SNP selection has been described in detail elsewhere.8 Briefly, there were a total of five common SNPs associated with 400 known human pre-miRNAs (miR-146a rs2910164, mirR-149 rs2292832, miR-196a2 rs11614913, miR-499 rs3746444 and miR-423 rs6505162) for the Chinese listed in the public database miRBase. As miR-423 rs6505162 SNP did not affect the hydrogen bands or predict secondary structure free energy, only four SNPs were included in the study. The polymorphism was determined using the PCR-restriction fragment length polymorphism (RFLP) method. The primers, lengths and restriction enzymes have been previously described.8 The gel electrophoresis pictures of the four SNPs are shown in Figure 2. The polymorphism analysis was performed by two persons independently in a blind manner. More than 10% of the samples were randomly selected for confirmation, and the results were 100% concordant. In addition, the PCR-RFLP results were confirmed by the direct sequencing method to verify the genotypes. As a result, we found that the genotypes using direct DNA sequencing were identical with the PCR-RFLP method.

Figure 2
figure 2

miR-146a rs2910164, miR-149 rs2292832, miR-196a2 rs11614913 and miR-499 rs3746444 genotypes determined by PCR-restriction fragment length polymorphism.

Statistical analysis

Differences in selected demographic variables, dust exposure years, the known risk factors, such as tobacco smoking, and each allele and genotype of four polymorphisms between the cases and controls were evaluated by using Student's t-test (for continuous variables) and χ2-test (for categorical variables). The Hardy–Weinberg equilibrium of the controls' genotype distributions was tested by a goodness-of-fit χ2-test. Unconditional univariate and multivariate logistic regression analyses were performed to obtain crude and adjusted odds ratios (ORs) for risk of CWP and their 95% confidence intervals (CIs). All tests were two-sided by using the SAS software (version 9.1; SAS Institute, Cary, NC, USA), unless indicated otherwise.

Results

Characteristics of the study subjects

The frequency distributions of selected characteristics of the cases and controls are presented in Table 1. There was no significant difference in the distribution of exposure years and tobacco smoking between the cases and controls (P=0.658 for exposure years, and P=0.110 for tobacco smoking). However, there was a significant difference between the cases and controls in terms of age (P<0.001). Specifically, light smokers (0–20 pack-years) had a 2.22-fold (95% CI, 1.61–3.06) increased risk compared with nonsmokers. In addition, of the 496 CWP patients, 228 (46.0%) were stage I, 213 (42.9%) were stage II and the remaining 55 (11.1%) were stage III.

Table 1 Distribution of selected variables between the coal workers' pneumoconiosis (CWP) cases and control subjects

Association between the pre-miRNA polymorphisms and CWP risk

As shown in Table 2, for the miR-149 rs2292832 polymorphism, the difference in the frequencies of genotypes and alleles between cases and controls was statistically significant (P=0.035 for genotypes and P=0.024 for alleles). The genotype distributions of these SNPs in our controls were in agreement with the Hardy–Weinberg equilibrium (P=0.060 for mirR-149 rs2292832, P=0.206 for miR-196a2 rs11614913 and P=0.136 for miR-499 rs3746444), but not for one SNP (P=0.009 for miR-146a rs2910164). In this study, the miR-149 rs2292832 CT genotype was statistically significantly associated with a deceased risk of CWP (adjusted OR=0.75, 95% CI=0.58–0.98), compared with the TT genotype. Furthermore, when we used the CT/CC genotypes as the reference, we found that the TT genotype was associated with a significantly increased risk of CWP (adjusted OR=1.31, 95% CI=1.01–1.69). However, no significant association with CWP risk was identified for the other SNPs in the pre-miRNAs examined in this study.

Table 2 Genotype and allele frequencies of pre-miRNA polymorphisms among the coal workers' pneumoconiosis (CWP) cases and controls and their associations with risk of CWP

Stratification analyses between the genotypes of four SNPs in the pre-miRNAs and risk of CWP

In the stratification analyses, we found that individuals with the TT genotype had a significantly increased risk of CWP than those with the CT/CC genotypes, and this increased risk was more pronounced among subgroups of those aged 68 years (OR=1.52, 95% CI=1.03–2.25), dust exposure 26 years (OR=1.42, 95% CI=1.04–1.93) and ever smokers (OR=1.48, 95% CI=1.00–2.20) (Table 3). In addition, significant associations were observed between the genotypes and patients with stages II and III (OR=1.50, 95% CI=1.05–2.14 for stage II, and OR=3.33, 95% CI=1.67–6.65 for stage III). However, no statistical evidence was found for the gene–environment interaction (data not shown).

Table 3 Stratification analyses between the genotypes of miR-149 rs2292832 polymorphism and coal workers' pneumoconiosis (CWP) risk

Discussion

In this study, we found that the miR-149 rs2292832 TT genotype was associated with a significantly increased risk of CWP compared with the CT/CC genotypes. In addition, the increased CWP risk was more significant in older smokers with long exposure years. Furthermore, statistical evidence was observed for the polymorphism and CWP patients with stages II and III. To the best of our knowledge, this is the first study of the association of common SNPs in pre-miRNAs with CWP risk.

The biogenesis of miRNAs involves a complex protein system, including members of the Argonaute family, Pol II-dependent transcription and the RNAase IIIs Drosha and Dicer.10 miRNAs are involved in crucial biological processes, including development, differentiation, apoptosis and proliferation.3, 11 From the systematic expression analysis of 217 mammalian miRNAs from 334 samples by Lu et al.,12 miR-149 was widely detected in human lung normal and tumor tissues. Sequence variations within the miRNA genes could potentially influence the processing and/or target selection of miRNAs.7 A few studies have been performed that examined sequence variations in miRNA regions.7, 8, 13, 14 For example, Hu et al.8 found that rs11614913 in the pre-miRNA-196a2 was associated with an increase in mature miRNA expression, which contributes to the development and prognosis of lung cancer. In this study, we found a positive association between rs2292832 in miR-149 and CWP risk. However, the potential of mechanism underlying this effect remains unknown. Whether the SNP rs2292832 in miR-149 could result in the expression of mature miRNA or not should be confirmed in further studies. In addition, many human miRNA transcripts have target sequences in the 3′-UTR to which miRNAs bind and exert posttranscriptional repression. As shown in Table 4, some of the predicted probable targets of miR-149 have been experimentally verified. For example, studies showed that N-deacetylase/N-sulfotransferase-1 (NDST1) is an essential NDSR isozyme in mouse embryonic development,15 and disruption of NDST1, resulting in severe malformations in the lung.16 Therefore, miR-149 was likely to affect the expression of the miRNA target gene NDST1 and contribute to the susceptibility of CWP. However, functional characterizations of SNP rs2292832 in miR-149 and its target mRNAs in CWP are warranted.

Table 4 Predicted most probable targets of miR-149

In this study, we also found that the increased risk associated with the rs2292832 TT genotype was more pronounced among older smokers with long exposure years. It is possible that the significant difference in the results may be due to the differences in the age of study subjects. For example, the control population is younger than the study population and may have lesser risk of CWP. Another possible explanation is that individuals in those subgroups may be more likely to have been exposed to some risk factors involved in the etiology of CWP risk, such as tobacco smoking, alcohol use or occupational coal mine dust.17 However, the lack of significant interactions suggested that the sample size of our study may be insufficient. After stratification for CWP stage, it appeared that the miR-149 rs2292832 TT genotype had an increased risk of CWP with stages II and III, but not with stage I. It has been posed that different stages of CWP have different etiology involving different genetic defects.18 Therefore, it seems plausible that the miR-149 rs2292832 polymorphism is involved in the higher stage of CWP patients. However, this hypothesis needs to be confirmed in further studies.

In conclusion, this study indicates that the SNP rs2292832 in miR-149 modulates the risk of CWP in the Chinese population. These findings should be further validated by larger, preferably prospective studies with more diverse ethnic groups.