Introduction

Connective tissue growth factor (CTGF) has recently been proposed to play a prominent role in the pathogenesis of diabetic nephropathy (van Nieuwenhoven et al. 2005). Diabetic nephropathy is the leading cause of end stage renal disease in the Western world (USRDS 2004), with substantial evidence for an inherited predisposition to diabetic nephropathy in a subset of type 1 diabetic patients (Quinn et al. 1996; Seaquist et al. 1989). To date, the major gene variants involved in predisposition to diabetic nephropathy have not been identified.

CTGF is an excellent biological candidate gene for nephropathy given its role in extracellular matrix remodelling and its upregulation by the profibrotic transforming growth factor beta (TGFβ1) (van Nieuwenhoven et al. 2005). Increased CTGF levels have been detected in the glomeruli of patients with diabetic nephropathy (Adler et al. 2001) and urinary CTGF excretion correlates with the amount of albuminuria in type 1 diabetic patients (Gilbert et al. 2003). Increased in vivo CTGF mRNA and protein levels have been demonstrated in the glomeruli of diabetic animals with nephropathy (Murphy et al. 1999; Wahab et al. 2001), while in vitro studies have shown CTGF induction by high glucose (Murphy et al. 1999), advanced glycation end products (Twigg et al. 2001), reactive oxygen species (Park et al. 2001) and mechanical strain (Hishikawa et al. 2001).

The CTGF gene maps to chromosomal location 6q23.1 (Martinerie et al. 1992); however, the genomic region has not yet been fully characterised. We screened the CTGF gene to identify variants and employed a case-control study to assess association between selected polymorphisms and diabetic nephropathy in an Irish population.

Subjects and methods

Ethical approval was obtained from the appropriate Research Ethics Committees and written, informed consent obtained from individuals prior to conducting this study. All patients were at least third generation Irish Caucasians diagnosed with type 1 diabetes mellitus before 31 years of age, and required insulin from diagnosis. Patients with nephropathy (cases, n=272) had diabetes for at least 10 years before the onset of proteinuria (>0.5 g/24 h), were on antihypertensive therapy, and demonstrated concurrent retinopathy. Patients without nephropathy (controls, n=367) had diabetes for at least 15 years with no evidence of microalbuminuria, were not in receipt of antihypertensive medication, and demonstrated no evidence of non-diabetic renal disease. There was no significant difference between cases and controls in mean age at onset [17.1 (SD 8.2) year vs 16.8 (SD 8.1) year], mean duration of diabetes ([26.9 (SD 8.3) year vs. 27.7 (SD 9.0) year] or mean HbA1c value [8.5 (SD 1.7)% vs. 8.4 (SD 1.6)%]. As expected, average blood pressure was higher and more variable in the cases compared to the controls [systolic = 150.1 (SD 22.6) mm Hg vs 126.9 (SD 16.6) mm Hg; diastolic = 86.5 (SD 11.4) mm Hg vs 76.1 (SD 7.3) mm Hg].

The nucleotide sequence of draft clone RP11-6918 for human chromosome 6 was downloaded from the National Centre for Biotechnology Information (http://www.ncbi.nlm.nih.gov). Reference mRNA (NM_001901, gi:4503122) and protein (NP_001892.1, GI:4503123) sequences were used to determine exon–intron boundaries. The promoter region was identified using ElDorado software from the Genomatix Suite, while MatInspector software (Cartharius et al. 2005) was used to assess the influence of identified single nucleotide polymorphisms (SNPs) in the promoter region on transcription factor binding sites. For screening purposes, 3,975 bases of genomic sequence for CTGF were divided into 11 overlapping fragments (S1). The CTGF gene, including the promoter, all exons, introns, 620 bp upstream of the 5′ untranslated region and 216 bp beyond the 3′ untranslated region were screened in 15 case and 15 control individuals. Each PCR product was then analysed on the WAVE (dHPLC) DNA Fragment Analysis System (Transgenomic, Crewe, UK) following the manufacturer’s recommendations. Differentially separating fragments (representing DNA variants) were bidirectionally sequenced to identify variants using an ABI PRISM 3100 Genetic Analyser (Applied Biosystems, Warrington, UK). Forty-six healthy, Irish controls were genotyped by direct capillary sequencing (Applied Biosystems) to establish allele frequencies for all variants. Genotyping was performed on case and control DNAs for SNPs c.−650G>C, c.289+94T>C and c.289+98T>C by Pyrosequencing (Biotage, Uppsala, Sweden), c.−220G>A employing TaqMan technology (Applied Biosystems) and c.−420InsT by direct capillary sequencing as this SNP was not amenable to the other technologies. Genotype frequencies were assessed for Hardy–Weinberg equilibrium using a χ2 goodness-of-fit test. The χ2 test for contingency tables was used to compare genotype and allele frequencies between case and control subjects with the level of significance set to P<0.05. Subgroup analyses on the basis of gender (female: cases, n=112; controls, n=209), duration of diabetes (<24 years: cases, n=101; controls n=143) and retinopathy (with retinopathy, n=399; no evidence of retinopathy, n=240) was performed for all four SNPs. The extent of linkage disequilibrium (LD) was quantified using Lewontin’s D′ (Lewontin 1964). Haplotype frequencies were estimated and compared between cases and controls using the Chaplin program (Epstein and Satten 2003).

Results and discussion

Comparison of our resequenced genomic data with the protein and mRNA reference sequences revealed the CTGF gene has five exons, where all exon–intron boundaries follow the consensus AG/GT rule (Mount 1982). The coding region of CTGF spans 1,909 bases from the start (ATG) to the stop (TGA) codons; however, we comprehensively screened and resequenced almost 4 kb of this gene, including the promoter, all exons, introns and beyond the untranslated regions. There are presently no SNPs in dbSNP validated by frequency data, and no haplotype tagged SNPs recorded in the genomic region of CTGF (gene_id=1490, NCBI, accessed 8 October 2005). As there was no genomic reference sequence available for the CTGF gene in public repositories, we have submitted our annotated sequencing data as GenBank accession number DQ097843. Our resequenced data established six differences with the GenBank recorded draft sequence for this region of chromosome 6, RP11–6918 (c.247G, c.754−29delT) and the partial promoter sequence published for CTGF, X92511 c.−401C>T, c.−274insG, c.−250insC and c.−246insG); this may be explained by the lower quality, draft raw sequence data generated by earlier, large-scale genotyping efforts. In total, we identified four SNPs in the promoter region, one in exon 2, two in intron 2 and three in the 3′ untranslated region of the CTGF gene. Two of these were previously recorded in dbSNP and we have obtained novel NCBI identifiers for the remaining eight SNPs (Table 1).

Table 1 Variant position, unique NCBI (http://www.ncbi.nlm.nih.gov) identifier and minor allele frequencies for identified variants in the connective tissue growth factor gene (CTGF) in 46 healthy controls

Using publicly available software, we identified two putatively functional SNPs in the promoter region of this gene, c.−650G>C and c.−484T>C, each of which are predicted to create a transcription factor binding site in the CTGF gene. The c.650G>C SNP is located two bases from a putative myeloid zinc finger 1 binding site previously reported by a Danish group (Blom et al. 2001) with a similar minor allele frequency to that found in our study (Table 2). The exonic SNP c.247G>C (rs7451102) was detected in only one individual in our healthy control population, but was predicted to cause a non-synonymous amino acid change from a small aspartic acid to a larger histidine residue (p.Asp83His) in the insulin-like growth factor binding protein domain (Pfam: 00219), thus having a potentially functional effect on the CTGF protein.

Table 2 Genotype and allele frequencies for genotyped CTGF SNPs in cases with diabetic nephropathy and controls without nephropathy. Data are n (%). P values are presented from the χ2 analysis for both genotype and allele frequency comparisons between these groups

Five SNPs demonstrated a minor allele frequency greater than 5% (Table 1) and were subsequently genotyped in an Irish, type 1 diabetic case-control cohort of 639 individuals (272 cases, 367 controls). The distribution of genotypes was found to be in Hardy–Weinberg equilibrium for all SNPs in both case and control groups. Significant association was not observed with either genotype (P>0.9) or allele (P>0.7) frequencies between case and control groups (Table 2). These sample sizes have 90% power to detect a doubling in the minor allele frequency in cases relative to controls (e.g. 10% vs 5%). Further subgroup analyses for gender (genotype P>0.4), duration of diabetes (genotype P>0.4) and retinopathy (genotype P>0.4) also failed to reveal significant association. Highly statistically significant LD was observed for all five loci (P<0.00001, Table 3). Haplotype frequencies were estimated separately in case and control groups, with the same two haplotypes being found at frequencies greater than 5% in each group. From 5′ to 3′ c.−650G>C, c.−420InsT, c.−220G>C, c.289+94T>C, c.289+98T>C),G–GTT was the most common haplotype (cases=76.4%, controls=78.1%), followed by G–GCC (cases=13.4%, controls=10.9%). A global test comparing these haplotype frequencies in cases and controls did not attain significance (χ2=1.32, df=2; P=0.57).

Table 3 Linkage disequilibrium (LD) between common SNPs in the CTGF gene. Data are |D′|, p<0.00001

The relatively low number of SNPs identified in the CTGF gene may reflect the diverse functions and high degree of conservation observed among different species (Moussad and Brigstock 2000). CTGF is involved in a multitude of physiological pathways and has been recently proposed as a potential therapeutic target for fibrotic diseases (Franklin 1997; van Nieuwenhoven et al. 2005; Twigg and Cooper 2004). Knowledge of the gene structure and variants should aid our understanding of CTGF gene function and further the development of strategies to ameliorate the development and progression of diabetic nephropathy. We have submitted the annotated, ‘corrected’ genomic sequence for the CTGF gene and have identified polymorphisms, seven of which are novel, which may influence the role of CTGF in disease pathologies. To our knowledge, this is the first report describing variations in the complete CTGF gene and assessing the influence of DNA variants on diabetic nephropathy. In our Irish type 1 diabetic population, our results suggest that common SNPs in CTGF do not play a major role in genetic susceptibility to diabetic nephropathy; however, this should be assessed in a larger study group.