Introduction

Breast and ovarian cancers are among the most common cancers in developed and developing countries.1 Approximately 5–10% of familial cases are due to germline mutations in susceptibility genes, such as BRCA1, BRCA2, CHEK2 and TP53. These susceptibility genes have key roles in cell cycle control, apoptosis and DNA repair.2,3

Mutations in the BRCA1 and BRCA2 genes are associated with the hereditary breast and ovarian cancer (HBOC) syndrome. Patients who have HBOC syndrome have a personal and family history of cancer mainly in the following organs: breast, ovarian, prostate and pancreas.2 In contrast, mutations in the CHEK2 and TP53 genes are also associated with breast and ovarian cancers, as well as with other cancers related to the Li–Fraumeni Syndrome and Li–Fraumeni-like syndrome.4,5

Many studies have demonstrated that mutations in these susceptibility genes could result in early-onset carcinoma as well as in a worse prognosis; therefore, it is important to use genetic testing to identify these deleterious mutations, so that risk-reducing and risk-prevention strategies can be applied.6,7

In previously characterized populations, it is easy to verify the genetic susceptibility of high-risk individuals, because some germline mutations have founder effects, such as the BRCA1 c.66_67delAG and c.5266dupC mutations and BRCA2 c.5946_5946delT in the Ashkenazi population; BRCA2 c.156_157insAlu in Portugal; BRCA2 999del5 in Iceland; CHEK2 c.1100 in European populations; and TP53 p.R337H in Southern Brazil.813

Thus, knowing that ethnic differences in cancer incidence and mortality are the results of genetic and epidemiological risk factor interactions, it is crucial to characterize a population which will allow for a more accurate risk assessment and proper genetic counseling for the individuals.14,15

The Brazilian population is one of the most heterogeneous in the world. Many populations have contributed to it, mainly Portuguese, African and Amerindian. However, other European populations, such as the Spanish and Italian, have also contributed to its genetic diversity.16 As the pattern of admixture in Brazil varies according to the region1721 and no study has been performed with the population from the Northeast region of Brazil, our aim was to verify the frequency of mutations of clinical significance in BRCA1, BRCA2, CHEK2 and TP53 in 106 patients at high risk for HBOC from the Northeast region of Brazil.

Materials and methods

Study population

A total of 106 unrelated Brazilian patients at high risk for HBOC who were referred to the Genetics Clinic of the Hospital Universitário Edgar Santos from Universidade Federal da Bahia between 2008 and 2013 were included in the study. The research was approved by the Research Ethics Committee of the institutions that were involved and all of the patients provided informed consent.

During genetic counseling, clinical and epidemiologic data were obtained, and genetic testing was offered to affected individuals who met at least one of the following criteria: (a) two or more family members affected by breast cancer; (b) one family member affected by breast cancer and another affected by ovarian cancer; (c) three or more family members affected by breast or ovarian cancer, with at least one affected with ovarian cancer; (d) breast cancer diagnosed before 45yr of age; (e) ovarian cancer diagnosed before 50yr of age; (f) bilateral breast cancer; (g) male breast cancer at any age; and (h) cancer in multiple organs including breast cancer. The patients’ available relatives were also analyzed.

Genotyping susceptibility genes

DNA was extracted from peripheral blood using the Gentra Puregene Blood kit (QIAGEN, Hilden, Germany) or the AxyPrep Blood genomic DNA Miniprep kit (Axygen Biosciences, Union City, CA, USA). The screening for mutations in BRCA1 was performed in two steps: (a) polymerase chain reaction (PCR) of each exon followed by single-strand conformation polymorphism analysis and electrophoresis of denatured DNA as previously described22 and (b) then all of the PCR fragments that presented a different migration pattern on the gel were analyzed by direct sequencing in an ABI PRISM 3130xl genetic analyzer (Applied BioSystems, Foster City, CA USA) using the BigDye™ terminator sequencing kit (Applied BioSystems) as recommend by the manufacturer. The primers used as well as the PCR cycling conditions are shown in Supplementary Table 1.

The initial screening for the mutations BRCA2 (c.5946_5946delT and c.156_157insAlu), CHEK2 (c.1100delC, c.444+1G>A and p.I157T) and TP53 (p.R337H) was performed by AS-PCR followed by electrophoresis in agarose gels (2%) or PCR/RFLP, which consisted of the following steps: (a) PCR reaction, (b) verification of amplification by electrophoresis in agarose gels (1.5%), (c) digestion with the appropriate restriction enzyme and (d) scoring other electrophoreses in agarose gels (2.5%).

The primers and PCR cycling were based on Struewing et al.,8 Machado et al.,9 Bayram et al.23 and Custodio et al.24 (Supplementary Table 2). All of the RFLP reactions used enzymes reagents of New England Biolabs (New England Biolabs, Beverly, MA, USA) and were performed according to this manufacturer.

Each PCR reaction contained 0.25 μM each primer, 0.5 mM dNTPs, 10 mM Tris-HCl (pH 8.9), 50 mM KCl, 1.5 or 2.0 mM MgCl2, 1 U Taq DNA polymerase (Invitrogen, Carlsbad, CA, USA), 25 ng genomic DNA and UltraPure DNase/RNase-Free Distilled Water (Invitrogen) up to 25 μl. Except in the PCR reaction for TP53’s mutation, Go Taq Flexi DNA polymerase was used (Promega, Madison, WI, USA).

The positive results were verified by two sequencing reactions. For the genotyping control, 10% of the negative samples were randomly selected and sequenced. The PCR reactions of the BRCA2 c.156_157insAlu were run with a positive control. The gene reference sequences were NG_005905.2, NG_012772.3, NG_008150.1 and NG_017013.2, which are available in GenBank (NCBI, NIH, USA). The sequences were analyzed using the CLC Bio sequence viewer version 6.8.1 (a QIAGEN Company) and BioEdit version 7.1.9 (Ibis Biosciences, Carlsbad, CA, USA) or FinchTV 1.4.0 (Geospiza, Seattle, WA, USA).

Genetic ancestry analysis

A micro-panel of nine ancestry informative markers was used to estimate the genetic ancestry: APO, AT3-I/D, CKMM, FY-NULL, GC*1S, GC*1F, LPL, PV92 and SB19.3. These ancestry informative markers were genotyped as described above by (a) PCR followed by electrophoresis, (b) PCR/RFLP, or (c) real-time PCR as described by Parra et al.25 and Shriver et al.26

The contents of the PCR reaction were as described above, except for the real-time PCR reactions, which used TaqMan SNP genotyping assays (Applied BioSystems) as recommended by the manufacturer and were monitored in ABI Prism 7700 Sequence Detection Systems (Applied Biosystems).

The restriction enzymes were also from New England Biolabs (New England Biolabs). The primers, enzymes, probes and PCR conditions are described in Supplementary Table 3.

Statistical analysis

The descriptive analyses were performed in Epi Info version 7. GENEPOP 1.227 and ADMIX 9528 were used to verify the allelic frequencies and estimate the genetic ancestry. The population structure was analyzed in STRUCTURE v.2.3.429 with 10.000 for the burn-in period interactions, 10 000 additional interactions and admixture option with the LOCPRIOR model and a K value from 1 to 3. The genotypes of the African (Nigerian), Amerindian and European (German and Spanish) populations were kindly provided by Dr Mark Shriver from Penn State University.

Results

All of the patients were female, and most of the patients had cancer before 50 years of age, with a mean age at diagnosis of 43.06 years (SD=±10.67). The majority of the cases were breast cancer (91.4%), followed by ovarian cancer (4.7%) (Table 1). Among the breast cancer cases, the most common histological type was the invasive ductal carcinoma (IDC) at 69.30% (70/101), and among the ovarian cancer cases, the most common was serous adenocarcinoma, at 77.77% (7/9).

Table 1 Baseline characteristics of the 106 patients analyzed

In addition to the personal cancer history, most of these women also had a family history (83.02%, Table 1), and the frequency of the cancers associated with HBOC (breast cancer, ovarian cancer, prostate cancer and pancreatic cancer) was significant among the first (81.82%)-, second (87.83%)- and third (78.71%)-degree relatives.

Regarding other risk factors, such as the use of oral contraceptives, parity and age at menarche, a significant percentage of the study population had used oral contraceptives for more than 5 years (64.15%) or also had children (76.42%), and the mean age at menarche was 12.89 years (SD=±1.63). Another risk factor that was analyzed was the smoking status, but a considerable portion of the subjects were nonsmokers (64.15%) (Table 1).

Knowing that the Brazilian population is composed of at least three ancestral population groups, a trihybrid admixture model was estimated in ADMIX 95, which showed a high European ancestry contribution 62.20% (s.e.=0.034) and considerable African and Amerindian ancestry contributions, 31.20% (s.e=0.011) and 6.60% (s.e.=0.036), respectively. This analysis was statistically significant (r2=0.991), and the study population fit the Hardy–Weinberg Equilibrium (P<0.005).

In addition, the population stratification analysis in STRUCTURE was also significant for K=3 (r=0.604); the bar plot in Figure 1 (left) shows the genetic admixture proportions of the Brazilian study population compared with the African, Amerindian and European populations based on the ancestry informative markers panel. The triangle plot in Figure 1 (right) demonstrates the distribution of the subjects according their genetic admixture and population of origin, in which is observed the heterogeneity degree of the study population clustered between the European and African populations. The average distances (expected heterozygosity) between individuals in the same cluster was greater in cluster 3 (0.319), in which the study population was more closely clustered than was that in cluster 1 (0.262) and cluster 2 (0.256), in which the African and Amerindian populations clustered (see triangle plot, Figure 1).The subjects also exhibited a distinct pattern in the assessment of self-reported race/color; most of the subjects self-reported as mulatto (40.57%), followed by white (34.91%) and black (20.75%). Almost all of the subjects were from the State of Bahia (83.02%), but there were also subjects from other regions of Brazil (15.09%) (Table 1).

Figure 1
figure 1

Distribution of the subjects according their estimated ancestral background (triangle plot) and population structure for the trihybrid model (bar plot). Note: red dots, African population; green dots, Amerindian subjects; blue, European subjects; yellow dots, Brazilian population (study population).

Ten unrelated patients were identified with mutations of clinical significance; the most frequent mutation was BRCA1 c.211A>G rs80357382 (4.71%; 5/10), followed by BRCA1 c.3331_3334delCAAG rs80357777 (3.77%; 4/10) and TP53 p.R337H rs121912664 (0.94%; 1/10). Six of these patients were from the countryside regions of the State of Bahia and four were from Salvador, the capital of Bahia. The clinic and pathological characteristics of the mutation carriers are described in Table 2, and Figure 2 shows the sequencing fragments of the identified mutations.

Table 2 Characteristics of the patients with BRCA1 and TP53 mutations
Figure 2
figure 2

Sequencing fragments of carriers and noncarriers for the BRCA1 p.R71G, BRCA1 3450del4 and TP53 p.R337H mutations. Note: Het, heterozygous type; Wt, wild type.

All of the mutation carriers had a family history of breast and ovarian cancer involving first-, second- and/or third-degree relatives (80.00%), except for two carriers who had no suggestive family history of any cancer (patients #21.1 and #106.1, Table 2). The pedigrees of three subjects who are carriers for each mutation are shown in the Figure 3, which presents the segregation of these mutations among first- and second-degree relatives, except for the TP53 p.R337H mutation carrier (Patient #97.1) for whom no family members were available for testing.

Figure 3
figure 3

Pedigree of families harboring the BRCA1 p.R71G, BRCA1 3450del4 and TP53 p.R337H mutations. Note: positive (+) symbol, mutation carrier; negative (−) symbol, noncarrier; *age unknown. The numbers between parentheses represent the age at diagnosis in years.

No patient with the Galician founder mutation (BRCA1 c.211A>G) reported a Spanish ancestry contribution in their families, whereas among the patients with the BRCA1 c.3331_3334delCAAG (3450del4), three reported a Portuguese and/or French ancestry contribution and the fourth patient reported an Italian ancestry contribution. The TP53 p.R337H mutation’s carrier did not report her ancestry contribution. Table 2 presents the estimated genetic ancestry proportion for each mutation carrier, which was similarly estimated for the study population as a whole as described above.

Discussion

Here, 106 unrelated high-risk subjects for HBOC from Northeast Brazil, a population that until now was uncharacterized, were analyzed. In this population, two mutations of clinical significance were found in BRCA1, 3450del4 and p.R71G (8.49%) and one in TP53, p.R337H (0.94%). These mutations were associated with aggressive clinic-pathological characteristics; most of the subjects had IDC and triple-negative tumors, and a family history of cancer involving first-, second- and/or third-degree relatives (Table 2 and Figure 3). Only two mutation carriers did not have a family history of any cancer, but both of these carriers had early-onset breast cancer (patients #21.1 and #106.1, Table 2).

The 3450del4 mutation has been described in many populations worldwide.3034 In South America, this mutation has been described among patients with HBOC from other regions of Brazil33,34 and among patients with ovarian cancer from Colombia.31 In fact, in Colombia, 3450del4 was the only mutation of clinical significance, suggesting a founder effect. Here, this mutation also had a significant frequency among the patients from Northeast Brazil, with a frequency higher (3.77%) than that previously observed in the others regions of Brazil, at 0.16%33 and 2.12%.34

The other mutation that was found in this study, BRCA1 p.R71G, which accounted for 50% of the mutations identified, is a Galician founder mutation that was first described by Vega et al.35 Based on the mtDNA analysis, the Galician population is very homogeneous, with traits of a Culdesac population with a striking similarity to the Basque population owing to its geographic location and cultural barriers (dialect).36 Others studies corroborate the restriction of this mutation in Galician or Hispanic descendants in Portugal,37 USA38,39 and Asturias (Northern Spain).30

Although no Brazilian patient with the p.R71G reported a Spanish ancestry contribution, these patients presented considerable European ancestry contributions (Table 2), which was statistically significant (r=0.604), and historical data corroborate the hypothesis that the Northeast region of Brazil has a high Spanish ancestry contribution, especially Galician, because the State of Bahia received more Galician immigrants than others states. In Salvador, the capital of Bahia, from 1883 to 1950, approximately 3–6 generations ago (~20 years per generation), 94.3% of the Spanish immigrants were from Galicia, of which 90.8% were from Pontevedra Province.40,41 It is also important to emphasize that the likelihood of a de novo mutation occurring in BRCA1 and BRCA2 genes is very low.42

The third mutation that was found, TP53 p.R337H (0.94%), is a low-penetrance mutation that has a founder effect in the Southern region of Brazil and is associated with breast cancer cases that are related to Li–Fraumeni-like syndrome syndrome.4,11 It seems that p.R337H originated in the Portuguese population because the haplotypes of the Portuguese (A3) and the Brazilian (A1) carriers differ by only one SNP, rs9894946:C>T.11

The frequency of the TP53 p.R337H mutation in Southeast and South Brazil varies from 0.51 to 8.6%,4345 and in the Northeast population, this mutation occurred at a significant frequency (0.94%). However, here the mutation carrier had a family history that was suggestive of HBOC and not Li–Fraumeni Syndrome or Li–Fraumeni-like syndrome (Figure 2) based on the NCCN Clinical Practice Guidelines in Oncology v.1.201046 and criteria that were proposed by Li–Fraumeni, Birch, Eeeles and Chompret.47

However, this association was also recently found by Cury et al.;48 therefore, this finding demonstrated that in Brazil, the TP53 p.R337H mutation is important for patients at high risk for HBOC as well as Li–Fraumeni Syndrome and Li–Fraumeni-like syndrome. Consequently, in agreement with it has been recently proposed, the combined screening of BRCA1, BRCA2 and TP53 should be offered to high-risk as well as early-onset breast cancer patients from Brazil.44,49

Interestingly, in this study we did not observe the BRCA1 mutation that is the most frequent among the Brazilians: the BRCA1 c.5266dupC (5382insC), which accounted for 56% of all of the identified mutations in patients from Southeast Brazil.50 The highest frequency that was ever found in Brazil was 5% by Ewald et al.51 among high-risk patients for HBOC from Porto Alegre and Rio de Janeiro. Other studies in different regions of Brazil also found BRCA1 c.5266dupC with a significant frequency,33,34,52 except among Ashkenazi women from Porto Alegre.53

BRCA1 c.5266dupC was first described as an Ashkenazi founder mutation, but an analysis of the origin of this mutation in the European population demonstrated that c.5266dupC originated from a common ancestor in Northern Europe long before becoming an Ashkenazi founder mutation.54 Because the Brazilian population has an ancestry contribution of many European populations, this mutation is likely to be found.

BRCA1 c.66_67delAG and BRCA2 6174delT, also considered Ashkenazi founder mutations,8 were found in other regions of Brazil34,53 as well as in Spanish and Latin America populations,26,35 and interestingly were not present in the population of Northeast Brazil; this result could also be because of the pattern of admixture of the Northeast population or even to the sampling size. However, the study population fit the Hardy–Weinberg Equilibrium (P<0.005).

The other mutation of BRCA2, c.156_157insAlu, a Portuguese founder mutation,9,10 was also not found in this population, although historical data demonstrated that most of the European immigrants who arrived in Brazil were Portuguese, and it was the Portuguese that settled the country (1500), with a great immigrant flux between 1850 and 1970 that was two times greater than that of the Spanish population. However, when considering the number of immigrants by state, in 1950, Bahia State registered 2509 Spanish immigrants and 1531 Portuguese immigrants. In 1970, the number of Spanish immigrants was also higher than that of the Portuguese, at 3225 and 1586, respectively.16,55

From the data presented here and the available historical data, it can be inferred that even though Salvador (the capital of the State of Bahia) was the first capital of Brazil (from 1549 to 1763), the presence of the Portuguese population in Bahia was smaller than that of the Spanish population, most likely due to the transference of the Portuguese court to Rio de Janeiro, the second capital of Brazil.16

In the estimated period of the first occurrence of BRCA2 c.156_157insAlu in Portugal, 120–130 generations ago (~20 years per generation),9 the Portuguese immigrants were well established in Brazil, but because of economic and political motives, their presence was greater in Southeast Brazil, mainly in Rio de Janeiro and São Paulo. In Rio de Janeiro, BRCA2 c.156_157insAlu was found in three unrelated patients with breast cancer.56

Therefore, due to the bottleneck effect result of the Portuguese population movements, (a) immigration from Portugal to Brazil (settlement, 1500), (b) emigration from the Northeast to the Southeast regions of Brazil (the establishment of Rio de Janeiro as the capital of Brazil in 1973) and (c) others fluxes did not allow the fixation of BRCA2 c.156_157insAlu in the population of Bahia.

The most frequent CHEK2 germline mutations worldwide, c.1100delC, c.444+1G>A and p.I157T,5,57 were also not found in any of the analyzed patients. These data are interesting because mutations of low-penetrance genes are more likely to be found than are those of high-penetrance genes.

Nevertheless, it has to be considered that the presence of these mutations could also be related to the pattern of the admixture of the population. Other studies reported results similar to those presented here,22,52 and in Brazil, only one study found c.1100delC, but the affected individual had a family history matching the clinical criteria for hereditary breast and colorectal cancer.58

In this study, the profile of a population that until now uncharacterized was analyzed; most of the subjects were from the largest state of the Northeast region (83.02%), but some were from other regions of Brazil (15.09%). In fact, it is important to consider that every population at some level is heterogeneous, some more than others, and that these variations can interfere with the level of genetic susceptibility.

Although the ancestry informative markers panel was relatively small, it could be observed that the study population was highly heterogeneous, presenting high levels of European ancestry contribution (62.20%), as well as significant African (31.20%) and Amerindian (6.60%) ancestry contributions. This high European ancestry contribution could be owing to fact that most of the subjects self-reported as white or mulatto (75.48%) and/or were from the countryside regions (52.83%), where there are more people with Caucasian traits than in Salvador, the capital of Bahia.17,18,59

However, this degree of heterogeneity was confirmed in the population stratification analysis that was also statistically significant (r=0.604). In the bar plot, the degree of the genetic admixture of the study population was higher than that of the other population groups analyzed. In the triangle plot, the distribution of the study population was between the European and African populations. However, most Brazilian subjects clustered with the European populations, in which the degree of heterozygosity between individuals was higher (0.319) compared with the clusters of the African (0.262) and Amerindian (0.256) populations.

In Figure 1 (triangle plot), some individuals from the Amerindian population were not near its cluster (cluster 2) or seemed closer to the European cluster (cluster 3); this effect could be related to the size of the panel. Nonetheless, we must consider that there is genetic variation among and within populations; with this panel, a higher degree of heterogeneity of the study population was demonstrated.

In conclusion, in this highly admixed population, three founder mutations, BRCA1 3450del4 and p.R71G and TP53 p.R337H, were found and can be added to the mutation panels that are used for high-risk HBOC patients and for early-onset breast cancer patients from this region. However, more importantly, these findings indicate that the combined screening of BRCA1/2 and TP53 is important and should be offered to high-risk patients. In addition, it is notable that this population seems to have a high Spanish ancestry contribution, considering that BRCA1 p.R71G was found in a total of five unrelated patients. However, further genetic ancestry analysis could demonstrate whether the haplotype of these Brazilian patients is similar to that of the Galician patients.

Funding

This study was supported by the Conselho Nacional de Desenvolvimento Científico e Tecnológico (Process #408858/2006-0) and Secretaria de Saúde do Estado da Bahia (Contract #028/2011).