Integration of functional assay data results provides strong evidence for classification of hundreds of BRCA1 variants of uncertain significance

Lyra, Paulo C. M.; Nepomuceno, Thales C.; de Souza, Marcele L. M.; Machado, Géssica F.; Veloso, Mariana F.; Henriques, Taciane B.; dos Santos, Diandra Z.; Ribeiro, Iuly G.; Ribeiro, Roberto S.; Rangel, Leticia B. A.; Richardson, Marcy; Iversen, Edwin S.; Goldgar, David; Couch, Fergus J.; Carvalho, Marcelo A.; Monteiro, Alvaro N. A.

doi:10.1038/s41436-020-00991-0

Download PDF

Article
Open access
Published: 22 October 2020

Integration of functional assay data results provides strong evidence for classification of hundreds of BRCA1 variants of uncertain significance

Paulo C. M. Lyra Jr PhD¹,
Thales C. Nepomuceno PhD^2,3,4,
Marcele L. M. de Souza MSc¹,
Géssica F. Machado BS¹,
Mariana F. Veloso BS¹,
Taciane B. Henriques PhD¹,
Diandra Z. dos Santos MSc¹,
Iuly G. Ribeiro BS¹,
Roberto S. Ribeiro Jr BS¹,
Leticia B. A. Rangel PhD¹,
Marcy Richardson PhD⁵,
Edwin S. Iversen PhD⁶,
David Goldgar PhD⁷,
Fergus J. Couch PhD⁸,
Marcelo A. Carvalho PhD^2,9 &
…
Alvaro N. A. Monteiro PhD ORCID: orcid.org/0000-0002-8448-4801⁴

Genetics in Medicine volume 23, pages 306–315 (2021)Cite this article

4806 Accesses
17 Citations
13 Altmetric
Metrics details

Abstract

Purpose

BRCA1 pathogenic variant heterozygotes are at a substantially increased risk for breast and ovarian cancer. The widespread uptake of testing has led to a significant increase in the detection of missense variants in BRCA1, the vast majority of which are variants of uncertain clinical significance (VUS), posing a challenge to genetic counseling. Here, we harness a wealth of functional data for thousands of variants to aid in variant classification.

Methods

We have collected, curated, and harmonized functional data for 2701 missense variants representing 24.5% of possible missense variants in BRCA1. Results were harmonized across studies by converting data into binary categorical variables (functional impact versus no functional impact). Using a panel of reference variants we identified a subset of assays with high sensitivity and specificity (≥80%) and apply the American College of Medical Genetics and Genomics/Association for Molecular Pathology (ACMG/AMP) variant interpretation guidelines to assign evidence criteria for classification.

Results

Integration of data from validated assays provided ACMG/AMP evidence criteria in favor of pathogenicity for 297 variants or against pathogenicity for 2058 representing 96.2% of current VUS functionally assessed. We also explore discordant results and identify limitations in the approach.

Conclusion

High quality functional data are available for BRCA1 missense variants and provide evidence for classification of 2355 VUS according to their pathogenicity.

A single-cell atlas enables mapping of homeostatic cellular shifts in the adult human breast

Article Open access 28 March 2024

Tissue-specific enhancer–gene maps from multimodal single-cell data identify causal disease alleles

Article 09 April 2024

Utility of polygenic scores across diverse diseases in a hospital cohort for predictive modeling

Article Open access 12 April 2024

INTRODUCTION

Approximately 6% of unselected breast cancers and 15% of ovarian cancers can be attributed to germline pathogenic variants in BRCA1 and BRCA2.^1,2 BRCA1 pathogenic variant heterozygotes have a cumulative (to age 80 years) breast cancer risk of 72% (95% confidence interval [CI] 65–79%) and ovarian cancer risk of 44% (95% CI 36–53%).³ This represents roughly a 6-fold and 35-fold increase compared with the general population for breast and ovarian cancer, respectively.

The explosion in clinical germline genetic panel testing has led to an increase in findings of variants of uncertain clinical significance (VUS).⁴ VUS heterozygotes remain in the dark as to whether they are at increased risk or not. While for those with a strong family history of cancer risk assessment can be based on clinical factors, those without a family history, representing approximately half of all heterozygotes, have no such alternative. Thus, the inability to determine pathogenicity associated with a VUS poses a significant barrier to counseling and clinical management of VUS heterozygotes.^5,6,7 Currently, the Evidence-based Network for the Interpretation of Germline Mutant Alleles (ENIGMA) consortium utilizes three distinct classification frameworks for the purposes of clinical management of patients with variants in BRCA1.

A rule-based framework classifies variants based on the predicted effect inferred by changes in genetic code. Variants whose effects on the protein can be unambiguously inferred to cause premature protein termination (e.g., frameshift and nonsense variants) or the production of a noncanonical protein lacking functional domains (e.g., disruption of splicing donor or acceptor sites) are deemed pathogenic. Variants whose effects cannot be inferred from the genetic code cannot be classified by the rule-based framework. This is the case for missense variants, small in-frame insertions and deletions, and intronic variants outside the canonical splice sites.

To classify these problematic variants, a statistical multifactorial model framework was developed to take into account tumor pathology, family history, cosegregation, and co-occurrence data.^8,9 However, very rare alleles remain as VUS as these data may not be sufficient to achieve classification. For these rare missense variants, functional data has emerged as a powerful way to determine whether a variant leads to loss of function.¹⁰ However, functional data are not yet integrated in multifactorial statistical models.

Finally, the framework recommended by the American College of Medical Genetics and Genomics and the Association for Molecular Pathology (ACMG/AMP), establishes categorical criteria for the strength of evidence and weighting of data sources, which allows for the introduction of functional data and provides an opportunity to assess a large number of VUS.¹¹

All three frameworks can be used in a complementary fashion and are based on the following common assumptions: (1) that loss-of-function variants constitute risk-associated BRCA1 variants and (2) that pathogenic missense variants confer the same risk increase as pathogenic truncating variants. Commercial and academic genetic testing laboratories may apply a variation or a combination of these frameworks to generate a five-tier system of pathogenic, likely pathogenic, variant of uncertain significance (VUS), likely benign, or benign.¹²

We started systematically collecting and curating BRCA1 functional data in 2014 and have made those data available through a publicly accessible web tool, the BRCA1 Circos (https://research.nhgri.nih.gov/bic/circos/).¹³ Here we provide an updated and comprehensive resource of functional data for missense variants in BRCA1. Further, we follow recent recommendations for the use of evidence from functional assays to rigorously assess their validity, harmonize data across studies, and integrate data to generate evidence for hundreds of variants.^10,14

MATERIALS AND METHODS

Literature curation, data extraction, and definitions

We annotated all published functional data for BRCA1 (OMIM 113705) missense variants (as of August 2019). Missense variants are defined as nucleotide changes that lead to the substitution of an amino acid residue using GenBank accession U14680 and LRG_292 as reference sequences.

For the purposes of this study, specific instances of an assay performed in a study were considered “tracks”. For example, a publication that reports cell viability, protein expression, and subcellular localization assays is represented as three separate tracks. The same underlying assay reported in different publications will be represented as a separate track. In one case, track 102, this single track collapses data from several independent publications with multiple biological and technical replicates runs using a joint analysis that takes into account batch effects.^15,16

Universe of variants

The published biological and biochemical assays included in this study focus on the effects of missense variants on protein function. There are 16,776 possible single-nucleotide substitutions in the BRCA1 coding region. We excluded nonsense (n = 803), stop codon reversions (n = 9), and synonymous (n = 3492) changes, resulting in 12,472 variants. As multiple nucleotide changes can generate the same amino acid substitution, we considered our starting “missense universe” all unique missense variants resulting from single-base substitutions (n = 11,009) in the reference BRCA1 (RefSeq NM_007294.3). For example, although single-nucleotide substitutions at codon 1775 in BRCA1 (ATG; Met) can generate nine mutant codons (CTG, GTG, TTG, ACG, AGG, AAG, ATC, ATA, and ATT), only five unique mutant amino acid residues result from these changes (Leu, Val, Thr, Arg, Lys, and Ile).

Next, we defined a set of “documented missense variants” (n = 2484) recorded in BRCA Exchange (https://brcaexchange.org/) as of August 2019, which represents all reported variants known to have been found in at least one individual. These variants include those observed in the context of clinical testing (e.g., ClinVar) or observed in exome and genome sequencing of large cohorts (e.g., gnomAD; https://gnomad.broadinstitute.org/). Documented variants are denoted as “1” if observed and “0” if not (track 9; Supplementary Table 1).

Harmonization

We retained each author’s determination of whether a given variant had a significant impact on the function being tested. There is a wide variety of methods and cutoff values used across studies and details for how each assay’s data were transformed into a binary categorical result (impact versus no impact) can be found in Supplementary Table 2.

To harmonize scores we assigned “0” to a variant that did not lead to a significant impact on the function being tested (functionally normal) and “1” to those that did (functionally abnormal). Scores “0” and “1” are presumably associated with (likely) nonpathogenic and (likely) pathogenic variants, respectively. Variants for which the results were inconclusive or intermediate did not receive a score. Some variants affect both splicing and amino acid composition and, in these cases, the determination of functional impact refers exclusively to its effect on protein function.

For track 102 we previously developed a “functional class” (fClass) scoring scheme¹⁷ using the posterior probability calculation of a variant being pathogenic in the transcriptional activation assays (PrDel) output by VarCall to generate functional classifications (fClass): PrDel ≤0.001 as fClass 1 (nonpathogenic), 0.001 < PrDel ≤ 0.05 as fClass 2 (likely not pathogenic), 0.05 > PrDel ≤ 0.95 as fClass 3 (uncertain), 0.95 < PrDel ≤ 0.99 as fClass 4 (likely pathogenic), and PrDel >0.99 as fClass 5 (pathogenic). We collapsed fClasses 1 with 2 and 4 with 5 and transformed them into binary categories: we assigned “0” to a variant that was a benign (fClass 1) or likely benign (fClass 2); and “1” to a variant that was pathogenic (fClass 5) or likely pathogenic (fClass 4).

Reference panel of variants

To assess the accuracy of each track we used a highly stringent reference panel combining data from the ENIGMA consortium¹⁸ and ClinVar. The ENIGMA reference panel is composed of 298 missense variants assigned to International Agency for Research on Cancer (IARC) classes by the multifactorial model.^8,9 While the ENIGMA panel has the advantage of being a set of variants systematically classified using a rigorous multifactorial statistical model,^8,9 it has a limited number of reference variants.

Thus, we assessed accuracy using an expanded panel which combines the ENIGMA panel with additional variants from ClinVar. The ClinVar data set was obtained by downloading data for all BRCA1 germline variants (8197 entries) on 24 May 2020. Missense-only entries with reference sequence NM_00794.3 and NM_00794.4 and review status of 1-star or better were retained. Variants reported as “conflicting interpretations of pathogenicity” and “uncertain significance” were removed, resulting in a data set composed of 295 variants. The ENIGMA reference panel was then merged with the ClinVar data set. Because the tracks assessed exclusively examined protein function and were not expected to identify variants with splicing effect, we excluded six missense variants with known effects on splicing for the calculation of specificity and sensitivity.^{11,18,19,20,21,22,23} We also removed the data on the reference variant C1787S because its classification of pathogenic was achieved in the context of a C1787S/G1788D haplotype.⁹ Track 102 is the only track that tested this haplotype context.¹⁵ To assess sensitivity and specificity we used the nonredundant ENIGMA + ClinVar [E + C] reference panel composed of 389 reference variants (Supplementary Table 3).

Sensitivity and specificity calculations

To assess the accuracy of different assays we calculated sensitivity and specificity for all tracks that had tested more than ten variants (reference and VUS) in parallel and included at least four pathogenic and four benign missense reference controls.

Reference variants are classified by the multifactorial model in a five-tier scale according to IARC recommendations:²⁴ variants with PrDel (probability of being pathogenic) ≤0.001 are assigned class 1 (nonpathogenic; benign); 0.001 < PrDel ≤ 0.05 are assigned class 2 (likely not pathogenic; likely benign); 0.05 > PrDel ≤ 0.95 are assigned class 3 (uncertain); 0.95 < PrDel ≤ 0.99 are assigned class 4 (likely pathogenic); and PrDel > 0.99 are assigned class 5 (pathogenic). We collapsed classes 1 with 2 and 4 with 5 and transformed them into binary categories: we assigned “0” to a variant that was a benign (IARC class 1) or likely benign (IARC class 2); and “1” to a variant that was pathogenic (IARC class 5) or likely pathogenic (IARC class 4). The multifactorial model does not include functional data, thus avoiding circularity.

Odds of pathogenicity calculations

To determine the strength of evidence associated with each track we estimated the odds of pathogenicity (OddsPath) for a theoretical assay that previously evaluated classified controls following the recommendations from the Clinical Genome (ClinGen) Resource Sequence Variant Interpretation Working Group.¹⁴

Because we excluded variants with intermediate results during data harmonization, we estimated an OddsPath that could be achieved by a perfect binary classifier.¹⁴ For each track, the OddsPath was calculated according to the formula: \(OddsPath = \frac{{\left[ {P2 \, \times\left( {1 - P1} \right)} \right]}}{{\left[ {\left( {1 - P2} \right) \,\times P1} \right]}}\) where P1 represents the pathogenic variants in the overall modeled data as a prior probability, and P2 the proportion of pathogenic variants with functionally abnormal or normal readouts as posterior probability.¹⁴ Results from these calculations (OddsPath for functionally abnormal variants, and OddsPath for functionally normal variants) were used to obtain a corresponding level of evidence strength (BS3 supporting, BS3 moderate, BS3, indeterminate, PS3 supporting, PS3 moderate, PS3) according to the Bayesian adaptation of the ACMG/AMP variant interpretation guidelines (Supplementary Table 4).²⁵ In this framework, each assay track receives PS and BS criteria that are applied to every variant scoring as loss of function (PS criterion) or with no functional impact (BS criterion), respectively. For example, all variants scoring as loss of function in a track with OddsPath for functionally abnormal variants >18.7 will receive evidence criterion PS3.

Data and code availability

All data sets used are available as supplementary materials associated with this article (Supplementary Tables 1–3). These data sets and codes for queries to generate summary statistics and variant callings in Supplementary Tables 5–11) are available through GitHub (github.com/FunctionalAssaYIntegration/FYI_BRCA1). Updated data sets are also available for download and can be visualized at FYI-HBOC (http://iscva.moffitt.org/fyi-hboc/build/).

RESULTS

The landscape of BRCA1 functional assays

The Functional AssaY Integration for BRCA1 (FYI-BRCA1) data set contains 140 tracks, including 131 tracks representing individual instances of functional assays from 37 publications (Fig. 1) (Supplementary Table 2). There was functional information for 2701 missense variants, of which 2465 are currently VUS (22% of all possible single-nucleotide missense changes and 40% of all reported missense variants) (Supplementary Table 5) according to the ENIGMA + ClinVar reference panel. Approximately 47% (62/131) of all tracks have tested ten or more variants in parallel and ~33.5% (44/131) have also tested at least four pathogenic and four benign controls (Supplementary Table 6). Controls are from a reference panel composed of 389 known missense variants assigned to IARC classes by the multifactorial model^8,9 or classified by ClinVar (Supplementary Table 3). Taken together, these data indicate that there is a wealth of untapped functional data to aid in classification of a significant fraction of BRCA1 missense VUS.

**Fig. 1: Overview of functional track assessment.**

Assessing the accuracy of functional assays

The ACMG/AMP rules state that validated functional assays can be used as a source of evidence to classify a VUS and specific recommendations have recently been published.^11,14 We followed the recommendations to define the disease mechanism and evaluate the applicability of general classes of assay (Supplementary Text) (Supplementary Fig. 1a). Most functional assays developed to date fall into the following 12 applicable classes: binding; focus formation; protein expression, stability, and folding; transcription activation; sensitivity; recombination; localization; proliferation; chromosome/mitotic apparatus; cell viability; catalytic activity; and cell cycle checkpoint (Supplementary Table 7).

The assay class “protein expression, stability, and folding” had one of the lowest fractions of tracks that met the criteria to be evaluated (i.e., that had tested at least ten variants and four benign and four pathogenic controls) and the lowest fraction of validated tracks. In contrast, 75% of recombination class tracks that met criteria were validated (Supplementary Table 7). We also evaluated tracks grouped by host (cell type or in vitro system used) of the assay. Although S. cerevisiae and E. coli showed <50% tracks validated, the data suggest that tracks using a range of different host categories can generate useful information (Supplementary Table 7).

Next, we evaluated the validity of individual tracks. After data harmonization, we used a reference panel of 389 variants (Supplementary Table 3) to calculate specificity and sensitivity for all tracks that had tested at least ten variants and four benign and four pathogenic controls (n = 44). Seven tracks achieved 100% sensitivity and 100% specificity. Twenty-two tracks with sensitivity and specificity ≥80% were considered appropriately validated for the purposes of variant interpretation (Fig. 2a) (Supplementary Table 8).

Applying the evidence to individual variant interpretation

We consider the 22 tracks for use in variant interpretation specified above (belong to an applicable assay class, inclusion of basic and variant controls, broadly accepted historically, validated) to be well-established and we refer to them as the “Hi Set”.^{15,26,27,28,29,30,31,32,33,34,35,36,37,38}

We determined the evidence strength (i.e., BS3 supporting, BS3 moderate, BS3, indeterminate, PS3 supporting, PS3 moderate, and PS3) according the Clinical Genome Resource (ClinGen) Sequence Variant Interpretation (SVI) Working Group recommended equivalence¹⁴ (Supplementary Table 8). Importantly, requiring ≥80% specificity and sensitivity eliminated all assays with evidence strength equivalent to “indeterminate” (Fig. 2b).

From a set of 2449 VUS tested by assays in the Hi Set we identified variants that had been tested once and assigned BS3_supporting, BS3_moderate, or BS3 to 1481 VUS and PS3_moderate or PS3 to 188 variants (Fig. 3) (Supplementary Table 9). Next we identified variants that had been tested more than once and all results were concordant. We assigned as final evidence the strongest assignment for each variant. For example, a variant tested three times and receiving BS3_supporting, BS3_moderate, and BS3_moderate would be assigned BS3_moderate as the final assignment (Supplementary Table 9). Finally, for variants tested across multiple assays, 117 variants had discordant results ranging from variants with 9:1 (benign:pathogenic) to variants with 1:13 (Supplementary Table 10). Based on the distribution of ratios for discordant results (Supplementary Fig. 1b), we propose that ratios of 3:1 or greater and 1:3 or smaller (1.5 ≤ log2 [Ratio Benign/Path] ≤ −1.5) constitute preponderance of evidence. Ratios of 2:1 and 1:2 were considered too weak while ratios of 4:1 and 1:4 would significantly reduce the numbers of discordant results resolved. In these 23 cases, we assigned the strongest evidence criteria among the tracks providing the results. In total we assigned evidence criteria to 2355 VUS (Fig. 3).

**Fig. 3: Overview of individual variant assessment using Hi Set.**

Estimating sensitivity and specificity of the Hi Set integrated approach

Reference variants (which were defined as reference pathogenic and benign variants without considering functional data) tested in the Hi Set were assigned evidence criteria using the same rules applied for the VUS, allowing us to estimate the sensitivity and specificity of the integrated framework based on the 22 Hi Set functional tracks (Fig. 3). Expectedly, since these reference variants were used to identify the tracks that comprise the Hi Set, the integrated approach had a low error rate (3.5%) and high sensitivity (0.92; 95% CI 0.84–0.96) and specificity (1.00; 95% CI 0.96–1). Most reference missense variants misclassified by the functional tracks also affect splicing, and thus prior knowledge of variants that might affect splicing improves sensitivity and specificity (Fig. 3).

VUS and reference variants with discordant results

There were 154 variants (117 VUS) with discordant results (Supplementary Table 10). We consider results discordant when a variant tested by multiple tracks scores both as loss of function and no functional impact. We looked into this set of variants to identify limitations of our approach.

First, we examined all 37 reference variants that initially scored as discordant (Supplementary Table 11). Variant p.C1787S (IARC class 5) scored four times of benign and one time as pathogenic. This variant was classified as pathogenic (IARC class 5) but only in the context of a haplotype with G1788D.⁸ Track 102 has tested C1787S, G1788D, and the double C1787S/G1788D. Both variants in conjunction contribute to loss of function, while each in isolation does not significantly impact the function.¹⁷ All other tracks have only tested one or the other variant but not the haplotype. Variant p.R71G (IARC class 5), which also affects splicing, scored as benign three times and once as pathogenic (from an assay that identifies defects in RNA abundance). These cases provide cautionary notes as variants with a preponderance of evidence in either direction may, in some cases, be assigned incorrectly, considering the reference panel as the “correct” assignment.

Five variants have conflict between two related tracks of phosphopeptide binding activity (track 27) and phosphopeptide binding specificity (track 28). It was observed that several variants, while not affecting binding of phosphopeptide, would bind phosphorylated and unphosphorylated peptides indiscriminately, a loss of function associated with cancer risk.²⁷ A similar case is encountered for conflicts in which a variant does not affect protein expression but the protein expressed has compromised function.

Some substitutions are more prone to generate discordant results

To determine whether there were classes of amino acid substitution (e.g., Arg → Glu) more prone to generate discordant results, we determined the frequency of 39 classes of substitutions represented in the set of discordant variants, and calculated their fold enrichment in the discordant set when compared with the tested set (Supplementary Fig. 2a). A small number of classes of substitution, such as Ile → Lys and Arg → Trp, were enriched in the discordant variant set, featuring prominently changes to/from hydrophobic and positively charged or polar amino acid residues (Supplementary Fig. 2b), suggesting that these substitutions are more affected by differences in experimental conditions or may lead to intermediate levels of activity.

Alternative approach

The process of assigning evidence criteria used in Fig. 3 is a relatively stringent approach. The harmonized data set, however, can filter tracks using different characteristics, and different assignment rules can be developed and compared. To illustrate this we have assigned evidence criteria using an alternative approach, although the relative contribution of the tracks with large number of variants does not change significantly (Supplementary Fig. 1c).

In the alternative approach, instead of preselecting tracks according to a specific specificity and sensitivity threshold, we used all 131 functional tracks. From a set of 2465 VUS tested we identified variants that had been tested once by any track and assigned BS3_moderate or BS3 to 1448 VUS and PS3 to 179 variants (Supplementary Fig. 3). Next we identified variants that had been tested more than once and all results were concordant. We assigned as final evidence the strongest assignment for each variant (Supplementary Table 11).

Finally, for VUS tested multiple times, 180 variants had discordant results ranging from 17:1 (benign:pathogenic) to 1:19. We used the same cutoff (1.5 ≤ log2 [Ratio Benign/Path] ≤ −1.5) to define preponderance of evidence. For those VUS we assigned BS3 and PS3 criteria. The remaining VUS were assigned by simple majority voting but, to reflect the discordance, were only assigned BS3_supporting and PS3_supporting criteria. Forty-two variants scored as benign in as many assays as they scored pathogenic) and remain unassigned until further testing. In total we assigned evidence criteria to 2421 VUS (Supplementary Fig. 3).

The alternative approach (majority voting) led to a small additional decrease in unassigned variants, mostly assigned to benign evidence criteria (BS3_supporting, BS3_moderate, and BS3) (Fig. 4). However, approximately 10% of variants achieving PS3 using the Hi Set were downgraded to PS3_moderate or PS3_supporting in the majority voting approach (Fig. 4).

**Fig. 4: Comparison of evidence criteria assignment for Hi Set and majority voting approaches.**

Protein modular domains and structural motifs

Most variants receiving PS3 criteria were part of a functional domain of BRCA1, either the RING finger or the BRCT domains (Fig. 5). Within these domains, some structures seem to be more sensitive to variation and therefore changes are likely to lead to loss of function. For example, some secondary structures, such as β2 in the first BRCT repeat have not yet recorded a variant with impact on function (Fig. 5b). While this could be due to the limited number of variants tested in these regions, the fact that β’2, the corresponding β sheet in the second BRCT domain, is also tolerant to changes suggest that these β sheets are not critical to function.

**Fig. 5: Location of pathogenic variants.**

DISCUSSION

BRCA1 pathogenic variant heterozygotes are at substantially increased risk for breast and ovarian cancer. Pathogenic variant heterozygotes affected with cancer can also benefit from the use of poly (ADP-ribose) polymerase (PARP) inhibitors for treatment. Therefore, accurate determination of a variant’s pathogenicity is critical to improve risk stratification and treatment outcomes in breast and ovarian cancer treatment. However, due to lack of information hundreds of BRCA1 variants remain as VUS, constituting a significant unmet clinical need.

Missense variants constitute the largest class of unclassified variants in BRCA1 and functional assays have emerged as an important source of evidence to aid in classification.¹⁰ Remarkably, a large number of experiments probing into the mechanisms of cancer predisposition due to defective BRCA1 have been conducted since its cloning, many using missense variants.

Here we systematically reviewed the biological basis of all functional assays testing missense variants in BRCA1 published in the last 23 years and conclude that all assay classes are applicable as they directly or indirectly measure a demonstrated function of BRCA1, although not all molecular functions have clearly established connections to the cancer phenotype. Of note, none of the tracks in this study is the result of a CLIA or European Communities Confederation of Laboratory Medicine (EC4) laboratory-developed test and this is a limitation of our approach. However, as these assays are unlikely to ever be conducted commercially, the published data are likely to remain the sole source of functional information for variants.

We reasoned that this wealth of experimental data could be used to mitigate the challenges of VUS for hereditary breast and ovarian cancer. We then harmonized the results for 2701 missense variants in BRCA1. Another limitation of our study was the harmonization of qualitative, semiquantitative, or quantitative data as binary categorical data and the fact that cutoffs to distinguish a normal from abnormal function are variable. Presumably, this approximation results in a loss of information. However, a general treatment to harmonize quantitative data across studies has several obstacles, including the lack of access of raw data from individual studies and the need to generate quantitative models capable of integrating all data sets. We believe this will be possible in the near future as there are several quantitative models for individual assays that could be adapted to integration.^16,33,36

We illustrate the utility of this data set by assigning ACMG/AMP evidence criteria using two scenarios. In the first scenario (Hi Set), only 22 tracks that (1) tested more than ten variants, (2) tested at least four benign and four pathogenic controls, and (3) achieved a specificity and sensitivity ≥80% using the [ENIGMA + ClinVar] reference, were considered. Using this approach, we assigned evidence criteria to 2355 VUS, which corresponded to 96.2% of all tested VUS.

The second scenario (majority voting) represents a less stringent one in which data from all 131 tracks were considered. Although the two data sets were not significantly different (the largest contributors to both scenarios are tracks with large number of variants tested, e.g., tracks 102, 131, 133, and 134) there was a small increase in the number of VUS assignments (2421) of variants tested, mostly due to increases in benign evidence criteria (BS3_supporting, BS3_moderate, BS3). We recommend the Hi Set approach for systematic assignment of ACMG/AMP evidence criteria to functional data. The availability of the data set will allow investigators to model the data using a variety of reference panels and criteria for choice of assay.

For variants with multiple tests, we also explored discordant results. Discordant results can be due to random variation, clerical errors (e.g., typographical errors in labels), experimental errors in one or multiple assays (e.g., sample swapping, incorrect pipetting), variants with intermediate effects that may be detected by the most sensitive but not all assays and variant impacts on risk independently of the function being tested. Our evidence criteria assignment can be improved by examining and adjudicating individual cases of discordant results.

Large data sets can also reveal more granular information about the role of specific protein segments and amino acid residues. For example, our analysis has shown that β2 and β’2 are relatively tolerant to changes, perhaps due to the fact that there is little contribution of these β sheets in the interrepeat BRCT interface.³⁷

In summary, we use a large body of experimental evidence to assign evidence criteria to an overwhelming majority of missense VUS in BRCA1 in a large scale application of ACMG/AMP evidence criteria. It is important to stress that according to recommendations of the SVI Working Group the functional evidence criteria are currently not meant to be standalone evidence for either a benign or pathogenic classification. At least one other evidence type would be required to reach a final classification. Future developments should take into consideration the impact of a variant on different phenotypes and devise ways to consider variants with intermediate effects.

References

Alsop K, Fereday S, Meldrum C, et al. BRCA mutation frequency and patterns of treatment response in BRCA mutation-positive women with ovarian cancer: a report from the Australian Ovarian Cancer Study Group. J Clin Oncol. 2012;30:2654–2663.
Article CAS Google Scholar
Fackenthal JD, Olopade OI. Breast cancer risk associated with BRCA1 and BRCA2 in diverse populations. Nat Rev Cancer. 2007;7:937–948.
Article CAS Google Scholar
Kuchenbaecker KB, Hopper JL, Barnes DR, et al. Risks of breast, ovarian, and contralateral breast cancer for BRCA1 and BRCA2 mutation carriers. JAMA. 2017;317:2402–2416.
Article CAS Google Scholar
Couch FJ, Shimelis H, Hu C, et al. Associations between cancer predisposition testing panel genes and breast cancer. JAMA Oncol. 2017;3:1190–1196.
Article Google Scholar
Szabo CI, Worley T, Monteiro AN. Understanding germ-line mutations in BRCA1. Cancer Biol Ther. 2004;3:515–520.
Article CAS Google Scholar
Monteiro AN, Couch FJ. Cancer risk assessment at the atomic level. Cancer Res. 2006;66:1897–1899.
Article CAS Google Scholar
Toland AE, Andreassen PR. DNA repair-related functional assays for the classification of BRCA1 and BRCA2 variants: a critical review and needs assessment. J Med Genet. 2017;54:721–731.
Article CAS Google Scholar
Goldgar DE, Easton DF, Deffenbaugh AM, Monteiro AN, Tavtigian SV, Couch FJ. Integrated evaluation of DNA sequence variants of unknown clinical significance: application to BRCA1 and BRCA2. Am J Hum Genet. 2004;75:535–544.
Article CAS Google Scholar
Easton DF, Deffenbaugh AM, Pruss D, et al. A systematic genetic assessment of 1,433 sequence variants of unknown clinical significance in the BRCA1 and BRCA2 breast cancer-predisposition genes. Am J Hum Genet. 2007;81:873–883.
Article CAS Google Scholar
Monteiro AN, Bouwman P, Kousholt AN, et al. Variants of uncertain clinical significance in hereditary breast and ovarian cancer genes: best practices in functional analysis for clinical annotation. J Med Genet. 2020;57:509–518.
Article CAS Google Scholar
Richards S, Aziz N, Bale S, et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med. 2015;17:405–424.
Article Google Scholar
Golubeva VA, Nepomuceno TC, Monteiro ANA. Germline missense variants in BRCA1: new trends and challenges for clinical annotation. Cancers (Basel). 2019;11:E522
Article Google Scholar
Jhuraney A, Velkova A, Johnson RC, et al. BRCA1 Circos: a visualisation resource for functional analysis of missense variants. J Med Genet. 2015;52:224–230.
Article CAS Google Scholar
Brnich SE, Abou Tayoun AN, Couch FJ, et al. Recommendations for application of the functional evidence PS3/BS3 criterion using the ACMG/AMP sequence variant interpretation framework. Genome Med. 2019;12:3.
Article Google Scholar
Fernandes VC, Golubeva VA, Di Pietro G, et al. Impact of amino acid substitutions at secondary structures in the BRCT domains of the tumor suppressor BRCA1: Implications for clinical annotation. J Biol Chem. 2019;294:5980–5992.
Article CAS Google Scholar
Iversen ES, Couch FJ, Goldgar DE, Tavtigian SV, Monteiro ANA. A computational method to classify variants of uncertain significance using functional assay data with application to BRCA1. Cancer Epidemiol Biomarkers Prev. 2011;20:1078–1088.
Article CAS Google Scholar
Woods NT, Baskin R, Golubeva V, et al. Functional assays provide a robust tool for the clinical annotation of genetic variants of uncertain significance. NPJ Genom Med. 2016;1:16001.
Article CAS Google Scholar
Parsons MT, Tudini E, Li H, et al. Large scale multifactorial likelihood quantitative analysis of BRCA1 and BRCA2 variants: An ENIGMA resource to support clinical variant classification. Hum Mutat. 2019;40:1557–1578.
Article CAS Google Scholar
Thomassen M, Blanco A, Montagna M, et al. Characterization of BRCA1 and BRCA2 splicing variants: a collaborative report by ENIGMA consortium members. Breast Cancer Res Treat. 2012;132:1009–1023.
Article CAS Google Scholar
Dines JN, Shirts BH, Slavin TP, et al. Systematic misclassification of missense variants in BRCA1 and BRCA2 “coldspots”. Genet Med. 2020;22:825–830.
Article Google Scholar
Ahlborn LB, Dandanell M, Steffensen AY, Jonson L, Nielsen FC, Hansen TV. Splicing analysis of 14 BRCA1 missense variants classifies nine variants as pathogenic. Breast Cancer Res Treat. 2015;150:289–298.
Article CAS Google Scholar
Gaildrat P, Krieger S, Thery JC, et al. The BRCA1 c.5434C->G (p.Pro1812Ala) variant induces a deleterious exon 23 skipping by affecting exonic splicing regulatory elements. J Med Genet. 2010;47:398–403.
Article CAS Google Scholar
Ladopoulou A, Konstantopoulou I, Armaou S, et al. A change in the last base of BRCA1 exon 23, 5586G->A, results in abnormal RNA splicing. Cancer Genet Cytogenet. 2002;134:175–177.
Article CAS Google Scholar
Plon SE, Eccles DM, Easton D, et al. Sequence variant classification and reporting: recommendations for improving the interpretation of cancer susceptibility genetic test results. Hum Mutat. 2008;29:1282–1291.
Article CAS Google Scholar
Tavtigian SV, Greenblatt MS, Harrison SM, et al. Modeling the ACMG/AMP variant classification guidelines as a Bayesian classification framework. Genet Med. 2018;20:1054–1060.
Article Google Scholar
Anantha RW, Simhadri S, Foo TK. et al. Functional and mutational landscapes of BRCA1 for homology-directed repair and therapy resistance. Elife. 2017;6:e21350.
Article Google Scholar
Lee MS, Green R, Marsillac SM, et al. Comprehensive analysis of missense variations in the BRCT domain of BRCA1 by structural and functional assays. Cancer Res. 2010;70:4880–4890.
Article CAS Google Scholar
Bouwman P, van der Gulden H, van der Heijden I, et al. A high-throughput functional complementation assay for classification of BRCA1 missense variants. Cancer Discov. 2013;3:1142–1155.
Article CAS Google Scholar
Lu C, Xie M, Wendl MC, et al. Patterns and functional implications of rare germline variants across 12 cancer types. Nat Commun. 2015;6:10086.
Article CAS Google Scholar
Towler WI, Zhang J, Ransburgh DJ, et al. Analysis of BRCA1 variants in double-strand break repair by homologous recombination and single-strand annealing. Hum Mutat. 2013;34:439–445.
Article CAS Google Scholar
Coyne RS, McDonald HB, Edgemon K, Brody LC. Functional characterization of BRCA1 sequence variants using a yeast small colony phenotype assay. Cancer BiolTher. 2004;3:453–457.
CAS Google Scholar
Thouvenot P, Ben Yamin B, Fourriere L, et al. Functional assessment of genetic variants with outcomes adapted to clinical decision-making. PLoS Genet. 2016;12:e1006096.
Article Google Scholar
Findlay GM, Daza RM, Martin B, et al. Accurate classification of BRCA1 variants with saturation genome editing. Nature. 2018;562:217–222.
Article CAS Google Scholar
Starita LM, Young DL, Islam M, et al. Massively parallel functional analysis of BRCA1 RING domain variants. Genetics. 2015;200:413–422.
Article CAS Google Scholar
Petitalot A, Dardillac E, Jacquet E, et al. Combining homologous recombination and phosphopeptide-binding data to predict the impact of BRCA1 BRCT variants on cancer risk. Mol Cancer Res. 2019;17:54–69.
Article CAS Google Scholar
Starita LM, Islam MM, Banerjee T, et al. A multiplex homology-directed DNA repair assay reveals the impact of more than 1,000 BRCA1 missense substitution variants on protein function. Am J Hum Genet. 2018;103:498–508.
Article CAS Google Scholar
Williams RS, Green R, Glover JN. Crystal structure of the BRCT repeat region from the breast cancer- associated protein BRCA1. Nat Struct Biol. 2001;8:838–842.
Article CAS Google Scholar
Langerud J, Jarhelle E, Van Ghelue M, Ariansen SL, Iversen N. Trans-activation-based risk assessment of BRCA1 BRCT variants with unknown clinical significance. Hum Genomics. 2018;12:51.
Article CAS Google Scholar

Download references

Acknowledgements

This work was funded by National Institutes of Health/National Cancer Institute (NIH/NCI) award CA116167; the H. Lee Moffitt Cancer Center & Research Institute, an NCI-designated Comprehensive Cancer Center (P30-CA076292); Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (Finance Code 001; CAPES; Brazil); and Fundação de Amparo à Pesquisa do Estado do Espírito Santo, Brazil (FAPES) T.C.N. is a Fulbright Scholar.

Author information

Authors and Affiliations

Biotechnology/RENORBIO Program, Federal University of Espírito Santo, Vitória, ES, Brazil
Paulo C. M. Lyra Jr PhD, Marcele L. M. de Souza MSc, Géssica F. Machado BS, Mariana F. Veloso BS, Taciane B. Henriques PhD, Diandra Z. dos Santos MSc, Iuly G. Ribeiro BS, Roberto S. Ribeiro Jr BS & Leticia B. A. Rangel PhD
Instituto Nacional de Câncer, Programa de Pesquisa Clínica, Rio de Janeiro, Brazil
Thales C. Nepomuceno PhD & Marcelo A. Carvalho PhD
Divisão de Pesquisa Clínica, Instituto Nacional de Câncer, Rio de Janeiro, Brazil
Thales C. Nepomuceno PhD
Cancer Epidemiology Program, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, USA
Thales C. Nepomuceno PhD & Alvaro N. A. Monteiro PhD
Ambry Genetics, Aliso Viejo, CA, USA
Marcy Richardson PhD
Department of Statistical Science, Duke University, Durham, NC, USA
Edwin S. Iversen PhD
Department of Dermatology, Huntsman Cancer Institute, University of Utah School of Medicine, Salt Lake City, UT, USA
David Goldgar PhD
Mayo Clinic, Rochester, MN, USA
Fergus J. Couch PhD
Instituto Federal do Rio de Janeiro–IFRJ, Rio de Janeiro, Brazil
Marcelo A. Carvalho PhD

Authors

Paulo C. M. Lyra Jr PhD
View author publications
You can also search for this author in PubMed Google Scholar
Thales C. Nepomuceno PhD
View author publications
You can also search for this author in PubMed Google Scholar
Marcele L. M. de Souza MSc
View author publications
You can also search for this author in PubMed Google Scholar
Géssica F. Machado BS
View author publications
You can also search for this author in PubMed Google Scholar
Mariana F. Veloso BS
View author publications
You can also search for this author in PubMed Google Scholar
Taciane B. Henriques PhD
View author publications
You can also search for this author in PubMed Google Scholar
Diandra Z. dos Santos MSc
View author publications
You can also search for this author in PubMed Google Scholar
Iuly G. Ribeiro BS
View author publications
You can also search for this author in PubMed Google Scholar
Roberto S. Ribeiro Jr BS
View author publications
You can also search for this author in PubMed Google Scholar
Leticia B. A. Rangel PhD
View author publications
You can also search for this author in PubMed Google Scholar
Marcy Richardson PhD
View author publications
You can also search for this author in PubMed Google Scholar
Edwin S. Iversen PhD
View author publications
You can also search for this author in PubMed Google Scholar
David Goldgar PhD
View author publications
You can also search for this author in PubMed Google Scholar
Fergus J. Couch PhD
View author publications
You can also search for this author in PubMed Google Scholar
Marcelo A. Carvalho PhD
View author publications
You can also search for this author in PubMed Google Scholar
Alvaro N. A. Monteiro PhD
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alvaro N. A. Monteiro PhD.

Ethics declarations

Disclosure

M.R. is a full-time, salaried employee of Ambry Genetics. The other authors declare no conflicts of interest.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Table 11

Supplementary Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License, which permits any non-commercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. If you remix, transform, or build upon this article or a part thereof, you must distribute your contributions under the same license as the original. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/4.0/.

Reprints and permissions

About this article

Cite this article

Lyra, P.C.M., Nepomuceno, T.C., de Souza, M.L.M. et al. Integration of functional assay data results provides strong evidence for classification of hundreds of BRCA1 variants of uncertain significance. Genet Med 23, 306–315 (2021). https://doi.org/10.1038/s41436-020-00991-0

Download citation

Received: 09 April 2020
Revised: 21 September 2020
Accepted: 22 September 2020
Published: 22 October 2020
Issue Date: February 2021
DOI: https://doi.org/10.1038/s41436-020-00991-0

Keywords

This article is cited by

PhenoScore quantifies phenotypic variation for rare genetic diseases by combining facial analysis with other clinical features using a machine-learning framework
- Alexander J. M. Dingemans
- Max Hinne
- Bert B. A. de Vries
Nature Genetics (2023)
How does re-classification of variants of unknown significance (VUS) impact the management of patients at risk for hereditary breast cancer?
- Ava Kwong
- Cecilia Yuen Sze Ho
- Edmond Shiu Kwan Ma
BMC Medical Genomics (2022)
Assessment of small in-frame indels and C-terminal nonsense variants of BRCA1 using a validated functional assay
- Thales C. Nepomuceno
- Ana P. P. dos Santos
- Marcelo A. Carvalho
Scientific Reports (2022)