Introduction

Traditionally, genetics has used a phenotype-driven approach to identify cases with an underlying genetic pathology and likely causative variants [1]. An example of this is for developmental delay (DD), where trio analysis of affected individuals, typically children, is used to identify likely causative variants [2, 3]. Some variants are known to have variable penetrance, which can be caused by many different mechanism, leading to reduced or increased risk [4].

One example of a disorder which causes DD is Rett syndrome, which is an X-linked disorder caused by MECP2 variants, predominantly affecting girls [5]. Mosaicism can alter the phenotype of this disease though skewed inactivation of the X-chromosomes (XCI), which was first identified in twins with differing phenotypes [6]. In 2007, a woman was identified, through her affected son, to be asymptomatic for Rett syndrome with a C-terminal truncation variant in MECP2, which usually produces a classical phenotype. This was the first case of non-penetrance and was predicted to be linked to highly skewed XCI. It also suggested that screening mothers of Rett syndrome patients is indicated, as this could affect reproductive decisions [7].

Genetic testing for inherited disease is frequently offered to family members of affected individuals. For example, BRCA1 and BRCA2 testing is offered to individuals with a known history of familial breast and ovarian cancer (FBOC). Screening has the advantage of informing life-time risk, and whether preventive action is advisable [8]. However, individuals with a family history of FBOC, who are negative for BRCA1 and BRCA2 variants established to be associated with an increased risk of breast and ovarian cancer, may still have a higher cancer risk relative to the wider population, indicating further risk factors are involved [9, 10].

Calculating prevalence and penetrance of variants requires population sampling and is more accurate the larger sample populations and more homogeneous cohorts [11]. Technological developments have reduced the costs and time associated with genome-wide sequencing [12] and NGS has been essential to projects such as ExAC, 1000 Genomes, and the 100,000 Genomes Project [13,14,15]. ExAC, a whole exome sequencing (WES) cohort of more than 60,000 people, harboured ~54 variants reported as disease causing and 41 of these were too common in the population to be classified as pathogenic [15]. ExAC has also been used successfully to confirm pathogenicity of variants validating clinical decisions [16].

A qualitative study of expert opinion on reporting of secondary findings, highlighted the challenges facing reporting without having complete penetrance studies linking genotype-to-phenotype in unaffected populations. There is concern over the extent to which family-based penetrance impacts pathogenicity through unknown risk modifiers [17]. Genetic profiling is used to inform clinical and reproductive decisions, so individuals may feel they should take action, especially if they have seen the effect of the disease and wish to prevent disease risk [18]. If there is uncertainty around penetrance between populations, there is a risk of misinformation which could lead to unnecessary intervention. If the mechanisms behind risk modifiers were identified, this information would allow genomic counsellors and clinicians to more accurately estimate risk informing clinical action.

This systematic review addressed the question: What is the evidence base for variant penetrance estimates in individuals with no family history of the condition?

Methods

Study design

A systematic search was performed and reported following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidance [19, 20]. This process allowed for an explicit and reproducible strategy for selection of literature. The methodological search was performed through Ovid, a journal search engine, facilitating searches in Medline, EMBASE and Global Health. Ovid also allowed for search terms to be exploded, to incorporate all known similar terms within literature to be incorporated into one search. The search was performed in September 2019.

Search strategy

A broad search strategy being deemed to be most appropriate to give the best chance of capturing all available evidence pertaining to the research question. Terms used included “incidental findings,” “genetic testing,” “genetic counselling,” “penetrance” and “genetic disease.” The complete bank of terms, including how the terms were exploded and combined using Boolean operators is documented in Appendix 1 as a full electronic search strategy for one of the databases. Free searching was used in addition to add unknown terms when exploding, such as secondary findings (SF). Reference searching of final papers was performed to identify further relevant papers.

Inclusion and exclusion criteria

Included papers had to have been published in English between January 2009 and September 2019. Key inclusion criteria were that the papers had to be primary literature or a systematic review of primary literature (see Table 1). There also had to be some reference to the use of a population identified through means other than family history. This could for example be the use of a cohort database, such as 1000 genomes, or through a population screen. Exclusion criteria were clearly defined. All articles unrelated to genetics were excluded, for example relating to medical imaging. Genetic related review papers, articles without abstracts, or articles linked to ethical, legal and social implications were also excluded. This review excluded papers related to epigenetic or mitochondrial disease and papers aiming to inform clinical management of a disease or case studies pertaining to individuals affected by disease. Papers were excluded for having a familial focus when estimating penetrance or similar risk estimates.

Table 1 Inclusion and exclusion criteria for papers to be included within the current review.

Search outcomes

Figure 1 is a PRISMA flow-chart showing the stages involved in the search. This search identified 1592 papers and reduced to 1356 once duplicates were removed. At each stage of analysis, two independent researchers screened papers and met to discuss and reach consensus on which papers to progress to the next stage. Disagreement over paper inclusion was resolved through recruitment of a third researcher. These papers were initially title searched to remove all papers deemed to be clearly irrelevant to the research question. This left 175 articles had their abstracts screened. Of these, 25 met the criteria for a full-text search. Ten studies met the inclusion criteria to be included in the final analysis. All final papers were identified by the Medline search, but only two were identified by searching Global Health alone. Reference searching and discussion with an experienced genomics researcher identified a further eight papers, of which two met the inclusion criteria and were retained.

Fig. 1: PRISMA flow-chart showing the stages of the systematic review.
figure 1

PRISMA flow diagram of study selection. The screening process identified ten studies from an initial pool of 1592 as being relevant to the current review and having satisfied the inclusion criteria. This PRISMA flow diagram demonstrates the process of identifying and screening relevant studies.

Quality assessment

Quality assessment of identified literature was performed by two independent researchers using a validated tool for qualitative and quantitative study appraisal [21]. This analysis generated a comparable score to assess the methodological strength of identified literature. Quality assessment scores were not used to exclude papers due to the paucity of literature in this area, however all included papers scored highly regardless.

Analysis of final literature

Due to the disparity of aims and methodologies of the identified papers, it was not feasible to do a meta-analysis of results. Therefore, a narrative synthesis was performed according to the Cochrane guidelines [22]. Thematic analysis was used to generate themes from the included studies [23]. This technique has been adopted for use in systematic reviews and is effective for analysis of small samples of quantitative literature [18, 24]. Identified themes were confirmed between researchers to minimise bias and ensure fair representation of identified papers within this review. The Cochrane Risk of Bias tool was referred to, to ensure assessment of the robustness of evidence in identified papers and to reconcile limitations within this review [25].

Results

Ten papers met the inclusion criteria for this search, with core features described in Table 2 [26,27,28,29,30,31,32,33,34,35]. These trials used either observational [27, 28] or case-control design which focused on specific diseases to identify the cohort of interest. The diseases these papers focused on were haemochromatosis [31], Huntington’s disease [34], familial and sporadic breast and ovarian cancer (FBOC and SPBCOC, respectively) [26], cystic fibrosis [32], retinitis pigmentosa [33], diabetes [29], Turner syndrome [35], 47,XXX syndrome [35] and prion disease [30].

Table 2 Description of key characteristics of final papers from the literature search.

All included studies scored full marks in the quality appraisal, with one exception. Cardenosa et al.’s study design was not entirely appropriate to the study aim, because only selected populations underwent BRCA1/2 sequencing, preventing comprehensive comparison. In addition, the sporadic population was disproportionately small relative to the familial population resulting in underpowering and researchers did not fully discuss how this limitation could impact the generalisability of their findings.

Summary of thematic analysis findings

Seven themes were identified from the included studies: variant classification, clinical implications, evidence, bias, study design, study limitations and disease risk modifiers. Categories and themes identified can be seen in Fig. 2. In the interest of brevity, only the themes most relevant to the aims of this review are discussed below.

Fig. 2: A schematic depicting the themes and categories generated from the included studies.
figure 2

The 48 categories generated by the thematic analysis are represented in this schematic as small boxes. The seven themes are represented by large boxes. The colours indicate the overarching theme that the category belongs to.

Disease risk modifiers

Most evidence for variations in penetrance comes from incomplete penetrance within families. Rose et al. identified the link between three or four copies of MRS1 in penetrance of retinitis pigmentosa [33]. Cassa et al. indicated that either erroneous findings or risk modifiers must occur at higher frequencies than previously believed, due to identifying a high proportion of variants annotated as pathogenic from the Human Gene Mutation Database (HGMD) in the 1000 genome cohort [27]. Chen et al. identified 13 individuals harbouring variants for severe childhood-onset disorders without apparent phenotype [28]. This suggests that risk modifiers exist even for variants which are classified as 100% penetrant, which has wide implications for the rare disease community.

Population studies were used to identify if there is variation in prevalence or penetrance of a disease.

Rose et al. compared European and Asian-origin populations, because Asian populations have higher penetrance for retinitis pigmentosa, and identified that Asian populations have a lower prevalence of the four copies of MSR1 genotype [33]. The approach to calculating prevalence varied between studies, with Thauvin-Robinet et al. using partners of CF patients, whereas Pedersen and Milman used a country-wide sampling approach [31, 36]. Thauvin-Robinet et al. used their approach to calculate the number of newborns with the [R117H] + [F508del] genotype, which they compared with the true number identified [31]. They also aimed to see if the IVS8 poly(T) played a role in modifying phenotype. However, T5, which is believed to be more pathogenic, had a very low prevalence in the French cohort. In populations with higher prevalence of T5, it has been associated with higher penetrance [37]. Pedersen and Milman used a genotype-driven approach to population sampling to calculate prevalence within the population [31]. They phenotyped a proportion of this population to calculate penetrance of these variants, which was a novel approach to penetrance estimation for haemochromatosis [38].

Tuke et al. identified mosaic X chromosome 45,X/46,XX in a higher than expected number of patients within women in the UK Biobank cohort (76/100,000) [35]. Turner syndrome is caused by 45,X genotype with key phenotypes related to short stature, cardiac disorders and reproduction. Comparison between these mosaic individuals and normal 46,XX women identified no significant increase in cardiac or reproductive incidents. Clinically this has significant implications as women identified as 45,X/46,XX appear to require no clinical follow-up.

Biological mechanisms for how a variant causes disease were presented either by building upon previous work, which has indicated a mechanism [26], or by studying the mechanism as part of the studies design [33]. Cardenosa et al. use a combination of biological evidence and case studies to select variants to study [26]. Rose et al. performed the only study in this review which aimed to identify the biological mechanisms underlying the risk variance [33]. In their study, they used a combination of in vivo and in vitro evidence to show that 3× MSR1 CNV induces haploinsufficiency through inhibition of transcription and that 4× MSR1 CNV may act as a rescue mechanism. It is likely that MSR1 has pleiotropic effects, as it modified the activity of other genes when inserted into the promoter region.

Evidence

Rose et al. were the only study to actively explain how the underlying biology affects disease risk [33]. Evidence quality is also dependent upon appropriate study design. Chen et al. included several aspects in their design to manage potential limitations [28]. They chose to only look at severe childhood-onset Mendelian diseases to increase certainty in findings. When they attempted to reduce stringency, the number of individuals identified and the subsequent complexity of verifying variants increased. A multistage filtering process was used, including assessing data quality to exclude candidates, to maximise the validity of the final candidates. They identified 13 individuals that are asymptomatic; however, only five could have confirmation sequencing performed which is part of the final stage of their process, so only these candidates definitively have been shown to harbour these variants.

Thauvin-Robinet et al. wanted to delineate the phenotype of R117H to calculate the prevalence and penetrance of [R117H] + [F508del] for causing CF [32]. They used a CF phenotype test and then genotyped for CF variants in phenotype-positive newborns, only identifying ~25% of predicted newborns. However, certain newborns may not present with the screened phenotype, so it could be a poor indicator for that genotype. Pedersen and Milman, by comparison, used a genotype-driven approach to identify prevalence and phenotyped afterwards to calculate penetrance [31, 38].

Laver et al. questioned the pathogenicity of R114W within HNF4A, due to being more common in the population relative to the expected population occurrence [29]. In addition, this variant did not exhibit the same phenotypes as other HNF4A MODY variants. They were able to identify that this variant is enriched in the MODY cohort and is not linked to type 2 diabetes. They were also able to demonstrate that relative to other HNF4A variants, R114W has a later median disease onset of approximately 10 years. This was consistent with the hypothesis that R114W has a reduced penetrance.

Sequeiros et al. used a combination of data samples to provide evidence that large normal alleles are common in the population, as an indicator that they are likely stably transmitted [34]. In addition, new intermediate alleles, identified through sperm analysis, are more unstable than those of identical size in the general population, although samples were only taken from four men [39].

Cardenosa et al. had the lowest score using the appraisal tool, largely due to uncertainty over the way they conducted their sequencing. They chose to only sequence BRCA1/2 in the FBOC group, and not the control group or the SPBCOC group. While BRCA1/2 are rarely found in sporadic cancer, only genotyping could confirm that BRCA1/2 is not present in the sporadic or control group [40]. Genotyping of BRCA1/2 in the control group would be important to indicate if BRCA1/2 variants occurred in the population. The researchers also grouped BRCA1/2 into one group with both breast and ovarian cancer, which may be confounding as BRCA1 and BRCA2 are separate genes and are associated with different cancer risks [41].

Cassa et al. designed their study to provide evidence of reduced or non-penetrance of variants listed by HGMD as pathogenic, within the 1000 Genomes cohort [27]. They identified 6917 variants of which 3744 had a minor allele frequency of ≥0.01. Some disease-associated variants have known varied penetrance, but this is insufficient to explain the volume of identified variants. There was a high number of disease variants (n = 583) identified in this cohort with a MAF ≥ 0.01. This study provided evidence of erroneous findings, and likely unknown risk modifiers within the cohort. They based their rational on the fact that many variants are identified in symptomatic populations in studies with small cohorts. There are often limited validation studies and unmatched control populations. Their study indicated that the probability of observing a variant may not be equivalent to the risk of developing the disease. In a previous study by the same researchers, 10.6% of HGMD variants had sufficient clinical relevance and scientific validity to be shared with research participants [42].

Minikel et al. were limited in the evidence which they could gather due to lack of standardisation of genotyping across the countries involved in case collection [30].

Bias and study limitations

Several of these studies worked to demonstrate the effect of bias and minimise its impact. Key stated limitations within this review were missing diagnosis [32] and follow-up [28, 31, 32]. Missing description of limitations was identified within Cardenosa et al. [26].

Chen et al. were unable to follow-up candidates due to limited consent. This is especially a challenge within the 23&me cohort and other direct-to-consumer genetic testing (DTC-GT) companies, where inadequate consent prevents re-contact. Chen et al. attempted to expand their analysis in candidates from the UK10K, a WES cohort [28]. However, only five of the identified variants were annotated and none indicated a putative role. WGS would probably be required to identify putative variants for further study.

Chen et al. attempted to collect the largest possible sample of individuals to optimise the chance of identifying candidates of interest, including a combination of genotyped, whole exome and whole genome cohorts. SNP arrays made up 88% of samples, with 68% being from 23&me alone [28]. SNP arrays had the lowest coverage (377 loci vs. 699 for WES) and were responsible for the low coverage of loci (36.5%). Chen et al. do highlight that the exact composition of commercial pan-ethnic screening panels cannot be scrutinised [28]. They attempt to overcome issues around incomplete screening panels by creating their own expanded allele panel, which is more comprehensive than other commercial panels.

Study design can also act to limit the scope of a study, as can be seen in the decision to only BRCA1/2 genotype FBOC limited the findings of Cardenosa et al. [26]. It is suspected that these individuals had already been BRCA genotyped and the researchers did not have the resources to BRCA1/2 genotype the rest of the cohort. As this may have confounded the result, the design limits the scope of the findings.

Cassa et al. based their rationale upon the hypothesis that some variants which are classified as pathogenic are erroneous findings, and some of the other studies identified provided evidence which further supported these claims [27]. Rose et al. identified a mechanism which underlies variance between populations, which Cassa et al. hypothesised would happen when asymptomatic individuals were studied further [27, 33]. Thauvin-Robinet et al. confirmed that R117H is more commonly associated with asymptomatic patients than affected patients, making it potentially an erroneous finding if it were used in clinical decisions, such as prenatal testing [32].

While potential factors which may have biased the papers have been discussed in detail, they are likely to only have a moderate to low risk of bias due to their efforts to consider and manage limitations and bias within their design [25].

Discussion

This systematic review has summarised the sources of data for genetic variant penetrance estimates in a population setting. It has shown that the evidence base in this context is extremely limited. Limitations of study design, strength of evidence and disease risk modifiers may account for discrepancies between estimates generated form the traditional phenotype first, family history based approach and observed prevalence in the wide population or within genotype first cohorts.

Disease risk modifiers are central to understanding why disease phenotype varies between individuals. The included studies addressed indications of varied penetrance, support for reduced risk variants and biological mechanisms underlying variants. Data from one of the studies [35] suggests that mosaicism may be a powerful modulator of disease pathogenicity. The included studies demonstrate evidence to suggest that disease risk modifiers are relatively common in disease cohorts. This evidence is most robust when biological evidence is coupled with population data and so it can be suggested that genomic data repositories be combined wherever possible to increase the strength of such studies. This information has the potential to impact genomic counselling and treatment considerations.

Observational studies can be used to reveal weaknesses within current research. This is particularly important where the discovery cohort is significantly different to the population in which the testing will occur. Large discrepancies between predicted penetrance and observed disease should cause cautious consideration of the original estimates and prompt discussions about how to counsel and manage patients. Standardisation of case selection across countries would improve penetrance studies because it would create greater certainty of the levels of ascertainment, especially regarding lower penetrance variants. In addition, it would allow for better ascertainment of founder variants in specific regions which guide clinical management of affected families.

Bias and limitations are common issues within research, although their origin differs. Bias is the systematic misrepresentation of findings, due to omission of outcomes from their derivation, whereas limitations are factors that a researcher cannot control, which restrict methodology and conclusions. Limitations have the power to bias findings if they are unreported. A number of the included studies contained significant limitations. One of these being the issue of correctly phenotyping patients. Missing diagnosis can occur when phenotypes are mild preventing sufficient recognition to prompt genetic testing. Certain phenotypes are ambiguous and common, so an individual could easily dismiss these symptoms. More newborns possessed the [R117H] + [F508del], who were not identified by phenotyping; however, without genotyping, they could not be identified to calculate true penetrance. To overcome this limitation, all newborns would need genotyping and clinical follow-up to calculate true penetrance [32]. When assessing individuals who are suspected to be asymptomatic, follow-up is essential to clarifying if they are truly non-penetrant or reduced penetrance. This could prove particularly difficult on a population level, where DTC-GT data has been used and inadequate consent prevents re-contact. This is just one issue which needs to be addressed within the wider discussion of consent for DTC-GT [43, 44].

Another issue of note to DTC-GT and GWAS alike is the use of SNP arrays. SNP arrays are designed primarily for use in genome-wide association (GWA) studies, which are based upon the premise that common variation contributing to disease are present in frequencies >1% in the population and these studies are dependent upon large samples to create statistical power [45, 46]. This means that they are less effective at picking up rare variants, especially between ethnic populations, where variant prevalence differs and the composition of the arrays may be designed for a specific population 46. Up to 40% of variants identified by DTC-GT companies are believed to be false positives and some variants which these companies classify as pathogenic are classified as benign by software [47]. Recent work has further highlighted how poor these arrays perform for rare pathogenic variants, with over 80% of those identified being false positives [48]. This has profound implications for downstream clinical follow-up.

The close to significant findings of FGFR2 for SPBCOC in the Cardenosa study, indicates either too small a sample size or that BRCA1/2 genotyping may have confounded results, especially as this variant has been significant for this group in other studies [49, 50]. A recent GWA study again linked FGFR2 to breast cancer risk, although a different variant within this gene [51]. The Cardenosa study also does not characterise the spread of age of onset among the cohort, despite penetrance being known to vary in BRCA1/2 FBOC cohorts [52]. The non-BRCA1/2 FBOC group had a higher rate of breast cancer than the BRCA1/2 FBOC group, so this information could have implications for further study, although this is not commented upon in the discussion [26]. The grouping of BRCA1 and BRCA2 into the same cohort also warrants explanation as they confer different cancer risks [53].

Clinical implications

The major clinical implications of these studies relate to genomic counselling; either to guide genetic testing [32,33,34], inform risk reporting of findings [28, 31], or to guide changes to improve research and clinical interactions [27, 28]. These studies all recognised a need for further research to expand the applicability of findings to other populations, to identify mechanisms underlying findings and to validate findings. Therefore, the clinical implications of these papers can be considered as a need to ensure sufficient evidence before reporting findings and that complete consent must be taken to ensure follow-up is possible.

Generalisability: secondary findings and the 100,000 Genome Project

The clinical implications of these papers and themes expand beyond the individual diseases into a wider discussion of how incidental findings or SF should be included in clinical practice. The American College of Medical Genetics and Genomics (ACMG) created a list of genes, which they recommended should be sequenced alongside any genomic test. These genes were selected for their potential actionability [54, 55]. The resultant debate has led to some very different opinions regarding if SF should be reported, and if they are, what criteria make them actionable.

ACMG listed RET variants causing medullary thyroid carcinoma in childhood as being clinically actionable [56]. However, there have been cases of reduced penetrance, including a woman with a high-risk variant who developed thyroid cancer at 87 [57]. Management of this variant would normally involve thyroid removal, which is a major operation with a long-term impact on quality of life. If there are risk modifiers involved, these need to be identified to improve clinical decisions and minimise the risk of unnecessary medical intervention. This is the foundation of much of the reticence around incorporating SF into clinical settings [17].

In the UK, the 100,000 Genome Project has raised a lot of questions about how SF (termed additional findings in the project) should be managed [58,59,60], especially with uncertainty around family-based penetrance [17].

There have been efforts to create quantitative measures of actionability to facilitate incorporation into clinical reporting [61, 62]. Much of the study into the evidence base of reporting has been inconclusive, because many variants lack validation [17, 27, 42, 63]. The impact of disease risk modifiers is a complicating factor in the reporting of SF, with this review identifying several studies suggesting risk modification is common [27, 28] and that there are identifiable mechanisms underlying variance in penetrance [26, 31,32,33]. The role of ascertainment bias was explicitly addressed in Laver et al. when trying to evaluate risk as a key aspect of clinical management of patients [30]. The authors discuss the impact of ascertainment bias and ways to minimise this when evaluating whether variants are reduced penetrance or likely benign.

Cardenosa et al.’s design came closest to calculating family-based penetrance by using control vs. sporadic vs. familial populations. The familial group was additionally characterised by BRCA1/2 sequencing, which made comparison between these populations difficult and somewhat reduced the validity of the data to address this specific question [26]. However, this study did not aim to calculate penetrance for these variants, rather to assess the strength of association, whereas Pedersen and Milman used population screening to calculate penetrance for HFE variants using a genotype-to phenotype approach [31].

The most direct approach to characterising penetrance differences requires comparison between control, familial and sporadic populations. Large-scale projects such as UK Biobank and the 100,000 Genome Project will provide an invaluable resource for this. The large populations allow for comparison, with knowledge of phenotypes, and if any candidates of interest are identified, the whole genome can be interrogated to identify putative risk modifiers. The participants have indicated if they are available for follow-up, and thus can be further characterised to assess if the variants they have are benign or exhibit reduced penetrance. Tissue samples are also stored to allow for confirmation sequencing of putative variants or individuals can be re-contacted to collect samples [64]. The longitudinal study of participants will support age-stratified penetrance estimates, which is especially relevant to cancer risk.

Alternatively, the approach taken by Pedersen and Milman is effective for calculating penetrance [31]. They were able to genotype a large number of individuals for a specific disease and then follow-up a proportion for phenotyping. This approach would only be able to identify family-based penetrance, if combined with clinical data to identify individuals with a known family history for the disease. Penetrance could then be calculated separately between the familial and the sporadic groups to identify any statistically significant differences between the groups. Statistical significance would be an indicator for risk modifiers and these individuals could be further genotyped to identify putative variants. This approach would favour diseases which are easily phenotyped and relatively common in the population, as haemochromatosis is in the Danish population.

Limitations of this review

The biggest limitation of this review is that the small number of papers identified and the diversity of aims and methodologies which prevented meta-analysis of the data and direct testing of whether penetrance estimates are significantly different in cohorts where patients lack a positive family history. This suggests that the evidence base as a whole is poor and that extensive further research is needed in this area, however anecdotally it appears where such comparisons have been carried out, variants have higher penetrance in cohorts with a positive family history.

Conclusion

It remains unclear how much evidence there is to suggest that family-based penetrance impacts variability in disease risk. That being said, there is some indication that mechanisms exist which could explain variability between sporadic and familial groups and between ethnic populations. Concerns over how best to manage SF, and pressure to actively report findings, make this a pressing issue, which needs to be studied further. Further research is warranted encompassing familial, sporadic and control groups for a given condition or population sampling combined with clinical and phenotypic data to assess variant penetrance more extensively.

Further research into the ethical consequences of reporting SF without specific penetrance estimates in the relevant population is needed before they should be integrated into clinical practice.