Introduction

Reproductive success of human females relies on a myriad of genes involved in numerous cellular processes. Early in embryogenesis, the oocyte pool needs to be established, needs to be shielded from DNA damage, and needs to successfully commence and arrest in meiosis before cyclical hormonal induction and maturation occurs after menarche. If any of these processes is disrupted, premature ovarian insufficiency (POI) can result. POI is a common form of female infertility characterised by menstrual disturbance (amenorrhea or oligomenorrhea) with raised gonadotropins and low estradiol before the age of 40. It is known to have a strong genetic basis, however, the majority of patients have no established cause [1]. The only genetic analyses routinely offered to POI patients include FMR1 premutation analysis, accounting for 3–13% of cases [2], and microarray/karyotyping detecting chromosomal rearrangements, responsible for 10–13% of cases [3].

Although further genetic studies are not routinely offered or clinically advised according to the European Society of Human Reproduction and Emrbyology (ESHRE) guidelines [4], they are increasingly being performed in a research context. Understanding genetic cause of POI allows for prompt and appropriate clinical care by the relevant medical professionals [5]. For example, certain genetic causes are associated with co-morbidities such as cancer predisposition or neurological problems. Specifically, when a diagnosis is made in a gene involved in DNA damage repair, the possibility of cancer predisposition arises. Chromosomal stability studies can be performed to determine patient sensitivity to DNA damage. With an elevated risk of DNA damage, patients should be vigilant to respond to any early signs of cancer. Variants in MCM9, for example, have been associated with a predisposition to colorectal cancer in addition to POI [6]. If such a genetic cause of POI is identified, affected individuals can engage with gastroenterologists and have regular colonoscopies for disease surveillance, leading to early detection and better prognosis and outcomes.

Many of the genes associated with POI pathogenesis are involved in meiosis (Fig. 1). For successful meiosis, several steps need to occur in a tightly regulated manner. Sister chromatids must be bound together by the ring-like cohesin complex. Sister chromatid cohesion allows the assembly of the synaptonemal complex (Fig. 1), which brings together homologous chromosomes. The synaptonemal complex consists of an axial/lateral element (SYCP2, SYCP3), a transverse filament (SYCP1) and a central element (SYCE1, SYCE2, SYCE3, SIX6OS1, TEX12) [7]. Paired chromosomes then need to be positioned appropriately, which requires genes involved in chromosomal tethering to the nuclear envelope and spindle migration, such as NUP107 and POF1B [8, 9].

Fig. 1: Diagram depicting proteins involved in meiosis.
figure 1

Those in bold have been implicated in human POI. Those with asterisks were identified for the first time in this study.

Meiotic cell division relies on effective DNA damage and repair to mediate homologous recombination and cross-over events (Fig. 1). This begins with the initiation of DNA double strand breaks by SPO11, and is regulated by accessory proteins such as MEI1, MEI4 and REC114. DNA double strand breaks are detected by the early recombination machinery, including the MRN complex (composed of MRE11A, RAD50 and NBN) (Fig. 1). EXO1 excises DNA at double strand breaks to generate single-stranded DNA, which can then be loaded by the meiosis-specific recombinases, DMC1 and RAD51, promoting strand invasion. Helicases have an important role in strand invasion via their ability to unwind the D loop to make homologous DNA accessible to the invading strand. There are two parallel helicase pathways, one involving the helicase, HELQ, and another involving the helicases MCM8 and MCM9, recruited by HROB [10, 11]. After strand invasion DNA is resolved by the late recombination machinery (Fig. 1), involving mismatch repair proteins, MSH4, MSH5, and endonucleases MLH1 and MLH3 [12, 13]. A number of factors tightly regulate the relative number of cross over events, such as HFM1 and FANCM [14, 15].

Variants in many of these “meiosis” genes cause POI, as summarised in Supplementary File 1. The dual role of many genes in meiosis as well as somatic DNA damage repair, means variants can manifest as POI-related syndromes. This makes it particularly important to understand the consequence of their disruption, to delineate which patients will be at risk of associated co-morbidities, such as cancer. Indeed, women with certain cancers, such as BRCA1-related breast cancer, have a greater likelihood of diminished ovarian reserve [16]. Women with POI do not have a general increased risk of cancer [17], however, individuals with POI resulting from certain genetic aberrations can have high cancer risk. Prompt recognition of genetic cause therefore enables surveillance for cancer development in relevant individuals, potentially improving prognosis.

Our aim was to identify new causes of POI in affected women via whole exome sequencing (WES). We found novel likely causative variants in six genes, all with a relevant role in meiosis, and including two genes never before associated with autosomal recessive human POI, REC8 and HROB.

Methods

Participants

Patients were recruited after clinical consultation as part of our ongoing research program investigating the genetics of POI (N = 80 patients, to date). Available family and medical history is included in Table 1. All patients had POI, defined by menstrual disturbance and elevated FSH (>20 mIU/mL) measured twice at least one month apart as per the ESHR guidelines [18], had no history of ovarian surgery, infection or gonadotoxic therapy, had normal 46,XX chromosomal complement, were negative for FMR1 premutation and negative for ovarian auto-antibodies. Patients with North African ancestry resided in France.

Table 1 Summary of patient clinical details.

General molecular techniques

Genomic DNA was extracted from EDTA-blood manually with the NucleoSpin® Blood XL kit (Macherey-Nagel) or with an automated system, Hamilton Microlab STAR and Nucleospin® Blood L kit (Macherey-Nagel), and were assessed by NanoDropTM 1000 spectrophotometer and Qubit dsDNA BR Assay (Thermo Fisher Scientific). Selected variants were validated by Sanger sequencing using BigDye v3.1 Terminators (Applied Biosystems) and ABI 3130X. Primer sequences are available on request.

Whole-exome sequencing (WES)

DNA underwent WES with SureSelect Human All Exon V6 (Agilent) capture and sequencing on the NovaSeq 6000 (Illumina) or with SureSelect Human All Exon V7 (Agilent) and sequencing on the NextSeq 500/550 (Illumina). WES data were processed using Cpipe [19] or C-GeVarA pipeline (Constitutional Genetc Variant Analysis).

We performed two phases of analysis (Table 2) as previously described [1] – the first was gene-centric and used two gene lists (POI genes and meiosis genes, Supplementary File 2) and the second was an unbiased variant-centric approach. Variant-centric analysis focused on high-priority variants (those considered likely to lead to loss of function: frameshift, nonsense and splice site variants) in any gene and with any inheritance, or potentially bi-allelic moderate-high priority variants (missense, in-frame indels, frameshift, nonsense or splice spite). Only variants with MAF < 0.005 in 1000 genomes and gnomAD (https://gnomad.broadinstitute.org/) and with high quality scores (Q > 50 and allele balance >25) were considered. Heterozygous “high” priority variants were subsequently discounted if they affected genes tolerant of LoF according to gnomAD data, affected genes not expressed in gonadal tissue according to GTEx data, fell within minimally-expressed transcripts that had not been validated and/or affected genes with no discernable link to ovarian biology. Missense variant pathogenicity was predicted in silico using Mutation Taster (http://www.mutationtaster.org/), Polyphen-2 (http://genetics.bwh.harvard.edu/pph2/), SIFT/Provean (http://provean.jcvi.org/), and CADD (Combined Annotation-Dependent Depletion) (https://cadd.gs.washington.edu/snv). The conservation of affected residues was assessed by Multiz Alignments of 100 vertebrates (UCSC Genome Browser https://genome.ucsc.edu/). Described variants were submitted to ClinVar (SCV001810162 - SCV001810169).

Table 2 Filtering summary.

Results

Variants in known POI genes

A homozygous MCM8 variant: c.1905 + 1G > C

Patient 1, born to consanguineous parents of Senegalese descent, experienced primary amenorrhea and was diagnosed with POI at age 28, with an FSH level of 82.1 IU/I. Ultrasound revealed small but detectable ovaries (0.45 cm2 right, 0.43 cm2 left). WES identified a homozygous MCM8:NM_182802.3 c.1905 + 1 G > C variant (Supplementary Fig. 1, Table 3). This variant is present in the gnomAD database, however, at a low frequency of 0.000016 (0.000062 in African population), and never homozygous. The variant is predicted to disrupt a canonical splice donor site (https://www.genomnis.com/access-hsf).

Table 3 Molecular detail of candidate variants in Patients 1–7.

A homozygous STAG3 variant: c.2627G > A p.(Gly876Glu)

Patient 2, born to consanguineous parents of Algerian descent, was diagnosed with POI at the age of 16 following primary amenorrhea and an FSH level of 75.7 IU/L. Pelvic ultrasound revealed atrophic ovaries. She has a similarly affected female cousin, however, DNA was not available for analysis. WES identified a homozygous STAG3: NM_012447.4 c.2627 G > A p.(Gly876Glu) variant (Supplementary Fig. 2, Table 3). Although not falling within a known functional domain, glycine at this site is moderately conserved (Supplementary Fig. 2). The only alternative amino acid in mammals is also small and non-polar (alanine). A large negatively charged residue, like glutamate, is never observed. The variant is absent from public databases, such as gnomAD. Most, but not all, online algorithms predict this to be a likely deleterious variant (Table 3).

A homozygous POF1B variant: c.986G > A p.(Arg329Gln)

Patient 3, of Mali descent (no known consanguinity), was diagnosed with POI at age 17 following delayed puberty and primary amenorrhea. Her FSH was elevated at 56.1 IU/L and she had low AMH at 0.07 ng/ml. She had two similarly affected sisters, however, their DNA was not available for analysis. Pelvic ultrasound detected two ovaries with substantial surface area (1.73 cm2 on the left and 4.08 cm2 on the right), however no detectable ovarian follicles. WES identified a homozygous POF1B: NM_024921.3 c.986 G > A p.(Arg329Gln) variant (Supplementary Fig. 3, Table 3). This variant has conflicting annotation in Clinvar. Predictions of pathogenicity using online algorithms are also conflicting (Table 3). The overall frequency of this variant is 0.002642 (African population, 0.004854), and it is never detected in homozygous state in gnomAD.

A homozygous NUP107 variant: c.1064G > A p.(Arg355His)

Patient 4, born to consanguineous parents of Moroccan descent, received a POI diagnosis at age 21 after experiencing primary amenorrhea associated with elevated FSH of 62 IU/L. Estradiol and AMH were low (5 pg/ml and 0.1 ng/ml) and her pubertal development was Tanner III. Pelvic MRI identified uterine hypoplasia and only one ovary of 2.4 cm2 without follicles. The apparent uterine “hypoplasia” could be due to estrogen deficiency; however, she was not evaluated for uterine recovery post-hormone replacement therapy. WES identified a homozygous NUP107: NM_020401.4 c.1064 G > A p.(Arg355His) variant that is invariably predicted pathogenic by online algorithms (Supplementary Fig. 4, Table 3).

A homozygous HFM1 variant: c.2410G > T p.(Glu804Ter)

Patient 5 is from a consanguineous Algerian pedigree and presented with secondary amenorrhea, not receiving a POI diagnosis until the age of 33 when she had an FSH level of 96 IU/L. She had an affected mother who was post-menopausal at 40 and an affected maternal grandmother who was post-menopausal at 28, as well as an infertile brother with azoospermia (Supplementary Fig. 5). She had a sister with regular menses but intellectual disability. Unfortunately, familial DNA was not available for analysis. Pelvic ultrasound revealed ovaries of normal size, however, they were devoid of follicles. Puberty and secondary sexual characteristics were normal. WES in this patient identified a homozygous HFM1: NM_001017975.4 c.2410 G > T p.(Glu804Ter) (Supplementary Fig. 5, Table 3). This is present in gnomAD, at a frequency of 0.000004 and only in heterozygous state. This nonsense variant falls within the 21st coding exon of this 39-exon gene and therefore is predicted to induce nonsense mediated decay [20], although this was not further investigated by RNA studies.

New genetic causes of POI

Biallelic loss-of-function REC8 variants

Patient 6 is from a French non-consanguineous pedigree and experienced menarche at 14 years of age with regular menstruation for 1.5 years followed by infrequent menstruation (~3 monthly) and amenorrhea from age 16. She commenced hormone treatment and was re-evaluated at age 19, when elevated FSH was elevated (114.1 IU/L) and POI was diagnosed (Table 1).

WES identified two heterozygous variants in REC8: NM_001048205.1:c.1035_1036dup p.(Glu346Glyfs*72) and c.624 + 1 G > A p.? (Fig. 2). The c.1035_1036dup introduces a frameshift into the 13th coding exon of this 19 exon gene, and is predicted to cause nonsense mediated decay [20]. The c.624 + 1 G > A affects a canonical donor splice site, and is predicted to disrupt gene splicing. Sequencing parental DNA revealed that these variants were compound heterozygous (Fig. 2C). Both variants are absent from gnomAD and predicted disease-causing (Table 3). REC8 is likely under selection against loss-of-function (LoF) variants having an observed:expected ratio of 0.42, and with no homozygous LoF variants being detected in gnomAD.

Fig. 2: WES identifies REC8 as a novel meiosis gene underlying human POI.
figure 2

A Diagram depicting meiotic cohesion with REC8 closing the ring-like structure surrounding DNA. B Schematic diagram of the REC8 gene indicating variant location and conserved domains. C Sanger sequencing demonstrates parental segregation consistent with compound heterozygosity. Left panel: the c.624 + 1 G > A splicing variant is paternally inherited. Right panel: the c.1035_1036dup p.(Glu346Glyfs*72) variant is maternally inherited.

A homozygous loss-of-function HROB variant

Patient 7, of Guinean descent and not known to be consanguineous, experienced delayed puberty and primary amenorrhea. Her first hormonal assessment was not until the age of 33, when FSH was found to be elevated at 34 IU/L, and ovaries could not be visualised by ultrasound.

WES identified an apparently homozygous HROB: NM_024032 cDNA c.421delG p.(Glu141ArgfsTer62) (Fig. 3). Parental DNA was not available to confirm true homozygosity, but based on WES coverage there was no evidence of a deletion of one allele. This variant falls in the third coding exon of this ten exon gene, and introduces a frameshift, likely leading to nonsense-mediated decay [20]. The variant is detected twice in heterozygous state in gnomAD giving a minor allele frequency of 0.000008 (Table 3) and is not detected in the African population in gnomAD. No homozygous LoF variants in this gene are reported in gnomAD.

Fig. 3: WES identifies HROB as a novel meiosis gene underlying human POI.
figure 3

A Diagram depicting the role of HROB in recruiting the helicase MCM8/9 complex to DNA double strand breaks. B HROB gene diagram indicating variant location and domains above (PPR proline rich region, DUF domain of unknown function). C Sanger validation of the homozygous HROB c. 421delG variant.

Discussion

A novel causative splice site variant in MCM8 associated with isolated POI

Given MCM8 is a known autosomal recessive POI gene, with previously reported LoF causative variants, we conclude that the novel splicing variant in Patient 1 is responsible for POI (Table 4). Six pathogenic variants in MCM8 have been reported in the literature, all but one being predicted LoF alleles, like the c.1905 + 1 G > C variant in Patient 1 that abolishes a canonical splice site (Supplementary Fig. 1). Interestingly, MCM8 has been associated with both isolated POI [21] and syndromic POI involving short stature and pilomatricomas [22]. MCM9, the binding partner of MCM8, has similarly been associated with both isolated and syndromic POI, including cancer predisposition [6, 23], highlighting the role of this complex in both somatic and germ cell DNA repair. The identification of a novel MCM8 variant associated with isolated POI supports the critical role of this gene primarily for meiosis.

Table 4 Variant based criteria.

A novel homozygous missense variant in STAG3 is a variant of uncertain significance

Missense variants in STAG3 are a recently identified cause of non-syndromic POI and azoospermia [24, 25]. Patient 2, who harbours the homozygous c.2627 G > A p.(Gly876Glu) missense variant in STAG3, presents with primary amenorrhea and atrophic ovaries, a phenotype consistent with previously reported patients, who usually have streak gonads. Without functional analysis, it remains possible that this is a benign variant and that POI is due to another genetic cause. If familial DNA could be obtained including from the similarly affected cousin, segregation studies could provide more definitive genetic diagnosis. Importantly, one of the reported patients with biallelic STAG3 variants suffered a germ cell tumour [26], suggesting cancer surveillance may be warranted, although the small case number prevents prediction of a statistically significant increase in risk.

Recurrence of a POF1B missense variant consolidates its status as pathogenic

The homozygous POF1B c.986 G > A p.(Arg329Gln) variant is previously reported to cause POI, but is classed as a variant of uncertain significance (VUS) in ClinVar. It was initially described as pathogenic because it fell within a 21 Mb linkage region determined by homozygosity-by-descent mapping, with an identical haplotype shared by four affected sisters but not an unaffected sister in a large consanguineous pedigree. Functional studies demonstrate this variant disrupts POF1B binding to non-muscle actin filaments [8] and results in disorganised F-actin at tight junctions [27]. Furthermore, this variant has been detected in a POI patient with a reciprocal translocation of the other X chromosome that causes non-random X-inactivation and therefore, functional homozygosity of the p.Arg329Gln variant [28]. We present a third independent family harbouring this variant in association with POI. Given the established functional evidence suggesting this variant is deleterious, and its recurrence in POI pedigrees, this variant is likely causative in Patient 3 (Table 4). Our study adds weight to the POF1B variant being pathogenic and suggests a need to update public resources, such as ClinVar.

A novel NUP107 causative variant consolidates a genotype:phenotype correlation

Interestingly, the c.1064 G > A p.(Arg355His) NUP107 variant detected in patient 4 is a novel genomic variant that alters the same amino acid as a previously reported variant, c.1066 C > T, p.Arg355Cys [29]. Although segregation studies and in silico analyses supported the previously reported variant being pathogenic, a knock-in mouse model had grossly normal ovarian pathology, and the authors concluded that using the ACMG guidelines, the variant was a VUS until additional human studies could confirm its pathogenicity [29]. Our discovery of a variant altering the same residue, and similarly abolishing its positive charge, provides this data, and establishes these variants as causative (Table 4).

NUP107 is ubiquitously expressed and encodes a component of the nuclear pore complex. Variants in NUP107 are an established cause of autosomal recessive nephrotic syndrome and Galloway-Mowat Syndrome-7 [30, 31]. Recently, the role of this gene in human ovarian function has been uncovered by the study of a large Palestinian pedigree with four individuals affected by isolated POI [32]. A homozygous missense variant segregated with POI in the family, and transgenic rescue experiments in fly validated that the variant impaired protein function and resulted in female infertility. The affected residue, p.Asp447, forms a salt bridge with the p.Arg355 residue affected by the variant identified in our study, and the other previously reported variant [32]. Taking into account the literature, and our newly described pathogenic variant, we speculate that variants affecting residues of this salt bridge, may disrupt NUP107 structure and/or function in a way that renders it incapable of its role in meiosis, but still able to perform its role in nephrotic function. Functional studies are required to investigate this theory.

HFM1 is an autosomal recessive POI gene that may also be involved in male infertility

Biallelic variants in HFM1 are a known, albeit rare, cause of POI, with one publication describing three affected individuals from two independent families. All three affected individuals presented with secondary amenorrhea in young adulthood, like Patient 5 of this study [33]. This is the first report of a pathogenic nonsense variant (Supplementary Fig. 5, Table 4). It is intriguing that this woman had an azoospermic brother, which raises the possibility that defects in HFM1 may account for infertility of both sexes in humans. This is in keeping with both male and female infertility of the HFM1 knockout mouse model [14]. To investigate further, familial DNA is required, but was not available.

Variants in REC8 are a new cause of autosomal recessive POI

The compound heterozygous LoF variants in REC8, identified in Patient 6, represent a new genetic cause of POI. REC8 encodes a meiosis-specific component of the cohesin complex that binds sister chromatids (Fig. 2A). The cohesin complex is a ring-like structure wrapped around chromosomes [34]. In humans, the cohesin complex responsible for binding sister chromatids during meiosis is composed of SMC1β and SMC3 which heterodimerise to form a V-like structure. This V-shape is closed to form a tripartite ring by a kleisin, either REC8 or RAD21L (Fig. 2A). STAG3 binds REC8 to stabilise the cohesin complex [35]. Consistent with an essential role in meiosis, male and female Rec8-deficient mice are sterile and have meiotic arrest with synapsed sister chromatids and the absence of crossovers [36, 37].

Heterozygous REC8 variants have been detected in POI patients, however these have had no functional validation or functional studies have revealed no impact on protein stability [38, 39]. One REC8 variant was confirmed to be inherited from an unaffected mother [39]. This suggests these variants are not causative in isolation, and potential oligogenic inheritance was proposed [39]. Furthermore, the mother of Patient 6 of this study, harbours one LoF allele and is currently 49 years of age and has not reached menopause, suggesting haploinsufficiency of REC8 is tolerated.

Given the role of REC8 in mammalian meiosis, the intimate relationship between REC8 and the known POI gene, STAG3, the POI-like phenotype of the Rec8-deficient mouse model and the severe nature of the biallelic variants identified in Patient 6, we consider these REC8 variants likely causative (Table 4).

Variants in HROB are a new cause of autosomal recessive POI

This study describes the first likely causative variant in HROB in a patient with isolated POI. HROB, also known as C17ORF53 or MCM8IP, is a recently identified factor required for homologous recombination [10]. It interacts with the MCM8/MCM9 helicase required for unwinding DNA at sites of damage to allow strand invasion and DNA synthesis-mediated repair [10] (Fig. 3A). Mice deficient in Hrob have meiotic arrest, leading to a POI-like phenotype [10].

Given variants in MCM8 and MCM9, encoding the helicases with which HROB interacts, cause POI [21, 23], it is likely that deleterious HROB variants are similarly causative. The POI-like phenotype in mice further indicates that Hrob is essential for mammalian meiosis, and is supportive of the homozygous frameshift HROB variant being causative (Table 4).

Expansion of the genotypic spectrum of POI patients with meiotic defects

Six patients of this study had likely causative biallelic variants in genes with a role in meiosis. We describe the first nonsense variant in HFM1, identify a novel NUP107 variant affecting a salt bridge previously implicated in POI, add new evidence to support the pathogenicity of a previously reported POF1B variant, and identify novel variants in MCM8 and STAG3 in association with POI. Importantly, we describe likely causative biallelic LoF variants in REC8 and HROB, genes that have not previously been shown as a cause of autosomal recessive human POI. The role of these genes in POI pathogenesis will be consolidated by finding additional affected families, but concordance with mouse model phenotypes, the relevance of the affected genes and the severe nature of the biallelic variants support these being bone fide new genetic causes. This study highlights the important role of meiosis in ovarian function and female reproduction.