DNA–protein crosslink proteases in genome stability

Ruggiano, Annamaria; Ramadan, Kristijan

doi:10.1038/s42003-020-01539-3

Download PDF

Review Article
Open access
Published: 04 January 2021

DNA–protein crosslink proteases in genome stability

Communications Biology volume 4, Article number: 11 (2021) Cite this article

6969 Accesses
38 Citations
11 Altmetric
Metrics details

Subjects

Abstract

Proteins covalently attached to DNA, also known as DNA–protein crosslinks (DPCs), are common and bulky DNA lesions that interfere with DNA replication, repair, transcription and recombination. Research in the past several years indicates that cells possess dedicated enzymes, known as DPC proteases, which digest the protein component of a DPC. Interestingly, DPC proteases also play a role in proteolysis beside DPC repair, such as in degrading excess histones during DNA replication or controlling DNA replication checkpoints. Here, we discuss the importance of DPC proteases in DNA replication, genome stability and their direct link to human diseases and cancer therapy.

Bromodomain proteins: protectors against endogenous DNA damage and facilitators of genome integrity

Article Open access 21 September 2021

The roles of RNA in DNA double-strand break repair

Article Open access 02 January 2020

DNA end resection and its role in DNA replication and DSB repair choice in mammalian cells

Article Open access 30 October 2020

Introduction

Our genome is constantly exposed to various forms of DNA damage. DNA–protein crosslinks (DPCs) are common lesions and form when a protein—of any size and nature—becomes covalently bound to DNA after exposure to a physical or chemical crosslinker. They are bulky and impose physical obstacles to various DNA metabolic processes, such as DNA replication, repair, transcription and recombination (Fig. 1). Thus, the fidelity of DNA replication across DPCs and the accuracy of DPC repair pathways are pivotal in avoiding genomic instability, which can lead to ageing-associated diseases such as cancer. Whereas the consequences of unrepaired DPCs are well appreciated, the components, regulation and dynamics of the DPC repair pathway(s) are far from being understood.

**Fig. 1: Proteolysis protects cells from DPC toxicity during various DNA metabolic processes.**

DPCs are essentially removed by either canonical nucleases or dedicated proteases that degrade the protein component of the DPC (DPC proteolysis)^1,2. DPC proteolysis has gained recognition with the discovery of the metalloproteases Wss1 (weak suppressor of Smt3-1) in yeast and SPRTN (SprT-like N-terminal domain, also known as Spartan or DVC-1) in higher eukaryotes^3,4,5,6. Both proteases possess an intrinsic metalloprotease active center in specialized but phylogenetically distinguished domains (Fig. 2): the WLM (Wss1p-like metalloproteases) domain in Wss1 and SprT domain in SPRTN¹. Wss1 and SPRTN depend on DNA binding for their proteolytic activity but have no defined sequence specificity. This feature is advantageous considering the heterogeneous nature of crosslinked proteins. However, it also exposes chromatin-associated and potentially functional proteins to the risk of undesired proteolysis. Thus, DPC proteolysis must be tightly regulated to minimize such risks. A comprehensive understanding of these regulatory modes is still lacking⁷. Remarkable progress in the field of DPC repair has also been made in recent studies on emerging DPC proteases with functional overlap to Wss1 and SPRTN. These proteolytic activities are primarily employed in DPC repair.

**Fig. 2: The structural features of the known DPC proteases.**

Besides DPCs, other types of substrates have been described for these proteases, for example tightly bound, albeit not crosslinked, chromatin proteins with the potential to disrupt the fidelity of DNA replication, like trapped poly (ADP-ribose) Polymerase-1 (PARP-1). It is therefore becoming increasingly clear that DPC proteases guard multiple aspects of genome maintenance.

In this review, we briefly discuss DPC formation, and expand on DPC proteases and the first and essential step of DPC repair: proteolysis of the protein component. We revise how DPC proteases contribute to genome maintenance, especially during DNA replication, and why defects in their activities lead to human disease.

DNA–protein crosslinks

DPCs originate when proteins become crosslinked to DNA after exposure to physical or chemical agents, such as UV light or aldehydes, respectively (non-enzymatic DPCs), or as a result of faulty enzymatic reactions (enzymatic DPCs)⁸. Enzymatic DPCs are well exemplified by Topoisomerase-1 and Topoisomerase-2 cleavage complexes (Topo-1ccs, Topo-2ccs). During the physiological reaction of Topoisomerase on DNA, a transient, covalent intermediate (i.e., cleavage complex) forms between the catalytic tyrosine residue and the DNA phosphate group (phosphotyrosyl linkage). Stabilization of the cleavage complex (and formation of a DPC) can happen spontaneously if DNA is damaged, but is enhanced in the presence of poisons, e.g., camptothecin (CPT) or etoposide, for Topo-1 or Topo-2, respectively^9,10. Notably, Topo-1/2 poisons are widely exploited in cancer chemotherapy^2,11. Enzymatic DPCs also include crosslinks of DNMT1 (DNA methyltransferase 1) to the DNA methylation inhibitor 5-aza-2’-deoxycytidine (5azadC) incorporated into DNA^12,13, and of HMCES (5-Hydroxymethylcytosine binding, ES-cell-specific) to abasic sites in single-stranded DNA¹⁴.

In the case of non-enzymatic DPCs, virtually any protein—of variable size, structure and nature—in the vicinity of DNA can be crosslinked. One of the most potent crosslinkers, formaldehyde (FA), is heavily present in the environment and produced endogenously via processes like lipid peroxidation, and DNA, RNA or histone demethylation^{15,16,17,18,19}. Hence, FA release can occur in the surroundings of DNA, implying that DPCs form continuously and cells must constantly overcome DPC-induced toxicity.

Defective DPC repair leads to sensitivity to crosslinking agents, faulty DNA replication and cell cycle abnormalities, which pave the way for chromosomal instability and carcinogenesis in humans and mice²⁰. Hence, multiple pathways work to ensure a proper response to these insults. Nuclease-dependent mechanisms, like nucleotide excision repair (NER) and homologous recombination, operate in both bacteria and eukaryotic cells by excising the DNA flanking the DPC^{21,22,23,24,25}. However, NER seems to have a fairly limited role in overall DPC repair, as it can only remove small DPCs (8–10 kDa in size in mammalian cells)^5,23,26. Topo-1/2ccs can be excised by dedicated tyrosyl-DNA phosphodiesterases, TDP1 and TDP2, which cleave the phosphotyrosyl linkage²⁷; this is normally shielded and becomes accessible to TDPs after partial proteolysis or structural changes of Topoisomerases^{28,29,30,31,32}.

A much more pliable, less discriminate way to process DPCs is through proteolysis of the protein component operated by specialized DPC proteases, namely DPC proteolysis repair.

DPC proteases

Wss1 and SPRTN: the first members of a class of repair enzymes discovered

DPC proteolysis repair was discovered in yeast with the metalloprotease Wss1³. A concomitant study in Xenopus egg extract hinted at a similar pathway in metazoans³³. The dedicated enzyme in metazoans was later found to be SPRTN^4,5,6. However, phylogenetic analysis has revealed that Wss1 and SPRTN are not orthologs but functional homologs^1,11,34. The similarity between their sequences is limited within the protease domains—WLM and SprT—and around the active center typical of metalloproteases (HExxH), and they show 24% identity⁵ (Fig. 2).

Wss1 and SPRTN confer resistance to FA, showing a general role in DPC repair^3,4,5,6. This is also supported by in vitro cleavage studies, which demonstrate that Wss1 and SPRTN can process DNA-binding proteins of variable size and structure, exemplified by the smaller histone H3 and the larger Topo-2^3,4,5,6. Although Wss1 is capable of processing purified Topo-1, WSS1 deletion alone does not sensitize yeast cells to CPT, in striking contrast with the effects of SPRTN depletion in mammalian cells^3,5. This is due to the existence of a yeast protease with a similar function and will be discussed below. A good explanation for this promiscuity came from structural studies of the yeast WLM domain³⁵. As the protease domain lacks a substrate-binding pocket, it can accommodate a vast range of structures. DPC proteolysis leaves a remnant peptide, still of unidentified size, attached to DNA. Although less problematic for DNA replication and transcription, this remnant must be repaired via other activities (e.g., nucleases, tyrosyl-DNA phosphodiesterases) in coordination with or post-proteolysis. How this is achieved is not well understood.

Active DPC proteases are potentially dangerous and cells adopt strategies to restrain their activities⁷. First, Wss1 and SPRTN bind to and are activated by single-stranded (ss) and double-stranded (ds) DNA^{3,4,5,6,36,37,38}; this feature protects nuclear soluble proteins from unwanted proteolysis. The structure of the SprT domain from human SPRTN suggests that the DNA dependence relies on the active site being shielded in the absence of DNA; DNA binding exposes the narrow groove with catalytic residues for peptide cleavage³⁷. Second, Wss1 and SPRTN are capable of in trans self-cleavage in the presence of DNA; this property is a regulatory mechanism to shut down the protease activity when at the chromatin⁴. Third, the activity of DPC proteases is further sharpened by post-translational modifications (phosphorylation, ubiquitylation, SUMOylation, acetylation) of the DPC proteases and/or their substrates, and interaction with partner proteins, for instance the sliding clamp PCNA (proliferating cell nuclear antigen), the ATPase p97 or the Topo-1 binding protein TEX264. The latter point will be expanded in a separate paragraph.

Emerging DPC proteases in genome stability

Recently, proteases other than SPRTN have emerged as potential DPC repair enzymes. These are ACRC/GCNA, FAM111A and B, DDI1 and DDI2 in human, and their respective orthologs. Among the human enzymes, SPRTN is the only essential gene in variety of cancer cell lines (Table 1; www.depmap.org).

Table 1 Essentiality of genes encoding for DPC proteases.

Full size table

These emerging DPC proteases have been linked to processing of DPCs and tightly bound proteins (Figs. 2 and 3). Besides the protease domain, where conserved catalytic residues map (Fig. 2), they share remarkable similarities in overall domain organization, which underlies potentially similar patterns of regulation. Nonetheless, some proteases might be more specific for a certain cell cycle phase, a developmental stage, or a class of substrates.

**Fig. 3: Overlapping and non-overlapping functions of DPC proteases.**

Germ cell nuclear antigen; also known as acid repeat-containing protein (ACRC)]

Germ cell nuclear antigen (GCNA) contains a SprT metalloprotease domain conserved across eukarya^11,39. In metazoans, it is expressed in germ cells, where it was recently shown to target Topo-2ccs^40,41. GCNA ectopically expressed in mammalian culture cells has also been involved in the resolution of 5-azadC-induced DNMT1 crosslinks⁴² (Fig. 3). GCNA is not essential in mammalian cell lines (Table 1), but its knock-out (KO) increases the chances of embryonic lethality in flies, worms and zebrafish⁴⁰.

DNA damage inducible 1 (DDI1) and DDI2 proteins

DDI1 and DDI2 proteins are aspartic proteases that interact with ubiquitin and the proteasome via a ubiquitin-like domain (UBL)^43,44 (Fig. 2). They promote proteasome-dependent replication fork restart after replicative stress through degradation of replication termination factor 2 (RTF2)⁴⁴. A direct involvement in DPC repair has not yet been explored, however the S.cerevisiae homolog Ddi1 was recently shown to aid Wss1 in resolution of CPT-induced and FA-induced DPCs^45,46.

Family with sequence similarity 111 member A (FAM111A)

FAM111A and its homolog FAM111B are serine proteases. Not much is known about FAM111B, except for its association with the etiology of a form of poikiloderma with pulmonary fibrosis (POIKTMP)⁴⁷. FAM111A was previously described as a PCNA interactor and for its role in restricting viral replication in host cells^48,49,50. Only recently has it been implicated in the repair of Topo-1ccs and trapped PARP-1⁵¹ (Figs. 3 and 4).

**Fig. 4: Proteolysis of non-DPC substrates during DNA replication.**

Evidence that GCNA, DDIs and FAM111A cleave DPC substrates is currently lacking. However, they do possess catalytic activity. FAM111A self-cleavage has been shown in vitro and in cells^51,52; ScDdi1 and HsDDI2 process substrates modified with long ubiquitin chains in vitro^53,54. Moreover, complementation studies show that mutations in the putative catalytic residues of GCNA⁴⁰, ScDdi1⁴⁵, and FAM111A^51,52 generate non-functional proteins. For ScDdi1, the in vitro activity has been tested in the presence of dsDNA, an activator of Wss1 and SPRTN, without success⁴⁵. Although DNA-dependent activity has been established as a regulatory mechanism for Wss1 and SPRTN, different proteases might respond to different modes of regulation.

A common feature among DPC proteases is DNA binding, a confirmation of their chromatin-related function. Minimal DNA binding domains (DBDs, Fig. 2) have been mapped by testing truncated versions in either DNA binding assays (e.g., EMSA)—as for Wss1, SPRTN, and FAM111A^{3,4,5,37,51,55}—or complementation studies in cells—as for Ddi1’s HDD (helical domain of Ddi1)^45,46. Like SPRTN and Wss1, FAM111A can bind ssDNA, and the integrity of this domain is necessary for in vitro self-cleavage, as well as its function in cells⁵¹. A possible interpretation for the ssDNA dependent-proteolysis will be elaborated later.

Overlapping functions of DPC proteases

The existence of different DPC proteases indicates that cells have invested in multiple enzymes to counteract DPC-dependent toxicity (Fig. 3). This comes as little surprise considering the heterogeneous nature of crosslinked proteins. Although SPRTN does not have strict sequence specificity, it preferentially cleaves DNA-binding proteins in poorly structured regions rich in lysine, arginine and serine residues⁵. The existence of alternative proteases would ensure that crosslinked proteins lacking these features are efficiently processed to prevent genomic instability. Here, we highlight the functional overlap between Wss1, SPRTN and the other proteases.

In metazoans, GCNA is predominantly expressed in germ cells and early embryos³⁹. Therefore, most of the studies have been conducted in animal models rather than human cells. In Drosophila and C.elegans, GCNA mutation exposes germ cells and embryos to replication stress—formation of RPA/γH2Ax foci and hydroxyurea (HU) sensitivity—and genomic instability, e.g., chromosome segregation defects and micronuclei, which overall limit reproductive success^40,41. C.elegans fertility defects are exacerbated by concomitant mutations in the SPRTN homolog dvc-1^40,41. Drosophila embryos mutated in both Gcna and mh (maternal haploid, mh, is the Drosophila SPRTN homolog) do not complete embryogenesis⁴⁰. Setting a role in DPC repair along with DVC-1/mh, GCNA deletion increases total DPCs in germ cells and early embryos of flies, worms and zebrafish⁴⁰, with Topo-2cc being among the most abundant^40,41. Dvc-1 and gcna-1 mutant worm embryos are equally sensitive to formaldehyde⁴². Overall, this indicates an overlap between GCNA and DVC-1 at the organismal level. Germ cells and embryos are particularly vulnerable to the genomic instability resulting from DPC accumulation because mistakes would be inherited. Also, changes in gene expression and histone demethylation during embyogenesis⁵⁶ might especially expose germ cells to FA release¹⁷ and DPCs formation, which explains the need for having multiple DPC proteases.

S.cerevisiae Ddi1 and human FAM111A are required for tolerance to Topo-1ccs, a common target of yeast Wss1 and human SPRTN. Unlike wss1 mutation alone, the deletion of both WSS1 and DDI1 sensitizes yeast cells to CPT and identifies Ddi1 as the elusive, redundant protease for Topo-1cc repair^3,45. Genetic data additionally indicate that Ddi1 gives resistance to the DPC-inducing agent FA, along with Wss1^45,46.

SPRTN and FAM111A also have overlapping functions. Both proteins have been found at nascent DNA^5,50,57 and both prevent replication fork stalling in the presence of crosslinking agents, specifically, FA for SPRTN^5,58 and CPT for both FAM111A and SPRTN⁵¹. FAM111A depletion does not appear to reduce fork speed in unchallenged conditions, in contrast to SPRTN mutation^{5,51,59,60,61}. Thus, FAM111A might come into play when the DPC overload exceeds the capacity of SPRTN, which is expressed at low levels. Such interplay is purely speculative at present. Interestingly, FAM111A KO but not SPRTN hypomorphic cells are sensitive to PARP-1 inhibitors, which were shown to trap PARP to DNA, suggesting that some DPC proteases might have preference for certain substrates⁵¹.

Regulation of DPC proteases by post-translational modifications

Post-translational modifications (PTMs) have great regulatory potential, and most of the proteases discussed here can bind ubiquitin, via ubiquitin-associated domain (UBA) or ubiquitin-binding zinc finger (UBZ), or SUMO (Small ubiquitin-like modifier) via SUMO-interacting motif (SIM) (Fig. 2). Treatment of cells with DPC inducing agents, like FA and 5-azadC, induces signaling cascades, which culminate with the modification of the crosslinked proteins by ubiquitin and SUMO^42,62,63. We can predict that recruitment to or persistence on the damage site is modulated by PTMs. GCNA, for example, localizes to DNMT1 foci that form after 5-azadC treatment; this relocation depends on SUMOylation and GCNA’s SIMs⁴² (Fig. 2). SPRTN forms nuclear foci after exposure to FA and fails to do so upon pharmacological inhibition of the ubiquitylation pathway^42,62. Whether this relocation depends on its UBZ remains to be demonstrated. This initial evidence, substantiated by the predominance of SUMO-interacting and ubiquitin-interacting domains, proves that DPC proteases have the potential to be regulated via interaction with post-translationally modified proteins.

DPC proteases can themselves be modified for regulatory purposes. SPRTN exists in a mono-ubiquitylated form, but exposure to FA leads to its de-ubiquitylation—two different studies have identified the de-ubiquitylating enzymes VCPIP and USP11—and acetylation to allow recruitment to chromatin^4,64,65.

Wss1 and SPRTN interact with the ATPase Cdc48/p97 also known as valosin-containing protein (VCP) in mammals, via SHP (suppressor of high copy PP1)-box and VCP-interacting motif (VIM)^{3,36,66,67,68} (Fig. 2). Cdc48/p97 is a ubiquitin-dependent and SUMO-dependent chaperone acting with specific cofactors to unfold substrates for proteasomal degradation or disassembly from various macromolecules^69,70. The unfolding activity of p97 directly aids repair of Topo-1ccs, to which p97 is recruited via its novel cofactor TEX264 (Testis Expressed 264) before SPRTN-mediated proteolysis⁶⁸. Whether and how unfoldases are a constitutive requirement of DPC proteolysis repair remains to be established.

DPC proteases and DNA replication

DPCs can stall the CMG (Cdc45-Mcm2-7-Gins) replicative DNA helicase complex and/or DNA polymerase during DNA replication, depending on their size and location (leading or lagging strand)^33,71 (Fig. 1). Actively replicating cells are more susceptible to FA and CPT than non-replicating cells^5,72,73. If left unrepaired, these replication obstacles can cause prolonged fork stalling that leads to fork collapse, formation of double-strand breaks (DSBs) and genomic instability⁷⁴.

In yeast, Wss1 has been proposed to allow completion of DNA replication after FA exposure³, and Ddi1 is recruited to the damage site for removal of a model DPC in synchronized, S phase yeast cells⁴⁵. However, repair of DPCs in a DNA replication-dependent manner has been more directly shown for SPRTN, both in human cells⁵ and in Xenopus egg extract^75,76.

In line with S phase repair, SPRTN levels are higher in S/G2 phase—due to APC (Anaphase promoting complex)-Cdh1-dependent degradation in G1 phase⁶⁶—when it localizes directly at the replication fork^5,57. SPRTN interacts with PCNA via a PIP (PCNA-interacting protein)-box (Fig. 2), which is dispensable for SPRTN recruitment to the replication fork. A truncation variant lacking the C-terminus (and therefore the PIP-box) still localizes at nascent DNA in iPOND (isolation of proteins on nascent DNA) experiments⁵⁷ and rescues DNA fiber length⁶⁰. Thus, it is unclear how SPRTN interacts with the replication machinery for DPC repair.

In vitro, substrate processing by SPRTN is most effectively stimulated by ssDNA—with the potential to form hairpins—or dsDNA structures with nicks or gaps close to the crosslinked protein^4,37,38. This is consistent with results obtained in Xenopus egg extracts using a DPC plasmid, where SPRTN cleaves a model DPC without ongoing replication when there is a gap in the complementary DNA filament⁷⁵. During DNA replication in cells, uncoupling between helicase and stalled DNA polymerase exposes ssDNA and generates a dsDNA/ssDNA ‘hybrid’ that is compatible with the in vitro model. This specific DNA context presumably protects non-crosslinked chromatin-associated proteins from SPRTN activity³⁸.

GCNA is also active in S phase. Its role during DNA synthesis is supported by the replication stress (formation of RPA/γH2Ax foci) in mitotically active germ cells of GCNA mutant flies and worms⁴⁰, and by the fact that the Mcm2-7 components of the CMG helicase complex figure among the DPCs in GCNA mutant fly embryos⁴⁰. Another study links GCNA to the resolution of DNMT1 crosslinks, although this was shown with ectopically expressed GCNA in human somatic cells⁴². DNMT1 crosslinks form behind the replication fork, when DNMT1 is recruited to newly synthesized DNA to restore the DNA methylation pattern, where it can get crosslinked in the presence of 5-azadC. It remains to be determined whether effective DNMT1 crosslink repair needs GCNA catalytic activity.

FAM111A levels increase in late S phase and remain high during G2/M⁴⁸. FAM111A interacts with PCNA via a PIP-box (Fig. 2)^50,52. It is unclear to what extent FAM111A depletion impairs DNA replication. EdU incorporation studies have produced conflicting results^50,52,61. DNA combing assays in the absence of exogenous damage have either shown no reduction in the length of nascent DNA in FAM111A-depleted cells or a slight increase in tract length to compensate for the decrease in origin firing^51,61. Instead, FAM111A KO reduces replication speed upon treatment with CPT and PARP inhibitors and increases cellular sensitivity to these poisons⁵¹. Consistently, Topo-1cc foci accumulate in KO cells; an intact PIP-box and an intact protease domain are required to rescue these phenotypes⁵¹. Although in vitro proteolysis of Topo-1 has not been shown so far, FAM111A may be required for the removal of replication barriers. In this respect, the extent of overlap with SPRTN remains to be determined.

Strikingly, FAM111A overexpression causes EdU incorporation and cell cycle defects, and consequently increases the rate of DNA damage and apoptosis; this is probably due to unspecific cleavage of replication fork proteins, e.g., PCNA and RFC (Replication factor C)^50,52,61,77. However, overexpression of the PIP-box mutated variant recapitulates these phenotypes to some extent, arguing that binding to PCNA might not be the cause of the replication problems^50,52,61. These replication defects are exacerbated by overexpression of FAM111A variants carrying disease-causing mutations (Kenny–Caffey syndrome type 2, see below and Fig. 2)^52,61,77. Overexpression of FAM111B mutant variants, causative of POIKTMP (Fig. 2), largely mimics the phenotype of FAM111A overexpression⁵². In both cases, concomitant mutations in the protease domains abolish these defects, showing that the observed phenotypes are the direct cause of FAM111A/B-dependent proteolysis⁵². These data suggest that deregulated FAM111A/B enzymatic activity is detrimental to the cells. Thus, tight regulation of the level and activity of these proteases is critical for the successful completion of DNA replication.

Downstream pathway and translesion DNA synthesis

Translesion DNA synthesis (TLS) is a damage tolerance mechanism where the conventional DNA polymerase is exchanged for an error-prone TLS polymerase that bypasses the lesion. This switch depends on PCNA and its ubiquitylation by the E3 ubiquitin ligase Rad18⁷⁸. SPRTN was linked to TLS regulation after UV damage before its role in DPC repair was established^{66,67,79,80,81,82}. SPRTN interacts with both PCNA and ubiquitin, but how SPRTN regulates TLS is not entirely clear. Some studies suggest it is a positive regulator of TLS^79,81,82, while others propose that SPRTN prevents UV-induced, TLS-dependent mutagenesis^66,67,80.

Although TLS regulation by SPRTN was initially linked to UV lesion repair, it is plausible that bypass of the remnant peptide after bulk DPC proteolysis also requires TLS (Fig. 1). Some lines of evidence support this: (i) UV light can crosslink proteins to DNA^83,84,85; (ii) epistasis analysis shows that Rad18 and SPRTN work together⁵⁸; (iii) in Xenopus egg extracts, TLS polymerases are required to complete replication of a plasmid carrying a model DPC after digestion by SPRTN⁷⁵; (iv) in yeast, WSS1 deletion lowers mutation rates after FA³. Therefore, TLS might act immediately after DPC proteolysis to avoid fork stalling, since mutations are still more desirable than replication fork collapse.

There is one case where the accumulation of mutations can be beneficial. This is the hypermutation of immunoglobulin loci. SPRTN has been shown to favor mutations (DNA polymerase eta and zeta-dependent) at abasic sites and template switch-mediated gene conversion during immunoglobulin gene diversification in chicken B cells⁸⁶. Notably, abasic sites are a common source of DPC formation, and the hypermutation phenotype is consistent with TLS acting downstream of DPC proteolysis. In agreement, lower mutation rates at the immunoglobulin locus are observed in SPRTN-depleted cells⁸⁶. This suggests that SPRTN plays a role in generating the diversity of immunoglobulin repertoire and in the DNA damage tolerance pathway when DNA replication forks get stalled.

DPC repair outside S phase

Despite the numerous bodies of evidence linking SPRTN to replication-coupled DPC proteolysis, a replication-independent function of SPRTN has not been ruled out^4,5. SPRTN mutations or insufficiency manifest into hepatocellular carcinoma^57,60. The liver might be more exposed than other organs to DPCs because inhaled and ingested compounds can be catabolized into formaldehyde. However, the great majority of hepatocytes are quiescent⁸⁷, therefore it is plausible that SPRTN is operating outside of S phase, and that malfunctioning would lead to cancer. It is also possible that liver damage observed in SPRTN-mutated patients or SPRTN hypomorphic mice initiates during embryogenesis’ S phase but this defect emerges later.

Starvation-arrested worm larvae, where replication is not happening (cells are arrested in G1/S), are sensitive to formaldehyde⁴, indicating that processes other than DNA synthesis are affected. Another example of SPRTN activity outside S phase comes from Drosophila. The expression of a mutated mh (the SPRTN homolog) from the maternal genome in the zygote leads to loss of the paternal DNA during the first mitotic division and results in the lethality of haploid embryos—hence the name mh, “maternal haploid”^88,89. This loss is caused by the lack of condensation of the paternal DNA during mitosis, and it is inherently unlinked to DNA replication and S phase⁸⁸. The paternal DNA remodeling still relies on mh catalytic activity, but the mechanism is unclear^88,89.

GCNA also works outside S phase. Its expression peaks in mitosis, where it localizes on chromosomes⁴¹. Topo-2ccs are prominent GCNA targets^40,41. Topo-2ccs are abundant in mitosis because Topo-2 activity is required for separation of sister chromatids. Supporting a role for GCNA in Topo-2cc repair, GCNA and Topo-2 physically interact and colocalize, and mutant worms are sensitive to etoposide but not to camptothecin⁴¹.

Overall, these data confirm that DPC protease activity is essential to maintain genome integrity during DNA replication, as well as outside of S phase.

DPC proteases and transcription

DPCs are also potential obstacles for transcription²⁰. Although transcriptional stalling and mutagenesis can be threatening for genome stability⁹⁰, the effects of DPCs on the progression of RNA polymerases are a largely neglected field.

Some evidence linking DPC proteases and transcription has emerged. Rpb1, the largest and catalytic subunit of the RNA polymerase II, is degraded via Ddi1 and Wss1 after exposure to HU or UV light⁴⁵. Both proteases interact with Rpb1. However, this evidence provides insufficient proof that Ddi1 and Wss1 target Rpb1 as a crosslinked protein. Its degradation might be a consequence of transcription stalling due to roadblocks ahead of RNA polymerase II. In fact, under these circumstances Rpb1 degradation was described to rely on the ubiquitin-proteasome system, the ATPase Cdc48/p97 and SUMO^91,92. At any rate, a stalled, non-crosslinked Rpb1 would be equally “eligible” for degradation by DPC proteases, since other “non-DPC” substrates exist and will be discussed later in this review.

A link between transcription and FAM111A in human cells also exists. FAM111A overexpression reduces Rpb1 levels at the chromatin and, consequently, EU incorporation⁵². FAM111A also interacts with Rpb1 upon overexpression⁵². However, it is unclear if these phenotypes are related to direct Rpb1 proteolysis.

DPCs and proteasome

The most studied proteolytic machinery in the cell is the 26S proteasome (henceforth proteasome), which degrades proteins that have been previously modified by ubiquitin and unstructured by unfoldases^93,94. It is not surprising that proteasome activity has been tested towards DPCs²⁶. The involvement of the proteasome implies that the DPC must be modified with ubiquitin. Early sources of evidence showed that proteasome inhibition delayed Topo-1/2 degradation following exposure to Topo-1 and −2 poisons, suggesting that the proteasome is required for Topo-cc repair^95,96,97. However, these initial experiments failed to substantiate the formation of ubiquitylated Topo-ccs. Topoisomerase ubiquitylation was shown in whole-cell extracts – rather than on chromatin or Topo-ccs – and there was no increase in the topoisomerase ubiquitylation state after proteasomal inhibition^95,97. Moreover, these studies used very high doses of CPT (μM range)^96,97, which crosslink 90% of Topo-1⁹⁸ and might cause an overload of Topo-1ccs beyond the repair capacity of the other, replication-dependent, proteases.

More recent studies have directly shown that both enzymatic and non-enzymatic DPCs are modified by ubiquitin^{14,42,62,63,75}. However, whether this modification leads to proteasomal degradation seems controversial, as ubiquitylation might rather have a signaling role⁶².

Crosslinked proteins can be concomitantly modified by ubiquitin and SUMO (SUMO-1 or SUMO-2/3)^42,62,63,75. Modification via SUMO-2/3 prior to ubiquitylation hints at the involvement of SUMO-targeted ubiquitin ligases (STUbLs), which ubiquitylate SUMO-modified substrates for proteasomal degradation^42,63. However, scenarios have been described where these two modifications are independent, although the function of DPC SUMOylation in these cases remains unclear^62,75.

S.cerevisiae Ddi1 is a proteasome adapter⁹⁹ (Fig. 2). Ddi1’s UBL binds the proteasome subunit Rpn1^100,101, and, unlike conventional UBL domains, it also binds ubiquitin, along with UBA^43,102. However, there is no evidence that Ddi1 functions in DPC repair with the aid of the proteasome. In fact, overexpression of mutated versions lacking UBL and UBA domains complements DDI1 deletion in FA sensitivity experiments, while mutation of the putative catalytic domain does not⁴⁵. A different scenario is very likely for the human homologs DDI1 and DDI2. DDI1/2 bind the proteasome via their UBLs^43,44 (Fig. 2). DDI1/2 recruit the proteasome at stalled replication forks to remove RTF2, whose persistence would prevent fork restart after HU and cause chromosome instability⁴⁴.

As for proteasome-dependent DPC repair during replication, experiments in Xenopus egg extracts have provided the most compelling evidence^75,76. Instead, in mammalian cells, ubiquitylation of Topo-ccs via the STUbL RNF4 prior to proteasomal degradation is independent of replication⁶³. A recent report claims that the enzyme HMCES crosslinks to abasic sites ahead of the replication fork. In this way, HMCES shields the abasic sites from replication by TLS polymerases and prevents mutagenesis¹⁴. HMCES crosslinks are ubiquitylated and processed by the proteasome, but the role of other proteases has not been tested¹⁴. Although physiological crosslinking must happen during DNA synthesis to prevent switching to TLS polymerases, it has not been established that HMCES-DPC repair by the proteasome happens during replication. Post-replicative repair is plausible, as (i) HCMES-DPC per se represents an impediment to DNA polymerase progression and (ii) proteolysis by the proteasome would likely leave a peptide that can halt the DNA polymerase. In either case, a tolerance mechanism other than TLS must be in place to bypass the DPC or the remnant peptide (e.g., template switching). Therefore, post-replicative proteasome-dependent proteolysis remains a formal possibility in this case.

In conclusion, replication-coupled proteolysis by a large complex like the proteasome remains a matter of future investigation in mammalian cells. For example, it will be informative to determine whether proteasome subunits appear at nascent chromatin after treatment with DPC-inducing agents (e.g., by iPOND), since assessing the proteins enriched at stalled forks has been useful for other kinds of replication stress¹⁰³.

DPC proteases in processing of non-crosslinked, tightly bound substrates

Mass spectrometry has been performed on DPCs isolated from cells depleted of SPRTN⁵ and GCNA⁴⁰ to identify the most abundant crosslinked proteins. These screenings have proved that Topoisomerases, histones and Mcm subunits of the CMG helicase complex are the major DPCs in SPRTN-depleted cells and GCNA KO flies^5,40. Nucleic acid-binding proteins are expected to come up in such screenings⁵ since proteins acting in the vicinity of the DNA are most likely crosslinked. However, in vitro cleavage experiments are mainly performed on substrates that have the propensity for DNA binding but are not crosslinked to DNA^3,4,5,6,37. Hence DNA association rather than crosslinking per se is a requisite for cleavage. Recent studies have proven that this is true in cells as well. The three studies below illustrate that proteases process non-covalently DNA-associated proteins to protect cells from DNA replication errors.

Histones: A recent study postulates that Wss1 degrades unassembled, yet non-covalently bound, histones during replication stress¹⁰⁴. This Wss1 activity would protect cells from the unspecific binding of excess histones to the ssDNA accumulating after HU exposure, which can interfere with DNA metabolism¹⁰⁴ (Fig. 4A). The sensitivity of wss1 mutant yeast cells to HU is greatly enhanced by concomitant deletion of DDI1, and a catalytically inactive Ddi1 does not rescue cell survival^45,46. Thus, the protease Ddi1, like Wss1, might be required to cope with replication stress beyond DPCs. More interestingly, complementation studies in yeast suggest that hDDI1/2 retain the same function⁴⁶.

PARP-1: Tightly bound proteins can be as dangerous as DPCs for replication fork progression¹⁰⁵. This scenario is well illustrated by PARP-1 trapping. PARP-1 is an enzyme participating in DNA replication and many DNA repair pathways^106,107. PARP-1 catalyses the formation of poly-ADP ribose (PAR) chains—a reaction known as PARylation—that recruit other repair proteins; self-PARylation triggers an electrostatic dissociation from chromatin, allowing the repair reaction to proceed¹⁰⁶. Pharmacological inhibition causes PARP to become tightly bound—or trapped—on chromatin because of the defective self-PARylation. Chromatin persistence due to PARP inhibitors is more problematic than the catalytic inhibition per se¹⁰⁸. In fact, trapped PARP-1 can cause replication fork collapse and DSBs and has the potential of killing BRCA-deficient cells, a strategy that is used for treatment of cancer patients with BRCA mutations¹⁰⁹. FAM111A depletion sensitizes cells to niraparib and talazoparib⁵¹, two PARP inhibitors with a strong trapping capacity^108,110. Niraparib treatment reduces replication fork speed in FAM111A-depleted cells and causes replication stress. Rescue of the replication defect depends on FAM111A catalytic residues, PIP-box and DNA binding, implying that FAM111A uses its proteolytic activity to remove obstacles imposed by trapped PARP-1 to replication forks⁵¹ (Fig. 4B). Whether other proteases participate in the removal of tightly bound proteins remains to be established.

Chk1: The degradation of a non-crosslinked substrate echoes DPC removal: in either case, proteolysis eliminates potential obstacles for DNA replication and metabolism, which threaten cell viability^51,104. In striking contrast is the cleavage of the checkpoint kinase Chk1 by SPRTN during DNA synthesis⁵⁹. During S phase, SPRTN cleaves Chk1 at the chromatin, in a loosely structured region at the C-terminus, and releases truncated Chk1 N-terminal fragments with stronger kinase activity than the full-length protein⁵⁹ (Fig. 4C). This way SPRTN ensures a basal and physiological Chk1 activation to support DNA replication progression in the absence of exogenous damage, i.e., in the absence of insults that cause ssDNA accumulation and robust ATR-dependent Chk1 activation cascade^59,111. Overexpression of Chk1 N-terminal fragments restores normal DNA replication in SPRTN-depleted cells (i.e., replication fork speed and origin firing) and partially rescues developmental defects of SPRTN-deficient zebrafish embryos⁵⁹.

Besides uncovering an important role for Chk1 during unperturbed replication, this study also shows that the function of DPC proteases goes well beyond the proteolysis of replication fork barriers.

DPC proteases and diseases

Defective DNA replication, often referred to as DNA replication stress, is one of the major causes of cancer^112,113,114. DNA replication stress can be triggered by obstacles ahead of the replication fork, such as those imposed by crosslinked or tightly bound proteins. Since the activity of DPC proteases is linked to DNA replication, strong association exists between defective DPC proteases and human diseases^40,57,60.

Ruijs-Aalfs syndrome (RJALS)

Biallelic and monogenic mutations in SPRTN cause a rare autosomal recessive progeroid disease known as RJALS⁶⁰. This was the first disease to be linked to defective DPC proteolysis repair. The loss-of-function mutations identified in patients so far are a missense mutation that generates the catalytically inactive SPRTN-Y117C variant, and non-sense mutations that generate a truncated SPRTN, SPRTN-ΔC (Fig. 2), which retains partial functionality due to intact protease activity but defective cellular localization^4,5,57,60. RJALS patients show signs of premature aging—including cataracts, graying of the hair, lipodistrophy—and develop early-onset hepatocellular carcinoma (HCC)⁶⁰. These phenotypes have been recapitulated in SPRTN hypomorphic mice^57,115, and definitively prove that defective SPRTN alone is responsible for the RJALS phenotype.

Aging and cancer are seemingly conflicting manifestations, but truly two outcomes of the same underlying cellular defects—the accumulation of DNA damage and genomic instability^116,117. RJALS patient cells (patient-derived lymphoblastoid cell lines and fibroblasts) display DNA replication stress, G2/M leakage, increased number of DSBs, chromosomal aberrations and increased level of total DPCs. It is unclear why human patients and mice develop HCC while the rest of the body ages.

Kenny–Caffey syndrome type 2 (KCS2) and gracile bone dysplasia (GCLEB)

FAM111A mutations cause KCS2, a rare autosomal dominant disease characterized by short stature, hypoparathyroidism, hypocalcemia and abnormal bone development. KCS2 patients do not show any predisposition to develop cancer^118,119. Heterozygous mutations in FAM111A have also been found in patients with GCLEB, or osteocraniostenosis¹¹⁹. GCLEB is lethal in newborns because of severe skeletal abnormalities.

KCS2-causing and GCLEB-causing mutations alter FAM111A proteolytic activity. Substitution of R569 to Histidine (H) is a recurrent mutation in KCS2^118,119 and falls in the proximity of the catalytic residue S541 (Fig. 2). FAM111A-R569H and other disease-causing mutations enhance FAM111A self-cleavage in vitro, showing that they are gain-of-function mutations^51,52. Expression of FAM111A-R569H or other mutated variants causes replication and transcription defects and apoptosis^52,61,77. Thus, it seems plausible that the toxicity of the hyperactive protein arises from uncontrolled degradation of replication and transcription proteins (e.g., RFC, Rpb1)⁵²; however, it is unknown how these mutations result in the observed phenotypes.

Pediatric germ cell tumors (GCTs)

Mutations in GCNA associate with GCTs. Downregulation of GCNA expression—due to copy number loss and promoter hypermethylation—is found in 66% of GCTs⁴⁰ and correlates with a poor prognosis. These alterations in GCTs show that GCNA protease is critical for the genomic stability of germ cells.

Conclusions and perspectives

The topic of DPC proteolysis repair has undergone an extraordinary advance in the past year. The list of specific repair enzymes has come to include novel DPC proteases working with Wss1 and SPRTN. In most cases, direct proteolysis of DPC substrates remains to be formally tested with in vitro assays; however, experiments conducted in cells, such as sensitivity to crosslinking agents, have established that an intact catalytic domain is always essential. We anticipate that more rigorous and systematic in vitro studies will elucidate the optimal requirements for DPC proteolysis.

The fidelity of DNA replication and genome stability are compromised by loss or misregulation of these proteases. This is not only due to defective DPC repair but also to defective removal of non-covalent replication barriers. The latter exemplifies other important functions of DPC proteases in the maintenance of genome integrity.

DPC toxicity opens an interesting perspective for cancer treatment as well. The most commonly used chemotherapy agents are Topoisomerase poisons that induce enzymatic DPCs known as Topo-1/2ccs^2,11. Specific DPC protease inhibitors could synergize with DPC-inducing agents and be instrumental to chemotherapy. The recent finding that FAM111A confers resistance to PARP inhibitors⁵¹ has potential implications in cancer treatment, not only in light of the development of a FAM111A inhibitor. In fact, FAM111A expression varies among cancer cell lines⁵¹, thus the efficacy of PARP inhibitors might change. These types of information should help design better treatment strategies.

We expect that research will focus more imminently on the identification of substrates and the regulatory mechanisms, including interaction and coordination with other repair proteins for timely activation of the DPC protease activity. This basic knowledge of DPC proteases will pave the way for a better understanding of human diseases and cancer therapy.

References

Vaz, B., Popovic, M. & Ramadan, K. DNA–protein crosslink proteolysis repair. Trends Biochem. Sci. 42, 483–495 (2017).
Article CAS PubMed Google Scholar
Stingele, J., Bellelli, R. & Boulton, S. J. Mechanisms of DNA-protein crosslink repair. Nat. Rev. Mol. Cell Biol. 18, 563–573 (2017).
Article CAS PubMed Google Scholar
Stingele, J., Schwarz, M. S., Bloemeke, N., Wolf, P. G. & Jentsch, S. A DNA-dependent protease involved in DNA-protein crosslink repair. Cell 158, 327–338 (2014). This seminal study has uncovered the existence of a specialised enzyme for DPC repair.
Article CAS PubMed Google Scholar
Stingele, J. et al. Mechanism and regulation of DNA-protein crosslink repair by the DNA-dependent metalloprotease SPRTN. Mol. Cell 64, 688–703 (2016). This article has described the first layer of regulation for the protease SPRTN, which is its de-ubiquitylation after DPC formation.
Article CAS PubMed PubMed Central Google Scholar
Vaz, B. et al. Metalloprotease SPRTN/DVC1 orchestrates replication-coupled DNA-protein crosslink repair. Mol. Cell 64, 704–719 (2016). This article has discovered SPRTN as the first DNA-dependent metalloprotease in metazoans for removal of DNA-protein crosslinks during DNA replication.
Article CAS PubMed PubMed Central Google Scholar
Lopez-Mosqueda, J. et al. SPRTN is a mammalian DNA-binding metalloprotease that resolves DNA-protein crosslinks. Elife 5, 1–19 (2016).
Article CAS Google Scholar
Ruggiano, A. Biochemical sciences spotlight the trinity of SPRTN protease regulation. Trends Biochem. Sci. https://doi.org/10.1016/j.tibs.2020.10.007. (2020).
Stingele, J. & Jentsch, S. DNA-protein crosslink repair. Nat. Rev. Mol. Cell Biol. 16, 455–460 (2015).
Article CAS PubMed Google Scholar
Pommier, Y. DNA topoisomerase I inhibitors: chemistry, biology, and interfacial inhibition. Chem. Rev. 109, 2894–2902 (2009).
Article CAS PubMed PubMed Central Google Scholar
Nitiss, J. L. Targeting DNA topoisomerase II in cancer chemotherapy. Nat. Rev. Cancer 9, 338–350 (2009).
Article CAS PubMed PubMed Central Google Scholar
Fielden, J., Ruggiano, A., Popović, M. & Ramadan, K. DNA protein crosslink proteolysis repair: from yeast to premature ageing and cancer in humans. DNA Repair. 71, 198–204 (2018).
Article CAS PubMed PubMed Central Google Scholar
Santi, D. V., Norment, A. & Garrett, C. E. Covalent bond formation between a DNA-cytosine methyltransferase and DNA containing 5-azacytosine. Proc. Natl Acad. Sci. USA 81, 6993–6997 (1984).
Article CAS PubMed PubMed Central Google Scholar
Maslov, A. Y. et al. 5-Aza-2′-deoxycytidine-induced genome rearrangements are mediated by DNMT1. Oncogene 31, 5172–5179 (2012).
Article CAS PubMed PubMed Central Google Scholar
Mohni, K. N. et al. HMCES maintains genome integrity by shielding abasic sites in single-strand DNA. Cell 176, 144–153.e13 (2019).
Article CAS PubMed Google Scholar
De Groot, A. C., Flyvholm, M. A., Lensen, G., Menné, T. & Coenraads, P. J. Formaldehyde-releasers: Relationship to formaldehyde contact allergy. Contact allergy to formaldehyde and inventory of formaldehyde-releasers. Contact Dermat. 61, 63–85 (2009).
Article Google Scholar
Pontel, L. B. et al. Endogenous formaldehyde is a hematopoietic stem cell genotoxin and metabolic carcinogen. Mol. Cell 60, 177–188 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hou, H. & Yu, H. Structural insights into histone lysine demethylation. Curr. Opin. Struct. Biol. 20, 739–748 (2010).
Article CAS PubMed PubMed Central Google Scholar
Ayala, A., Muñoz, M. F. & Argüelles, S. Lipid peroxidation: production, metabolism, and signaling mechanisms of malondialdehyde and 4-hydroxy-2-nonenal. Oxid. Med. Cell. Longev. 2014, 1–31 (2014).
Article CAS Google Scholar
Traube, F. R. & Carell, T. The chemistries and consequences of DNA and RNA methylation and demethylation. RNA Biol. 14, 1099–1107 (2017).
Article PubMed PubMed Central Google Scholar
Ide, H., Nakano, T., Salem, A. M. H. & Shoulkamy, M. I. DNA–protein cross-links: formidable challenges to maintaining genome integrity. DNA Repair 71, 190–197 (2018).
Article CAS PubMed Google Scholar
Nakano, T. et al. Nucleotide excision repair and homologous recombination systems commit differentially to the repair of DNA-protein crosslinks. Mol. Cell 28, 147–158 (2007).
Article CAS PubMed Google Scholar
de Graaf, B., Clore, A. & McCullough, A. K. Cellular pathways for DNA repair and damage tolerance of formaldehyde-induced DNA-protein crosslinks. DNA Repair 8, 1207–1214 (2009).
Article PubMed PubMed Central CAS Google Scholar
Nakano, T. et al. Homologous recombination but not nucleotide excision repair plays a pivotal role in tolerance of DNA-protein cross-links in mammalian cells. J. Biol. Chem. 284, 27065–27076 (2009).
Article CAS PubMed PubMed Central Google Scholar
Hoa, N. N. et al. Mre11 is essential for the removal of lethal topoisomerase 2 covalent cleavage complexes. Mol. Cell 64, 580–592 (2016).
Article CAS PubMed Google Scholar
Deshpande, R. A., Lee, J. H., Arora, S. & Paull, T. T. Nbs1 converts the human Mre11/Rad50 nuclease complex into an endo/exonuclease machine specific for protein-DNA adducts. Mol. Cell 64, 593–606 (2016).
Article CAS PubMed Google Scholar
Quievryn, G. & Zhitkovich, A. Loss of DNA–protein crosslinks from formaldehyde-exposed cells occurs through spontaneous hydrolysis and an active repair process linked to proteosome function. Carcinogenesis 21, 1573–1580 (2000).
Article CAS PubMed Google Scholar
Pommier, Y. et al. Tyrosyl-DNA-phosphodiesterases (TDP1 and TDP2). DNA Repair 19, 114–129 (2014).
Article CAS PubMed PubMed Central Google Scholar
Yang, S. W. et al. A eukaryotic enzyme that can disjoin dead-end covalent complexes between DNA and type I topoisomerases. Proc. Natl Acad. Sci. USA 93, 11534–11539 (1996).
Article CAS PubMed PubMed Central Google Scholar
Debethune, L. Processing of nucleopeptides mimicking the topoisomerase I-DNA covalent complex by tyrosyl-DNA phosphodiesterase. Nucleic Acids Res. 30, 1198–1204 (2002).
Article CAS PubMed PubMed Central Google Scholar
Interthal, H., Chen, H. J. & Champoux, J. J. Human Tdp1 cleaves a broad spectrum of substrates, including phosphoamide linkages. J. Biol. Chem. 280, 36518–36528 (2005).
Article CAS PubMed Google Scholar
Schellenberg, M. J. et al. ZATT (ZNF451)–mediated resolution of topoisomerase 2 DNA-protein cross-links. Science 357, 1412–1416 (2017).
Article CAS PubMed PubMed Central Google Scholar
Gao, R. et al. Proteolytic degradation of topoisomerase II (Top2) enables the processing of Top2·DNA and Top2·RNA covalent complexes by tyrosyl-DNA-phosphodiesterase 2 (TDP2). J. Biol. Chem. 289, 17960–17969 (2014).
Article CAS PubMed PubMed Central Google Scholar
Duxin, J. P., Dewar, J. M., Yardimci, H. & Walter, J. C. Repair of a DNA-protein crosslink by replication-coupled proteolysis. Cell 159, 346–357 (2014). This study has described for the first time that DPC proteolysis repair happens during DNA replication.
Article CAS PubMed PubMed Central Google Scholar
Reinking, H. K., Hofmann, K. & Stingele, J. Function and evolution of the DNA-protein crosslink proteases Wss1 and SPRTN. DNA Repair 88, 102822 (2020).
Article CAS PubMed Google Scholar
Yang, X. et al. Structural analysis of Wss1 protein from saccharomyces cerevisiae. Sci. Rep. 7, 1–9 (2017).
CAS Google Scholar
Balakirev, M. Y. et al. Wss1 metalloprotease partners with Cdc48/Doa1 in processing genotoxic SUMO conjugates. Elife 4, e06763 (2015).
Li, F., Raczynska, J. E., Chen, Z. & Yu, H. Structural insight into DNA-dependent activation of human metalloprotease spartan. Cell Rep. 26, 3336–3346.e4 (2019).
Article CAS PubMed Google Scholar
Reinking, H. K. et al. DNA structure-specific cleavage of DNA-protein crosslinks by the SPRTN protease. Mol. Cell 80, 102–113.e6 (2020). This article has described a second layer of regulation for the protease SPRTN, which is activated by a specific combination of ssDNA and dsDNA.
Article CAS PubMed PubMed Central Google Scholar
Carmell, M. A. et al. A widely employed germ cell marker is an ancient disordered protein with reproductive functions in diverse eukaryotes. Elife 5, e19993 (2016).
Bhargava, V. et al. GCNA preserves genome integrity and fertility across species. Dev. Cell 52, 38–52.e10 (2020).
Article CAS PubMed Google Scholar
Dokshin, G. A. et al. GCNA interacts with Spartan and Topoisomerase II to regulate genome stability. Dev. Cell 52, 53–68.e6 (2020).
Article CAS PubMed Google Scholar
Borgermann, N. et al. SUMOylation promotes protective responses to DNA‐protein crosslinks. EMBO J. 38, e101496 (2019).
Nowicka, U. et al. DNA-damage-inducible 1 protein (Ddi1) contains an uncharacteristic ubiquitin-like domain that binds ubiquitin. Structure 23, 542–557 (2015).
Article CAS PubMed PubMed Central Google Scholar
Kottemann, M. C., Conti, B. A., Lach, F. P. & Smogorzewska, A. Removal of RTF2 from stalled replisomes promotes maintenance of genome integrity. Mol. Cell 69, 24–35.e5 (2018).
Article CAS PubMed Google Scholar
Serbyn, N. et al. The aspartic protease Ddi1 contributes to DNA-protein crosslink repair in yeast. Mol. Cell 77, 1066–1079.e9 (2020). This article has reported the existence of a second protease in S.cerevisiae with overlapping functions to Wss1 during DNA replication.
Article CAS PubMed Google Scholar
Svoboda, M., Konvalinka, J., Trempe, J. F. & Grantz Saskova, K. The yeast proteases Ddi1 and Wss1 are both involved in the DNA replication stress response. DNA Repair 80, 45–51 (2019).
Article CAS PubMed Google Scholar
Mercier, S. et al. Mutations in FAM111b cause hereditary fibrosing poikiloderma with tendon contracture, myopathy, and pulmonary fibrosis. Am. J. Hum. Genet. 93, 1100–1107 (2013).
Article CAS PubMed PubMed Central Google Scholar
Fine, D. A. et al. Identification of FAM111A as an SV40 host range restriction and adenovirus helper factor. PLoS Pathog. 8, e1002949 (2012).
Panda, D., Fernandez, D. J., Lal, M., Buehler, E. & Moss, B. Triad of human cellular proteins, IRF2, FAM111A, and RFC3, restrict replication of orthopoxvirus SPI-1 host-range mutants. Proc. Natl Acad. Sci. USA 114, 3720–3725 (2017).
Article CAS PubMed PubMed Central Google Scholar
Alabert, C. et al. Nascent chromatin capture proteomics determines chromatin dynamics during DNA replication and identifies unknown fork components. Nat. Cell Biol. 16, 281–291 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kojima, Y. et al. FAM111A protects replication forks from protein obstacles via its trypsin-like domain. Nat. Commun. 11, 1–14 (2020). This study has discussed the existence of a second DPC protease in humans.
Hoffmann, S. et al. FAM111 protease activity undermines cellular fitness and is amplified by gain‐of‐function mutations in human disease. EMBO Rep. 21, e50662 (2020).
Yip, M. C. J., Bodnar, N. O. & Rapoport, T. A. Ddi1 is a ubiquitin-dependent protease. Proc. Natl Acad. Sci. USA 117, 7776–7781 (2020).
Article CAS PubMed PubMed Central Google Scholar
Dirac-Svejstrup, A. B. et al. DDI2 is a ubiquitin-directed endoprotease responsible for cleavage of transcription factor NRF1. Mol. Cell 79, 332–341.e7 (2020).
Article CAS PubMed PubMed Central Google Scholar
Toth, A., Hegedus, L., Juhasz, S., Haracska, L. & Burkovics, P. The DNA-binding box of human SPARTAN contributes to the targeting of Polη to DNA damage sites. DNA Repair. 49, 33–42 (2017).
Article CAS PubMed Google Scholar
Jambhekar, A., Dhall, A. & Shi, Y. Roles and regulation of histone methylation in animal development. Nat. Rev. Mol. Cell Biol. 20, 625–641 (2019).
Article CAS PubMed PubMed Central Google Scholar
Maskey, R. S. et al. Spartan deficiency causes accumulation of Topoisomerase 1 cleavage complexes and tumorigenesis. Nucleic Acids Res. 45, 4564–4576 (2017).
Article CAS PubMed PubMed Central Google Scholar
Mórocz, M. et al. DNA-dependent protease activity of human Spartan facilitates replication of DNA-protein crosslink-containing DNA. Nucleic Acids Res. 45, 3172–3188 (2017).
PubMed PubMed Central Google Scholar
Halder, S. et al. SPRTN protease and checkpoint kinase 1 cross-activation loop safeguards DNA replication. Nat. Commun. 10, 1–18 (2019). This study has discovered one of the non-DPC-related functions of proteases.
Lessel, D. et al. Mutations in SPRTN cause early onset hepatocellular carcinoma, genomic instability and progeroid features. Nat. Genet. 46, 1239–1244 (2014). This article has described for the first time the link between SPRTN mutations and aging diseases in human.
Article CAS PubMed PubMed Central Google Scholar
Rios-Szwed, D. O. et al. FAM111A regulates replication origin activation and cell fitness. bioRxiv 2020.04.22.055574. https://doi.org/10.1101/2020.04.22.055574 (2020).
Vaz, B. et al. SPRTN protease and SUMOylation coordinate DNA-protein crosslink repair to prevent genome instability. bioRxiv 2020.02.14.949289. https://doi.org/10.1101/2020.02.14.949289 (2020).
Sun, Y. et al. A conserved SUMO-Ubiquitin pathway directed by RNF4/SLX5-SLX8 and PIAS4/SIZ1 drives proteasomal degradation of topoisomerase DNA-protein crosslinks. bioRxiv 707661. https://doi.org/10.1101/707661 (2019).
Perry, M. et al. USP11 deubiquitinates monoubiquitinated SPRTN to repair DNA-protein crosslinks. bioRxiv 2020.06.30.180471. https://doi.org/10.1101/2020.06.30.180471 (2020).
Huang, J. et al. Tandem deubiquitination and acetylation of SPRTN promotes DNA-protein crosslink repair and protects against aging. Mol. Cell 79, 824–835.e5 (2020).
Article CAS PubMed PubMed Central Google Scholar
Mosbech, A. et al. DVC1 (C1orf124) is a DNA damage-targeting p97 adaptor that promotes ubiquitin-dependent responses to replication blocks. Nat. Struct. Mol. Biol. 19, 1084–1092 (2012).
Article CAS PubMed Google Scholar
Davis, E. J. et al. DVC1 (C1orf124) recruits the p97 protein segregase to sites of DNA damage. Nat. Struct. Mol. Biol. 19, 1093–1100 (2012).
Article CAS PubMed Google Scholar
Fielden, J. et al. TEX264 coordinates p97- and SPRTN-mediated resolution of topoisomerase 1-DNA adducts. Nat. Commun. 11, 1–16 (2020). This article has identified the first DPC-specific cofactor for the ATPase p97.
Article CAS Google Scholar
van den Boom, J. & Meyer, H. VCP/p97-mediated unfolding as a principle in protein homeostasis and signaling. Mol. Cell 69, 182–194 (2018).
Article PubMed CAS Google Scholar
Franz, A., Ackermann, L. & Hoppe, T. Ring of change: CDC48/p97 drives protein dynamics at chromatin. Front. Genet. 7, 1–14 (2016).
Article CAS Google Scholar
Nakano, T. et al. Translocation and stability of replicative DNA helicases upon encountering DNA-protein cross-links. J. Biol. Chem. 288, 4649–4658 (2013).
Article CAS PubMed PubMed Central Google Scholar
Hsiang, Y. H., Lihou, M. G. & Liu, L. F. Arrest of replication forks by drug-stabilized topoisomerase I-DNA cleavable complexes as a mechanism of cell killing by camptothecin. Cancer Res. 49, 5077–5082 (1989).
CAS PubMed Google Scholar
Kumari, A., Lim, Y. X., Newell, A. H., Olson, S. B. & McCullough, A. K. Formaldehyde-induced genome instability is suppressed by an XPF-dependent pathway. DNA Repair 11, 236–246 (2012).
Article CAS PubMed Google Scholar
Cortez, D. Replication-coupled DNA repair. Mol. Cell 74, 866–876 (2019).
Article CAS PubMed PubMed Central Google Scholar
Larsen, N. B. et al. Replication-coupled DNA-protein crosslink repair by SPRTN and the proteasome in Xenopus egg extracts. Mol. Cell 73, 574–588.e7 (2019).
Article CAS PubMed PubMed Central Google Scholar
Sparks, J. L. et al. The CMG helicase bypasses DNA-protein cross-links to facilitate their repair. Cell 176, 167–181.e21 (2019).
Article CAS PubMed Google Scholar
Nie, M. et al. Activation of FAM111A protease induces defects in nuclear function that likely underlie its roles in disease and viral restriction. bioRxiv 2020.05.04.077594. https://doi.org/10.1101/2020.05.04.077594 (2020).
Branzei, D. & Psakhye, I. DNA damage tolerance. Curr. Opin. Cell Biol. 40, 137–144 (2016).
Article CAS PubMed Google Scholar
Juhasz, S. et al. Characterization of human Spartan/C1orf124, an ubiquitin-PCNA interacting regulator of DNA damage tolerance. Nucleic Acids Res. 40, 10795–10808 (2012).
Article CAS PubMed PubMed Central Google Scholar
Machida, Y., Kim, M. S. & Machida, Y. J. Spartan/C1orf124 is important to prevent UV-induced mutagenesis. Cell Cycle 11, 3395–3402 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ghosal, G., Leung, J. W. C., Nair, B. C., Fong, K. W. & Chen, J. Proliferating cell nuclear antigen (PCNA)-binding protein C1orf124 is a regulator of translesion synthesis. J. Biol. Chem. 287, 34225–34233 (2012).
Article CAS PubMed PubMed Central Google Scholar
Centore, R. C., Yazinski, S. A., Tse, A. & Zou, L. Spartan/C1orf124, a reader of PCNA ubiquitylation and a regulator of UV-induced DNA damage response. Mol. Cell 46, 625–635 (2012).
Article CAS PubMed PubMed Central Google Scholar
Pashev, I. G., Dimitrov, S. I. & Angelov, D. Crosslinking proteins to nucleic acids by ultraviolet laser irradiation. Trends Biochem. Sci. 16, 323–326 (1991).
Article CAS PubMed Google Scholar
Smith, K. C. Dose dependent decrease in extractability of DNA from bacteria following irradiation with ultraviolet light or with visible light plus dye. Biochem. Biophys. Res. Commun. 8, 157–163 (1962).
Article CAS PubMed Google Scholar
SHETLAR, M. D., CHRISTENSEN, J. & HOM, K. Photochemical addition of amino acids and peptides to DNA. Photochem. Photobiol. 39, 125–133 (1984).
Article CAS PubMed Google Scholar
Nakazato, A. et al. SPARTAN promotes genetic diversification of the immunoglobulin-variable gene locus in avian DT40 cells. DNA Repair 68, 50–57 (2018).
Article CAS PubMed Google Scholar
Grishaw, W. J. A morphologic study of deoxyribonucleic acid synthesis and cell proliferation in regenerating rat liver; autoradiography with thymidine-H-3. Cancer Res. 22, 842–849 (1962).
Google Scholar
Delabaere, L. et al. The spartan ortholog maternal haploid is required for paternal chromosome integrity in the drosophila zygote. Curr. Biol. 24, 2281–2287 (2014).
Article CAS PubMed Google Scholar
Tang, X. et al. Maternal haploid, a metalloprotease enriched at the largest satellite repeat and essential for genome integrity in Drosophila embryos. Genetics 206, 1829–1839 (2017).
Article CAS PubMed PubMed Central Google Scholar
Brŕgeon, D. & Doetsch, P. W. Transcriptional mutagenesis: causes and involvement in tumour development. Nat. Rev. Cancer 11, 218–227 (2011).
Article CAS Google Scholar
Heckmann, I., Kern, M. J., Pfander, B. & Jentsch, S. A SUMO-dependent pathway controls elongating RNA Polymerase II upon UV-induced damage. Sci. Rep. 9, 17914 (2019).
Article PubMed PubMed Central CAS Google Scholar
Verma, R., Oania, R., Fang, R., Smith, G. T. & Deshaies, R. J. Cdc48/p97 mediates UV-dependent turnover of RNA Pol II. Mol. Cell 41, 82–92 (2011).
Article CAS PubMed PubMed Central Google Scholar
Bedford, L., Paine, S., Sheppard, P. W., Mayer, R. J. & Roelofs, J. Assembly, structure, and function of the 26S proteasome. Trends Cell Biol. 20, 391–401 (2010).
Article CAS PubMed PubMed Central Google Scholar
Collins, G. A. & Goldberg, A. L. The Logic of the 26S Proteasome. Cell 169, 792–806 (2017).
Article CAS PubMed PubMed Central Google Scholar
Mao, Y., Desai, S. D., Ting, C. Y., Hwang, J. & Liu, L. F. 26S proteasome-mediated degradation of topoisomerase II cleavable complexes. J. Biol. Chem. 276, 40652–40658 (2001).
Article CAS PubMed Google Scholar
Lin, C. P., Ban, Y., Lyu, Y. L., Desai, S. D. & Liu, L. F. A ubiquitin-proteasome pathway for the repair of topoisomerase I-DNA covalent complexes. J. Biol. Chem. 283, 21074–21083 (2008).
Article CAS PubMed PubMed Central Google Scholar
Desai, S. D., Liu, L. F., Vazquez-Abad, D. & D’Arpa, P. Ubiquitin-dependent destruction of topoisomerase I is stimulated by the antitumor drug camptothecin. J. Biol. Chem. 272, 24159–24164 (1997).
Article CAS PubMed Google Scholar
Hsiang, Y. H. & Liu, L. F. Identification of mammalian dna topoisomerase i as an intracellular target of the anticancer drug camptothecin. Cancer Res. 48, 1722–1726 (1988).
CAS PubMed Google Scholar
Finley, D. Recognition and processing of ubiquitin-protein conjugates by the proteasome. Annu. Rev. Biochem. 78, 477–513 (2009).
Article CAS PubMed PubMed Central Google Scholar
Elsasser, S. et al. Proteasome subunit Rpn1 binds ubiquitin-like protein domains. Nat. Cell Biol. 4, 725–730 (2002).
Article CAS PubMed Google Scholar
Gomez, T. A., Kolawa, N., Gee, M., Sweredoski, M. J. & Deshaies, R. J. Identification of a functional docking site in the Rpn1 LRR domain for the UBA-UBL domain protein Ddi1. BMC Biol. 9, 33 (2011).
Trempe, J. F. et al. Structural studies of the yeast DNA damage-inducible protein Ddi1 reveal domain architecture of this eukaryotic protein family. Sci. Rep. 6, 1–13 (2016).
Sirbu, B. M. et al. Identification of proteins at active, stalled, and collapsed replication forks using isolation of proteins on nascent DNA (iPOND) coupled with mass spectrometry. J. Biol. Chem. 288, 31458–31467 (2013).
Article CAS PubMed PubMed Central Google Scholar
Maddi, K. et al. Wss1 promotes replication stress tolerance by degrading histones. Cell Rep. 30, 3117–3126.e4 (2020). This study has discovered one of the non-DPC-related functions of proteases.
Article CAS PubMed Google Scholar
Fu, Y. V. et al. Selective bypass of a lagging strand roadblock by the eukaryotic replicative DNA helicase. Cell 146, 931–941 (2011).
Article CAS PubMed PubMed Central Google Scholar
Pascal, J. M. The comings and goings of PARP-1 in response to DNA damage. DNA Repair 71, 177–182 (2018).
Article CAS PubMed PubMed Central Google Scholar
Hanzlikova, H. et al. The importance of poly(ADP-Ribose) polymerase as a sensor of unligated Okazaki fragments during DNA replication. Mol. Cell 71, 319–331.e3 (2018).
Article CAS PubMed PubMed Central Google Scholar
Murai, J. et al. Trapping of PARP1 and PARP2 by clinical PARP inhibitors. Cancer Res. 72, 5588–5599 (2012).
Article CAS PubMed PubMed Central Google Scholar
Lord, C. J. & Ashworth, A. PARP inhibitors: synthetic lethality in the clinic. Science 355, 1152–1158 (2017).
Article CAS PubMed PubMed Central Google Scholar
Murai, J. et al. Stereospecific PARP trapping by BMN 673 and comparison with olaparib and rucaparib. Mol. Cancer Ther. 13, 433–443 (2014).
Article CAS PubMed Google Scholar
Bruhn, C. & Foiani, M. A model of DNA damage response activation at stalled replication forks by SPRTN. Nat. Commun. 10, 16–18 (2019).
Article CAS Google Scholar
Gaillard, H., García-Muse, T. & Aguilera, A. Replication stress and cancer. Nat. Rev. Cancer 15, 276–280 (2015).
Article CAS PubMed Google Scholar
Macheret, M. & Halazonetis, T. D. DNA replication stress as a hallmark of cancer. Annu. Rev. Pathol. Mech. Dis. 10, 425–448 (2015).
Article CAS Google Scholar
Lukas, C. et al. 53BP1 nuclear bodies form around DNA lesions generated by mitotic transmission of chromosomes under replication stress. Nat. Cell Biol. 13, 243–253 (2011).
Article CAS PubMed Google Scholar
Maskey, R. S. et al. Spartan deficiency causes genomic instability and progeroid phenotypes. Nat. Commun. 5, 1–12 (2014).
Article CAS Google Scholar
López-Otín, C., Blasco, M. A., Partridge, L., Serrano, M. & Kroemer, G. The hallmarks of aging. Cell 153, 1194 (2013).
Article PubMed PubMed Central CAS Google Scholar
Hanahan, D. & Weinberg, R. A. Hallmarks of cancer: the next generation. Cell 144, 646–674 (2011).
Article CAS PubMed Google Scholar
Isojima, T. et al. A recurrent de novo FAM111A mutation causes Kenny-Caffey syndrome type 2. J. Bone Miner. Res. 29, 992–998 (2014).
Article CAS PubMed Google Scholar
Unger, S. et al. FAM111A mutations result in hypoparathyroidism and impaired skeletal development. Am. J. Hum. Genet. 92, 990–995 (2013).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We deeply apologize to the researchers whose work was not cited. We thank a Medical Research Council (MRC) Programme grant to K.R. (MC_EX_MR/K022830/1). A.R. was supported by an EMBO LTF (ALTF 1109-2017). We would like to thank our DPhil student Gwendoline Hoslett for helping us with the text editing and her comments.

Author information

Authors and Affiliations

Medical Research Council (MRC) Oxford Institute for Radiation Oncology, Department of Oncology, University of Oxford, Roosevelt Drive, OX3 7DQ, Oxford, UK
Annamaria Ruggiano & Kristijan Ramadan

Authors

Annamaria Ruggiano
View author publications
You can also search for this author in PubMed Google Scholar
Kristijan Ramadan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.R. wrote the draft and prepared the figures. K.R. initiated and supervised this work. A.R. and K.R prepared the final version of this manuscript.

Corresponding author

Correspondence to Kristijan Ramadan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ruggiano, A., Ramadan, K. DNA–protein crosslink proteases in genome stability. Commun Biol 4, 11 (2021). https://doi.org/10.1038/s42003-020-01539-3

Download citation

Received: 26 June 2020
Accepted: 01 December 2020
Published: 04 January 2021
DOI: https://doi.org/10.1038/s42003-020-01539-3

This article is cited by

Transcription-coupled DNA–protein crosslink repair by CSB and CRL4CSA-mediated degradation
- Marjolein van Sluis
- Qing Yu
- Jurgen A. Marteijn
Nature Cell Biology (2024)
Transcription-coupled repair of DNA–protein cross-links depends on CSA and CSB
- Christopher J. Carnie
- Aleida C. Acampora
- Julian Stingele
Nature Cell Biology (2024)
Targeting DNA damage response pathways in cancer
- Florian J. Groelly
- Matthew Fawkes
- Madalena Tarsounas
Nature Reviews Cancer (2023)
FAM111A is dispensable for electrolyte homeostasis in mice
- Barnabas P. Ilenwabor
- Heidi Schigt
- Jeroen H. F. de Baaij
Scientific Reports (2022)
Human topoisomerases and their roles in genome stability and organization
- Yves Pommier
- André Nussenzweig
- Caroline Austin
Nature Reviews Molecular Cell Biology (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.