Introduction

Cells have evolved sophisticated mechanisms of DNA damage response and repair to minimize mutations that lead to genomic instability and cancer development. A variety of nucleases are critical DNA repair factors that process damaged DNA. In particular, the XPF family of structure-specific endonucleases is implicated in the repair of DNA lesions caused by UV and DNA interstrand crosslinking agents1,2. Mutation of these genes leads to genomic instability and cancer predisposition. XPF mutations can cause xeroderma pigmentosum, a genetic disease that is highly prone to skin cancer. Fanconi anemia (FA), complementation group M (FANCM) and FA-associated protein, 24-KD (FAAP24) belong to the XPF family of proteins and are implicated in the repair of UV and interstrand crosslink (ICL) damage1,3. FANCM mutation has been found in the rare genetic disorder FA4. Like xeroderma pigmentosum, FA is a genetically heterogeneous disease. Mutations in any one of the 15 FA genes can cause this disease5, with manifestation including developmental abnormalities and progressive bone marrow failure that eventually results in pancytopenia. At the cellular level, FA-mutant cells display hypersensitivity to DNA interstrand crosslinking agents, indicating the importance of the FA pathway in responding to complex DNA damage and strong replication blockage6,7.

FANCM and FAAP24 function coordinately with other FA and FA-associated proteins to protect cells against ICL damage. They dynamically interact with the FA core complex composed of FA proteins (FANCA, B, C, E, F, G and L) and FA-associated proteins (MHF1, MHF2, FAAP20 and FAAP100)8,9,10,11,12,13,14,15. In response to DNA damage, the FA core complex serves as an E3 ligase and monoubiquitinates FANCI and FANCD216,17,18. Nucleolytic ICL processing is then likely mediated by FAN1 and structure-specific nucleases associated with FANCP/SLX419,20,21,22,23,24, both of which are thought to be recruited through the binding to ubiquitin. Four other FA/breast cancer susceptibility proteins, FANCD1/BRCA2, FANCJ/BRIP1, FANCN/PALB2 and FANCO/Rad51C17,25,26,27,28,29, function downstream to reestablish replication forks, presumably via homologous recombination.

Previous studies implicated that the DNA-interacting activities of the FANCM/FAAP24 complex are involved in the FA pathway function. FANCM contains a DEAH helicase domain and a degenerated endonuclease domain, and exhibits ATP-dependent DNA translocase activity4,9,30. FAAP24 is an interacting protein of FANCM. FAAP24 also contains an ERCC4 domain likely possessing DNA-binding activity9. Together with the MHF1/MHF2 histone-fold complex, FANCM/FAAP24 cooperatively bind to and stabilize/remodel stalled replication forks15,31,32, and target the FA core complex to DNA damage sites to initiate the FA-dependent DNA damage response signaling33. FANCM/FAAP24 also carry out additional functions that are important for genomic integrity, including suppression of sister chromatid exchange (SCE) formation34, recombination-independent ICL repair34, and ATR-mediated checkpoint activation34,35,36,37.

Biochemically, FANCM and FAAP24 display affinity to various DNA structures with preferences to fork structure and single-stranded DNA (ssDNA), respectively. No detectable binding of FAAP24 to double-stranded DNA (dsDNA) was observed9,15. How FAAP24 exerts these diverse functions and interacts with FANCM in the context of the heterodimer was not well understood. In the present study, we determined the NMR structure of both N-terminal and C-terminal domains of FAAP24 independently. Based on the structural insights, we defined its ssDNA-binding activity and the critical residues of FAAP24 for the FA pathway activation, DNA damage checkpoint function, and cellular survival against ICLs. These data provide insights into how FAAP24 maintains genomic integrity at both structural and molecular levels.

Results

Solution structures of the ERCC4 and (HhH)2 domains of human FAAP24

The solution structures of the N-terminal domain (residues 1-139) and the C-terminal domain (residues 155-215) of human FAAP24 were determined using multi-dimensional NMR. 1 713 NOE and 172 dihedral restraints were used to calculate the N-terminal ERCC4-like domain of FAAP24. The C-terminal (HhH)2 domain of FAAP24 was determined based on 828 NOE restraints and 108 dihedral restraints. An ensemble of 10 conformers with the lowest energy and good convergence was selected out of 100 calculated structures. Statistical data regarding the quality and precision of the ensembles for the two domains are summarized in Supplementary information, Table S1. The backbone chemical shift assignment (CO, Cα, Cβ, amide H and N) of the full-length (FL) protein was also achieved successfully. However, the TALOS+ result indicates that a 22-residue linker connecting these two domains is exceedingly flexible in solution. The N-terminal tail (residues 1-18) appears unstructured and dynamic in solution.

The structures of the FAAP24 N-terminal and C-terminal domains show a typical ERCC4 nuclease domain comprising a central six-stranded β-sheet and five flanking α helices on both sides (Figure 1A) and an (HhH)2 domain (Figure 1B), respectively. Therefore, FAAP24 appears to be a member of the ERCC1/XPF family. Such conclusion might not be simply predicted via sequence alignment (Supplementary information, Figure S1).

Figure 1
figure 1

Solution structure of human FAAP24. (A) Solution structure of the N-terminal ERCC4 nuclease domain of FAAP24. Left panel, stereo-view showing the backbones of 10 superimposed NMR-derived structures of the N-terminal domain of FAAP24. Right panel, ribbon diagram of a representative NMR structure of the ERCC4 nuclease domain. The secondary structures of ERCC4 nuclease domain are labeled. (B) Solution structure of the C-terminal (HhH)2 domain of FAAP24. Left panel, stereo-view showing the backbones of 10 superimposed NMR-derived structures of the (HhH)2 domain of FAAP24. Right panel, ribbon diagram of a representative NMR structure of the (HhH)2 domain. The secondary structures of (HhH)2 domain are labeled. (C) Structural comparison of four ERCC4 nuclease domains. Cyan: solution structure of the ERCC4 domain of hFAAP24 (PDB: 2M9M); Green, crystal structure of ApeXPF ERCC4 domain (PDB: 2BHN); Wheat, crystal structure of hERCC1 ERCC4 domain (PDB: 2A1I); Yellow, solution structure of hERCC1 ERCC4 domain (PDB: 2JPD). (D) Structural comparison of four (HhH)2 domains. Cyan, solution structure of hFAAP24 (HhH)2 domain (PDB: 2M9N); Blue, solution structure of hFAAP24 (HhH)2 domain (PDB: 2LYH)39; Green, solution structure of hERCC1 (HhH)2 domain (PDB: 1Z00:A); Salmon, solution structure of hXPF (HhH)2 domain (PDB: 1Z00:B).

The structure of the ERCC4 nuclease domain of FAAP24 was compared with that of ApeXPF (PDB: 2BHN) and human ERCC1 (PDB: 2JPD and 2A1I) (Figure 1C). The RMSD values of these structural superimpositions are 2.1, 2.3 and 2.2 Å, respectively. There are still some minor differences probably related to the different length of the loops, helices or strands. It was noticed that the α4 helix (residues 96-112) of the FAAP24 ERCC4 nuclease domain was kinked, which might be caused by the presence of a residue Pro (P102) in the primary sequence of this helix. It was reported that the GDXnERKX3D motif in the ERCC4 nuclease domain of human ERCC1 would be critical for the nucleolytic activity, in which the residue Glu binds to the catalytic metal ion and mutation of the Arg and Lys led to reduced nuclease activity38. In FAAP24, the electrically charged residues ERK are substituted by three hydrophobic residues LYV (Supplementary information, Figure S2A). The electrostatic surfaces showed that the key Glu residues of ApeXPF and pfuXPF are located in a negatively charged cavity, while the same surface of FAAP24 is totally hydrophobic and is surrounded by positive charges (Supplementary information, Figure S2B). In addition, compared with XPF, the helix α3 of FAAP24 covers the active site. This ERK-to-LYV substitution and the orientation of helix α3 most likely degenerate the nuclease activity of FAAP24, consistent with experimental observations that the FANCM-FAAP24 complex has no endonuclease activity4,9.

The (HhH)2 domain of FAAP24 consists of two HhH motifs bridged by a connecting helix, which forms an integral five-helical domain bearing two HhH motifs, displaying a pseudo-two-fold symmetry (Figure 1B). The structure of the (HhH)2 domain of FAAP24 was compared with the recently reported NMR structure of the hFAAP24 (HhH)2 domain (PDB: 2LYH)39, the solution structures of the (HhH)2 domain of hERCC1 (PDB: 1Z00:A) and hXPF (PDB: 1Z00:B) (Figure 1D). The RMSD values of these structural superimpositions are 1.5, 2.5 and 2.6 Å, respectively. It showed that helices α7-α10 were generally well-superimposed between FAAP24 and ERCC1/XFP, while the helix α6 of FAAP24 and ERCC1/XPF showed a deflection angle of around 50 degrees (Supplementary information, Figure S2C). This deviation is consistent with the structure of the hFAAP24 (HhH)2 domain reported by Wienk et al.39. (HhH)2 domains are usually found in non-sequence-specific DNA-binding proteins40. A consensus GhG motif (h depicts a hydrophobic residue) located in the hairpin is directly involved in protein-DNA interaction. In FAAP24, a G168V169G170 motif is located between helices α6 and α7. This indicates the possibility that the (HhH)2 domain of FAAP24 could possess DNA-binding ability.

The (HhH)2 domain of FAAP24 is critical for both FANCM association and DNA binding

We next sought to determine which domain(s) of FAAP24 mediate its interaction with FANCM and DNA binding, respectively. To this end, we generated a series of deletion mutants covering the full-length protein (Figure 2A), and determined the FANCM-interacting domain(s) of FAAP24 via co-immunoprecipitation (Co-IP) experiments. We co-expressed triple-Myc-tagged full-length FAAP24 or its deletion mutants together with FANCM in HEK293T cells, and immunoprecipitated the protein complex using an anti-Myc antibody. As shown in Figure 2B, the C-terminal deletion mutant D5 completely lost the ability to associate with FANCM, suggesting that the C-terminal (HhH)2 motif is important for FANCM interaction.

Figure 2
figure 2

The (HhH)2 domain of FAAP24 is required for FANCM interaction and DNA binding. (A) Schematics of FAAP24 truncation mutants used for Co-IP experiment in B. Pink-shaded areas depict the degenerated nuclease motif. Green-shaded areas depict the tandem HhH repeat. (B) Co-IP of Myc-tagged wild-type FAAP24 or the truncation mutants with FANCM. HEK293T cells were co-transfected with plasmids encoding Myc-tagged FAAP24 and FANCM, and the cell lysates were subjected to immunoprecipitation with an Myc antibody. Asterisks indicate nonspecific bands associated with anti-Myc antibody. (C) SPR assay measuring ssDNA-binding affinity of wild-type FAAP24 and the C-terminal (HhH)2 domain truncation mutant (ΔC).

Tandem HhH motifs are found in many DNA-binding proteins40 and the (HhH)2 domain is therefore a plausible candidate domain that mediates the DNA-binding activity of FAAP24. To test this, we deleted the entire C-terminal (HhH)2 domain and tested the in vitro DNA-binding activity of the truncated protein (ΔC). As the preferred DNA substrate for FAAP24 is ssDNA9, we tested the full-length FAAP24 protein and the ΔC mutant for their binding of a 20-nt ssDNA in vitro using SPR assays. As shown in Figure 2C, the deletion of the (HhH)2 domain completely abolished the robust ssDNA-binding activity of FAAP24 as observed with the full-length protein, suggesting that the C-terminal (HhH)2 motif is also responsible for ssDNA binding.

Identification of residues important for FAAP24 binding to ssDNA

The nucleic acid-binding properties of FAAP24 were further characterized by NMR titration with 12-nt and 20-nt ssDNA (Figure 3A). We found that the overall fold of FAAP24 remains unaltered, which suggested that the observed chemical shift perturbations arose from specific contacts with ssDNA. Critical residues for DNA binding were identified using the backbone assignments for FAAP24. A comparison of the 15N HSQC spectra for FAAP24 in the presence and absence of DNA revealed that the following residues were markedly perturbed when FAAP24 interacts with either 12-nt or 20-nt ssDNA: G168, G170, K171 and K173 (the amide signal of K173 disappeared upon titration of ssDNA). Chemical shift perturbation data analysis showed that these most prominent changes in chemical shifts involved a cluster of residues within the first HhH motif of the FAAP24 (HhH)2 domain, especially the strictly conserved GVG motif (G168, G170) and two ensuing residues (K171, K173).

Figure 3
figure 3

Identification of critical residues of FAAP24 for ssDNA binding. (A) NMR titration experiments of FL-FAAP24 with 12-nt (upper left) and 20-nt (upper right) ssDNA. Five HSQC spectra were recorded with ssDNA:FL-FAAP24 ratio at 0:1, 0.3:1, 0.5:1, 1:1, and 2:1, respectively. For better illustration, only the first (red, ssDNA:FL-FAAP24 = 0:1) and the last (blue, ssDNA:FL-FAAP24 = 2:1) spectra were shown and overlapped. Peaks that were obviously shifted were labeled by arrows. In each panel, solid, dotted and dashed horizontal lines correspond to the averaged CSP, one and two standard deviations above the averaged CSP, respectively (see Materials and Methods for details). (B) SPR assay measuring ssDNA-binding affinity of the G168A mutant and the C-terminal (HhH)2 domain truncation mutant (ΔC). (C) SPR assay measuring ssDNA-binding affinity of K171A, K173A and K171A/K173A mutants.

To further verify whether the above-identified residues are indeed important for DNA binding, we performed the SPR assay on several single amino-acid substitution mutants. As shown in Figure 3B, the G168A mutation significantly reduced ssDNA-binding affinity of the FAAP24 protein, suggesting that the GVG motif is critical for DNA binding. However, the G170A mutation rendered recombinant FAAP24 protein insoluble, suggesting that a major structural disruption occurred with such substitution.

To verify that K171 and K173 are important for DNA binding, we generated K171A, K173A and K171A/K173A substitution mutants of FAAP24. As shown in Figure 3C, alanine substitution of the lysine residues resulted in a dramatic decrease of the in vitro DNA-binding activity. Quantitatively (Supplementary information, Figure S3A), the binding affinity of wild-type FAAP24 to ssDNA is high with a dissociation constant (Kd) of 3.69E-07M. When mutations were introduced at these positive residues (K171A, K173A and K171A/K173A), the binding affinity reduced to 3.14E-06M, 2.43E-06M and 5.10E-06M, respectively (up to 13-fold reduction). Substitution of the lysine residues with glutamic acid also greatly compromised the DNA-binding activity of FAAP24 (Supplementary information, Figure S3B).

Identification of residues important for in vivo chromatin association

Next we sought to systemically determine which residues are important for FAAP24 DNA binding in vivo. To this end, we generated a series of FAAP24 mutants with amino acid substitutions of conserved and NMR structure-implicated residues (Supplementary information, Figure S4). These mutations cover the core ERKxxxD motif in the ERCC4 domain and key residues in the (HhH)2 domain. We stably expressed triple-Myc-tagged FAAP24 mutants in a FAAP24−/− somatic knockout cell line we previously generated34, and tested the chromatin association of these mutants in the stable cell lines. As controls, we also generated in-parallel stable cell lines expressing wild-type FAAP24 and the D5 mutant. As shown in Figure 4, wild-type FAAP24 associates with chromatin with high affinity. In contrast, FAAP24 D5 mutant, which lacks the (HhH)2 domain, was completely deficient in chromatin association. This result further demonstrates that the (HhH)2 domain is indispensable for FAAP24 to bind DNA. Importantly, the V169A, K173A and K171A/K173A mutants showed significantly reduced affinity with chromatin, again indicating that the GVGKVK motif is important for DNA interaction. In addition, L177A, LLL176-178AAA, L187A and F201A/F211A mutants also displayed unfavorable chromatin association activity, suggesting that these hydrophobic residues are also important for DNA binding.

Figure 4
figure 4

Chromatin association of wild-type and mutant FAAP24. (A-B) FAAP24−/− cells stably complemented with Myc-tagged wild-type or mutant FAAP24 were fractionated, and western blot analysis was performed to detect FAAP24. Chr: chromatin fraction; WCE: whole-cell extract. Arrows indicate Myc-tagged FAAP24 proteins. The # marks Myc-tagged D5 mutant. Asterisks indicate either nonspecific bands associated with the anti-Myc antibody or potential degradation bands.

Consistently, SPR assay on selected mutants also confirmed that V169, LLL176-178, L187 and FF210-211 are important for in vitro DNA binding of FAAP24 (Supplementary information, Figure S5), further validating the importance of these residues in the DNA-binding affinity of FAAP24.

Identification of residues important for FANCM interaction

Our results from the in vitro DNA-binding assay and in vivo chromatin-association experiments are largely consistent with each other, suggesting that chromatin binding of FAAP24 is likely reflective of its intrinsic DNA-binding activity. As both FANCM and the FANCM/FAAP24 complex display in vitro DNA-binding activities, it is important to determine whether FANCM interaction impacts the DNA-binding activity of FAAP24. We tested FAAP24 mutants described above for FANCM interaction. We co-expressed triple-Myc-tagged FAAP24 mutants together with FANCM in HEK293T cells and tested FANCM-FAAP24 interaction by Co-IP experiments. As shown in Supplementary information, Figure S6, single amino-acid substitution within the (HhH)2 domain did not disrupt FAAP24 interaction with FANCM, suggesting that the protein-interacting surface may involve multiple residues of FAAP24. When two to three amino acids were substituted, which likely caused a more severe disruption of the interacting interface, most mutants, including GVG168-170AAA, LLL176-178AAA, VVG198-200AAA and F201A/F211A, lost the ability to associate with FANCM. We also examined the FAAP24 mutants for FANCM interaction in endogenous settings by using lysates from FAAP24−/− cells expressing various FAAP24 mutants for Co-IP experiments. The results (Figure 5) were consistent with those from Co-IP experiments of exogenous proteins. Notably, although two mutants (LLL176-178AAA and F201A/F211A) displayed both decreased DNA binding/chromatin association and abolished FANCM interaction, most point mutants are only defective for one of these two activities, suggesting that FANCM interaction and chromatin association are not interdependent.

Figure 5
figure 5

Co-IP of Myc-tagged wild-type or mutant FAAP24 with FANCM. (A-B) Lysates of FAAP24−/− cells stably complemented with Myc-tagged wild-type or mutant FAAP24 were subjected to immunoprecipitation with an Myc antibody, and western blot analysis was performed to detect FANCM and FAAP24. Arrows indicate Myc-tagged FAAP24 proteins. The # marks Myc-tagged D5 mutant. Asterisks indicate either nonspecific bands associated with the anti-Myc antibody or potential degradation bands.

Both FANCM-interacting and DNA-binding activities are important for FAAP24 function in the FA pathway

A systematic examination of FAAP24 mutants for FANCM interaction and DNA binding led to the identification of mutants defective for one or both of these functions, which suggests a possible segregation of the FANCM-association and DNA-binding functions of FAAP24, and allows a fine dissection of their importance in the FA pathway activation and other DNA damage response events.

We first treated the stable cells carrying various FAAP24 mutants (Supplementary information, Table S2) with mitomycin C (MMC) and examined FANCD2 monoubiquitination, a marker for the FA pathway activation. As shown in Figure 6A, FAAP24−/− cells displayed significant defect of FA pathway activation compared with parental wild-type HCT-116 cells. This defect can be fully rescued by complementation with full-length FAAP24 protein but not the D5 mutant, suggesting that the (HhH)2 domain is indispensable for FAAP24 function in the FA pathway. An ERCC4 domain mutant E63A/D65A was also competent in complementing the FANCD2 monoubiquitination deficiency of FAAP24−/− cells, suggesting that the diverged ERCC4 domain of FAAP24 is less important for the FA pathway activation. Among the (HhH)2 domain mutants, those defective for FANCM interaction (GVG168-170AAA and VVG198-200AAA) or DNA binding (K171A, K173A, K171A/K173A, V169A, L177A and L187A) only partially restored FANCD2 monoubiquitination, while mutants defective for both (LLL176-178AAA and F201A/F211A) showed little rescue of FANCD2 monoubiquitination, suggesting that both FANCM-interacting and DNA-binding activities of FAAP24 are important for the FA pathway activation.

Figure 6
figure 6

Both FANCM-interacting and DNA-binding activities of FAAP24 are required for the FA pathway activation. (A) FANCD2 monoubiquitination in FAAP24−/− cells stably complemented with wild-type or mutant FAAP24. Cells were mock-treated (Ctrl) or treated with 200 ng/ml MMC for 16 h. L and S represent monoubiquitinated and native forms of FANCD2, respectively. (B) Quantification of FANCD2 foci formation in FAAP24−/− cells stably complemented with wild-type or mutant FAAP24. Cells were mock-treated (Ctrl) or treated with 500 ng/ml MMC for 24 h. Data represent three independent experiments and error bars depict SD derived from five data sets.

We also tested FANCD2 foci formation in response to MMC damage in FAAP24−/− cells stably expressing FAAP24 mutants. As shown in Figure 6B and Supplementary information, Figure S7, the results were consistent with FANCD2 monoubiquitination data and further support that both FANCM-interaction and DNA-binding functions of FAAP24 are required for FA pathway activation.

DNA-binding activity of FAAP24 is critical for ATR-dependent checkpoint activation in response to ICL damage

We recently reported that FAAP24 but not FANCM is required for optimal activation of ATR-dependent checkpoint in response to ICL damage. We sought to determine whether FANCM interaction and/or DNA-binding activity of FAAP24 are important for its role in DNA damage checkpoint activation. FAAP24−/− cells stably complemented with various FAAP24 mutants were exposed to MMC and examined for their ATR-mediated checkpoint activation by blotting for Chk1 phosphorylation at Ser317. As shown in Figure 7A, FAAP24−/− cells displayed readily detectable checkpoint activation defect, which could be complemented by wild-type FAAP24 proteins. Mutants that are only defective for FANCM binding were able to fully rescue Chk1 phosphorylation. In contrast, mutants deficient in DNA binding were incompetent in restoring Chk1 activation, indicating that DNA-binding activity of FAAP24 is required for optimal activation of ATR-mediated checkpoint activation.

Figure 7
figure 7

DNA-binding activity of FAAP24 is required for ATR-mediated checkpoint activation in response to ICL damage. (A) Ser317 phosphorylation of Chk1 in FAAP24−/− cells stably complemented with wild-type or mutant FAAP24. Cells were mock-treated (Ctrl) or treated with 150 ng/ml MMC for 6 h. Chk1 and β-actin serve as loading controls. (B) Clonogenic survival of FAAP24−/− cells stably complemented with wild-type or mutant FAAP24. Error bars represent SD from four independent experiments with triplicated plates.

To functionally extend the above results, we tested the sensitivity of mutant-complemented FAAP24−/− cells to MMC. As shown in Figure 7B, the ERCC4 domain mutant E63A/D65A fully rescued the MMC-hypersensitive phenotype of FAAP24−/− cells, indicating that this domain is dispensable for crosslinking damage resistance. In contrast, the D5 mutant-complemented cells displayed low survival comparable to the FAAP24−/− cells, again suggesting that the (HhH)2 domain is critical for FAAP24 function. Mutants defective for FANCM interaction or DNA binding only partially corrected the MMC-hypersensitive phenotype of FAAP24−/− cells, while mutants defective for both showed little correction, pointing to a dual requirement of FANCM association and DNA-binding affinity for FAAP24 to function in cellular resistance to ICL damage.

Discussion

The DNA remodeling enzyme FANCM and its binding partner FAAP24 have been implicated in a number of DNA damage response and repair processes, including the FA pathway activation, ATR-dependent DNA damage checkpoint signaling, and recombination-independent repair of ICLs. These intercalated functions may either depend on the FANCM/FAAP24 complex function or rely on intrinsic biochemical activities of FANCM or FAAP24 alone. For example, FAAP24 functions cooperatively with FANCM to activate the FA pathway and suppress SCE formation, but it also independently promotes optimal ATR-mediated checkpoint activation in response to ICL damage34. To further understand how FAAP24 executes distinct functions to maintain genomic stability, we undertook structural and molecular approaches to dissect how FAAP24 interacts with FANCM and DNA, and how these activities contribute to FAAP24 functions in different settings.

FANCM/FAAP24 exhibit domains reminiscent of those of the XPF/ERCC1 and MUS81/EME1 structure-specific endonuclease complexes. However, neither FANCM nor FAAP24 has been found to possess nuclease activity. We found that folding of the ERCC4 domain of FAAP24 is in line with typical ERCC4 nuclease domains, and the overall structure highly resembles those of ApeXPF and human ERCC1 (Figure 1C). The critical core motif of a catalytic ERCC4 domain consists of ERKxxxD. In ApeXPF, the ERK residues are located in a central β-strand, while the following TSSD residues are part of a following helix structure1. In human FAAP24, the diverged ERKxxxD motif is folded in a similar manner: the LYV and TEAD residues are within a β-strand and a following helix, respectively. Thus, the loss of nuclease activity in FAAP24 does not seem to be attributed to severe local structural folding disruption. Rather, a change of amino-acid properties due to ERK-to-LYV degeneration is detrimental. Substitution of these residues likely interferes with an otherwise functional nuclease active site. Although an E63A/D65A mutant in our study appears to function as well as wild-type FAAP24, it remains unclear whether the diverged ERKxxxD motif in FAAP24 confers other functions.

Previously, the ERCC4 domain of ERCC1 was shown to bind ssDNA in the absence of its (HhH)2 domain41. NMR titration experiments indicated that N99, I102, L132, K213, A214 and the side-chain amide of Q134 contribute to such ssDNA-binding ability. Among them, the only positively charged residue K213 is fully conserved in the ERCC1 family of proteins. However, structural superimposition (Figure 1D) and sequence alignment (Supplementary information, Figure S1) between ERCC4 domains of ERCC1 and FAAP24 showed that the conserved Lys residue is replaced by a hydrophobic residue (V133) in FAAP24. This substitution most likely disrupts the ssDNA affinity of the FAAP24 ERCC4 domain.

On the other hand, (HhH)2 motifs are found in many DNA-binding proteins and are thought to be important for interacting with DNA. In the recently published work the (HhH)2 domain of FAAP24 was suggested to bind dsDNA when a FAAP24 truncate (residues 139-215) was used in the NMR titrations39. We believe that the truncated FAAP24 used in this study might have lost certain constrains from the ERCC4 domain, allowing the (HhH)2 domain to bind both ssDNA and dsDNA. This may underline the discrepancy between what was observed by Wienk et al.39 and our studies using the full-length FAAP24. Using both NMR titration and SPR assays, we found that the (HhH)2 domain of FAAP24 is essential for its ssDNA-binding activity (Figure 3A and 3B). The hairpins in a (HhH)2 domain are important for DNA binding. In particular, the classical glycine-hydrophobic residue-glycine (GhG) hairpins are critical for DNA interactions. In ApeXPF, two GIG hairpins are present and they cooperatively bind the backbone phosphates from two strands of a duplex DNA. Several positively charged residues immediately following the GIG hairpins also contribute to the interaction with the minor groove42. In contrast, FAAP24 contains only one classical GVG hairpin in the first HhH motif, which is immediately followed by positively charged K171 and K173. In the second HhH motif, there is no GhG hairpin and no alkaline residues following the VVG residues. As a result, the (HhH)2 domain of FAAP24 might only make contact with one strand of a duplex DNA. This may dictate that FAAP24 prefers ssDNA (Supplementary information, Figure S8A). In addition to the GVGKVK residues, we found that several hydrophobic residues within the (HhH)2 domain also contribute to ssDNA-binding affinity of FAAP24. Most likely, these residues are important for a stable (HhH)2 structure that allows a favorable DNA interaction. For example, the F210 and F211 residues reside in the tail of the last helix, and together with L158 and I166 on the first helix, they form a hydrophobic core that might be crucial for maintaining the overall (HhH)2 structure by controlling its opening (Supplementary information, Figure S8B).

FANCM contains an ERCC4 domain and an (HhH)2 domain in its C-terminus. However, the C-terminus of FANCM (FANCM1727-2048) by itself does not exhibit DNA-binding activities9. This is probably because that FANCM does not contain classical GhG hairpins in the (HhH)2 domain. The FANCM1727-2048/FAAP24 dimer could bind a variety of substrates with a preference for splayed-arm DNA, suggesting that FAAP24 is important for targeting FANCM to DNA9,33. This resembles the DNA-binding property of XPF/ERCC1. ERCC1 contains one classical GhG hairpin while XPF contains none, and it has been shown that ERCC1 binds ssDNA and is important for targeting XPF to DNA9,43.

FANCM also contains a helicase domain, which confers a DNA translocase activity. Although FANCM1727-2048 does not have DNA-binding activity, full-length FANCM can bind forked DNA structures15. Therefore, FANCM/FAAP24 complex may bind DNA through two independent surfaces, the (HhH)2 domain of FAAP24 and the helicase domain of FANCM. This allows the complex to bind replication fork structures synergistically. In fact, it has been proposed that FANCM/FAAP24 and MHF1/2 complexes cooperatively bind stalled replication fork such that FANCM binds to the branch while MHF1/2 and FAAP24 associate with dsDNA and ssDNA regions, respectively15. This cooperative binding model may underscore why both FANCM-interaction and DNA-binding activities of FAAP24 are required for optimal FA pathway activation in response to ICL damage that stalls replication forks (Figure 6 and Supplementary information, Figure S7).

We showed previously that the function of FAAP24 in FA pathway activation can be segregated from its function in DNA damage checkpoint signaling — the former is attained cooperatively with FANCM while the latter is independent of FANCM. Here we show that disrupting either FANCM interaction or DNA binding of FAAP24 compromised the activation of FA pathway, suggesting that both activities are important. A cooperative and more stable binding of stalled forks by the FANCM/FAAP24 complex may be required for an efficient recruitment of the FA core complex to the damage sites. On the other hand, we found that only the DNA-binding activity of FAAP24 was important for ATR-dependent checkpoint activation in response to ICL damage. This is consistent with the finding that FANCM seems dispensable for this process. FAAP24 was shown to have affinity toward ICL DNA structures, which is a plausible explanation of its role in ICL-induced checkpoint activation. Our results clearly demonstrated that DNA binding-defective FAAP24 mutants do not support robust ATR-dependent checkpoint activation by ICL damage, suggesting that the role of FAAP24 in this process strictly depends on its ssDNA-binding activity. It is possible that the ICL DNA-binding and the ssDNA-binding activities of FAAP24 are important for enhancing ATR/ATRIP recruitment in the context of ICL damage.

Future investigations, especially with crystal structures of FAAP24, co-crystal structures of FANCM C-terminus/FAAP24, with and without DNA, will be invaluable in understanding how these proteins interact with each other and how they associate with DNA. Structure of full-length FANCM/FAAP24 and structure-function analysis using genetic platforms will provide additional important insights into the functions of these proteins.

Materials and Methods

Cloning, protein expression and purification

The full-length human FAAP24 gene was PCR-amplified from pDONR201 and subcloned into the pET28a (Novagen) bacterial expression vector between NdeI and XhoI sites. FAAP24 N-terminal domain (residues 1 to 139) was subcloned into the pET-22b (Novagen) between NdeI and XhoI sites, C-terminal domain (residues 158-215) was subcloned into the pET-28a-sumo (Novagen) between BamHI and XhoI sites. Point mutations of FAAP24 were created using the standard PCR-based mutagenesis method. As a result all the constructs contained an N-terminal his-tag. The nucleotide sequences of the cloned DNAs were confirmed by DNA sequencing (Sangon). Recombinant proteins were expressed in Escherichia coli Rosetta host cells. Cells were grown at 37 °C to an OD600 = 0.8 and induced with 0.5 mM IPTG at 16 °C for 16 h. The target proteins were purified by Ni2+-IDA agarose affinity chromatography and further purified by two steps of column chromatography with HiTrap SPFF 5 ml and Superdex 75 120 ml columns (GE Healthcare).

Uniformly isotope-labeled FAAP24 was prepared by growing bacteria in M9 minimal medium using 15NH4Cl as the sole nitrogen source or 15NH4Cl and 13C6-glucose as the sole nitrogen and carbon sources, respectively. The NMR samples were concentrated to 0.3 mM (for titration experiments) or 0.5 mM (for structural determination) in 50 mM potassium phosphate pH 6.0, 50 mM NaCl, 1 mM β-ME, 0.5 mM EDTA.

Mutant proteins and NMR samples were prepared using the same protocol described above.

NMR spectroscopy

NMR spectra were recorded at 25 °C on either a Varian 700 MHz or a Bruker 850 MHz spectrometer both equipped with a cryogenic probe. The sequential backbone resonance assignments were achieved based on the following standard triple-resonance experiments: HNCACB, HN(CO)CACB, HNCO, HN(CA)CO and HNCA. Non-exchangeable side chain resonance assignments were achieved using the following spectra: HBHA(CO)NH, HCC(CO)NH, CC(CO)NH, HCCH-COSY and HCCH-TOCSY. Approximate interproton distance restraints were derived from two 3D NOESY spectra: 15N-edited NOESY (mixing time 100 ms) and 13C-edited NOESY (mixing time 100 ms). All of the NMR spectra were processed using NMRPipe and analyzed using NMRView or Sparky.

Structural determination

The solution structure of FAAP24 was calculated using the Xplor-NIH program, based on backbone dihedral angle restraints and NOE restraints. Backbone dihedral angle restraints (φ and ψ angles) were obtained by the analysis of the assigned backbone chemical shift values (C, CA, CB, N and HN) using TALOS+. Interproton distance restraints were derived from the 15N- or 13C-edited NOESY spectra. A total of 100 structures were calculated and the final 10 structures with the lowest total energy and least experimental violations were selected to represent the FAAP24 structure. The atomic coordinates of ERCC4 domain (residues 1-139) and (HhH)2 domain (residues 155-215) have been deposited in the Protein Data Bank (PDB) with the accession codes 2M9M and 2M9N, respectively. The chemical shift values of ERCC4 domain and (HhH)2 domain have been deposited in the Biological Magnetic Resonance Bank (BMRB) with the accession codes 19302 and 19303, respectively.

NMR titration

The interaction between FAAP24 and ssDNA was investigated using NMR chemical shift perturbation method. Two ssDNA (one 12 nt and one 20 nt) were chemically synthesized and purified with HPLC (Sangon). The lyophilized ssDNA samples were dissolved in titration buffer (50 mM K2HPO4-KH2PO4 pH 6.0, 200 mM NaCl, 1 mM β-ME, 0.5 mM EDTA) to make stock solutions of concentrations of 10 mM before titrating into the protein samples. Five HSQC spectra were acquired for each ssDNA ligand with a PRO/DNA ratio of 1:0, 1:0.3, 1:0.5, 1:1 and 1:2, respectively. Combined chemical shift perturbation was calculated by the formula (1),

where ΔH and ΔN are the chemical shift variations between the free and bound forms in the proton and nitrogen dimensions, respectively.

The SD of the CSP was calculated according to the following formula (2),

where CSPres is the residue-specific CSP.

Surface plasmon resonance (SPR) analysis

SPR analysis was deployed using a 20-nt substrate GCCCACCTGCAGGTTCACCC labeled with a biotin at the 5′-end. The solid DNA samples were dissolved in the SPR buffer (50 mM HEPES pH 7.5 and 100 mM NaCl). All SPR experiments were carried out at 25 °C using a BIAcore T100 instrument (GE Healthcare). Streptavidin was immobilized to a CM5 sensor chip by the amine-coupling method. The biotin-labeled DNA was captured by streptavidin at different channels for 180 resonance units (Ru). The binding experiments were carried out in the running buffer (50 mM HEPES pH 7.5 and 100 mM NaCl plus 0.005% [v/v] Tween-20) at the flow rate of 30 μl/min. Wild-type and mutant FAAP24 with different concentrations were injected over the DNA surface and blank flow cell for 60 s at a flow rate of 30 μl/min followed by 5 min dissociation phase. Bound protein could be dissociated from the DNA completely within 1-5 min. Data were analyzed with the BIAcore T100 evaluation software by fitting to steady state affinity model.

Cell culture

Human colorectal carcinoma HCT-116 and human embryonic kidney 293T (HEK293T) cell lines were obtained from the American Type Culture Collection (ATCC). All cells were all maintained in Dulbecco's Modification of Eagles Medium (DMEM) plus 10% fetal calf serum (FBS) and grown in a humidified 5% CO2-containing atmosphere at 37 °C.

Antibodies

FANCM and FAAP24 antibodies were described previously34. Sources of commercial antibodies are as follows: anti-β-actin (sc-69879, Santa Cruz); anti-β-tubulin (T4026, Sigma-Aldrich); anti-FANCD2 (sc-20022, Santa Cruz, and NB100-182, Novus); anti-histone H3 (05-928, Millipore); anti-Chk1 (sc-8408, Santa Cruz); anti-phospho-Chk1 Ser317 (2344, Cell Signaling); peroxidase-conjugated rabbit anti-mouse (315-035-048, Jackson ImmunoResearch); peroxidase-conjugated goat anti-rabbit (111-035-144, Jackson ImmunoResearch); horseradish peroxidase (HRP)-conjugated protein A (NA-9120V, GE Healthcare); and rhodamine Red-X-conjugated goat anti-rabbit (111-295-144, Jackson ImmunoResearch).

Western blot analysis

Whole-cell lysates were prepared by lysing cells with NETN-SDS buffer (20 mM Tris-HCl, pH 8.0, 50 mM NaCl, 1 mM EDTA, 0.5% Nonidet P-40, 0.5% deoxycholate, 0.5% SDS) followed by brief sonication. Western blot analysis was performed following standard protocol. Briefly, protein samples were mixed with 2× SDS buffer, boiled, separated with SDS-PAGE, and electrotransfered onto nitrocellulose membranes, and immunoblotting was carried out with primary antibodies as indicated. After washing for three times, the membrane was incubated with secondary anti-rabbit or anti-mouse antibody conjugated with horseradish peroxidase. The blot was developed with enhanced chemiluminescence reagent (Amersham Biosciences).

Cell fractionation

Cells were lysed with NETN buffer (20 mM Tris-HCl, pH 8.0, 100 mM NaCl, 1.5 mM MgCl2, 1 mM EDTA, 0.5% Nonidet P-40, 10 mM NaF, 1 mM Na3VO4, 2 μg/ml aprotinin and 1μg /ml pepstatin) at 4 °C for 30 min and centrifuged at 14 000 rpm for 10 min. The soluble fraction contains the cytosol and soluble nuclear proteins. The insoluble chromatin enriched pellet was washed with phosphate buffered saline (PBS) for three times and extracted with 0.2 N HCl for 30 min on ice, and the extract was neutralized with equal volume of 1 M Tris-HCl (pH 8.0).

Immunofluorescence staining

Cells were cultured on cover slips and treated with desired dosage of MMC for 24 h. Cells were then washed with PBS, fixed with 3% formaldehyde for 15 min and permeabilized with 0.5% Triton buffer (20 mM HEPES, pH7.4, 50 mM NaCl, 3 mM MgCl2, 300 mM sucrose, 0.5% Triton X-100) for 5 min at room temperature. Samples were incubated sequentially with desired primary and secondary antibodies for 30 min, and finally stained with 4′, 6-diamidino-2-phenylindole (DAPI) for 2 min in dark, all at room temperature. Cover slips were mounted on slides with 1, 4-phenylenediamine based anti-fade solution, and visualized under Nikon Eclipse 90i fluorescence microscopy. Nuclear foci were counted in at least 100 nuclei at 5 different locations for each sample.

Clonogenic survival assay

Cells were seeded at a density of 1-3 × 105 cells per 100 mm culture plate and allowed to attach and recover overnight. Cells were then exposed to MMC at dosages ranging from 100 nM to 5 000 nM for 1 h. After MMC treatment, cells were seeded at appropriate densities in triplicates for each dosage group, and grown in MMC-free medium for approximately 14 days. Cell colonies were then fixed with 6% glutaraldehyde (v/v) and stained with 0.5% crystal violet (w/v). Colonies containing more than 50 cells were counted, and relative survival was calculated as follows: % survival = no. of colonies counted/(no. of cells plated × plating efficiency) × 100.