Identification of replication fork-associated proteins in Drosophila embryos and cultured cells using iPOND coupled to quantitative mass spectrometry

Munden, Alexander; Wright, Madison T.; Han, Dongsheng; Tirgar, Reyhaneh; Plate, Lars; Nordman, Jared T.

doi:10.1038/s41598-022-10821-9

Download PDF

Article
Open access
Published: 28 April 2022

Identification of replication fork-associated proteins in Drosophila embryos and cultured cells using iPOND coupled to quantitative mass spectrometry

Alexander Munden¹,
Madison T. Wright²,
Dongsheng Han¹,
Reyhaneh Tirgar¹,
Lars Plate^1,2 &
…
Jared T. Nordman¹

Scientific Reports volume 12, Article number: 6903 (2022) Cite this article

2607 Accesses
4 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Replication of the eukaryotic genome requires the formation of thousands of replication forks that must work in concert to accurately replicate the genetic and epigenetic information. Defining replication fork-associated proteins is a key step in understanding how genomes are replicated and repaired in the context of chromatin to maintain genome stability. To identify replication fork-associated proteins, we performed iPOND (Isolation of Proteins on Nascent DNA) coupled to quantitative mass spectrometry in Drosophila embryos and cultured cells. We identified 76 and 278 fork-associated proteins in post-MZT embryos and Drosophila cultured S2 cells, respectively. By performing a targeted screen of a subset of these proteins, we demonstrate that BRWD3, a targeting specificity factor for the DDB1/Cul4 ubiquitin ligase complex (CRL4), functions at or in close proximity to replication forks to promote fork progression and maintain genome stability. Altogether, our work provides a valuable resource for those interested in DNA replication, repair and chromatin assembly during development.

A high-resolution protein architecture of the budding yeast genome

Article 10 March 2021

Identification of DHX9 as a cell cycle regulated nucleolar recruitment factor for CIZ1

Article Open access 22 October 2020

Genome-wide and sister chromatid-resolved profiling of protein occupancy in replicated chromatin with ChOR-seq and SCAR-seq

Article 06 August 2021

Introduction

Each and every time a cell divides it must accurately replicate both its genetic and epigenetic information. Core replication factors are known to assemble at replication forks to replicate the genome (e.g. helicase, polymerases). There are, however, likely hundreds of proteins that function at or in close proximity to the replication fork to facilitate replication of difficult-to-replicate sequences, propagate epigenetic information and coordinate replication with other chromatin-related processes such as transcription^1,2,3. Replication of the eukaryotic genome requires thousands of replication forks functioning simultaneously to complete replication in a timely manner. Errors generated at a single replication fork during genome duplication can result in mutations or genomic alterations with the potential to cause cell lethality or drive tumor formation⁴. Further complicating DNA replication is the need to allow regulatory flexibility to accommodate cell-type specific changes in cell division and cell cycle rates that occur during cell differentiation and development^5,6. How replication fork composition and activity is remodeled in response to difficult-to-replicate regions of the genome and in response to changes in developmentally programed changes in S phase regulation has yet to be defined.

Drosophila provides an ideal system to understand how developmentally-programed changes in S phase impact replication fork composition. In contrast to the ~ 8 h S phases associated with mitotic cell division in differentiated cells, S phases during early embryonic development are extremely rapid. During early embryogenesis and prior to the maternal-to-zygotic transition (MZT) S phases are 3–4 min in length. S phase gradually lengthens as development approaches the MZT and slows to ~ 75 min at the MZT^5,7. While S phase length can drastically differ during development, the rate of replication fork progression is similar in these different contexts⁸. The chromatin context that replication forks must navigate also changes with development. In pre-MZT embryos, chromatin is devoid of heterochromatin and transcription is largely inactive. Around the time of the MZT, condensed heterochromatin is formed and zygotic transcription is activated^5,9. In fact, the extension of S phase at the MZT is largely driven by the onset of late replication and the bulk of S phase is dedicated to replication of heterochromatic sequences^5,10.

In recent years, several techniques have been established to isolate active replication forks to identify replication fork-associated proteins^3,11,12. One technique, isolation of proteins on nascent DNA (iPOND), has become widely employed due to its ease of use and the only technical requirement being a short pulse of the nucleotide analog 5-ethynyl-2′-deoxyuridine (EdU)¹¹. For iPOND, cells are incubated with a brief pulse of EdU and proteins are crosslinked to nascent DNA. EdU can be biotinylated using click chemistry and newly synthesized DNA and associated proteins are purified using streptavidin beads¹³. A key to identifying proteins at or in close proximity to active replication forks, rather than general chromatin-associated proteins, is a chase sample where a thymidine chase is introduced after the EdU pulse. Proteins enriched in pulse only samples relative to the chase samples are largely replication fork-associated proteins^2,11,14. Another advantage of iPOND is that it can be coupled to quantitative mass spectrometry to identify replication fork-associated proteins in an unbiased manner^1,2,15. While iPOND coupled to quantitative mass spectrometry has been used extensively in mammalian cultured cells, it has not been applied in Drosophila or in a developing organism.

To identify proteins at or in close proximity to active replication forks in Drosophila, and to determine if replication fork composition is influenced by development, we have performed iPOND in combination with tandem mass tag (TMT)-based quantitative mass spectrometry in Drosophila post-MZT embryos and cultured cells. Using an iPOND-TMT approach, together with a stringent statistical analysis, we identified 76 and 278 replication fork-associated proteins in post-MZT embryos and Drosophila cultured S2 cells, respectively. While we have confirmed many known replication fork components, we have identified many proteins that do not have known roles at the replication fork. By performing a targeted RNAi-based screen of select factors, we have identified the Cul4 E3 ubiquitin ligase specificity factor, BRWD3, as a replication fork-associated protein that affects replication fork progression.

Results

Establishing iPOND in the developing embryo

To define the landscape of proteins at or in close proximity to active replication forks during development, we turned to Drosophila due to its well-characterized S phase programs that are known to significantly change during development. To identify replication fork-associated proteins, we chose to use iPOND because it does not require protein tags or extensive multi-step purifications¹¹. We chose post-MZT Drosophila embryos (3–5 h after egg laying—AEL) for our embryonic sample. During this developmental time point, S phase is ~ 75 min with the bulk of that time devoted to replication of heterochromatin (Fig. 1A)⁵.

Nucleotide analogs and other small molecules are unable to enter embryos without permeabilization or direct injection¹⁶. To obtain sufficient EdU-labeled embryos for iPOND, we developed a large-scale permeabilization strategy. Starting with 3–5 h embryos collected from a population cage, embryos were permeabilized and pulse-labeled with EdU for 10′ using custom collection baskets (see “Experimental procedures”; Fig. 1B). Using this approach, we could routinely isolate 100–200 mg of EdU-labeled embryos from a single collection basket. To determine if EdU-labeled 3–5 h post-MZT embryos could be used for iPOND, we biotinylated embryos using Click chemistry as previously described¹³ and the EdU-labeling efficiency was determined by staining embryos with fluorescently labeled streptavidin¹³ (Fig. 1C). Two key controls were also used; following the 10′ pulse of EdU, embryos were immediately transferred into medium containing thymidine for 30′ (chase sample). Second, age-matched embryos were mock treated and biotinylated exactly as the pulse and chase samples (no EdU control). To determine if iPOND could be used to isolate replication-dependent chromatin, we performed a single-step purification of biotinylated EdU-containing chromatin from our pulse, chase and no-EdU samples. We probed lysates for histone H3 as a mark of total chromatin¹¹ (Fig. 1D). We found that the recovery of chromatin was dependent on EdU incorporation. This indicates that iPOND can be applied to Drosophila embryos to isolate native chromatin.

iPOND mass spectrometry identifies proteins at or in close proximity to active replication forks in Drosophila embryos

Now that we established iPOND as a technique to identify proteins at or in close proximity to active replication forks in Drosophila embryos, we wanted to identify the repertoire of proteins associated with replication forks in early embryos in an unbiased manner using quantitative mass spectrometry. Therefore, we coupled our iPOND purifications to tandem mass tag (TMT) labeling, which allows us to multiplex and quantify the relative abundance of peptides across multiple biological replicates in a single mass spectrometry experiment¹⁷. We optimized the amount of labeled embryos necessary for reproducible purification and mass spectrometry experiments. Ultimately, we found that 0.5 g of EdU-labeled embryos routinely provided robust and reproducible mass spectrometry results. We collected EdU-pulsed embryos from four biological replicates of 3–5 h embryos. EdU-pulsed embryos were either fixed immediately (pulse) or chased with thymidine for 30 min prior to fixation (chase). After EdU purification and verification via Western blot, peptides derived from pulse and chase samples were TMT labeled, separated using multidimensional protein identification technology (MudPIT) and quantified by mass spectrometry (Fig. 2A).

To identify proteins at or in close proximity to active replication forks, we focused on proteins that were enriched in the EdU pulse samples relative to the thymidine chase controls. To ensure that the differences were not due to differences in purification and/or labeling efficiencies, we normalized proteins to histone H4 (see “Experimental procedures”). Using this analysis, we identified 76 proteins that were significantly enriched in either the EdU pulse or thymidine chase samples (Supp. Table 1). Most of the proteins enriched in our experiments were derived from the EdU pulse samples (Fig. 2B; 74 of 76). This is likely due to the relatively short thymidine chase time we adopted to allow us to focus on replication fork-associated proteins. Several lines of evidence indicate that our iPOND strategy is effective in isolating replication fork-associated proteins from Drosophila embryos. First, out of the top 25 enriched proteins in our data set, 18 are known replication factors (Supp. Table 2). Second, a Gene Ontology (GO) analysis of the 76 proteins enriched in the EdU pulse samples was highly enriched for DNA replication and DNA replication-associated processes (Fig. 2C). Therefore, we conclude that iPOND is an effective strategy to identify proteins at or in close proximity to active replication forks during Drosophila embryogenesis.

We wanted to categorize the proteins in our data set in an unbiased manner to identify existing and potentially new protein networks centered around the replication machinery. To this end, we analyzed all enriched proteins using the STRING network in Cytoscape to identify proteins known to interact with one another. This analysis identified a network cluster of 27 known DNA replication and repair proteins (Fig. 2D). This cluster contains known helicase subunits, DNA polymerases, clamp loading factors and other factors (Fig. 2D). Additionally, it contains proteins involved in replication fork stability and response to DNA damage (Mei-41/ATR, Timeout/Timeless, CG10336/Tipin and Claspin)^{18,19,20,21,22}. Unexpected clusters were also identified. For example, we identified a cluster of proteins involved in RNA processing and a cluster involved in nuclear organization and the nuclear pore. We also identified several unique replication fork-associated proteins that did not readily form interaction networks. Together, we conclude that iPOND can be used in Drosophila embryos to identify existing and potentially new replication fork-associated proteins.

iPOND mass spectrometry identifies proteins at or in close proximity to active replication forks in Drosophila cultured cells

To extend the utility of iPOND in Drosophila, we performed iPOND in Drosophila S2 cultured cells. We previously performed iPOND in S2 cells, but did not couple iPOND to quantitative mass spectrometry²³. First, we validated that iPOND functions in S2 cells (Supp. Fig. 1A). Next, we performed iPOND coupled to quantitative mass spectrometry with TMT labeling using ~ 10⁹ cells/biological replicate (Fig. 3A). Similar to our results in embryos, all of the enriched proteins were found in the pulse sample rather than the chase (Fig. 3B). This is likely due to the short chase time we used in these experiments. One difference we noted, however, is that we identified 278 proteins at or in close proximity to active replication forks in S2 cells (compared to 76 in embryos) (Supp. Table 3). While significantly higher than embryos, this protein number is similar to other iPOND and iPOND-like data sets in mammalian cells^1,3.

Similar to our embryo data set, multiple lines of evidence indicate that our purifications successfully captured replication fork-associated proteins. Out of the top 25 enriched proteins in our data set, 22 are known replication factors (Supp. Table 4). Next, a Gene Ontology (GO) analysis of proteins enriched in the EdU pulse samples was highly enriched for DNA replication and DNA replication-associated processes (Fig. 3C). We attempted to generated an unbiased interaction network map using the STRING network in Cytoscape for the 278 enriched proteins, however, the networks were too dense to effectively visualize any meaningful interaction network hubs (Supp. Fig. 1B). For prioritization, we selected the proteins with an adjusted p value of < 0.05 and greater than 1.8-fold enrichment in the pulse relative to the chase. These stringent statistical cutoffs revealed 99 high-confidence proteins that were ultimately used to generate interaction network clusters, which consisted of known replication fork factors, further validating this data set and statistical analysis (Fig. 3D). Also, we identified networks containing proteosome components, RNA processing factors, protein phosphatase 4 complex (PP4) and a number of proteins with no recognized network connections (Fig. 3D).

BRWD3 affects genome stability and replication fork progression

To determine if any of the replication fork-associated factors we identified affect genome stability, we used RNAi to deplete select factors and measured the global level of DNA damage. We chose to perform this targeted screen in S2 cells rather than post-MZT embryos due to the rapid and efficient depletion that can be obtained in S2 cells without the need to generate new reagents²⁴. To quantify the global levels of DNA damage, we measured the level of phosphorylated H2Av (ɣ-H2Av), the Drosophila equivalent to mammalian ɣ-H2Ax, which is found at double strand breaks and stalled replication forks by immunofluorescence^25,26. Of the 15 factors we chose, some but not all have known functions in DNA replication or DNA repair^{18,27,28,29,30,31,32,33,34,35}. We validated knock down efficiency and the effect on cell proliferation for all factors (Supp. Fig. 2A,B). As our negative control, we used a non-targeting RNA to GFP that is not present in S2 cells. As a positive control we targeted DNA polymerase alpha (DNA pol⍺), which is necessary for continual priming of the lagging strand (Fig. 4A)³⁶. Depletion of several factors resulted in increased H2Av phosphorylation (Fig. 4A). For example, depletion of Cul4, RTEL, ELG1 and BRWD3 all caused increased DNA damage consistent with mammalian studies^27,28,29,30. Interestingly, knockdown of polybromo, a component of the Brahma chromatin remodeling complex³⁷, also caused an increase in DNA damage (Fig. 4A). Depletion of several factors caused a decrease in ɣ-H2Av signal intensity, suggesting these factors contribute to DNA damage detection or signaling (Fig. 4A). Consistent with this hypothesis, depletion of mei41 (the Drosophila ATR ortholog) decreased ɣ-H2Av intensity.

BRWD3 is a targeting specificity factor for the DDB1/Cul4 ubiquitin ligase complex (CRL4)³⁸. In mammalian cells, one of the BRWD3 orthologs DCAF14/PHIP associates with replication forks upon DNA replication stress²⁹. Depletion of DCAF14 results in a modest increase in DNA damage, which is exacerbated upon replication stress²⁹. Depletion of BRWD3 in Drosophila S2 cells causes an increase in ɣ-H2Ax levels in unstressed cells (Fig. 4A). This suggests that BRWD3/DCAF14 has an evolutionarily conserved role at the replication fork to maintain genome stability. Given these observations, we wanted to determine if BRWD3 affects replication fork progression in unchallenged Drosophila cells. To this end, we developed a DNA combing protocol for Drosophila S2 cells. While we initially attempted to perform DNA combing with both IdU and CldU nucleoside analogs, we were unable to successfully perform combing with IdU (data not shown). This would have allowed us to measure fork rate, fork asymmetry and inter-origin distance. To solely measure the rate of fork progression, we performed DNA combing analysis with CldU as the sole nucleotide analog. As positive and negative controls, we used interfering RNAs against DNA pol⍺ and GFP, respectively. Depletion of BRWD3 caused a decreased rate of fork progression in untreated cells (Fig. 4B). To rule out an off-target effect of the RNAi construct, we performed the DNA combing assay with an independent RNAi construct (Supp. Fig. 3A). We also validated the knock down efficiency of both RNAi constructs by Western blot (Supp. Fig. 3B). Thus, we conclude that BRWD3 functions at or in close proximity to the replication fork to promote replication fork progression and genome stability in Drosophila.

Discussion

By developing a large-scale EdU labeling protocol in Drosophila embryos, we were able to perform iPOND in a developing organism. By coupling iPOND to quantitive mass spectrometry we identified 76 replication fork-associated proteins in Drosophila post-MZT embryos. Giving confidence to this method of identifying replication fork-associated proteins, 32 proteins we identified have known roles in DNA replication or repair. We note, however, that not all known replication fork-associated proteins were identified in our data set. Multiple reasons likely explain this observation. First, we used a stringent statistical cut off in our analysis to avoid false positives (see “Experimental procedures”). Second, either due to loss in purification or difficulty in mass spectrometry, some replication proteins are simply not detected by mass spectrometry, resulting in false negatives. Therefore, we suspect that 76 proteins are an underestimate of the total number of replication fork-associated proteins in post-MZT embryos.

While iPOND-TMT identified 76 proteins in post-MZT embryos, the same technique uncovered 278 proteins in Drosophila cultured cells. Although this number of proteins is higher than what we observed in embryos, it is similar to recent iPOND and iPOND-like experiments coupled to quantitative mass spectrometry performed in mammalian cells^1,3. The difference in protein number between post-MZT embryos and cultured cells could be due to cell-type-specific factors in S2 cells or technical differences when performing iPOND in embryos vs. cultured cells. It should be noted, however, that the rate of replication fork progression is similar in Drosophila embryos and cultured cells⁸. Given that a single pulse of EdU is the only technical limitation with iPOND, it seems unlikely that differences in the amount of EdU labeling are responsible for the differences in protein number between the two developmental states. One complicating factor when trying to compare iPOND data sets for cell-type-specific factors is that lack of a protein in one sample could be due to a limitation in peptide detection in mass spectrometry. Therefore, we cannot use the lack of a protein in one developmental sample as direct evidence that a protein is cell-type specific. Nonetheless, our data reveal numerous replication fork-associated proteins in Drosophila embryos and cultured cells that can serve as a resource for anyone interested in replication fork composition and activity.

One factor that we identified as a replication fork-associated protein is BRWD3. Interestingly, one of the BRWD3 orthologs in mammalian cells also functions at the replication fork to maintain genome stability upon replication stress²⁹. One key difference, however, is that in Drosophila BRWD3 functions at or in close proximity to active replication forks in the absence of exogenous replication stress. Therefore, while BRWD3 and DCAF14 are both substrate specificity factors for CRL4, they likely function differently in mammalian cells and Drosophila. While it is tempting to speculate that CRL4^BRWD3 targets a critical factor for ubiquitylation at the replication fork, further work will be necessary to test this hypothesis. For example, BRWD3 could alter the activity of a factor that directly controls fork progression away from a replication fork. Therefore, although BRWD3 can be found at or in close proximity to active replication forks, it’s effect on replication fork progression could be indirect.

In summary, we have developed a protocol for the biochemical isolation of replication fork-associated proteins in Drosophila embryos and cultured cells. Our work suggests that replication fork composition can be modulated during development. Importantly, we have provided a resource of replication fork-associated factors in Drosophila for those interested in DNA replication, DNA repair and chromatin dynamics during replication.

Experimental procedures

EdU pulsing of embryos

Oregon R flies were expanded into population cages on grape juice plates supplemented with wet yeast. Cages were kept at 25 °C in a humidified room and plates changed daily. Prior to embryo collections, flies were precleared for at least 2 h. To acquire post-MZT embryos, flies were allowed to lay for 2 h, and the plate was aged for 3 h at 25 °C to obtain 3–5 h AEL embryos. Embryos were transferred to a container with a wire mesh bottom, washed in water and embryos were dechorionated in 50% bleach for 2 min. After washing, embryos were arranged in a monolayer on the mesh and bucket were dried with paper towels. Embryos were allowed to air dry 4–10 min, then submerged in octane for precisely 3.5 min with gentle shaking. Embryos were then air dried for 1 min while shaking. Permeabilized embryos were pulsed with 10 μM EdU in EBR for 10 min. For chase samples, EdU-pulsed embryos were transferred to a new solution containing 20 μM of thymidine for an additional 30 min. After pulse/chasing, embryos were transferred to a scintillating flask in 10 mL of heptane. 10 mL of 4% PFA was added (2% final) and embryos were shaken vigorously at room temperature for 20 min. After fixation, the bottom layer of PFA was removed and an equal volume of methanol was added. Embryos were shaken by hand for 1 min, settled and heptane was removed. Embryos were washed in methanol twice and transferred to PBS + 0.1% Triton X-100 and permeabilized overnight at 4 °C. For each batch of embryos, a small fraction was taken and biotinylated and incubated with 568-Streptavidin to ensure that at least 50% of embryos were labeled. Successful collections were pooled to obtain 500 μL of embryos per biological replicate.

EdU pulsing of S2 cells

S2 cells were obtained directly from the DGRC. Cells were confirmed negative for mycoplasma contamination via PCR. Cells were grown in Schneider’s Drosophila Medium with 10% heat-inactivated FBS (Gemini Bio Products) and 100 U/mL of Penicillin/Streptomycin (Fisher Scientific) and kept at 25 °C. Cells were pulsed as described in¹³. Briefly, three T225 flasks of 70% confluent cells were pulsed with 10 μM of EdU for 9 min. Cells were scraped and spun down for 3 min at 300×g. 10 mL of 2% paraformaldehyde (PFA) was added to each flask and samples were fixed at room temperature on a nutator for 20 min. Paraformaldehyde was neutralized with glycine and cells were centrifuged for five minutes at 900×g at 4 °C and resuspended in PBS with 0.1% Triton X-100 at 4 °C until processing. For the chase sample, after centrifuging the cells were resuspended in cell media with 20 μM thymidine and incubated for 30 min in the cell culture incubator before fixation. Three T225 flasks were pooled for each replicate (~ 7.5e⁸ cells per replicate).

iPOND

Embryos and S2 cells were biotinylated as described in¹³. Briefly, PBS, CuSO₄, Biotin-Azide, and sodium ascorbate were mixed and added to labeled cells and embryos for 30 min. After biotinylation, cells or embryos were washed with PBS + 0.1% Triton X-100. A crude nuclear extract was generated by douncing embryos in Buffer 1 (15 mM HEPES pH 7.6, 10 mM KCl, 5 mM MgCl₂, 0.1 mM EDTA, 0.5 mM EGTA, 350 mM sucrose)³⁹ twelve times using a B-type homogenizer and centrifuged for 15 min at 8000×g. This pellet was resuspended in 1.2 mL of LB3 (1 mM EDTA, 0.5 mM EGTA, 10 mM Tris pH 7.5, 100 mM NaCl, 0.1% Na-Deoxycholate, 0.5% N-Lauroyl sarcosine)⁴⁰ with 2× protease inhibitors. Cells were resuspended in 1.2 mL LB3 lysis buffer with 2× protease inhibitors.

Samples were sonicated in a Bioruptor Plus (Diagenode) at high power, 10 cycles at 30″ seconds on/30″ seconds off. After a short break, samples were vortexed and this was repeated until 40 total cycles were achieved. 100 μL of Streptavidin C1 Dynabeads were extensively washed with LB3 and added to each sample. Samples were incubated at 4 °C for 2 h on a nutator. The unbound material was reserved to verify chromatin fragmentation. Beads were washed five times in LB3, with the 4th wash containing 500 mM NaCl. To elute, samples were incubated at 65 °C overnight on thermoblock in 1:1 combination of LB3:SB (20% glycerol, 20% SDS, 120 mM Tris pH 6.8). The next day, the eluate was removed from the beads and added to 2× Laemmli buffer with DTT and boiled for 10 min. This lysate was used for Western blot and mass spectrometry experiments.

Western blotting

Lysates from iPOND samples were loaded onto a 4–15% Mini-Protean Stain-free protein gel (BioRad). After running the gel, samples were transferred onto 0.2 μM PVDF using the Transblot Turbo system (BioRad). Membranes were blocked in 5% milk, and incubated with the appropriate antibody for 1 h at room temperature. Histone H3 (abcam 21054, 1:3000) was used to verify the success of iPOND. After washing in TBS + 0.1% Tween-20 (TBST), secondary antibodies (Jackson Labs) conjugated with HRP were added at 1:10,000 (mouse) or 1:20,000 (rabbit). After 30 min at room temperature, membranes were washed with TBST, incubated with Clarity ECL for 5 min (Bio-Rad) and visualized using a Bio-Rad ChemiDoc MP Imaging System.

TMT labeling

After verifying iPOND was successful by Western blot (5% of total material), the remaining purified material was precipitated using methanol and chloroform and washed with methanol to remove excess detergent. Protein was resuspended in 5 μL fresh 1% Rapigest. 32.5 μL of mass spectrometry grade water with HEPES (pH 8.0 at a final concentration of 100 mM). Disulfide bonds were reduced with freshly made 5 mM TCEP and incubated for 30 min at room temperature. Fresh Iodoacetamide was added at a final concentration of 10 mM to acetylate free sulfhydryl bonds. Protein was digested overnight with 0.5 μg trypsin at 37 °C with shaking and covered from light. The next day, samples were labeled using a TMT10plex kit (Thermo Scientific catalog #90110). TMT labels were resuspended in acetonitrile and each sample was incubated with the appropriate amount of TMT reagent for 1 h at room temperature. Excess label was neutralized with 0.4% final concentration of ammonium bicarbonate for 1 h. Samples were mixed and acidified with formic acid to a pH 2. The mixed sample was reduced to 1/6 of the original volume using a SpeedVac, and brought back up to original volume with Buffer A (5% acetonitrile, 0.1% formic acid). Rapigest was cleaved by incubating for 1 h at 42 °C. The samples were centrifuged at 14,000 rpm for 30 min and the supernatant was transferred to a fresh tube and stored at − 80 °C until mass spectrometry analysis.

Liquid chromatography–tandem mass spectrometry

MudPIT microcolumns were prepared as previously described⁴¹. Peptide samples were directly loaded onto the columns using a high-pressure chamber. Samples were then desalted for 30 min with buffer A (97% water, 2.9% acetonitrile, 0.1% formic acid v/v/v). LC–MS/MS analysis was performed using a Q-Exactive HF (Thermo Fisher) or Exploris480 (Thermo Fisher) mass spectrometer equipped with an Ultimate3000 RSLCnano system (Thermo Fisher). Embryo samples were analyzed on the Exploris480 while the S2 cell culture were analyzed on the Q-Exactive HF. MudPIT experiments were performed with 10 µL sequential injections of 0, 10, 30, 60, and 100% buffer C (500 mM ammonium acetate in buffer A), followed by a final injection of 90% buffer C with 10% buffer B (99.9% acetonitrile, 0.1% formic acid v/v) and each step followed by a 130 min gradient from 5 to 80% B with a flow rate of 300 nL/min when using the Q-Exactive HF and 500 nL/min when using the Exploris480 on a 20 cm fused silica microcapillary column (ID 100 um) ending with a laser-pulled tip filled with Aqua C18, 3 µm, 100 Å resin (Phenomenex). Electrospray ionization (ESI) was performed directly from the analytical column by applying a voltage of 2.0 kV when using the Q-Exactive HF and 2.2 kV when using the Exploris480 with an inlet capillary temperature of 275 °C. Using the Q-Exactive HF, data-dependent acquisition of mass spectra was carried out by performing a full scan from 300 to 1800 m/z with a resolution of 60,000. The top 15 peaks for each full scan were fragmented by HCD using normalized collision energy of 38, 0.7 m/z isolation window, 120 ms maximum injection time, at a resolution of 45,000 scanned from 100 to 1800 m/z and dynamic exclusion set to 60 s. Using the Exploris480, data-dependent acquisition of mass spectra was carried out by performing a full scan from 400 to 1600 m/z at a resolution of 120,000. Top-speed data acquisition was used for acquiring MS/MS spectra using a cycle time of 3 s, with a normalized collision energy of 36, 0.4 m/z isolation window, 120 ms maximum injection time, at a resolution of 45,000 with the first m/z starting at 110. Peptide identification and TMT-based protein quantification was carried out using Proteome Discoverer 2.4. MS/MS spectra were extracted from Thermo Xcalibur .raw file format and searched using SEQUEST against a Uniprot Drosophila melanogaster proteome database (downloaded February 6th, 2019 and containing 21,114 entries). The database was curated to remove redundant protein and splice-isoforms, and supplemented with common biological MS contaminants. Searches were carried out using a decoy database of reversed peptide sequences and the following parameters: 10 ppm peptide precursor tolerance, 0.02 Da fragment mass tolerance, minimum peptide length of 6 amino acids, trypsin cleavage with a maximum of two missed cleavages, dynamic methionine modification of 15.995 Da (oxidation), static cysteine modification of 57.0215 Da (carbamidomethylation), and static N-terminal and lysine modifications of 229.1629 Da.

iPOND-TMT data analysis

To determine enrichment or depletion of the proteins, the TMT intensities for each protein was log₂ transformed and samples were normalized based on median TMT intensity per channel. Log2-transformed, median normalized TMT intensities were further normalized to the level of Histone H4, as the resulting incorporation of this histone should be identical between each sample. Enrichment values were calculated based on this normalized data. Cellular localization data was determined for each protein using the Gene Ontology Cellular Compartment (FlyBase v2021_05). Proteins that lacked any nuclear or chromatin compartmental data were removed from the datasets. To determine if a protein was significantly enriched or depleted in the pulse or chase embryo samples, an unpaired t-test was performed for each protein. Our uncorrected p values were validated because our positive controls (known replication proteins) were identified. For S2 cell data, an unpaired t-test was performed for each protein with a Benjamini, Krieger and Yekutieli multiple test correction and false discovery rate of 5%⁴².

For the pathway enrichment analysis of enriched proteins, PANTHER Gene Ontology was used^42,43,44. Enriched proteins were inputted and the default background for Drosophila melanogaster was selected. The biological process pathway was used, and the results were exported to Excel and the top 10 pathways were chosen by q-value, and visualized in Graphpad Prism.

For network clustering, all of the proteins enriched in the embryo and S2 pulse were loaded as separate networks in Cytoscape v3.9.0⁴⁵. The resulting interactions were visualized using the STRING network with the stringApp, using the Drosophila melanogaster setting with 0 additional interactors and a confidence score cutoff of 0.8^46,47.

RNAi and immunofluorescence in S2 cells

RNAi in S2 cells was performed as described⁴⁸. Briefly, dsRNA against each candidate RNA was designed to be 200–500 bp. Primers used to generate dsRNA are listed in Supplemental Table 5. The dsRNAs were synthesized using the Invitrogen MEGAscript T7 Transcription Kit (Ambion). For each sample, 1.5 million S2 cells were seeded in 1 ml non-serum medium in a 6-well plate and 30 µg of dsRNA was added. After 45 min incubation at room temperature, 2 ml of serum-containing medium was added and cells were incubated for an additional 5 days. Reverse transcription and quantitative PCR (RT-qPCR) and Western blotting were performed to determine the knock down efficiency using a rabbit anti-BRWD3 antibody at 1:500⁴⁹. For immunofluorescence, RNAi-treated cells were attached to Concanvan A-coated slides for 15 min, fixed for 15 min in 4% paraformaldehyde and permeabilized for 15 min in PBS supplemented with 0.3% Triton-X-100 (PBT). Cells were then blocked for 60 min in blocking buffer, containing 1% BSA and 0.2% goat serum in 0.1% PBT. After blocking, cells were incubated with rabbit anti-γ-H2Av (1:500, Rockland, # 600-401-914) antibody overnight at 4 °C in blocking buffer. After washing with PBS, cells were incubated with goat anti-rabbit IgG secondary antibody (1:500, Life Technologies, # A11011) in blocking buffer for 1 h at room temperature and stained with DAPI (0.1 µg/mL) in PBT for 10 min and mounted in Vectasheild (Vector Labs). All images were obtained using Nikon Ti-E inverted microscope with a Zyla sCMOS digital camera with a 20× oil objective. For each biological replicate, all samples were captured at the same magnification and same exposure time. For quantitative analysis of γ-H2Av levels, regions of interest (ROIs) were defined based on the DAPI signal. The mean signal intensity of ɣ-H2Av was extracted for each ROI. The signal was normalized to the DAPI signal intensity to account for differences in the total amount of DNA. 350 randomly selected cells were used for each biological replicate. Two biological replicates were used for the data analysis. Kruskal–Wallis one-way analysis of variance was performed in GraphPad Prism for statistical significance.

DNA molecular combing

Drosophila S2 cells were pulsed with 20 μM of CldU nucleoside (Sigma-Aldrich, C6891) for 20 min. Cells were washed with PBS then ~ 1.5–3.0 million cells were embedded in agarose plugs. The assay was performed as described in Genomic Vision’s manufacturer instructions. The stretched and denatured DNA was stained with a CldU-specific antibody (Abcam Cat#ab6326) for 1 h, washed in PBS, then probed with a secondary antibody (Thermo, A11007) for 30 min. Coverslips were washed with PBS then mounted. Stained coverslips were imaged using a Nikon Ti-E inverted microscope with a Zyla sCMOS digital camera with a 40× oil objective. For each sample, 200 DNA fiber lengths were measured manually using Nikon NIS-Elements AR v4.40. Investigator was blinded to sample identity. Two biological replicates were performed per sample. The length of a given fiber is directly proportional to the rate of replication fork progression. Therefore, fiber lengths were converted to fork progression rates given the 20-min pulse time (Fiber length * 20 min/2 kb min⁻¹). Kruskal–Wallis one-way analysis of variance followed by Dunn’s multiple comparisons post-test was performed in GraphPad Prism for statistical significance.

Data availability

The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD031165 http://www.ebi.ac.uk/pride/archive/projects/PXD031165.

References

Wessel, S. R., Mohni, K. N., Luzwick, J. W., Dungrawala, H. & Cortez, D. Functional analysis of the replication fork proteome identifies BET proteins as PCNA regulators. Cell Rep. 28, 3497-3509.e4 (2019).
Article CAS PubMed PubMed Central Google Scholar
Dungrawala, H. et al. The replication checkpoint prevents two types of fork collapse without regulating replisome stability. Mol. Cell 59, 998–1010 (2015).
Article CAS PubMed PubMed Central Google Scholar
Alabert, C. et al. Nascent chromatin capture proteomics determines chromatin dynamics during DNA replication and identifies unknown fork components. Nat. Cell Biol. 16, 281–291 (2014).
Article CAS PubMed PubMed Central Google Scholar
Tomasetti, C., Li, L. & Vogelstein, B. Stem cell divisions, somatic mutations, cancer etiology, and cancer prevention. Science 355, 1330–1334 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Shermoen, A. W., McCleland, M. L. & O’Farrell, P. H. Developmental control of late replication and S phase length. Curr. Biol. 20, 2067–2077 (2010).
Article CAS PubMed PubMed Central Google Scholar
Matson, J. P. et al. Rapid DNA replication origin licensing protects stem cell pluripotency. elife 6, e30473 (2017).
Article PubMed PubMed Central Google Scholar
Yuan, K., Seller, C. A., Shermoen, A. W. & O’Farrell, P. H. Timing the drosophila mid-blastula transition: A cell cycle-centered view. Trends Genet. 32, 496–507 (2016).
Article CAS PubMed PubMed Central Google Scholar
Blumenthal, A. B., Kriegstein, H. J. & Hogness, D. S. The units of DNA replication in Drosophila melanogaster chromosomes. Cold Spring Harb. Sym. 38, 205–223 (1974).
Article CAS Google Scholar
Tadros, W. & Lipshitz, H. D. The maternal-to-zygotic transition: A play in two acts. Development 136, 3033–3042 (2009).
Article CAS PubMed Google Scholar
Seller, C. A. & O’Farrell, P. H. Rif1 prolongs the embryonic S phase at the Drosophila mid-blastula transition. PLoS Biol. 16, e2005687 (2018).
Article PubMed PubMed Central CAS Google Scholar
Sirbu, B. M. et al. Analysis of protein dynamics at active, stalled, and collapsed replication forks. Genes Dev. 25, 1320–1327 (2011).
Article CAS PubMed PubMed Central Google Scholar
Gambus, A. et al. GINS maintains association of Cdc45 with MCM in replisome progression complexes at eukaryotic DNA replication forks. Nat. Cell Biol. 8, 358–366 (2006).
Article CAS PubMed Google Scholar
Dungrawala, H. & Cortez, D. The nucleus. Methods Mol. Biol. 1228, 123–131 (2014).
Article Google Scholar
Sirbu, B. M. et al. Identification of proteins at active, stalled, and collapsed replication forks using isolation of proteins on nascent DNA (iPOND) coupled with mass spectrometry. J. Biol. Chem. 288, 31458–31467 (2013).
Article CAS PubMed PubMed Central Google Scholar
Cortez, D. Chapter two proteomic analyses of the eukaryotic replication machinery. Methods Enzymol. 591, 33–53 (2017).
Article CAS PubMed PubMed Central Google Scholar
Limbourg, B. & Zalokar, M. Permeabilization of Drosophila eggs. Dev. Biol. 35, 382–387 (1973).
Article CAS PubMed Google Scholar
McAlister, G. C. et al. Increasing the multiplexing capacity of TMTs using reporter ion isotopologues with isobaric masses. Anal. Chem. 84, 7469–7478 (2012).
Article CAS PubMed PubMed Central Google Scholar
Sibon, O. C. M., Laurençon, A., Hawley, R. S. & Theurkauf, W. E. The Drosophila ATM homologue Mei-41 has an essential checkpoint function at the midblastula transition. Curr. Biol. 9, 302–312 (1999).
Article CAS PubMed Google Scholar
Benna, C. et al. Drosophila timeless2 is required for chromosome stability and circadian photoreception. Curr. Biol. 20, 346–352 (2010).
Article CAS PubMed Google Scholar
Lee, E.-M. et al. Drosophila Claspin is required for the G2 arrest that is induced by DNA replication stress but not by DNA double-strand breaks. DNA Repair 11, 741–752 (2012).
Article CAS PubMed Google Scholar
Gotter, A. L., Suppa, C. & Emanuel, B. S. Mammalian TIMELESS and tipin are evolutionarily conserved replication fork-associated factors. J. Mol. Biol. 366, 36–52 (2007).
Article CAS PubMed Google Scholar
Saldivar, J. C., Cortez, D. & Cimprich, K. A. The essential kinase ATR: Ensuring faithful duplication of a challenging genome. Nat. Rev. Mol. Cell Biol. 18, 622–636 (2017).
Article CAS PubMed PubMed Central Google Scholar
Munden, A. et al. Rif1 inhibits replication fork progression and controls DNA copy number in Drosophila. Elife 7, e39140 (2018).
Article PubMed PubMed Central Google Scholar
Echeverri, C. J. & Perrimon, N. High-throughput RNAi screening in cultured cells: A user’s guide. Nat. Rev. Genet. 7, 373–384 (2006).
Article CAS PubMed Google Scholar
Madigan, J. P., Chotkowski, H. L. & Glaser, R. L. DNA double-strand break-induced phosphorylation of Drosophila histone variant H2Av helps prevent radiation-induced apoptosis. Nucleic Acids Res. 30, 3698–3705 (2002).
Article CAS PubMed PubMed Central Google Scholar
Mah, L.-J., El-Osta, A. & Karagiannis, T. C. γH2AX: A sensitive molecular marker of DNA damage and repair. Leukemia 24, 679–686 (2010).
Article CAS PubMed Google Scholar
Jin, J., Arias, E. E., Chen, J., Harper, J. W. & Walter, J. C. A family of diverse Cul4-Ddb1-interacting proteins includes Cdt2, which is required for S phase destruction of the replication factor Cdt1. Mol. Cell 23, 709–721 (2006).
Article CAS PubMed Google Scholar
Vannier, J.-B. et al. RTEL1 is a replisome-associated helicase that promotes telomere and genome-wide replication. Science 342, 239–242 (2013).
Article ADS CAS PubMed Google Scholar
Townsend, A., Lora, G., Engel, J., Tirado-Class, N. & Dungrawala, H. DCAF14 promotes stalled fork stability to maintain genome integrity. Cell Rep. 34, 108669 (2021).
Article CAS PubMed PubMed Central Google Scholar
Bell, D. W. et al. Predisposition to cancer caused by genetic and functional defects of mammalian Atad5. PLoS Genet. 7, e1002245 (2011).
Article CAS PubMed PubMed Central Google Scholar
Bandura, J. L. et al. Humpty dumpty is required for developmental DNA amplification and cell proliferation in Drosophila. Curr. Biol. 15, 755–759 (2005).
Article CAS PubMed Google Scholar
Mansfield, E., Hersperger, E., Biggs, J. & Shearn, A. Genetic and molecular analysis of hyperplastic discs, a gene whose product is required for regulation of cell proliferation in Drosophila melanogaster imaginal discs and germ cells. Dev. Biol. 165, 507–526 (1994).
Article CAS PubMed Google Scholar
Kappes, F. et al. The DEK oncoprotein is a Su(var) that is essential to heterochromatin integrity. Genes Dev. 25, 673–678 (2011).
Article CAS PubMed PubMed Central Google Scholar
Sousa-Nunes, R., Chia, W. & Somers, W. G. Protein Phosphatase 4 mediates localization of the Miranda complex during Drosophila neuroblast asymmetric divisions. Gene Dev. 23, 359–372 (2009).
Article CAS PubMed PubMed Central Google Scholar
Klymenko, T. et al. A polycomb group protein complex with sequence-specific DNA-binding and selective methyl-lysine-binding activities. Genes Dev. 20, 1110–1122 (2006).
Article CAS PubMed PubMed Central Google Scholar
Muzi-Falconi, M., Giannattasio, M., Foiani, M. & Plevani, P. The DNA polymerase ^ɦ-primase complex: Multiple functions and interactions. Sci. World J. 3, 21–33 (2003).
Article CAS Google Scholar
Thompson, M. Polybromo-1: The chromatin targeting subunit of the PBAF complex. Biochimie 91, 309–319 (2009).
Article CAS PubMed Google Scholar
Jackson, S. & Xiong, Y. CRL4s: The CUL4-RING E3 ubiquitin ligases. Trends Biochem. Sci. 34, 562–570 (2009).
Article CAS PubMed PubMed Central Google Scholar
Shao, Z. et al. Stabilization of chromatin structure by PRC1, a polycomb complex. Cell 98, 37–46 (1999).
Article CAS PubMed Google Scholar
MacAlpine, H. K., Gordân, R., Powell, S. K., Hartemink, A. J. & MacAlpine, D. M. Drosophila ORC localizes to open chromatin and marks sites of cohesin complex loading. Genome Res. 20, 201–211 (2010).
Article CAS PubMed PubMed Central Google Scholar
Fonslow, B. R. et al. Single-step inline hydroxyapatite enrichment facilitates identification and quantitation of phosphopeptides from mass-limited proteomes with MudPIT. J. Proteome Res. 11, 2697–2709 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ashburner, M. et al. Gene ontology: Tool for the unification of biology. Nat. Genet. 25, 25–29 (2000).
Article CAS PubMed PubMed Central Google Scholar
Carbon, S. et al. The gene ontology resource: Enriching a GOld mine. Nucleic Acids Res. 49, D325–D334 (2020).
Google Scholar
Mi, H., Muruganujan, A., Ebert, D., Huang, X. & Thomas, P. D. PANTHER version 14: More genomes, a new PANTHER GO-slim and improvements in enrichment analysis tools. Nucleic Acids Res. 47, D419–D426 (2019).
Article CAS PubMed Google Scholar
Shannon, P. et al. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Article CAS PubMed PubMed Central Google Scholar
Szklarczyk, D. et al. The STRING database in 2021: Customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic Acids Res. 49, D605–D612 (2021).
Article CAS PubMed Google Scholar
Doncheva, N. T., Morris, J. H., Gorodkin, J. & Jensen, L. J. Cytoscape StringApp: Network analysis and visualization of proteomics data. J. Proteome Res. 18, 623–632 (2019).
Article CAS PubMed Google Scholar
Rogers, S. L. & Rogers, G. C. Culture of Drosophila S2 cells and their use for RNAi-mediated loss-of-function studies and immunofluorescence microscopy. Nat. Protoc. 3, 606–611 (2008).
Article CAS PubMed Google Scholar
Morgan, M. A. J. et al. A cryptic Tudor domain links BRWD2/PHIP to COMPASS-mediated histone H3K4 methylation. Genes Dev. 31, 2003–2014 (2017).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Martina Brienza-Ramos for assistance with EdU labeling of embryos used in this study. We thank David Cortez and Kavi Mehta for advice and guidance with DNA combing. We thank A. Shilatifard for providing the Drosophila anti-BRWD3 antibody. We thank Sarah Wessel and members of the Nordman lab for providing critical feedback on the manuscript. Figures 2 and 3 created in part using Biorender.com. This work was supported by National Institutes of Health (NIH) General Medical Sciences awards R35GM133552 to L.P. and R35GM128650 to J.T.N. M.T.W was supported by the Vanderbilt Chemistry-Biology Interface Training Program (T32GM065086) and the National Science Foundation Graduate Research Fellowship Program.

Author information

Authors and Affiliations

Department of Biological Sciences, Vanderbilt University, Nashville, TN, 37212, USA
Alexander Munden, Dongsheng Han, Reyhaneh Tirgar, Lars Plate & Jared T. Nordman
Department of Chemistry, Vanderbilt University, Nashville, TN, USA
Madison T. Wright & Lars Plate

Authors

Alexander Munden
View author publications
You can also search for this author in PubMed Google Scholar
Madison T. Wright
View author publications
You can also search for this author in PubMed Google Scholar
Dongsheng Han
View author publications
You can also search for this author in PubMed Google Scholar
Reyhaneh Tirgar
View author publications
You can also search for this author in PubMed Google Scholar
Lars Plate
View author publications
You can also search for this author in PubMed Google Scholar
Jared T. Nordman
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.M. and J.T.N. planned and designed the research; A.M., R.T., D.S. performed experiments; A.M., M.W., D.H., and R.T. analyzed data with supervision from L.P. and J.T.N.; A.M. and J.T.N. wrote the manuscript with input from all of the authors. A.M., M.W., D.H., R.T., L.P. and J.T.N. edited the manuscript.

Corresponding author

Correspondence to Jared T. Nordman.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figures.

Supplementary Table 1.

Supplementary Table 2.

Supplementary Table 3.

Supplementary Table 4.

Supplementary Table 5.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Munden, A., Wright, M.T., Han, D. et al. Identification of replication fork-associated proteins in Drosophila embryos and cultured cells using iPOND coupled to quantitative mass spectrometry. Sci Rep 12, 6903 (2022). https://doi.org/10.1038/s41598-022-10821-9

Download citation

Received: 18 January 2022
Accepted: 13 April 2022
Published: 28 April 2022
DOI: https://doi.org/10.1038/s41598-022-10821-9

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.