Introduction

Leishmania species cause a large spectrum of diseases in humans ranging from skin lesions to visceral damage, which is lethal, if left untreated. Treatment options for leishmaniasis are limited and toxic and no effective vaccine is currently available (https://www.dndi.org/ diseases-projects/leishmaniasis/). Within its mammalian host, Leishmania replicates in the phagolysosome compartment of macrophages where it encounters various stress stimuli that trigger important changes in gene expression1,2,3,4,5 and parasite metabolism6,7. Most of those stresses lead to DNA damage or protein misfolding that has to be corrected. As many other eukaryotes8, Leishmania has evolved quality control systems that cooperate to eliminate damaged proteins9,10,11,12.

Recently, we have undertaken studies to characterize the Leishmania valosin-containing protein (VCP)/p97/Cdc48 ortholog (VCP and p97 in metazoa, Cdc48 in yeast)10, one of the key quality control components in recycling or degrading misfolded proteins or aggregates. VCP belongs to the AAA + (Associated with diverse cellular Activities) family of ATPases that hydrolyze ATP and use the resulting energy to extract polyubiquitinated target proteins from membranes, organelles, and large protein assemblies and delivered them to proteasomal degradation13,14,15,16. As a central component of the Ubiquitin Proteasome System (UPS), VCP/p97 plays a critical role in cellular proteostasis13,17,18. Indeed, VCP/p97 is involved in the endoplasmic reticulum (ER)-associated protein degradation15,19, the mitochondrion-associated protein degradation20, ribosomal quality control21, the extraction of chromatin-bound proteins22 or of damaged lysosomes by autophagy23, genome stability24, and stress granules clearance25. Mutations in this well-conserved protein can lead to protein aggregation and have been linked to several diseases, including neurodegenerative and muscular disorders and cancer26,27,28.

Each monomer of the hexameric VCP/p97 protein is composed by an N-terminal domain followed by two tandem ATPase domains (D1 and D2) separated by a short linker, and an unstructured C-terminal tail29. The N-terminal domain can be further subdivided into two subdomains, Nn (15–95 aa) and Nc (104–175 aa). This structure allows the association of VCP with a large variety of cofactors/adaptors which determine substrate specificity, target the ATPase to different cellular locations, or modify the ubiquitin chain attached to the substrate30,31,32. So far, about 30–40 cofactors have been identified in mammals but their exact functions are still poorly understood. Whether they have a substrate-recruiting, processing or regulatory function, most cofactors interact with the Nn or Nc subdomains of VCP via a small number of conserved binding modules, while a lower number binds to the unstructured C‐terminal tail formed by the last 7 amino acids30,31,32. In mammals, most VCP cofactors contain the ubiquitin regulatory X domain (UBX) or UBX-like (UBXL) with similar three-dimensional structure described for ubiquitin. The UBX module interacts with the Nn-Nc cleft of the VCP through the Rx(3)FPR motif. Proteins such as UBXD1 to UBXD6, UBXD9 and UBXD11 contain only a UBX domain. The UBA (ubiquitin associated)-UBX cofactors, such as UBXD7, UBXD8 (FAF2), UBXD10 (p47), UBXD12 (FAF1) and UBXD13 also accommodate an UBA domain that is fundamental for interacting with ubiquitinated substrates16,33. Cofactors such as p47 harbor, in addition to the UBX and UBA domains important for their function in the UPS, a SHP (BS1, binding segment 1) motif as another site for interaction with the Nc subdomain of VCP31,32. Cofactors harboring either a VIM (VCP-interacting motif) or VBM (VCP-binding motif) motif also interact with the same hydrophobic pocket of the N-domain32. One of the most studied VCP cofactor, the heterodimer UFD1-NPL4 (UN), interacts with VCP to form the VCP-UFD1-NPL4 complex which extracts polyubiquitinated proteins from membranes and macromolecular complexes and is involved in a series of biological processes, including ER-associated degradation (ERAD)34. Few cofactors have been reported to interact with the C-terminus of VCP. These harbor a PUB (PNGase/UBA or UBX-containing proteins) or a PUL (PLAP, Ufd3p, and Lub1p) domain which forms a hydrophobic pocket for interactions with the C-terminal tail of VCP35 following association of key amino acids such as Leu804 and the aromatic side chain of the penultimate tyrosine805 residue30,31,32.

Our initial studies on the ubiquitin selective chaperone VCP/p97 in Leishmania (LiVCP) demonstrated its essential role in the parasite intracellular development and survival under heat stress10. In this study, we provide novel insights into the LiVCP interaction network. A series of immunoprecipitation experiments coupled to liquid chromatography-tandem mass spectrometry analysis were used to identify the major interacting partners of LiVCP. These studies uncovered p47, UFD1, NPL4, FAF2 and PUB1 as the core LiVCP cofactors. Network proteomics for each cofactor confirmed close partnership with LiVCP and revealed the presence of multiple LiVCP complexes in Leishmania, including the well-characterized LiVCP-LiUFD1-LiNPL4 complex. Gene Ontology analysis of each cofactor proteome combined with digitonin fractionation and immunofluorescence studies support cofactor subcellular compartmentalization. Furthermore, in silico models based on 3D homology modeling and protein–protein docking predicted that, with the exception of NPL4, conserved binding modules within key Leishmania VCP cofactors interact with specific binding sites in the hexameric LiVCP protein, as described for their mammalian orthologs. Altogether, these results allowed us to build the Leishmania VCP protein network, the first characterized in parasitic protozoa.

Results

Identification of the Leishmania VCP/p97 cofactors and core proteome

VCP/p97 is a hexameric protein that can interact with a large number of protein cofactors through its N-terminal and C-terminal domains27. These interactions are key for its activity and functional diversity. This study aimed to identify the Leishmania VCP interactome based on the human VCP/p97 network from the BioGrid database (Supplementary Table S1) as well as on protein–protein interaction experiments. According to the e-value similarity accessed by protein BLAST (blastP), most proteins from the human VCP network have homologs in Leishmania (Supplementary Fig. S1).

Seeking for the identification of Leishmania VCP cofactors, we carried out a series of immunoprecipitation (IP) experiments with either C- or N-terminally HA-tagged Leishmania infantum VCP (LiVCP; LinJ.36.1420)10 followed by LC–MS/MS, database searching, and peptide identification. To allow VCP expression from its endogenous locus, we replaced one LiVCP genomic copy by a HA-epitope tagged LiVCP (LiVCP-HA) (Fig. 1A). Both episomal and integrated LiVCP-HA versions were expressed at levels comparable to the endogenous VCP protein (Fig. 1B) and yielded similar IP results. From seven independent IP-MS/MS experiments (see Methods), 218 proteins were initially identified as potential LiVCP interacting partners (Supplementary Table S2). To decrease the number of false positives in the LiVCP network building, we applied a stringent filter. First, we calculated the average number of peptides for each protein in all seven IPs and then we excluded proteins also found in five independent IP-MS/MS studies conducted with HA-tagged proteins unrelated to the VCP complex (Fig. 1C and Supplementary Table S2). From this analysis, 24 proteins were selected to specifically associate with LiVCP. Among those, 15 proteins with an average of exclusive unique peptide count (EUPC) ≥ 2.0 were classified as the LiVCP core partners (Fig. 1D and Supplementary Table S2). These share conserved domains with known VCP cofactors in other eukaryotes, such as the p47 (Shp1) UBX/UBA- and SEP-containing protein (LinJ.22.0200), the UBX-containing protein FAF (Fas-associated factor) shown a higher homology to the human FAF2 protein (UBXD8 or ETEA; LinJ.35.1960), the ubiquitin fusion degradation protein 1 (UFD1; LinJ.36.6780), the nuclear protein localization 4 (NLP4; LinJ.25.1320) known to form a heterodimer with UFD134, an ubiquitin associated UBA/TS-N domain protein (LinJ.24.1650) with an identified ER membrane signal peptide, and two PUB (PNGase/UBA or UBX) domain containing proteins (LinJ.11.0920 and LinJ.09.1060). In addition, the LiVCP proteome core includes the serine/threonine phosphatase type 1 (PP1) (LinJ.34.0900), the PP1 regulator-like protein (LinJ.05.1200) whose human homolog SDS22 is known to interact with VCP36, three mitochondrial proteins—a stomatin-like protein (LinJ.05.1040), an ATP-dependent zinc metallopeptidase (LinJ.36.2850) and the AAA + FtsH protease (LinJ.36.2850), a tetratricopeptide repeat protein part of the ER membrane protein complex subunit 2 (LinJ.17.0230) and three hypothetical proteins (LinJ.36.5080, LinJ.03.0250, LinJ.15.1570) (Fig. 1D) with some similarity to the yeast nuclear envelope protein Nsp1-like C-terminal region, syntaxin-like domain, and putative nuclear jumonji-like domain, respectively.

Figure 1
figure 1

The Leishmania LiVCP core partners identified by co-immunoprecipitation and mass spectrometry studies. (A) Schematic draw of the construct used to express the L. infantum valosin‐containing protein (LiVCP) tagged with an HA epitope at the C‐terminus (VCPHA) either episomally or integrated into the L. infantum VCP endogenous locus. (B) Western blotting to evaluate LiVCP-HA expression levels using anti-TbVCP and anti-HA antibodies. The alpha-tubulin antibody was used as protein loading control. The uncropped blots are shown in Supplementary Fig. S20 (panel A). (C) Venn diagram demonstrating the number of proteins identified by mass spectrometry in LiVCP-HA immunoprecipitates of seven independent experiments that were filtered against 5 unrelated HA-tagged proteins (for more details see Methods and Supplementary Table S2). (D) LiVCP core partners identified in (C) with an average of exclusive unique peptide count (EUPC) of ≥ 2. The five LiVCP core cofactors studied here (Lip47, LiUFD1, LiNPL4, LiFAF2 and LiPUB1) are indicated in bold. For the complete list, see Supplementary Table S2.

Apart from the above core partners, LiVCP co-immunoprecipitated heat shock proteins, chaperonins, T-complex proteins, mitochondrial integrity and stress response proteins, translation factors and ribosomal proteins, components of the trypanothione and peroxidase systems, RNA-binding proteins, and proteasome subunits (Supplementary Table S2) in line with its function in a broad array of ubiquitin-dependent protein quality control pathways13,14,15,16.

‘Network proteomics’ analysis to identify interacting partners of the core LiVCP cofactors and associations between cofactor-bound proteins

The interaction of p47, UFD1, NPL4, FAF2 and PUB1 with the Leishmania VCP ortholog was further confirmed by inverse co-immunoprecipitation studies. C- or N- terminally HA-tagged Lip47, LiNPL4, LiUFD1, LiFAF2, or LiPUB1 proteins cloned into vector pSP \(\alpha\) ZEO \(\alpha\) were transfected and stably expressed in L. infantum promastigotes (Supplementary Fig. S2A). Immunoprecipitation of these HA-tagged cofactor versions using an anti-HA antibody followed by western blotting with a specific anti-TbVCP antibody10,37 confirmed their association with LiVCP (Fig. 2A). The co-immunoprecipitation of LiUFD1, LiNPL4, Lip47, LiFAF2 and LiPUB1 with LiVCP (Fig. 1D and Table 1) and the detection of LiVCP in LiUFD1, LiNPL4, Lip47, LiFAF2 and LiPUB1 immunoprecipitates by western blotting (Fig. 2A) and mass spectrometry (Fig. 2B, Table 1, Supplementary Table S2) confirmed their close partnership.

Figure 2
figure 2

Reciprocal interactions between the Leishmania LiVCP and its key cofactors. (A) LiVCP detection by western blotting with an anti-TbVCP antibody following immunoprecipitation of C- or N- terminally HA-tagged LiUFD1, Lip47, LiFAF2, LiNPL4 and LiPUB1 proteins ectopically expressed in Leishmania. The uncropped blots derived from different gels are shown in Supplementary Fig. S20 (panels B-D) together with other repetitions of this experiment. (B) Detection of LiVCP and its cofactors LiUFD1, Lip47, LiFAF2 and LiPUB1 by mass spectrometry after co-immunoprecipitation using anti-HA magnetic beads for recombinant Leishmania expressing C- or N- terminally HA-tagged cofactor proteins. The average of EUPC for two independent experiments for each cofactor is shown here. For the complete list of mass spectrometry results, see Supplementary Table S2.

Table 1 Proteins specifically associated with one or more Leishmania LiVCP cofactors as determined by immunoprecipitation and LC–MS/MS studies.

As done for LiVCP, we carried out IP-MS/MS studies with five unrelated proteins to VCP complexes (average EUPC > 3.6; LiVCP threshold) to filter out non-specific LiVCP cofactor partners (Fig. 3A and Supplementary Table S2). The remaining proteins that co-immunoprecipitated with Lip47 (145), or LiFAF2 (164), or LiUFD1 (46) and or LiPUB1 (98) can be considered as putative cofactor interacting partners (Fig. 3A, Table 1 and Supplementary Table S2).

Figure 3
figure 3

Network proteomics to gain novel insights into LiVCP cofactor complexes. (A) Venn diagrams demonstrating the number of proteins identified by mass spectrometry in Lip47, LiFAF2, LiUFD1 and LiPUB1 co-immunoprecipitates of two independent experiments for each cofactor (see Methods) after applying a filter of five unrelated HA-tagged proteins (average of EUPC = 3.6 on 5 unrelated proteins; see Supplementary Table S2 for details) to reduce non-specific interactions. (B) Gene ontology–cellular component (GO-CC) analysis of proteins identified in (A) for each LiVCP cofactor: Lip47 (145), LiFAF2 (164), LiUFD1 (46), and LiPUB1 (98) according to their terms listed on TriTrypDB. The same stringent filter (grey in A) was applied for each cofactor (see Supplementary Table S2 for details). Gene Ontology biological process and molecular function can be seen in Supplementary Fig. S19.

Lip47 co-immunoprecipitated LiPUB1, the serine/threonine protein phosphatase 1 (PP1) catalytic subunit beta (LinJ.28.0730), a PP1 regulator-like protein that is homologous to the yeast SDS22 protein (LinJ.05.1200), and the PP1 phosphatase inhibitor that is homologous to the yeast Ypi1 protein (LinJ.07.0840) (Table 1). In Saccharomyces cerevisiae, these three proteins form a ternary complex that is important for the nuclear localization of PP1 and whose assembly and quality control requires Cdc48 and its adaptor p4738.

As reported in other eukaryotes34, the Leishmania LiUFD1 protein complexes with NPL4 to form the heterodimer cofactor UFD1-NPL4 but also associates with LiFAF2 (Table 1). Complex formation between VCP-NPL4-UFD1, FAF1 and polyubiquitinated proteins was shown previously to promote ER-associated degradation39. Other proteins only associated with the LiUFD1 proteome include a mitochondrial serine carboxypeptidase (CBP1; LinJ.18.0450), a phenylalanine-4-hydroxylase (LinJ.28.1390), peroxin PEX12 (LinJ.19.1240) and three hypothetical conserved proteins (LinJ.03.0250, LinJ.29.2300, LinJ.12.0550) (Table 1 and Supplementary Table S2). Interestingly, the hypothetical protein LinJ.03.0250 was also found in LiVCP immunoprecipitates (Fig. 1C), and some of the other LiUFD1 interacting proteins have homologs known to be part of the VCP network in other systems (see Discussion).

Proteins solely detected in LiFAF2 pull-down include a tetratricopeptide repeat containing protein of the ER membrane protein complex (LinJ.17.0230) also found in LiVCP immunoprecipitates (Fig. 1C), several peroxisomal/glycosomal proteins that are homologs of the peroxins PEX2 (LinJ.25.2330), PEX10 (LinJ.25.2400) and PEX11 (LinJ.35.3740), two fatty acid elongases (ELO1: LinJ.14.0680 and ELO3: LinJ.14.0760), a cell differentiation protein-like (LinJ.34.4180), and hypothetical protein LinJ.36.5360 (Table 1).

The LiPUB1 cofactor co-immunoprecipitated a serine palmitoyltransferase 1-like protein (LinJ.34.3530; SPT1), also found in the LiUFD1 proteome (Table 1). Serine palmitoyltransferase catalyzes the first and rate-limiting step in sphingolipid (ceramide) biosynthesis and has been associated with the resistance to heat stress and apoptosis40,41.

Interestingly, several LiVCP cofactor-associated proteins are common to more than two cofactors (see Table 1). For example, Lip47, LiFAF2 and LiPUB1 co-immunoprecipitated the tubulin polymerization-promoting protein TPPP/p25-alpha (LinJ.34.1620) that stabilizes microtubules42 and also protects against abnormal forms of prion proteins43, as well as the root hair defective 3 GTP-binding protein (RHD3) (LinJ.32.0380) that is analogous to the mammalian atlastin GTPases involved in shaping ER tubules44. An ADF/cofilin factor (LinJ.29.0520) of the family of actin remodeling proteins45 co-immunoprecipitated with both Lip47 and LiFAF2. The ribosomal protein S33 (LinJ.26.1610) and the hypothetical protein LinJ.33.1130 co-immunoprecipitated with LiFAF2 and LiPUB1. The V-type proton ATPase subunit D (LinJ.05.1140), a multi-subunit membrane protein complex that is evolutionarily related to F-type adenosine triphosphate synthases and A-ATP synthases46, co-immunoprecipitated with Lip47 and LiUFD1. The LiUFD1, LiFAF2 and LiPUB1 cofactors co-immunoprecipitated the trypanosomatid functional analog of Tom40 (ATOM40; LinJ.35.4920), which is the central pore of the TOM complex involved in the import of mitochondrial proteins47. Finally, all LiVCP core cofactors co-immunoprecipitated a fatty acid elongase (ELO2) (LinJ.14.0700).

General features of the LiVCP core cofactors

Proteins specifically co-immunoprecipitated with LiUFD1, Lip47, LiFAF2, or LiPUB1 were submitted to Gene Ontology – cellular component (GO-CC) enrichment analysis using the tool provided by TriTrypDB (Fig. 3B and Supplementary Table S3). An enrichment of a 19-fold for vacuolar proteins was seen in the LiUFD1 proteome. Nuclear proteins were exclusively enriched within the Lip47 proteome and endoplasmic reticulum-related proteins were enriched only within the LiFAF2 and LiPUB1 proteomes.

Molecular function (MF) and biological process (BP) analyses were also done aside comparison with the LiVCP proteome (Supplementary Table S3). Globally, the molecular function analysis predicted proteins with translation factor activity and structural constituents of the ribosome, RNA-binding activity, structural molecule activity and peptidase activity. ATPase and ligase activities were exclusively enriched within the LiFAF2 proteome while transmembrane transporter activity was only associated with the LiUFD1 proteome.

The biological process analysis showed that cofactor-associated proteins were mostly involved in translation, protein targeting and folding, and several metabolic processes. Protein folding and tRNA metabolic processes were found only associated with the LiFAF2 proteome while protein targeting and mitotic cell cycle were enriched in the LiPUB1 proteome (Supplementary Table S3). Overall, Gene Ontology analysis of each proteome indicated specific compartment-association for each cofactor.

Subcellular localization of the LiVCP core cofactors

To determine the subcellular localization of the core LiVCP cofactors, we carried out digitonin fractionation and indirect immunofluorescence studies. As a central player to the endoplasmic reticulum (ER)-associated protein degradation19,48 and the translocation of damaged mitochondrial proteins from the outer mitochondrial membrane (OMM) into the cytosol20,49, VCP has been found associated with both ER and OMM fractions, similarly to what we have described previously for the Leishmania LiVCP10.

First, we treated Leishmania parasites with increasing concentrations of digitonin (20 μM-10 mM) and carried out western blotting on the different cellular sub-fractions using an anti-HA antibody (all cofactors were HA-epitope tagged). Our results indicate an association of LiFAF2 and partly of LiPUB1 proteins with the organellar fraction and an enrichment of LiUFD1, Lip47 and LiNPL4 proteins with the cytosolic fractions (Fig. 4A). Immunofluorescence studies using an antibody directed against the ER BiP protein confirmed that LiFAF2 and partly LiPUB1 co-localize to endoplasmic reticulum (Fig. 4B, 4C). No co-localization was found with the mitochondrion (data not shown). Immunofluorescence studies to detect Lip47 demonstrated a partial co-localization with the histone H3 nuclear marker (Fig. 4D), as also suggested by GO analysis (Fig. 3B).

Figure 4
figure 4

Subcellular localization of LiVCP cofactors. (A) Western blot of L. infantum digitonin‐fractionated promastigotes (20 μM-10 mM) using the anti‐HA antibody to detect the C- or N-terminally HA-tagged LiUFD1, LiNPL4, Lip47, LiFAF2 and LiPUB1 proteins ectopically expressed in Leishmania. The 20 μM to 200 μM digitonin fractions are enriched with cytosolic proteins, the 1 mM and 10 mM fractions contain mostly organellar proteins, and the pellet fraction contains membrane‐associated proteins. Antibodies against the cytosolic or mitochondrial HSP70 were used as controls. The uncropped blots are shown in Supplementary Fig. S20 (panels E-I). Immunofluorescence studies showing the localization of C- or N-terminally HA-tagged LiFAF2 (B), LiPUB1 (C) and Lip47 (D) (in green). An anti‐HA antibody was used as primary antibody followed by Alexa Fluor 488 anti-mouse as secondary antibody. Nuclear and kinetoplast DNA was stained with DAPI (blue). Endoplasmic reticulum (ER) putative co-localization was assessed using an anti‐BiP antibody (red in B and C) as an ER marker. An Alexa Fluor 555 anti-rabbit was used as secondary antibody for BiP. Putative nuclear co-localization for Lip47 was assessed using an anti‐histone H3 (H3) antibody (red in D).

Predicted molecular interactions between the conserved domains of LiVCP cofactors and the Leishmania VCP protein based on 3D homology modeling and protein–protein docking

It has been shown previously that the VCP/p97/Cdc48 hexamer can bind to its various cofactors through the UBX or PUB domains or a ubiquitin-like fold or through one of the linear binding motifs such as SHP32. The L. infantum LiVCP encodes a protein of 784 amino acids that is highly conserved among its eukaryotic orthologs (75% aa identity with the human VCP/p97 protein)10 and harbors all the characteristic domains of VCP orthologs (Fig. 5A, top). Based on sequence similarity and conservation to mammalian VCP cofactor orthologs (Supplementary Fig. S1) and on the EUPC for each cofactor in LiVCP immunoprecipitates (Fig. 1), we selected Lip47, LiUFD1, LiNPL4, LiFAF2, and LiPUB1 as the core LiVCP cofactors for further investigation. Multiple sequence alignments of these LiVCP cofactors along with mammalian, yeast, trypanosomatid and non-trypanosomatid species helped us to generate phylogenetic trees revealing that Lip47, LiFAF2, LiUFD1, LiNPL4 and LiPUB1 are evolutionary distant from other eukaryotes but they harbor binding modules/motifs that are conserved and could represent specific domains for interaction with LiVCP (Supplementary Figs. S3-S7).

Figure 5
figure 5

3D homology modeling of the major LiVCP cofactors and their docking prediction into the L. infantum LiVCP protein. (A) Top, domain architecture of LiVCP. The N domain is subdivided into subdomains Nn (colored in yellow) and Nc (colored in orange) and the two ATPase domains are in blue (D1) and fuchsia (D2), respectively. The Walker A (A) and Walker B (B) motifs as well as the Sensor 1 (S1) residues and arginine (R) fingers are indicated. Bottom, top (left) and side views (right) of the LiVCP hexamer 3D structure model built by homology with the H. sapiens p97 hexamer (PDBid entry 5C18). (B) Top, domain composition of Lip47. Bottom, electrostatic potential surface of LiVCP (colored according to hydrophobicity) with stick representation of the UBX GMPP binding motif of Lip47. Key interactions are shown. (C) Top, domain composition of LiFAF2. Bottom, electrostatic potential surface of LiVCP with stick representation of the UBX GFPP binding motif of LiFAF2. Middle panel between B and C shows molecular surface of LiVCP with the UBX binding site indicated (Nn-Nc pocket). (D) Top, domain composition of LiUFD1. Bottom, electrostatic potential surface of LiVCP with stick representation of the SHP1 binding site QPTFAGAGRTL. (E) Top, domain composition of LiPUB1. Bottom, electrostatic potential surface of LiPUB1 with a stick representation of the C-terminus of LiVCP 780DDLYS784. Middle panel between D and E shows molecular surface of LiVCP with the SHP binding site (Nc) and C-terminus tail indicated. 3D homology models of LiUFD1, Lip47, LiFAF2 and LiPUB1 were respectively built by homology based on the templates with pdb entries: 5C1B_V, 1S3S_H, 2MX2 and 2HPL, respectively using the modeling software MODELLER80. For further details, see Supplementary Figs. S9, S11, S13 and S18. UBA (Ubiquitin-associated),SEP (Shp, eyes-closed, p47),SHP (BS1, binding segment 1); UBX (Ubiquitin regulatory X); UAS, domain of unknown function found in FAF2 and other proteins; PUB (PNGase/UBA or UBX) containing proteins. Further information about the respective domains can be found at https://www.ebi.ac.uk/interpro/.

To investigate whether LiVCP-cofactor interactions identified by immunoprecipitation studies take place within the previously predicted binding sites in the hexameric VCP protein30,32 and the conserved binding modules in Lip47, LiFAF2, LiUFD1, LiNPL4 and LiPUB1, we constructed 3D models by homology modeling and a protein–protein docking approach. 3D homology models are helpful in refining protein–protein interaction predictions that have been based on a sequence match alone as interface binding is more directly determined by the structure of the binding site rather than its sequence. The LiVCP homohexamer 3D molecular structure was obtained through homology modeling by superposition with the H. sapiens p97 homohexamer (PDBid: 5C18) template (Fig. 5A, bottom).

The Lip47 protein presents the same domain composition (UBA, SEP and UBX) than its human ortholog (Fig. 5B, upper). Similarly to the human p47, Lip47 harbors the sequence GMPP in the UBX domain (Supplementary Fig. S8) that docks in silico into the Nn-Nc pocket of LiVCP (Fig. 5B and Supplementary Fig. S9)50. Although Lip47 shares only 25% amino acid identity with its human ortholog, it has 41% sequence similarity and 73% sequence coverage and contains the two SHP motifs (SHP1 and SHP2) (Supplementary Fig. S8) that together with the UBX domain are known to be key for the interaction with VCP31. The SHP site is typically characterized by the consensus sequence FxGxGx2h that was recently summarized as h(x)1−2F/W(x)0−1GxGx2L (h, hydrophobic residue; x, any amino acid)32 and in Lip47, SHP1 and SHP2 sites, FYGRGQRL and FQGHGHRL, respectively are conserved (Supplementary Fig. S8). In addition, the Ramachandran plot showed that the amino acid residues of the Lip47 3D homology model are found in most favoured (87%) and additional allowed regions (8.7%) (Supplementary Table S4), which makes structure predictions reliable.

The LiFAF2 protein interacts with LiVCP through the GFPP motif in its UBX C-terminal domain as predicted by 3D homology modeling (Fig. 5C, Supplementary Fig. S10) and protein–protein docking upon superposition of the available structure of the human 40GYPP43 motif in the ovarian tumor domain-containing protein 1 (OTU1) and the homologous part within the Leishmania FAF2 protein (Supplementary Fig. S11). Surprisingly, LiFAF2 lacks the ubiquitin-associated (UBA) domain that is present in the human FAF1 and FAF2 proteins and is fundamental for interacting with ubiquitinated substrates51.

UFD1 is known to assemble with NPL4 to form the heterodimer UFD1/NPL4 (U/N), one of the most important p97/VCP cofactors, that recognizes polyubiquitinated substrates and together with VCP extracts them from organellar membranes or macromolecular complexes52,53. U/N binds to the N-terminal domain of VCP via the ubiquitin-like domain (UBL) of NPL4 and the SHP box of UFD132. The Leishmania LiUFD1 protein also contains the UBA domain and an UT6 region that accommodates the sequences QPTFAGAGRTL and RLKALGGGGRS corresponding to the SHP1 and SHP2 sites, respectively (Fig. 5D, top and Supplementary Fig. S12). 3D homology modeling predicts binding of LiUFD1 to LiVCP via its SHP1 site (Fig. 5D, bottom). Furthermore, docking essays propose that the SHP1 motif of LiUFD1 (reversed comparatively to the human SHP1) interacts with the same surface as in its human ortholog using amino acid residues Phe273, Arg278 and Leu280 (Phe228, Arg234 and Leu235 in human Ufd1) (Fig. 5D bottom and Supplementary Fig. S13).

In contrast to its eukaryotic orthologs, the Leishmania NPL4 lacks the N-terminal ubiquitin regulatory X (UBX)-like domain bound to the N-domain of Cdc48 hexamer, as well as the two Zn2+-finger domains important for anchoring the adjacent Mpr1/Pad1 N-terminal (MPN) domain to the top of the D1 ATPase ring34. Interestingly, protein–protein docking simulations showed that despite the absence of the UBX-like domain, LiNPL4 seems to interact with the N-terminus of LiVCP through its C-terminal GMPP motif (aa 350–353) (Supplementary Figs. S14, S15). Similarly to other eukaryotes34, the U/N heterodimer in Leishmania also recognizes polyubiquitinated substrates as shown by LiNPL4 and LiUFD1 IPs followed by western blotting (Supplementary Fig. S16).

A representative member of the PUB (PNGase/UBA or UBX-containing) domain proteins in Leishmania, LiPUB1, which has no homolog in humans, contains a conserved region (Supplementary Fig. S17) that is predicted by 3D homology modeling (Fig. 5E) and protein docking (Supplementary Fig. S18) to interact with the last five amino acids 780DDLYS784 of the LiVCP C-terminus, similarly to other PUB domain proteins 54.

Altogether, despite the evolutionary distance of LiVCP cofactors from their mammalian orthologs, in silico models based on structural data from the human orthologs using 3D homology modeling and protein–protein docking indicated that the conserved binding modules of Leishmania VCP core cofactors (except LiNPL4) interact with specific binding sites in the hexameric LiVCP, similarly to their mammalian orthologs.

The first trypanosomatid VCP interaction network

The resulting datasets from LiVCP, Lip47, LiUFD1, LiFAF2 and LiPUB1 IP-MS/MS studies were mined to identify interacting partners that are specific to each LiVCP cofactor as well as partners that are shared among multiple cofactors and LiVCP complexes (Fig. 6A) and allowed us to build the first LiVCP interaction network (Fig. 6B). To decrease the number of false positives, we excluded proteins that were co-immunoprecipitated with unrelated HA-tagged proteins (average of EUPC = 0.0). From the total number of proteins co-immunoprecipitated with Lip47, LiUFD1, LiFAF2, and LiPUB1 (Fig. 3A) only 10, 17, 23 and 9, respectively together with 24 proteins specifically associated to LiVCP (Fig. 1,Table 1 and Supplementary Table S2) were considered to build the topology of LiVCP network (83 proteins in total, 47 shown in Fig. 6B). The list of proteins and their intersections are detailed in Supplementary Table S5 whereas the complete list of interacting proteins is presented in Supplementary Table S2.

Figure 6
figure 6

The Leishmania VCP protein interaction network. (A) A Venn diagram representing the number of proteins identified by mass spectrometry in independent immunoprecipitates of LiVCP and its key cofactors LiUFD1, Lip47, LiFAF2 and LiPUB1. For high stringency filtering and network clarity, only proteins absent in IPs with HA-tagged unrelated proteins (EUPC = 0.0 on five unrelated experiments) were considered for this analysis. The intersections of list of elements were calculated with the Venn tool at https://bioinformatics.psb.ugent.be/webtools/Venn/. For the list of proteins used and their intersection, see Supplementary Table S5. (B) Integrated interaction map of the LiVCP network using the data generated in this work showed as in (A). For clarity, protein names were simplified. Filled color circles for LiVCP and its cofactors are represented as in (A). Bigger circles represent higher average of EUPC in LiVCP IP experiments. The proteins LiUFD1, Lip47, LiFAF2, LiPUB1 and LiVCP were used as bait (see Methods) and are represented with bold border circles. The direction of interactions is represented with a target arrow shape. Spring-embedded layout was applied on Cytoscape 3.5.1.

In addition to the known cofactors, eight proteins were exclusively associated with the LiVCP proteome and might also be considered as cofactors of the Leishmania VCP protein. These include another PUB domain protein (PUB2) (LinJ.11.0920), an OTU1 (ovarian tumor domain-containing protein 1-like cysteine protease (LinJ.36.6280) that is homologous to the S. cerevisiae deubiquitylation enzyme OTU1 shown to interact with VCP55, a protein present in the outer mitochondrial membrane (LinJ.29.2220), a P-type H + -ATPase (LinJ.18.1510), a microtubule-associated protein (LinJ.26.1950), a member of the FYVE zinc finger proteins (LinJ.36.2570) that serve as regulators of endocytic membrane trafficking and receptor signaling56, a type 1 protein serine/threonine phosphatase (PP1) (LinJ.34.0900) and a Leishmania specific hypothetical protein (LinJ.36.5080) (Fig. 6B and Supplementary Table S5).

Our analysis revealed that the LiUFD1 and LiFAF2 proteomes share five proteins in common with LiVCP. These include the mitochondrial stomatin-like protein 2 (SPL-2) (EUPC > 20), the membrane-integrated mitochondrial AAA/FtsH protease57, an UBA domain containing protein (LinJ.24.1650), and hypothetical conserved proteins LinJ.15.1570 and LinJ.36.4740 (Table 1, Fig. 6B). LiVCP and Lip47 mutually associate with the serine/threonine protein PP1 phosphatase catalytic subunit, the PP1 phosphatase inhibitor Ypi1, and the SDS22 (EUPC = 20) PP1 regulator-like protein (Table 1, Fig. 6B and Supplementary Table S5). LiVCP and LiFAF2 mutually co-immunoprecipitate the tetratricopeptide repeat protein (LinJ.17.0230) and an uncharacterized protein family (LinJ.07.0450), both subunits of the ER membrane protein complex (Table 1, Fig. 6B and Supplementary Table S5). LiVCP and LiUFD1 mutually interact with the hypothetical protein LinJ.03.0250 (Table 1, Fig. 6B and Supplementary Table S5). Specific interactions between cofactor-associated partners that do not involve an association with LiVCP are described in detail in the ‘network proteomics’ section (see also Table 1 and Fig. 6B).

Overall, using combined datasets from multiple IP/MS–MS studies for LiVCP and its main cofactors, we unveiled the first Leishmania LiVCP interactome. Many of the LiVCP interactors have homologs that are known to interact with VCP in other eukaryotic systems but others do not and may be relevant to study further as part of distinct Leishmania VCP complexes.

Discussion

In this study, we have characterized the valosin‐containing protein (VCP) interaction network, the first in parasitic protozoa. We have employed a series of co-immunoprecipitation and mass spectrometry analyses coupled with in silico models that allowed us to identify the Leishmania VCP proteome with its major cofactors and their interacting partners, as well as, to predict molecular interactions between conserved domains within these cofactors and specific binding sites in the hexameric LiVCP. Our data support several similarities but also important differences between the Leishmania VCP protein network and VCP complexes characterized in other eukaryotes.

Most of the known classes of VCP cofactors in other eukaryotes have orthologs in Leishmania. However, cofactors harboring the VIM (VCP-interacting motif)/VBM (VCP-binding motif) motifs58 were not found in any of our immunoprecipitation experiments, and with the exception of the ERAD-associated E3 ubiquitin-protein ligase HRD1 (LinJ.15.1460) no homologs of this class of proteins were depicted in the Leishmania genome. While LiVCP is highly conserved among different eukaryotic organisms and harbors all the characteristic domains of the human p97/VCP ortholog10, most of the major LiVCP cofactors are phylogenetically distant from their eukaryotic orthologs. However, despite their evolutionary distance from yeast and mammalian orthologs, in silico models based on 3D homology modeling and protein–protein docking indicated that the conserved binding modules of Leishmania VCP cofactors interact with specific binding sites in the hexameric LiVCP, similarly to their eukaryotic orthologs. This is the case for Lip47 and LiFAF2 that bind the Nn-Nc pocket of LiVCP through the sequence GMPP/GFPP in their UBX domain. Surprisingly, LiFAF2 lacks the UBA domain and theoretically should have lost the ability of binding ubiquitinated substrates59. Similarly to its yeast ortrholog34, LiUFD1 interacts with the N-terminus of LiVCP through its conserved SHP motif. LiPUB1 while has no human homolog yet interacts with the C-terminal region of LiVCP, as described for the human VCP-PUB complex.

One of the important differences unveiled by our study concerns LiNPL4, which lacks the UBX-like domain and the two Zn2+-finger motifs shown to bind VCP. Indeed, recent studies on the structure of the Cdc48-Ufd1-Npl4 complex from the thermophilic fungus Chaetomium thermophilum demonstrated that Npl4 interacts with the Cdc48 N-terminal region through its UBX-like domain and uses the Zn2+-finger motifs to anchor its MPN domain to the top of the D1 ATPase ring34. On the other hand, Ufd1 interacts with Npl4 to form the Ufd1-Npl4 heterodimer through a short segment of its UT6 domain34 which is also conserved in the Leishmania Ufd1 ortholog. Our 3D homology modeling and docking experiments predicted binding of LiNPL4 to LiVCP through its C-terminus GMPP motif. All NPL4 orthologs of the genus Leishmania, Leptomonas and Phytomonas harbor the GMPP motif and a variant of this motif, GM-[EDS]-PP, is present in Trypanosoma species. This unique feature in Trypanosomatidae paves the way for the design of inhibitors specifically targeting NPL4 binding to VCP. Ufd1-Npl4 and p47 are substrate-recruiting cofactors shown to bind VCP in a mutually exclusive manner as they compete for the same binding modules30,31. This seems also to be the case in Leishmania. Lip47 was neither found in the LiUFD1 proteome nor LiUFD1 in the Lip47 pull-down. Moreover, they do not share common interacting partners, except for the V-type proton ATPase subunit D.

Our analysis led us to identify the first PUB domain proteins in Leishmania and to unveil LiPUB1 as one of the major LiVCP cofactors interacting with the C-terminal region of LiVCP. Furthermore, new prospects were opened with the yet non-studied PUB domain protein LiPUB2 identified here as a putative LiVCP cofactor. In addition, a third UBX- and PUB domain-containing protein (LinJ.36.0140; LiPUB3) was retrieved by searching the L. infantum genome database. PUB domain proteins known to bind to the C-terminus of the VCP hexamer have been associated with autophagy of damaged lysosomes23 but the physiological significance of these interactions remains to be determined. Here, we found that a serine palmitoyltransferase-like protein was associated solely with the LiPUB1 proteome. Interestingly, a homolog of this protein in yeast is required for resistance to heat shock and plays an essential role in the removal or refolding of denatured or aggregated cytoplasmic proteins60. We have reported recently that LiVCP is essential for the parasite survival under heat stress10, and PUB proteins with the help of a serine palmitoyltransferase may contribute to this process.

Interestingly, our study revealed new LiVCP interacting partners exclusively associated with the LiVCP proteome that might function as LiVCP cofactors. These include a PUB domain protein (LiPUB2), an outer mitochondrial membrane protein, a P-type H + -ATPase with homology to fungal and plant proton pumps61, a FYVE zinc-finger protein known to bind phosphatidylinositol 3-phosphate in membranes of endocytic vesicles that regulate membrane trafficking and receptor signaling56, a microtubule-associated protein, and several hypothetical conserved proteins that may be part of distinct VCP-complexes. LiVCP, and also Lip47, associates with the type 1 protein serine/threonine phosphatase (PP1) complex consisting of the PP1 catalytic subunit Glc7/PP1-B, the SDS22 regulator-like protein and the protein phosphatase PP1 inhibitor YPI1. In S. cerevisiae, nuclear localization of Glc7 requires Sds22 and Ypi162. It has been shown recently that nuclear PP1 activity is positively regulated by the AAA-ATPase Cdc48 and its cofactor Shp1 (p47)63 to promote the assembly of the Glc7-Sds22-Ypi1 PP1 complex and to ensure its quality control38. Interestingly, here we found that Lip47 interacts with a large number of nuclear proteins and that is partly localized to the nucleus. From an evolutionary point of view, it is interesting that the interaction of LiVCP with Lip47 may regulate the assembly of the Glc7-Sds22-Ypi1 complex like is the case in yeast. LiVCP was found also associated, although not in all immunoprecipitates, with substrate-processing cofactors like the thioesterase ovarian tumor domain-containing protein 1 (OTU1), a deubiquitinating enzyme involved in ERAD through the interaction of its UBX-like (UBXL) domain with the N-terminus of Cdc4864. Although it is yet unclear how the polyubiquitinated substrates are released from the p97/VCP/Cdc48 complex and passed on to the proteasome65, OTU1 can be the deubiquitinase involved in this process as its UBXL domain interacts with VCP55. Although the Leishmania OTU1 ortholog does not harbor a well-defined UBXL domain, interestingly the 39GYPP42 loop of UBXL in S. cerevisiae shown not only to be critical for the interaction with VCP but also for its role in the ERAD pathway55 it is conserved in Leishmania (49GFPP52).

Another novel finding here is the association of the Leishmania VCP and also of its cofactors LiFAF2 and LiUFD1 with a stomatin-like protein 2 (SLP-2). Stomatins are members of the SPFH (stomatin, prohibitin, flotillin, HflC/K) superfamily that localize to the mitochondrial inner membrane scaffolding for the spatial organization of inner membrane proteases regulating mitochondrial dynamics, quality control, and cell survival66. Accordingly, our data support the association of LiUFD1 with transmembrane transport and peptidase activity proteins. It is possible that LiVCP associates with SLP-2 through UFD1, which also interacts with FAF2. Also present in the LiVCP proteome and associated with both LiFAF2 and LiUFD1 cofactors is the mitochondrial ATP-dependent AAA + protease FtsH whose proteolytic and chaperone-like activity is crucial to the protein degradation protein quality control of mitochondrial and chloroplast membranes67. This suggests a new model for membrane protein degradation mediated by ATP-dependent proteolytic systems57.

Our data support a subcellular compartmentalization for the key LiVCP cofactors. LiFAF2 and LiPUB1 are associated with organelles, LiFAF2 most likely with endoplasmic reticulum (ER), Lip47 is partially localized to the nucleus, LiUFD1 is possibly associated with cytosolic vacuoles, and NPL4 is enriched in the cytoplasm. Interestingly, LiFAF2 is associated with the peroxisomal biogenesis factors PEX2, PEX10 and PEX11. Similarly to other eukaryotes68, recent data in trypanosomatids support de novo biogenesis of peroxisomes (glycosomes) from the ER69. In higher eukaryotes PEX2, PEX10 and PEX12 are part of the peroxisomal E3 ubiquitin ligase complex required for pexophagy, a pathway to degrade ubiquitinated peroxisomes that involves an AAA ATPase complex with a striking similarity to p9770. Glycosome turnover in Leishmania is mediated by autophagy71 and the VCP/p97 quality control system, possibly through its FAF2 cofactor, may contribute to the control of glycosome homeostasis and turnover in these parasites. LiFAF2 also associates with two fatty acid elongases of the ELO GNS1/SUR4 family involved in the membrane-bound fatty acid chain elongation72. Interestingly, an ELO2 fatty acid elongase was found in the proteome of all four LiVCP cofactors (p47, UFD1 FAF2, PUB1). In trypanosomatids, the fatty acid elongation pathway occurs in the membrane of the ER73 and this could explain its association with components of the ERAD pathway.

In summary, this study allowed us to build the first VCP protein interaction network in trypanosomatids through the identification of known and novel interacting partners potentially associated with distinct VCP complexes. Our proteomics datasets identified biologically relevant functions for the Leishmania VCP cofactors and provided an important resource for further investigation of VCP function in several cellular processes related to protein quality control in these parasites.

Methods

Parasite strains, plasmid constructs and transfections

Leishmania infantum MHOM/MA/67/ITMAP-263 was used in this study. L. infantum promastigotes were cultured in SDM-79 medium supplemented with 10% heat-inactivated FCS (Multicell Wisent Inc., Canada) and 5 μg/ml hemin at pH 7.0 and 25 °C. The Lip47 (LinJ.22.0200/LINF_220008200), LiFAF2 (LinJ.35.1960/LINF_350024700), LiUFD1 (LinJ.36.6780/LINF_360076400), LiNPL4 (LinJ.25.1320/ LINF_250019000), and LiPUB1 (LinJ.09.1060/LINF_090016300) genes were amplified and cloned into pSP72αZEOα. An HA epitope tag was added either at their N-or C-terminus. Primers used in this study are shown in Supplementary Table S6. The generated plasmids were independently transfected into L. infantum promastigotes. Purified plasmid DNA (10–20 μg, Qiagen Plasmid Mini Prep Kit, Toronto, Ontario, Canada) or linearized fragments (10–20 μg for the targeting cassette 5′flank‐αZEOα‐VCPHA‐3′flank from10 were transfected into Leishmania by electroporation as described previously74. Stable transfectants were selected and cultured with 0.6 mg/ml zeocin (Sigma).

Protein lysate preparations and western blots

Western blots were performed following standard procedures. The anti-mouse HA tag monoclonal antibody (1:3,000; ABM), the anti-mouse HSP70 (cytosolic) antibody (1:400, Acris), the anti-rabbit HSP70 (mitochondrial) antibody (1:2000, kindly provided by Dr. Osvaldo de Melo Neto, Recife, Brazil), and the anti-mouse T. brucei (Tb)VCP antibody (1:5,000;37 kindly provided by Dr. James D. Bangs (Department of Microbiology & Immunology, Jacobs School of Medicine & Biomedical Sciences, University at Buffalo, USA) were used in this study. As a secondary antibody, we used the anti-mouse Horseradish peroxidase (HRP) (Cell Signalling) or the anti‐rabbit‐HRP antibody (GE Healthcare). Blots were visualized by chemoluminescence with Pierce ECL2 western blotting kit (Thermo Scientific). Digitonin fractionation was carried out in the presence of increasing concentrations of digitonin (20 μM to 10 mM) as described previously75 and each fraction was analysed by western blot.

Immunofluorescence studies

Immunofluorescence studies were done as described previously9,10. The anti-mouse HA tag monoclonal antibody (1:1,000; ABM), the anti-mouse H3 Histone 3 antibody (1:500; ABM) and the rabbit polyclonal anti‐T. brucei BiP (kindly provided by Dr. JD Bangs) were used as a primary antibody followed by Alexa Fluor 488 anti-mouse and Alexa Fluor 555 anti-rabbit as secondary antibodies. Nucleus and kinetoplast DNA was stained with DAPI (blue). The cells were observed under a Nikon epifluorescence microscope with a 100X objective and oil immersion. Images acquisition was performed with ImagePro Plus software and ImageJ.

Immunoprecipitation studies

Immunoprecipitation (IP) studies and mass spectrometry analysis were carried out as described previously9. Briefly, L. infantum promastigotes expressing HA-tagged protein were lysed, mixed with Pierce anti-HA magnetic beads (Thermo Scientific, Canada) and kept at -20 °C and subjected to mass spectrometry analysis or alternatively resolved in SDS-PAGE, excised, trypsin-digested, and analyzed by mass spectrometry.

Sample preparation and LC–MS/MS analysis

Protein digestion and mass spectrometry analyses were performed by the Proteomics Platform of the CHU de Québec Research Center (Quebec, Qc, Canada) as described previously76. Briefly, bands of interest were extracted from gels, placed in 96-well plates and then washed with water. Tryptic digestion was performed on a liquid handling robot (MultiProbe, Perkin Elmer) according to manufacturer’s instructions using the protocol detailed in77 with some modifications as suggested in78. Proteins were reduced with 10 mM DTT and alkylated with 55 mM iodoacetamide. Iodoacetamide derivative of cysteine was specified as a fixed modification and oxidation of methionine and deamidation of Asparagine and Glutamine were specified as a variable modification. Two missed cleavages were allowed. Trypsin digestion was performed using 126 nM of modified porcine trypsin (Sequencing grade, Promega, Madison, WI) at 37 °C for 18 h. Digestion products were extracted using 1% formic acid, 2% acetonitrile followed by 1% formic acid and 50% acetonitrile. The recovered peptides were pooled, vacuum-centrifuged, dried and then resuspended into 12 µl of 0.1% formic acid, and 5 µl were analyzed by mass spectrometry. Peptide samples were separated by online reversed-phase (RP) nanoscale capillary liquid chromatography (nanoLC) and analyzed by electrospray mass spectrometry (ES MS/MS). The experiments were performed with an Ekspert NanoLC425 (Eksigent) coupled to a 5,600 + mass spectrometer (Sciex, Framingham, MA, USA) equipped with a nanoelectrospray ion source. Peptide separation took place on a self-packed picofrit column (New Objective) with reprosil 3u, 120A C18, 15 cm × 0.075 mm internal diameter, (Dr Maisch). Peptides were eluted with a linear gradient from 5–35% solvent B (acetonitrile, 0.1% formic acid) for 35 min at a flow rate of 300 nL/min. Mass spectra were acquired by a data dependent acquisition mode using Analyst software version 1.7. Each full scan mass spectrum (400 to 1,250 m/z) was followed by collision-induced dissociation of the twenty most intense ions. Dynamic exclusion was set for a period of 12 s and a mass tolerance of 100 ppm.

Database searching

MGF peak list files were created using Protein Pilot version 4.5 software (Sciex). MGF sample files were then analyzed using Mascot (Matrix Science, London, UK; version 2.5.1). Mascot was searched with a fragment ion mass tolerance of 1.0 Da and a parent ion tolerance of 1.0 Da. Scaffold (version Scaffold_4.8.4, Proteome Software Inc., Portland, OR) was used to validate MS/MS based peptide and protein identifications based on the Leishmania Infantum TriTrypDB (version 9.0 released April 2016, 8,589 entries). Peptide identifications were accepted if they could be established at greater than 5.0% probability to achieve an FDR less than 1.0% by the Scaffold Local FDR algorithm. Protein identifications were accepted if they could be established at greater than 99.0% probability to achieve a FDR less than 1.0% and contained at least 2 identified peptides. Protein probabilities were assigned by the Protein Prophet algorithm79. Proteins that contained similar peptides and could not be differentiated based on MS/MS analysis alone were grouped to satisfy the principles of parsimony.

3D homology modeling and protein–protein docking

To construct 3D models of L. infantum JPCM5 proteins LiUFD1 (LinJ.36.6780), Lip47 (SEP domain, LinJ.22.0200), LiFAF2 (UBX domain, LinJ.35.1960), LiPUB1 domain (LinJ.09.1060), LiNPL4 (LinJ.25.1320) and LiVCP proteins, we searched for their orthologs in PDB database using BlastP and Delta-Blast. Then, 3D structure models were built using the modelling software MODELLER80 based on their homologous structure, PDBid : 5C1B_V, 1S3S_H, 2MX2_A, 2HPL_B, 6CDD_ A and 5C18_D, respectively. The QPTFAGAGRTL SHP1 peptide of LiUFD1 and the DDLYS peptide of LiVCP were modeled using as templates PDBid: 5C1B_V and 2HPL, respectively. The LiVCP hexamer was obtained by superposition with the H. sapiens p97 hexamer (PDBid: 5C18). The quality of the models was assessed by Ramachandran plot analysis through PROCHECK81. Proteins and peptides of L. infantum were docked into LiVCP using HDOCK82 and HPEPDOCK83, respectively. The electrostatic potential surfaces of 3D models and images were generated with PyMOL software (https://pymol.org/).

Multiple alignments and phylogeny

We used the amino acid sequence of each L. infantum JPCM5 protein to search by BlastP and Delta-Blast for close homologs in bacteria, archaea, fungi, plants and animals. The sequences extracted from databases were aligned with Clustal Omega84. To establish the phylogenetic relationships between L. infantum proteins and those of prokaryotic and other eukaryotic organisms, the sequences were aligned with Clustal Omega84 and a phylogenetic tree was constructed using PhyML85. The gene ontology (GO) term enrichment analysis was performed using the tool provided by TriTrypDB (https://tritrypdb.org) with Fisher exact test filtering for false discovery rate (FDR) lower than 0.05. Cellular Component, Molecular Function and Biological Process were carried out separately.

Experimental design and statistical rationale

Filter for LiVCP IP-MS/MS: Seven IP-MS/MS studies were considered for LiVCP (Supplementary Table S2). To select proteins that specifically associate with LiVCP, we calculated the average number of peptides for each protein in all seven experiments, and then excluded all proteins also found in five independent IP-MS/MS studies conducted with HA-tagged proteins unrelated to the VCP complex. From the selected 24 LiVCP specific interacting proteins, 15 with an average of exclusive unique peptide count (EUPC) ≥ 2.0 were classified as the LiVCP core and are presented in Fig. 1C and Supplementary Table S2.

Filter for LiVCP cofactors IP-MS/MS: Two independent experiments were performed for each LiVCP cofactor (Lip47, LiFAF2, LiUFD1 and LiPUB1) and only proteins with an average of exclusive unique peptide count EUPC ≥ 2.0 in both experiments for each cofactor were considered (Supplementary Table S2). To exclude non-specific partners, IP-MS/MS studies with proteins unrelated to LiVCP complexes were carried out and proteins with an average of EUPC > 3.6 (LiVCP threshold) were used to filter out non-specific LiVCP cofactor associated proteins. The remaining proteins co-immunoprecipitated with Lip47, or LiFAF2, or LiUFD1 and or LiPUB1 were considered as putative LiVCP cofactors. Cytoscape v 3.5.186 was used to plot network analyses of all new interactions for LiVCP, Lip47, LiFAF2, LiUFD1 and LiPUB1.

Filter for network plotting: To decrease the number of false positive results in the LiVCP network building, we only show proteins that were not co-immunoprecipitated with all unrelated HA-tagged proteins (average of EUPC = 0.0). From the total number of proteins co-immunoprecipitated with Lip47, or LiUFD1 or LiFAF2 and or LiPUB1, only 10, 17, 23 and 9, respectively were considered together with the 24 LiVCP specific interactors (Fig. 1, Table 1, and Supplementary Table S2) to build the topology of LiVCP network. No statistical analysis was needed.