Intermolecular interactions underlie protein/peptide phase separation irrespective of sequence and structure at crowded milieu

Poudyal, Manisha; Patel, Komal; Gadhe, Laxmikant; Sawner, Ajay Singh; Kadu, Pradeep; Datta, Debalina; Mukherjee, Semanti; Ray, Soumik; Navalkar, Ambuja; Maiti, Siddhartha; Chatterjee, Debdeep; Devi, Jyoti; Bera, Riya; Gahlot, Nitisha; Joseph, Jennifer; Padinhateeri, Ranjith; Maji, Samir K.

doi:10.1038/s41467-023-41864-9

Download PDF

Article
Open access
Published: 04 October 2023

Intermolecular interactions underlie protein/peptide phase separation irrespective of sequence and structure at crowded milieu

Manisha Poudyal¹^na1,
Komal Patel^1,2^na1,
Laxmikant Gadhe¹,
Ajay Singh Sawner¹,
Pradeep Kadu¹,
Debalina Datta¹,
Semanti Mukherjee¹,
Soumik Ray¹,
Ambuja Navalkar¹,
Siddhartha Maiti¹^nAff3,
Debdeep Chatterjee¹,
Jyoti Devi¹,
Riya Bera¹,
Nitisha Gahlot¹,
Jennifer Joseph ORCID: orcid.org/0000-0002-0441-8856¹,
Ranjith Padinhateeri ORCID: orcid.org/0000-0001-5485-1474¹ &
…
Samir K. Maji ORCID: orcid.org/0000-0002-9110-1565^1,2

Nature Communications volume 14, Article number: 6199 (2023) Cite this article

12k Accesses
6 Citations
35 Altmetric
Metrics details

Subjects

Abstract

Liquid-liquid phase separation (LLPS) has emerged as a crucial biological phenomenon underlying the sequestration of macromolecules (such as proteins and nucleic acids) into membraneless organelles in cells. Unstructured and intrinsically disordered domains are known to facilitate multivalent interactions driving protein LLPS. We hypothesized that LLPS could be an intrinsic property of proteins/polypeptides but with distinct phase regimes irrespective of their sequence and structure. To examine this, we studied many (a total of 23) proteins/polypeptides with different structures and sequences for LLPS study in the presence and absence of molecular crowder, polyethylene glycol (PEG-8000). We showed that all proteins and even highly charged polypeptides (under study) can undergo liquid condensate formation, however with different phase regimes and intermolecular interactions. We further demonstrated that electrostatic, hydrophobic, and H-bonding or a combination of such intermolecular interactions plays a crucial role in individual protein/peptide LLPS.

Dehydration entropy drives liquid-liquid phase separation by molecular crowding

Article Open access 26 June 2020

The synergic effect of water and biomolecules in intracellular phase separation

Article 25 July 2019

Expanding the molecular language of protein liquid–liquid phase separation

Article 29 March 2024

Introduction

Liquid-liquid phase separation (LLPS) of biomolecules (proteins/nucleic acids) is now well-established as a ubiquitous phenomenon for the formation of membraneless organelles^1,2,3,4,5,6. These phase separated, condensed compartments not only help in various cellular functionality^7,8,9; but they are also useful for macromolecular sequestration/storage, and cellular signaling/communications^1,9. Although many studies have shown that LLPS can play a vital role in normal physiological functions of cells^1,2,10,11; it can be also associated with malfunctions^1,6,12,13. Concentrations of protein increase several orders of magnitude inside the condensate^14,15,16 compared to their endogenous levels. This often leads to toxic protein aggregation and nucleation of amyloid fibril formation associated with various human diseases, such as amyotrophic lateral sclerosis (ALS), Alzheimer’s disease (AD), and Parkinson’s disease (PD)^{1,17,18,19,20,21,22}. It is widely accepted that intra- and inter-molecular interactions driving protein phase separation are embedded in the protein/peptide sequence and the respective structure^1,4,6,14,23. In this context, the conformational properties of intrinsically disordered regions (IDRs), low complexity domains (LCDs), and prion-like domains (PLDs) are known to facilitate multivalent interactions that are prerequisites for condensate formation^{23,24,25,26,27,28}. Further, the specific arrangement of amino acids in protein sequences under various conditions can regulate LLPS^23,27,29,30 through common mechanisms that promote these multivalent interactions (such as electrostatic and cation-π interactions)^{23,25,26,29,31}. The nature of the overall interactions between the proteins makes the condensates responsive to the cellular microenvironment^32,33. By exploiting this knowledge, it is also possible to design artificial peptides/proteins with tunable LLPS properties^8,34,35.

However, emerging evidence indicates that a significant proportion of proteins in the human proteome reside at concentrations just below their respective solubility limit³⁶. The concentration levels not only depend on the extent of endogenous expression of individual proteins; but can also be greatly affected by the efficiency of the protein turnover machinery of the cell. The transition from soluble to LLPS state (reaching the saturation concentration) thus, is not associated with a very high energy barrier^2,33,37,38. Seemingly, alterations such as post-translational modifications, changes in cellular or subcellular localization, the effect of counterions, and metabolites (such as ATP) can significantly modulate the phase behavior of various proteins^{5,31,39,40,41,42,43}. Apart from intrinsically disordered proteins (IDPs), globular proteins (such as lysozyme⁴⁴) are also capable of undergoing LLPS. Since the basis of most supramolecular assemblies (aggregates/precipitates/LLPS/crystals) is the intermolecular interactions, by tuning the extent of such interactions, it is experimentally feasible to explore conditions that drive LLPS of globular proteins as well.

Further, for any multivalent molecule with even the most-weakly attractive interactions, the null expectation is that there exists a concentration regime in which self-assembly will occur. While this prediction can be clearly confirmed by theory, simulation, and synthetic polymers, whether such a prediction holds true under experimentally accessible conditions for real proteins and polypeptides remain less well-established. Here, we assessed the ability of 23 different proteins/polypeptides with diverse structures/sequences to undergo homotypic phase separation in the presence and absence of the macromolecular crowder PEG-8000. Our results confirm that both folded and disordered proteins can be driven to form dynamic, reversible liquid-like condensates in a concentration-dependent manner. The driving forces and kinetics for assembly vary from protein to protein, and the observed saturation concentrations scale directly with the apparent intermolecular binding constant (K_D). Moreover, our data confirm that a variety of distinct chemical modes can drive phase separation. To further explore these observations, we designed polypeptides based on neutral (Gly), hydrophobic (Val), positively (Arg), and negatively charged (Asp) amino acids and observed that even these simple model peptides could undergo pseudo-homotypic phase separation under appropriate solution conditions. Taken together, our results suggest that the observation that a protein or peptide can be driven to undergo phase separation under some solution conditions should be the null model for any in vitro system. Our results caution researchers in ascribing the functional significance to in vitro assays without consideration of the physiological relevance of those conditions. In parallel, our results also suggest that the regulation of intracellular phase transitions may be an unavoidable facet of cell biology, regardless of if the resulting assemblies are functional.

Results

Liquid-liquid phase separation (LLPS) of a diverse library of proteins

To address if LLPS is a generic phenomenon of proteins, we first examined whether a diverse library of proteins could undergo LLPS in vitro in the presence of a molecular crowder (in our case, PEG-8000). We chose this library of proteins from multiple species with varied sequences, structures, and properties (Supplementary Table 1). Also, to exclude the influence of various cellular factors and other parameters such as salt, we have used a 20 mM sodium phosphate buffer (pH 7.4) in the presence of PEG-8000 as a molecular crowder. We first generated the three-dimensional surface image of proteins superimposed with their secondary structures using PYMOL (v 2.5.2) to understand the diversity of structures and distribution of charge (Fig. 1a and Supplementary Fig. 1). We then examined all the protein sequences in silico, using IUPred2A⁴⁵, SMART⁴⁶, CatGranule⁴⁷ and PONDR⁴⁸ for predicting the presence of IDRs, LCDs, their LLPS and disorder propensities, respectively. Our data revealed that a subset of proteins possesses LLPS propensity as well as sequence/s featuring intrinsic disorders/LCD regions. On the other hand, many proteins, such as lysozyme (LYS) and β-lactoglobulin (β-lac) did not exhibit any such features (Supplementary Fig. 2 and Supplementary Table 2). To test whether these proteins can undergo LLPS in vitro, we purified all the proteins using size exclusion chromatography (SEC) and examined for LLPS using fluorescence microscopy (by labeling the proteins with NHS-Rhodamine) in the presence of PEG-8000 (Fig. 1b and Supplementary Fig. 3). To construct the LLPS regimes, proteins at varying concentrations were incubated with different concentrations of PEG-8000 at physiological pH 7.4. We observed that all proteins undergo LLPS at different concentrations, thereby exhibiting a varied phase regime (Fig. 1c and Supplementary Fig. 4a) with different condensate size distributions (Supplementary Fig. 5). The integrity of all the proteins was evident from the protein band/s in SDS-PAGE after LLPS (Supplementary Fig. 6).

**Fig. 1: Liquid-liquid phase separation of various proteins in vitro.**

From this phase regime, we further determined the apparent saturation concentration (C_sat) of the proteins in the presence of 10% (w/v) PEG-8000. Here, we define the apparent C_sat as the minimum protein concentration where we observed phase separation under the microscope within 12 h of waiting time. Note that the apparent saturation concentration is abbreviated as C_sat for easy referencing in the subsequent sections. Importantly, we do not know at present the exact saturation concentration for the proteins. However, we think that the C_sat determined using microscopy in our study is not significantly different from the saturation concentration of the proteins as we observe an increase in light scattering measurements (see later) along with microscopic observations.

Proteins such as lactoferrin (LT), γ-globulin (GG), and catalase (CATA) required as low as 1 µM concentration to undergo LLPS, while ubiquitin (Ub), α-Syn and RNase A required a very high protein concentration (≥500 µM) for phase separation (Supplementary Fig. 4a).

Further to support our microscopic observations, we calculated the dilute phase concentration after phase separation using the centrifugation method⁴⁹ (Supplementary Fig. 7). For the proteins LT, β-cas, α-amy, LYS and RNase A, the C_sat determined from the microscopic study was approximately consistent with the C_sat calculated from the dilute phase protein concentration. Another subset of proteins (GG, CATA, Tau, Alb, Cyt c, Hb, CA, Mb, α-Syn, Ub) showed a slightly higher C_sat by microscopy than the C_sat estimated through centrifugation. This could be a case of overestimation due to the resolution limit of the grid of the microcopy-based phase regime. However, for Chymo, β-lac, BSA, and α-lac, the centrifugation estimate is higher than the microscopy estimate. This could possibly be due to the relatively smaller size of the condensates, for which the centrifugation speed used may not have been sufficient for the separation of dense and dilute phases⁵⁰ (Supplementary Fig. 7).

To further evaluate the kinetics of LLPS, we performed a static light scattering experiment (at 350 nm) of each protein at their respective C_sat in the presence of 10% (w/v) PEG-8000 over time (Fig. 1d). Similar to their phase behavior, the data revealed that kinetics of LLPS also varied across different proteins (Fig. 1d, e). At the end of the light scattering experiments, the condensate formation by proteins was further verified using differential interference contrast (DIC) microscopy (Supplementary Fig. 4b). The light scattering data were fitted with a sigmoidal growth kinetics model (see method section) and the t_1/2 for LLPS was determined for all the proteins. The data revealed that many proteins with low C_sat exhibited faster LLPS kinetics in contrast to proteins with higher C_sat (Fig. 1e). Overall, the data provides promising evidence in support of LLPS being a generic phenomenon of proteins at crowded microenvironment.

Role of PEG in protein phase separation

PEG is a non-ionic polymer that increases the tendency of the protein to self-assemble into condensates by inducing intermolecular interaction⁵¹. Previous studies indicated that PEG might not be considered as an inert crowder as it might interact with proteins where some amino acids have been shown to possess higher interaction potential with PEG⁵². To examine the possibility of whether PEG is directly participating in the liquid condensate formation in our studies and in the present experimental condition, we performed phase separation experiments with a subset of protein in the presence of FITC-labeled PEG (5% FITC-labeled PEG-5000 + 5% PEG-8000). We observed that there is no selective sequestration of PEG inside the protein condensates when added both before or after the condensate formation as evident from the confocal microscopy images (Fig. 2a and Supplementary Fig. 8a, b). Since the fluorescence intensity is linearly proportional to concentration, we calculated the apparent partition coefficient using the fluorescence intensity of PEG inside (${PE}{G}_{{inside}}$) and outside $({PE}{G}_{{outside}})$ of the condensate. For instance, the apparent partition coefficient in the case of β-lac is 0.01 = ~0. Indeed, this was the case for the subset of proteins we tested, confirming that there is no PEG sequestration inside the protein condensates under study. This indicates that condensate formation is majorly driven by protein-protein interactions.

**Fig. 2: Role of PEG in liquid-liquid phase separation of proteins in vitro.**

Further, the effect of PEG polymer length in protein LLPS was evaluated using 10% (v/v) PEG-300. We studied the condensate formation using 10% (v/v) PEG-300, however, at the same C_sat of proteins. We observed the condensate formation of proteins but with slower kinetics in comparison to the proteins in the presence of PEG-8000. A subset of proteins such as GG, Chymo, β-lac and BSA exhibited LLPS after 24 h, while LYS, Alb, RNase A and Ub required 48 h for observable condensate formation (Supplementary Fig. 8c). This suggests that lowering the molecular weight of PEG might increase the time required for the condensate formation of protein, however, it does not alter the propensity of protein phase separation in identical buffer conditions.

More importantly, we also examined the condition where the proteins can phase separate readily in the absence of PEG. For this, we performed the LLPS study using purified proteins from size-exclusion chromatography and tested different conditions (high protein concentration, pH and/or NaCl) for phase separation in the absence of PEG-8000 (Fig. 2b, Supplementary Fig. 8d and Supplementary Table 3). We observed that GG and LT undergo LLPS at high protein concentrations in the absence of PEG. For some proteins, a change in pH (LYS and RNase A) or the addition of salt (BSA, Alb and β-cas) induced the condensate formation. However, in the case of α-Syn, both alteration of pH (pH 5.5) and addition of salt (1 M) was required to induce condensate formation at a very low protein concentration (10 μM) in contrast to the high concentration (600 μM) required for α-Syn to undergo LLPS in presence of PEG-8000. Hence, our study indicates that proteins can undergo phase separation in the absence of PEG (crowder), but under different conditions. Therefore, molecular crowder such as PEG might only facilitate intermolecular interaction via depletion mechanism and/or osmotic pressure effect⁵³, however, protein-protein interactions play a major role in condensate formation.

In the end, we performed LLPS reactions of all proteins using cytoplasmic extract⁵⁴ of HeLa cells to access the effect of other biomolecules on protein LLPS under physiological crowding conditions (Fig. 2c and Supplementary Fig. 9a, b). We used NHS-Rhodamine labeled proteins at their respective C_sat for LLPS in the cellular extract. Interestingly, all proteins (except Ub and α-Syn) showed LLPS in cell extract, however, with much larger condensate size compared to their corresponding condensate in the PEG-buffer system (Fig. 2c and Supplementary Fig. 9b). In the case of α-Syn, we observed the aggregate formation and Ub showed no LLPS even after a long incubation suggesting more specific conditions might be required for the protein condensate formation in cell lysate (Supplementary Fig. 9b). The condensate fusions upon contact and FRAP study by selected proteins revealed that all these protein condensates in cytoplasmic extract possess liquid-like behavior (Supplementary Fig. 9c–e).

Liquid-like property of the phase separated condensates

Typical characteristics of phase separated condensates include condensate fusion upon contact, temperature reversibility, and rapid fluorescence recovery after photobleaching (FRAP). To examine the dynamic nature of the molecules inside the condensates, we performed FRAP using 10% (v/v) NHS-Rhodamine labeled proteins. At the initial time of condensate formation (0 h), most of the proteins showed rapid recovery of fluorescence (~80-100% recovery) with a short half-life (t_1/2) (< 5 s); while a few proteins showed partial recovery (e.g., LT and GG showed 50-60% recovery) with higher t_1/2 values (>10 s) (Fig. 3a–c and Supplementary Fig. 10a). We hypothesized that extensive intermolecular interactions might result in the viscoelastic transition leading to reduced fluorescence recovery (also supported by their very low C_sat). The liquid-like property of the condensates was further supported by fusion events (Fig. 3d, Supplementary Fig. 10b and Supplementary movie 1, 2) and the dissolution of condensates upon increased temperature (at 45 °C). The protein condensates, however, reappeared upon incubating back to 37 °C (Fig. 3e and Supplementary Fig. 10c), suggesting their thermo-reversible property. To examine whether LLPS is associated with the conformational transition of the proteins, we isolated the dense and dilute phases of all proteins through centrifugation and performed circular dichroism (CD). We observed no substantial change in secondary structure/s upon phase separation (Fig. 3f and Supplementary Fig. 11) as CD spectra of proteins in dense and dilute phases are essentially similar. Note, we performed CD spectroscopy of dense phase proteins after dilution. This is unavoidable for CD study due to very high dynode voltage and light scattering of the original dense phase suspension. Therefore, we further performed FTIR spectroscopy to analyze the secondary structure of the intact dilute and dense phase of the proteins. Our deconvoluted FTIR spectra (Supplementary Fig. 12) and secondary structure estimation data (Supplementary Table 4) revealed that the gross secondary structure remains the same after phase separation for all the proteins with some subtle secondary structural changes for a few proteins. The morphology of liquid condensate by various protein LLPS samples were further examined using transmission electron microscopy (TEM). The TEM micrographs mostly showed circular protein-rich condensates (Fig. 3g and Supplementary Fig. 10d). The data, therefore, suggest that proteins can form thermo-reversible, liquid condensates without significant alteration in their secondary structures.

**Fig. 3: Liquid-like properties of the various protein condensates.**

Maturation and rigidification of protein condensate over time

The viscoelastic transition of protein condensates is often associated with toxic amyloid fibril formation in various neurodegenerative diseases such as ALS, AD, and PD^17,18,19,20. However, such viscoelastic transition can also help in various cellular functions^5,6,55 including oocyte dormancy (Balbiani body⁵⁶) and heterochromatin assembly^57,58,59. We wanted to investigate whether the condensates formed by the various proteins in our study also undergo rigidification with time. We incubated various protein condensates for 48 h (at 37 °C) and performed FRAP and temperature reversibility (Fig. 4a–d and Supplementary Fig. 13a, b) studies. FRAP analysis of condensates at 48 h revealed substantially slower recovery (higher t_1/2) for most of the proteins compared to freshly formed liquid condensate (0 h) (Fig. 4c). Intriguingly, a few proteins (GG, LT, Tau, α-Syn, β-cas, and CATA) did not recover after photo-bleaching at 48 h (Fig. 4c), suggesting their viscoelastic transition, which might be due to the change in material property because of the extent of intermolecular interaction and protein arrangement inside the condensate. This was also consistent with the thermo-reversibility study as these condensates did not dissolve upon increasing the temperature to 45 °C (Fig. 4d and Supplementary Fig. 13b). To examine the possible structural changes due to rigidification, we performed CD spectroscopy for a subset of proteins (which showed negligible fluorescence recovery after 48 h) (Fig. 4e and Supplementary Fig. 14). The data suggest that except for α-Syn, the rest of the proteins did not undergo substantial structural changes during the viscoelastic transition. The FTIR spectroscopic study of the dense and dilute phase of these proteins further showed that except α-Syn, no other proteins showed significant structural changes upon phase separation and subsequently their viscoelastic transition after 48 h (Supplementary Fig. 15), consistent with CD data. To examine whether the loss of dynamicity by any of the proteins was associated with amyloid fibril formation, we performed ThT (which binds to amyloid aggregates) fluorescence assay¹⁸. The data suggest that except α-Syn (bind strongly with ThT as expected¹⁸), no other proteins showed any significant ThT binding (Fig. 4f). This indicates that either crystal-like native packing/protein vitrification and/or amorphous aggregation might result in their rigidification^1,2,6,22,60. To further characterize the morphology of the condensates after 48 h, we analyzed the condensates using TEM (Fig. 4g and Supplementary Fig. 13c). The TEM images of GG and LT condensates showed a multiphasic nature as evident from different electron-dense/sparse regions, indicating protein assembly in the condensate (Fig. 4g). We also observed aggregate-like morphology around α-Syn condensates as previously reported^18,40,61,62 (Fig. 4g). The data suggest that partial or full rigidification might occur for protein condensates upon ageing with or without structural transition.

**Fig. 4: The viscoelastic transition of protein condensates.**

Correlation of sequence and structure specific parameters with C _sat for various proteins LLPS

We hypothesized that proteins undergo LLPS through different intermolecular interactions based on their surface-exposed charge, hydrophobicity and through H-bonding. This is due to the different structural fold/s and amino acid sequences of the proteins. According to the Flory Huggins (FH) theory⁶³, the important criteria driving phase separation are (a) the length of residues capable of intermolecular interactions [which is directly proportional to molecular weight (MW)] and (b) their respective interaction strengths. However, this is true for a (semi)-flexible polymer, but for globular protein exposed residues scale with the surface area. In the limit of a spherical globule, radius of gyration, ${Rg}=B{N}^{0.33}$ [where N is the number of residues and B is a constant], while the surface area (SA) of a sphere is defined as ${SA}=4\pi {\left({Rg}\right)}^{2}$, such that ${SA}=4\pi {(B{N}^{0.33})}^{2}$ → SA is proportional to ${N}^{\frac{2}{3}}$ (or, given all amino acids are approximately the same mass, SA is proportional to $M{W}^{\frac{2}{3}}$). We plotted the molecular weight and also (MW)^0.66 of all proteins with the respective C_sat. However, we do not observe any apparent correlation between the parameters (Fig. 5a and Supplementary Fig. 16a). In both cases, we see an overall negative correlation and proteins having a similar range of molecular weight do show very different C_sat. Although, achieving a perfect correlation between the sequence-specific quantities and C_sat is unlikely as the C_sat depends on many factors and molecular weight is just one of them. Further, the FH theory best explains the liquid-liquid phase separation of homopolymers and does not account for the complexity of protein including sequence variations and electrostatic interactions^64,65. Thus, the protein LLPS might not be explained using FH theory by interaction strength alone. Various factors, such as protein conformation and its susceptibility to change with concentration, length of the protein, sequence specificity, etc. might also dictate protein LLPS.

**Fig. 5: Intermolecular interactions govern LLPS of all proteins.**

We further derived the solvent-accessible surface area (SASA) using available PDB structure files for the proteins. For intrinsically disordered proteins, we used a solved structure ensemble from Protein Ensemble Database (proteinensemble.org) and calculated the averaged properties from the structures. We calculated SASA using an existing SASA algorithm present in the VMD (Visual Molecular Dynamics)⁶⁶ software package. We then plotted the SASA values and the C_sat for all the proteins (Fig. 5b) and observed that the correlations are weak, suggesting that multiple factors affect the determination of C_sat. Further, using the solvent-accessible residues, we decomposed each protein into polar, hydrophobic, aromatic, and charge residues contributions and calculated global weighting parameters, which enable a 4-parameter model to be globally fit to the data i.e. an equation of the format

$${C}_{{sat}}=A1\times {N}_{{Polar}}+A2\times {N}_{{Hydrophobic}}+A3\times {N}_{{Charge}}+A4\times {N}_{{Aromatic}}$$

(1)

where C_sat is measured, N_* are calculable from sequence and A1-A4 are constants (unit concentration) that can be globally fit. We indeed find that the best-fit parameters led to the relation that predicts the C_sat:

$${\log }_{10}\left({C}_{{sat}}\right)= -0.041\times {N}_{{Polar}}+0.016\times {N}_{{Hydrophobic}}\\ -0.005\times {N}_{{Charge}}+0.026\times {N}_{{Aromatic}}+2.24$$

(2)

Here, each quantity (${N}_{{Polar}}$, ${N}_{{Hydrophobic}}$, ${N}_{{Charge}}$, and ${N}_{{Aromatic}}$) is defined as the effective exposure of those respective residues on the surface. By supplying each quantity ${N}_{{Polar}}$, ${N}_{{Hydrophobic}}$, ${N}_{{Charge}}$, and ${N}_{{Aromatic}}$ from the structure, one can predict the C_sat using the above equation. Note, we used log (C_sat), as the energy is proportional to the log of concentration. We compared the predicted C_sat to the experimentally determined C_sat (Fig. 5c). Further, we also validated this predictive model of the equation using a few other proteins (FUS, TDP43, p53 and HSA), which are not included in our study for LLPS. Important to note that the structure of FUS (AlphaFold ID: AF-P35637-F1) and TDP43 (AlphaFold ID: AF-Q13148-F1) was taken from the AlphaFold database; whereas the structural information of p53 (PDB ID: 8F2H) and HSA (PDB ID: 4LB2) was acquired from the Protein data bank. Interestingly, we observed that the C_sat predicted using the model approximately correlated with the experimental C_sat as reported in the literature^31,67,68,69. The correlation plot shows that there is an overall positive correlation with the Pearson correlation coefficient of 0.77. However, the correlation is not perfect. We further tried different sequence-specific parameters and observable properties of liquid condensate with C_sat, however, we did not find any apparent correlation (Supplementary Fig. 16a–h). This might be due to the fact that the driving forces of phase separation are complex and diverse, which could not be easily extrapolated at present.

Role of various intermolecular interactions responsible for protein LLPS

To investigate the role of various inter-molecular interactions (electrostatic, H-bonding and hydrophobic) responsible for protein LLPS, we performed condensate dissolution assay using a sequential titration of NaCl (disrupts electrostatic interaction)^13,32, 1,6 hexanediol (disrupts hydrophobic interaction)^70,71 and urea (disrupts H-bonding and van der Waals interaction)^72,73 (Fig. 5d). We used up to 2 M urea, as this concentration range might not significantly unfold globular proteins^74,75 (Supplementary Fig. 17), rather may break the inter-molecular H-bonding for condensate dissolution. We further hypothesized that the addition of one or a combination of these molecules will disrupt the preformed condensate and thereby reveal the nature of intermolecular interaction responsible for its formation/stabilization. The condensate dissolution was assayed using static light scattering at 350 nm (Fig. 5e) for each protein condensate immediately after its formation. The dissolution of the preformed condensates was further verified using DIC imaging of the LLPS solution at the highest concentration of each additive (Fig. 5f and Supplementary Fig. 18a). The data showed that the preformed condensates of LT and GG were mostly disrupted by the addition of salt ( > 70% decrease in light scattering value). While considerably lower effect was observed on the addition of 1,6 hexanediol and urea in these proteins (Fig. 5g). This suggests that electrostatic interaction is playing a major role in phase separation (or maintaining the phase separated state) of LT and GG. In contrast, the light scattering value of BSA, Alb, α−Syn and β−lac condensates mostly dropped by the addition of 1,6 hexanediol suggesting that hydrophobic interaction played a major role in the formation and/or maintaining these protein condensates. Indeed, the ANS binding study (probing the exposed hydrophobic surface⁷⁶) showed an increase in ANS fluorescence for BSA and a moderate increase for Alb, CATA and β−lac, suggesting hydrophobic interactions might play a role in LLPS of these proteins (Supplementary Fig. 18b). On the other hand, urea showed a major impact on the dissolution of Ub and β−cas condensates, indicating that these proteins undergo phase separation majorly by H-bonding and other van der Waals interactions (Fig. 5g). Important to note that in the titration experiments, the second additive and the third additive is not purely in phosphate buffer but in the presence of the previous additive. To rule out the possibility that the sequence of additives in titration experiments might affect the relative contribution of each interaction for LLPS, we performed a condensate dissolution assay using the altered sequence of additives for selected proteins. The data suggest that the sequence of addition does not alter the outcome of intermolecular interactions responsible for condensate formation by dissolution assay (Supplementary Fig. 18c). Further, based on the decrease in light scattering values, we calculated the relative percentage of three major types of interactions responsible for individual protein phase separation. Our data clearly suggest that either or combination of electrostatic, H-bonding and hydrophobic interactions are responsible for condensate formation and stabilization (Fig. 5g). However, the mode/extent of intermolecular interactions may differ depending upon the microenvironment and post-translational modifications of the protein³¹. For instance, in the case of α−Syn, the phosphomimetic mutation, S129E undergoes phase separation faster with a lower C_sat (200 μM) in comparison to the wild-type (Supplementary Fig. 19a). Also, upon titrating S129E and wild-type protein with different additives, the nature of intermolecular interaction driving phase separation differs (Supplementary Fig. 19b).

After predicting that proteins might mostly use either electrostatic or hydrophobic (or in combination) interactions for LLPS, we examined the kinetics of LLPS for selected proteins in the presence of salt, NaCl or 1,6-hexanediol using static light scattering (at 350 nm) (Fig. 5h and Supplementary Fig. 18d). Our data showed that LLPS of LT and Chymo (as predicted electrostatic interaction for LLPS) was largely inhibited by the addition of 150 mM NaCl; while there was no effect due to the presence of 10% (w/v) 1,6-hexanediol. In contrast, LLPS of BSA and Alb (with ANS binding due to exposed hydrophobic surface) was substantially inhibited by the presence of 10% (w/v) 1,6-hexanediol, but no difference in scattering intensity was observed in the presence of 150 mM NaCl (Fig. 5h and Supplementary Fig. 18d).

Protein-protein interaction strength determines C _sat

To find the correlation between intermolecular interaction strength and saturation concentration required for protein LLPS (C_sat), we performed the homotypic protein-protein interaction using a label-free technique of surface plasmon resonance (SPR) spectroscopy. We selected a subset of proteins based on their C_sat (low, intermediate and high) and immobilized the protein on the CM3/CM5 chips. The different concentrations of respective proteins were allowed to pass through the microfluidic channel, enabling interaction with the immobilized protein. Using one state/two state model, we fitted the resultant response curves and the respective K_D (binding affinity) values were determined. Our results showed strong protein-protein interaction for GG, Chymo and β−cas (low K_D, < 500 nM). Whereas Mb and CA showed an intermediate tendency for homotypic protein-protein interaction and a much low interaction tendency was showed by α−Syn and Ub (high K_D, >60 μM) (Figs. 6a and b). When we plotted the correlation between K_D and respective C_sat, we found a strong correlation between the binding affinity of proteins and their respective C_sat (Fig. 6c). This suggests that binding affinity/interaction strength determines the tendency and/or saturation concentration required for LLPS⁷⁷. To further delineate how inter-protein interaction strength dictates the C_sat, we performed SPR study of GG, α-Syn in the presence of 150 mM NaCl and Ub in the presence of 2 M Urea. In the presence of NaCl, GG showed no effective intermolecular interaction (Fig. 6d), which is consistent with the fact that LLPS of GG indeed occurs through electrostatic interaction and is inhibited in the presence of salt. Interestingly our previous study showed that in the presence of salt, an increase in the tendency of α-Syn phase separation occurs with a drastic reduction in C_sat⁴⁰. We found that the K_D of α-Syn in the presence of NaCl is ~7 fold lower as compared to α-Syn, signifying strong binding, which is also consistent with their respective C_sat. Furthermore, to understand the importance of H-bonding for LLPS, we chose Ub for the determination of interaction strength in the presence and absence of 2 M urea. We indeed found no intermolecular binding affinity (accurate K_D could not be determined due to a very low response unit) of Ub in the presence of urea as compared to the control (without urea) (Fig. 6d, e). The data suggest that the strength of intermolecular interaction dictates the tendency, feasibility and saturation concentration required for protein LLPS. This might be tightly regulated in the cellular milieu to promote or prevent the protein LLPS as per need.

**Fig. 6: Correlation of homotypic protein-protein interaction and phase separation by proteins.**

Minimalistic peptide-based model determining different intermolecular interactions responsible for phase separation

We hypothesized that if intermolecular interactions are the only necessary prerequisites for phase separation assisted by crowding, then even small polypeptides at optimum concentration can undergo LLPS, however, with different modes of interactions (hydrophobic/electrostatic/H-bonding) dictated by their amino acid sequence (Figs. 7a, 8a). To examine this, we designed a minimalistic model of 10-residue polypeptides [(Gly)₁₀, (Val)₁₀, (Arg)₁₀, and (Asp)₁₀] and characterized them using MALDI and LC-MS (Supplementary Fig. 20). We speculated that the polypeptide, (Gly)₁₀, would require a very high concentration for LLPS due to lack of polyvalency/side chains by the simplest amino acid, glycine²⁶. Moreover, this peptide would undergo LLPS only via intermolecular H-bonding. In contrast, (Val)₁₀ might undergo intermolecular interaction based on hydrophobic interactions, which will facilitate its LLPS. Interestingly, both the peptides showed LLPS at high concentrations in 20 mM sodium phosphate buffer, pH 7.4 in the presence of 10% PEG-8000 exhibiting a varied phase regime (Fig. 7b, c). In the presence of 10% PEG-8000, (Gly)₁₀ showed LLPS when peptide concentration reached ≥ 2 mM concentration, while (Val)₁₀ showed LLPS ≥ 1 mM concentration (Fig. 7b–d). The (Gly)₁₀ and (Val)₁₀ condensates were further characterized by using fluorescence microscopy using labeled peptides (10% N-terminal NHS-Rhodamine labeled peptide + 90% unlabeled peptide). We observed condensate fusion upon contact and complete fluorescence recovery after photobleaching both at 0 h and 48 h, confirming their liquid-like property (Fig. 7e and Supplementary Fig. 21a, b). Further, the morphology of the condensates, examined using TEM revealed homogeneous electron density of the condensate state of these polypeptides (Fig. 7f). The data, therefore, suggest that small homo-polypeptide also undergo LLPS but with relatively high C_sat compared to other proteins under study. This indicates that intermolecular interactions between these polypeptides are much less prevalent compared to large proteins. We investigated the mode of intermolecular interaction responsible for (Gly)₁₀ and (Val)₁₀ LLPS using pre-formed condensate dissolution assay similar to proteins (Fig. 7g). The light scattering and DIC microscopy data suggest that the phase separation of (Gly)₁₀ is majorly disrupted by the addition of 2 M urea but not by 1,6 hexanediol or NaCl (Fig. 7g). When similar experiments were performed with (Val)₁₀, the condensates were disrupted only in presence of 1,6 hexanediol not in presence of either urea or NaCl (Fig. 7g). Similar observation was also obtained when we allowed both the peptides for condensate formation in presence and absence of different additives (Fig. 7h, i and Supplementary Fig. 21c). To further examine the contribution of multivalency in C_sat, we chose glycine polypeptides. When LLPS study was performed with increasing length of polypeptide in 20 mM sodium phosphate buffer, pH 7.4 in the presence of 10% PEG-8000, we observed that (Gly)₅, (Gly)₆, (Gly)₇, (Gly)₈ and (Gly)₉ required 40 mM, 25 mM, 20 mM, 12 mM and 8 mM concentration, respectively for their LLPS (Supplementary Fig. 21d, e). Overall, the polymer length and C_sat of glycine polypeptides showed a negative linear correlation (R² value: 0.955), suggesting that a decrease in polypeptide length will increase the C_sat and vice versa (Fig. 7j). Important to note that the C_sat is expected to follow an exponential decay with length (or valence), but not a linear decay as per theory⁶³. However, within the experimental scope, the range of glycine peptide length that was used for the study, we found a linear correlation. It might be possible that one might find an exponential decay with C_sat on further increasing the polymer length.

**Fig. 7: Liquid-liquid phase separation of neutral homo polypeptides.**

**Fig. 8: Liquid-liquid phase separation of charged polypeptides.**

In contrast to neutral polypeptides [(Gly)₁₀ and (Val)₁₀], the liquid condensate formation of charged homo-polymers might occur upon neutralization of charged residues^25,34. To examine this, we designed 10-residue polypeptides of (Arg)₁₀, and (Asp)₁₀ (Fig. 8a) and monitored their LLPS behavior in 20 mM sodium phosphate buffer, pH 7.4 in the presence of 10% PEG-8000. Interestingly, both the peptides showed condensate formation only in the presence of high salt as well as at high peptide concentrations (Fig. 8b, c). Our data showed that (Arg)₁₀ and (Asp)₁₀ formed condensates in the presence of 4 M NaCl [with 10% (w/v) PEG-8000] at a concentration of ≥ 2 mM and ≥ 8 mM, respectively (Fig. 8c and Supplementary Fig. 22a). This data suggests that at charged neutralized state, poly-Arg might possess higher polyvalency for LLPS in comparison to poly-Asp. The liquid nature of the condensates was also further characterized by fusion and FRAP studies. (Arg)₁₀ condensates showed fusion upon contact to form larger condensates and all the polypeptide condensates showed complete fluorescence recovery after photobleaching, confirming their liquid-like property at both 0 h and 48 h (Fig. 8d and Supplementary Fig. 22b, c). To further examine whether intermolecular interactions between oppositely charged polypeptides facilitate LLPS, we monitored the co-LLPS of (Arg)₁₀ and (Asp)₁₀. When two oppositely charged polypeptides were mixed at their C_sat (Fig. 8e and Supplementary Fig. 22d) as well as with different ratios (Supplementary Fig. 22f), we observed spontaneous phase separation in the absence of salt. We found the C_sat for co-LLPS reached 0.75 mM for both peptides when mixed together, suggesting that charge neutralization favors their co-LLPS (Supplementary Fig. 22f). In identical conditions, however, the individual polypeptides did not show any LLPS (Fig. 8e and Supplementary Fig. 22d). Further, the morphology of the condensates was examined using TEM, which revealed homogeneous electron density of the condensate state of these polypeptides (Fig. 8f). We further investigated the nature of interaction responsible for LLPS of charged polypeptides using preformed condensate using light scattering and DIC imaging (Fig. 8g). We observed that LLPS of (Arg)₁₀ was majorly disrupted by the addition of 20% (w/v) 1,6 hexanediol; whereas dissolution of (Asp)₁₀ condensates were observed on addition of 2 M urea. This suggests that upon charge neutralization, (Arg)₁₀ and (Asp)₁₀ polypeptides undergo LLPS through hydrophobic and H-bonding interaction. A similar observation was also obtained when additives were added before phase separation and the condensate formation was examined using DIC microscopy (Supplementary Fig. 22e).

Discussion

Increasing evidence underscores the ability of condensate formation by a wide range of proteins either related to membrane-less organelles formation for performing a normal cellular function^1,7,8,9 or as a nucleation center for protein aggregation^{17,18,19,20,21,22}. LLPS might be tightly regulated based on the protein localization in specific organelles where it performs its native function^{1,5,22,78,79,80} and the presence of DNA/RNA or other co-factors in cell^{24,38,78,81,82,83,84,85,86}. Recent studies indicated that the condensate state might be a proteome-wide phenomenon³³ and may be considered as the fundamental state of proteins⁸⁷, besides the native and the amyloid state^88,89. Also, several recent studies have provided prediction tools and physical frameworks in encoding the molecular grammar driving condensate formation by a wide range of proteins^23,27,30,90. In a given experimental condition, although proteins (both folded and disordered) and polypeptides form liquid condensates, the underlying driving forces resulting in the formation of condensate remain unclear. Phase separation is thermodynamically favored when a protein has enough concentration and adequate interaction strength⁸⁷. The thermodynamics of condensate formation is a complex interplay between entropy and enthalpy, where the decrease in entropy (ΔS) due to molecular clustering must be overcome by the increase in enthalpy (ΔH), which generally is achieved through intermolecular interactions^64,65. The weak multivalent interactions hold the higher-order molecular arrangement inside the condensate and maintain the liquid-like property. These include cation-π, π-π interaction, charge dipole and hydrophobic interactions^32,91.

It has been well-documented that low-complexity, intrinsically disordered, or prion-like domains promote LLPS^{23,24,25,26,27,28}. Due to the lack of a specific fold, these domains generally provide more multivalency for intermolecular clustering, a prerequisite for LLPS. Further, the intrinsically disordered proteins are also known to bind to different partners for functionality through a short segment while a significant stretch of the protein maintains the overall conformational flexibility of the complex⁹². The binding interaction of such fuzzy proteins may also result in protein assemblies responsible for biomolecular condensates^90,92,93. On the other hand, multidomain globular proteins can also facilitate intermolecular contacts due to multiple-binding sites (similar to patchy colloids^94,95) and can form liquid condensates in a suitable reaction condition. However, the chances and strength of both specific and non-specific intermolecular interactions can be further enhanced with an increase in protein concentration (or due to crowding) which may favor LLPS. In this concentration regime, the inter-molecular ordering or self-assembly state is more thermodynamically favorable over individual protein molecules. At low concentrations, proteins, however, are more diffusive in nature with no intermolecular ordering. Thus, the concentration of a protein molecule finely balances the intermolecular interaction and determines the collective behavior of proteins⁸⁷. Indeed, we observed that various proteins with diverse sequences and structures could readily undergo LLPS, however, with a wide variation in their C_sat and kinetics (Fig. 1). This is expected as the nature of amino acids and their pattering^96,97,98 in three-dimensional space would dictate the extent of intermolecular interaction determining the C_sat. The growth kinetic study further indicates that all proteins above the C_sat give rise to a liquid condensate state after a lag time. Understanding the nucleation mechanism of LLPS, which is otherwise a thermodynamically uphill process, is crucial and an ongoing research area. Recently, Martin et al. have found a multistep nucleation process prior to detectable LLPS where small complexes in the nanoscale size distribution are formed even in the sub-critical protein concentration⁹⁹. After the initial complex formation, which is an energetically unfavorable process, the monomer starts to recruit in the complex with high affinity to form the mesoscale clusters¹⁰⁰, which can further grow by classical homogeneous nucleation. Recent studies also further suggested that at sub-saturation concentrations, some proteins form nanoclusters^50,101. The presence of nanoclusters at sub-saturation concentration and its increase in size with concentration suggest that many proteins might phase separate through nanocluster formation, which could be even present after macroscopic phase separation as shown for α-Syn⁵⁰. Although, currently the relationship between nanocluster formation below the saturation concentration and macroscopic phase separation above saturation concentration is not known as there is a possibility that the nanocluster formation also could be linked with macroscopic condensate formation for a particular protein (could be encoded by protein sequence). Apart from the nanoclusters, another “mesoscopic” protein condensate of several hundred nanometers in size and liquid-like clusters has been observed for many proteins in different experimental conditions^{102,103,104,105}. The mechanism of transient complex formation prior to mesoscopic condensate formation has been explained as a common feature for several biomolecules¹⁰⁴. We speculate that although LLPS of proteins of various structures and sequences are detectable above saturation concentration, the initiation of the cluster formation might start even below the saturation concentration and the subsequent growth of the clusters might be modulated by the intermolecular interaction of the proteins, which results in different saturation concentration for the detectable LLPS. However, this needs further investigation. Important to note that the condensates formed in our case are above the saturation concentration, as the size of the condensates is much higher than the size of the nanocluster observed below the saturation concentration⁵⁰. Although the nanocluster formation and the presence of small condensates, which might not be sedimented by ultracentrifugation or not visible under the confocal microscope (limit is 500 nm) might also affect the saturation concentration determination.

A major theory that is often used to explain phase separation is the FH theory⁶³. Although FH theory does not account for nanocluster formation, it takes into account the Flory parameter χ, which represents the strength of the monomer-solvent interaction averaged over the protein^64,65. Hence, the implication of possible nanoclusters formation by protein and its effect on C_sat is not clearly apparent at present and is beyond the scope of FH theory.

We have shown that both folded and disordered proteins can form reversible, dynamic condensates in a concentration-dependent manner. Interestingly, the phase separation by proteins does not require a misfolding or drastic structural transition, suggesting that a high enough concentration (or factors promoting intermolecular interaction) is sufficient for inducing LLPS (Figs. 3 and 5). Consistent with all previous studies^17,18,20, most LLPS systems maintain their liquid-like nature; upon aging, however, a subset of proteins indeed shows a certain extent of viscoelastic transition (partial rigidification) (Fig. 4). We found that gradual rigidification does not mandatorily corroborate with amyloid fibril formation. The viscoelastic transition of liquid condensate might also occur due to crystal-like packing/ amorphous aggregation in the dense LLPS milieu^1,2,6,22. This suggests that rigidification of liquid condensates might be specific to proteins with respect to sequence/structure and could preserve the structure (therefore protein function) of most of the proteins^1,2,6,22,60. In this context, mesoscopic clusters by p53 mutant protein were also reported to promote the essential sites of nucleation for higher-ordered solid condensates such as misfolded protein aggregates forming amyloid fibrils apart from macroscopic protein-rich condensates¹⁰⁵.

There might be a possibility that the cellular environment, sequence and structure of protein might dictate protein aggregation either from liquid condensate and/or small/large mesoscopic clusters. Moreover, there can also be a rearrangement of existing molecular machineries and component systems in cells that may give rise to condensates of a few tens of nanometers in diameter^37,106.

Although most of the proteins under study harness a combination of intermolecular interactions for their condensate formation and stability (Fig. 5), it is apparent that the driving forces are complex and diverse, and multiple factors can determine C_sat. Moreover, hydrophobic interaction also showed an important role in many proteins/peptides phase separation under study, consistent with previous studies of proteins LLPS^{18,70,107,108}. It seems that H-bonding interaction also promotes phase separation for globular proteins where electrostatic and/or hydrophobic interaction sites are less prevalent. Interestingly, our designed peptide condensate data clearly showed that (Gly)₁₀ and (Asp)₁₀ (in the presence of NaCl), undergo condensate formation using H-bonding interaction, which requires much more C_sat than (Val)₁₀ and/or (Arg)₁₀ where we observed hydrophobic interaction playing a major role. Therefore, proteins/peptides undergoing LLPS through H-bonding require much more concentration so that enough interactions are made possible for network formation in the confined space for condensate formation.

Further, the C_sat for LLPS also shown to be strongly correlated with their intermolecular protein-protein interaction strength (Fig. 6c). It is also dictated by the molecular weight (polymer length/amino acid number in proteins) and the nature of amino acid side chains^{6,11,29,36,91}. For example, a stretch of a glycine-rich polypeptide with higher polypeptide flexibility and the absence of sidechain polyvalency might decrease the extent of intermolecular interaction²⁶. However, hydrophobic amino acid (Val) and other aromatic amino acids might increase the interaction strength due to hydrophobic and other interactions (such as cation-π)^23,26,31 when present in proteins. This interaction strength is highly reflected in C_sat as (Gly)₁₀ requires double the polypeptide concentration (2 mM) for LLPS in comparison to (Val)₁₀ (1 mM). Further, homopolymers of charged amino acids might not undergo LLPS due to charge-repulsion unless their charges are neutralized³⁴. Indeed, our data showed that (Arg)₁₀ and (Asp)₁₀ homopolymers undergo LLPS either in the presence of salt^{24,38,40,81,84} (Fig. 8b) or when they are mixed (Fig. 8e).

Important to note that the in vivo LLPS depends on other factors, for example, the presence of other biomolecules or microenvironment. It is possible that the active cellular processes might modify the crowded milieu, and hence, maintain the protein solubility³⁶. Therefore, the present study and its relevance to in vivo at this point is not clear and need further investigation. Since, in vivo LLPS is most likely a multi-component system, which might not be applicable to our single-component experimental conditions. However, the present study indicates that in a given condition, protein/peptide in general might phase separate, irrespective of the relevance in in vivo system as the condition of in vivo and in vitro might differ significantly.

In conclusion, our study suggests that proteins/polypeptides with different structures and sequences can undergo LLPS although with different apparent C_sat (Supplementary Fig. 23). The presence of IDRs might provide an advantage in undergoing phase separation as they have higher polyvalency as well as a low structural order, resulting in substantially a greater number of molecular interactions^1,6,14,18,23. However, this phenomenon can be protein specific (with specific sequence and structure) but might not be applicable to all proteins/peptides in general (Supplementary Fig. 2). Moreover, once a protein undergoes LLPS, its subsequent rigidification might require very high concentration and/or specific interactions. Deregulation of protein quality control and turnover mechanisms in cells might pave the way for aberrant phase transition^1,6,12,109. A similar generic state hypothesis has also been proposed for amyloid fibril¹¹⁰ formation by proteins and polypeptides with an argument that cellular/subcellular conditions, protein quality control machinery and protein expression/post-translational modification do not allow such transition in cells. Also, nature perhaps has evolved with a ‘negative design’ for proteins, which prevents amyloidogenesis¹¹¹.

Methods

All the reagents and chemicals used for the study were purchased from Sigma (USA) unless mentioned otherwise. The product information of the proteins is provided in Supplementary Tables 1 and 2. NHS-Rhodamine (Catalog no. 46406), and Fluorescein-5 isothiocyanate (FITC) (Catalog no. F1906) were procured from ThermoFisher Scientific (USA). FITC-PEG-COOH, molecular weight 5000 (Catalog no. PHB-3925) was purchased from Creative PEG Works (North Carolina, US). The protease inhibitor cocktail (PIC) was obtained from Roche Applied Science (Catalog no. 05056489001). 1-Hydroxybenzotriazole hydrate (HOBt) (Catalog no. 157260), Triisopropylsilane (TIPS) (Catalog no. 233781), Trifluoroacetic acid (TFA) (Catalog no. T6508), N, N’-Diisopropylcarbodiimide (DIC) (Catalog no. D4781), and Polyethylene glycol molecular weight 300 (Catalog no. 202371-5 G) were purchased from Sigma (USA). N, N-Dimethylformamide (DMF) (Catalog no. 8.22275.2521), Dichloromethane (DCM) (Catalog no. 1.94508.2521), Acetonitrile (ACN) (Catalog no. 60003025001730), and Diethyl ether (Catalog no. 1.07026.0521) were purchased from Merck Millipore. Wang resin (100–200 mesh, 0.7 mmol/ g) (Catalog no. 8.55002), and 4-(Dimethylamino) pyridine (DMAP) (Catalog no. 8.51055) were purchased from Novabiochem (Germany). The polypeptides, pentaglycine (Catalog no. G5755), and hexaglycine (Catalog no. G5630) were purchased from Sigma-Aldrich (USA).

In silico analysis of proteins

The FASTA sequence of all proteins was obtained from Uniprot (Supplementary Table 1). These protein sequences were used for various in silico analyses. The online tool IUPred2A⁴⁵ was used for the identification of the disordered regions for all proteins using the amino acid sequence as input. It provides a score between 0 and 1 for each amino acid residue, which corresponds to the probability of the residue being part of a disordered region. SMART⁴⁶ (Simple Modular Architecture Research Tool) identifies and annotates the presence of low-complexity regions from the amino acid sequence. The LLPS propensity was predicted using the catGRANULE⁴⁷ algorithm. The propensity score was determined and plotted for all the proteins. PONDR⁴⁸ is an algorithm used for predicting the naturally disordered region. The percentage of disorderness for all proteins was determined using PONDR (VLXT predictor), which was plotted against C_sat. All the data was plotted using OriginPro 2021 (Origin Lab, USA) software.

Expression and purification of α-synuclein (α-Syn) and Tau protein

α-Syn was expressed and purified using previously established protocols with slight modifications^112,113. Briefly, competent E. coli BL21 (DE3) cells were transformed using cloned plasmid and the expression was induced using isopropyl-β-D-thiogalactoside (IPTG) (1 mM). Following this, the cells were centrifuged at 1699 x g for 30 min at 4 °C. The pellet was resuspended in lysis buffer (50 mM Tris, 10 mM EDTA, 150 mM NaCl) and PIC (Roche) was added to prevent proteolytic cleavage. The cells were further lysed using a probe sonicator (Sonics & Materials Inc.) at 40% amplitude with 3 s ON and 1 s OFF pulse for 10 min. The solution was then heated at 95 °C for 20 min and centrifuged at 8603 x g for 30 min. The supernatant was used for nucleic acid precipitation using 10% streptomycin sulfate (136 μl/ml) and glacial acetic acid (228 μl/ml). The solution was then centrifuged at 8603 x g for 30 min at 4 °C to remove nucleic acid. Following this, the protein precipitation was carried out using saturated ammonium sulfate (equal volume). The solution was kept at 4 °C for 4 h for complete precipitation and centrifuged at 10621 x g for 30 min at 4 °C. The protein was further washed using ammonium sulfate solution (50%) and centrifuged at 10621 x g. Finally, the protein was washed using ammonium acetate (100 mM) and precipitated using ethanol. This step was repeated three times. The solution was centrifuged and the pellet was dissolved in a minimum volume of ammonium acetate (100 mM) and lyophilized. The lyophilized protein was redissolved in 20 mM sodium phosphate buffer, pH 7.4, and further purified using size exclusion chromatography (SEC) in the Q Sepharose column before the LLPS experiment. The purity of the protein was confirmed by SDS-PAGE and Coomassie blue staining method.

Expression of full-length wild-type Tau protein (2N4R isoform containing 441 residues) was carried out by transforming tau/pET29b plasmid (Addgene id 16316) into E. coli BL21 (DE3) competent cells. The expression and purification protocol of Tau protein were similar to α-Syn with minor modifications. Briefly, bacterial cells were grown in the presence of Kanamycin in Luria broth (LB) media at 37 °C to an optical density value between 0.7-1. Protein expression was induced with 1 mM IPTG followed by 4 h incubation at 37 °C in 200 rpm rotation. Cells were harvested by centrifugation and resuspended in 60 ml of lysis buffer (50 mM Tris, 10 mM EDTA, and 150 mM NaCl, 5 mM DTT at pH 8.0). PIC was added to the lysis buffer to prevent proteolytic cleavage. The cells were lysed by sonication (40% amplitude, 3 s ON and 1 s OFF) for 15 min using a probe sonicator (Sonics and Materials Inc., USA) and heat-denatured in hot water at 95 °C for 20 min. Cell debris and other denatured proteins were pelleted down by centrifugation at 10621 x g, 4 °C for 30 min. DNA was precipitated from the supernatant using streptomycin sulfate [10% (w/v)] and glacial acetic acid. After DNA removal, an equal volume of saturated ammonium sulfate was added and incubated at 4 °C overnight for protein precipitation. The solution was centrifuged twice at 15294 x g, 4 °C for 30 min. Pellet was dissolved in 100 mM ammonium acetate and reprecipitated in an equal volume of ethanol. The final pellet was redissolved in a minimum volume of 100 mM ammonium acetate, flash-frozen with liquid nitrogen, and lyophilized. The lyophilized protein powder was stored at -20 °C until used for experiments. The required amount of protein was dissolved in equilibrating buffer (20 mM sodium phosphate buffer, 1 mM DTT) and further purified by size exclusion chromatography in the Q Sepharose column before the experiment. The purity of the protein was confirmed by the standard SDS-PAGE and Coomassie blue staining method.

Size exclusion chromatography (SEC) of proteins

All the commercially purchased and recombinantly expressed/purified (α-Syn and Tau) proteins were dissolved in a filtered 20 mM sodium phosphate buffer (pH 7.4, 0.01% sodium azide). The Superdex 200 TM 10/300 SEC column was pre-equilibrated with 3 column volumes of 20 mM sodium phosphate buffer (pH 7.4, 0.01% sodium azide) and the protein solutions were injected into the column. The proteins were isolated and the purity of the protein from SEC was confirmed using SDS-PAGE. The protein concentrations were determined by Beer-Lamberts law (c = A/εl), where c is the protein concentration in molar, l is the path length in cm, A is the absorbance value at the respective wavelength, and ε is the molar absorption coefficient at the respective wavelength, using UV spectroscopy (Jasco V650, Japan). The absorbance measurement at 280 (A₂₈₀) was used for determining the protein concentration for all the proteins except the chromophore-containing proteins such as Hb (A₄₀₆, ε₄₀₆ = 270548 M⁻¹cm⁻¹)¹¹⁴, Mb (A₄₀₈, ε₄₀₈ = 129000 M⁻¹cm⁻¹)¹¹⁵, Cyt c (A₄₁₀, ε₄₁₀ = 101600 M⁻¹cm⁻¹)¹¹⁶, and CATA (A₄₀₅, ε₄₀₅ = 324000 M⁻¹cm⁻¹)¹¹⁷ whose protein concentration was determined using the extinction coefficient of the respective chromophore group.

Solid-phase peptide synthesis

All the peptides were synthesized by 9-fluorenylmethoxy- carbonyl (FMOC) chemistry using the manual solid-phase peptide synthesis method¹¹⁸. The synthesis was performed with a scale of 0.20–0.25 mmol on a Wang resin. In a typical synthesis, the first amino acid was loaded on Wang resin by dissolving 1 eq. of amino acid and HOBt in DMF, followed by the addition of 1 eq. of DIC and finally DMAP in catalytic amt. (0.1 eq.). The coupling was kept for 2-3 h and washed several times with DMF and DCM after the completion of the reaction. The FMOC group was removed using 25% piperidine in DMF. The next coupling was repeated using DIC/HOBt coupling agent with the equivalent amount of the next amino acid. After the synthesis of the desired length polypeptide, the peptide was cleaved off from the resin using a standard cleavage cocktail, TFA: Phenol: TIPS: water (88:5:2:5). Further, the cleavage solution was transferred into an ice-cold ether solution to get the precipitated peptides. After precipitation, the ether solution was evaporated and the peptides were redissolved in ammonium bicarbonate (50 mM).

The synthesized peptides [(Gly)₁₀, (Asp)₁₀, and (Arg)₁₀] were purified using HPLC equipped with a reverse phase-C18 column. The mobile phase was used with the 90 min gradient system starting from 10% ACN/water (0.1% TFA) to 90% ACN/water system with a flow rate of 1 ml/min. The samples were injected from a 5 mg/ml stock concentration and 200 μl of peptide aliquot solution was injected using an autosampler injector. The instrument was provided with a UV-Vis detector (dual-wavelength) and absorbance at 195 nm was recorded. For analysis, we used the data acquired at 195 nm (the analytes had maximum molar absorptivity). Using these parameters, all the synthesized polypeptides were separated. However, (Val)₁₀ was not purified using HPLC, since it exhibits poor solubility in the given HPLC mobile phase gradient system and therefore, was used as synthesized. The purified polypeptides were characterized using ESI LC-MS and MALDI analysis.

Fluorescent labeling of protein/peptide

The NHS-Rhodamine and FITC labeling of protein/peptides was done as per the manufacturer’s protocol (ThermoFisher Scientific, USA). Briefly, 5X molar excess of FITC/rhodamine (dissolved in DMSO) was added to the protein obtained after SEC. For FITC, the mixture was incubated on a magnetic stirrer at 4 °C for 6 h in the dark with slow rotation. For NHS-Rhodamine labeling, the protein mixture was incubated for 2 h at room temperature in the dark with slow stirring. The excess dye was removed by dialysis using different molecular weight cut-off membranes depending upon the molecular weight of the proteins in 20 mM sodium phosphate buffer (pH 7.4) at 4 °C for 48 h, with regular buffer exchange in 6 h intervals. The concentration of the labeled protein was determined as per the manufacturer’s protocol. The polypeptides were labeled as mentioned previously. The excess FITC/rhodamine dye was removed by dialysis in 20 mM sodium phosphate buffer (pH 7.4) for 12 h with regular buffer exchange with 2 h intervals at 4 °C. After dialysis, the labeled polypeptide solution was lyophilized and the concentration was determined by redissolving the dry weight in 20 mM sodium phosphate buffer (pH 7.4). For all experiments, we used 1:10 (v/v) of labeled versus unlabeled protein/polypeptide, unless mentioned otherwise.

In vitro liquid-liquid phase separation of proteins and peptides

For LLPS experiments, acid-treated coverslips were used⁴⁰. To do this, the glass slides and 12 mm coverslips (Blue Star, India) were kept in aqua regia [1:3 (v/v) nitric acid/hydrochloric acid] for 12 h and thoroughly washed with Milli-Q. After every wash, the pH of the Milli-Q was checked until it reached 7.0. The slides and coverslips were air-dried in a laminar air-flow hood under sterile conditions and used for all the subsequent LLPS experiments.

For LLPS, the proteins purified using SEC were used to prepare the reaction mixture at different protein and PEG-8000 concentrations [(0%, 5% 10%, 15% and 20% (w/v)] in 20 mM sodium phosphate buffer (pH 7.4, 0.01% sodium azide) to determine the phase regime. Similarly, LLPS experiments were also performed in the presence of PEG-300 to study the effect of PEG length in phase separation. For this, the purified proteins after size exclusion chromatography at their respective C_sat were used to prepare the reaction mixture in the presence of 10% (v/v) PEG-300 in 20 mM sodium phosphate buffer (pH 7.4). Moreover, proteins were tested for LLPS in the absence of PEG-8000 under different conditions (high concentration, addition of NaCl and/or change in pH) in 20 mM sodium phosphate buffer, pH 7.4. The experiment was repeated two times.

The polypeptides (Arg)₁₀ and (Asp)₁₀ were dissolved in 20 mM sodium phosphate buffer (pH 7.4). For (Val)₁₀ and (Gly)₁₀, 1 mg of the respective polypeptide was dissolved in 20 μl of TFA to obtain a homogenous solution and the volume was adjusted to 50 µl by the addition of 20 mM sodium phosphate buffer (pH 7.4). TFA was removed by nitrogen gas purging and 20 mM sodium phosphate buffer was added to obtain a stock solution of 5 mM for both polypeptides. For LLPS of polypeptides, the reaction mixture at different polypeptide concentrations and PEG-8000 concentrations [(0%, 5% 10%, 15% and 20% (w/v)] in 20 mM sodium phosphate buffer (pH 7.4, 0.01% sodium azide) was prepared to determine the phase regime. The reaction mixture (proteins/polypeptides) was drop-casted on the acid-treated slides and sandwiched with an acid-treated 12 mm glass coverslip (Blue Star, India). The coverslips were sealed using commercially available nail paint. The slides were incubated at 37 °C in a moist chamber and phase separation was monitored using 63X oil immersion objective in the DIC (Differential Interference contrast) mode and fluorescence mode under a DMi8 microscope (Leica Microsystems, Germany). All the images were obtained at 16-bit depth with 2048 × 2048 pixels resolution unless mentioned otherwise. The images were analyzed using ImageJ (NIH, Bethesda, USA) software.

For co-LLPS of peptides, FITC labeled (Asp)₁₀ was mixed with NHS-Rhodamine labeled (Arg)₁₀ in the presence of PEG-8000 (10% w/v) in 20 mM sodium phosphate buffer (pH 7.4, 0.01% sodium azide) at various peptide concentration ratios. For all the experiments, we used 1:10 (v/v) of labeled versus unlabeled peptides, unless mentioned otherwise. The mixture was drop-casted on acid-treated glass slides and sandwiched with a 12 mm acid-treated coverslip. The coverslip was sealed using commercially available nail paint. These were used during the microscope image acquisition of co-LLPS condensates. The images were processed using ImageJ (NIH, Bethesda, USA) software.

Determination of PEG partitioning into condensates

The partitioning of PEG inside condensates was monitored using FITC-labeled PEG-5000. For LLPS, a subset of NHS-Rhodamine labeled proteins at their respective C_sat were mixed in the presence of 10% PEG (5% (w/v) FITC-labeled PEG-5000 + 5% (w/v) PEG-8000) in 20 mM phosphate buffer (pH 7.4). Similarly, in another case, a subset of proteins at C_sat was allowed to undergo LLPS in the presence of PEG-8000, in which immediately after LLPS, 5% (w/v) FITC labeled PEG-5000 was added to verify PEG partitioning. The condensate formation was observed using a confocal microscope (LSM 780 Zeiss Axio-Observer Z1 microscope (inverted)) equipped with iPlan-apochromat 63X/1.4 NA oil immersion objective and with an appropriate fluorescence channel. The images were obtained with a frame size of 1024 pixels x 1024 pixels with 8 bit-depth unless mentioned otherwise. The images were processed using ImageJ (NIH, Bethesda, USA) software. The experiment was repeated two times.

Thereafter, the apparent partition coefficient of PEG into the condensates was calculated using, $\frac{{PE}{G}_{{inside}}}{{PE}{G}_{{outside}}}$. For example, in the case of β-lac the average fluorescence intensity of ${PE}{G}_{{inside}}$ was 7.63 (A.U.) and ${PE}{G}_{{outside}}$ was 404.34 (A.U.). The number of condensates used for fluorescence intensity determination was >50. Therefore, the apparent partition coefficient is $\frac{{PE}{G}_{{inside}}}{{PE}{G}_{{outside}}}$ = 7.63 (A.U.)/404.34 (A.U.) = 0.01 = ~0 for β-lac. Similar calculations were done for the other seven proteins and the apparent partition coefficient was ~0.

Fluorescence and confocal microscopy

The in vitro liquid condensate formation for all the NHS-Rhodamine labeled proteins and peptides [1:10 (v/v) labeled to unlabeled protein/peptide] were observed using a DMi8 microscope (Leica Microsystems, Germany) under DIC and fluorescence mode using an appropriate fluorescence channel (560 nm/488 nm) at 16-bit depth with 2048 × 2048 pixels resolution. The FRAP analysis at C_sat of proteins (0 h and 48 h after LLPS) were performed using a laser scanning confocal microscope (LSM 780 Zeiss Axio-Observer Z1 microscope (inverted)) equipped with iPlan-apochromat 63X/1.4 NA oil immersion objective and with appropriate fluorescence channel (560 nm). The images were obtained with a frame size of 512 pixels X 512 pixels with 8 bit-depth unless mentioned otherwise. The images were processed using ImageJ (NIH, Bethesda, USA) software.

Preparation of HeLa cell cytoplasmic extract

HeLa cells (authenticated cell line procured from NCCS cell repository, Pune, India) were used for the preparation of the cytoplasmic extract (CE) for the study as per previously established protocol^54,119. The cell density of 5 × 10⁵/mL was used such that the cell confluency was less than 80%. The cells were washed three times with PBS and were harvested using trypsin. The cell solution was incubated in hypotonic buffer [20 mM Tris-HCl (pH 7.4), 10 mM KCl, 2 mM MgCl₂, 1 mM EGTA, 0.5 mM DTT and 0.5 mM PMSF] for 3 mins at 4 °C to enhance fractionation. NP-40 (0.1%) was added to the cells and incubated for 3 mins for membrane lysis. The cell suspension was then centrifuged at ~1000 x g for 5 mins at 4 °C. The supernatant containing the cytoplasmic extract was collected and centrifuged at ~15,000 x g for 3 mins at 4 °C for removing the debris. The supernatant was collected and dialyzed with 20 mM sodium phosphate buffer (pH 7.4) at 4 °C for 6 h using a 1 kDa cut-off membrane. The total protein concentration was determined using Bradford’s protein estimation assay. For the LLPS study, all the proteins were mixed with CE (10 mg/mL protein concentration) at their respective C_sat using NHS-Rhodamine labeled [10% (v/v) labeled to unlabeled] proteins. The condensate formation was observed using a DMi8 microscope (Leica Microsystems, Germany) under DIC and fluorescence mode using an appropriate fluorescence channel (560 nm) at 16-bit depth with 2048 × 2048 pixels resolution.

Determination of C _sat by centrifugation

The SEC purified protein samples at approximately three to four times the respective C_sat (determined through microscopic observation) in the presence of PEG-8000 (10% w/v) were incubated in an eppendorf tube at 37 °C in a moist chamber for LLPS. Immediately after LLPS, the samples were centrifuged at ~1,50,000 x g for 30 mins using ultracentrifugation (Beckman Coulter Optima^TM Max-XP, USA). 20 μL of the supernatant was taken, diluted in 20 mM sodium phosphate buffer (pH 7.4) and absorbance at 280 (A₂₈₀) was measured using UV spectroscopy (Jasco V650, Japan). The absorbance at 280 nm was used for determining the concentration of the dilute phase of all the proteins except the chromophore-containing proteins such as Hb (A₄₀₆, ε₄₀₆ = 270548 M⁻¹cm⁻¹), Mb (A₄₀₈, ε₄₀₈ = 129000 M⁻¹cm⁻¹), Cyt c (A₄₁₀, ε₄₁₀ = 101600 M⁻¹cm⁻¹), and CATA (A₄₀₅, ε₄₀₅ = 324000 M⁻¹cm⁻¹) whose protein concentration was determined using the extinction coefficient of the respective chromophore group. The concentration in the dilute phase after phase separation was considered as C_sat,¹²⁰ which is consistent with the microscopic observation.

Light scattering measurements

The saturation concentration (C_sat) for LLPS of the respective protein sample in the presence of 10% PEG-8000 (w/v) (LLPS-inducing condition) was used for static light scattering (SLS) measurements. The excitation and emission wavelength were set at 350 nm and the slit width was kept at 5 nm for both. The measurements were acquired in continuous mode using a spectrofluorometer (JASCO FP 8500, USA). The experiment was performed twice. A plot of light scattering intensity against time was plotted, which resulted in a sigmoidal curve. The data were background corrected, normalized, and fitted using the Boltzmann equation and t_1/2 was calculated as follows;

$$y={y}_{0}+({y}_{\max }-{y}_{0})/[1+{e}^{-\left(k\right.(t-{t}_{1/2})}]$$

(3)

Where, y = the light scattering intensity at a particular time point, y_max = maximum light scattering intensity, y₀ = light scattering values at t₀. The data was plotted using OriginPro 2021 (Origin Lab, USA) software. The t_1/2 was determined using the Eq. 3 and the graph was plotted using the GraphPad Prism 8 software.

To investigate the nature of intermolecular interactions responsible for protein LLPS, a sequential titration assay was performed using static light scattering at 350 nm. To do so, NaCl (disrupts electrostatic interaction), 1,6 hexanediol (disrupts hydrophobic interaction) and urea (disrupts H-bonding and van der Waals forces) stock solutions were prepared. 100 μl of SEC purified protein samples at their respective C_sat in the presence of 10% (w/v) PEG-8000 were incubated in an Eppendorf at 37 °C in a moist chamber for LLPS. Immediately after LLPS (0 h), the samples were sequentially titrated with increasing concentration of NaCl (50-150 mM) followed by 1,6 hexanediol (2-15% w/v) and urea (0.5-2 M). Important to note that NaCl was added initially to the pre-formed condensates followed by 1,6 hexanediol and urea. The 1,6 hexanediol experiments had NaCl in them and the urea experiments already had both NaCl and 1,6 hexanediol in them. Considering the volume of work, the dissolution assay using a scrambled sequence of the additives was performed only for LT, BSA and β-cas. After the addition of each concentration of the additives, the light scattering was recorded using a spectrofluorometer for 30 s and the value at 15th s was used for data analysis. For studying the kinetics of LLPS in the presence of additives, the protein samples at their respective C_sat in the presence of 10% (w/v) PEG-8000 and NaCl (150 mM) or 1,6-hexanediol (10%) (w/v) was used for the measurements. The light scattering intensity against time was plotted in OriginPro 2021 (Origin Lab, USA) software. Two independent experiments were performed for this assay.

Fluorescence Recovery After Photobleaching (FRAP)

For FRAP experiments, NHS-rhodamine labeled [10% labeled and 90% unlabeled (v/v)] protein/peptides mixture in the presence of PEG-8000 (10% w/v) or cytoplasmic extract at respective C_sat were incubated in Eppendorf at 37 °C in a moist chamber for LLPS. At different time points (0 h and 48 h) the samples were drop-casted on acid-treated glass slides and covered with 12 mm acid-treated coverslip. The condensate was bleached and fluorescence recovery was determined using a previously established protocol¹⁸. The experiments were performed using a built-in FRAP module in Zeiss Axio-Observer Z1 confocal microscope with 63X oil-immersion objective (NA 1.4). A 561 nm DPSS 561-10 laser (at 100% laser power) was used to bleach the center of the condensate and two other regions of interest (ROI) with the same diameter were also recorded to determine the background and passive bleaching corrections. The fluorescence intensity after bleaching was simultaneously recorded for all three ROIs using the Zen Pro 2011 (Zeiss, Germany) software provided with the instrument. The images were obtained with a frame size of 512 pixels x 512 pixels with 8 bit-depth. The fluorescence recovery data were background corrected, normalized, and fitted using the single exponential recovery function in OriginPro 2021 (Origin Lab, USA) software, and t_1/2 was determined. The equation used for fitting is as follows^{17,121,122,123,124};

$$I\left(t\right)=A\left(1-\exp \left(\frac{-t}{\tau }\right)\right)+C$$

(4)

Where, τ is the fluorescence recovery time constant, ‘A’ corresponds to the mobile fraction of the fluorescent probe, and ‘C’ is the Y-intercept of the recovery curve.

The half-time of the recovery (t_1/2) was calculated from,

$${t}_{\frac{1}{2}}=\tau {{{{\mathrm{ln}}}}}\left(2\right)$$

(5)

The graph was plotted using the OriginPro 2021 (Origin Lab, USA) software.

Thioflavin T (ThT) fluorescence assay

For the ThT fluorescence assay, 100 µl of unlabeled SEC isolated protein samples and the proteins which showed low recovery of fluorescence at 48 h after LLPS; (β-cas, CATA, GG, LT, Tau, and α-Syn) were incubated at 37 °C for LLPS (0 h and 48 h). At both time points, the sample was diluted in 20 mM sodium phosphate buffer (pH 7.4, 0.01% sodium azide) to a final concentration of 10 µΜ and ThT fluorescence assay was performed. To do that 1 µl of 1 mM ThT dye (prepared in 10 mM Tris-HCl buffer, pH 8.0, 0.01% sodium azide) was added to the protein samples. ThT fluorescence measurements were recorded using Spectrofluorimeter (JASCO FP 8500, USA) instrument at an excitation wavelength of 450 nm and an emission range of 460-500 nm with a slit width of 5 nm for both excitation and emission measurement. The graph was plotted using the GraphPad Prism 8 software at the emission maxima (λ_max ~ 480 nm) after background corrections. The experiment was repeated two times.

Transmission Electron Microscopy (TEM)

TEM analysis was performed for a subset of proteins (proteins that showed substantial rigidification using FRAP data; α-Syn, LT, GG, Tau, β-cas, and CATA) at their respective C_sat immediately after LLPS (0 h) and after 48 h of incubation. For sample preparation, the coverslip containing LLPS solution was removed from the slide of LLPS samples (0 h and 48 h) and it was directly transferred on the EM grid (Electron Microscopy Sciences, USA), incubated for 5 min. The grids were stained using uranyl formate (1% w/v) for 5 min and excess dye was removed with the help of filter paper. The grids were directly air-dried without any further washes before imaging. Imaging was done using JEOL Field Emission Gun-transmission electron microscopy (JEM 2100 F, JEOL, Japan) at 200 kV with X10000 magnification. The images were recorded digitally using the Gatan microscopy suite® (Gatan, USA).

8-anilino-1-naphthalenesulfonic acid (ANS) binding assay

To determine the extent of the exposed hydrophobic surface of proteins, ANS fluorescence binding assay was performed. Briefly, 3 µl of 5 mM ANS (prepared in 20 mM phosphate buffer, pH 7.4) was added to 10 µM of 100 µl of all the SEC purified protein samples in the presence of PEG-8000 (10% w/v) in 20 mM sodium phosphate buffer (pH 7.4, 0.01% sodium azide). The mixture was incubated for 5 min in the dark at room temperature. The fluorescence intensity measurements were done using a spectrofluorimeter (JASCO FP 8500, USA) with 370 nm as an excitation wavelength and 400-600 nm as an emission wavelength range. The slit width was set to 5 nm for both excitation and emission wavelength. The acquired spectral intensities were plotted after background correction using GraphPad Prism 8 at the emission wavelength of 475 nm. The experiment was repeated three times.

Circular dichroism (CD) study

The far-UV circular dichroism spectra for the dilute and dense phase of proteins (with PEG-8000 10% w/v) immediately after LLPS (0 h) and after 48 h incubations were recorded using JASCO-1500 CD spectrophotometer (USA) in a 0.1 cm microcuvette (Hellma Forest Hills, NY). The samples were prepared by high-speed centrifugation, which resulted in the separation of two different phases (dilute phase monomeric protein and the dense liquid condensate)⁴⁹. After removing most of the upper dilute phase, a small amount of the solution at the bottom of the tube was taken and diluted to 200 µl for CD measurements. Note, the dilution of the dense phase protein is unavoidable for CD due to very high dynode voltage and light scattering in the CD of the original dense phase suspension. The spectra were recorded for the wavelength range of 260-198 nm at 20 °C with a scanning speed of 200 nm/min. Three accumulations for each sample were acquired and the experiment was done in duplicate. The buffer subtraction and smoothing of the data were done as per the manufacturer’s instructions. The data was plotted using KaleidaGraph software.

Fourier-transform infrared (FTIR) spectroscopy

FTIR spectroscopy was performed to determine the secondary structure of the proteins. All proteins which are purified from SEC were incubated at 37 °C for LLPS in the presence of 10% (w/v) PEG-8000. After phase separation, the dilute and dense phases of proteins were separated using high-speed centrifugation. The dilute and dense phases of the proteins without dilution were spotted on the KBr pellet and were subsequently dried under IR (infra-red) lamp. Vertex 80 FTIR system equipped with a DTGS detector (Bruker, Leipzig, Germany) was used to record the spectra in the range of 1800–1500 cm⁻¹. Each spectrum was recorded using an average of 32 scans at a resolution of 4 cm⁻¹. Fourier self-deconvolution (FSD) method was used to deconvolute the spectra corresponding to the wavenumbers 1700 − 1600 cm⁻¹ ¹²⁵. The Lorentzian curve fitting procedure was employed to fit the spectra using Opus-65 software (Bruker, Leipzig, Germany) as per the manufacturer’s instruction. The FTIR spectroscopy was also done after 48 h of phase separation for the selected proteins, which showed low FRAP recovery (β-cas, CATA, GG, LT, Tau, and α-Syn). The data was plotted using KaleidaGraph software. The experiments were performed twice with similar observations. The statistical significance was calculated using a two-tailed t-test (95% confidence interval) with p-values, p < 0.001, p < 0.002, p < 0.033, and p > 0.12 indicated by (***), (**), (*) and (ns), respectively.

Protein sequence analysis and correlation plot parameters

The molecular weight, positively charged and aromatic residues of the proteins were determined using the Expasy ProtParam tool from the protein sequences obtained from Uniprot (Supplementary Table 1). Solvent-accessible surface area (SASA) was calculated using an existing SASA algorithm present in the VMD (Visual Molecular Dynamics)⁶⁶ software package.

For determining the C_sat using the predictive model, we computed the solvent-accessible surface area (SASA) using available PDB structure files for the proteins. For IDPs, we used solved structure ensembles from Protein Ensemble Database (proteinensemble.org) and calculated the averaged SASA values from the structure ensembles. The structure of FUS (AlphaFold ID: AF-P35637-F1) and TDP43 (AlphaFold ID: AF-Q13148-F1) were obtained from the AlphaFold database whereas the structural information of p53 (PDB ID: 8F2H) and HSA (PDB ID: 4LB2) was obtained from the Protein data bank. The amino acids were categorized into four categories, namely aromatic (WFYH), polar (PTSNQ), charged (RKED), and hydrophobic (ACGILMV). The effective exposure of each residue (amino acid) was calculated as the ratio of the exposed surface area (from SASA calculations) to the maximum possible exposed surface area of each amino acid. The maximum solvent-accessible surface area for each amino acid was taken from the Tien et al.¹²⁶. For each amino acid, the effective exposure will be in the range of 0 to 1, where 1 represents the amino acid being completely solvent accessible while 0 represents the amino acid is buried within the protein and is not solvent accessible. Then, we computed the net exposure for each category (aromatic, polar, charged, and hydrophobic) by summing each residue of the respective categories. Once we have the effective exposure of aromatic, polar, charged and hydrophobic residues for individual proteins, we fit these data using the following model.

$$\log ({{{{{{\rm{C}}}}}}}_{{{{{{\rm{sat}}}}}}})={{{{{\rm{A}}}}}}\times {{{{{{\rm{N}}}}}}}_{{{{{{\rm{Polar}}}}}}}+{{{{{\rm{B}}}}}}\times {{{{{{\rm{N}}}}}}}_{{{{{{\rm{Hydrophobic}}}}}}}+{{{{{\rm{C}}}}}}\times {{{{{{\rm{N}}}}}}}_{{{{{{\rm{Charge}}}}}}}+{{{{{\rm{D}}}}}}\times {{{{{{\rm{N}}}}}}}_{{{{{{\rm{Aromatic}}}}}}}+{{{{{\rm{E}}}}}}$$

(6)

Here each quantity (${N}_{{Polar}}$, ${N}_{{Hydrophobic}}$, ${N}_{{Charge}}$ and ${N}_{{Aromatic}}$) is defined as the summation of the effective exposure of those respective residues on the surface and C_sat is the saturation concentration at which the protein undergoes phase separation as obtained from experiments. To find the parameters A, B, C, D and E, we did multiple regression analysis to fit the model to the available protein data for proteins under this study (${{data\; is}:N}_{{Polar}}$, ${N}_{{Hydrophobic}}$, ${N}_{{Charge}}$, and ${N}_{{Aromatic}}$ and C_sat). We used the scikit learn package in Python to do the multiple regression and found the 5 unknown variables (A, B, C, D and E). Thus, by supplying each quantity ${N}_{{Polar}}$, ${N}_{{Hydrophobic}}$, ${N}_{{Charge}}$, and ${N}_{{Aromatic}}$ from the structure of a protein, one can predict the approximate C_sat using the above equation.

All the graphs were plotted using the OriginPro 2021 (Origin Lab, USA) software.

Surface plasmon resonance (SPR) analysis

The homotypic protein-protein interactions were determined using surface plasmon resonance (SPR) spectroscopy (BIAcore T200, GE Healthcare). SEC-isolated proteins were immobilized on the CM5 or CM3 sensor chip where immobilization levels of ~ 800-1000 response units were achieved for all proteins. The same protein with an increasing concentration range (GG: 25-500 nM, β-cas: 30 nM-2 μM, Chymo: 0.1-1 μM, CA: 5-80 μM, Mb: 10-60 μM, α-Syn: 5-100 μM and Ub: 25-250 μM) were injected into the microfluidic channel. To determine the effect of NaCl in GG, α-Syn, we immobilized the respective proteins in the sensor chip and allowed the same protein to pass through the chip in the presence of 150 mM salt. A similar experiment was done with Ub in the presence of 2 M urea. The contact time and dissociation time for the protein samples were set as follows: For GG (120 s and 360 s), β-cas (90 s and 400 s), Chymo (50 s and 120 s), CA (90 s and 300 s), Mb (45 s and 120 s), α-Syn (90 s and 300 s), Ub (60 s and 360 s). The contact time and flow rate for regeneration were set as follows: for GG (30 s and 40 μl/min), β-cas (60 s and 50 μl/min), Chymo (20 s and 20 μl/min), CA (30 s and 25μl/min), Ub (30 s and 35 μl/min). The response unit for the blank run (20 mM sodium phosphate buffer, pH 7.4) was used as a baseline and was subtracted from the response unit of the protein samples. The resultant response unit for the protein samples after the blank correction was fitted in the two-state model for all proteins except α-Syn and Chymo (one-state model, using SPR module). From the respectively fitted sensogram, the dissociation constant (K_D) was determined using Biacore T200 software. The response unit curve was plotted with respect to time using OriginPro 2021 (Origin Lab, USA) software. The statistical significance was calculated using a two-tailed t-test (95% confidence interval) with p-values, p < 0.001, p < 0.002, p < 0.033, and p > 0.12 indicated by (***), (**), (*) and (ns), respectively.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The authors declare that all the data supporting the findings of this study are available within the paper and in supplementary information files. All the data analysis was performed using published tools and packages and has been cited in the paper and supplementary information text. PDB (Protein Data Bank) IDs and PED (Protein Ensemble Database) IDs used in our study are available on the PDB and PED servers. PDB ID: 1B0L, 1QG5, 3V03, 1REX, 1FS3, 8F2H, 4LB2. PED ID: e001 (https://proteinensemble.org/entries/PED00017). Source data are provided with this paper.

References

Shin, Y. & Brangwynne, C. P. Liquid phase condensation in cell physiology and disease. Science 357, eaaf4382 (2017).
Article PubMed Google Scholar
Hyman, A. A., Weber, C. A. & Jülicher, F. Liquid-Liquid Phase Separation in Biology. Annu. Rev. Cell Dev. Biol. 30, 39–58 (2014).
Article CAS PubMed Google Scholar
Lyon, A. S., Peeples, W. B. & Rosen, M. K. A framework for understanding the functions of biomolecular condensates across scales. Nat. Rev. Mol. Cell Biol. 22, 215–235 (2021).
Article CAS PubMed Google Scholar
Alberti, S. Phase separation in biology. Curr. Biol. 27, R1097–R1102 (2017).
Article CAS PubMed Google Scholar
Brangwynne, C. P. et al. Germline P granules are liquid droplets that localize by controlled dissolution/condensation. Science 324, 1729–1732 (2009).
Article ADS CAS PubMed Google Scholar
Banani, S. F., Lee, H. O., Hyman, A. A. & Rosen, M. K. Biomolecular condensates: organizers of cellular biochemistry. Nat. Rev. Mol. Cell Biol. 18, 285–298 (2017).
Article CAS PubMed PubMed Central Google Scholar
Mitrea, D. M. & Kriwacki, R. W. Phase separation in biology; functional organization of a higher order. Cell Commun. Signal. 14, 1–20 (2016).
Article PubMed PubMed Central Google Scholar
Ray, S. & Maji, S. K. Predictable phase-separated proteins. Nat. Chem. 12, 787–789 (2020).
Article CAS PubMed Google Scholar
Li, P. et al. Phase transitions in the assembly of multivalent signalling proteins. Nature 483, 336–340 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Franzmann, T. M. et al. Phase separation of a yeast prion protein promotes cellular fitness. Science 359, eaao5654 (2018).
Article PubMed Google Scholar
Franzmann, T. M. & Alberti, S. Protein Phase Separation as a Stress Survival Strategy. Cold Spring Harb. Perspect. Biol. 11, a034058 (2019).
Article CAS PubMed PubMed Central Google Scholar
Alberti, S. & Hyman, A. A. Are aberrant phase transitions a driver of cellular aging? BioEssays 38, 959–968 (2016).
Article CAS PubMed PubMed Central Google Scholar
Lin, Y., Protter, D. S. W., Rosen, M. K. & Parker, R. Formation and Maturation of Phase-Separated Liquid Droplets by RNA-Binding Proteins. Mol. Cell 60, 208–219 (2015).
Article CAS PubMed PubMed Central Google Scholar
Wang, J. et al. A Molecular Grammar Governing the Driving Forces for Phase Separation of Prion-like RNA Binding Proteins. Cell 174, 688–699 (2018).
Article CAS PubMed PubMed Central Google Scholar
Dada, S. T. et al. Spontaneous nucleation and fast aggregate-dependent proliferation of α-synuclein aggregates within liquid condensates at neutral pH. Proc. Natl. Acad. Sci. USA 120, (2023).
McCall, P. M. et al. Quantitative phase microscopy enables precise and efficient determination of biomolecular condensate composition. bioRxiv https://doi.org/10.1101/2020.10.25.352823 (2020).
Patel, A. et al. A Liquid-to-Solid Phase Transition of the ALS Protein FUS Accelerated by Disease Mutation. Cell 162, 1066–1077 (2015).
Article CAS PubMed Google Scholar
Ray, S. et al. α-Synuclein aggregation nucleates through liquid–liquid phase separation. Nat. Chem. 12, 705–716 (2020).
Article CAS PubMed Google Scholar
Ambadipudi, S., Biernat, J., Riedel, D., Mandelkow, E. & Zweckstetter, M. Liquid–liquid phase separation of the microtubule-binding repeats of the Alzheimer-related protein Tau. Nat. Commun. 8, 275 (2017).
Article ADS PubMed PubMed Central Google Scholar
Wegmann, S. et al. Tau protein liquid–liquid phase separation can initiate tau aggregation. EMBO J. 37, e98049 (2018).
Article PubMed PubMed Central Google Scholar
Galkin, O., Chen, K., Nagel, R. L., Hirsch, R. E. & Vekilov, P. G. Liquid–liquid separation in solutions of normal and sickle cell hemoglobin. Proc. Natl. Acad. Sci. 99, 8479–8483 (2002).
Article ADS CAS PubMed PubMed Central Google Scholar
Alberti, S. & Hyman, A. A. Biomolecular condensates at the nexus of cellular stress, protein aggregation disease and ageing. Nat. Rev. Mol. Cell Biol. 22, 196–213 (2021).
Article CAS PubMed Google Scholar
Martin, E. W. et al. Valence and patterning of aromatic residues determine the phase behavior of prion-like domains. Science 367, 694–699 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Maharana, S. et al. RNA buffers the phase separation behavior of prion-like RNA binding proteins. Science 360, 918–921 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Pak, C. W. et al. Sequence Determinants of Intracellular Phase Separation by Complex Coacervation of a Disordered Protein. Mol. Cell 63, 72–85 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bremer, A. et al. Deciphering how naturally occurring sequence features impact the phase behaviours of disordered prion-like domains. Nat. Chem. 14, 196–207 (2022).
Article CAS PubMed Google Scholar
Borcherds, W., Bremer, A., Borgia, M. B. & Mittag, T. How do intrinsically disordered protein regions encode a driving force for liquid–liquid phase separation? Curr. Opin. Struct. Biol. 67, 41–50 (2021).
Article CAS PubMed Google Scholar
Hughes, M. P. et al. Atomic structures of low-complexity protein segments reveal kinked β sheets that assemble networks. Science 359, 698–701 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Brangwynne, C. P., Tompa, P. & Pappu, R. V. Polymer physics of intracellular phase transitions. Nat. Phys. 11, 899–904 (2015).
Article CAS Google Scholar
Ruff, K. M. et al. Sequence grammar underlying the unfolding and phase separation of globular proteins. Mol. Cell 82, 3193–3208.e8 (2022).
Article CAS PubMed Google Scholar
Qamar, S. et al. FUS Phase Separation Is Modulated by a Molecular Chaperone and Methylation of Arginine Cation-π Interactions. Cell 173, 720–734 (2018).
Article CAS PubMed PubMed Central Google Scholar
Nott, T. J. et al. Phase Transition of a Disordered Nuage Protein Generates Environmentally Responsive Membraneless Organelles. Mol. Cell 57, 936–947 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hardenberg, M., Horvath, A., Ambrus, V., Fuxreiter, M. & Vendruscolo, M. Widespread occurrence of the droplet state of proteins in the human proteome. Proc. Natl. Acad. Sci. 117, 33254–33262 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Fisher, R. S. & Elbaum-Garfinkle, S. Tunable multiphase dynamics of arginine and lysine liquid condensates. Nat. Commun. 11, 4628 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Dzuricky, M., Rogers, B. A., Shahid, A., Cremer, P. S. & Chilkoti, A. De novo engineering of intracellular condensates using artificial disordered proteins. Nat. Chem. 12, 814–825 (2020).
Article CAS PubMed PubMed Central Google Scholar
Vecchi, G. et al. Proteome-wide observation of the phenomenon of life on the edge of solubility. Proc. Natl. Acad. Sci. 117, 1015–1020 (2020).
Article ADS CAS PubMed Google Scholar
Narayanan, A. et al. A first order phase transition mechanism underlies protein aggregation in mammalian cells. Elife 8, e39695 (2019).
Article PubMed PubMed Central Google Scholar
Riback, J. A. et al. Composition-dependent thermodynamics of intracellular phase separation. Nature 581, 209–214 (2020).
Article ADS CAS PubMed Google Scholar
Patel, A. et al. ATP as a biological hydrotrope. Science 356, 753–756 (2017).
Article ADS CAS PubMed Google Scholar
Sawner, A. S. et al. Modulating α-Synuclein Liquid–Liquid Phase Separation. Biochemistry 60, 3676–3696 (2021).
Article CAS PubMed Google Scholar
Feric, M. et al. Coexisting Liquid Phases Underlie Nucleolar Subcompartments. Cell 165, 1686–1697 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kaiser, T. E., Intine, R. V. & Dundr, M. De novo formation of a subnuclear body. Science 322, 1713–1717 (2008).
Article ADS CAS PubMed Google Scholar
Molliex, A. et al. Phase Separation by Low Complexity Domains Promotes Stress Granule Assembly and Drives Pathological Fibrillization. Cell 163, 123–133 (2015).
Article CAS PubMed PubMed Central Google Scholar
Taratuta, V. G., Holschbach, A., Thurston, G. M., Blankschtein, D. & Benedek, G. B. Liquid-liquid phase separation of aqueous lysozyme solutions: effects of pH and salt identity. J. Phys. Chem. 94, 2140–2144 (1990).
Article CAS Google Scholar
Mészáros, B., Erdős, G. & Dosztányi, Z. IUPred2A: context-dependent prediction of protein disorder as a function of redox state and protein binding. Nucleic Acids Res. 46, W329–W337 (2018).
Article PubMed PubMed Central Google Scholar
Letunic, I. & Bork, P. 20 years of the SMART protein domain annotation resource. Nucleic Acids Res. 46, D493–D496 (2017).
Article PubMed Central Google Scholar
Bolognesi, B. et al. A Concentration-Dependent Liquid Phase Separation Can Cause Toxicity upon Increased Protein Expression. Cell Rep. 16, 222–231 (2016).
Article CAS PubMed PubMed Central Google Scholar
Xue, B., Dunbrack, R. L., Williams, R. W., Dunker, A. K. & Uversky, V. N. PONDR-FIT: A meta-predictor of intrinsically disordered amino acids. Biochim. Biophys. Acta - Proteins Proteomics 1804, 996–1010 (2010).
Article CAS Google Scholar
Milkovic, N. M. & Mittag, T. Determination of Protein Phase Diagrams by Centrifugation. in Intrinsically Disordered Proteins: Methods and Protocols (eds. Kragelund, B. B. & Skriver, K.) 685–702 (Springer US, 2020). https://doi.org/10.1007/978-1-0716-0524-0_35.
Ray, S. et al. Mass photometric detection and quantification of nanoscale α-synuclein phase separation. Nat. Chem. 1–11 (2023) https://doi.org/10.1038/s41557-023-01244-8.
Annunziata, O. et al. Effect of polyethylene glycol on the liquid-liquid phase transition in aqueous protein solutions. Proc. Natl. Acad. Sci. USA. 99, 14165–14170 (2002).
Article ADS CAS PubMed PubMed Central Google Scholar
Knowles, D. B. et al. Chemical Interactions of Polyethylene Glycols (PEGs) and Glycerol with Protein Functional Groups: Applications to Effects of PEG and Glycerol on Protein Processes. Biochemistry 54, 3528–3542 (2015).
Article CAS PubMed Google Scholar
Cohen, J. A., Podgornik, R., Hansen, P. L. & Parsegian, V. A. A phenomenological one-parameter equation of state for osmotic pressures of PEG and other neutral flexible polymers in good solvents. J. Phys. Chem. B 113, 3709–3714 (2009).
Article CAS PubMed PubMed Central Google Scholar
Freibaum, B. D., Messing, J., Yang, P., Kim, H. J. & Taylor, J. P. High-fidelity reconstitution of stress granules and nucleoli in mammalian cellular lysate. J. Cell Biol. 220, e202009079 (2021).
Article CAS PubMed PubMed Central Google Scholar
Kato, M. & McKnight, S. L. A Solid-State Conceptualization of Information Transfer from Gene to Message to Protein. Annu. Rev. Biochem. 87, 351–390 (2018).
Article CAS PubMed Google Scholar
Boke, E. et al. Amyloid-like Self-Assembly of a Cellular Compartment. Cell 166, 637–650 (2016).
Article CAS PubMed PubMed Central Google Scholar
Larson, A. G. & Narlikar, G. J. The Role of Phase Separation in Heterochromatin Formation, Function, and Regulation. Biochemistry 57, 2540–2548 (2018).
Article CAS PubMed Google Scholar
Falk, M. et al. Heterochromatin drives compartmentalization of inverted and conventional nuclei. Nature 570, 395–399 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Strickfaden, H. et al. Condensed Chromatin Behaves like a Solid on the Mesoscale In Vitro and in Living Cells. Cell 183, 1772–1784 (2020).
Article CAS PubMed Google Scholar
Halfmann, R. A glass menagerie of low complexity sequences. Curr. Opin. Struct. Biol. 38, 18–25 (2016).
Article CAS PubMed PubMed Central Google Scholar
Hardenberg, M. C. et al. Observation of an α-synuclein liquid droplet state and its maturation into Lewy body-like assemblies. J. Mol. Cell Biol. 13, 282–294 (2021).
CAS PubMed PubMed Central Google Scholar
Stender, E. G. P. et al. Capillary flow experiments for thermodynamic and kinetic characterization of protein liquid-liquid phase separation. Nat. Commun. 12, 7289 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Leibler, L. Theory of Microphase Separation in Block Copolymers. Macromolecules 13, 1602–1617 (1980).
Article ADS CAS Google Scholar
Flory, P. J. Thermodynamics of high polymer solutions. J. Chem. Phys. 10, 51–61 (1942).
Article ADS CAS Google Scholar
Huggins, M. L. Some Properties of Solutions of Long-chain Compounds. J. Phys. Chem. 46, 151–158 (1942).
Article CAS Google Scholar
Humphrey, W., Dalke, A. & Schulten, K. VMD: Visual molecular dynamics. J. Mol. Graph. 14, 33–38 (1996).
Article CAS PubMed Google Scholar
Schmidt, H. B., Barreau, A. & Rohatgi, R. Phase separation-deficient TDP43 remains functional in splicing. Nat. Commun. 10, 1–14 (2019).
Article PubMed PubMed Central Google Scholar
Safari, M. S. et al. Anomalous Dense Liquid Condensates Host the Nucleation of Tumor Suppressor p53 Fibrils. iScience 12, 342–355 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Patel, C. K., Singh, S., Saini, B. & Mukherjee, T. K. Macromolecular Crowding-Induced Unusual Liquid-Liquid Phase Separation of Human Serum Albumin via Soft Protein-Protein Interactions. J. Phys. Chem. Lett. 13, 3636–3644 (2022).
Article CAS PubMed Google Scholar
Krainer, G. et al. Reentrant liquid condensate phase of proteins is stabilized by hydrophobic and non-ionic interactions. Nat. Commun. 12, 1085 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Kroschwald, S., Maharana, S. & Simon, A. Hexanediol: a chemical probe to investigate the material properties of membrane-less compartments. Matters 3, 201702000010 (2017).
Google Scholar
Zou, Q., Habermann-Rottinghaus, S. M. & Murphy, K. P. Urea effects on protein stability: Hydrogen bonding and the hydrophobic effect. Proteins Struct. Funct. Bioinforma. 31, 107–115 (1998).
Article CAS Google Scholar
Auton, M., Holthauzen, L. M. F. & Bolen, D. W. Anatomy of energetic changes accompanying urea-induced protein denaturation. Proc. Natl. Acad. Sci. 104, 15317–15322 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Klotz, I. M. Equilibrium constants and free energies in unfolding of proteins in urea solutions. Proc. Natl. Acad. Sci. USA. 93, 14411–14415 (1996).
Article ADS CAS PubMed PubMed Central Google Scholar
Guinn, E. J. & Marqusee, S. Exploring the Denatured State Ensemble by Single-Molecule Chemo-Mechanical Unfolding: The Effect of Force, Temperature, and Urea. J. Mol. Biol. 430, 450–464 (2018).
Article CAS PubMed Google Scholar
Campioni, S. et al. A causative link between the structure of aberrant protein oligomers and their toxicity. Nat. Chem. Biol. 6, 140–147 (2010).
Article CAS PubMed Google Scholar
Dignon, G. L., Zheng, W., Best, R. B., Kim, Y. C. & Mittal, J. Relation between single-molecule properties and phase behavior of intrinsically disordered proteins. Proc. Natl. Acad. Sci. USA 115, 9929–9934 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Sabari, B. R. et al. Coactivator condensation at super-enhancers links phase separation and gene control. Science 361, eaar3958 (2018).
Article PubMed PubMed Central Google Scholar
Reber, S. et al. The phase separation-dependent FUS interactome reveals nuclear and cytoplasmic function of liquid–liquid phase separation. Nucleic Acids Res. 49, 7713–7731 (2021).
Article CAS PubMed PubMed Central Google Scholar
Sabari, B. R., Dall’Agnese, A. & Young, R. A. Biomolecular Condensates in the Nucleus. Trends Biochem. Sci. 45, 961–977 (2020).
Article CAS PubMed PubMed Central Google Scholar
Zhang, H. et al. RNA Controls PolyQ Protein Phase Transitions. Mol. Cell 60, 220–230 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hnisz, D., Shrinivas, K., Young, R. A., Chakraborty, A. K. & Sharp, P. A. A Phase Separation Model for Transcriptional Control. Cell 169, 13–23 (2017).
Article CAS PubMed Google Scholar
Klosin, A. et al. Phase separation provides a mechanism to reduce noise in cells. Science 367, 464–468 (2020).
Article ADS CAS PubMed Google Scholar
Ditlev, J. A., Case, L. B. & Rosen, M. K. Who’s In and Who’s Out—Compositional Control of Biomolecular Condensates. J. Mol. Biol. 430, 4666–4684 (2018).
Article CAS PubMed PubMed Central Google Scholar
Roden, C. & Gladfelter, A. S. RNA contributions to the form and function of biomolecular condensates. Nat. Rev. Mol. Cell Biol. 22, 183–195 (2021).
Article CAS PubMed Google Scholar
Larson, A. G. et al. Liquid droplet formation by HP1α suggests a role for phase separation in heterochromatin. Nature 547, 236–240 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Fuxreiter, M. & Vendruscolo, M. Generic nature of the condensed states of proteins. Nat. Cell Biol. 23, 587–594 (2021).
Article CAS PubMed Google Scholar
Chiti, F. & Dobson, C. M. Protein Misfolding, Amyloid Formation, and Human Disease: A Summary of Progress Over the Last Decade. Annu. Rev. Biochem. 86, 27–68 (2017).
Article CAS PubMed Google Scholar
Poudyal, M., Sakunthala, A., Mukherjee, S., Gadhe, L. & Maji, S. K. Phase separation and other forms of α-Synuclein self-assemblies. Essays Biochem. 66, 987–1000 (2022).
Article CAS PubMed Google Scholar
Hatos, A., Monzon, A. M., Tosatto, S. C. E., Piovesan, D. & Fuxreiter, M. FuzDB: A new phase in understanding fuzzy interactions. Nucleic Acids Res. 50, D509–D517 (2022).
Article CAS PubMed Google Scholar
Burke, K. A., Janke, A. M., Rhine, C. L. & Fawzi, N. L. Residue-by-Residue View of In Vitro FUS Granules that Bind the C-Terminal Domain of RNA Polymerase II. Mol. Cell 60, 231–241 (2015).
Article CAS PubMed PubMed Central Google Scholar
Fuxreiter, M. Fuzzy protein theory for disordered proteins. Biochem. Soc. Trans. 48, 2557–2564 (2020).
Article CAS PubMed PubMed Central Google Scholar
Freiberger, M. I., Wolynes, P. G., Ferreiro, D. U. & Fuxreiter, M. Frustration in Fuzzy Protein Complexes Leads to Interaction Versatility. J. Phys. Chem. B 125, 2513–2520 (2021).
Article CAS PubMed PubMed Central Google Scholar
Li, W. et al. Charge-induced patchy attractions between proteins. J. Phys. Chem. B 119, 503–508 (2015).
Article CAS PubMed Google Scholar
Cai, J. & Sweeney, A. M. The Proof Is in the Pidan: Generalizing Proteins as Patchy Particles. ACS Cent. Sci. 4, 840–853 (2018).
Article CAS PubMed PubMed Central Google Scholar
Lin, Y. H., Forman-Kay, J. D. & Chan, H. S. Sequence-Specific Polyampholyte Phase Separation in Membraneless Organelles. Phys. Rev. Lett. 117, 1–6 (2016).
Article Google Scholar
Ruff, K. M., Harmon, T. S. & Pappu, R. V. CAMELOT: A machine learning approach for Coarse-grained simulations of aggregation of block-copolymeric protein sequences. J. Chem. Phys. 143, (2015).
Chan, H. S. & Dill, K. A. Solvation: Effects of molecular size and shape. J. Chem. Phys. 101, 7007–7026 (1994).
Article ADS Google Scholar
Martin, E. W. et al. A multi-step nucleation process determines the kinetics of prion-like domain phase separation. Nat. Commun. 12, (2021).
Vekilov, P. G. Nucleation. Cryst. Growth Des. 10, 5007–5019 (2010).
Article CAS PubMed PubMed Central Google Scholar
Kar, M. et al. Phase-separating RNA-binding proteins form heterogeneous distributions of clusters in subsaturated solutions. Proc. Natl. Acad. Sci. USA. 119, 1–12 (2022).
Article Google Scholar
Vorontsova, M. A., Chan, H. Y., Lubchenko, V. & Vekilov, P. G. Lack of Dependence of the Sizes of the Mesoscopic Protein Clusters on Electrostatics. Biophys. J. 109, 1959–1968 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, Y., Lubchenko, V., Vorontsova, M. A., Filobelo, L. & Vekilov, P. G. Ostwald-like ripening of the anomalous mesoscopic clusters in protein solutions. J. Phys. Chem. B 116, 10657–10664 (2012).
Article CAS PubMed Google Scholar
Chan, H. Y. & Lubchenko, V. A mechanism for reversible mesoscopic aggregation in liquid solutions. Nat. Commun. 10, (2019).
Yang, D. S. et al. Mesoscopic protein-rich clusters host the nucleation of mutant p53 amyloid fibrils. Proc. Natl. Acad. Sci. USA. 118, (2021).
Rajeev, P. et al. Nanoscale regulation of Ca2+ dependent phase transitions and real-time dynamics of SAP97/hDLG. Nat. Commun. 13, 4236 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Murthy, A. C. et al. Molecular interactions underlying liquid−liquid phase separation of the FUS low-complexity domain. Nat. Struct. Mol. Biol. 26, 637–648 (2019).
Article CAS PubMed PubMed Central Google Scholar
Reichheld, S. E., Muiznieks, L. D., Keeley, F. W. & Sharpe, S. Direct observation of structure and dynamics during phase separation of an elastomeric protein. Proc. Natl. Acad. Sci. 114, E4408–E4415 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Mateju, D. et al. An aberrant phase transition of stress granules triggered by misfolded protein and prevented by chaperone function. EMBO J. 36, 1669–1687 (2017).
Article CAS PubMed PubMed Central Google Scholar
Dobson, C. M. The structural basis of protein folding and its links with human disease. Philos. Trans. R. Soc. London. Ser. B, Biol. Sci. 356, 133–145 (2001).
Article CAS Google Scholar
Richardson, J. S. & Richardson, D. C. Natural β-sheet proteins use negative design to avoid edge-to-edge aggregation. Proc. Natl. Acad. Sci. 99, 2754–2759 (2002).
Article ADS CAS PubMed PubMed Central Google Scholar
Volles, M. J. & Lansbury, P. T. Relationships between the Sequence of α-Synuclein and its Membrane Affinity, Fibrillization Propensity, and Yeast Toxicity. J. Mol. Biol. 366, 1510–1522 (2007).
Article CAS PubMed Google Scholar
Singh, P. K. et al. Curcumin modulates α-Synuclein aggregation and toxicity. ACS Chem. Neurosci. 4, 393–407 (2013).
Article CAS PubMed Google Scholar
Prahl, S. Optical absorption of hemoglobin. Oregon Med. Laser Cent. 15, 770–774 (1999).
Google Scholar
Loreto, S. et al. Hydration and Confinement Effects on Horse Heart Myoglobin Adsorption in Mesoporous TiO2. J. Phys. Chem. C 122, 23393–23404 (2018).
Article CAS Google Scholar
Oellerich, S., Lecomte, S., Paternostre, M., Heimburg, T. & Hildebrandt, P. Peripheral and Integral Binding of Cytochrome c to Phospholipids Vesicles. J. Phys. Chem. B 108, 3871–3878 (2004).
Article CAS Google Scholar
Samejima, T. & Yang, J. T. Reconstitution of Acid-denatured Catalase. J. Biol. Chem. 238, 3256–3261 (1963).
Article CAS PubMed Google Scholar
Condron, M. M., Monien, B. H. & Bitan, G. Synthesis and Purification of Highly Hydrophobic Peptides Derived from the C-Terminus of Amyloid β-Protein. Open Biotechnol. J. 2, 87–93 (2008).
Article CAS PubMed PubMed Central Google Scholar
Senichkin, V. V., Prokhorova, E. A., Zhivotovsky, B. & Kopeina, G. S. Simple and Efficient Protocol for Subcellular Fractionation of Normal and Apoptotic Cells. Cells 10, (2021).
Ceballos, A. V., McDonald, C. J. & Elbaum-Garfinkle, S. Methods and Strategies to Quantify Phase Separation of Disordered Proteins. Methods Enzymol. 611, 31–50 (2018).
Article CAS PubMed PubMed Central Google Scholar
Aumiller, W. M. J., Pir Cakmak, F., Davis, B. W. & Keating, C. D. RNA-Based Coacervates as a Model for Membraneless Organelles: Formation, Properties, and Interfacial Liposome Assembly. Langmuir 32, 10042–10053 (2016).
Article CAS PubMed Google Scholar
Elbaum-Garfinkle, S. et al. The disordered P granule protein LAF-1 drives phase separation into droplets with tunable viscosity and dynamics. Proc. Natl. Acad. Sci. 112, 7189–7194 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Banani, S. F. et al. Compositional Control of Phase-Separated Cellular Bodies. Cell 166, 651–663 (2016).
Article CAS PubMed PubMed Central Google Scholar
Taylor, N. O., Wei, M.-T., Stone, H. A. & Brangwynne, C. P. Quantifying Dynamics in Phase-Separated Condensates Using Fluorescence Recovery after Photobleaching. Biophys. J. 117, 1285–1300 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Yang, H., Yang, S., Kong, J., Dong, A. & Yu, S. Obtaining information about protein secondary structures in aqueous solution using Fourier transform IR spectroscopy. Nat. Protoc. 10, 382–396 (2015).
Article CAS PubMed Google Scholar
Tien, M. Z., Meyer, A. G., Sydykova, D. K., Spielman, S. J. & Wilke, C. O. Maximum allowed solvent accessibilites of residues in proteins. PLoS One 8, (2013).

Download references

Acknowledgements

We acknowledge IIT Bombay central facilities and SAIF for TEM, FTIR, LC-MS, MALDI, SPR and Confocal microscopy facility. We thank Prof. Sudipta Maiti, TIFR, Mumbai for helping us to utilize the HPLC facility, Prof. Guruswamy Kumaraswamy, IIT Bombay, Mumbai, Prof. M. Muthukumar, University of Massachusetts, Amherst and Prof. Roland Riek, ETH Zurich for reading and valuable suggestions. S.K.M acknowledges DST-SUPRA (SPR/2021/000103), DST-SERB (File no. CRG/2019/001133), and DBT-Basic Science (File no. BT/PR22749/BRB/10/1576/2016) for financial support. M.P. acknowledges DST-INSPIRE, Government of India for the fellowship.

Author information

Siddhartha Maiti
Present address: Department of Bioengineering, VIT Bhopal University, Bhopal-Indore Highway, Kothrikalan, Sehore, Madhya Pradesh, 466114, India
These authors contributed equally: Manisha Poudyal, Komal Patel.

Authors and Affiliations

Department of Biosciences and Bioengineering, IIT Bombay, Powai, Mumbai, 400076, India
Manisha Poudyal, Komal Patel, Laxmikant Gadhe, Ajay Singh Sawner, Pradeep Kadu, Debalina Datta, Semanti Mukherjee, Soumik Ray, Ambuja Navalkar, Siddhartha Maiti, Debdeep Chatterjee, Jyoti Devi, Riya Bera, Nitisha Gahlot, Jennifer Joseph, Ranjith Padinhateeri & Samir K. Maji
Sunita Sanghi Centre of Aging and Neurodegenerative Diseases, IIT Bombay, Powai, Mumbai, 400076, India
Komal Patel & Samir K. Maji

Authors

Manisha Poudyal
View author publications
You can also search for this author in PubMed Google Scholar
Komal Patel
View author publications
You can also search for this author in PubMed Google Scholar
Laxmikant Gadhe
View author publications
You can also search for this author in PubMed Google Scholar
Ajay Singh Sawner
View author publications
You can also search for this author in PubMed Google Scholar
Pradeep Kadu
View author publications
You can also search for this author in PubMed Google Scholar
Debalina Datta
View author publications
You can also search for this author in PubMed Google Scholar
Semanti Mukherjee
View author publications
You can also search for this author in PubMed Google Scholar
Soumik Ray
View author publications
You can also search for this author in PubMed Google Scholar
Ambuja Navalkar
View author publications
You can also search for this author in PubMed Google Scholar
Siddhartha Maiti
View author publications
You can also search for this author in PubMed Google Scholar
Debdeep Chatterjee
View author publications
You can also search for this author in PubMed Google Scholar
Jyoti Devi
View author publications
You can also search for this author in PubMed Google Scholar
Riya Bera
View author publications
You can also search for this author in PubMed Google Scholar
Nitisha Gahlot
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Joseph
View author publications
You can also search for this author in PubMed Google Scholar
Ranjith Padinhateeri
View author publications
You can also search for this author in PubMed Google Scholar
Samir K. Maji
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.P., K.P., L.G., A.S.S., P.K., S.M., S.Maiti., D.C., J.D., R.B., and N.G. performed the in vitro experiments. J.J performed in silico study. M.P., K.P., D.D., S.R., and A.N. participated in analyzing the data. The study was conceived by S.K.M. and designed by S.K.M., R.P., M.P., and K.P. P.K. prepared the illustration. S.K.M., M.P., K.P., S.R. and R.P. participated in the writing of the manuscript.

Corresponding author

Correspondence to Samir K. Maji.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Deepak nair and the other anonymous, reviewers for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary movie 1

Supplementary movie 2

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Poudyal, M., Patel, K., Gadhe, L. et al. Intermolecular interactions underlie protein/peptide phase separation irrespective of sequence and structure at crowded milieu. Nat Commun 14, 6199 (2023). https://doi.org/10.1038/s41467-023-41864-9

Download citation

Received: 29 October 2022
Accepted: 21 September 2023
Published: 04 October 2023
DOI: https://doi.org/10.1038/s41467-023-41864-9

This article is cited by

Machine learning predictor PSPire screens for phase-separating proteins lacking intrinsically disordered regions
- Shuang Hou
- Jiaojiao Hu
- Yong Zhang
Nature Communications (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.