The role of directional interactions in the designability of generalized heteropolymers

Cardelli, Chiara; Bianco, Valentino; Rovigatti, Lorenzo; Nerattini, Francesca; Tubiana, Luca; Dellago, Christoph; Coluzza, Ivan

doi:10.1038/s41598-017-04720-7

Download PDF

Article
Open access
Published: 10 July 2017

The role of directional interactions in the designability of generalized heteropolymers

Chiara Cardelli¹,
Valentino Bianco¹,
Lorenzo Rovigatti^1,2,
Francesca Nerattini¹,
Luca Tubiana¹,
Christoph Dellago¹ &
…
Ivan Coluzza¹

Scientific Reports volume 7, Article number: 4986 (2017) Cite this article

2003 Accesses
19 Citations
Metrics details

Subjects

An Author Correction to this article was published on 12 March 2018

This article has been updated

Abstract

Heteropolymers are important examples of self-assembling systems. However, in the design of artificial heteropolymers the control over the single chain self-assembling properties does not reach that of the natural bio-polymers, and in particular proteins. Here, we introduce a sufficiency criterion to identify polymers that can be designed to adopt a predetermined structure and show that it is fulfilled by polymers made of monomers interacting through directional (anisotropic) interactions. The criterion is based on the appearance of a particular peak in the radial distribution function, that we show being a universal feature of all designable heteropolymers, as it is present also in natural proteins. Our criterion can be used to engineer new self-assembling modular polymers that will open new avenues for applications in materials science.

Population-based heteropolymer design to mimic protein mixtures

Article Open access 08 March 2023

Polymers with controlled assembly and rigidity made with click-functional peptide bundles

Article 30 October 2019

Emergence of low-symmetry foldamers from single monomers

Article 20 November 2020

Introduction

Gaining control of self-assembly processes is key for the generation of smart materials, with applications ranging from energetics¹, to photonic crystals² and biomimetic scaffolding³. Among the most promising and versatile classes of systems that undergo self-assembly are polymers composed of monomers of different species, also known as heteropolymers^4,5,6,7. Remarkable examples are natural self-assembling systems such as DNA⁸, RNA⁹ and, in particular, proteins^10,11. Their extremely rich self-assembling behaviour is mainly controlled by the sequence of chemically different building blocks, or monomers, that compose the polymer. The collection of all possible monomer types is the alphabet of the system. For proteins, different sequences of the same alphabet of 20 amino acids lead to the huge number of proteins expressed in nature, making the alphabet extremely versatile. Indeed, it is the specific sequence that drives a heteropolymer to uniquely collapse (fold) into a target conformation, providing precise control over the resulting structure.

Artificial heteropolymers synthesis and manipulation is at an advanced stage, as demonstrated by the numerous materials based on block copolymers⁴. Typically, block copolymers are synthesised from two monomer types (AB copolymers), but there are examples of linear chains synthesis with an alphabet size of 4 (ABCD block copolymers)^6,7 or star polymers with up to 7 types¹². The different types are organized in patterns of blocks of the same repeated monomer, from two up to five blocks⁴, and by controlling the relative length of the blocks it is possible to realize rich phase diagrams^{4,5,13,14,15,16,17,18,19}).

However, it is extremely difficult to drive the folding of a single chain towards a very specific structure, especially with the large variety of different structures and the high accuracy, fractions of inter-monomer distances, obtained by biopolymers^{11,20,21,22,23} that is necessary for many applications (e.g. catalysts). Recently, efforts to overcome this limitation have been made^{24,25,26,27,28}, in particular by Khokhlov et al.²⁹, where copolymer chains have been iteratively “painted” to undergo a hydrophobic-hydrophilic micro-phase separation and by Moreno et al.³⁰, who have engineered specific hydrophilic-hydrophilic patterns to produce a similar micro-phase separation. However, contrary to biopolymers, these two methodologies do not provide control over the detailed shape of the target structure.

According to mean field theories (MFT)^31,32,33 it should be possible to construct artificial heteropolymers that, similarly to proteins, drive the collapse of specific sequences into given target structures. The identification of such sequences is normally referred to as design and, in what follows, we define designability as the property of a heteropolymer to have at least one heterogeneous sequence that reliably folds into a given stable target structure³³. A clear review of the MFT mentioned above can be found in the seminal works of Pande et al.³³, who showed that the designability of a heteropolymer increases with the alphabet size and decreases with the conformational entropy per bead. Thus, the designability of a heteropolymer can be enhanced following two strategies: i) by increasing the alphabet size as in Go-models³⁴, where each amino acid can specifically interact only with a subset of residues and, ii) by decreasing the configurational entropy per particle, i.e. reducing the number of possible configurations.

A straightforward way to adopt the second strategy is represented by designing lattice heteropolymers^{32,35,36,37,38,39}, in which the number of possible configurations (thus the configurational entropy) is reduced by the topology of the lattice.

In order to adopt the second strategy for off-lattice polymers a possibly effective way is to introduce directional interactions, i.e. interactions that depend on the mutual orientations of the monomers, in addition to the heterogeneous isotropic interactions. In this way, the conformational entropy per bead decreases because of constraints on the mutual orientations. In fact, the directionality of the interactions introduces a frustration in the system that “pre-sculpts” the configurational space of the system, which will contain just a fraction of the total compact configurations. Another effective way to reduce the conformational entropy is to increase the chain stiffness. An important contribution in this direction is represented by the “tube protein model”, developed by Maritan and co-workers^{40,41,42,43,44}, in which control of the conformational free-energy landscape is achieved by tuning the total hydrophobicity and the chain stiffness of short polymers, which can be made designable in some specific cases^41,43. Natural proteins take advantage of both routes by exploiting the directionality of the hydrogen bonds and the geometry of the backbone^42,43,44,45. From an experimental standpoint, in fact, decreasing the conformational entropy via these two routes is more feasible then increasing the alphabet like in Go-Models, which might be the reason why directional interactions (such as hydrogen bonds) are so important in biology.

In this work, we show that the presence of directional interactions in a generalised heteropolymer allows for the realisation of compact structures with functionalized regions on their surfaces with an accuracy comparable to natural catalysts, i.e. of the order of a fraction of amino acids distances. To give an idea of the huge potential of such an approach, in Fig. 1 we present simulation snapshots of heteropolymers that self-assemble with high-accuracy into predetermined structures: two spherical objects with patterned (functionalized) surfaces. The sequences we design are able to make the two heteropolymers reliably fold into a Janus-like (top) and a triblock-Janus-like (bottom) object with DRMSD (distance root mean square displacement see Eq. 3) of 0.36 and 0.25 with respect to the target structure which correspond to an RMSD (root mean square displacement see Eq. 4) of 0.72 and 0.25 with respect to the target structure, respectively. Depending on the chemical nature of the functionalization, such folded structures could exhibit catalytic functions or be used as building blocks in a hierarchical self-assembling process⁴⁶. Of course, these applications would also require a precise control of inter-chain interactions in order to avoid amorphous aggregations⁴⁷.

However, in order to exploit the folding of artificial heteropolymers for applications, we ought to first address the following question: “Can we predict a priori whether an engineered heteropolymer is designable?” In other words, if one engineers heteropolymer architectures that differ in terms of number, geometry or nature of the directional interaction, is it possible to predict a priori which ones are designable (i.e., for which ones one can find at least one specific sequence that drives the system to fold into a given target structure)?

Here we show that the answer to this question is represented by a distinct peak in the radial distribution function dominating over the random packing of the heteropolymers. A minimal set of directional interactions is an effective way to induce such a peak (Fig. 2), which guarantees the designability (Fig. 3).

Results and Discussion

Our working hypothesis is that the addition of directional interactions will result in designable chains as previously suggested by results obtained for the “patchy polymer” model^48,49 and the “caterpillar” protein model^23,50. In the following we will use two heteropolymer models with different chain stiffness (see Methods): the freely-rotating chain (FRC), marked by open locks in the figures, and the freely-jointed chain (FJC), marked by closed locks in the figures. Briefly, the “chemical character” of each monomer is given by a isotropic interaction potential, represented as a simple square-well like shape, with a different pre-factor (more or less attractive or repulsive) for every different pair of monomer types. The directional interaction between the patches is homogeneous along the chain and is the potential derived by Irbäck et al.⁵¹, commonly used to model hydrogen bonds (see Methods). By changing the number of patches per monomer we will probe the emergence of designability starting from a standard heteropolymer (no patches) up to chains of monomers with 10 patches. The number of patches unambiguously identifies their arrangement on their surface, which is chosen so as to yield the most symmetrical geometry (on the poles for two patches, equispaced on the equator for three patches, on a tetrahedron for four patches, etc). The bottom of Fig. 3 shows all the patch arrangements employed.

Strictly speaking, proving that a heteropolymer with a certain number of patches is not designable would require testing all possible sequences for the presence of a unique collapsed equilibrium structure. Here we rely on the statistical definition of designability, where the methodology of the Monte Carlo simulations SEEK, DESIGN and FOLD (SDF) was proven to be able to discriminate between designable and not-designable structures⁴⁹. Briefly, the SDF method: (i) seeks the most designable target structure through an extensive sampling of the heteropolymer conformations and sequences (Supp. Mat. Fig. 1); (ii) designs the sequence that should optimally fold into the target structure; (iii) tests whether the designed sequence correctly folds into the target structure. If the folding fails with the most favourable target structure, the heteropolymer with a certain number of patches is labelled as non-designable. More details can be found in Sec. Methods 1. Here we apply the SDF method to chains made of 50 monomers. From now on, all the distances will be in units of the bead radius R_bead and the energies in units of k_BT_Ref, where T_Ref is a reference temperature that sets the scale of interactions.

In Fig. 3 we show the results for the designability for all cases studied. The systems without directional interactions (patches) are not designable for any alphabet. By adding directional interactions, the heteropolymer becomes designable for a wide range of patch numbers and alphabet sizes, both for the FRC and FJC models. The emergence of designability coincides with the appearance of a peak in the pair radial distribution function—highlighted with the grey band in Fig. 2a)—located at the distance $r\simeq 3$, at which the patch-patch interaction is most favourable. The presence of such an isolated intermediate peak between the first ($r\simeq 2$) and second random close packing neighbours ($r\simeq 4$) indicates that the directional interactions are inducing a geometrical frustration in the system. The frustration strongly biases towards a subset of compact conformations. In fact, in the non-designable case without patches, the directional interaction peak is not present. For one patch, the directional interaction peak is present but is lower than the peak corresponding to the close packing. These chains with one patch are not always designable (borderline), and their designability depends on the alphabet size q (Fig. 3). Increasing the number of patches the directional interaction peak becomes dominating over the random packing peak, making the chains designable in a broad interval of q (3 to 20). Increasing further the number of patches up to 10, the random packing peak increases again, splits and moves towards higher distances (Supp. Mat. Fig. 4) due to the self-avoiding of the many patches and also the system loses directionality: this corresponds in turn to a loss of designability (Fig. 3). It is important to stress that the strength of the directional interaction is such that even the chains with one patch are always maximally bonded, implying that arbitrarily increasing the relative strength of the directional interactions will not suppress the first peak of the g(r) (Supp. Mat. Fig. 1). Thus, when the patch-patch peak dominates over the random close packing peak, the system becomes robustly designable in a broad interval of alphabet sizes q, both for the FRC and FJC models.

According to the MFT³³, given an alphabet of size q, a system is designable when the sequence entropy per particle ln(q) exceeds the conformational entropy per particle ω. Hence, the designable–to–not-designable transition, where ω ~ ln(q), allows us to estimate ω. Following the diagram in Fig. 3, we estimate ω to be >ln(20) for 0 patches, ~ln(5) for one patch in the freely rotating chain, and <ln(3) for up to 6 patches. At 10 patches ω increases again to a value between ln(3) and ln(20). Since e^ω is related to the number of compact conformations, the latter is reduced by the patches by approximately an order of magnitude. A more precise evaluation of ω requires the study of intermediate values of ln(q) which cannot be achieved simply changing the alphabet.

The presence of the directional interaction peak is a fundamental fingerprint of designability. Indeed, we find it to be a general feature also of natural proteins. In Fig. 2b we show the radial distribution function for some characteristic examples out of 145 analysed high resolution protein structures from the Protein Data Bank. Here, the conformational space is shaped by the directionality of the hydrogen bonds, which forces the carbon C_α of the amino acids to be at the typical distances for the different types of secondary structure. Hence, the peaks analysed in Fig. 2b are equivalent to the directional interaction peak highlighted in grey in the patchy polymers. Since proteins are precisely designable because of the peculiar hydrogen-bond geometry of the backbone, we expect the latter to be an ideal template for general heteropolymers and patchy-polymers.

These results are in agreement with MFT^31,32,33, which predicts that the decrease of conformational entropy allows for better sequence design. Here, we introduce the presence of a peak in the g(R) not related to the random packing as an estimate of the reduction of the conformational entropy, in order to have a simple tool to anticipate the designability of different polymer architectures. Thus, we propose the presence of such a peak as a general criterion to engineer designable polymers able to fold into unique target structures, with the same accuracy and versatility of natural proteins. The geometry of the protein skeleton is a particular choice, and our results suggest that, by following our criterion, several others can be found.

Another important result is the characterization of the transition from a not-designable to a designable number of patches (Fig. 3). We start by noting that the freely rotating chain is sensitive to the choice of the alphabet: a minimum alphabet size q = 7 is required to guarantee designability for any number of patches (Fig. 3). To characterise the designable–to–not-designable transition, we have performed the SDF trial for each point in the designability diagram. The last step of the SDF is the calculation of the free energy difference between structures, grouped together according to the distance root mean square displacement (DRMSD) from the target conformation at DRMSD = 0 (see Methods). The accuracy of the refolding varies considerably for each scenario and depends on the alphabet size, the number of patches and the local environment of the monomers. In Fig. 4 we show the FOLDING free energy profiles for some significant points in the designability diagram for q = 3 and q = 20 (Supp. Mat. Fig. 3). At low temperature (T = 0.4) all curves show a global free energy minimum located at different DRMSD, however, they are considered folded only if the global minimum corresponds to the folded structure and not to a disordered molten globule structure. The different nature of the global minimum can be discriminated by increasing T and pushing the system to unfold (Supp. Mat. Fig. 2). What is striking in the figure is the high refolding accuracy of the FJC model, with smooth folding profiles and global minima very close to the target structure. We ascribe such a high accuracy to the stronger constraint experienced by monomers in the FJC compared to the FRC model, in agreement with MFT results^31,32,33. In fact, the patches in the FJC have less rotational freedom, thus the conformational entropy ω is decreased further. Finally, we note that such high accuracy is particularly interesting for single-patch chains, which, according to the designability criterion based on the g(r), is a borderline case (Fig. 2a).

In conclusion, we extend the concept of designability to heteropolymers, bridging the gap between naturally foldable biopolymers and artificial polymers. We demonstrate that there is a minimal set of ingredients that makes it possible to exert a precise control over the conformation of the folded structure. Indeed, we show that starting from a traditional heteropolymer model, we can attain designability by introducing a few directional interactions (patches). The patches make the system designable even with alphabet sizes comparable to the case of block-copolymers (3 and 5), which are synthesizable and manipulable with the current advanced technology^4,6,7. From an experimental standpoint, in fact, small alphabet sizes are more feasible. We demonstrate that the directional interaction peak in the g(R) is a universal fingerprint for designability and can be used as a criterion to engineer new heteropolymers. This criterion will allow artificial heteropolymer to self-assemble into an enormous variety of highly exotic structures with high control over the detailed shape. This will expand the possibilities of current copolymer technology to the full potential of biopolymers, providing a way of going beyond the current protein- and DNA-based materials^22,52,53.

Heteropolymer design could find several applications depending on the scale of the building units. At the micrometre scale, chains could be realised using colloidal particles with applications in the design of materials with new mechanical, electronic or photonic properties tailored in 3D. Colloidal particles are widely available, their interactions can be tuned to be either attractive or repulsive and can range from a few nm to several micrometres⁵⁴. Flexible strings of polymeric colloids were made by inducing electric dipoles to line the particle up before thermal fusion⁵⁵. Such strings were observed to collapse into a random compact configuration under influence of an induced depletion attraction. Control over the sequence can theoretically be achieved in a similar fashion as for protein synthesis where the chains are grown from a solid surface⁵⁶. Each monomer is loaded in the sample alternating washing cycles to remove the previously unbound monomers. The limiting factor of this technique is in the bonding probability that must be extremely high in order for the growth to continue until chain lengths of 20–50 residues. Currently, we are working on strategies to overcome such limitations in close collaboration with experimental groups. At the atomic and nanometre scales, control of the sequence is in reach of current polymer synthesis technology^6,7,12. As we show, artificial chains could be designed to place chemical groups with a precision similar to what protein can achieve^57,58, but also to work in different environments than the ones suitable for proteins or DNA.

Methods

The patchy polymer model we employ has already been proven to be able to refold artificial sequences into unique target structures^48,49. Here we consider two standard heteropolymer models where heterogeneous isotropically-interacting monomers are bonded along the chain via a harmonic potential with two different anchoring geometries: (i) in the freely rotating chain (FRC) model the spring bonds the centers of the monomers; (ii) in the freely jointed chains (FJC) model the spring connects two anchoring points on the monomer surface and opposite to each other (see bottom of Fig. 3 for a sketch). An experimental realisation of the FJC model could be represented by covalently bonded chemical units or surface grafted colloidal particles. While possible experimental examples of patchy particles that could serve as monomers for the FRC model, i.e. where the patches can rotate with respect to the bead, are the lock and key colloids as in ref.⁵⁹, the solid colloids with surface-mobile DNA linkers in ref.⁶⁰, DNA coated emulsion droplets with mobile DNA patches⁶¹, colloidal particles with an induced electric dipole⁵⁵. The isotropic and bonded interactions are complemented by non-specific additional attractions provided by patches arranged on the monomer surface. The resulting directional interaction is anisotropic in nature as it depends not only on the distance between two monomers but also on their relative orientations. The key parameters of the models are the alphabet size of the isotropic interactions, the number of patches (which is the same for all the monomers in a chain) and their the geometrical arrangement. The isotropic interaction energy E_AB(r) between two different sub-units of types A and B is represented as a simple-square-well like shape (Fig. 5)

$${E}_{AB}\,(r)=\{\begin{array}{cc}{\epsilon }_{AB}\,[1-\frac{1}{1.0+{{\rm{e}}}^{2.5({r}_{max}-r)}}] & {\rm{i}}{\rm{f}}\,r > {R}_{bead}\\ {\rm{\infty }},= & {\rm{i}}{\rm{f}}\,r < {R}_{bead}\end{array}$$

(1)

where r is the distance between the centres of the beads and R_bead is the hard core radius, which is the same for each bead. ${\epsilon }_{AB}$ is a different pre-factor for every different pair of monomers. The cut-off distance ${r}_{max}=6{R}_{bead}$ is the distance at which ${E}_{AB}={\epsilon }_{AB}/2$ and was derived with a trial and error approach on coarse-grained proteins in the caterpillar protein model^23,50.

As directional interaction between the patches we employ the potential derived by Irbäck et al.⁵¹, commonly used to model hydrogen bonds. It is represented by a 10–12 Lennard-Jones type potential multiplied by a factor containing the angles between the patches and the bead radius (Fig. 5), so that the energy is minimum if the patches face each other (when they are opposite to each other the radial part of the potential is ~0)

$${E}_{p}=s\,{\epsilon }_{p}{(\cos {\theta }_{1}\cos {\theta }_{2})}^{\nu }\,[5{(\frac{\sigma }{R})}^{12}-6{(\frac{\sigma }{R})}^{10}].$$

(2)

Here R is the distance of the patches as in Fig. 5 (right inset), ${\epsilon }_{p}=3.1\,{k}_{B}T$ and ν = 2⁵¹ while we set σ = R_bead. The scaling factor s is chosen to not over favour the patch contribution over the isotropic one. If its value is too large all sequences form regular structures that depend solely on the symmetries of the patch arrangements on the beads. On the other hand, if it is too small all sequences fail to self-assemble and collapse into random glassy three-dimensional structures. Using a trial and error approach, we found 4 to be good number⁴⁹. The neighbour beads along the chain are bonded via a harmonic spring potential.

In order to find whether the polymer is designable or not, we identify for each different number of patches and alphabet size at least one pattern (sequence) that has a global free energy minimum into a given structure. To increase the chances to find such a pattern, we first perform a SEEK MC simulation, in order to find potentially designable target structures that are not known from nature, unlike for proteins. In the SEEK, we explore at the same time different structures and sequences, and we extract a target structure from the global minimum of the free energy landscape obtained (see Supp. Mat. Fig. 1). The global minimum corresponds to the structure with the highest number of sequences that fold into it, thus the most designable one⁶². Other structures further from the global minimum might be good candidates as well, so the solution is not necessarily unique. On the contrary, if this structure will be not designable, it is highly unlikely and thus unfeasible for practical purposes to find another structure that will be designable.

The target structure is then redesigned in the DESIGN, where we explore only the different sequences maintaining the structure frozen. Here we choose the optimised sequence in the global minimum of the free energy, which in our method corresponds to a low potential energy and a high heterogeneity of the sequence.

Starting from a fully stretched structure we then perform a FOLDING Monte Carlo simulation, in order to study the self-assembling properties of this pattern. Here we explore the conformational space keeping the pattern fixed in the designed sequence.

We project the FOLDING free energy onto an order parameter, namely the root mean square displacement of the inter-particle distance (DRMSD) between the target structure and each sampled structure:

$$DRMSD=\frac{1}{N}\sqrt{\sum _{ij}\,{(|{\rm{\Delta }}{\overrightarrow{r}}_{ij}|-|{\rm{\Delta }}{\overrightarrow{r}}_{ij}^{T}|)}^{2}}$$

(3)

where ${\rm{\Delta }}{\overrightarrow{r}}_{ij}$ is the distance between the sphere i and j while ${\rm{\Delta }}{{\overrightarrow{r}}_{ij}}^{T}$ is the same distance calculated over the target structure, and N is the chain length (50 in our case). Most studies dedicated to proteins adopt another order parameter, the RMSD. This differ from the DRMSD in that its definition contains the positions of the atoms instead of their distances:

$$RMSD=\frac{1}{N}\sqrt{\sum _{ij}\,{(|{\overrightarrow{r}}_{ij}|-|{{\overrightarrow{r}}_{ij}}^{T}|)}^{2}}$$

(4)

The DRMSD has already been shown to be a proper order parameter to study the folding process²³. DRMSD = 0 corresponds uniquely to the target structure. The closer the global minimum is to DRMSD = 0, the smaller is the corresponding ensemble of structures. Thus, if the free energy landscape has a clear global minimum close to DRMSD = 0, we can identify at least one pattern that drives the system to fold into a unique target structure: the polymer is designable. Based on this qualitative definition we already clearly separate two groups: one with global minimum with position DRMSD < 0.6 and one with global minimum in the range $DRMSD\in [0.8,1.4]$.

However, in order to define more precisely a threshold we increase the temperature and push the system to unfold. Upon increasing the temperature the minimum significantly shifts towards higher DRMSD (as in Supp. Mat. Fig. 2), consistently with a temperature induced folding-unfolding transition. In some cases, our temperature resolution was high enough to observe two simultaneous minima. The configurations with DRMSD values corresponding to the position of the second minimum are the molten globule structures. Interestingly, for the same alphabet size the position of the molten globule minimum is conserved for different number of patches (see Supp. Mat. Fig. 2), and for 0 patches is the only global minimum observed at all temperatures (as in Supp. Mat. Fig. 2). Hence, the 0 patch chain was never designable. Thus, we choose as definition for the threshold between folded and not folded for each alphabet size (grey area in Fig. 4) the DRMSD range spanning 1K_BT from the position of the global minimum of the free energy of the case with 0 patches. Systems with global minimum at any temperature below this area are labeled as designable, the others as non-designable.

We observed from the simulations that the position of the minimum corresponding to the molten globule does not change significantly with the number of patches while it varies with the alphabet size. The latter is because the bare heteropolymer without directional interactions, although it never folded, starts to feel the influence of the larger alphabets on its designability. Even when the system does not reach the folded state for the target structure that we identified via the SEEK, and hence the SDF trials fail, it might still be possible to find a handful of structures that are designable. However, since a heteropolymer with few and hard-to-find designable structures is not a good candidate for potential applications, we label it as not-designable.

In all Monte Carlo simulations we enhance the sampling with the Virtual Move Parallel Tempering algorithm⁶³, performing each simulation at 16 different temperatures in the set [3, 2.5, 2.0, 1.6, 1.4, 1.2, 1.0, 0.9, 0.8, 0.75, 0.7, 0.65, 0.6, 0.55, 0.5, 0.4]. The SEEK, DESIGN and FOLDING steps are each composed by 10 independent simulations, run until we observe that the free energy landscape does not vary anymore and that all the 10 independent simulations give the same results, i.e. the free energy surfaces for each of the 10 independent simulations overlap within the statistical error. So each time we categorise a minimum in the FOLDING free energy landscape as folded or non-folded, the minimum is found always at the same position over 10 independent simulations.

For the radial distribution functions of proteins, the normalisation has been performed on the same ideal gas with an average density, to make the g(r) of proteins with different lengths comparable. All the g(r) have been calculated by neglecting the contribution of the beads (or amino acids) directly connected to each other along the chain, in order to ignore their trivial contribution to the first neighbour’s peak.

Data Availability

The datasets published in the current study are available in the website of our research group, http://homepage.univie.ac.at/ivan.coluzza/Home_Page/News/Entries/2017/5/13_Supplementary_data_for_The_role_of_directional_interactions_in_the_designability_of_generalized_heteropolymers.html. While the codes and the files necessary to launch are available in the git-bucket repository, https://bitbucket.org/viennafolding/bionic-proteins.

Change history

12 March 2018
A correction to this article has been published and is linked from the HTML and PDF versions of this paper. The error has not been fixed in the paper.

References

Umena, Y., Kawakami, K., Shen, J.-R. & Kamiya, N. Crystal structure of oxygen-evolving photosystem II at a resolution of 1.9 Å. Nature 473, 55–60 (2011).
Article ADS CAS PubMed Google Scholar
Furumi, S. Active lasing from organic colloidal photonic crystals. Journal of Materials Chemistry C 1, 6003–6012 (2013).
Article CAS Google Scholar
Yang, Y. et al. Combinatorial polymer scaffold libraries for screening cell-biomaterial interactions in 3d. Advanced Materials 20, 2037–2043 (2008).
Article ADS CAS Google Scholar
Lodge, T. P. Block copolymers: Past successes and future challenges. Macromolecular Chemistry and Physics 204, 265–273 (2003).
Article CAS Google Scholar
Capone, B., Hansen, J.-P. & Coluzza, I. Competing micellar and cylindrical phases in semi-dilute diblock copolymer solutions. Soft Matter 6, 6075 (2010).
Article ADS CAS Google Scholar
Wang, G., Luo, X., Liu, C. & Huang, J. Synthesis of ABCD 4-Miktoarm star-shaped quarterpolymers by combination of the click chemistry with multiple polymerization mechanism. Journal of Polymer Science Part A: Polymer Chemistry 46, 2154–2166 (2008).
Article ADS CAS Google Scholar
Yang, L., Zhou, H., Shi, G., Wang, Y. & Pan, C.-Y. Synthesis of ABCD 4-miktoarm star polymers by combination of RAFT, ROP, and Click Chemistry. Journal of Polymer Science Part A: Polymer Chemistry 46, 6641–6653 (2008).
Article ADS CAS Google Scholar
Rothemund, P. W. K. Folding DNA to create nanoscale shapes and patterns. Nature 440, 297–302 (2006).
Article ADS CAS PubMed Google Scholar
Conde, J., Oliva, N., Atilano, M., Song, H. S. & Artzi, N. Self-assembled rna-triple-helix hydrogel scaffold for microrna modulation in the tumour microenvironment. Nature materials (2015).
Chino, M. et al. Artificial Diiron Enzymes with a De Novo Designed Four-Helix Bundle Structure. European Journal of Inorganic Chemistry 2015, 3371–3390 (2015).
Article CAS PubMed PubMed Central Google Scholar
Huang, P.-S. et al. High thermodynamic stability of parametrically designed helical bundles. Science 346, 481–485 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Hirao, A., Higashihara, T. & Inoue, K. Successive Synthesis of Well-Defined Asymmetric Star-Branched Polymers up to Seven-Arm, Seven-Component ABCDEFG Type by an Iterative Methodology Based on Living Anionic Polymerization. Macromolecules 41, 3579–3587 (2008).
Article ADS CAS Google Scholar
Hamley, I. W. Block copolymers in solution: fundamentals and applications (John Wiley & Sons, 2005).
Leibler, L. Theory of microphase separation in block copolymers. Macromolecules 13, 1602–1617 (1980).
Article ADS CAS Google Scholar
Matsen, M. W. & Schick, M. Stable and unstable phases of a diblock copolymer melt. Physical Review Letters 72, 2660 (1994).
Article ADS CAS PubMed Google Scholar
Fredrickson, G. The equilibrium theory of inhomogeneous polymers (Oxford University Press, USA, 2006).
Martnez-Veracoechea, F. J. & Escobedo, F. A. Bicontinuous phases in diblock copolymer/homopolymer blends: simulation and self-consistent field theory. Macromolecules 42, 1775–1784 (2009).
Article ADS Google Scholar
Eurich, F., Karatchentsev, A., Baschnagel, J., Dieterich, W. & Maass, P. Soft particle model for block copolymers. The Journal of chemical physics 127, 134905 (2007).
Article ADS CAS PubMed Google Scholar
Lodge, T. P., Bang, J., Li, Z., Hillmyer, M. A. & Talmon, Y. Introductory Lecture Strategies for controlling intra-and intermicellar packing in block copolymer solutions: Illustrating the flexibility of the self-assembly toolbox. Faraday discussions 128, 1–12 (2005).
Article ADS CAS PubMed Google Scholar
Shakhnovich, E. I. Proteins with selected sequences fold into unique native conformation. Phys. Rev. Lett. 72, 3907–3910, http://link.aps.org/doi/10.1103/PhysRevLett.72.3907 (1994).
Seno, F., Vendruscolo, M., Maritan, A. & Banavar, J. R. Optimal protein design procedure. Phys. Rev. Lett. 77, 1901–1904, http://link.aps.org/doi/10.1103/PhysRevLett.77.1901 (1996).
Gonen, S., DiMaio, F., Gonen, T. & Baker, D. Design of ordered two-dimensional arrays mediated by noncovalent protein-protein interfaces. Science 348, 1365–1368 (2015).
Article ADS CAS PubMed Google Scholar
Coluzza, I. Transferable coarse-grained potential for de novo protein folding and design. PloS one 9, e112852 (2014).
Article ADS PubMed PubMed Central Google Scholar
Kochendoerfer, G. G. et al. Design and chemical synthesis of a homogeneous polymer-modified erythropoiesis protein. Science 299, 884–887, http://science.sciencemag.org/content/299/5608/884 (2003).
Hawker, C. J. & Wooley, K. L. The convergence of synthetic organic and polymer chemistries. Science 309, 1200–1205, http://science.sciencemag.org/content/309/5738/1200 (2005).
Anderson, J. A., Sknepnek, R. & Travesset, A. Design of polymer nanocomposites in solution by polymer functionalization. Phys. Rev. E 82, 021803, http://link.aps.org/doi/10.1103/PhysRevE.82.021803 (2010).
Hoshino, Y. et al. The rational design of a synthetic polymer nanoparticle that neutralizes a toxic peptide in vivo. Proceedings of the National Academy of Sciences 109, 33–38, http://www.pnas.org/content/109/1/33.abstract (2012).
Lebouille, J. G. J. L., Leermakers, F. A. M., Cohen Stuart, M. A. & Tuinier, R. Design of block-copolymer-based micelles for active and passive targeting. Phys. Rev. E 94, 042503, http://link.aps.org/doi/10.1103/PhysRevE.94.042503 (2016).
Khalatur, P. G., Khokhlov, A. R. & Krotova, M. K. Evolutionary Approach in Copolymer Sequence Design. Macromolecular Symposia 252, 36–46 (2007).
Article CAS Google Scholar
Moreno, A. J. et al. Advantages of orthogonal folding of single polymer chains to soft nanoparticles. Macromolecules 46, 9748–9759 (2013).
Article ADS CAS Google Scholar
Gutin, A. M. & Shakhnovich, E. Ground-state of random copolymers and the discrete Random Energy-model. The Journal of Chemical Physics 98, 8174–8177 (1993).
Article ADS CAS Google Scholar
Shakhnovich, E. I. & Gutin, A. M. Engineering of stable and fast-folding sequences of model proteins. Proceedings of the National Academy of Sciences of the United States of America 90, 7195–9 (1993).
Article ADS CAS PubMed PubMed Central Google Scholar
Pande, V. S., Grosberg, A. Y. & Tanaka, T. Heteropolymer freezing and design: Towards physical models of protein folding. Reviews of Modern Physics 72, 259–314 (2000).
Article ADS CAS Google Scholar
Go, N. & Taketomi, H. Respective roles of short-range and long-range interactions in protein folding. Proceedings Of The National Academy Of Sciences Of The United States Of America 75, 559–563 (1978).
Article ADS CAS PubMed PubMed Central Google Scholar
Bryngelson, J. D. & Wolynes, P. G. Spin glasses and the statistical mechanics of protein folding. Proceedings of the National Academy of Sciences of the United States of America 84, 7524–8 (1987).
Article ADS CAS PubMed PubMed Central Google Scholar
Yue, K. & Dill, K. A. Inverse protein folding problem: designing polymer sequences. Proceedings of the National Academy of Sciences 89, 4163–4167 (1992).
Article ADS CAS Google Scholar
Coluzza, I., Sprik, M. & Ciccotti, G. Constrained reaction coordinate dynamics for systems with constraints. Molecular Physics 101, 2885–2894 (2003).
Article ADS CAS Google Scholar
Coluzza, I. & Frenkel, D. Designing specificity of protein-substrate interactions. Physical Review. E 70, 51917 (2004).
Article ADS Google Scholar
Coluzza, I. & Frenkel, D. Monte Carlo study of substrate-induced folding and refolding of lattice proteins. Biophysical Journal 92, 1150–6 (2007).
Article ADS CAS PubMed Google Scholar
Micheletti, C., Seno, F., Maritan, A. & Banavar, J. R. Design of proteins with hydrophobic and polar amino acids. Proteins: Structure, Function and Genetics 32, 80–87 (1998).
Article CAS Google Scholar
Clementi, C., Maritan, A. & Banavar, J. Folding, Design, and Determination of Interaction Potentials Using Off-Lattice Dynamics of Model Heteropolymers. Physical Review Letters 81, 3287–3290, http://arxiv.org/abs/cond-mat/9802269 (1998).
Maritan, A., Micheletti, C., Trovato, A. & Banavar, J. R. Optimal shapes of compact strings. Nature 406, 287–90 (2000).
Article ADS CAS PubMed Google Scholar
Hoang, T. X., Trovato, A., Seno, F., Banavar, J. R. & Maritan, A. Geometry and symmetry presculpt the free-energy landscape of proteins. Proceedings of the National Academy of Sciences of the United States of America 101, 7960–4 (2004).
Article ADS CAS PubMed PubMed Central Google Scholar
Magee, J., Vasquez, V. & Lue, L. Helical Structures from an Isotropic Homopolymer Model. Physical Review Letters 96, 2078028 (2006).
Article Google Scholar
Banavar, J. R., Cieplak, M., Hoang, T. X. & Maritan, A. First-principles design of nanomachines. Proceedings of the National Academy of Sciences of the United States of America 106, 6900–3 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Glotzer, S. C. & Solomon, M. J. Anisotropy of building blocks and their assembly into complex structures. Nature materials 6, 557–562 (2007).
Article PubMed Google Scholar
Leonhard, K., Prausnitz, J. & Radke, C. Solvent–amino acid interaction energies in 3-d-lattice mc simulations of model proteins. aggregation thermodynamics and kinetics. Physical Chemistry Chemical Physics 5, 5291–5299 (2003).
Article CAS Google Scholar
Coluzza, I., van Oostrum, P., Capone, B., Reimhult, E. & Dellago, C. Sequence Controlled Self-Knotting Colloidal Patchy Polymers. Physical Review Letters 110, 075501 (2013).
Article ADS PubMed Google Scholar
Coluzza, I., van Oostrum, P. D., Capone, B., Reimhult, E. & Dellago, C. Design and folding of colloidal patchy polymers. Soft Matter 9, 938–944 (2013).
Article ADS CAS Google Scholar
Coluzza, I. A coarse-grained approach to protein design: learning from design to understand folding. PloS one 6, e20853 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Irbäck, A., Sjunnesson, F. & Wallin, S. Three-helix-bundle protein in a Ramachandran model. Proceedings of the National Academy of Sciences of the United States of America 97, 13614–8 (2000).
Article ADS PubMed PubMed Central Google Scholar
Seeman, N. C. DNA in a Material World. Nature 421, 427–431 (2003).
Article ADS PubMed Google Scholar
Lee, J. B. et al. A Mechanical Metamaterial Made From a DNA Hydrogel. Nature Nanotechnology 7, 816–820 (2012).
Article ADS CAS PubMed Google Scholar
Bianchi, E., Blaak, R. & Likos, C. N. Patchy colloids: state of the art and perspectives. Physical Chemistry Chemical Physics: PCCP 13, 6397–410 (2011).
Article ADS CAS PubMed Google Scholar
Vutukuri, H. R. et al. Colloidal analogues of charged and uncharged polymer chains with tunable stiffness. Angewandte Chemie (International ed. in English) 51, 11249–53 (2012).
Article CAS Google Scholar
Durek, T. & Becker, C. F. Protein semi-synthesis: new proteins for functional and structural studies. Biomolecular engineering 22, 153–172 (2005).
Article CAS PubMed Google Scholar
Gannavaram, S., Sirin, S., Sherman, W. & Gadda, G. Mechanistic and computational studies of the reductive half-reaction of tyrosine to phenylalanine active site variants of d-arginine dehydrogenase. Biochemistry 53, 6574–6583 (2014).
Article CAS PubMed Google Scholar
Sirin, S. et al. A computational approach to enzyme design: Predicting ω-aminotransferase catalytic activity using docking and mm-gbsa scoring. Journal of chemical information and modeling 54, 2334–2346 (2014).
Article CAS PubMed Google Scholar
Sacanna, S., Irvine, W. T. M., Chaikin, P. M. & Pine, D. J. Lock and key colloids. Nature 464, 575–578 (2010).
Article ADS CAS PubMed Google Scholar
van der Meulen, S. A. & Leunissen, M. E. Solid colloids with surface-mobile dna linkers. Journal of the American Chemical Society 135, 15129–15134 (2013).
Article PubMed Google Scholar
Feng, L., Pontani, L.-L., Dreyfus, R., Chaikin, P. & Brujic, J. Specificity, flexibility and valence of dna bonds guide emulsion architecture. Soft Matter 9, 9816–9823 (2013).
Article ADS CAS Google Scholar
Helling, R. et al. The designability of protein structures. Journal Of Molecular Graphics & Modelling 19, 157–167 (2001).
Article CAS Google Scholar
Coluzza, I. & Frenkel, D. Virtual-move parallel tempering. Chemphyschem: a European journal of chemical physics and physical chemistry 6, 1779–83 (2005).
Article CAS PubMed Google Scholar
Finkelstein, A. V. & Ptitsyn, O. Protein physics: a course of lectures (Academic Press, 2002).
Miyazawa, S. & Jernigan, R. L. Residue-residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. Journal of Molecular Biology 256, 623–644 (1996).
Article CAS PubMed Google Scholar
Betancourt, M. R. & Thirumalai, D. Pair potentials for protein folding: choice of reference states and sensitivity of predicted native states to variations in the interaction schemes. Protein Science 8, 361–9 (1999).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Achille Giacometti and Tatjana Škrbić for fruitful discussions. We acknowledge support from the Austrian Science Fund (FWF) project P23846-N16, the Mahlke-Obermann Stiftung and the European Union’s Seventh Framework Programme for research, technological development and demonstration under grant agreement no. 609431, VSC Research Center funded by the Austrian Federal Ministry of Science, Research and Economy (bmwfw). The computational results presented have been achieved using the Vienna Scientific Cluster (VSC). V. B. acknowledges support from the Austrian Science Fund (FWF) project: M 2150-N36. L.R. acknowledges support from the Austrian Science Fund (FWF) through the Lise-Meitner Fellowship No. M 1650-N27 and from the European Commission through the Marie SkodowskaCurie Fellowship No. 702298-DELTAS.

Author information

Authors and Affiliations

Faculty of Physics, University of Vienna, Boltzmanngasse 5, A-1090, Vienna, Austria
Chiara Cardelli, Valentino Bianco, Lorenzo Rovigatti, Francesca Nerattini, Luca Tubiana, Christoph Dellago & Ivan Coluzza
Rudolf Peierls Centre for Theoretical Physics, University of Oxford, 1 Keble Road, Oxford, UK
Lorenzo Rovigatti

Authors

Chiara Cardelli
View author publications
You can also search for this author in PubMed Google Scholar
Valentino Bianco
View author publications
You can also search for this author in PubMed Google Scholar
Lorenzo Rovigatti
View author publications
You can also search for this author in PubMed Google Scholar
Francesca Nerattini
View author publications
You can also search for this author in PubMed Google Scholar
Luca Tubiana
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Dellago
View author publications
You can also search for this author in PubMed Google Scholar
Ivan Coluzza
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

I.C. designed the research, C.C. performed the simulations, I.C., V.B. and C.C. performed the data analysis. C.C., F.N., V.B., L.R., L.T., I.C., C.D. wrote the manuscript and discussed the research.

Corresponding author

Correspondence to Ivan Coluzza.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

A correction to this article is available online at https://doi.org/10.1038/s41598-018-22649-3.

Electronic supplementary material

SUPPLEMENTARY MATERIAL

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cardelli, C., Bianco, V., Rovigatti, L. et al. The role of directional interactions in the designability of generalized heteropolymers. Sci Rep 7, 4986 (2017). https://doi.org/10.1038/s41598-017-04720-7

Download citation

Received: 17 February 2017
Accepted: 18 May 2017
Published: 10 July 2017
DOI: https://doi.org/10.1038/s41598-017-04720-7

This article is cited by

Protein design under competing conditions for the availability of amino acids
- Francesca Nerattini
- Luca Tubiana
- Ivan Coluzza
Scientific Reports (2020)
Demolishing Prejudices to Get to the Foundations: A Criterion of Demarcation for Fundamentality
- Flavio Del Santo
- Chiara Cardelli
Foundations of Science (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.