Visualization of macromolecular structures

O'Donoghue, Seán I; Goodsell, David S; Frangakis, Achilleas S; Jossinet, Fabrice; Laskowski, Roman A; Nilges, Michael; Saibil, Helen R; Schafferhans, Andrea; Wade, Rebecca C; Westhof, Eric; Olson, Arthur J

doi:10.1038/nmeth.1427

Download PDF

Review Article
Published: 01 March 2010

Visualization of macromolecular structures

Seán I O'Donoghue¹,
David S Goodsell²,
Achilleas S Frangakis³,
Fabrice Jossinet⁴,
Roman A Laskowski⁵,
Michael Nilges⁶,
Helen R Saibil⁷,
Andrea Schafferhans¹,
Rebecca C Wade⁸,
Eric Westhof⁴ &
…
Arthur J Olson²

Nature Methods volume 7, pages S42–S55 (2010)Cite this article

9217 Accesses
114 Citations
14 Altmetric
Metrics details

Subjects

Software

Abstract

Structural biology is rapidly accumulating a wealth of detailed information about protein function, binding sites, RNA, large assemblies and molecular motions. These data are increasingly of interest to a broader community of life scientists, not just structural experts. Visualization is a primary means for accessing and using these data, yet visualization is also a stumbling block that prevents many life scientists from benefiting from three-dimensional structural data. In this review, we focus on key biological questions where visualizing three-dimensional structures can provide insight and describe available methods and tools.

Visualizing RNA conformational and architectural heterogeneity in solution

Article Open access 09 February 2023

Jienyu Ding, Yun-Tzai Lee, … Yun-Xing Wang

HyperBeta: characterizing the structural dynamics of proteins and self-assembling peptides

Article Open access 08 April 2021

Marco S. Nobile, Federico Fontana, … Fabrizio Gelain

Integrative structural modeling of macromolecular complexes using Assembline

Article 29 November 2021

Vasileios Rantos, Kai Karius & Jan Kosinski

Main

Decades ago, when structural biology was still in its infancy, structures were rare and structural biologists often dedicated years of their life to studying just one structure at atomic detail. The first tools used for visualizing macromolecular structures were tools for specialists.

Today's situation is very different: the rate at which structures are solved has greatly increased, with over 60,000 high-resolution protein structures now available in the consolidated Worldwide Protein Data Bank (wwPDB)¹. These data provide a wealth of detailed information that can yield significant insight into macromolecular function. To use this information most effectively, visualization tools were developed and are increasingly becoming everyday tools for biologists. For example, many biochemists regularly view protein structures to gain insight into protein function (Fig. 1). Chemists look at ligand-binding sites as part of drug design. Molecular biologists view RNA structures and complexes with proteins to gain insight into RNA signal and message processing. Some aspects of structure visualization remain mostly the domain of the specialist, such as molecular motion and large-scale molecular assemblies. Even in these intrinsically more complex fields, however, resources are beginning to enable bench biologists to visualize and use this information.

**Figure 1: Visualizing a tyrosine kinase structure (PDB 1QCF)⁹⁷.**

However, although structural information is now viewed and used by a large and diverse group of scientists, most of them are not prepared to spend months learning complex user interfaces or scripting languages. Even today, complex user interfaces in visualization tools are often a stumbling block, preventing many scientists from benefiting from structural data. Even structural experts have come to expect ease of use from molecular graphics tools, in addition to improved speed, features and capabilities.

In the past, molecular graphics tools were invariably stand-alone, designed to view one molecular system at once. Today's tools are increasingly internet aware, often integrated tightly with structure databases (Table 1), as well as with databases containing sequences and other features (for example, domains, single-nucleotide polymorphisms (SNPs), interactions).

Table 1 Selected resources for finding and visualizing macromolecules

Full size table

Today, we are spoiled for choice when it comes to molecular graphics tools for viewing proteins and other macromolecular structures. Indeed, the sheer range of available tools can be overwhelming. Many molecular graphics tools have been developed to address diverse requirements, as documented in recent reviews^2,3,4 and in several web resources maintaining lists of such tools (see footnote to Table 1). Most of these tools have a large set of features in common, including standard representations (ribbon, space-filling, ball-and-stick and so on) and coloring schemes (element-based coloring of atoms, coloring by secondary structure and so on). It is beyond the scope of this review to comprehensively compare all of these tools; instead, we focus on key biological questions for which visualizing structures can provide insight, and we highlight practical methods and tools with outstanding features that are particularly suited to addressing these questions.

Protein structures

Finding three-dimensional structures. For a biochemist looking to use three-dimensional structures to gain insight into the functions of a particular protein, the typical first step is a search for relevant structures. This task is considerably simplified by the remarkable degree to which all experimentally determined protein three-dimensional structures are consolidated into a single data repository, the Worldwide Protein Data Bank (wwPDB)¹. Three primary distribution sites (RSCB PDB⁵, PDB Europe and PDB Japan; Table 1) provide access to the same underlying data bank, each with a wide range of integrated visualization and analysis tools. In addition, the PDB is mirrored at many other sites, some of which provide innovative visualization tools tailored to make specific questions easier to answer (Table 1). Most of these sites offer, embedded directly in their web-pages, one or more molecular graphics tools (for example, Jmol, PyMol, KiNG and Mage⁶). Increasingly, the process of finding and visualizing structures is becoming one seamless step for most users.

Finding structures from sequence. Several websites (for example, RCSB PDB⁵) allow the user to find structures using a sequence identifier or BLAST search (Table 1). Entrez Structure and SRS 3D⁷ allow the sequence to be aligned to any related three-dimensional structure (Fig. 1f). So far, experimental three-dimensional structures have been determined for less than 1% of all known proteins (based on direct links from PDB to protein sequences in UniProt⁸). However, for around 42–48% of all proteins, at least part of their sequence is considered significantly similar to a PDB entry, so that some structural information can be inferred^9,10. Several websites (for example, Swiss-Model¹¹) provide comparative models for such cases^12,13. Each service uses slightly varying cut-off criteria for defining 'significant sequence similarity' (for example, in some cases depending on the length of aligned regions), but generally >40% sequence identity to a PDB structure is considered sufficiently good to create a high-quality comparative model structure¹⁰. These comparative models can be accessed at a single consolidated website, the Protein Model Portal (PMP)¹⁰. The original PDB templates also include information on experimental conditions, ligands and cofactors, which can be relevant in deciding to use or discard a comparative model.

For sequences where no template PDB structure can be found by the above resources, it may be possible to calculate a structure using so-called ab initio methods¹⁴. However, in spite of progress¹⁵, ab initio methods still require much improvement¹⁴ and we recommend they be used with caution.

Getting a first impression. To gain an initial overview of a protein structure, it is often useful to choose a representation that hides side chain atoms; ribbon-like representations do that well and also convey information about secondary structure (Fig. 1a–d). Ligand molecules are best displayed in space-filling or ball-and-stick atom representations. Many of the websites in Table 1 provide such a view (for example, FirstGlance, among others), some by default. Typically, each protein chain is colored differently, thus giving a quick insight into the number of molecules present in the PDB entry. To highlight overall shape and form, nonphotorealistic rendering can be very effective (Fig. 1e), especially with images for presentation and publication.

Some molecular graphics tools (for example, Chimera¹⁶, Cn3D¹⁷, OpenAstexViewer¹⁸, SRS 3D⁷, STRAP¹⁹ and Swiss-PdbViewer²⁰) offer an integrated view of both the amino acid sequence and the three-dimensional structure, and further enable interaction between these two views (Fig. 1j). For example, clicking on a residue in the sequence view causes the corresponding residue to be highlighted and selected in the three-dimensional view, and vice versa. This feature can significantly help a scientist in understanding and using three-dimensional structures. For example, by viewing the location of key residues or sequence motifs, a scientist can assess whether they are likely to be accessible for posttranslational modification, such as phosphorylation²¹. Some viewers (for example, STRAP¹⁹) go one step further, showing structure integrated with a multiple sequence alignment viewer—a feature we anticipate will continue to become available for other viewers²².

For publication and presentations, some viewers can create impressive, ray-traced images (for example, Amira, Chimera¹⁶ ICM-Browser, Molscript²³ plus Raster3D²⁴, PMV²⁵, PyMOL, VMD²⁶).

The majority of PDB structures are derived from X-ray crystallography (Box 1, Fig. 2), about 13% from NMR spectroscopy (Box 2, Fig. 3) and less than 1% from electron microscopy (Box 3). These three experimental methods often require specific considerations and visualization methods (discussed in each display box).

**Figure 2: Caution for beginners: symmetry in crystal structures.**

**Figure 3: Visualization of an NMR ensemble for SH3 (ref. 108).**

Viewing sequence features on three-dimensional structures. A very straightforward way to use three-dimensional structures to gain insight into function is by coloring based on features such as domains, SNPs, exon boundaries, secondary structure and so forth. (Fig. 1a–d,f). The ability to easily see where sequence features are located in the three-dimensional structure can be of substantial practical value to bench biochemists and molecular biologists. For example, the spatial location of residues within the structure and the proximity to solvent can help in designing primers and mutation experiments. The ability to show such views for a wide range of features is a particular strength of SRS 3D⁷ and SPICE²⁷ and is also facilitated by JenaLib²⁸, PDBsum²⁹ and Entrez Structure. Viewers such as STRAP³⁰ that provide easy access to multiple sequence alignment information mapped onto three-dimensional structures can help locate key conserved residues. ProSAT2 (ref. 31) can display SNPs and also predict their effects, allowing a scientist to gauge the potential impact of a SNP on the protein structure.

Protein-protein binding sites. Typically, as part of its biological role, a protein will bind to several other proteins through comparatively large but flat binding surfaces. In fact, a large percentage of PDB entries contain not just a single protein chain but several. In some cases, this means identical subunits assembled together; in other cases, it means a complex of several different protein chains. The arrangement of subunits, and of the interface residues that form the subunit-subunit contacts, is often of biological significance. Several websites specialize in finding and visualizing subunit-subunit interface residues³². In PDBsum²⁹ the interacting residues, and the types of their interaction across the interface, are shown schematically. MolSurfer (Table 1) provides a range of methods that help users explore macromolecular interfaces.

For symmetric assemblies (dimers, trimers and so on), the PDB entry of an X-ray crystal structure will often have explicit three-dimensional coordinates for only one monomer. To construct the coordinates for all subunits in the biologically relevant assembly, we recommend PISA³³ (see Box 1, Fig. 2).

Comparing related structures. It is often informative to visualize two related structures superimposed—for example, two states of the same molecule, or two proteins with homologous sequences, or two structural homologs found by structural comparison tools³⁴. Many molecular graphics tools offer automatic superposition as a standard feature (for example, MOLMOL³⁵, MOE, PyMOL or VMD²⁶). These tools allow the researcher to specify a portion of the molecule to be superimposed. The results are highly dependent on the regions chosen for the superposition. Typically, the researcher identifies a more-or-less rigid core of the molecule and superimposes this region using a subset of the atoms (typically the α-carbons or the backbone atoms). But many other combinations are possible for addressing specific questions (Figs. 1i and 4d–f). For difficult cases—for example, low sequence similarity or large regions that cannot be aligned in sequence—it is best to use more robust, dedicated superimposition tools (for example, STAMP³⁶, STRAP¹⁹ or THESEUS³⁷).

**Figure 4: Visualizing ligand-binding sites.**

Molecular surfaces and electrostatic potentials. Many tools can generate molecular surfaces, most commonly the so-called Connolly surface³⁸, which is derived by rolling a sphere the radius of a water molecule around the atomic van der Waals surface of the molecule. This surface, also known as the solvent-excluded surface, can be used as a canvas to map a wide variety of properties such as residue conservation scores, hydrophobicity (Fig. 1h), depth-cue information (Fig. 1e), mean-force potentials³⁹ and electrostatics (Fig. 1g). Such colored surfaces (sometimes called texture mappings) can give insight into molecular interactions and conformational changes, for example, by highlighting surface regions with complementary shape and charge. The molecular surface can also be used to estimate the energetics of molecular interactions, including the entropic cost of desolvation, by calculating the area buried from solvent upon binding of other molecules⁴⁰.

Although many program can generate a surface, the program MSMS⁴¹ is widely used as it provides a good estimate of molecular surface area and volume, and the most relevant molecular geometry when analyzing molecular interactions and interfaces.

Box 1: X-ray crystal structures

About 86% of PDB entries are derived from X-ray crystallography. Each X-ray structure has a resolution value, that is, a measure of the crystal order. The expected error in the three-dimensional atomic coordinates is correlated to, but much smaller than, the resolution. An average resolution is about 2.5 Å, and 1.2 Å is very high quality¹¹². At low resolution (≥4 Å), there is a significant chance of errors in the structure. Each atom also has a B-factor, a parameter correlated with molecular motion, and many molecular graphics tools can use this parameter to color the molecule. However, the correlation with motion is only partial, as other effects contribute to the B-factor¹¹³. Regions with very high B-factor (>>80) should generally be treated as of unknown structure. The goodness of fit of the structure to the X-ray crystal data is indicated by the R-value and free R-value¹¹⁴, where values of around 20% are considered to indicate a good structure. The difference between the R-value and the free R-value should be low. The goodness of fit can be examined at the Electron Density Server¹¹⁵. It can be useful to also check further independent measures of structure accuracy, and many are available directly on the web page for each structure at the RCSB PDB site. The PDBe site also has several useful services for assessing accuracy, particularly PDBsum²⁹. A very useful, and independent, accuracy estimate is also available from ProSA-Web¹¹¹, which can show residue-specific quality scores mapped on the three-dimensional structure (Table 1). When viewing crystal structures, it is important to be aware that PDB entries only give explicit coordinates of the 'asymmetric subunit' of the crystal (that is, the smallest portion of a crystal needed to produce the unit cell of the crystal) which often is only part of the full biologically relevant assembly (Fig. 2).

A primary limitation of X-ray crystallography is the need to crystallize the macromolecule; this is especially difficult for membrane proteins, for proteins with natively disordered regions and for transient complexes. To aid crystallization, macromolecules are often tampered with, for example, by mutating surface residues and truncating segments of sequence; in addition, modifications present in vivo are often missing in X-ray crystal structures. To help with structure calculation, macromolecules are often crystallized with seleno-methionine, heavy metal ions, or other impurities (sometimes even other proteins) that are not present in vivo and that may sometimes distort the structure.

X-ray crystal structures are calculated by fitting an atomic model of the molecule into an electron-density map (EDM) or isosurface and can be visualized by tools such as COOT¹¹⁶, FRODO¹¹⁷ and O¹¹⁸. Highly mobile parts of the molecule often are missing in the EDM and are usually removed from the molecular model, resulting in missing atoms or residues. Current methods for structure determination are dominated by the concept of a rigid structure, although improved methods have very recently been proposed¹¹⁹ that account for molecular motion and produce structural ensembles similar to NMR structures (Box 2).

Box 2: NMR structures

About 13% of PDB entries are derived from NMR spectroscopy. NMR structures are usually deposited in the PDB as an ensemble of 10–50 structures (Fig. 3), providing a visual representation of precision. The ensemble precision derives not from the dynamics of the molecule in solution but from the lack of data to describe the structure fully. The method used for structure calculation also affects the ensemble precision, and recently a significantly improved method has been developed that ensures that the ensemble precision more truly reflects the data¹²⁰. Often, a single 'minimized average' structure, or a representative structure, is also provided in the PDB. The ensemble precision is often measured as the root-mean-square (r.m.s.) deviation to the average structure, where a value of about 2.0 Å is typical¹²¹; more precise ensembles with lower r.m.s. deviation may indicate an overfitting of data, rather than high quality. An ensemble r.m.s. deviation of ≥5 Å generally indicates a low quality structure, although values this low may occur, for example, in a structure comprising two well ordered domains connected by a flexible linker region. There is still no standardized measure for assessing the goodness of fit of structures to NMR data, although it is common to report the r.m.s. deviation of violations of distance and other constraints. As with X-ray structures, it can be useful to check independent measures of accuracy (for example, available at RCSB PDB, PDBe and via CAME/ProSA-Web¹¹¹).

Compared with X-ray crystallography, NMR has the advantage that it is not necessary to crystallize a molecule; instead, NMR usually studies biomolecules in solution, hence arguably in a more natural state. NMR structures also lack the heavy metal contaminants of many X-ray structures. NMR can also dynamically track specific reactions in living cells¹²² and can more easily study weak associations as well as disordered protein states¹²³. However, NMR has the disadvantage that it imposes an upper limit on the size of the molecule studied: the largest molecule solved by NMR so far is 82 kDa (ref. 124), and most NMR structures are 25–30 kDa or less. Another disadvantage is that, for the same molecular system, NMR usually produces less precise structures than does X-ray crystallography. As with X-ray crystallography, studying structures by NMR often requires tampering with the target macromolecule. Frequently, truncated segments, often single domains, are studied by NMR, rather than full-length proteins. In addition, mutations of surface residues are often introduced to avoid aggregation. Finally, post-translational modifications that are present in vivo are often missing in NMR structures. NMR structures are calculated automatically from constraints derived from the NMR spectra, primarily constraints on interatomic distances¹²⁵. However, manual checks are sometimes needed during structure determination, and several molecular graphics tools (for example, MOLMOL³⁵) offer the possibility of easily displaying NMR data, such as distance constraints, directly on the structure. In contrast to X-ray crystal structure calculation, it is customary to retain residues with very little or no data, with the result that some regions of the molecule can be very divergent in the ensemble. The display of ensembles requires prior superposition of the structures. Some graphics packages offer automatic superposition (VMD²⁶, MOLMOL³⁵), but for more difficult cases, with highly divergent regions, dedicated programs should be used; for example, THESEUS³⁵.

Box 3: Macromolecular structures from electron microscopy

The Electron Microscopy Database (EMDB, http://emdatabank.org/) now has nearly 700 entries, mostly three-dimensional electron density maps (EDMs) of macromolecular and cellular structures. For about 250 of these entries, an atomic model has been calculated, usually by fitting an X-ray crystal structure (for example, Fig. 5b): these atomic-detail models may be deposited in linked entries in the PDB, where they account for about 0.4% of PDB entries.

A significant advantage of electron microscopy structure determination^126,127,128 is that it can be used to study a wide range of sample types, from large, ordered assemblies such as helical arrays to isolated complexes (single particles) and irregular objects such as cells or subcellular components. The upper size limit is mainly the sample thickness (up to a few hundred nanometers). Compared with X-ray and NMR structures, electron microscopy is almost always at lower resolution. Three-dimensional maps are viewed by choosing an appropriate density threshold value, normally one that gives a surface enclosing the correct molecular volume and displaying the isosurface (for example, with Chimera¹⁶). EMDB provides two map viewers, of which we recommend OpenAstexViewer¹⁸ because it displays surfaces well and lets the user change threshold levels.

Transmission electron microscopy images are projections, and three-dimensional structure determination involves the collection and merging of different projections (views) of the object. The main task is usually the determination of the relative positions and orientations for the set of views. Atomic structures of components in larger assemblies that have been determined by crystallography can be docked into the electron microscopy map. Often there are conformational changes between a structure in a crystal lattice and in solution. Flexible fitting makes it possible to account for changes, such as hinge rotations, in fitting. In the most favorable cases, the resolution of macromolecular electron microscopy structures can reach about 3 Å, although for cell or tissue sections, radiation damage limits the resolution to >>30–40 Å. At this high resolution end, the conformation of protein and nucleic acid backbones and bulky side chains can be determined directly from the electron microscopy density¹²⁹ with the same tools as in X-ray crystallography.

However, there are uncertainties in determining the resolution and in validation for noncrystalline samples. When atomic-detail structures are known that correspond to any part of the electron microscopy structure, docking of the atomic coordinates into the map provides an independent test of reliability. Two density maps can be compared by maximizing their cross-correlation (for example, using Chimera), and the comparison can be visualized using semitransparent and solid or wire mesh surface displays that can be overlaid. For tomographic reconstruction and for three-dimensional reconstruction of sections, IMOD¹⁰⁹ is commonly used.

Ligand binding sites

Interactions between macromolecules and small molecules often occur in buried active sites; these may be catalytic active sites, allosteric sites, or sites that may either disrupt or stabilize protein-protein interactions. The PDB at present contains over 37,000 binding sites involving about 10,000 different types of ligand molecules. A range of methods are available to characterize and visualize these sites, depending on the questions asked by the end user.

Annotation and highlighting. For gaining an initial insight into the atomic interactions in the binding site, a useful representation is to display ligands using a ball-and-stick representation and to display only backbone atoms of the protein or nucleic acid, except for those residues in direct contact with ligands (Fig. 4a). Many molecular graphics tools have been developed to support working with small molecules (for example, DS Visualizer, MOE, PMV²⁵, PyMOL, STRAP¹⁹, Swiss-PdbViewer²⁰, SYBYL, VMD²⁶, WHAT IF⁴², Yasara; Table 1) Almost any can implement such views, and those with scripting capabilities can often be programmed to recreate this view on demand.

In addition, many PDB entries or related files (for example, UniProt) have annotations indicating which residues form the binding site. It can be instructive to display these annotations directly on three-dimensional structures, and many molecular graphics tools enable such displays (for example, JenaLib²⁸, PDBsum²⁹, ProSAT2 (ref. 31), SRS 3D⁷ and Ligand Explorer in the RCSB PDB).

Surface-based approaches. Structural details of binding sites are widely used in rational drug design, usually to generate ideas for classes of compounds for screening⁴³. A common question is to ask what kinds of small molecules may bind to a given binding site. Many molecular graphics viewers allow the surface to be colored by local properties, such as hydrogen bonding ability, hydrophobicity or electrostatics, to allow exploration of chemical complementarily (Fig. 1g,h). The local curvature of the surface may also be used to evaluate steric complementarily.

Volume-based approaches. An alternative approach is to analyze the space around the target molecule, highlighting regions that may form strong interactions with small molecules. Some tools (for example, AutoLigand⁴⁴) allow probe atoms, such as carbon atoms or oxygen atoms, to be scanned through the entire space and the interaction energies of the probes with the molecule to be evaluated. The resultant three-dimensional data sets are then rendered to show the areas of most favorable interaction⁴⁵. More recently, atomic probes have been used to create maps of the atomic affinity. These may be rendered using isocontours, text-mapped clipping planes or volume rendering (Fig. 4b,c). Many researchers are now analyzing these volume data sets to identify and visualize ligand-sized regions of maximal affinity⁴⁴.

Sequence-profile approaches. Another approach to identify ligand binding sites uses multiple sequence alignments mapped onto three-dimensional structures⁴⁶. This approach is based on the observation that binding site residues tend to be more conserved than other positions, so it can be particularly useful when little is known about a protein. Even for well studied proteins, however, these methods sometimes find binding sites not previously noticed. Some examples of such services are TraceSuite⁴⁷, ETV⁴⁸ and others^48,49,50.

Multiple ligands. A three-dimensional structure gives a snapshot of a single state; however, in some cases, several different structures of the same protein exist with different ligands. We can use this information to help explore the range of conformations available to the system. For example, such comparisons can highlight interactions common to all known binding partners, which may help to guide the search for further possible binding partners^51,52,53. For such comparisons, it can be useful to try different sets of atoms for superposition—for example, the ligand alone, or all atoms involved in the binding site. Each of these superimpositions can highlight different aspects of the conformational differences.

Often, it is of interest to compare structures with multiple ligands obtained by means of docking tools (for example, FlexX⁵⁴, AutoDock⁵⁵). To preselect promising compounds, computational chemists can scan large libraries of drug-like molecules and dock 'hits' into the binding site of the protein target⁵⁶. Subsequently, the docked structures can be inspected visually to find ways of enhancing the predicted strength of binding⁵⁷. Some docking tools now provide graphical interfaces (for example, FlexV and AutoDockTools) for the preparation of the input structures and the analysis of the results. These tools allow the comparison of interaction geometries of different ligands with the same protein.

Two useful resources for comparing multiple ligand structures are Relibase^58,59 and Superligands⁶⁰, which both contain information about all ligands in the PDB and take special care to ensure the assignment of chemically correct atom and bond types. Both resources allow searching by identifiers as well as chemical substructure searches and similarity searches; Relibase also offers keyword searches and sequence similarity searches. The structures can be displayed in two or in three dimensions in embedded viewers. When exploring a specific protein, it is especially useful to search for similar complexes; Relibase lists similar proteins with their respective ligands, which can subsequently be superimposed and displayed in the embedded OpenAstexViewer¹⁸ (Fig. 4d and Supplementary Fig. 1). The extended functionalities of Relibase+ (which requires a paid license) give an analysis of the differences in the superimposed structures (protein movements and ligand overlap).

PDBsum can also help visualize multiple ligands binding to the same protein by superimposing the protein's different structural models in the PDB and identifying any 'ligand clusters'; that is, sites where the ligands from the different structures overlap (Fig. 4e).

Multiple proteins and ligands. Finding features that are specific to a given target adds another level of complexity when studying protein-ligand interactions. To identify features determining selectivity, it is useful to compare the target binding site with binding sites of similar proteins. The “similar binding site” as well as the “similar ligand” search of Relibase can help to identify and compare similar protein complexes. Here, again, the Relibase+ comparison table is especially useful for detecting differences in the protein binding sites—mutations, insertions and residue movements. MOE provides a similar facility to help compare multiple proteins bound to multiple ligands (Fig. 4f).

Structural visualization can be useful for predicting side effects and 'off-label' uses of known drugs by comparing the target binding site to other known protein structures^61,62. Some graphic tools support this: for example, Relibase+ offers a search for “similar cavities,” where the protein comparison is based on physico-chemical properties rather than residues, hence finding remote similarities not evident from sequence similarity.

Structural visualization can also help in developing more selective drugs. Although promising, such approaches remain speculative, and their success will be fundamentally limited, as the PDB contains only a small fraction of all binding site geometries. A complementary approach is to use the much larger set of known protein-drug interactions where no three-dimensional structure is available. For example, STITCH⁶³ can be used to show a network featuring all proteins known to interact with a given drug, based on a wide range of experimental databases, including the PDB (Fig. 4i). In the future, we anticipate that such approaches will be improved, and that PDB data will be increasingly incorporated into network visualization methods⁶⁴.

Schematic illustrations. For presentations and printouts, it can be useful to highlight key interactions in the binding site using simplified schematic illustrations produced by tools such as LIGPLOT⁶⁵, PoseView⁶⁶ and Ligand:Protein Interaction Diagrams⁶⁷ (part of MOE). These illustrations show the ligand and interacting protein side chains 'flattened' in a plane, and indicating relevant hydrogen bonds, covalent bonds, unbonded contacts and water-mediated hydrogen bonds (Fig. 4g,h). For comparing different complexes, LIGPLOT⁶⁵ and MOE allow the user to generate a series of plots for related proteins binding the same or different ligands. Equivalent components of each plot are plotted in the same relative location, thus highlighting residues and interactions present in some of the structures but missing in others.

RNA structures

Over 4,000 nucleic acid three-dimensional structures are on deposit in the Nucleic Acid Databank (NDB⁶⁸), mostly RNA structures, either determined experimentally or by ab initio prediction. NDB is also synchronized with the PDB¹, and RNA structures account at present for nearly 8% of PDB entries. Many standard aspects of visualizing three-dimensional structures of RNA can be performed completely adequately by molecular graphics tools designed for proteins, such as PyMOL and Swiss-PdbViewer²⁰ (Table 1).

Knowing the secondary structure of an RNA molecule often gives significant insight into its function, much more so than for protein secondary structure. RNA secondary structure can be derived either from multiple sequence alignments or from thermodynamic predictions, although the process requires specialized features and capabilities not available in most tools for visualizing protein alignments or structures. Multiple sequence alignment is particularly important in RNA research; alignments can be used to find covariations between nucleotide positions, which are then taken as evidence for a contact between the two nucleotide positions, and these contacts in turn define secondary structure (Fig. 5).

**Figure 5: Visualization of RNA structure in one, two and three dimensions.**

Because of these special-purpose requirements, the RNA community has developed their own specialized visualization tools (Supplementary Table 1) for viewing RNA secondary structure. Some of these RNA tools (for example, S2S Assemble⁶⁹) provide an integrated environment for interactively visualizing multiple sequence alignments, intramolecular contacts and RNA three-dimensional structures (Fig. 5). The most useful tools provide the option to manually edit the two-dimensional contacts, allowing not only reorientations of elements but also deletion and addition of nucleotides or a whole element, such as a helix.

At present, two of the main challenges in RNA visualization are as follows: first, RNA often adopts multiple structures depending on experimental conditions, and none of the available tools can deal with this properly. Second, RNA in vivo usually occurs in complex with proteins, however the RNA-specific tools cannot yet manage such complexes. RNA researchers can use standard molecular graphics tools to view such complexes, but of course this means losing RNA-specific features and capabilities.

Molecular motion

Biomacromolecules are dynamic entities, and motion is usually essential to function⁷⁰. Visualizing dynamic molecular processes is often key toward understanding these processes. Recently, several visualization tools have become available that allow quick and easy exploration of dynamic transitions between two known states of a molecule. For example, the Yale Morph Server⁷¹ (http://molmovdb.org/) provides morphed animations of potential plausible pathways between two structures; Moviemaker⁷² (http://tinyurl.com/moviemaker-v1/) is a web server that permits the user to generate simple animations of a variety of types of protein motion. These tools provide very approximate, often simply schematic, descriptions of the molecular motions.

To explore large-amplitude, low-frequency motions, such as protein domain flexing, methods based on normal mode analysis and elastic network models provide a computationally efficient approach⁷³. There are now several websites, for example, NOMAD-ref⁷⁴ and ANM⁷⁵, where even a novice user can enter a PDB file, compute normal modes, and visualize and analyze the results.

At a slightly higher level of complexity, several programs allow users to generate conformational ensembles and trajectories using constraint-based methods. Such programs include tCONCOORD⁷⁶ and FIRST/FRODA⁷⁷. One application of these methods is to identify segmental flexibility in proteins. The researcher identifies rigid domains in the protein connected by flexible tethers, then defines the geometry of the hinge or shear motions that occur as the proteins change conformation⁷⁸. The Database of Macromolecular Movements⁷¹ provides a service for analyzing hinge motion in proteins. Other websites enable molecular motions to be analyzed by means of hierarchical, multiresolution flexibility trees⁷⁹.

More realistic and detailed studies of motion require molecular dynamics simulations, which typically simulate 10–100 ns of motion in ∼1-fs time-steps. Unfortunately, such calculations are generally too CPU-intensive to be provided as a free service; hence, users usually need to calculate their own trajectories. For a first look at molecular dynamics simulations, DSMM⁸⁰ (http://tinyurl.com/dsmm-eml/) is a site that collects movies showing molecular dynamics simulations. Generally, molecular dynamics simulations are recorded as trajectory files that can be played back in a range of molecular graphics tools that support molecular dynamics (Table 1). There is as yet no unified resource to deposit or access trajectory files, although there are several initiatives in this direction—for example, the MoDEL Molecular Dynamics Extended Library (http://mmb.pcb.ub.es/model/). A related project, called Dynameomics⁸¹ (http://www.dynameomics.org/), provides online interactive views of simulations of 30 proteins and plans to extend this to all known protein folds. Such services are still very new, and we can expect significant advances in the next few years.

Of the molecular graphics tools with molecular dynamics support, VMD²⁶ is probably the most widely used. It can display 'movies', analyze properties such as atomic fluctuations and allows flexible integration with other computational tools and with the user's own scripts. Although VMD is popular, many other molecular graphics tools support molecular dynamics trajectories, and each tool often has unique features that may be useful for particular projects (Table 1).

In general, visualization of molecular dynamics trajectories remains challenging owing to intrinsic complexity, such as the large number of atoms involved and the many orders of magnitude in time relevant for biological processes. The most straightforward visualization is to superimpose several molecular dynamics snapshots (Fig. 6a). While often useful, this method has obvious limits. Overall motion can be viewed using 'sausage-like' representations (Fig. 3b); however, often dimension-reduction methods are needed⁵⁸. An increasing number of such methods are being developed for visualization of specialized cases—for example, transient cavities (Fig. 6b) and molecular diffusion (Fig. 6c–e).

**Figure 6: Visualizations of molecular motion.**

Large macromolecular assemblies

X-ray crystallography is being used to solve the structures of larger and more complex systems, and there is now considerable overlap in the size range of structures from X-ray crystallography and from electron microscopy (Box 3). It is common to see electron microscopy isosurfaces into which atomic-detail X-ray structures have been fitted. Meanwhile, electron microscopy continues to produce higher-resolution density maps of large assemblies and of single particles, such as viruses or other isolated complexes (Box 3), in addition to tomograms of higher-order, unique structures such as cell sections or isolated organelles⁸².

These data on large-scale assemblies that integrate data from X-ray crystallography (Box 1), NMR spectroscopy (Box 2), electron microscopy (Box 3) and even light microscopy^82,83,84 pose many new challenges for visualization. Many of these data are not at atomic detail, so other representations must be used. In addition, the systems can be very large, and there are often issues with computational and graphics performance. There is a need for high-performance, interactive visualization of such large assemblies, and across very different distance scales, although some tools, such as Amira (Visage Imaging) and PMV²⁵, were designed with such challenges in mind.

At present, researchers typically use a hierarchical approach to visualizing large macromolecular assemblies. For portions for which atomic information is available, atomic representations may be used, and then abstracted to simpler, surface-based representations. These surfaces may then be integrated with density sections or volumes from the lower-resolution methods (for example, electron microscopy tomography). This approach scales nicely from the level of atoms to the level of cells, allowing the use of simpler, more abstracted representations of the individual components as one moves to large systems, such as intracellular components (Fig. 7a) or even whole-cell visualization (Fig. 7b), and to multiscale movies⁸⁵.

**Figure 7: Two examples of multiscale, hierarchical visualization.**

Visualization hardware

Most of this review has focused exclusively on software developments, tacitly assuming that computer and display hardware are adequate for all visualization tasks we require. In the early days of molecular graphics tools, hardware limitations were a key issue; display systems were often very expensive, and they relied on nonstandard hardware. Significant effort in software development was directed toward ameliorating hardware limitations. Today, although most molecular graphics tools run comfortably on standard desktop computers, many hardware issues remain, particularly for the more complex visualization tasks, such as the study of molecular motion and of large assemblies.

Stereo capabilities can greatly enhance molecular graphics and, although available for many years on expensive and specialized systems, stereo is only just now becoming available for desktop LCD screens.

For particularly large assemblies, computational speed is often still an issue—here, it is important to use a top-of-the-line graphics card, and also to use molecular graphics tools that can take advantage of hardware acceleration. Fortunately, most tools can, the principal exceptions being RasMol and Chime.

Immersive virtual reality. Visualizing large, complex and multi-scale macromolecular assemblies, especially combined with molecular motion, is not only challenging computationally, but ultimately may require display systems significantly better than current computer monitors can provide. Immersive virtual reality is very promising, enabling the user to virtually enter a microscopic world, flying through and interactively manipulating macromolecules. Experimental immersive environments—for example, CAVE⁸⁶—have been in development for over 20 years, and concepts from this research have been used to enhance the user experience of several molecular graphics tools (for example, Yasara; Table 1). But such techniques have yet to find widespread use for molecular visualization—partially because of the still high cost and cumbersome nature of such systems, but perhaps also because the sense of immersion is not critical for interaction with the molecular world.

Today, however, some of the hardware components for virtual reality are becoming affordable and practical, such as head-mounted displays with head tracking, and a variety of haptic devices (mechanical input devices that are touch sensitive), such as the Wii controller, as well as devices such as wired gloves that can provide force feedback. These improvements are largely driven by the gaming market and are expected to continue rapidly. For most molecular graphics tools, minimal modifications should be required to allow them to work with such hardware, and some tools have been built with such support already in mind (for example, VMD²⁶ and SRS 3D⁷). However fully exploiting the promise of virtual reality will require substantial further software development, particularly to the user interface layer.

Physical models. Today, molecular visualization relies almost exclusively on computer-generated images. Although physical wooden and wire molecular models played a critical early role in structural chemistry and biology, the advent of three-dimensional interactive computer graphics in the 1970's provided new and much improved utility in macromolecular structure determination and analysis. However, a more recent technology, computer autofabrication, or 'solid printing', initially developed for industrial rapid prototyping, is now being used to produce physical molecular models. Such models bring back the properties of real-object perception and manipulation that were lost when the model resided only in the computer. Over the past decade, the variety of such printers has increased steadily as the entry price has dropped to below $10,000 and printing services have sprouted to fill this new niche. Because accurate and complex tangible models can be produced automatically as computer 'printouts' of molecular geometrical representations, the barrier to custom production has disappeared. Physical models with functional parts have been autofabricated with analog physical constraints, affinities and/or structural behavior of the molecular system⁸⁷ (Fig. 8).

**Figure 8: Tangible models in research.**

Such models have begun to be used for structural research. As persistent objects they are convenient, accessible and naturally manipulable. They can be used as springboards to ideas and hypotheses⁸⁸. Such characteristics also make physical models useful in multidisciplinary collaborations, helping structural experts communicate better with other colleagues. In addition, physical models lend themselves to teaching⁸⁹. We are in the early stages of learning how to best use physical models in structural biology education and research, perhaps comparable to where computer graphics was in the 1970s. This is an ongoing area of research^90,91,92.

Future perspectives

Methods for visualizing molecular structures are very mature. In the near future, we can expect more effective computational approaches for representing, analyzing and synthesizing ever-more-complex molecular systems. Increased collaboration with the graphic design community will also lead to the development of more effective and intelligible rendering approaches. However, we expect that most of the advances in molecular visualization will come in the areas of computer interfaces, user interaction and new ways to represent and visualize nonspatial information. These changes will help structures reach an even broader audience. Navigating a synthesis of structural data with image data⁸² and genomic^22,93 and biological network information⁶⁴ will require new methods that combine spatial and dynamic representations with statistical and high-dimensional abstract relationships. We also anticipate that collaborative community editing of structure-related data sources (for example, Proteopedia⁹⁴) will change how scientists relate to structural data, and to each other. The fields of information visualization and visual analytics have developed over the past decade to address problems in making such complex data intelligible and navigable^95,96.

Some of the drawbacks of immersive virtual reality may be overcome by the emerging technology of augmented reality (Fig. 8d), which provides inexpensive and accessible ways to interact in intuitive and perceptually rich ways with our computational models. Whatever direction new technologies will take us, the roles of macromolecular visualization in understanding, gaining insight and developing ideas will remain the same.

Note: Supplementary information is available on the Nature Methods website.

Accession codes

Accessions

Protein Data Bank

References

Berman, H., Henrick, K. & Nakamura, H. Announcing the worldwide Protein Data Bank. Nat. Struct. Biol. 10, 980 (2003).
CAS PubMed Google Scholar
Goddard, T.D. & Ferrin, T.E. Visualization software for molecular assemblies. Curr. Opin. Struct. Biol. 17, 587–595 (2007).
CAS PubMed PubMed Central Google Scholar
Tate, J. Molecular visualization. Methods Biochem. Anal. 44, 135–158 (2003).
CAS PubMed Google Scholar
Olson, A.J. & Pique, M.E. Visualizing the future of molecular graphics. SAR QSAR Environ. Res. 8, 233–247 (1998).
CAS PubMed Google Scholar
Berman, H.M. et al. The Protein Data Bank. Nucleic Acids Res. 28, 235–242 (2000).
CAS PubMed PubMed Central Google Scholar
Richardson, D.C. & Richardson, J.S. The kinemage: a tool for scientific communication. Protein Sci. 1, 3–9 (1992).
CAS PubMed PubMed Central Google Scholar
O'Donoghue, S.I., Meyer, J.E.W., Schafferhans, A. & Fries, K. The SRS 3D module: integrating sequence, structure, and annotation data. Bioinformatics 20, 2476–2478 (2004).
CAS PubMed Google Scholar
Bairoch, A. et al. The Universal Protein Resource (UniProt). Nucleic Acids Res. 33 (Database issue), D154–D159 (2005).
CAS PubMed Google Scholar
Schafferhans, A., Meyer, J.E.W. & O'Donoghue, S.I. The PSSH database of alignments between protein sequences and tertiary structures. Nucleic Acids Res. 31, 494–498 (2003).
CAS PubMed PubMed Central Google Scholar
Arnold, K. et al. The Protein Model Portal. J. Struct. Funct. Genomics 10, 1–8 (2009).
CAS PubMed Google Scholar
Schwede, T., Kopp, J., Guex, N. & Peitsch, M.C. SWISS-MODEL: an automated protein homology-modeling server. Nucleic Acids Res. 31, 3381–3385 (2003).
CAS PubMed PubMed Central Google Scholar
Krieger, E., Nabuurs, S.B. & Vriend, G. Homology modeling. Methods Biochem. Anal. 44, 509–523 (2003).
CAS PubMed Google Scholar
Cozzetto, D. et al. Evaluation of template-based models in CASP8 with standard measures. Proteins 77 (Suppl 9), 18–28 (2009).
CAS PubMed PubMed Central Google Scholar
Ben-David, M. et al. Assessment of CASP8 structure predictions for template free targets. Proteins 77 (Suppl 9), 50–65 (2009).
CAS PubMed Google Scholar
Bradley, P., Misura, K.M. & Baker, D. Toward high-resolution de novo structure prediction for small proteins. Science 309, 1868–1871 (2005).
CAS PubMed Google Scholar
Pettersen, E.F. et al. UCSF Chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
CAS PubMed Google Scholar
Wang, Y., Geer, L.Y., Chappey, C., Kans, J.A. & Bryant, S.H. Cn3D: sequence and structure views for Entrez. Trends Biochem. Sci. 25, 300–302 (2000).
CAS PubMed Google Scholar
Hartshorn, M.J. AstexViewer: a visualisation aid for structure-based drug design. J. Comput. Aided Mol. Des. 16, 871–881 (2002).
CAS PubMed Google Scholar
Gille, C. & Frommel, C. STRAP: editor for STRuctural Alignments of Proteins. Bioinformatics 17, 377–378 (2001).
CAS PubMed Google Scholar
Guex, N. & Peitsch, M.C. SWISS-MODEL and the Swiss-PdbViewer: an environment for comparative protein modeling. Electrophoresis 18, 2714–2723 (1997).
CAS PubMed Google Scholar
Zanzoni, A., Ausiello, G., Via, A., Gherardini, P.F. & Helmer-Citterich, M. Phospho3D: a database of three-dimensional structures of protein phosphorylation sites. Nucleic Acids Res. 35 Database issue, D229–D231 (2007).
CAS PubMed Google Scholar
Procter, J.B. et al. Visualization of multiple alignments, phylogenies and gene family evolution. Nat. Methods 7, S16–S25 (2010).
CAS PubMed Google Scholar
Kraulis, P.J. Molscript: a program to produce both detailed and schematic plots of protein structures. J. Appl. Crystallogr. 24, 946–950 (1991).
Google Scholar
Merritt, E.A. & Bacon, D.J. Raster3D: photorealistic molecular graphics. Methods Enzymol. 277, 505–524 (1997).
CAS PubMed Google Scholar
Sanner, M.F. A component-based software environment for visualizing large macromolecular assemblies. Structure 13, 447–462 (2005).
CAS PubMed Google Scholar
Humphrey, W., Dalke, A. & Schulten, K. VMD: visual molecular dynamics. J. Mol. Graph. 14, 33–38 (1996). Widely used and versatile tool for displaying, animating and analyzing large biomolecular systems. Particularly suited for MD simulations.
CAS PubMed Google Scholar
Prlic, A., Down, T.A. & Hubbard, T.J. Adding some SPICE to DAS. Bioinformatics 21 (Suppl. 2), ii40–ii41 (2005).
CAS PubMed Google Scholar
Huehne, R. & Suehnel, J. The Jena Library of Biological Macromolecules – JenaLib. Preprint at 〈http://precedings.nature.com/documents/3114/version/1/〉 (2009).
Laskowski, R.A. PDBsum: summaries and analyses of PDB structures. Nucleic Acids Res. 29, 221–222 (2001).
CAS PubMed PubMed Central Google Scholar
Gille, C. Structural interpretation of mutations and SNPs using STRAP-NT. Protein Sci. 15, 208–210 (2006).
CAS PubMed PubMed Central Google Scholar
Gabdoulline, R.R., Ulbrich, S., Richter, S. & Wade, R.C. ProSAT2–protein structure annotation server. Nucleic Acids Res. 34 (Web Server issue), W79–W83 (2006).
CAS PubMed PubMed Central Google Scholar
Bordner, A.J. & Gorin, A.A. Comprehensive inventory of protein complexes in the Protein Data Bank from consistent classification of interfaces. BMC Bioinformatics 9, 234 (2008).
PubMed PubMed Central Google Scholar
Krissinel, E. & Henrick, K. Inference of macromolecular assemblies from crystalline state. J. Mol. Biol. 372, 774–797 (2007).
CAS PubMed Google Scholar
Kolodny, R., Koehl, P. & Levitt, M. Comprehensive evaluation of protein structure alignment methods: scoring by geometric measures. J. Mol. Biol. 346, 1173–1188 (2005).
CAS PubMed PubMed Central Google Scholar
Koradi, R., Billeter, M. & Wuthrich, K. MOLMOL: a program for display and analysis of macromolecular structures. J. Mol. Graph. 14, 51–55 29–32 (1996).
CAS PubMed Google Scholar
Russell, R.B. & Barton, G.J. Multiple protein sequence alignment from tertiary structure comparison: assignment of global and residue confidence levels. Proteins 14, 309–323 (1992).
CAS PubMed Google Scholar
Theobald, D.L. & Wuttke, D.S. THESEUS: maximum likelihood superpositioning and analysis of macromolecular structures. Bioinformatics 22, 2171–2172 (2006).
CAS PubMed Google Scholar
Connolly, M.L. Solvent-accessible surfaces of proteins and nucleic acids. Science 221, 709–713 (1983).
CAS PubMed Google Scholar
Sippl, M.J. Boltzmann's principle, knowledge-based mean fields, and protein folding. An approach to the computational determination of protein structures. J. Comput. Aided Mol. Des. 7, 473–501 (1993).
CAS PubMed Google Scholar
Wesson, L. & Eisenberg, D. Atomic solvation parameters applied to molecular dynamics of proteins in solution. Protein Sci. 1, 227–235 (1992).
CAS PubMed PubMed Central Google Scholar
Sanner, M.F., Olson, A.J. & Spehner, J.-C. Reduced surface: an efficient way to compute molecular surfaces. Biopolymers 38, 305–320 (1996).
Article CAS PubMed Google Scholar
Vriend, G. WHAT IF: a molecular modeling and drug design program. J. Mol. Graph. 8, 52–56 (1990).
CAS PubMed Google Scholar
Hunter, W.N. Structure-based ligand design and the promise held for antiprotozoan drug discovery. J. Biol. Chem. 284, 11749–11753 (2009).
CAS PubMed PubMed Central Google Scholar
Harris, R., Olson, A.J. & Goodsell, D.S. Automated prediction of ligand-binding sites in proteins. Proteins 70, 1506–1517 (2008).
CAS PubMed Google Scholar
Goodford, P.J. A computational procedure for determining energetically favorable binding sites on biologically important macromolecules. J. Med. Chem. 28, 849–857 (1985).
CAS PubMed Google Scholar
Campbell, S.J., Gold, N.D., Jackson, R.M. & Westhead, D.R. Ligand binding: functional site location, similarity and docking. Curr. Opin. Struct. Biol. 13, 389–395 (2003).
CAS PubMed Google Scholar
Lichtarge, O., Bourne, H.R. & Cohen, F.E. An evolutionary trace method defines binding surfaces common to protein families. J. Mol. Biol. 257, 342–358 (1996).
CAS PubMed Google Scholar
Morgan, D.H., Kristensen, D.M., Mittelman, D. & Lichtarge, O. ET viewer: an application for predicting and visualizing functional sites in protein structures. Bioinformatics 22, 2049–2050 (2006).
CAS PubMed Google Scholar
Laskowski, R.A., Watson, J.D. & Thornton, J.M. ProFunc: a server for predicting protein function from 3D structure. Nucleic Acids Res. 33 (Web Server issue), W89–W93 (2005).
CAS PubMed PubMed Central Google Scholar
Kinoshita, K., Murakami, Y. & Nakamura, H. eF-seek: prediction of the functional sites of proteins by searching for similar electrostatic potential and molecular surface shape. Nucleic Acids Res. 35 (Web Server issue), W398–W402 (2007).
PubMed PubMed Central Google Scholar
Wolber, G. & Kosara, R. Pharmacophores from macromolecular complexes with LigandScout. in Pharmacophores and Pharmacophore Searches (ed. Langer, T. & Hoffmann, R.D.) vol. 32, 131–150 (Wiley-VCH, Weinheim, Germany, 2006).
Vulpetti, A. & Pevarello, P. An analysis of the binding modes of ATP-competitive CDK2 inhibitors as revealed by X-ray structures of protein-inhibitor complexes. Curr. Med. Chem. Anticancer Agents 5, 561–573 (2005).
CAS PubMed Google Scholar
Zou, J. et al. Towards more accurate pharmacophore modeling: Multicomplex-based comprehensive pharmacophore map and most-frequent-feature pharmacophore model of CDK2. J. Mol. Graph. Model. 27, 430–438 (2008).
CAS PubMed Google Scholar
Rarey, M., Kramer, B., Lengauer, T. & Klebe, G. A fast flexible docking method using an incremental construction algorithm. J. Mol. Biol. 261, 470–489 (1996).
CAS PubMed Google Scholar
Morris, G.M. et al. AutoDock4 and AutoDockTools4: automated docking with selective receptor flexibility. J. Comput. Chem. 30, 2785–2791 (2009).
CAS PubMed PubMed Central Google Scholar
Zoete, V., Grosdidier, A. & Michielin, O. Docking, virtual high throughput screening and in silico fragment-based drug design. J. Cell. Mol. Med. 13, 238–248 (2009).
CAS PubMed PubMed Central Google Scholar
Karkola, S., Alho-Richmond, S. & Wahala, K. Pharmacophore modelling of 17β-HSD1 enzyme based on active inhibitors and enzyme structure. Mol. Cell. Endocrinol. 301, 225–228 (2009).
CAS PubMed Google Scholar
Hendlich, M. Databases for protein-ligand complexes. Acta Crystallogr. D Biol. Crystallogr. 54, 1178–1182 (1998).
CAS PubMed Google Scholar
Gunther, J., Bergner, A., Hendlich, M. & Klebe, G. Utilising structural knowledge in drug design strategies: applications using Relibase. J. Mol. Biol. 326, 621–636 (2003). Provides several detailed examples showing how Relibase can aid structure-based drug design.
CAS PubMed Google Scholar
Michalsky, E., Dunkel, M., Goede, A. & Preissner, R. SuperLigands—a database of ligand structures derived from the Protein Data Bank. BMC Bioinformatics 6, 122 (2005).
PubMed PubMed Central Google Scholar
Xie, L., Li, J. & Bourne, P.E. Drug discovery using chemical systems biology: identification of the protein-ligand binding network to explain the side effects of CETP inhibitors. PLOS Comput. Biol. 5, e1000387 (2009).
PubMed PubMed Central Google Scholar
Kinnings, S.L. et al. Drug discovery using chemical systems biology: repositioning the safe medicine Comtan to treat multi-drug and extensively drug resistant tuberculosis. PLOS Comput. Biol. 5, e1000423 (2009).
PubMed PubMed Central Google Scholar
Kuhn, M., von Mering, C., Campillos, M., Jensen, L.J. & Bork, P. STITCH: interaction networks of chemicals and proteins. Nucleic Acids Res. 36 (Database issue), D684–D688 (2008). Useful and easy-to-use tool for visualizing graphical networks showing interactions between proteins and small molecules. Underlying data is consolidated from many sources, including PDB.
CAS PubMed Google Scholar
Gehlenborg, N. et al. Visualization of omics data for systems biology. Nat. Methods 7, S56–S68 (2010).
CAS PubMed Google Scholar
Wallace, A.C., Laskowski, R.A. & Thornton, J.M. LIGPLOT: a program to generate schematic diagrams of protein-ligand interactions. Protein Eng. 8, 127–134 (1995). Widely used for generating simplified, two-dimensional schematic diagrams of protein-ligand interactions from the three-dimensional coordinates.
CAS PubMed Google Scholar
Stierand, K., Maass, P.C. & Rarey, M. Molecular complexes at a glance: automated generation of two-dimensional complex diagrams. Bioinformatics 22, 1710–1716 (2006).
CAS PubMed Google Scholar
Clark, A.M. & Labute, P. 2D depiction of protein-ligand complexes. J. Chem. Inf. Model. 47, 1933–1944 (2007).
CAS PubMed Google Scholar
Berman, H.M. et al. The nucleic acid database: a comprehensive relational database of three-dimensional structures of nucleic acids. Biophys. J. 63, 751–759 (1992).
CAS PubMed PubMed Central Google Scholar
Jossinet, F. & Westhof, E. Sequence to Structure (S2S): display, manipulate and interconnect RNA data from sequence to structure. Bioinformatics 21, 3320–3321 (2005).Offers the most complete set of features for viewing RNA structures. Recommended for advanced users. Also available by web services.
CAS PubMed Google Scholar
Ringe, D. & Petsko, G.A. The 'glass transition' in protein dynamics: what it is, why it occurs, and how to exploit it. Biophys. Chem. 105, 667–680 (2003).
CAS PubMed Google Scholar
Flores, S. et al. The Database of Macromolecular Motions: new features added at the decade mark. Nucleic Acids Res. 34 (Database issue), D296–D301 (2006).
CAS PubMed Google Scholar
Maiti, R., Van Domselaar, G.H. & Wishart, D.S. MovieMaker: a web server for rapid rendering of protein motions and interactions. Nucleic Acids Res. 33 (Web Server issue), W358–W362 (2005).
CAS PubMed PubMed Central Google Scholar
Chennubhotla, C., Rader, A.J., Yang, L.W. & Bahar, I. Elastic network models for understanding biomolecular machinery: from enzymes to supramolecular assemblies. Phys. Biol. 2, S173–S180 (2005).
CAS PubMed Google Scholar
Lindahl, E., Azuara, C., Koehl, P. & Delarue, M. NOMAD-Ref: visualization, deformation and refinement of macromolecular structures based on all-atom normal mode analysis. Nucleic Acids Res. 34 (Web Server issue), W52–W56 (2006).
CAS PubMed PubMed Central Google Scholar
Eyal, E., Yang, L.W. & Bahar, I. Anisotropic network model: systematic evaluation and a new web interface. Bioinformatics 22, 2619–2627 (2006).
CAS PubMed Google Scholar
Seeliger, D. & De Groot, B.L. tCONCOORD-GUI: visually supported conformational sampling of bioactive molecules. J. Comput. Chem. 30, 1160–1166 (2009).
CAS PubMed Google Scholar
Thorpe, M.F., Lei, M., Rader, A.J., Jacobs, D.J. & Kuhn, L.A. Protein flexibility and dynamics using constraint theory. J. Mol. Graph. Model. 19, 60–69 (2001).
CAS PubMed Google Scholar
Gerstein, M., Lesk, A.M. & Chothia, C. Structural mechanisms for domain movements in proteins. Biochemistry 33, 6739–6749 (1994).
CAS PubMed Google Scholar
Zhao, Y., Stoffler, D. & Sanner, M. Hierarchical and multi-resolution representation of protein flexibility. Bioinformatics 22, 2768–2774 (2006).
CAS PubMed Google Scholar
Finocchiaro, G., Wang, T., Hoffmann, R., Gonzalez, A. & Wade, R.C. DSMM: a database of simulated molecular motions. Nucleic Acids Res. 31, 456–457 (2003).
CAS PubMed PubMed Central Google Scholar
Kehl, C., Simms, A.M., Toofanny, R.D. & Daggett, V. Dynameomics: a multi-dimensional analysis-optimized database for dynamic protein data. Protein Eng. Des. Sel. 21, 379–386 (2008).
CAS PubMed Google Scholar
Walter, T. et al. Visualization of image data from cells to organisms. Nat. Methods 7, S26–S41 (2010).
CAS PubMed PubMed Central Google Scholar
Goodsell, D.S. Visual methods from atoms to cells. Structure 13, 347–354 (2005).
CAS PubMed Google Scholar
Goodsell, D.S. Making the step from chemistry to biology and back. Nat. Chem. Biol. 3, 681–684 (2007).
CAS PubMed Google Scholar
McGill, G. Molecular movies. coming to a lecture near you. Cell 133, 1127–1132 (2008).
CAS PubMed Google Scholar
Cruz-Neira, C., Sandin, D.J., DeFanti, T.A., Kenyon, R.V. & Hart, J.C. The CAVE: audio visual experience automatic virtual environment. Commun. ACM 35, 64–72 (1992).
Google Scholar
Gillet, A., Sanner, M.F., Stoffler, D. & Olson, A.J. Tangible interfaces for structural molecular biology. Structure 13, 483–491 (2005).
CAS PubMed Google Scholar
Olson, A.J., Hu, Y.H. & Keinan, E. Chemical mimicry of viral capsid self-assembly. Proc. Natl. Acad. Sci. USA 104, 20731–20736 (2007).
CAS PubMed PubMed Central Google Scholar
Herman, T. et al. Tactile teaching: exploring protein structure/function using physical models. Biochem. Mol. Biol. Educ. 34, 247–254 (2006).
CAS PubMed Google Scholar
Creem, S.H. & Proffitt, D.R. Grasping objects by their handles: a necessary interaction between cognition and action. J. Exp. Psychol. Hum. Percept. Perform. 27, 218–228 (2001).
CAS PubMed Google Scholar
Kozma, R. The material features of multiple representations and their cognitive and social affordances for science understanding. Learning and Instruction 13, 205–226 (2003).
Google Scholar
Zhang, J. & Patel, V.L. Distributed cognition, representation, and affordance. Pragmatics & Cognition 14, 333–341 (2006).
Google Scholar
Nielsen, C.B., Cantor, M., Dubchak, I., Gordon, D. & Wang, T. Visualizing genomes: techniques and challenges. Nat. Methods 7, S5–S15 (2009).
Google Scholar
Hodis, E. et al. Proteopedia—a scientific 'wiki' bridging the rift between three-dimensional structure and function of biomacromolecules. Genome Biol. 9, R121 (2008).
PubMed PubMed Central Google Scholar
Cook, K., Earnshaw, R. & Stasko, J. Discovering the unexpected. IEEE Comput. Graph. Appl. 27, 15–19 (2007).
PubMed Google Scholar
Kerren, A., Stasko, J.T., Fekete, J.D. & North, C. Information Visualization (Springer, New York, 2008).
Schindler, T. et al. Crystal structure of Hck in complex with a Src family-selective tyrosine kinase inhibitor. Mol. Cell 3, 639–648 (1999).
CAS PubMed Google Scholar
Xu, W., Harrison, S.C. & Eck, M.J. Three-dimensional structure of the tyrosine kinase c-Src. Nature 385, 595–602 (1997).
CAS PubMed Google Scholar
Lam, P.Y. et al. Rational design of potent, bioavailable, nonpeptide cyclic ureas as HIV protease inhibitors. Science 263, 380–384 (1994).
CAS PubMed Google Scholar
Gangloff, M. et al. Crystal structure of a mutant hERα ligand-binding domain reveals key structural features for the mechanism of partial agonism. J. Biol. Chem. 276, 15059–15065 (2001).
CAS PubMed Google Scholar
Brzozowski, A.M. et al. Molecular basis of agonism and antagonism in the oestrogen receptor. Nature 389, 753–758 (1997).
CAS PubMed Google Scholar
Robertson, M.P. et al. The structure of a rigorously conserved RNA element within the SARS virus genome. PLoS Biol. 3, e5 (2005).
PubMed Google Scholar
Petrek, M. et al. CAVER: a new tool to explore routes from protein clefts, pockets and cavities. BMC Bioinformatics 7, 316 (2006).
PubMed PubMed Central Google Scholar
Spaar, A., Dammer, C., Gabdoulline, R.R., Wade, R.C. & Helms, V. Diffusional encounter of barnase and barstar. Biophys. J. 90, 1913–1924 (2006).
CAS PubMed Google Scholar
Al-Amoudi, A., Diez, D.C., Betts, M.J. & Frangakis, A.S. The molecular architecture of cadherins in native epidermal desmosomes. Nature 450, 832–837 (2007).
CAS PubMed Google Scholar
Boggon, T.J. et al. C-cadherin ectodomain structure and implications for cell adhesion mechanisms. Science 296, 1308–1313 (2002).
CAS PubMed Google Scholar
Marina, A., Waldburger, C.D. & Hendrickson, W.A. Structure of the entire cytoplasmic portion of a sensor histidine-kinase protein. EMBO J. 24, 4247–4259 (2005).
CAS PubMed PubMed Central Google Scholar
Mal, T.K., Matthews, S.J., Kovacs, H., Campbell, I.D. & Boyd, J. Some NMR experiments and a structure determination employing a {¹⁵N,²H} enriched protein. J. Biomol. NMR 12, 259–276 (1998).
CAS PubMed Google Scholar
Kremer, J.R., Mastronarde, D.N. & McIntosh, J.R. Computer visualization of three-dimensional image data using IMOD. J. Struct. Biol. 116, 71–76 (1996).
CAS PubMed Google Scholar
Sayle, R.A. & Milner-White, E.J. RASMOL: biomolecular graphics for all. Trends Biochem. Sci. 20, 374 (1995).
CAS PubMed Google Scholar
Wiederstein, M. & Sippl, M.J. ProSA-web: interactive web service for the recognition of errors in three-dimensional structures of proteins. Nucleic Acids Res. 35 (Web Server issue), W407–W410 (2007).
PubMed PubMed Central Google Scholar
Rhodes, G. Crystallography Made Crystal Clear: A Guide for Users of Macromolecular Models 3rd edn. (Academic Press, 2006).
Glykos, N.M. On the application of molecular-dynamics simulations to validate thermal parameters and to optimize TLS-group selection for macromolecular refinement. Acta Crystallogr. D Biol. Crystallogr. 63, 705–713 (2007).
CAS PubMed Google Scholar
Brünger, A.T. The free R factor: a novel statistical quantity for assessing the accuracy of crystal structures. Nature 355, 472–474 (1992).
PubMed Google Scholar
Kleywegt, G.J. et al. The Uppsala electron-density server. Acta Crystallogr. D Biol. Crystallogr. 60, 2240–2249 (2004).
PubMed Google Scholar
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr. D Biol. Crystallogr. 60, 2126–2132 (2004).
PubMed Google Scholar
Jones, T.A. Diffraction methods for biological macromolecules. Interactive computer graphics: FRODO. Methods Enzymol. 115, 157–171 (1985).
CAS PubMed Google Scholar
Jones, T.A., Zou, J.Y., Cowan, S.W. & Kjeldgaard, M. Improved methods for building protein models in electron density maps and the location of errors in these models. Acta Crystallogr. A 47, 110–119 (1991).
PubMed Google Scholar
Levin, E.J., Kondrashov, D.A., Wesenberg, G.E. & Phillips, G.N. Jr. Ensemble refinement of protein crystal structures: validation and application. Structure 15, 1040–1052 (2007).
CAS PubMed PubMed Central Google Scholar
Rieping, W., Habeck, M. & Nilges, M. Inferential structure determination. Science 309, 303–306 (2005).
CAS PubMed Google Scholar
Nederveen, A.J. et al. RECOORD: a recalculated coordinate database of 500+ proteins from the PDB using restraints from the BioMagResBank. Proteins 59, 662–672 (2005).
CAS PubMed Google Scholar
Selenko, P. & Wagner, G. Looking into live cells with in-cell NMR spectroscopy. J. Struct. Biol. 158, 244–253 (2007).
CAS PubMed Google Scholar
Eliezer, D. Biophysical characterization of intrinsically disordered proteins. Curr. Opin. Struct. Biol. 19, 23–30 (2009).
CAS PubMed PubMed Central Google Scholar
Tugarinov, V., Choy, W.-Y., Orekhov, V.Y. & Kay, L.E. Solution NMR-derived global fold of a monomeric 82-kDa enzyme. Proc. Natl. Acad. Sci. USA 102, 622–627 (2005).
CAS PubMed PubMed Central Google Scholar
Markwick, P.R., Malliavin, T. & Nilges, M. Structural biology by NMR: structure, dynamics, and interactions. PLOS Comput. Biol. 4, e1000168 (2008).
PubMed PubMed Central Google Scholar
Wang, L. & Sigworth, F.J. Cryo-EM and single particles. Physiology (Bethesda) 21, 13–18 (2006).
Google Scholar
Frank, J. Three-dimensional Electron Microscopy of Macromolecular Assemblies 2nd edn. (Oxford University Press, 2006).
Frank, J. ed. Electron Tomography 2nd edn. (Springer, 2006).
Yu, X., Jin, L. & Zhou, Z.H. 3.88 Å structure of cytoplasmic polyhedrosis virus by cryo-electron microscopy. Nature 453, 415–419 (2008).
CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

Thanks to M. Berynskyy and L. Biedermannova for assistance with Figure 6. This work was partly supported by the European Union Framework Programme 6 grant 'TAMAHUD' (LSHC-CT-2007-037472). R.C.W. gratefully acknowledges the support of the Klaus Tschira Foundation.

Author information

Authors and Affiliations

European Molecular Biology Laboratory, Heidelberg, Germany
Seán I O'Donoghue & Andrea Schafferhans
The Scripps Research Institute, La Jolla, California, USA
David S Goodsell & Arthur J Olson
Goethe University, Frankfurt, Germany
Achilleas S Frangakis
Institut de Biologie Moléculaire et Cellulaire du Centre National de la Recherche Scientifique (CNRS), Université de Strasbourg, Strasbourg, France
Fabrice Jossinet & Eric Westhof
European Bioinformatics Institute, Cambridge, UK
Roman A Laskowski
Institut Pasteur, Paris, France
Michael Nilges
Institute of Structural and Molecular Biology, Birkbeck College, London, UK
Helen R Saibil
Heidelberg Institute for Theoretical Sciences (HITS), Heidelberg, Germany
Rebecca C Wade

Authors

Seán I O'Donoghue
View author publications
You can also search for this author in PubMed Google Scholar
David S Goodsell
View author publications
You can also search for this author in PubMed Google Scholar
Achilleas S Frangakis
View author publications
You can also search for this author in PubMed Google Scholar
Fabrice Jossinet
View author publications
You can also search for this author in PubMed Google Scholar
Roman A Laskowski
View author publications
You can also search for this author in PubMed Google Scholar
Michael Nilges
View author publications
You can also search for this author in PubMed Google Scholar
Helen R Saibil
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Schafferhans
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca C Wade
View author publications
You can also search for this author in PubMed Google Scholar
Eric Westhof
View author publications
You can also search for this author in PubMed Google Scholar
Arthur J Olson
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Seán I O'Donoghue.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Text and Figures

Supplementary Figure 1 and Supplementary Table 1 (PDF 729 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

O'Donoghue, S., Goodsell, D., Frangakis, A. et al. Visualization of macromolecular structures. Nat Methods 7 (Suppl 3), S42–S55 (2010). https://doi.org/10.1038/nmeth.1427

Download citation

Published: 01 March 2010
Issue Date: March 2010
DOI: https://doi.org/10.1038/nmeth.1427

This article is cited by

Enhancing Enzyme Activity and Thermostability of Bacillus amyloliquefaciens Chitosanase BaCsn46A Through Saturation Mutagenesis at Ser196
- Guo Jing
- Gao Wenjun
- Cai Zhiqiang
Current Microbiology (2023)
Network pharmacology of iridoid glycosides from Eucommia ulmoides Oliver against osteoporosis
- Ting Wang
- Liming Fan
- Yang Li
Scientific Reports (2022)
Developing novel methods to image and visualize 3D genomes
- Tszshan Ma
- Long Chen
- Juntao Gao
Cell Biology and Toxicology (2018)
LiteMol suite: interactive web-based visualization of large-scale macromolecular structure data
- David Sehnal
- Mandar Deshpande
- Jaroslav Koča
Nature Methods (2017)
Aquaria: simplifying discovery and insight from protein structures
- Seán I O'Donoghue
- Kenneth S Sabir
- Andrea Schafferhans
Nature Methods (2015)