Main

“Each level of organization represents a threshold where objects, methods and conditions of observation suddenly change . . . Biology has then to articulate these levels two by two, to cross each threshold and unveil its peculiarities of integration and logic.” François Jacob, in The Logic of Life (Penguin, London, 1989).

Rather like those of Albert Einstein, DNA's popular images are hardly representative. While it is fashionable in these post-genome days to show it as an endless string of A's, C's, G's and T's, this year's anniversary will surely be replete with two kinds of picture. One shows the famous double helix, delightfully suggesting the twin snakes of Wisdom and Knowledge intertwining around the caduceus, the staff of the medic's god Hermes. The other reveals the X-shaped symbol of inheritance, the chromosome.

But it is rare that DNA looks this good. For only a couple of hours during the early stages of the cell cycle, as the cell prepares to divide, the genome is compacted into its distinctive chromosomal fragments (Fig. 1). The rest of the time you will search the eukaryotic cell in vain for those molecular tetrapods. What you find instead in the cell nucleus is, apparently, a tangled mess.

Figure 1: Inset: coloured scanning electron micrograph of a pair of human chromosomes.
figure 1

INSET: A. SYRED/SPL. MAIN IMAGE: DRISCOLL, YOUNGQUIST & BALDESCHWIELER, CALTECH/SPL

Main image: scanning tunnelling micrograph showing approximately three turns of a DNA double helix. The image is created by scanning a fine point just above the surface of a DNA molecule and electronically recording the height of the point as it moves across the specimen.

And don't think that this will, on closer inspection, turn out to be woven from that elegant, pristine double helix. Rather, the threads are chromatin — a filamentary assembly of DNA and proteins — in which only very short stretches of the naked helix are fleetingly revealed. Although chromosomes are often equated with DNA, there is actually about twice as much protein as DNA in chromatin. And about 10 per cent of the mass of a chromosome is made up of RNA chains, newly formed (or in the act of forming) on the DNA template in the process called transcription.

Zooming in on DNA

If we want to know how DNA really functions, it is not enough to zoom in to the molecular level with its beautifully simple staircase of base pairs. Textbooks tend, understandably, to show replication as the steady progress of the DNA-synthesizing enzyme DNA polymerase along a linear single strand laid out like a railway line (see article in this issue by Alberts, page 431), and RNA polymerase doing likewise in transcription. One has the impression of the genome as a book lying open, waiting to be read.

However, it is not so straightforward. The book is closed up, sealed, and packed away. Moreover, the full story is not merely what is written on the pages; these operations on DNA involve information transmission over many length scales. Perhaps those who do not routinely have to delve into the intricacies of genome function have acquired such a simplistic picture of it all because, until relatively recently, these length scales were considered largely out of bounds for molecular science. We know about molecules; we know about cells and organelles; but the stuff in between is messy and mysterious.

We speak of molecular biology and cell biology, but no one really talks of mesobiology. Yet that is the level of magnification at which much of the action takes place: the scale of perhaps a few to several hundred nanometres. How DNA is arranged on these scales seems to be central to the processes of replication and transcription that we have come to think of in terms of neat base pairings, yet it is precisely here that our understanding remains the most patchy.

Partly that's because the mesoscale represents, quite literally, a difficult middle ground. It encompasses too many atoms for one to apply straightforward molecular mechanics, with its bond bending and breaking; yet the graininess still matters, the continuum has not yet become a good approximation. As Bustamante and colleagues show elsewhere in this issue (see page 423), looking at DNA on a scale where it flexes and twists like a soft rod reveals how the mechanical and the molecular interact.

Take the problem of supercoiling, for example. The closed loops of bacterial DNA can develop twists like those in a Möbius strip, which either 'overwind' or 'underwind' the helix. Generally there is some degree of underwinding (negative supercoiling) such that there is one negative supercoil for every 200 base pairs (bp) or so. This has an energy cost of around –9 kcal mol−1, which manifests itself in physiological effects. In bacteria, too much supercoiling can inhibit growth, which is why enzymes called topoisomerases exist to release it. On the other hand, negative supercoiling tends to unwind the double helix, which is needed to initiate strand separation for DNA replication.

Although the chromosomal DNA of eukaryotes has free ends, it too is prone to supercoiling, as it seems typically to be attached in large loops to a filamentous structure called the nuclear matrix that coats the inside of the nuclear membrane. The attachment may in fact be necessary for both replication and transcription to take place.

Packaging problem

Stretched into a linear double helix, the three billion or so base pairs of human DNA would measure 1.8 m. This strand, snipped into 46 chromosomes, has to be packed into a nucleus just 6 μm or so across. As a result, the DNA chains are far from the idealized picture of molecules floating in an infinite solvent. They have a density of around 100 mg per ml, comparable to that of a highly viscous polymer gel.

The packing ratio for the chains is therefore enormous. In the smallest human chromosome, a length of DNA 14 mm long is compressed into a chromosome about 2 μm long: a packing ratio of 7,000. The first stage in solving this packaging problem is to wind the DNA around protein disks to form a bead-like nucleosome (see article in this issue by Felsenfeld and Groudine, page 448). Each disk is an octamer of four types of histone protein; a fifth histone, called H1, seals the DNA to the disk at the point where the winding starts and ends. Each nucleosome, 6 nm high by 11 nm in diameter, binds around 200 bp of DNA in two coils, and there is very little 'free' DNA between adjacent nucleosomes: sometimes as little as 8 bp.

The string of nucleosomes forms a fibre about 10 nm thick, which is then packaged into a filament three times as wide. This 30-nm fibre is the basic element of chromatin — yet we still don't know its structure. It is widely held to be composed of nucleosomes arranged in a solenoid, but hard evidence for this is scanty. How many celebrations of the double helix will admit that, 50 years on, we don't really know what DNA at large in the cell looks like?

The 30-nm fibre is further folded and condensed to give a packing ratio of around 1,000 in chromosomes during interphase (the time between cell divisions), and around ten times that in the X-shaped chromosomes of mitosis (cell division). How this happens is even more of a mystery. For mitotic chromosomes it was thought until only recently that there might be a contiguous protein scaffold holding the whole affair together; but now it seems that the structural integrity must come from chromatin crosslinking1. All the histones seem to have higher-order structural functions. Multi-subunit protein complexes in yeast called SWI/SNF and RSC (both of which seem to have human homologues) are chromatin-remodelling machines, which distort histone–DNA contacts or transfer histones between DNA molecules, exposing the DNA to attack by DNA-cleaving enzymes called nucleases. How they work remains hazy2. According to one recent study3, DNA engaged by such complexes 'behaves as if it were free and bound at the same time'. Or in other words, as if 'free' and 'bound' were notions too simplistic to have much meaning here. What is clear is that these chromatin-shaping machines are important in transcription: cells lacking RSC are no longer viable.

There are in fact two types of chromatin in the nucleus of an interphase eukaryotic cell. Euchromatin is the most abundant: it is relatively dispersed, with the tangled-net appearance of a polymer gel. Heterochromatin is much denser (virtually solid-like), comparable to the density of mitotic chromosomes, and is confined to a few small patches. The invitation is to regard euchromatin as 'active' DNA, unpacked enough to let the transcription apparatus get to work on it, while heterochromatin is compressed, like a big data file, until needed. But like just about any other generalization about DNA's structure and behaviour, this one quickly breaks down. Clearly only a fraction of a cell's euchromatin is made up of transcribable DNA in the first place (so why not pack the rest away?); and even chromosomes containing a large amount of heterochromatin can be transcriptionally active. Some researchers think that 'euchromatin' and 'heterochromatin' are just blanket terms for many things we don't understand: further hierarchies of DNA organization yet to be revealed.

Structured chaos

Certainly, there seems to be more to the nucleus than a disorderly mass of DNA. It is a constantly changing structure, but not randomly: there is method in there somewhere. Specific chromosomes occupy discrete nuclear positions during interphase, and these positions can change in a deterministic way in response to changes in the cell's physiological state.

And the euchromatin itself has an internal logic, albeit one only partly decoded. It has been proposed that DNA has sequences called scaffold/matrix-attached regions (S/MARs), recurring typically every 10–100 kbp, that bind to the nuclear matrix to divide up the chromosome into loops2. Yet the existence of not only S/MARs, but also the nuclear scaffold itself, has been questioned. There is no sign of the scaffold during mitosis, and the material it is thought to be composed of may be nothing more than a mess of denatured proteins.

Be that as it may, the organization of the loops seems to be important for compaction of DNA and for the regulation of gene expression, and each loop may act as an independent unit of gene activity. In other words, there is at least one level of superstructural organization in the chromosomes that makes its influence felt at the scale of molecular information transfer. Topoisomerase II is one of several proteins that bind specifically to the putative S/MARs, suggesting that these points are important for controlling supercoiling in the strands.

With all this high-level structure, transcription of DNA is not so much a matter of slotting the parts in place as tugging on the rope. DNA is highly curved around the nucleosomes, the inward-facing groove compressed and the outer one widened. RNA polymerase, at 13 by 14 nm, is about the same size as the nucleosome, yet it binds to a region of DNA around 50 bp long: about a quarter of the entire histone-bound length. So clearly some DNA must leave the surface of the histone core for transcription to proceed. But this core need not be displaced completely. The histone disk actually has a considerable amount of mobility, sometimes described as a corkscrew motion through the DNA coil. The reality is undoubtedly more complex, involving a kind of diffusion of localized defects in the DNA–histone contact.

If all of this destroys the pretty illusion created by the iconic model of Watson and Crick, it surely also opens up a much richer panorama. The fundamental mechanism of information transfer in nucleic acids — complementary base pairing — is so elegant that it risks blinding us to the awesome sophistication of the total process. These molecules do not simply wander up to one another and start talking. They must first be designated for that task, and must then file applications at various higher levels before permission is granted, forming a complex regulatory network (see article in this issue by Hood and Galas, page 444). For those who would like to control these processes, and those who seek to mimic them in artificial systems, the message is that the biological mesoscale, far from being a regime where order and simplicity descend into unpredictable chaos, has its own structures, logic, rules and regulatory mechanisms. This is the next frontier at which we will unfold the continuing story of how DNA works.