## Abstract

High precision measurements of molecules containing more than one heavy isotope may provide novel constraints on element cycles in nature. These so-called clumped isotope signatures are reported relative to the random (stochastic) distribution of heavy isotopes over all available isotopocules of a molecule, which is the conventional reference. When multiple indistinguishable atoms of the same element are present in a molecule, this reference is calculated from the bulk (≈average) isotopic composition of the involved atoms. We show here that this referencing convention leads to apparent negative clumped isotope anomalies (anti-clumping) when the indistinguishable atoms originate from isotopically different populations. Such statistical clumped isotope anomalies must occur in any system where two or more indistinguishable atoms of the same element, but with different isotopic composition, combine in a molecule. The size of the anti-clumping signal is closely related to the difference of the initial isotope ratios of the indistinguishable atoms that have combined. Therefore, a measured statistical clumped isotope anomaly, relative to an expected (e.g. thermodynamical) clumped isotope composition, may allow assessment of the heterogeneity of the isotopic pools of atoms that are the substrate for formation of molecules.

## Introduction

Analysis of the isotopic composition of molecules is one of the key tools for studying element cycles on earth. For the light elements H, C, N and O with relatively small heavy-to-light isotope ratios at natural abundance, the standard analytical instruments have largely limited isotope analysis to single-substituted isotopocules (isotopically substituted molecules). Studies of multiply substituted isotopocules, referred to as clumped isotopes, were only occasionally carried out, often using isotopically enriched substrates or labeling experiments^{1,2,3,4,5}. Recent analytical advancements using isotope ratio mass spectrometry^{6,7,8} or laser spectroscopy^{9} have enabled high precision measurements of clumped isotopes in several molecules such as CO_{2}, CH_{4}, O_{2} and N_{2}O^{8,10,11,12,13,14,15} and the field is rapidly expanding.

Since multiply substituted isotopocules are thermodynamically more stable than single substituted ones, classical isotope theory predicts small but measurable positive clumped isotope anomalies for most molecules under natural conditions^{16,17,18,19}. These clumped isotope signatures depend on temperature, which is the basis of the new field of clumped isotope thermometry^{20}.

Yeung *et al*.^{14} and Wang *et al*.^{13} reported negative heavy isotope clumping in photosynthetic O_{2} formation and in biogenic CH_{4}, respectively. Yeung *et al*.^{14} attributed the negative Δ values (see equation 3 for definition) in photosynthetic O_{2} to different isotopic composition of the two O atoms originating from different sites in the oxygen evolving complex of photosystem II. Triggered by this observation we investigated this further and show here that negative clumping anomalies are necessarily expected whenever two or more indistinguishable atoms of the same element but with different isotopic composition combine in a molecule. The atoms do not need to share a common bond but can be at distant places in a molecule.

Yeung^{21} recently presented an analysis of such apparent statistical clumped isotope effects in combination with other isotope effects. In our paper, we restrict the analysis to statistical clumped isotope effects and phrase the calculations exclusively in terms of isotope ratios in order to elucidate the underlying general nature of these apparent isotope signatures. The fundamental origin of the apparent statistical clumped isotope effect is thoroughly presented and visualized geometrically. We then provide a general mathematical formalism for apparent statistical clumped isotope signatures in any multiple isotope system. Finally we demonstrate quantitatively how a certain measured statistical anti-clumping signal can be used to determine the isotopic heterogeneity of indistinguishable atoms in a molecule.

## Origin of the statistical negative clumped isotope signatures

We describe and calculate the statistical clumped isotope effects in terms of heavy-to-light isotope ratios ^{i}*R* of individual atoms and molecules, where the index *i* indicates the mass of the atom or molecule. The same letter *R* is used for both atomic isotope ratios and molecular isotopocule ratios. In particular, for molecules with multiple heavy isotopes (clumped isotopes), the heavy-to light isotopocule ratio is defined as

When two atoms with atomic heavy-to-light isotope ratios *R*_{1} and *R*_{2} (where *R*_{i} can be, e.g., ^{2}H/^{1}H, ^{13}C/^{12}C, ^{18}O/^{16}O, etc.) combine in a molecule in a purely random manner, i.e., without any isotope effect, the ratio of molecules that include the heavy isotopes of both of these atoms relative to molecules including only light isotopes is simply the product of the atomic isotope ratios of the two atoms

Clumped isotope signatures Δ_{i} are then by convention calculated as the relative difference between a certain (measured) clumped isotopocule ratio and the random clumped isotope ratio, usually reported in per mill (‰).

The apparent negative statistical clumped isotope signatures that we describe in this paper are fundamentally related to this referencing convention, in particular the choice of the reference ratio ^{i}*R*_{cl,random} that is required to calculate Δ_{i} in Eq. 3. When the isotope ratios *R*_{1} and *R*_{2} of the involved atoms are individually known, ^{i}*R*_{cl,random} = *R*_{1} · *R*_{2} can be precisely calculated. This is always the case when heavy isotopes of different elements clump together (e.g. ^{13}C and ^{18}O in CO or in CO_{2}). It also holds when molecules of the same element, for which the individual isotope ratios are known, clump together (e.g. ^{15}N^{α} and ^{15}N^{β} in N_{2}O, where α and β indicate the central and terminal position of the N atom in the linear NNO molecule, which can be determined independently^{22,23}). This is graphically illustrated in Fig. 1, where *R*_{1} and *R*_{2} are plotted on the x- and y-axis and their product, ^{i}*R*_{cl,random}, is shown as the blue area.

When a molecule contains indistinguishable atoms of the same element, it is impossible to determine the individual atomic isotope ratios of these atoms. Nevertheless, the atoms may originate from isotopically distinct populations with isotope ratios *R*_{1} and *R*_{2}. To calculate the correct value of ^{i}*R*_{cl,random} we would therefore need to know the individual isotope ratios *R*_{1} and *R*_{2}. However, since these isotope ratios cannot be retrieved for indistinguishable atoms, it is common (and reasonable) to assign the bulk (≈average, see below) isotopic composition of the atoms. Through this choice, the real stochastic clumped isotope ratio *R*_{1} ⋅ *R*_{2} (blue area in Fig. 1) is substituted by the approximated value *R*_{av} ⋅ *R*_{av} (red area in Fig. 1). Both areas have the same perimeter 2 ⋅ (*R*_{1} + *R*_{2}) =2 ⋅ (*R*_{av} + *R*_{av}), but the red square has a larger area than the blue rectangle, i.e., *R*_{av} ⋅ *R*_{av} > *R*_{1} ⋅ *R*_{2}. Replacing the area of the blue rectangle by the one of the red square in the denominator of Eq. 3 causes a systematic negative artifact. This produces the apparent negative statistical clumped isotope effect. The anti-clumping signal is the larger the more different the individual isotope ratios of the indistinguishable atoms are. In the following chapters we derive the general mathematical formalism to calculate these apparent statistical clumped isotope effects in any multi-isotope system. We also show that the measurement of statistical anti-clumping in principle allows quantifying the heterogeneity of the isotopic composition of indistinguishable atoms in a molecule, i.e. to reconstructing the blue area from the red area in Fig. 1.

We emphasize that the apparent anti-clumping signature is not related to a physical isotope effect, but is a mathematical artifact that originates from the referencing convention. It will never occur when the contributing atoms are distinguishable (thus never for atoms from different elements, e.g. for ^{13}C-^{18}O clumping in CO_{2}), but it will always occur when two indistinguishable atoms of the same element combine in a molecule (e.g. for ^{18}O-^{18}O clumping in CO_{2}). Table 1 shows a selection of common atmospheric molecules and specific clumping signatures for which statistical anti-clumping will occur or not occur, respectively.

### Clumping of indistinguishable atoms in one molecule

#### Molecules with two atoms of the same element (e.g. N_{2}, O_{2}, H_{2})

As mentioned above, two indistinguishable atoms of the same element in one molecule may generally originate from different reservoirs or involve different fractionation effects such that their isotope ratios *R*_{1} and *R*_{2} represented two distinct pools when the molecule formed. Since the two atoms are now indistinguishable, we cannot independently measure the isotope ratios *R*_{1} and *R*_{2}. In fact, in conventional isotope ratio measurements of single substituted isotopocules the arithmetic average ratio of the two ratios (e.g. ^{29}*R* = 2 ^{15}*R*_{av} for ^{15}N measurements in N_{2}) is determined. For rare heavy isotopes (*R*_{1}, *R*_{2} ≪1), this average ratio ^{15}*R*_{av} is generally similar to the bulk isotope ratio *R*_{bulk} of the sample (see Supplementary Information). In this case, it is common and reasonable to assign *R*_{bulk} ≈ *R*_{av} to each of the indistinguishable atoms for further calculations. For the remainder of this paper, we use *R*_{bulk} = *R*_{av}, which considerably simplifies the formulas and removes the dependency of the apparent clumped isotope signal on the isotope ratio. The differences between using *R*_{bulk} and *R*_{av} are discussed in detail in the Supplementary Information.

The stochastically expected (random) ratio of isotopocules with two heavy atoms relative to the light isotopocules, from a population of atoms with average heavy isotope ratio of , is

However, when the atoms represent different isotopic pools with possibly different isotope ratios, the real clumped isotope ratio, *R*_{cl}, of doubly substituted isotopocules relative to the light isotopocules is

The apparent statistical clumped isotope Δ is the relative difference between the real and the stochastically expected clumped isotope ratio

The clumped isotope composition is always negative, except for the case *R*_{1}* *=* R*_{2}, for which Δ = 0. Thus, when two atoms of the same element with different isotopic composition combine in a molecule, the resulting molecule will always have an apparent negative clumping signature. The black curve in Fig. 2a shows the size of this quadratic statistical negative isotope clumping according to Eq. (6). The negative clumping signal Δ does not depend on the absolute value of the underlying isotope ratios, but only on the relative difference of the isotope ratios. In the following we refer to this effect as “statistical clumped isotope signature”. Eq. 6 was first derived for the case of formation of molecular O_{2} in photosynthesis by Yeung *et al*.^{14}, who indeed observed negative clumped isotope signals relative to the thermodynamically expected values for photosynthetic O_{2}.

### Generalization: Molecules with three or more atoms of the same element – complete substitution

When a molecule contains three or more atoms of the same element, these atoms can generally represent populations with different isotope ratios *R*_{1}, *R*_{2}, … *R*_{n}. We first consider the case of full heavy-isotope substitution, which is the generalization of the two-atom case presented above. As it is not possible to independently measure the individual isotope ratios *R*_{i} of the indistinguishable atoms, the arithmetic average isotope ratio ^{18}*R*_{av}(≈^{18}*R*_{bulk}, see Supplementary Information) is assigned to each of the indistinguishable atoms for further calculations

As the atoms are all assigned the same atomic isotope ratio *R*_{av}, the stochastically expected ratio of fully substituted isotopocules relative to non-substituted isotopocules from this population of atoms is the *n*-th power of *R*_{av}.

This is the *n*-dimensional equivalent of replacing blue rectangle in Fig. 1 by the red square. The real (=observed) ratio of fully-substituted isotopocules is the product of all isotope ratios involved, which is identical to the *n*-th power of the geometric mean of the isotope ratios

Thus, the statistical clumped isotope signature for fully substituted isotopocules is

This equation applies to any set of indistinguishable atoms in a molecule. Since the arithmetic mean is always larger or equal than the geometric mean, the statistical clumped isotope signal is always negative, except for the case where all ratios *R*_{i} are identical, in which case the arithmetic and geometric means are equal and thus Δ = 0. Figure 2 shows the variation of Δ with the relative difference of the isotope ratios in the 2-, 3-, 4- and 10-atom systems. For the cases illustrated in Fig. 2a, only one isotope ratio is varied and the isotope ratios of all other atoms are kept constant and identical. For the same relative difference in isotope ratio of a single atom, the clumping signal decreases with increasing number of atoms. Although the 10-atom case may not be of much practical use, it is included to emphasize the point that the statistical negative heavy isotope clumping does not require the heavy isotopes to be linked directly by a common chemical bond. For example, statistical negative D-D clumping in ethane (C_{2}H_{6}, Table 1) may involve pairs of hydrogen atoms at any of the 6 positions.

The isotope combinations presented in Fig. 2a all include the point where all isotope ratios are equal, which corresponds to Δ = 0‰. However, in general, the isotope ratios of the atoms at different positions are not equal. Figures 2b,c show the clumping signal for the 3- and 4-atom cases when one ratio is varied again, and the other ratios are held constant, but at different values for the individual atoms. Now the situation that all isotope ratios are equal cannot occur and the parabola-shaped curves are shifted towards negative Δ values. The y-axis offset increases with increasing difference of the individual isotope ratios, thus with the heterogeneity of the isotopic composition of the individual indistinguishable atoms. The curves in Fig. 2b,c are selected 2-dimensional cross-sections of a multi-dimensional space, which illustrate the effect of varying one of the multiple isotope ratios relative to a fixed set of other ratios. In practice, a certain combination of isotope ratios among indistinguishable atoms will correspond to one single value of Δ and we will show below that the statistical clumped isotope signal Δ is a measure for the heterogeneity of the isotopic composition of indistinguishable atoms in a molecule.

### Molecules with three or more atoms of the same element – incomplete substitution

#### Clumping of two heavy isotopes in molecules with three indistinguishable atoms (e.g. ^{18}O-^{18}O or ^{17}O-^{17}O clumping in O_{3} or NO_{3})

As a first example we consider molecules with three indistinguishable atoms and calculate the clumping signature of double substituted isotopocules. The three atoms generally represent three isotopically different populations with isotope ratios *R*_{1}, *R*_{2} and *R*_{3}. However, as the atoms are indistinguishable, the individual ratios cannot be determined and for further calculations they are assigned the average isotope ratio, which can be determined from measurement of the single substituted isotopocules

The stochastically expected isotope ratio of isotopocules with exactly two out of three possible heavy isotopes relative to the light isotopocules from a population of indistinguishable atoms with assigned isotope ratio *R*_{av} is

The factor 3 gives the number of possible permutations of two heavy isotopes over three atom positions. The real probability for finding a molecule with exactly two heavy atoms from the three atoms with isotope ratios *R*_{1}, *R*_{2} and *R*_{3} is

Thus, the statistical clumped isotope signal for clumping of two out of 3 possible heavy isotopes, Δ_{2/3}, is

Since all terms in the numerator and denominator are squares and thus positive, Δ is always negative except for the case *R*_{1} = *R*_{2} = *R*_{3} when Δ_{2/3} = 0. Some examples for the clumping of two out of three heavy isotopes in a molecule are shown in Fig. 3a. The solid line again includes the case where all ratios are identical and Δ = 0. The dashed and dotted lines show examples where one isotope ratio varies and the other ones are constant but different, so that the case that all are equal does not occur. Again, the dashed and dotted curves are shifted to more negative Δ values.

### Clumping of two heavy isotopes in molecules with four indistinguishable atoms (e.g. D-D clumping in CH_{4})

We now consider molecules with four indistinguishable atoms and calculate the clumping signature of doubly-substituted isotopocules. The four atoms generally represent isotopically distinct pools with isotope ratios *R*_{1}, *R*_{2}, *R*_{3} and *R*_{4}. As the ratios cannot be determined individually, they are assigned the average atomic isotope ratio

The stochastically expected probability for forming a molecule with exactly two out of four possible heavy isotopes from a population of indistinguishable atoms with assigned isotope ratio *R*_{av} is

The factor 6 again gives the number of possible permutations of 2 heavy isotopes over 4 atom positions . However, the real probability to form an isotopocule with exactly two heavy atoms from the four atoms with different isotope ratios *R*_{1}, *R*_{2}, *R*_{3} and *R*_{4} is

Thus, the clumped isotope signal Δ is

Since Δ can again be expressed as a negative sum of squares, it is always negative, except for *R*_{1} = *R*_{2} = *R*_{3} = *R*_{4} where Δ_{2/4} = 0. Some examples for the statistical clumped isotope effect of two out of four heavy isotopes in a molecule are shown in Fig. 3b. An important example for this case is the D-D clumping in methane. The reservoirs that supply the different hydrogen atoms in the formation of methane can vary considerably and significant apparent statistical anti-clumping is expected.

### Generalization: Clumping of *m* heavy isotopes in molecules with *n* indistinguishable atoms

For the general case of *n* indistinguishable atoms that represent isotopic pools with isotope ratios *R*_{1}, *R*_{2}, … *R*_{n}, we assign again the arithmetic mean isotope ratio *R*_{av} to each of the atoms.

The stochastically expected ratio of isotopocules with exactly *m* out of a possible *n* heavy atoms relative to the light isotopocules from a population of indistinguishable atoms with this average heavy isotope ratio *R*_{av} is

The real ratio of isotopocules with *m* heavy isotopes relative to non-substituted isotopocules (*R*_{cl}) from *n* atoms with isotope ratios *R*_{1}, *R*_{2}, … *R*_{n} is

Thus, the clumped isotope signal Δ is

Eq. 22 is the most general equation to calculate statistical negative isotope clumping in any multi-isotope system. In the case where all ratios *R*_{i} are identical, *R*_{ji} = *R*_{av}, the numerator becomes identical to the denominator because the number of possible subsets {j_{1}, … j_{m}} out of {1, … *n*} is equal to the binomial coefficient and the products reduce to the factors (*R*_{av})^{m}. In this case, Δ = 0.

### More than 2 stable isotopes: ^{17}O-^{18}O clumping for oxygen

So far we have formally only treated molecules with two stable isotopes. Oxygen has three stable isotopes ^{16}O, ^{17}O and ^{18}O. Clumping of heavy isotopes of the same sort (i.e., clumping of multiple ^{17}O atoms or multiple ^{18}O atoms in one molecule) follows the examples and general rules outlined above (see Table 1). However, it is also possible that the heavy ^{17}O and ^{18}O atoms clump together in one molecule.

### O_{2} and other molecules with two indistinguishable O atoms

In the case of molecular O_{2}, the apparent statistical clumping signal for ^{17}O-^{18}O clumping was derived in Yeung *et al*.^{14}. Since the isotope ratios of the individual O atoms cannot be determined individually, they are assigned the average heavy isotope ratios

The stochastically expected (random) ratio of molecules with one ^{17}O and one ^{18}O isotope relative to ^{16}O^{16}O from this population of O atoms is

The real probability for forming ^{17}O^{18}O molecules from two atoms with isotope ratio ^{17}*R*_{1}, ^{18}*R*_{1} and ^{17}*R*_{2,} ^{18}*R*_{2} is

Therefore,

For normal mass dependent fractionation, where ref. 24, with three-isotope exponent *β *≈ 0.53 ref. 25, ^{17}*R*_{1} and ^{18}*R*_{1} are either both smaller or both larger than ^{17}*R*_{2} and ^{18}*R*_{2}, thus again the clumped isotope signature ^{35}Δ is always <0 ( = 0 if ^{i}R_{1} and ^{i}R_{2} are identical). This can also be shown by inserting the mass dependent fractionation relation as follows:

As the atoms do not need to share the same bond to generate statistical isotope clumping, the equation also applies to ^{17}O-^{18}O clumping of other molecules with 2 O atoms (importantly ^{17}O-^{18}O clumping in CO_{2}). The resulting negative clumping signature is shown in Fig. 4.

### Heavy isotope clumping for three indistinguishable oxygen atoms (e.g. O_{3})

When a molecule has three indistinguishable (or not distinguished) oxygen atoms, we assign the average ratios and to each of the atoms. The statistical clumping signatures for complete ^{17}O or ^{18}O substituted isotopocules and for ^{17}O-^{17}O and ^{18}O-^{18}O clumping can be derived according to the formalism derived above. Here we also consider the remaining combinations where ^{17}O and ^{18}O clump together in one molecule.

Since there are 3! = 6 different possibilities to distribute the three distinguishable atoms (^{16}O, ^{17}O and ^{18}O) over the three positions, the stochastically expected clumping is

The real clumping is calculated by considering explicitly all 6 configurations of ^{16}O, ^{17}O and ^{18}O

and thus

As argued above for O_{2}, for normal mass dependent fractionation, ^{17}*R*_{i} and ^{18}*R*_{i} are either both smaller or both larger than ^{17}*R*_{j} and ^{18}*R*_{j}, thus Δ is always < 0 (=0 if all ratios are identical). Inserting the mass dependent fractionation relation again, Eq. 30 can be transformed to

The corresponding values of Δ are shown in Fig. 5 together with all other possible isotope clumping combinations for ^{17}O and ^{18}O isotopes in O_{3}. These are always calculated as the relative difference of the explicitly calculated clumped isotope ratio ^{i}*R*_{cl} from the stochastically expected heavy isotope ratio ^{i}*R*_{cl,random} (Equation 30). For example, for ^{17}O-^{17}O-^{18}O clumping there are three possible configurations to distribute the atoms (^{17}O, ^{17}O and ^{18}O) over the three positions, so the stochastically expected clumping is

The real clumping is calculated by considering explicitly all 3 configurations

and thus

The corresponding equation for ^{17}O-^{18}O-^{18}O clumping is easily derived by exchanging ^{17}O and ^{18}O.

Important examples of molecules with three oxygen atoms are O_{3} and the nitrate ion NO_{3}^{−}. For O_{3}, the central and terminal O atoms can be distinguished with suitable techniques^{2,26,27,28,29}. In these cases, statistical anti-clumping only occurs when the indistinguishable terminal O atoms are involved (Table 1). In many cases, however, the isotopic composition of O_{3} is determined without position information^{30,31,32,33} and in this case the atoms can be treated as effectively indistinguishable and need to be treated according to the formalism developed here.

### Statistical clumped isotope signals and the heterogeneity of the isotope ratios of indistinguishable atoms

The derivations above show that statistical combination of indistinguishable atoms in a molecule leads to apparent negative clumped isotope signals. The size of the apparent negative clumping in the molecule increases with increasing difference in isotopic composition between the individual atoms, so it may actually contain scientifically relevant information. In order to investigate this further, we created random (within a certain range) sets of isotope ratios for all multi-isotope systems with up to 5 indistinguishable atoms and calculated the apparent negative multi-isotope clumping signature Δ. Figure 6 shows that for each multi-isotope set all these random sets of isotope ratios yield Δ values that fall on distinct curves when Δ is plotted versus the relative standard deviation of the individual isotope ratios. For each of the multi-isotope clumping combinations with *m* out of *n* heavy isotopes, the curves can be parameterized as:

Note that the Δ values in Fig. 6 are plotted as ‰, so the factor 1/2 in Eq. 35 corresponds to 500‰ in the fit curves.

In the case of m = 2 (2 heavy isotopes in any multi-isotope system) the fit is perfect, but for more heavy atoms clumping in one molecule there is some scatter around these fit lines. This originates from the fact that there is no fixed analytical relation between the arithmetic and geometric means. Figure 7 shows the relative deviation of the explicitly calculated Δ values and the approximation using Eq. 35 for the randomly chosen sets of isotope ratios with known standard deviation and average values. The outer envelopes of the point clouds for each multi-isotope system define the error with which the statistical clumped isotope signal Δ in a molecule can be predicted from Eq. 35 when the standard deviation and the mean of the individual isotope ratios are known.

Scientifically, the opposite relation

is more attractive, since the isotopic variability among indistinguishable atoms in a molecule is usually not known, but Δ may be measurable^{14}. This means that measurement of the statistical clumped isotope anomaly Δ could provide a novel tracer to determine the heterogeneity (quantified by the standard deviation) of the isotopic pools of indistinguishable atoms in a molecule. For example, in the 2-isotope system O_{2}, a Δ value of 1.5‰ below the thermodynamically expected value as measured by Yeung *et al*.^{14} would indicate a relative standard deviation in the isotope ratios of the two O atoms of about 5.5% (Fig. 6, blue curve).

Figure 8 shows the relative error that is made when Eq. 36 is used to calculate the relative standard deviation of the isotope ratios from the Δ value for randomly chosen sets of (known) isotope ratios. The outer envelope for each isotope system quantifies the error with which for a group of indistinguishable atoms can be derived from Δ. Above Δ values of −1.5‰ the relative error in is generally below 1%, and above Δ values of −5‰ the relative error is still only 2%. Thus, whereas we are not able to measure the isotopic composition of individual indistinguishable atoms, the apparent statistical isotope clumping provides a means to obtain information about the heterogeneity of the isotope ratios with quite good precision.

## Conclusions

The statistical combination of indistinguishable atoms with different isotope ratios in a molecule always leads to apparent negative clumped isotope signals. We emphasize the term apparent, because this signal does not relate to a physical negative clumping process. The underlying reason is that in the calculation of the stochastic reference value for calculating Δ, the actual isotopic composition at each individual atom position is replaced by the average of the isotopic composition of all indistinguishable atoms (which is similar bulk isotopic composition of the molecule for small isotope ratios). Thus the apparent statistical Δ is by nature an artifact originating from our limitation to measure the isotope ratios of indistinguishable atoms. Using the formalism presented in this paper, this apparent statistical clumping signal can be calculated for any multi-isotope system.

We have calculated here the pure apparent statistical heavy isotope clumping values. In nature these apparent clumping signals will always occur in combination with thermodynamic heavy isotope clumping and possible other kinetic isotope effects that lead to clumped isotope anomalies (Yeung^{21}. The statistical clumped isotope signatures will *always* occur whenever two or more indistinguishable atoms clump together in a molecule. For isotope heterogeneities of a few percent, the effect is of the same order of magnitude as the thermodynamic effects in many molecules. Thus, it is important to take these effects into consideration when interpreting isotopic clumping of indistinguishable atoms in nature. Furthermore, when the statistical clumping signature can be separated from other contributions, its magnitude provides quantitative information on the heterogeneity of the isotopic composition of the indistinguishable atoms in a molecule.

## Additional Information

**How to cite this article**: Röckmann, T. *et al*. Statistical clumped isotope signatures. *Sci. Rep.* **6**, 31947; doi: 10.1038/srep31947 (2016).

## References

- 1.
Kaiser, J., Röckmann, T. & Brenninkmeijer, C. A. M. Assessment of

^{15}N^{15}N^{16}O as a tracer of stratospheric processes.*Geophys. Res. Lett.***30**, 10.1029/2002GL016073 (2003). - 2.
Janssen, C., Guenther, J., Krankowsky, D. & Mauersberger, K. Relative formation rates of

^{50}O_{3}and^{52}O_{3}in^{16}O-^{18}O mixtures.*J. Chem. Phys.***111**, 7179–7182 (1999). - 3.
Mauersberger, K., Erbacher, B., Krankowsky, D., Günther, J. & Nickel, R. Ozone isotope enrichment: Isotopomer-specific rate coefficients.

*Science***283**, 370–372 (1999). - 4.
Hauck, R. D. & Bouldin, D. R. Distribution of Isotopic Nitrogen in Nitrogen Gas During Denitrification.

*Nature***191**, 871–872 (1961). - 5.
Mroz, E. J.

*et al.*Detection of Multiply Deuterated Methane in the Atmosphere.*Geophys. Res. Lett.***16**, 677–678, 10.1029/Gl016i007p00677 (1989). - 6.
Eiler, J. M. & Schauble, E.

^{18}O^{13}C^{16}O in Earth’s atmosphere.*Geochim. Cosmochim. Acta***68**, 4767–4777, 10.1016/j.gca.2004.05.035 (2004). - 7.
Yeung, L. Y., Young, E. D. & Schauble, E. A. Measurements of

^{18}O-^{18}O and^{17}O-^{18}O in the atmosphere and the role of isotope-exchange reactions.*J. Geophys. Res.***117**, D18306, 18310.11029/12012JD017992 (2012). - 8.
Eiler, J. M.

*et al.*A high-resolution gas-source isotope ratio mass spectrometer.*Int J. Mass Spect.***335**, 45–56 (2013). - 9.
Ono, S.

*et al.*Measurement of a Doubly Substituted Methane Isotopologue,^{13}CH_{3}D, by Tunable Infrared Laser Direct Absorption Spectroscopy.*Anal. Chem.***86**, 6487–6494, 10.1021/ac5010579 (2014). - 10.
Stolper, D. A.

*et al.*Formation temperatures of thermogenic and biogenic methane.*Science***344**, 1500–1503, 10.1126/science.1254509 (2014). - 11.
Stolper, D. A.

*et al.*Combined^{13}C-D and D-D clumping in methane: Methods and preliminary results.*Geochim. Cosmochim. Acta***126**, 169–191, 10.1016/j.gca.2013.10.045 (2014). - 12.
Stolper, D. A.

*et al.*Distinguishing and understanding thermogenic and biogenic sources of methane using multiply substituted isotopologues.*Geochim. Cosmochim. Acta***161**, 219–247, 10.1016/j.gca.2015.04.015 (2015). - 13.
Wang, D. T.

*et al.*Nonequilibrium clumped isotope signals in microbial methane.*Science***348**, 428–431, 10.1126/science.aaa4326 (2015). - 14.
Yeung, L. Y., Ash, J. L. & Young, E. D. Biological signatures in clumped isotopes of O

_{2}.*Science***348**, 431–434, 10.1126/science.aaa6284 (2015). - 15.
Magyar, P. M., Orphan, V. J. & Eiler, J. M. Insights into Mechanisms of Nitrous Oxide Generation from Measurement of Nine N

_{2}O Isotopologues*Goldschmidt Abstracts,***2015**, 1970 (2015). - 16.
Urey, H. C. The thermodynamic properties of isotopic substances

*J. Chem. Soc.*562–581 (1947). - 17.
Bigeleisen, J. & Mayer, M. G. Calculation of equilibrium constants for isotopic exchange reactions.

*J. Chem. Phys.***15**, 261–267 (1947). - 18.
Richet, P., Bottinga, Y. & Javoy, M. A review of hydrogen, carbon, nitrogen, oxygen, sulphur, and chlorine stable isotope fractionation among gaseous molecules.

*Ann. Rev. Earth Planet. Sci.***5**, 65–110 (1977). - 19.
Wang, Z. G., Schauble, E. A. & Eiler, J. M. Equilibrium thermodynamics of multiply substituted isotopologues of molecular gases.

*Geochim. Cosmochim. Acta***68**, 4779–4797 (2004). - 20.
Eiler, J. M. “Clumped-isotope” geochemistry—The study of naturally-occurring, multiply-substituted isotopologues.

*Earth Planet. Sci. Lett.***262s**, 309–327 (2007). - 21.
Yeung, L. Y. Combinatorial effects on clumped isotopes and their significance in biogeochemistry.

*Geochim. Cosmochim. Act*, 10.1016/j.gca.2015.1009.1020 (2016). - 22.
Brenninkmeijer, C. A. M. & Röckmann, T. Mass spectrometry of the intramolecular nitrogen isotope distribution of environmental nitrous oxide using fragment-ion analysis.

*Rap. Commun. Mass Spectrom***13**, 2028–2033 (1999). - 23.
Toyoda, S. & Yoshida, N. Determination of nitrogen isotopomers of nitrous oxide on a modified isotope ratio mass spectrometer.

*Anal. Chem.***71**, 4711–4718 (1999). - 24.
Kaiser, J., Röckmann, T. & Brenninkmeijer, C. A. M. Contribution of mass-dependent fractionation to the oxygen isotope anomaly of atmospheric nitrous oxide.

*J. Geophys. Res.***109**, D03305, 10.1029/2003JD004088 (2004). - 25.
Young, E. D., Galy, A. & Nagahara, H. Kinetic and equilibrium mass-dependent isotope fractionation laws in nature and their geochemical and cosmochemical significance.

*Geochim. Cosmochim. Acta***66**, 1095–1104 (2002). - 26.
Vicars, W. C., Bhattacharya, S. K., Erbland, J. & Savarino, J. Measurement of the

^{17}O-excess (Δ^{17}O) of tropospheric ozone using a nitrite-coated filter.*Rapid Commun. Mass Spectrom.***26**, 1219–1231, 10.1002/Rcm.6218 (2012). - 27.
Janssen, C. Intramolecular isotope distribution in heavy ozone (

^{16}O-^{18}O-^{16}O and^{16}O-^{16}O-^{18}O).*J. Geophys. Res.***110**, D08308, 08310.01029/02004JD005479 (2005). - 28.
Larsen, R. W., Larsen, N. W., Nicolaisen, F. M., Sorensen, G. O. & Beukes, J. A. Measurements of

^{18}O-enriched ozone isotopomer abundances using high-resolution Fourier transform far-IR spectroscopy.*J. Mol. Spectrosc.***200**, 235–247 (2000). - 29.
Johnson, D. G., Jucks, K. W., Traub, W. A. & Chance, K. V. Isotopic composition of stratospheric ozone.

*J. Geophys. Res.***105**, 9025–9031 (2000). - 30.
Shaheen, R., Janssen, C. & Röckmann, T. Investigations of the photochemical isotope equilibrium between O

_{2}, CO_{2}and O_{3}.*Atmos. Chem. Phys.***7**, 495–509 (2007). - 31.
Johnston, J. C., Röckmann, T. & Brenninkmeijer, C. A. M. CO

_{2}+O(^{1}D) isotopic exchange: Laboratory and modeling studies.*J. Geophys. Res.***105**, 15213–15229 (2000). - 32.
Früchtl, M., Janssen, C. & Röckmann, T. Experimental study on isotope fractionation effects in visible photolysis of O

_{3}and in the O+O_{3}odd oxygen sink reaction.*J. Geophys. Res.***120**, 4398–4416, 10.1002/2014JD022944 (2015). - 33.
Chakraborty, S. & Bhattacharya, S. K. Oxygen isotopic fractionation during UV and visible light photodissociation of ozone.

*J. Chem. Phys.***118**, 2164–2172 (2003).

## Acknowledgements

This work was carried out as part of the program of the Netherlands Earth System Science Centre (NESSC), financially supported by the Dutch Ministry of Education, Culture and Science (OCW).

## Author information

## Affiliations

### Institute for Marine and Atmospheric research Utrecht (IMAU), Utrecht University, Utrecht, The Netherlands

- T. Röckmann
- , M. E. Popa
- , M. C. Krol
- & M. E. G. Hofmann

### Wageningen University, Wageningen, Netherlands

- M. C. Krol

### SRON Netherlands Institute for Space Research, Utrecht, Netherlands

- M. C. Krol

## Authors

### Search for T. Röckmann in:

### Search for M. E. Popa in:

### Search for M. C. Krol in:

### Search for M. E. G. Hofmann in:

### Contributions

T.R., M.E.P. and M.E.G.H. developed the conceptual framework for the manuscript and discussed the content. M.C.K. contributed code to produce the figures. T.R. developed the mathematical formalism and wrote the manuscript with input from all authors.

### Competing interests

The authors declare no competing financial interests.

## Corresponding author

Correspondence to T. Röckmann.

## Supplementary information

## PDF files

## Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

## About this article

## Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.