Abstract
Physicists have long wondered whether the gravitational interactions between matter and antimatter might be different from those between matter and itself. Although there are many indirect indications that no such differences exist and that the weak equivalence principle holds, there have been no direct, freefall style, experimental tests of gravity on antimatter. Here we describe a novel direct test methodology; we search for a propensity for antihydrogen atoms to fall downward when released from the ALPHA antihydrogen trap. In the absence of systematic errors, we can reject ratios of the gravitational to inertial mass of antihydrogen >75 at a statistical significance level of 5%; worstcase systematic errors increase the minimum rejection ratio to 110. A similar search places somewhat tighter bounds on a negative gravitational mass, that is, on antigravity. This methodology, coupled with ongoing experimental improvements, should allow us to bound the ratio within the more interesting near equivalence regime.
Introduction
There are many compelling experimental and theoretical arguments^{1,2,3,4,5,6,7,8,9,10} that suggest that the gravitational mass of antimatter cannot differ from the gravitational or inertial mass of normal matter, that is, that the weak equivalence principle holds. For instance, one such argument comes from the absence of anomalies in Eötvös experiments conducted with differing atoms^{4}; the differing number of virtual particle–antiparticle pairs in such atoms might have caused gravitational anomalies to occur. However, all of these arguments are indirect and are not universally accepted^{11,12,13,14}; they rely on assumptions about the gravitational interactions of virtual antimatter, on postulates such as CPT invariance, or on other theoretical premises. Although these arguments may well be correct, in a world in which physicists have only recently discovered that we cannot account for most of the matter and energy in the universe, it would be presumptuous to categorically assert that the gravitational mass of antimatter necessarily equals its inertial mass. Moreover, the baryogenesis problem suggests that our understanding of antimatter is incomplete; gravitational asymmetries have been proposed as an explanation^{7,15,16}. (Note that ref. 7 ultimately rejected gravity as a solution to the baryogenesis problem because of a thermodynamic proof of the weak equivalence principle. This proof was later challenged^{10}.)
There have not yet been any direct^{14}, freefall or gravitational balance, tests of the gravitational interactions of observable matter and antimatter. Direct gravitational experiments with nonneutral antimatter, for example, isolated positrons or antiprotons, are exceedingly difficult because the electrical forces overwhelm the gravitational forces^{17}. Employing neutral antihydrogen^{18,19,20,21,22,23,24,25} or positronium^{26} eliminates this complication. The AEGIS project^{27} at CERN was formed to conduct direct experimental tests of gravity on antihydrogen, and is now in its final construction phase. A second experiment, GBAR, has recently been approved at CERN^{28}, and a third experiment was proposed at Fermilab^{29}.
This article describes a novel method that yields directly measured limits on the ratio of the gravitational to inertial mass of antimatter, accomplished essentially by searching for the free fall (or rise) of 434 groundstate antihydrogen atoms in the ALPHA^{30,31,32} experiment at CERN. Our results set statistical bounds on the value of F≡M_{g}/M, the ratio of the gravitational mass M_{g} to the inertial mass M of antihydrogen. (M is assumed numerically equal to the mass of hydrogen.) In the absence of systematic errors, we find that F must be <75 at a statistical significance level of 5%; worstcase systematic errors increase this limit to F<110. A similar search places somewhat tighter bounds on a negative F, that is, on antigravity. Refinements of our technique, coupled with larger numbers of coldtrapped antiatoms, should allow us to bound F more tightly in future experiments and approach the F≈1 regime of widespread interest.
Results
Antihydrogen trapping
ALPHA traps antihydrogen atoms by producing and capturing them in a minimumB trap^{33}. These traps confine those antiatoms whose magnetic moment is aligned such that they are attracted to the minimum in the trap magnetic field B, and whose kinetic energy is below the trap well depth, . In ALPHA (see Fig. 1), this magnetic minimum is created by an octupole magnet that produces transverse fields of magnitude 1.54 T at the trap wall at R_{Wall}=22.3 mm, and two mirror coils that produce axial fields of 1 T at their centres. The mirror coil centres are offset by ±138 mm from the trap centre. (The relative orientation of these coils and the trap boundaries are shown in Fig. 1.) These fields are superimposed on a uniform axial field of 1 T produced by an external solenoid^{34,35}.
The general methods by which antiatoms are captured are described in refs 30,31,32,36; in this article we concentrate only on the last phase of the experiments, during which antiatoms are released from the minimumB trap by turning off the octupole and mirror fields. The escaping antiatoms are then detected when they annihilate on the trap wall; a siliconbased annihilation vertex imaging detector^{37} records the times (binned to 0.1 ms) and locations (azimuthal FWHM of 8 mm) of these annihilations.
Annihilation time history on release
The time history of the annihilations is critical to our analysis. This history is governed by the nearexponential decay of the octupole and mirror fields after the magnet turnoff is initiated. The fields decay with time constants of ∼9.5 ms (^{38}). (Throughout this paper, times t are referenced to the initiation of the magnet shutdown.) At t=20 ms, for example, the maximum octupole field is ∼0.18 T and the mirror fields are ∼0.12 T. The trapping potential depth, which was originally ∼540 mK at t=0 ms, is reduced to ∼11 mK in the radial direction at t=20 ms. (Here we use kelvin as an energy unit.) Note that the 1 T solenoidal field, which is oriented parallel to the trap axis (the direction), is never varied. The well depth, which is proportional to the change in the magnitude of the total magnetic field as one progresses outwards from the trap centre, diminishes more slowly (∼80 mK at 20 ms) in the axial direction than in the radial direction. This is because the directed mirror fields add linearly to the solenoidal field, while the  and directed octupole fields add in quadrature to this field. Consequently, almost all of our trapped antihydrogen escapes radially^{31}.
Previous studies using the ALPHA apparatus have shown that the antiatoms have a distribution in centreofmass energy ɛ that scales approximately like below the trapping threshold^{31,38}. An antiatom can escape the evershallower trap when its energy is greater than the trap depth. However, there is no onetoone correspondence between the escape time of an antiatom and its initial energy because it can take some time for an antiatom to find the ‘hole’ in the trap potential. Computer simulations of this process, described in ref. 38, show that antiatoms of a given initial energy escape over a temporal range of at least 10 ms. The simulations discussed in ref. 38 did not include a gravitational force; to aid in our interpretation of the current experimental data, we extended these simulations to include gravity by the addition of a gravitational term to the equation of motion:
where is the centreofmass position of the antiatom, and g is the local gravitational acceleration. Previous measurements^{39} on ALPHA established that the magnitude of the magnetic moment equals that of hydrogen to the accuracy required in this paper; its direction is assumed to adiabatically track the external magnetic field.
Simulation studies
To model the experiment, we simulated the effects of gravity on an ensemble of groundstate antihydrogen atoms randomly selected from the energy distribution described above. These antiatoms are first propagated for 50 ms in the fullstrength trap fields to effectively randomize their positions, and then propagated in the postshutdown decaying fields until they annihilate on the trap wall. The results of a typical simulation are shown in Fig. 2 for F=100, which exaggerates the effects of gravity relative to the baseline of F=1 expected from the equivalence principle. As can be seen in Fig. 2, there is a tendency for the antiatoms to annihilate in the bottom half (y<0) of the trap. This tendency is pronounced for antiatoms annihilating at later times. This is because, as shown in Fig. 3 and in Table 1, the confining potential well associated with the magnetic and gravitational forces in equation 1 is most skewed by gravitational effects late in time when the magnetic restoring force is relatively weak, and the remaining particles are those with the lowest energy. We note that while the number of late annihilating antiatoms is dependent on the exact energy distribution used to initialize the simulations, the annihilation locations of these antiatoms are not; for the purposes of this paper, the exact distribution is unimportant.
Reverse cumulative average analysis
To determine an experimental limit on F, we compare our data set of 434 observed antihydrogen annihilation events to computer simulations at various F’s. Our statistics suffer from the fact that escaping antiatoms are most sensitive to gravitational forces at late times, but relatively few of the events occur at late times. For example, even with the cooling due to the adiabatic expansion that occurs as the trap depth is lowered, only 23 antiatoms out of the 434 annihilate after 20 ms. Moreover, inspection of the simulation data in Fig. 2 shows that even when there is a pronounced tendency for the antiatoms to fall down, some still annihilate near the top of the trap. To obtain a qualitative understanding of the data, we use the reverse cumulative average 〈yt〉: the average of the y positions of all the annihilations that occur at time t or later (see Methods). This reverse cumulative average highlights the more informative latetime events while still including as many events as possible into the average. Figure 4 plots 〈yt〉 for the events and the simulations at several values of F. These plots suggest that an upper bound on F can be established from the data, at a value somewhere between F=60 and 150.
Monte Carlo analysis
Although the visual approach taken in Fig. 4 is striking, a more sophisticated analysis is necessary for a quantitative assessment of F. Specifically, our problem is this: given our event set of experimental annihilations {(y,t)}_{Ev}, where y is the observed position of a given annihilation and t is the time of this annihilation, and given a family of similar sets of simulated pseudoannihilations {(y,t)}_{F} at various F, how can we determine which values of F can be excluded with reasonable confidence? In other words, which sets {(y,t)}_{F} are unlikely to be compatible with {(y,t)}_{Ev}? (In this paper, the phrase ‘pseudoannihilations’ or ‘pseudoevents’ always refers to simulation results. The unqualified word ‘events’ always refers to experimental results.) We make this determination with a Monte Carlo analysis based on an overall test statistic, that is, a figureofmerit, Φ, which is sensitive to discrepancies between the real and simulated data. Our choice of Φ is closely related to a Fisher’s combined test^{40} based on KolmogorovSmirnov (KS)^{41} statistics. The exact definition of Φ is described in the Methods section. In brief, for every F, we calculate the test statistic Φ_{Ev} for the experimental events. This Φ_{Ev} compares {(y,t)}_{Ev} to a reference distribution compiled from a third (∼300,000 simulated annihilations) of the simulation data set {(y,t)}_{F}. The test statistic Φ is small when it is likely that the 434 events could have been drawn from the reference distribution, and large when it is unlikely that the events could have been so drawn, that is, when there is a significant disparity between the distribution of the actual events and the reference distribution of the simulated annihilations at the hypothesized F.
Next, to approximate the sampling distribution for Φ, we distribute the remaining pseudoannihilations in {(y,t)}_{F} into N pseudoevent subsets of 434 points. In total there are about 900,000 pseudoevents in {(y,t)}_{F}, so N is about 1,400. Each of these pseudoevent sets is representative of what we would have observed if the ratio of the inertial to the gravitational mass really was F. Then, we calculate the set of test statistics {Φ_{i;F}} for each of these pseudoevent sets, and count the number N_{>} for which Φ_{i;F}≥Φ_{Ev}, that is, the number of pseudoevent sets that are less compatible with the reference distribution than the actual events. From N_{>}, we obtain a Monte Carlo estimate of the overall Pvalue, P=N_{>}/N, for the goodnessoffit test on the actual data set compared with the simulations. The results of this analysis are shown in Fig. 5, from which we conclude that F>75 is excluded at a significance level of 5%.
A similar Monte Carlo analysis comparing the actual event data to F=1 simulations gives an unsurprising overall Pvalue of 0.3. Thus, the event data are not incompatible with F=1, but we cannot conclude that F≈1.
Systematic error analysis
In the ∼800 trapping trials used to obtain our 434 point event set, we would expect approximately one cosmic ray to be misclassified as an antihydrogen atom^{30,32}. Thus, cosmic rays are an insignificant source of error in this analysis. The cosmic ray background does, however, preclude our using annihilation data from times later than 30 ms, as the current data rate would not be comfortably above the cosmic rate at such late times.
Previously, we calculated^{31} that more than 99.5% of antihydrogen atoms held longer than 400 ms will have decayed to the ground state. The 434 trapped antiatoms employed in the analysis were all held for times longer than this. Thus, we expect that virtually all of our antiatoms are in the ground state, and are largely immune to Stark effect/polarization forces that might have otherwise overwhelmed the gravitational forces. The largest electric fields in our trap during the magnet shutdown phase come from the ‘bias’ potential that we use to discriminate between antihydrogen atoms and antiprotons^{30,31,38} and exist in the 0.75mm gap between the electrodes. These fields are on the order of 10 V mm^{−1}. The energy that a groundstate antihydrogen atom would acquire approaching this gap is about five orders of magnitude less than the F=1 gravitational potential drop across the trap diameter. Furthermore, such a high field exists only in a very small volume of the trap. The ‘patch’ fields^{17} that plague charged particle gravity tests perturb the antiatom energy by about two orders of magnitude less than the bias electric fields. The annihilation detection algorithm determines the locations of the antiatom annihilations from the tracks of the pions that result from each annihilation. The smearing that results from the limited spatial resolution of the detector is well characterized^{37} and is incorporated into our analysis (see Methods).
The largest uncertainty in limiting F comes from our neglect, up to this point, of systematic effects from mechanical misalignments and from magnetic field errors. For example, the detector might not be perfectly centred on the trap axis. This misalignment is limited by mechanical constraints to be no more than ±5 mm. Such a misalignment would cause an apparent shift in the annihilation locations at early times as well as late, resulting in a bias in the average of the entire event set, 〈yt=0〉, of ±2.5 mm if at the constraint limit. (These errors differ from the detector smearing errors, which were calculated assuming that the detector was perfectly centred.) A somewhat smaller error would result from the octupole axis being displaced from the trap axis, which would cause a shift in the real annihilation locations. Like the detector displacement error, this displacement would cause a bias in overall average 〈yt=0〉. A bias of unknown origin is indeed visible in the event data: 〈yt=0〉=−1.3±0.8 mm. Simulations incorporating an octupole axis displacement show that this overall bias would correspond to a axis displacement of only −0.06 mm. Perhaps coincidentally, this is nearly identical to the maximum displacement allowed by mechanical constraints. We have performed a broad survey (see Supplementary Note 1) of other magnetic field errors consistent with the mechanical tolerances of our device. This survey shows that the largest biases that could result from magnetic errors are usually smaller than, and at worst comparable to, the largest bias possible from an octupole axis displacement. Thus, in the absence of fortuitous cancelations, the relatively small measured bias in 〈yt=0〉 limits the size of the effects of these errors at the late times when the experiment is most sensitive to gravity. Taking the maximally allowed detector and octupole displacement errors as representative of the worstcase systematic errors, we have modelled their effects in the statistical calculations and, as shown in Fig. 5, determined that the worstcase exclusion region is F>110, still at a significance level of 5%. Similarly, analysis of favourable systematic errors, say because of a fortuitous octupole axis displacement of −0.05 mm that would eliminate the 〈yt=0〉 bias, yields a best case exclusion of F>65 based on statistics alone.
Some perspective on the size of the systematic errors can be found by calculating 〈yt=0〉 for the untrapped antihydrogen atoms and antiprotons that annihilate on the wall during the antihydrogen synthesis process. In an observed sample of over 270,000 of such antiatoms, the y mean was +0.86±0.03 mm. However, the orbital dynamics of untrapped antihydrogen and antiprotons are quite different from the dynamics of trapped antihydrogen, and there are effects that can lead to average vertical displacements of the opposite sign. A Monte Carlo simulation of our detector, which includes the effects of dead regions, gives a mean value for y of +0.01±0.06 mm. A hitherto unutilized experimental sample of 120 trapped antihydrogen atoms had a y mean of +2.2±1.4 mm. (This sample was not otherwise utilized because the atoms in this sample could not be guaranteed to have been trapped for more than 400 ms. Hence, these atoms were not necessarily in the ground state^{31}.) These means do not entirely reconcile with each other or with the y mean of the standard sample of trapped atoms (−1.3±0.8 mm), and we have no certain explanation of their differences. However, the range of means predicted by our analysis of the detector axis displacements encompasses all these values; thus, we allow for larger errors in our worstcase analysis.
We set a limit on antigravity by inverting the sign of g in equation 1, or, equivalently, by making F negative. We find that F<−12 is excluded by statistics alone, with a worstcase limit from systematic errors of F<−65. However, because the systematic effects are not very well characterized for such small F, it is more conservative to only exclude F<−65.
Importance of detailed studies of the orbital dynamics
We stress that our determination of F relies on detailed simulations of antiatom trajectories in the timedependent trap magnetic fields; other gravitational measurements using trapped antihydrogen would likely require a similar analysis. A recent publication, ref. 42, briefly mentions an experimental bound on F of 200. So far as we can discern from the oneparagraph description of the experiment, the measurement implicitly assumes thorough dynamical mixing between the transverse and axial directions. Previous antihydrogen simulations^{31,38} show that these two directions are poorly coupled. This is because the trapping potential is nearly separable, and approximate independent constants of the motion exist for the transverse and axial degreesoffreedom. Mixing only occurs due to end effects from the finite axial length of the magnetic system or from large size, smallspatialscale magnetic errors unlikely to be present. Indeed, analytic calculations show that these constants of motion are adiabatically conserved for a broad range of parameters^{43}. Furthermore, experiments^{44} on the evaporative cooling of hydrogen atoms—a procedure closely analogous to the procedure outlined in ref. 42—show that the evaporation is essentially one dimensional, not three; that is, the transverse and axial directions do not couple. Thus, it is not surprising that simulations based on the best model we can construct from the limited information available in ref. 42 show that no effects of gravity could be observed using the techniques described in ref. 42 for F≤200, or indeed, for F’s significantly greater than 200 (ref. 45).
Discussion
We report directly measured limits on the ratio of the gravitational mass to the inertial mass of antimatter. On the basis of goodnessoffit tests comparing the positions of actual and simulated annihilation events, we can rule out ratios above F=75 (statistics alone) and F=110 (including worstcase systematic effects) for gravity, and below F=−65 (combined systematic and statistical effects) for antigravity, at the 5% significance level. Obviously, our limits are far from the F=1 regime where one could test for small deviations from the weak equivalence principle, but the methodology described here, coupled with planned and ongoing improvements to the ALPHA apparatus, should allow us to improve the measurement substantially. Simulations show that by cooling the antiatoms, perhaps with lasers, to 30 mK or lower, and by lengthening the magnetic shutdown time constant to 300 ms, we would have the statistical power to measure gravity to the F=±1 level (see Fig. 6). Cooling obviously increases the relative influence of gravity on the antiatom trajectories. The longer shutdown times are necessary to take full advantage of adiabatic expansion cooling of these slower antiatoms. They also allow the antiatoms to find and annihilate on the portions of the trap wall where the trapping well depth is lowest. Systematic errors pose a significant challenge for low F measurements, however, and will need to be addressed. In summary, our experiments are an important first step towards a precise gravitational measurement with trapped, neutral antimatter. The current work clearly demonstrates the potential for using a carefully prepared, wellcharacterized sample of trapped antihydrogen atoms as a source for direct, ballistic studies of the gravitational behaviour of antimatter. The use of untrapped neutral antimatter for gravitational measurements, as pursued by other groups^{27,28}, is, as yet, unproven.
Methods
Simulations
Antihydrogen trajectories were simulated using codes developed to establish that ALPHA trapped antihydrogen^{38}. The codes use an adaptive RungeKutta stepper to propagate antihydrogen atoms in the magnetic and gravitational fields of the trap. The model for the spatial structure and temporal behaviour of the magnetic field was experimentally verified by studying the trajectories of antiprotons^{38}. (Also see Supplementary Note 2.) The numeric value of the antihydrogen magnetic moment used in the simulations was set equal to that of the positron alone; the small deviations to the antihydrogen magnetic moment from the antiproton are not significant for the experiments reported here.
As described previously, the simulations are initiated with antiatoms with a random energy consistent with a distribution. Antiatoms with energies up to 650 mK, well above the nominal trapping depth of ∼540 mK, are included. Most of the antiatoms with energy above 540 mK are lost during the 50 ms randomization period before the magnet shutdown is initiated, but some, those on quasitrapped orbits^{31,38,46}, are retained. The gravity analysis is almost independent of the exact distribution of these quasitrapped antiatoms, however, because they are lost at very early t. Spatially, the simulations were initiated with antiatoms that originate in a region mimicking the dimensions of the experimental positron plasma. The 50 ms randomization period is sufficient to distribute these antiatoms within the trap^{38}, but may not entirely randomize them. To look for effects of insufficient randomization, simulations were also run with randomization times of 1 and 10 s. Some differences were observed, but these differences were significantly smaller than the differences caused by the detector displacement errors discussed above. We note that almost 75% of the antiatoms used in this analysis were held for times between 0.4 and 1.4 s, so the 1s simulations model the approximate entire lifetime of the majority of the antiatoms.
Antihydrogen energy distribution
To model the behaviour of antiatoms during the magnet shutdown, we need to know the initial antihydrogen velocity distribution. ALPHA synthesizes antihydrogen atoms by injecting antiprotons into a positron plasma. The positron plasma is typically at a temperature of ∼40 K (^{30}); before antihydrogen forms, the antiprotons thermalize on the positrons, giving them a temperature that approaches 40 K 47. The resultant antihydrogen inherits the centreofmass kinetic energy of the antiprotons from which they are formed, so it too has an initial temperature of about 40 K. Most of these antihydrogen atoms are far too energetic to be trapped; only those with an energy near or below the trapping depth of 540 mK are sufficiently cold to be trapped. These trapped antiatoms are deep within the Maxwellian distribution, where the energy distribution scales like . Strong evidence that the true energy distribution is close to this comes from comparing the annihilation times of the actual antiatoms with the annihilation times of simulated antiatoms for several different distributions (see Fig. 7a). This comparison is shown in Fig. 7b, where it is clear that the Maxwellian distribution best fits the experimental events. However, there are some differences between the two; for example, the simulations slightly underpredict the number of late annihilating antiatoms. Fortunately, the analysis is not very sensitive to the details of the distribution, so the small deviations from Maxwellian visible in Fig. 7b are unimportant. For instance, Fig. 7c shows the annihilation locations for antiatoms that annihilate between 20 and 22 ms, and the differences between the three distributions plotted are barely discernible. Figure 7d shows the influence of the choice of distribution on the reverse cumulative average 〈yt〉, and the differences are also small.
Reverse cumulative average
The reverse cumulative average is formally defined to be 〈yt〉=(1/N_{t})Σ_{n}y_{n}, where {y_{n}} is the set of annihilation locations, and the sum is over all of the N_{t} elements of {y_{n}} that occur after time t and before the late cutoff at 30 ms used to exclude the cosmic ray background. In Fig. 4, 〈yt〉 is shown for both the event data and the simulation data at the given F’s. The Monte Carlo error bands in Fig. 4 are calculated by dividing the ∼900,000 point simulation set at given F into about 2,100 subsets of length 434—the size of the actual event sample. Then, at every t, 〈yt〉 is calculated for each subset and the results ordered. The error band at every t is then defined by the 5 and 95% quantiles of the ordered 〈yt〉.
Detector resolution
The detector determines the locations of the antiatom annihilations by triangulation of the pion tracks produced by each annihilation. This process was extensively studied using the GEANT3 code^{48}, and a probability density function for the azimuthal resolution error was determined^{37}. This error was incorporated into the simulation results by adding random angular offsets consistent with this probability density function to each of the simulated annihilation angular locations.
Statistical analysis
To find the probability that the events are compatible with the simulations at a given F, we employ a test statistic akin to Fisher’s combined statistic^{40} aggregating KS tests in different (overlapping) time windows:
where P_{KS}(t;F) is the approximate Pvalue for a onesided, twosample KS test^{41,49} for a given F. The KS test, described in the next paragraph, indicates how compatible the y annihilation distribution of a specific trial data set, windowed between t and 30 ms, is with the y annihilation distribution of a similarly windowed reference data set. Specifically, at every F we extract a ∼300,000 point subset from the simulation data to serve as a reference data set. Then we compute P_{KS}(t;F) at every start time t and integrate using a numerical quadrature rule with a fixed time increment of 0.3 ms. Carrying out this procedure using the event data set for the trial distribution, we get the Φ_{Ev} defined earlier. Carrying out this identical procedure using the remaining N≈1,400 pseudoevent sets as the trial distributions, we get the set {Φ_{i;F}}. Under the null hypothesis, namely, that there is no difference between the distributions for a given F, the P_{KS}(t;F) themselves should be uniformly distributed. As originally introduced, Fisher’s combined test statistic was intended for independent tests, for which the overall Pvalue is χ^{2} distributed. In our case, the KS Pvalues are correlated in t because the t windows overlap, so the Pvalue of the combined test statistic is estimated by Monte Carlo sampling. Thus, P=N_{>}/N, where the integer N_{>} counts the number of Φ_{i,F} for which Φ_{i,F}>Φ_{Ev}.
For each time window and F, the KS test computes a ‘distance’ between the cumulative distribution function (CDF) for y for a trial event or pseudoevent set, and a reference distribution CDF. A greater distance reflects a lower probability that samples drawn from the reference set could deviate from the ‘average’ of that set by more than the trial set. These distances translate to approximate KS Pvalues, P_{KS}(t;F), through a wellstudied universal function^{49,50}. As our reference CDFs are rigorously stochastically ordered, yielding strictly declining P_{KS}(t;F) for increasing F (t held fixed) once P is small, we can employ a onesided KS test rather than the more typical twosided test. When the number of samples between t and 30 ms in the trial set, k is greater than 4, we use the standard asymptotic expansion^{49} for the distance to P_{KS} function; for smaller k we use the direct smallsample formulae. The P_{KS} for small k are generally close to unity, and contribute little to Φ. The estimated P_{KS} include ‘twosample’ corrections to account for the sampling error in the reference CDFs; however, these corrections are very small because the simulation sample sizes are large. Any approximations involved in calculating the P_{KS} do not greatly affect the overall Pvalue, as the former are not interpreted directly in terms of Type I (false positive) errors, but are only used to compute the combined test statistic Φ whose Pvalue is determined by Monte Carlo methods.
Note that for the analysis of the compatibility of the events with F=1, which yielded an overall Pvalue of 0.3, the KS Pvalues are not small and the use of the onesided KS test is not justified. Hence, in this case only, we used the twosided KS test.
We have approached the statistical analysis from the perspective of significance testing, that is, by seeking to reject hypotheses corresponding to sufficiently large values of F for which the data appear incompatible. If desired, however, the unrejected interval, −65<F<110, which includes systematic errors, could also be interpreted as a confidence region for F (with a coverage probability of 95% corresponding to our 5% significance level).
Event data set
The event data set analysed here includes all those antihydrogen atoms trapped in the ALPHA apparatus in 2010 and 2011 that were held for more than 400 ms, escaped the trap within 30 ms of the magnet shutdown initiation, and whose annihilation locations reconstructed to be within z=±138 mm of the trap centre. Regions beyond z=±138 mm were excluded because the trap wall has a significant inward step at these z locations.
Additional information
How to cite this article: The ALPHA Collaboration and Charman, A.E. Description and first application of a new technique to measure the gravitational mass of antihydrogen. Nat. Commun. 4:1785 doi: 10.1038/ncomms2787 (2013).
References
 1
Good, M. L. and the equivalence principle. Phys. Rev. 121, 311–313 (1961).
 2
Pakvasa, S., Simmons, W. A. & Weiler, T. J. Test of equivalence principle for neutrinos and antineutrinos. Phys. Rev. D 39, 1761–1763 (1989).
 3
LoSecco, J. M. The case for neutrinos from SN 1987a. Phys. Rev. D 39, 1013–1019 (1989).
 4
Adelberger, E. G., Heckel, B. R., Stubbs, C. W. & Su, Y. Does antimatter fall with the same acceleration as ordinary matter? Phys. Rev. Lett. 66, 850–853 (1991).
 5
Hughes, R. J. & Holzscheiter, M. H. Constraints on the gravitational properties of antiprotons and positrons from cyclotronfrequency measurements. Phys. Rev. Lett. 66, 854–857 (1991).
 6
Apostolakis, A. et al. Tests of the equivalence principle with neutral kaons. Phys. Lett. B 452, 425–433 (1999).
 7
Morrison, P. Approximate nature of physical symmetries. Am. J. Phys. 26, 358–368 (1958).
 8
Schiff, L. I. Sign of the gravitational mass of a positron. Phys. Rev. Lett. 1, 254–255 (1958).
 9
Schiff, L. I. Gravitational properties of antimatter. Proc. Natl Acad. Sci. 45, 69–80 (1959).
 10
Nieto, M. M. & Goldman, T. The arguments against ‘antigravity’ and the gravitational acceleration of antimatter. Phys. Rep. 205, 221–281 (1991).
 11
Goldman, T., Hynes, M. & Nieto, M. M. The gravitational acceleration of antiprotons. Gen. Relat. Gravit. 18, 67–70 (1986).
 12
Nieto, M. M., Goldman, T., Anderson, J. D., Lau, E. L. & PerezMercader, J. Theoretical motivation for gravitation experiments on ultralowenergy antiprotons and antihydrogen. Preprint at http://arxiv.org/abs/hepph/9412234 (1994).
 13
Chardin, G. Motivations for antigravity in General Relativity. Hyperfine Interact. 109, 83–94 (1997).
 14
Fischler, M., Lykken, J. & Roberts, T. Direct observation limits on antimatter gravitation. Preprint at http://arxiv.org/abs/0808.3929 (2008).
 15
Hajdukovic, D. Do we live in the universe successively dominated by matter and antimatter? Astrophys. Space Sci. 334, 219–223 (2011).
 16
BenoitLévy, A. & Chardin, G. Introducing the DiracMilne universe. Astron. Astrophys. 537, A78 (2012).
 17
Witteborn, F. C. & Fairbank, W. M. Experiments to determine the force of gravity on single electrons and positrons. Nature 220, 436–440 (1968).
 18
Poth, H. Physics with Antihydrogen. 2nd Conference on the Intersections between Particle and Nuclear Physics, Lake Louise: Canada, (1986).
 19
Goldman, T., Hughes, R. J. & Nieto, M. M. Gravitational acceleration of antiprotons and of positrons. Phys. Rev. D 36, 1254–1256 (1987).
 20
Deutch, B. I. et al. Antihydrogen production by positroniumantiproton collisions in an ion trap. Physica Scripta T22, 248–255 (1988).
 21
Gabrielse, G. Trapped antihydrogen for spectroscopy and gravitation studies: Is it possible? Hyperfine Interact. 44, 349–356 (1988).
 22
Beverini, N., Lagomarsino, V., Manuzioa, G., Scuri, F. & Torelli, G. Possible measurements of the gravitational acceleration with neutral antimatter. Hyperfine Interact. 44, 357–363 (1988).
 23
Adelberger, E. G. & Heckel, B. R. Adelberger and Heckel reply. Phys. Rev. Lett. 67, 1049 (1991).
 24
Poggiani, R. A possible gravity measurement with antihydrogen. Hyperfine Interact. 76, 371–377 (1993).
 25
Phillips, T. J. Antimatter gravity studies with interferometry. Hyperfine Interact. 109, 357–365 (1997).
 26
Mills, A. P. Jr. & Leventhal, M. Can we measure the gravitational free fall of cold Rydberg state positronium? Nucl. Instr. Meth. B 192, 102–106 (2002).
 27
Kellerbauer, A. et al. Proposed antimatter gravity measurement with an antihydrogen beam. Nucl. Instrum. Meth. Phys. Res. B 266, 351–356 (2008).
 28
Chardin, G. et al. Proposal to measure the gravitational behaviour of antihydrogen at rest. Report No. CERNSPSC2011029/ SPSCP342 30/09/2011 (CERN: Meyrin, Switzerland, (2011).
 29
Cronin, A. D. et al. Letter of intent: Antimatter gravity experiment (AGE) at Fermilab. http://www.fnal.gov/directorate/program_planning/Mar2009PACPublic/AGELOIFeb2009.pdf (2009).
 30
Andresen, G. B. et al. Trapped antihydrogen. Nature 468, 673–676 (2010).
 31
Andresen, G. B. et al. Confinement of antihydrogen for 1000 seconds. Nat. Phys. 7, 558–564 (2011).
 32
Andresen, G. B. et al. Search for trapped antihydrogen. Phys. Lett. B 695, 95–104 (2011).
 33
Pritchard, D. E. Cooling neutral atoms in a magnetic trap for precision spectroscopy. Phys. Rev. Lett 51, 1336–1339 (1983).
 34
Bertsche, W. et al. A magnetic trap for antihydrogen confinement. Nucl. Instr. Meth. Phys. Res. A 566, 746–756 (2006).
 35
Andresen, G. B. et al. Production of antihydrogen at reduced magnetic field for antiatom trapping. J. Phys. B: At. Mol. Opt. Phys 41, 011001 (2008).
 36
Andresen, G. B. et al. Autoresonant excitation of antiproton plasmas. Phys. Rev. Lett. 106, 025002 (2011).
 37
Andresen, G. B. et al. Antihydrogen annihilation reconstruction with the ALPHA silicon detector. Nucl. Instr. Meth. Phys. Res. A 684, 73–81 (2012).
 38
Amole, C. et al. Discriminating between antihydrogen and mirrortrapped antiprotons in a minimumB trap. New J. Phys. 14, 015010 (2012).
 39
Amole, C. et al. Resonant quantum transitions in trapped antihydrogen atoms. Nature 483, 439–443 (2012).
 40
Fisher, R. A. Statistical Methods for Research Workers Oliver and Boyd (1925).
 41
Stephens, M. A. EDF statistics for goodness of fit and some comparisons. J. Amer. Stat. Assoc. 69, 730–737 (1974).
 42
Gabrielse, G. et al. Trapped antihydrogen in its ground state. Phys. Rev. Lett. 108, 113002 (2012).
 43
Surkov, E. L., Walraven, J. T. M. & Shlyapnikov, G. V. Collisionless motion of neutral particles in magnetostatic traps. Phys. Rev. A 49, 4778–4786 (1994).
 44
Pinkse, P. W. H., Mosk, A., Weidemüller, M., Reynolds, M. W., Hijmans, T. W. & Walraven, J. T. M. Onedimensional evaporative cooling of magnetically trapped atomic hydrogen. Phys. Rev. A 57, 4747–4760 (1998).
 45
Zhmoginov, A., Charman, A., Fajans, J. & Wurtele, J. S. Nonlinear dynamics of antihydrogen magnetostatic traps: implications for gravitational measurements. Preprint at http://arxiv.org/abs/1303.2738 (2013).
 46
Coakley, K., Doyle, J., Dzhosyuk, S., Yang, L. & Huffman, P. Chaotic scattering and escape times of marginally trapped ultracold neutrons. J. Res. Natl. Stand. Technol. 110, 367–376 (2005).
 47
Robicheaux, F. & Hanson, J. D. Threebody recombination for protons moving in a strong magnetic field. Phys. Rev. A 69, 010701 (2004).
 48
Brun, R., Bruyant, F., Maire, M., McPherson, A. C. & Zanarini, P. GEANT 3: User’s Guide Geant 3.10, Geant 3.11; rev. version CERN (1987).
 49
Gail, M. H. & Green, S. B. Critical values for the onesided twosample KolmogorovSmirnov statistic. J. Amer. Stat. Assoc. 71, 757–760 (1976).
 50
Stephens, M. A. Use of the KolmogorovSmirnov, Cramervon Mises and related statistics without extensive tables. J. Roy. Stat. Soc. 32, 115–122 (1970).
Acknowledgements
This work was supported by: CNPq, FINEP/RENAFAE (Brazil); ISF (Israel); FNU (Denmark); VR (Sweden); NSERC, NRC/TRIUMF, AITF, FQRNT (Canada); DOE, NSF, LBNLLDRD (USA); and EPSRC, the Royal Society and the Leverhulme Trust (UK). We are grateful for the efforts of the CERN AD team, without which these experiments could not have taken place.
Author information
Affiliations
Consortia
Contributions
This retrospective analysis was based on data collected in 2010 and 2011 by the entire ALPHA collaboration, using the antihydrogen trapping apparatus and methods developed by the collaboration. The gravity analysis described here was proposed and first carried out by J.F. and J.S.W. The analysis employed computer simulations developed by J.F.; the results of these simulations were checked with independent simulations developed by F.R. The statistical techniques were refined by A.E.C., and the final implementation of these techniques was carried out by A.Z. E.B., A.O. and F.R. proposed alternative statistical treatments that yielded similar results. This article was written by J.F. and J.S.W., with help from A.E.C., M.C., J.S.H. and A.Z., and then improved and approved by all the authors.
Corresponding authors
Ethics declarations
Competing interests
The author declares no competing financial interests.
Additional information
A full list of authors for the ALPHA Collaboraton and their affiliations appears at the end of the paper.
Supplementary information
Supplementary Information
Supplementary Figures S1S2 and Supplementary Notes 12 (PDF 603 kb)
Rights and permissions
This work is licensed under a Creative Commons AttributionNonCommercialNoDerivs 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/byncnd/3.0/
About this article
Cite this article
The ALPHA Collaboration., Charman, A. Description and first application of a new technique to measure the gravitational mass of antihydrogen. Nat Commun 4, 1785 (2013). https://doi.org/10.1038/ncomms2787
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/ncomms2787
Further reading

Investigation of the fine structure of antihydrogen
Nature (2020)

Fundamental symmetry tested using antihydrogen
Nature (2020)

Optical trapping of antihydrogen towards an atomic anticlock
Hyperfine Interactions (2020)

Interference of several gravitational quantum states of antihydrogen in GBAR experiment
Hyperfine Interactions (2019)

Gravitational and matterwave spectroscopy of atomic hydrogen at ultralow energies
Hyperfine Interactions (2019)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.