Introduction

Earth's proton aurora is formed when charged protons precipitate into the atmosphere loss cone, within a few degrees at the equator and subsequently collide with the neutral atmosphere at low altitudes1. Proton aurora can provide important information for understanding magnetosphere-ionosphere interaction by supplementing the direct imaging of the magnetosphere2. Since charged protons are easily trapped inside the Earth's magnetic field due to a minimum magnetic field existing at the equator, such proton precipitation requires a scattering mechanism to break the first adiabatic invariant. In the basically collisionless and tenuous magnetosphere, interactions between protons and plasma waves can induce such non-adiabatic scattering. One important plasma wave, electromagnetic ion cyclotron (EMIC) wave can interact with protons and efficiently scatter protons into the atmosphere3,4,5,6. However, strong EMIC waves are present primarily along the plasma plume in the duskside7,8,9 and on the dayside in the outer magnetosphere L > 610,11,12. This appears to be difficult for EMIC wave alone to explain the broad MLT distribution of proton auroral emission from the morning sector to the dusk sector associated with the precipitating protons at lower L-shells. Hence, the fundamental and long-outstanding problem as to what mechanism is primarily responsible for proton auroral emission still remains unresolved. Another important wave, fast magnetosonic (MS) wave, also named equatorial noise13, can resonate with protons, potentially leading to rapid pitch angle scattering of protons. Furthermore, MS waves can propagate eastward (later MLT) or westward (earlier MLT) over a broad region of MLT14,15,16. However, it has not been possible so far to determine whether MS wave is indeed another generating mechanism of the proton aurora, because simultaneous observations concerning MS wave activity, proton pitch distribution and proton auroral emissions are challenging due to extremely difficult observational conditions. Fortunately, such simultaneous observations were serendipitously found in the unique events on September 16, 2003, to identify such mechanism.

Results

On September 16, 2003 an interplanetary coronal mass ejection (CME) interacted with the Earth's magnetosphere, leading to a small geomagnetic storm (Dst ≈ −50 nT). The Cluster spacecrafts, which stayed around the noon near the Earth's equator for about ten minutes then, recorded strong MS wave activities for a duration of ~ 10 minutes in the frequency range ~ 10–150 Hz and around the noon local time sector from 11.47 to 11.41 MLT (Fig. 1a). MS wave is confined within a few degrees of the geomagnetic equator and exhibits a notable harmonic structure, viz., spaced at 4fcp and 12fcp (Fig. 1a). Moreover, MS wave has a normal angle θ ≈ 90° (Fig. 1b) and an ellipticity ≈ 0 (Fig. 1c), implying that MS wave propagates very obliquely with respect to the ambient magnetic field direction17.

Figure 1
figure 1

Satellite data on 16 September 2003 storm.

Data collected by the Cluster STAFF instrument during ~ 10 minutes duration for MS wave power (a), wave normal angle (b), the angle between the Earth's magnetic field and the normal to the plane of the wave; and wave ellipticity (c), the degree of elliptical polarization. MS waves are right hand polarized electromagnetic waves which occur as a series of narrow tones, spaced at multiples of the proton gyrofrequency fcp up to the lower hybrid resonance frequency fLH. a, MS waves maximizes basically at frequencies from ~ 10–150 Hz, spaced at 4fcp (fcp ≈ 5 Hz) and 12fcp. The white dotted line denotes the lower hybrid resonance frequency fLH. b–c, The observed waves have a high normal angle θ ≈ 90° and a high degree of elliptical polarization (ellipticity ≈ 0), indicating that the k vector is approximately perpendicular to the ambient magnetic field direction. d, Pitch angle distribution of protons for different indicated energies (2–10 keV) measured by CIS instrument. Proton fluxes peak at a pitch angle of 90° and drop dramatically at small pitch angles. e–j, Auroral snapshots for northern hemisphere and southern hemisphere as a function of magnetic local time (MLT) and magnetic latitude (MLat) obtained by FUV-SI12 onboard IMAGE spacecraft when IMAGE spacecraft travels at the location ~ 15 Mlat and altitude = 8 RE. Proton aurora bands are present from the morning sector to dusk sector 09:00–18:00 MLT, with the strongest emission basically in 14:00–18:00 MLT. The auroral emission is asymmetric with a more intensity and a broader latitude in the northern hemisphere than those in the southern hemisphere.

Figure 1d shows a pancake distribution of proton flux for different indicated energies (2–10 keV) measured by CIS instrument. Proton flux maximizes at a pitch angle of 90° and drops remarkably at small pitch angles at each energy, particularly for energy above 3.53 keV. Since a pancake distribution is produced when protons with smaller pitch angles have been scattered into the loss cone, any endeavor to identify whether MS waves are really responsible for proton auroral precipitation must also explain the formation of pancake distributions.

Figure 1e–j show auroral snapshots observed by FUV-SI12 instrument onboard IMAGE spacecraft18 for the northern hemisphere and southern hemisphere produced by the precipitation of 2–10 keV protons originating from the plasma sheet. The SI12 instrument detects the Doppler-shifted Lyman-α photons corresponding to precipitating charge-exchanged protons with energies of a few keV19,20. Proton aurora covers a broad MLT region 09:00–18:00 MLT, with stronger emissions roughly in 14:00–18:00 MLT. Such a broad MLT distribution of proton aurora requires MS waves which can produce proton scattering to occur over a similar broad MLT region.

Unfortunately, there is no direct wave data over such a broad MLT region during this event. Here, we adopt the previously developed programs21,22 to trace MS waves with different frequencies based on the wave data (Fig. 1a–b). MS waves are launched at 12:00 and 15:00 MLT for different locations at the geomagnetic equator. The modeled results (Fig. 2) confirm that MS waves can indeed propagate over the similar MLT region from the morning sector to the dusk sector 09:00–18:00 MLT.

Figure 2
figure 2

Ray-tracing of MS waves.

Ray paths representing MS waves are initiated at the equator, at MLT = 12:00 (a) and 15:00 (b), with the ray-tracing parameters as shown in Table 1. The corresponding color scale indicates the initial wave frequency of each ray in Hz. The black circles indicate the initial positions L = 5.0, 5.5 and 6.0, where L is the distance in Earth radii (1RE = 6, 370 km) from the centre of the Earth to the equatorial crossing of a given magnetic field line. MS waves can propagate eastward (later MLT) or westward (earlier MLT) basically from the morning sector to the dusk sector 0900–1800 MLT, corresponding to the occurrence of proton auroral emission.

Resonant interactions between MS waves and protons take place when the wave frequency equals a multiple of the proton gyrofrequency in the proton reference frame. These cyclotron MS-proton interactions produce efficient exchange of energy and momentum between waves and protons, leading to stochastic proton scattering. Such stochastic scattering can be evaluated in terms of pitch angle and momentum diffusion coefficients induced by cyclotron MS-proton interactions. Then, the dynamic evolution of the proton distribution function can be calculated by solving the Fokker-Planck diffusion equation associated with pitch-angle and momentum diffusion coefficients23. Here, we use this method to model evolutions of the proton distribution following their injection into the magnetosphere.

Calculation of the diffusion coefficients requires a detailed knowledge of the amplitudes and spectral properties of the waves. A standard way is to assume that the wave spectral density obey a Gaussian distribution in wave frequency and wave normal angle24 (see the details in Methods). To allow the data fitting more reasonable, we average the observed wave magnetic field intensity over the indicated time period in this event and then present a least squares Gaussian fit to the observed two-band spectral intensity (Fig. 1a), together with the corresponding fitting parameters as shown in Figure 3. We then calculate bounce-averaged diffusion coefficients for MS waves as shown in Figure 4a–c. All the diffusion coefficients cover a broad region of pitch angle and energy, with high values at lower pitch angles. In particular, the pitch angle diffusion rate above ~ 7 keV can approach 0.01/s around the loss-cone, allowing efficient pitch scattering of protons into the loss-cone in a time scale of tens of seconds. Moreover, pitch angle and cross diffusion coefficients are higher than momentum diffusion coefficients particularly at lower pitch angles, implying that pitch angle diffusion dominates over the energy diffusion while cross diffusion coefficients should also contribute to proton-MS interaction.

Figure 3
figure 3

Gaussian Fitting curves.

The modeled Gaussian fit (solid) to the observed two-band wave spectra (dot) over a three-minute period 17:58:25–18:01:25 is shown, together with the fitted wave amplitudes Bt, the peak wave frequencies fm, the bandwidths δf, the lower and upper bands f1 and f2.

Figure 4
figure 4

Diffusion rates Proton bounce-averaged pitch angle (〈Dαα〉, a), momentum (〈Dpp〉, b) and cross (〈Dαp〉, c) diffusion rates for resonant MS wave interactions with protons.

Pitch angle 〈Dαα〉 and cross 〈Dαp〉 are respectively higher than momentum 〈Dpp〉. Combined scattering by all three diffusion rates leads to rapid pitch-angle scattering between 2 and 10 keV at lower pitch angles and the resultant proton auroral precipitation into the atmosphere on a timescale comparable to tens of seconds. Evolution of the proton flux Starting with an initial condition representative of protons (d), we show the proton flux due to scattering by MS waves from a numerical solution to the two-dimensional Fokker-Planck diffusion equation. MS wave causes a rapid loss of low-energy (2–10 keV) protons within tens of seconds (e–f), leading to the strongest proton auroral precipitation from the morning sector to the dusk sector 0900–1800 MLT.

Using those diffusion rates in Figure 4a–c, we solve a two-dimensional Fokker-Planck diffusion equation by a recently introduced hybrid difference method25 and calculate phase space density f evolution of protons due to MS waves. We then simulate the evolution of differential flux j by the subsequent conversion j = p2f and extend the results to the range of 90°–180° due to the mirror symmetry. The results (Fig. 4d–f) confirm that MS waves can produce a rapid pitch angle scattering of protons within tens of seconds and substantially modify the entire population of injected protons, leading to the formation of pancake distributions at energies 2–10 keV. It should be mentioned that anomalous depressions (or dips) occur around 90° because of the absence of resonant scattering at (or around) 90°. Moreover, the diffusion rates around 90° are very small in the energy range 1–3 keV and gradually increase above 3 keV. Consequently, those dips around the pitch angle 90° are more noticeable when energy increases (Fig. 4a–c). Although our simulation results are sensitively dependent on adopted model parameters and display anomalous depressions around 90°, the dominant features of Figure 4f are very similar to the Cluster data shown in Figure 1d. Considering that MS waves can propagate over a broad MLT region in this event (Fig. 2), we therefore conclude that MS waves are indeed responsible for the precipitation of energetic protons into the atmosphere, producing the resultant proton aurora (Fig. 1e–j).

Discussion

The simultaneous observation and corresponding modeling presented in this study firstly link MS waves to the origin of proton aurora. Our results definitively demonstrate that, as presented in Fig. 5, MS waves can produce rapid proton precipitation responsible for proton aurora, naturally accounting for the remnant pancake proton distribution left behind in space and the broad MLT distribution of proton aurora. Although our simulations were performed around the noon 11:30 MLT, this conclusion should be valid over the entire region of excited MS waves since the basic properties of MS-proton interaction should not change no matter whether the amplitude and morphology of MS waves are different under different locations.

Figure 5
figure 5

Protons scattered by MS waves.

Schematic diagram showing the solar wind protons, MS waves and the proton aurora. The image of the Earth was created by HZG and XFL. (1) The energetic protons originating from the plasmasheet are trapped in the Earth's magnetic field and drift westward around the Earth. (2) MS waves propagate eastward or westward over a broad region from the morning side to the dusk side. (3) Wave-particle interaction occurs in the same region. (4) It acts as an unexpected mechanism to scatter the trapped protons into the atmosphere, yielding the proton aurora.

As shown in Figure 1, the proton auroral emission maximizes in the afternoon sector, which appears to be inconsistent with the recent statistical survey of MS waves observed on THEMIS spacecraft26 that such waves are strongest in the dawn to pre-noon sector. However, the proton aurora emission intensity primarily depends on two factors: the number of the precipitating protons and the MS wave intensity. Since the precipitating protons originating from the plasma sheet drift westwards around the Earth (see Fig. 5), the protons encounter resonance with the afternoon sector MS waves at first, allowing part of protons precipitating into the atmosphere. Then the rest of protons continue resonating with MS waves on other sectors in their drifting. Moreover, the potentially existing plume EMIC waves along the duskside (though not observed directly here) may also contribute to the proton scattering. This probably explains why the proton aurora peak in the afternoon sector, instead of in the dawn to pre-noon sector.

It should be point out that, we use the quasi-linear theory of wave-particle interaction. Analogous to the Particle-in-cell treatment, the quasi-linear theory has been frequently adopted by the space plasma physics and magnetosphere research community to treat wave-particle interaction. The quasi-linear theory is valid by assuming that each individual particle randomly walks in velocity space, resonates with a succession of uncorrelated waves and is scattered in a small amount in pitch angle and energy each time. Those conditions are basically satisfied in the Earth's radiation belts for naturally generated MS waves, where the bandwidth is generally above the proton cyclotron frequency up to the lower hybrid frequency. Moreover, this study is intended to propose a new mechanism of proton aurora by MS-driven scattering, but not to exclude EMIC waves as a potential wave responsible for the proton scattering. The relative contribution to the proton aurora emission from MS or EMIC waves deserves a future study.

Finally, in a departure from the previous works27, we focus on the pitch-angle scattering by the MS waves instead of the instability of MS waves in this study. The enhanced MS waves appear to be excited by those injected anisotropic protons with a typical ring distribution at energies of ~10 keV or above prior to this event. Unfortunately, there is no direct data on MS waves or energetic proton distributions before this event because the Cluster satellite doesn't stay in the radiation belt. We leave the instability of MS waves to a future study.

Methods

The ray tracing of waves is performed by using the following standard Hamiltonian equations28:

where R, ω, k and t represent the position vector of a point on the ray path, the wave frequency, the wave vector and the group time, respectively. The wave dispersion relation D obeying D(R, ω, k) = 0 at every point along the ray path, has well been documented in the previous work. The spatial variation in D can be written:

where B0 is a ambient magnetic field and Nc is the background plasma density.

Here, two Cartesian coordinate systems are adopted for the ray-tracing calculation15,21. The first is Earth centered Cartesian coordinate system (OXYZ), in which Z axis points north along the geomagnetic axis; and the X and Y axes stay in the geomagnetic axis equatorial plane. The second is a local Cartesian system, in which the z axis points along the direction of the ambient magnetic field, the x axis is orthogonal to the z axis and stays in the meridian plane pointing away from the Earth at the equator and the y axis completes the right-handed set. The wave vector k makes an angle θ with the z axis and the projection of k onto the xy plane makes an angle η with the x axis, viz., . η = 0°, 90°, 180° and 270° correspond to the perpendicular component k pointing away from the Earth, toward later MLT (eastward), toward the Earth and toward earlier MLT (westward), respectively.

For ray-tracing, we adopt a dipole magnetic field model and the global core plasma density model29. Considering that MS wave propagate very obliquely, we choose the initial wave normal angle θ = 88° and the initial azimuthal angle η = 150° (toward earlier MLT) and 30° (toward later MLT). The other ray-tracing parameters for different wave frequencies and source locations are shown in Table 1.

Table 1 Parameters for Ray Tracing

To calculate the diffusion rates, we assume that the MS wave spectral density follows a typical Gaussian frequency distribution with a center fm, a half width δf, a band between f1 and f230.

here Bt is the wave amplitude in units of Tesla and erf is the error function.

We choose the wave normal angle distribution to also satisfy a standard Gaussian form:

where X = tan θ (θ1 ≤ θ ≤ θ2, X1,2 = tan θ1,2), with a half-width Xω and a peak Xm. Based on the observation, we choose Xm = tan 89°, Xω = tan 86°, X1 = XmXω, X2 = Xm + Xω; and the maximum latitude for the presence of MS waves λm = 5°. We assume that the wave spectral intensity remain constant along the dipolar geomagnetic field line.

As shown in Figure 3, there is a cross region between two bands with the cross frequency fcr = 70 Hz. We consider contribution from harmonic resonances up to n = ±20 for the lower band and n = ±30 for the upper band. To avoid repeating calculation from the cross region of two bands, the lower band stops at fcr = 70 Hz; and the upper band starts at fcr = 70 Hz. We then compute MS-driven bounce-averaged diffusion coefficients at the location L = 4.6.

The evolution of the proton phase space density f is calculated by solving the bounce-averaged pitch angle and momentum diffusion equation

here p is the proton momentum, G = p2Te) sin αe cos αe with αe being the equatorial pitch angle, the normalized bounce time Te) ≈ 1.30 − 0.56 sin αe; 〈Dαα〉, 〈Dpp〉 and 〈Dαp〉 = 〈D〉 denote bounce-averaged diffusion coefficients in pitch angle, momentum and cross pitch angle-momentum. The explicit expressions of those bounce-averaged diffusion coefficients for MS waves can be found in the previous work24,25.

The initial condition is modeled by a kappa-type distribution function of energetic protons31:

here np, l, κ and Γ denote the number density of energetic protons, the loss-cone index, the spectral index and the gamma function. represents the effective thermal parameter normalized by the proton rest mass energy mpc2 (~ 938 MeV).

Solution of the diffusion equation (6) requires to choose the appropriate and realistic initial and boundary conditions in order for a realistic simulation of this event. Boundary conditions in pitch angle are taken: f = 0 at the loss-cone αe = αL (sin αL = L−3/2(4 − 3/L)−1/4) to simulate a rapid precipitation of protons inside the loss cone (Fig. 1d) and ∂f/∂αe = 0 at αe = 90°. For the energy boundary conditions, f is assumed to remain constant at the lower boundary 0.5 keV and the upper boundary 20 keV, respectively.

Based on the observational data, we choose the following values of parameters: , l = 0.01, κ = 3 and np = 2.9 cm−3. We solve the equation using the recently developed hybrid difference method25, which is efficient, stable and easily parallel programmed. The numerical grid is set to be 91 × 101 and uniform in pitch angle and natural logarithm of momentum.