## Introduction

Approximately half of global primary production occurs in the oceans1. In the vast expanse of the oligotrophic oceans, marine primary production is limited by nitrogen2. In these regions, nitrogen (N2) fixation by diazotrophs has been hypothesized to be an important source of new nitrogen ultimately influencing the uptake and sequestration of CO23,4,5. For instance, in the North Atlantic subtropical gyre, a significant seasonal carbon drawdown in the absence of measurable nutrients has been attributed to episodic and patchy N2 fixation events6,7. These events are, however, difficult to capture using current methods, which rely on discrete sampling. Furthermore, most observations to date have been collected in tropical and subtropical open oceans, overlooking the potential role of coastal regions8,9. The limited number of observations impede our ability to close regional marine nitrogen budgets, scale estimates globally, and identify factors controlling N2 fixation10.

Marine N2 fixation is generally believed to be regulated by various factors including light, temperature, nutrients, and trace-metal availability11. While low light, low temperature, and high bioavailable nitrogen have traditionally been assumed to limit N2 fixation, recent studies have reported significant diazotrophic activities in darker, colder, and more nitrogen-rich environments, thereby broadening the putative biogeography of marine N2 fixation12,13,14,15,16. The discovery of these new niches has been accompanied by increasing appreciation for the large diversity of species fixing N2, ranging from the well-known Trichodesmium and diatom-diazotroph associations (DDA) to more recently-recognized unicellular cyanobacteria17 and non-cyanobacterial diazotrophs18.

Taken together, the large uncertainty in regional and global budgets of marine N2 fixation, associated with its patchy and recently broadened biogeography, demands new tools to adequately map this important biogeochemical process. We recently deployed a new method across more than 6000 km of the North Atlantic to revisit the geographical distribution and assess the controlling factors of N2 fixation. Our near-real-time, continuous high-resolution measurements allowed us to locate hotspots of N2 fixation and adapt our sampling strategy to characterize plankton communities and environmental properties19 (Methods). The contribution of N2 fixation to net community production (NCP) was simultaneously evaluated at high resolution. Our new observations reveal hotspots of marine N2 fixation along the Eastern Seaboard, and highlight the overlooked significance of N2 fixation to coastal and global nitrogen and carbon cycling.

## Results and Discussion

### N2 fixation distribution and controlling factors

Our survey revealed substantial variability and diel cycling behavior in surface N2 fixation rates, which ranged from less than 0.01 to nearly 15 nmol N L−1 h−1 (Fig. 1a and Supplementary Fig. 1). When integrated over 24-h N2 fixation diel cycles, continuous estimates of daily surface N2 fixation rates (≤0.19–97.6 nmol N L−1 d−1) were in line with discrete N2 fixation rates concurrently determined by 15N2 incubations (n = 7, r = 0.97, p < 0.01, Fig. 1b and Methods). To extrapolate surface measurements to the entire euphotic zone, we derived an empirical relationship between surface and depth-integrated N2 fixation rates (Supplementary Fig. 2). Our lower-end measurements of less than 10 μmol N m−2 d−1 are within the range of published rates near Bermuda20 (Fig. 1c). In contrast, high N2 fixation rates reaching 3000 μmol N m−2 d−1 near the New Jersey coast are among the top 2% of rates ever reported in the global ocean21, further underscoring the high N2 fixation along the continental shelf of the eastern seaboard22, and the value of high-frequency observations for identifying hotspots. The coastal region (bathymetry ≥ −200 m) stands in sharp contrast to open ocean areas, with depth-integrated N2 fixation rates in the coastal sectors (geometric mean of 577 μmol N m−2 d−1) being on average an order of magnitude larger than open ocean rates (geometric mean of 85 μmol N m−2 d−1).

Sea surface temperature did not appear to drive the spatial variability of N2 fixation in our study area (Supplementary Fig. 3 and Supplementary Fig. 4). In open ocean regions, N2 fixation rates varied substantially even when the temperature range was narrow. Moreover, high N2 fixation rates were observed at temperatures ranging from 23 °C off New Jersey to 30 °C near the Florida coast. Fixed nitrogen is known to suppress N2 fixation, but the threshold of inhibition differs among diazotrophs and can be fairly high23. Observed dissolved inorganic nitrogen did not effectively regulate N2 fixation rates (Supplementary Fig. 4). We also note that a recent study shows that these coastal waters may be nitrogen limited in summer24. In addition, the excess of nitrogen in subsurface waters $$\left( {{\mathrm{N}}^ \ast = \left[ {{\mathrm{NO}}_3^ - } \right] - [{\mathrm{PO}}_4^{3 - }] \times 16} \right)$$, commonly used as a geochemical proxy for the distribution of N2 fixation25, was not a strong predictor of overlying N2 fixation rates (Fig. 1c and Supplementary Fig. 4). This comparison should be interpreted with caution, as N* integrates over longer spatial and temporal scales than our observations. N* is also not well resolved in coastal waters and may be affected by other processes such as atmospheric nitrogen deposition26. In contrast, some of the N2 fixation hotspots coincided with high phosphorus concentration and excess phosphorus $$\left( {{\mathrm{P}}^ \ast = [{\mathrm{PO}}_4^{3 - }] - [{\mathrm{NO}}_3^ - ]/16} \right)$$ at the ocean surface (Fig. 1b and Supplementary Fig. 4). Regions of N2 fixation have been hypothesized to be coupled to areas of denitrification via the upwelling of waters deficient in nitrogen relative to phosphorus27. The phytoplankton bloom near the New Jersey coast, where the highest N2 fixation rates were observed, recurs almost every summer and may be associated with local upwelling28. High N2 fixation rates have also been reported in other upwelling systems worldwide, including the equatorial Atlantic Ocean29, the northwest African coastal upwelling30, and the Benguela Upwelling System31. The excess phosphorus may also result from terrestrial and/or riverine runoff. For example, N2 fixation and carbon sequestration in the tropical North Atlantic were shown to be enhanced by the Amazon River plume32.

A recent study in the Eastern South Pacific suggested Fe, rather than phosphorus, may limit N2 fixation33, questioning the spatial coupling between N2 fixation and denitrification34. However, we did not find a strong relationship between dissolved Fe and N2 fixation rates across our study (Supplementary Fig. 4). Dissolved Fe ranged from 0.5 nmol L−1 near Bermuda to around 1.8 nmol L−1 along the Florida coast, which is higher than Fe measured in the Eastern South Pacific. Fe concentration is admittedly a poor predictor of Fe availability, as concentrations merely reflect snapshots of the complex interactions between sources and sinks35. Consequently, we calculated Fe* ($${\mathrm{Fe}}^ \ast = {\mathrm{Fe}} - {R}_{{\mathrm{Fe}}}{\mathrm{PO}}_4$$; where $${R}_{{\mathrm{Fe}}} = 0.47\;{\mathrm{mmol}}{\mathrm{Fe}}:1\;{\mathrm{mol}}\;{\mathrm{PO}}_4$$) to evaluate whether Fe potentially limits phytoplankton growth36. Positive values of Fe* across the study area indicate Fe was not limiting. The coupling between N2 fixation and N loss may be a dominant factor in regions where Fe is abundant, notably in coastal oceans. We hypothesize that, in contrast to the Eastern South Pacific, North American coastal waters support substantial N2 fixation due to high sedimentary nitrogen loss37 and high Fe input (e.g., from sediment and atmospheric dust deposition)38 (Supplementary Fig. 5). Interestingly, N2 fixation rates correlated well with dissolved manganese (Mn) concentrations. While the correlation of N2 fixation to Mn could be coincidental or symptomatic of other factors, it deserves further investigation as the physiological requirement for Mn in marine diazotrophs is poorly characterized.

Our N2 fixation measurements strongly correlated to satellite estimates of chlorophyll-a concentrations ([Chl]) (Fig. 1a and Supplementary Fig. 4). This is unexpected as N2 fixation is generally believed to be most significant where nitrogen is limited, such as the low biomass regions of the subtropical gyres. The low-[Chl] waters of the Sargasso Sea, typically viewed as N2 fixation hotspots39, exhibited lower N2 fixation rates than those measured in the Mid-Atlantic Bight. This pattern was further supported by a meta-analysis, which showed that N2 fixation is correlated to [Chl] in the global ocean (Supplementary Fig. 6). This pattern may be related to the stimulation of non-autotrophic N2 fixation by organic matter in high [Chl] waters40. However, while our field observations identify phosphorus, P*, and [Chl] as predictors of spatial variations in N2 fixation in our study area, a meta-analysis of published results shows that none of the putative regulating factors of N2 fixation can satisfactorily explain variations in volumetric rates globally (Supplementary Fig. 6). We posit that N2 fixation is likely driven by a complex interplay of spatially-variable environmental factors, also reflecting the heterogeneity and the large diversity of marine diazotrophs and their niches.

### Distribution of diazotrophic phylotypes

Diazotrophs and their potential hosts were identified via high-throughput quantitative 16S rRNA and 18S rRNA gene sequencing from our 2015 cruise (Methods). Although the 16S rRNA gene approach differs from the nifH method for characterizing diazotrophs in terms of specificity and coverage41, it provides some insights into the broad distribution of diazotrophs. To address the cases where organisms may not be capable of N2 fixation despite sharing a similar 16S rRNA gene with diazotrophs, we only searched for diazotrophs known to fix N2 among our 16S rRNA gene sequences. Distinct diazotrophic communities were found to dominate in different ecological domains (Fig. 2a). Heterotrophic groups, which include members known to be diazotrophs42, were more abundant than diazotrophic cyanobacteria in the open ocean, where N2 fixation was relatively low. These observations are consistent with recent recognition of the widespread distribution of non-cyanobacterial diazotrophs, whose activities remain poorly constrained18. Trichodesmium peaked off South Carolina. Richelia intracellularis showed relatively high abundances in the oligotrophic open ocean and peaked in coastal waters, where its hosts—Rhizosolenia and Hemiaulus—were also found at relatively high abundances (Fig. 2b). The most striking N2 fixation hotspot off the New Jersey coast was likely driven by a UCYN-A bloom that reached 2 × 107 16S rRNA gene copies L−1, which is of comparable magnitude to UCYN-A abundances (2.5–3.5 × 107 L−1) observed previously in the same region using the nifH method22. One of UCYN-A’s hypothesized hosts, Braarudosphaera bigelowii also flourished in this region. Across all samples, the ratio of UCYN-A (16S rRNA gene):B. bigelowii (18S rRNA gene) varied from 0.1 to 35 with a median of 0.24. The two organisms co-occurred in most samples, consistent with previous studies that suggest obligate symbiosis43,44. There is a growing interest in UCYN-A’s unusual physiological and ecological traits45. Its genetic diversity, evolution, and the niches inhabited by its different lineages deserve further investigation. Meanwhile, its presence in the coastal oceans provides new opportunities to study this unique organism. The divergent geographic distribution of different diazotrophs in our research area likely reflects their respective niches. For example, warm seawater is more favorable to Trichodesmium46, while UCYN-A prefers temperate environments17. This difference may partly explain why Trichodesmium abundances peaked off the South Carolina coast (~30 °C) while UCYN-A dominated in regions with lower temperatures (~23 °C). Overall, the quantitative 16S rRNA and 18S rRNA gene sequencing methods revealed spatial patterns of diazotrophs and their hosts despite the assumptions of our quantitative sequencing methods, e.g., equivalent recovery efficiency for both the standard and the natural sequences in the sample (Methods).

### Variable contribution of N2 fixation to new production

N2 fixation has been estimated to be an important source of new nitrogen in oligotrophic waters, supporting as much as 50% of new production4,47, yet the contribution of N2 fixation to productivity in coastal oceans remains relatively understudied48. To assess the proportion of production fueled by N2 fixation, we combined N2 fixation observations with underway estimates of NCP based on high-frequency measurements of the dissolved O2/Ar ratio (Methods). NCP was mostly positive, with higher rates along the North American coast where high [Chl] and N2 fixation rates were also observed (Fig. 3a). NCP was relatively low in the open ocean likely due to nutrient limitation. As a rough estimate, we converted N2 fixation rates and NCP to their carbon equivalents using a theoretical C:N:O2 stoichiometry of 106:16:138. Regional differences in stoichiometry would modify but not erase the large gradients of N2 fixation and NCP observed over our cruise transects. We found that the contribution of N2 fixation to NCP varied substantially over the western North Atlantic (Fig. 3b). Across large portions of the oligotrophic subtropical ocean, no more than 20% NCP was generally fueled by N2 fixation (with some high excursions). Other mechanisms of nutrients supply, such as revised estimates of vertical nitrate flux49, must therefore be invoked to support the NCP rates we observed and those that have been reported in the Sargasso Sea. In contrast, the ratio of N2 fixation to NCP exceeded 50% in some regions off the Cape Hatteras and New Jersey coasts (Fig. 3b). The high contribution of N2 fixation to primary production off the New Jersey coast is supported by dual-tracer 15N2 and 13C incubations (Supplementary Fig. 7). Despite methodological differences between these techniques, both methods independently capture similar spatial patterns of contribution of N2 fixation to biological carbon fixation (see NCP estimates in Methods). Our results highlight that N2 fixation is not only high in coastal regions, but may also contribute significantly to marine production.

### Updated N2 fixation budget and global implications

To contextualize our findings, we performed a meta-analysis combining 15N2 incubations collected during our cruises with discrete N2 fixation measurements compiled from the literature, not including our underway continuous measurements (Supplementary Fig. 8). Our updated database contains over 80% more depth-integrated observations (1172 points in total) than the most up-to-date database currently available in the literature (630 points)21. Less than 15% of observations reported in the literature were collected in coastal waters. However, these observations support our findings of high N2 fixation rates in the neritic environment (Fig. 4), notably on the eastern American coast5,22, eastern Arabian Sea50, and estuaries of the Baltic Sea14. The similar magnitude of depth-integrated N2 fixation rates in coastal and open oceans leads to significantly higher volumetric N2 fixation rates in the coastal oceans due to shallower integration depths (Fig. 4 and Methods). However, recent reports of N2 fixation in the deep ocean12,13 may reverse the pattern of marginally higher depth-integrated N2 fixation rates in coastal waters, if N2 fixation is integrated to the aphotic zone of the open ocean.

In light of these new observations, we reassessed the budgets of marine N2 fixation globally as well as separately for the neritic and oceanic regions (Supplementary Fig. 9). Our updated geometric and arithmetic mean estimates of marine N2 fixation for the global ocean at 70.8 Tg N yr−1 and 196.1 Tg N yr−1, respectively, are slightly higher than other estimates21,51 (Table 1 and Supplementary Table 1). Earlier studies report coastal N2 fixation rates of 15 Tg N yr−1, with most activity associated with benthic diazotrophs52. Our updated analysis shows that a significant portion of N2 fixation is also occurring in the water column of coastal regions, contributing an additional 6.6 (geometric) or 16.7 (arithmetic) Tg N yr−1 to the global budget. This nitrogen input could support the equivalent of 95 Tg C yr−1 of primary production. These updated N2 fixation budgets have large uncertainties since they are sensitive to the extrapolation method to scale the sparse data to the global ocean. Recent studies have also identified methodological issues in historical observations, which could lead to under-53 or over-54 estimations. However, the number of observations resulting from the revised dissolved 15N2 incubation method is currently too limited to derive robust global estimates. Nevertheless, after accounting for both aquatic and sedimentary N2 fixation, the coastal oceans may play a larger role in the nitrogen cycle than previously considered.

Using an underway method recently developed in our lab for continuous high-resolution N2 fixation measurements, we mapped N2 fixation at unprecedented scales across the western North Atlantic, identifying hotspots of N2 fixation in the Mid-Atlantic Bight. Our study challenges the classic paradigm of N2 fixation distribution and further underscores the central role coastal regions play in the global cycling of nutrients and carbon55. With coastal regions being exposed to ever-increasing anthropogenic disturbances56, expansion of coastal monitoring efforts using high-resolution methods will be critical to evaluate ongoing biogeochemical changes and their global repercussions.

## Methods

### Underway N2 fixation rate measurements

N2 fixation rates were estimated at high resolution using a continuous underway method of Flow-through incubations for Acetylene-Reduction Assays by Cavity ring-down laser Absorption Spectroscopy (FARACAS). A description of the method is presented in Cassar et al. (2018)19 with a brief outline below. Nitrogenase activity in seawater is estimated based on the conventional technique of acetylene (C2H2) reduction to ethylene (C2H4)57,58,59. Instead of injecting C2H2 gas directly into the incubation bottle, C2H2 gas produced from high-purity calcium carbide (Acros Organics) is first dissolved in 0.2-μm filtered seawater that is collected from a trace-metal clean towfish (Geofish) to make a C2H2–H2O tracer. The dissolved C2H2 approach was previously applied for measuring nitrogenase activity in estuarine sediments60. The C2H2–H2O tracer is then mixed at a constant ratio using a two-channel peristaltic pump (Masterflex) with a continuous stream of seawater supplied by the Geofish. The mixture of C2H2–H2O tracer and seawater is continuously pumped into a 9-L flow-through incubation reactor (Chemglass) at a flow rate of ~100 mL min−1. The short flow-through incubation, with an e-folding residence time of 90 min (i.e. ~63% of the seawater in the incubation reactor replaced in 90 min), minimizes the effects of C2H2 on metabolic processes and on microbial community structure61,62. The incubation reactor is lit by a strand of cool-light LEDs fitted with blue filters to simulate the light quality at the ocean surface. The light intensity is instantly calculated and adjusted based on the ship’s location and local time. A water jacket on the incubation reactor is flushed with a high-flow rate of continuous surface seawater to mimic the in situ sea surface temperature. Downstream of the flow-through incubation reactor, the seawater flows into a gas extraction chamber. This gas extraction chamber consists of a glass frit with medium-size pores (Chemglass) and a custom-built gas-water separation system. A flow of 35 mL min−1 of C2H4-free air controlled by a mass flow controller (OMEGA) continuously purges the incubated seawater, extracting ethylene out of the seawater, and carrying it to a Cavity Ring-Down laser absorption Spectrometer (CRDS, Picarro) for analyses. This CRDS ethylene analyzer measures ethylene concentrations in real time at ppb levels with high frequency and accuracy63. Approximately every 3 h, the incubation reactor is bypassed to determine the background ethylene concentration in the mixture of C2H2–H2O tracer and seawater. The difference between the incubation ethylene and background ethylene concentrations represents ethylene production rates during the incubation period. Finally, ethylene production rates are converted to N2 fixation rates using a conversion factor of 4:158,64,65,66. We acknowledge that 4:1 is a theoretical ratio with uncertainties. However, our comparison of FARACAS to the 15N2 addition method shows good agreement when applying this conversion factor19. In the current configuration, the detection limit of FARACAS is 0.19 nmol N L−1 d−1, which is also comparable to the 15N2 addition method.

### Discrete N2 fixation and primary production incubations

For comparison to our underway survey of N2 fixation, discrete 15N2 incubation experiments in parallel with 13C additions were also conducted at eight stations during the 2015 cruise using methods detailed in previous studies30,51. Seawater samples were collected from each station at three levels in the euphotic zone, including the surface (5 m), an intermediate depth above the Deep Chlorophyll Maximum (DCM), and the DCM. Four liters of seawater were immediately filtered onto glass microfiber filters (MGF, 0.7 μm, Sartorius) to determine natural concentrations and isotopic signatures of particulate organic carbon (POC) and particulate nitrogen (PN). For incubation experiments, 4.5-L Nalgene polycarbonate bottles were first partly filled with natural seawater. Then, 15N2-enriched filtered seawater (98% 15N atom%, Eurisotop, batch number 23/051301) and NaH13CO3 solution (99%, Eurisotop) were added into the incubation bottles, reaching approximate final enrichments of 2 15N atom% and 7 13C atom%. Finally, 4.5-L Nalgene bottles were topped off with natural seawater from sampled depths and capped with septum-fitted screw caps. Incubations were subsequently performed for 24 h in on-deck incubators covered by blue light filters simulating light intensity at the sampled depths. Incubators were flushed with surface seawater to avoid heating due to sunlight. Finally, incubated seawater was filtered onto MGF filters, which were stored at −20 °C until further analysis on land. Filters were treated and analyzed for POC, PN, δ13CPOC, and δ15NPN using an Elemental Analyzer-Isotope Ratio Mass Spectrometer (EA-IRMS; EuroVector Euro EA 3000 coupled to a Delta V Plus, Thermo Scientific) to calculate corresponding carbon uptake and N2 fixation rates. The N2 fixation rates measured by our underway method closely match the results obtained from discrete incubation experiments (n = 7, r = 0.97, p < 0.01). A more comprehensive inter-method comparison can be found in Figure 5 of Cassar et al. (2018)19, showing good agreement between the two methods.

### Nutrients and trace-metal analyses

Nutrient samples were collected from a CTD rosette equipped with 24 12-L Niskin bottles. Seawater was subsampled in acid-washed 15-mL polypropylene vials and immediately preserved at −20 °C. Nitrate + Nitrite and phosphate were analyzed on land using an Automatic Nutrients Analyzer with detection limits of 0.03 μM and 0.014 μM, respectively.

For trace-metal analyses, all reagents, standards, and blanks were prepared in acid-cleaned low-density polyethylene (LDPE) or Teflon-fluorinated ethylene propylene (FEP) bottles. Bottles were cleaned following GEOTRACES protocols. Trace metal samples were collected in surface seawater (~5 m) using a towed fish (UCSC) deployed along side of the ship while underway67. During stops, the towfish was recovered from seawater to avoid contamination. Surface seawater was pumped through Teflon tubing to a sink located in a home-made clean plastic bubble installed within the chemistry lab on the ship. There, seawater was filtered in-line from the Teflon tubing outlet using 0.22-µm pore-size Acropak filter cartridges and collected in acid-washed 60-mL LDPE bottles that were triple-rinsed with ~20 mL of filtered seawater before final sample collection. Samples for dissolved trace metal were then acidified to pH = 2 with concentrated HCl (Ultrapur grade, Merck) under a laminar flow hood equipped with HEPA filter. Samples were then double-bagged and stored in the dark, at room temperature, until analysis. On land, all analyses were performed in cleanroom environments at the Pôle Spectrométrie Océans (Brest, France).

Seawater samples were introduced to a PFA-ST nebulizer and a cyclonic spray chamber via a SeaFASTpico introduction system (Elemental Scientific Incorporated, Omaha, NE), following the protocol of Lagerström et al. (2013)68. High-purity grade solutions and water (Milli-Q, 18.2 MΩ cm) were used to prepare the following reagents on a daily basis. Buffer was made from 0.5 M acetic acid (Ultrapur grade, Merck) and 0.6 M ammonium hydroxide (Ultrapur grade, Merck) and was adjusted to pH = 8.3. Elution acid was made of 1.6 M HNO3 (Ultrapur grade, Merck) in Milli-Q water and spiked with 1 μg mL−1 In (PlasmaCAL calibration standards) to allow for drift correction. Autosampler and column rinsing solutions were made from 0.012 M HCl (Ultrapur grade, Merck) in Milli-Q water.

Mixed element standard solution was prepared gravimetrically using high-purity standards (Fe, Mn, Cd, Co, Zn, Cu, Pb; PlasmaCAL calibration standards) in 0.8 M HNO3 (Ultrapur grade, Merck). A six-point calibration curve was prepared by standard additions of the mixed element standard to our in-house standard (North Atlantic filtered seawater, collected at 55.87445° N/48.09345° W, 40 m depth, 0.15 nM) and run at the beginning, the middle and the end of each run. Final concentrations of samples and procedural blanks were calculated from In-normalized data. Precision was assessed through replicate samples (every tenth sample was a replicate) and accuracy was determined from analysis of consensus seawater (SAFe S1 and D2, and GSP, GSC).

### Diazotrophic community structure analysis

Diazotrophic phylotypes were identified and quantified using data obtained from 16S rRNA amplicon sequencing of environmental DNA, targeting the V4 region69. Eukaryotic hosts of some diazotroph taxa were similarly detected using amplicon sequencing of the V4 region of the 18S rRNA gene. Detailed experimental protocols are described in Wang et al. (2018) including sample collection, addition of internal controls for quantitative sequencing, DNA extraction, primer sequences, PCR amplification steps, and procedures for the analysis of sequencing data70. This quantitative analysis has previously been described and applied in other environments71,72. Here, the processes are described briefly. From 0.2 to 1 L of seawater (average of 0.8 L) pumped from a towed fish were filtered onto a 0.22-μm polycarbonate filter using a peristaltic pump. The low-volume samples were typically collected in coastal waters, where high biomass led to clogging of filters. The volume filtered was recorded for each sample. The filter was flash-frozen immediately in liquid nitrogen and stored at −80 °C. Following the cruise, DNA extraction was performed using the Qiagen DNeasy Plant Mini Kit according to the manufacturer’s instructions, with several modifications adapted from Moisander et al. (2008)73. Prior to bead beating, 3.04 ng of Thermus thermophilus (ATCC #27634D-5) genomic DNA and 0.679 ng of Schizosaccharomyces pombe (ATCC #24843D-5) genomic DNA were added to each sample as internal DNA standards, each in 50 μL volumes. These additions introduced ~5,780,000 and 2,800,000 copies of S. pombe and T. thermophilus rRNA sequences sample−1, amounts expected to constitute ≤1% of total reads sample−1 following sequencing70. PCR cycle parameters are detailed in Wang et al. (2018). Following PCR purification using the Qiagen QIAquick PCR Purification Kit, samples were pooled at equimolar concentrations. Illumina MiSeq sequencing (300 bp PE reads, V3 chemistry) was performed at the Sequencing and Genomic Technologies Shared Resources core facility at the Duke Center for Genomic and Computational Biology (Durham, USA). Raw rRNA sequences and metadata are available from the NCBI Sequence Read Archive under accession number SRP126177.

We used QIIME to process and analyze our Illumina sequencing data following the pipeline described in Fadrosh et al. (2014)74,75. Taxonomy tables reporting raw counts of 16S rRNA gene and 18S rRNA gene were produced by open-reference operational taxonomic unit (OTU) picking at the 97% threshold using the Usearch 6.1 algorithm and the SILVA ribosomal RNA database76,77,78. The SILVA ribosomal RNA database was supplemented with the addition of full length 16S rRNA gene sequences of UCYN-A1 and UCYN-A2 (accession: NC_013771, CP001842, JPSP01000003, and JPSP01000022). Absolute abundances of the 16S rRNA gene or 18S rRNA gene for each OTU were subsequently calculated based on the number of internal standard sequences recovered71. Finally, the concentrations of 16S and 18S rRNA genes in the environment were calibrated for the volume of seawater sample filtered. Common diazotrophs observed from clone library studies across the global ocean and their eukaryotic hosts were picked out from our 16S and 18S taxonomy tables, respectively42,73,79,80,81.

The internal standard method is subject to a number of limitations and caveats71. A key assumption of the approach is that the recovery efficiency of the standard is equivalent to the recovery efficiency of the natural sequences in the sample. Variation of rRNA gene copy number is also an important consideration. However, while recovery of the standard may differ from recovery of natural taxa in the same sample, variation in standard recovery efficiency from sample to sample will reflect differences in starting material, losses during elution, and other processes as long as the same PCR and library preparation protocol is followed. In that case, any biases in the quantitative measurement due to amplification biases or DNA extraction recoveries should be consistent across the samples. Therefore, the 16S rRNA approach is informative when providing the spatial distribution and abundance patterns of diazotrophic taxa.

### NCP estimates

NCP reflects the balance between plankton community photosynthesis and respiration. An excess of photosynthesis leads to a net production of particulate and dissolved organic carbon, which can either accumulate at the ocean surface or be exported to depth. To estimate the proportion of NCP fueled by N2 fixation, we measured NCP underway using the O2/Ar method82. Oxygen concentrations in the surface ocean are influenced by biological processes, such as photosynthesis and respiration, as well as physical processes including bubble injection, temperature, and pressure changes. Due to the similar solubility properties of O2 and Ar, the biological O2 supersaturation ([O2]sat) can be calculated by removing the effects of physical processes determined from Ar supersaturation ([Ar]sat)83. Biological O2 supersaturation and undersaturation reflect the metabolic state of the surface ocean, suggesting autotrophic or heterotrophic conditions, respectively84,85. Under steady-state conditions within the mixed layer and when vertical mixing is negligible over the residence time of O2 at the ocean surface, NCP can be estimated based on the exchange of biological O2 with the atmosphere using the equations below.

$${\mathrm{NCP}} \approx k_{{\mathrm{O}}_2} \ast \left[ {\mathrm{O}_2} \right]_{{\mathrm{sat}}} \ast \Delta {(\mathrm{O}_2}/{\mathrm{Ar})}$$
(1)
$$\Delta \left({\mathrm{O}}_{2}/{\mathrm{Ar}}\right) = [\frac{{([{\mathrm{O}}_{2}]/[{\mathrm{Ar}}])}}{{([{\mathrm{O}}_{2}]/[{\mathrm{Ar}}])_{{\mathrm{sat}}}}} - 1]$$
(2)

$$k_{{\mathrm{O}}_2}$$ is the gas exchange velocity for oxygen86,87. The uncertainties in the NCP estimate are mainly from errors associated with $$k_{{\mathrm{O}}_2}$$ and vertical mixing of O2. Dissolved O2/Ar ratios in surface seawater were continuously measured by Equilibrator Inlet Mass Spectrometry (EIMS) during the 2015 and 2016 cruises. O2/Ar-NCP estimates were converted to carbon-NCP assuming a constant O2/C stoichiometry88,89.

We note that our observations of NCP fueled by N2 fixation should be interpreted with caution mainly due to differences in timescales of integration. Our O2/Ar-NCP observations integrate productivity over 3–4 days in this region, while the N2 fixation measurements reflect hourly or daily rates. We cannot rule out the possibility that high N2 fixation rates occurred during late-stages of a phytoplankton bloom when nitrogen was exhausted29 or, conversely that release of N by diazotrophs relieved N starvation and initiated rapid growth of non-N2 fixers90. This artefact in integration timescales is circumvented with dual-tracer 15N2 and 13C incubations, which also show a high contribution of N2 fixation to primary production off the coast of New Jersey (Supplementary Fig. 7). In addition, negative NCP values accompanied by detectable N2 fixation and heterotrophic diazotrophs were observed over a large portion of the transition zone between the neritic and open ocean regions. These observations may be attributed to transient net heterotrophy, advective transport of organic matter, or vertical mixing of O2-depleted waters. Further studies within a Lagrangian framework will be required to explore the coupling between N2 fixation and the net metabolic status of marine systems.

The contribution of N2 fixation to NCP measured by FARACAS and O2/Ar method shows a similar spatial pattern as the contribution of N2 fixation to NPP measured by 15N2/13C incubation. 13C-based primary production measures yield rates closer to NPP than NCP91. Therefore, the 15N2/13C-based approach to assessing N2 fixation’s contribution to biological production should relate to our FARACAS-O2/Ar according to the following equation:

$$\frac{{{\mathrm{N}}_2\;{\mathrm{fixation}}}}{{{\mathrm{NCP}}}} \ast {\mathrm{export}}\;{\mathrm{ratio}} = \frac{{{\mathrm{N}}_2\;{\mathrm{fixation}}}}{{{\mathrm{NPP}}}}$$
(3)

Where the $${\mathrm{export}}\;{\mathrm{ratio}} = \frac{{{\mathrm{NCP}}}}{{{\mathrm{NPP}}}}$$. In some cases, we estimate a contribution of N2 fixation to NCP of 80–100%. Should we assume an export ratio of 8.4% for the oligotrophic Sargasso Sea based on BATS estimates92, the contribution of N2 fixation to NPP implied by our approach (6–8%) is approximately in line with our discrete incubation-based estimates (Supplementary Fig. 7). Coastal environments likely exhibit higher export ratios of 0.2–0.393, which would yield a range of contribution of N2 fixation to NPP of up to 16–30%. Thus, while discrepancies exist between 15N2/13C and FARACAS-O2/Ar-based approaches, the broad relationship between these quantities is as expected.

### Nitrogen budget via N2 fixation in the global ocean

An updated database of depth-integrated and volumetric N2 fixation rates over the global ocean is presented in Supplementary Fig. 8. The complete dataset of global N2 fixation is shown in the Supplementary Data 1, which includes 1172 depth-integrated and 4299 volumetric N2 fixation measurements. We conducted a Welch’s t-test to evaluate whether the N2 fixation rates in the coastal oceans (bathymetry ≥ −200 m) are significantly higher than in the open ocean. This one-tailed hypothesis was examined at the 0.01 significance level. N2 fixation rates were first log-transformed since they are approximately log-normally distributed (Fig. 4a–c). Based on these analyses, the volumetric N2 fixation rates at different depths and at surface are significantly larger in coastal regions than in the open ocean (p < 0.01), while depth-integrated N2 fixation rates appear to be similar in both systems.

Nitrogen inputs through N2 fixation were further evaluated for coastal and open oceans separately by scaling to the surface areas of the respective regions. The areal extents of the coastal and open oceans were calculated using ArcGIS. Land (orange), coastal (cyan), and open ocean (blue) regions were delineated using bathymetric contour lines (GEBCO One Minute Grid), with depth criteria of 0 m and −200 m as shown in Supplementary Fig. 9. Surface areas were calculated under a Cylindrical Equal Area Projection (World) with 10° latitudinal bands except at latitudes higher than 50° N/50° S. We calculated global budgets after removing outliers identified using a Tukey’s test. These outliers include some extremely high values in the Indian Ocean50 and some extremely low rates in the eastern North Atlantic Ocean94 (Supplementary Fig. 8). Global budgets including outliers were also computed but may be substantially biased and are not shown here. Geometric means were used because depth-integrated and volumetric N2 fixation rates are approximately log-normally distributed (Fig. 4a–c). Flux budgets were determined by multiplying the geometric mean of N2 fixation rates by the area of each latitudinal band. Uncertainties were estimated based on the propagation of errors. Regional and global N2 fixation rates are presented in Table 1 and Supplementary Table 1. For comparison, we also present budgets based on the arithmetic mean of N2 fixation rates. Large uncertainties are expected in high latitude regions because of the limited number of observations.