## Abstract

Marine viruses are critical drivers of ocean biogeochemistry, and their abundances vary spatiotemporally in the global oceans, with upper estimates exceeding 10^{8} per ml. Over many years, a consensus has emerged that virus abundances are typically tenfold higher than microbial cell abundances. However, the true explanatory power of a linear relationship and its robustness across diverse ocean environments is unclear. Here, we compile 5,671 microbial cell and virus abundance estimates from 25 distinct marine surveys and find substantial variation in the virus-to-microbial cell ratio, in which a 10:1 model has either limited or no explanatory power. Instead, virus abundances are better described as nonlinear, power-law functions of microbial cell abundances. The fitted scaling exponents are typically less than 1, implying that the virus-to-microbial cell ratio decreases with microbial cell density, rather than remaining fixed. The observed scaling also implies that viral effect sizes derived from ‘representative’ abundances require substantial refinement to be extrapolated to regional or global scales.

Viruses of microbes have been linked to central processes across the global oceans, including biogeochemical cycling^{1,2,3,4,5,6} and the maintenance and generation of microbial diversity^{1,4,7,8,9}. Virus propagation requires that virus particles both contact and subsequently infect cells. The per cell rate at which microbial cells—including bacteria, archaea and microeukaryotes—are contacted by viruses is assumed to be proportional to the product of virus and microbial abundances^{10}. If virus and microbe abundances were related in a predictable way it would be possible to infer the rate of virus–cell contacts from estimates of microbial abundance alone.

Virus ecology underwent a transformation in the late 1980s with the recognition that virus abundances, as estimated using culture-independent methods, were orders of magnitude higher than estimates via culture-based methods^{11}. Soon thereafter, researchers began to report the ‘virus to bacterium ratio’ (VBR) as a statistical proxy for the strength of the relationship between viruses and their potential hosts in both freshwater and marine systems^{12}. This ratio is more appropriately termed the ‘virus-to-microbial cell ratio’ (VMR), a convention that we use here (Supplementary Section 1).

Observations accumulated over the past 25 years have noted a wide variation in the VMR, yet there is a consensus that a suitable first approximation is that the VMR is equal to 10 (Supplementary Table 1). This ratio also reflects a consensus that typical microbial abundances are approximately 10^{6} per ml and typical virus abundances are approximately 10^{7} per ml^{13,14}. Yet, the use of a fixed ratio carries with it another assumption—that of linearity—that is, if microbial abundance were to double, then viruses are expected to double as well. An alternative is that the relationship between virus and microbial abundances is better described in terms of a nonlinear relationship, for example, a power law.

In practice, efforts to predict the regional or global-scale effects of viruses on marine microbial mortality, turnover and even biogeochemical cycles depend critically on the predictability of the relative density of viruses and microbial cells. The expected community-scale contact rate, as inferred from the product of virus and microbial abundances, is a key factor for inferring virus-induced cell lysis rates at a site or sites (see, for example, ref. 15), which also depend on diversity^{16}, latent infections^{17} and virus–microbe infection networks^{18}. Here, we directly query the nature of the relationship between virus and microbial densities via a large-scale compilation and re-analysis of abundance data across marine environments.

## Results

### VMR exhibits substantial variation in the global oceans

In the compiled marine survey data (Fig. 1, Table 1 and the ‘Materials and methods’), 95% of microbial abundances range from 5.0 × 10^{3} to 4.1 × 10^{6} per ml, and 95% of virus abundances range from roughly 3.7 × 10^{5} to 6.4 × 10^{7} per ml (Fig. 2a). Both microbial and virus concentrations generally decrease with depth, as reported previously (for example, see ref. 19). We separated the samples according to depth using an operational definition of the near-surface and sub-surface, corresponding to samples taken at depths ≤100 m and >100 m, respectively. The cutoff of 100 m was chosen as a typical depth scale for the euphotic zone in systems with low to moderate chlorophyll^{20}. The precise depth varies spatiotemporally. Our intent was to distinguish zones strongly shaped by active planktonic foodweb dynamics in well-lit waters (the ‘near-surface’) from dark mesopelagic waters shaped primarily by decaying particle fluxes with greater depth (the ‘sub-surface’). The median VMR for the near-surface samples (≤100 m) is 10.5, and the median VMR for the sub-surface samples (>100 m) is 16.0. In that sense, the consensus 10:1 ratio does accurately represent the median VMR for the surface data. We also observe substantial variation in the VMR, as has been noted in earlier surveys and reviews (Supplementary Table 1). Figure 2b shows that 95% of the variation in VMR in the near-surface ocean lies between 1.4 and 160, and between 3.9 and 74 in the sub-surface ocean. For the near-surface ocean, 44% of the VMR values are between 5 and 15, 16% are less than 5 and 40% exceed 15. This wide distribution, both for the near-surface and the sub-surface, demonstrates the potential limitations in using the 10:1 VMR, or any fixed ratio, as the basis for a predictive model of virus abundance derived from estimates of microbial abundance.

### Virus abundance does not vary linearly with microbial abundance

Figure 3 shows two alternative predictive models of the relationship between logarithmically scaled virus and microbial abundances for water column samples. The models correspond to a fixed-ratio model and a power-law model. To clarify the interpretation of fitting in log–log space, consider a fixed-ratio model with a 12:1 ratio between virus and microbial abundance, *V* = 12 × *B*. Then, in log–log space the relationship is
$$\begin{array}{}\text{(1)}& \mathrm{lo}{\mathrm{g}}_{10}(V)=\phantom{\rule{thinmathspace}{0ex}}\mathrm{lo}{\mathrm{g}}_{10}12\phantom{\rule{thinmathspace}{0ex}}+\phantom{\rule{thinmathspace}{0ex}}\mathrm{lo}{\mathrm{g}}_{10}B\end{array}$$which we interpret as a line with a *y* intercept of log_{10}12 = 1.08 and a slope (change in log_{10}*V* for a one-unit change in log_{10}*B*) of 1. By the same logic, any fixed-ratio model will result in a line with slope 1 in the log–log plot, and the *y* intercept will vary logarithmically with VMR. The alternative predictive model is that of a power law: *V* = *cB*^{α1}. In log–log space, the relationship is
$$\begin{array}{}\text{(2)}& \mathrm{lo}{\mathrm{g}}_{10}V\phantom{\rule{thinmathspace}{0ex}}=\phantom{\rule{thinmathspace}{0ex}}\mathrm{lo}{\mathrm{g}}_{10}c\phantom{\rule{thinmathspace}{0ex}}+\phantom{\rule{thinmathspace}{0ex}}{\alpha}_{1}\mathrm{lo}{\mathrm{g}}_{10}B\end{array}$$
$$\begin{array}{}\text{(3)}& \mathrm{lo}{\mathrm{g}}_{10}V\phantom{\rule{thinmathspace}{0ex}}=\phantom{\rule{thinmathspace}{0ex}}{\alpha}_{0}\phantom{\rule{thinmathspace}{0ex}}+\phantom{\rule{thinmathspace}{0ex}}{\alpha}_{1}\mathrm{lo}{\mathrm{g}}_{10}B\end{array}$$The slope *α*_{1} of a fitted line on log-transformed data denotes the power-law exponent that best describes the relationship between the variables. The intercept *α*_{0} of a fitted line on log-transformed data denotes the logarithmically transformed pre-factor.

The 10:1 line has residual squared errors of −16% and −25% in the surface and deep samples, respectively (Supplementary Table 3). In both cases, this result means that a 10:1 line explains less of the variation in virus abundance compared with a model in which virus abundance is predicted by its mean value across the data. To evaluate the generality of this result, we considered an ensemble of fixed-ratio models, each with a different VMR. In the near-surface samples, we find that all fixed-ratio models explain less of the variation (that is, have negative values of *R*^{2}) than a ‘model’ in which virus abundance is predicted to be the global mean in the data set (Supplementary Fig. 1). This reflects the failure of constant-ratio (that is, linear) models to capture the cluster of high VMRs at low microbial density apparent in the density contours of Fig. 2a and the shoulder of elevated high VMR frequency in Fig. 2b. The largest contributor to this cluster of points is the ARCTICSBI study (Table 1). In the sub-surface samples, fixed-ratio models in which the VMR varies between 12 and 22 do have positive explanatory power, but all perform worse than the power-law model (Supplementary Fig. 1). In contrast, the best-fitting power-law model explains 15 and 64% of the variation in the data for near- and sub-surface samples, respectively (Supplementary Table 3). The best-fit power-law scaling exponents are 0.42 (with 95% confidence intervals (CIs) of (0.39,0.46)) for near-surface samples and 0.53 (with 95% CIs of (0.52,0.55)) for sub-surface samples.

The difference between a linear and a power-law model can be understood, in part, by comparing predictions of viral abundances as a function of variation in microbial abundances. For example, doubling the microbial abundance along either regression line is not expected to lead to a doubling in virus abundance, but rather a 2^{0.42} = 1.3- and 2^{0.53} = 1.4-fold increase, respectively. The difference between models becomes more apparent with scale, for example, 10- and 100-fold increases in near-surface microbial abundances are predicted to be associated with 10^{0.53} = 3.4- and 100^{0.53} = 11-fold increases in viral abundances, respectively, given a power-law model. The power-law model is an improvement over the fixed-ratio model in both the near- and sub-surface, even when accounting for the increase in parameters (Supplementary Table 3). In the near-surface, refitting the surface data without outliers improves the explanatory power to approximately *R*^{2} = 0.3 from an *R*^{2} = 0.65 for the sub-surface (Methods and Supplementary Fig. 2). The power-law exponents in the near- and sub-surface are qualitatively robust to variation in the choice of depth threshold, for example, as explored over the range between 50 and 150 m (Supplementary Fig. 3). In summary, the predictive value of a power-law model is much stronger in the sub-surface than in the near-surface, where confidence in the interpretation of power-law exponents is limited.

### Study-to-study measurement variation is unlikely to explain the intrinsic variability of virus abundances in the surface ocean

Next, we explored the possibility that the variation in methodologies affected the baseline offset of virus abundance measurements and thereby decreased the explanatory power of predicting virus abundances based on microbial abundances. That is, if *V** is the true and unknown abundance of viruses, then it is possible that two studies would estimate ${\hat{V}}_{1}={V}^{\ast}(1+{\u03f5}_{1})$ and ${\hat{V}}_{2}={V}^{\ast}(1+{\u03f5}_{2})$, where |*ε*_{1}| and |*ε*_{2}| denote the relative magnitude of study-specific shifts. We constrain the relative variation in measurement, such that the measurement uncertainty is 50% or less (see ‘Materials and methods’). The constrained regression model improves the explanatory power of the model (Supplementary Table 3), but, in doing so, the model forces 18 of the 25 studies to the maximum level of measurement variation permitted (Supplementary Fig. 4). We do not expect differences in measurement protocols to explain the nearly two orders of magnitude variation in estimating virus abundance, given the same true virus abundance in a sample. Note that when sub-surface samples were analysed through the constrained power-law model, there was only a marginal increase of 2% in *R*^{2} and, moreover, 9 of the 12 studies were fit given the maximum level of measurement variation permitted (Supplementary Fig. 4). The constrained intercept model results suggest that the observed variation in virus abundance in the surface oceans is not well explained strictly by the variation in measurement protocol between studies.

### VMR decreases with increasing microbial abundance, a hallmark of power-law relationships

We next evaluate an ensemble of power-law models, *V*_{i} = *c*_{i}*N*^{αi}, where index *i* denotes the use of distinct intercepts and power-law exponents for each survey. The interpretation of this model is that the nonlinear nature of the virus-to-microbial relationship may differ in distinct oceanic realms or due to underlying differences in sites or systems, rather than due to measurement differences. Figure 4 presents the results of fitting using the study-specific power-law model in the surface ocean samples. Study-specific power-law fits are significant in 18 of 25 cases in the surface ocean. The median power-law exponent for studies in the surface ocean is 0.50. Furthermore, of those significant power-law fits, the 95% distribution of the power-law exponent excludes a slope of one and is entirely less than one in 11 of 18 cases (Fig. 5). This model, in which the power-law exponent varies with study, is a significant improvement in terms of *R*^{2} (Supplementary Table 3). For sub-surface samples, study-specific power-law fits are significant in 10 of 12 cases in the sub-surface (Supplementary Fig. 5). The median power-law exponent for studies in the sub-surface is 0.67. Of those significant power-law fits, the central 95% distribution of the power-law exponent is less than one in 6 of 10 cases (Supplementary Fig. 6). A power-law exponent of less than one means that the virus abundance increases less than proportionately given increases in microbial abundance. This study-specific analysis extends the findings that nonlinear, rather than linear, models are more suited to describing the relationship between virus and microbial abundances. We find that the dominant trend in both near-surface and sub-surface samples is that the VMR decreases as microbial abundance increases. The increased explanatory power by study is stronger for near-surface than for sub-surface samples. This increase in *R*^{2} comes with a caveat: study-specific models do not enable *a priori* predictions of virus abundance given a new environment or sample site, without further effort to disentangle the biotic and abiotic factors underlying the different scaling relationships.

## Discussion

Viruses are increasingly considered in efforts to describe the factors controlling marine microbial mortality, productivity and biogeochemical cycles^{3,4,33,34,35,36}. Quantitative estimates of virus-induced effects can be measured directly, but are often inferred indirectly using the relative abundance of viruses to microbial cells. To do so, there is a consensus that assuming the VMR is 10 in the global oceans—despite the observed variation—is a reasonable starting point. Here, we have re-analysed the relationship of virus to microbial abundances in 25 marine survey data sets. We find that 95% of the variation in VMR ranges from 1.4 to 160 in the near-surface ocean and from 3.9 to 74 in the sub-surface. Although the 10:1 ratio accurately describes the median of the VMR in the surface ocean, the broad distribution of VMR implies that microbial abundance is a poor quantitative predictor of virus abundance. Moreover, increases in microbial abundance do not lead to proportionate increases in virus abundance. Instead, we propose that the virus to microbial abundance relationship is nonlinear and that the degree of nonlinearity—as quantified via a power-law exponent—is typically less than 1. This sublinear relationship can be interpreted to mean that the VMR decreases as an increasing function of microbial abundance and generalizes earlier observations^{13}. Power-law relationships between virus and microbial abundance emerge from complex feedbacks involving both exogeneous and endogenous factors. The question of exogenous factors could be addressed, in part, by examining environmental covariates at survey sites. For example, if microbial and virus abundances varied systematically with another environmental co-factor during a transect, then this would potentially influence the inferred relationship between virus and microbial abundances. In that same way, variation in environmental correlates, including temperature and incident radiation, may directly modify virus life history traits^{37,38}. Also, some of the marine survey data sets examined here constitute repeated measurements at the same location (for example, in the Bermuda Atlantic Time-series Study (BATS)). Time-varying environmental factors could influence the relative abundance of microbes and viruses. It is also interesting to note that viruses-induced mortality is considered to be more important at eutrophic sites^{13}, where microbial abundance is higher, yet the observed decline in VMR with microbial abundance would suggest the opposite.

It could also be the case that variation in endogenous factors determines total abundances. Endogenous factors can include the life history traits of viruses and microbes that determine which hosts are infected by which viruses^{18}, as well as the quantitative rates of growth, defence and infection. For example, relative strain abundances are predicted to depend on niche differences according to the ‘kill-the-winner’ theory, which presupposes tradeoffs between growth and defence^{1,39}. Similarly, the recent hypothesis of a complementary ‘king-of-the-mountain’ mechanism suggests that relative abundance relationships may depend on life history trait differences, even when tradeoffs are not strict^{40}. In both examples, total abundances may nonetheless depend on other factors, including the strength of grazing.

The analysis of abundance relationships also requires a consideration of variation in time. As is well known, virus–microbe interactions can lead to intrinsic oscillatory dynamics. Indeed, previous observations of a declining relationship between VMR and microbial abundance have been attributed to changing ratios across phytoplankton bloom events, including possible virus-induced termination of blooms^{13}. Similar arguments have been proposed in the analysis of tidal sediments^{41}. Alternatively, observations of declining VMR with microbial density have been attributed to a variation in underlying diversity^{42}. Another factor potentially complicating abundance predictions is that episodic events, including the induction of lysogenic populations, influence total microbial and viral counts. Varying degrees of lysogenic and co-infection relationships have been measured in marine virus–host systems^{14,17,43}, the consequences of which may differ from those given interactions with lytic viruses, as is commonly the focus of model- and empirical-based studies. Whatever the mechanism(s), it is striking that virus abundances in some surveys can be strongly predicted via alternative power-law functions of microbial abundances. Mechanistic models are needed to further elucidate these emergent macroecological patterns and relationships, akin to recent efforts to explain emergent power laws between terrestrial predators and prey^{44}.

The present analysis first separated the abundance data according to depth and then according to survey as a means to identify different relationships between virus and microbial abundances in the global oceans. The predictive value of total microbial abundance is strong when considering sub-surface samples. In contrast, microbial abundance is not a strong predictor of virus abundance in near-surface samples, when using linear or nonlinear models. The predictive power of nonlinear models improved substantially in the near-surface when evaluating each marine survey separately. The minimal predictive value of microbial cell abundances for inferring viral abundances in the near-surface when aggregating across all surveys is problematic given that virus–microbe interactions have significant roles in driving microbial mortality and ecosystem functioning^{3,5,33}. Indeed the aggregation of abundance measurements in terms of total microbial abundances may represent part of the problem.

At a given site and time of sampling, each microbial cell in the community is potentially targeted by a subset of the total viral pool. In moving forward, understanding the variation in virus abundance and its relationship to microbial abundance requires a critical examination of correlations at functionally relevant temporal and spatial scales, that is, at the scale of interacting pairs of viruses and microbes. These scales will help inform comparisons of virus–microbe contact rates with viral-induced lysis rates, thereby linking abundance and process measurements. We encourage the research community to prioritize examination of these scales of interaction as part of efforts to understand the mechanisms underlying nonlinear virus–microbe abundance relationships in the global oceans.

## Methods

### Data source

Marine virus abundance data were aggregated from 25 studies (Table 1). A total of 5,671 data points were aggregated. The data collection dates ranged from 1996 to 2012. Data were primarily collected from coastal waters in the northern hemisphere, predominately during the summer months, with the notable exceptions of long-term coastal monthly monitoring sites, that is, the studies USC MO, BATS and MOVE.

### Data processing

Analyses of the data were performed using R version 3.1.1. Scripts and original data are provided at https://github.com/WeitzGroup/Virus_Microbe_Abundance.

### Power-law model

A power-law regression model used the log_{10} of the predictor variable, microbial abundance per ml *N* and the log_{10} of the outcome variable, virus abundance per ml *V*. The power-law regression was calculated using the equation log_{10}*V* = *α*_{0} + *α*_{1}log_{10}*N*. The *α*_{0} and *α*_{1} parameters were fit via ordinary least squares (OLS) regression to minimize the sum of square error.

### Constrained variable-intercept model

The constrained model is a ‘mixed-effects’ regression model using the same predictor and outcome variables, log_{10} of microbial abundance per ml and the log_{10} virus abundance per ml, respectively. This model includes study-specific intercepts, which were constrained such that the values for any of the intercepts were restricted to one standard error above or below the intercept value taken from the power-law model. The standard error value for this model came from the power-law model. The equation for this model is $V\phantom{\rule{negativethinmathspace}{0ex}}=\phantom{\rule{negativethinmathspace}{0ex}}{\alpha}_{0}^{(i)}\phantom{\rule{negativethinmathspace}{0ex}}+\phantom{\rule{negativethinmathspace}{0ex}}{\alpha}_{1}N$, where ${\alpha}_{0}^{(i)}$ is the study-specific intercept and *α*_{1} is the slope common to all studies, *N* is the predictor variable and *V* is the outcome variable.

### Variable slope and variable intercept model

A power-law model where the exponent and intercept varied with each study was evaluated using the same predictor variable, log_{10} microbial abundance per ml, and the same outcome variable, log_{10} virus abundance per ml. In this model, there was a study-specific *α*_{0} and *α*_{1} and an OLS regression calculated using the equation $V\phantom{\rule{negativethinmathspace}{0ex}}=\phantom{\rule{negativethinmathspace}{0ex}}{\alpha}_{0}^{(i)}\phantom{\rule{negativethinmathspace}{0ex}}+\phantom{\rule{negativethinmathspace}{0ex}}{\alpha}_{1}^{(i)}\phantom{\rule{1pt}{0ex}}N$.

### Bootstrapping model CIs

Bootstrap analyses of the power-law model and mixed-effects models were conducted to derive 95% CIs surrounding the parameters estimated by the models. For all models the original data set was sampled with replacement, by study, to arrive at a bootstrap sample data set; this process was repeated 10,000 times. Distributions for all parameters were generated and the 2.5, 50 and 97.5% points were identified from among the 10,000 parameter estimates.

### Outlier identification

Outliers in the data were identified by calculating the top and bottom 2% of estimated VMRs amongst the entire 5,671 samples. The outliers corresponded to ratios below 1.81 and above 128. Those samples with VMRs that fell outside these bounds were considered outliers. There were 218 outlier samples taken at depths of ≤100 m and 10 outlier samples taken at depths of >100 m.

### Depth cutoff robustness

The cutoff point for which data were partitioned into either the near-surface or the sub-surface was varied from 50 m to 150 m in 1 m increments. For each step, a power-law model was evaluated for both the near-surface and the sub-surface.

## References

- 1.
Thingstad, T. F. Elements of a theory for the mechanisms controlling abundance, diversity, and biogeochemical role of lytic bacterial viruses in aquatic systems.

*Limnol. Oceanogr.***45,**1320–1328 (2000). - 2.
Suttle, C. A. Viruses in the sea.

*Nature***437,**356–361 (2005). - 3.
Brussaard, C. P. D.

*et al.*Global-scale processes with a nanoscale drive: the role of marine viruses.*ISME J.***20,**575–578 (2008). - 4.
Rohwer, F. & Thurber, R. V. Viruses manipulate the marine environment.

*Nature***459,**207–212 (2009). - 5.
Weitz, J. S. & Wilhelm, S. W. Ocean viruses and their effects on microbial communities and biogeochemical cycles.

*F1000 Biol. Reports***4,**17 (2012). - 6.
Jover, L. F., Effler, T. C., Buchan, A., Wilhelm, S. W. & Weitz, J. S. The elemental composition of virus particles: implications for marine biogeochemical cycles.

*Nature Rev. Microbiol.***12,**519–528 (2014). - 7.
Weitz, J. S., Hartman, H. & Levin, S. A. Coevolutionary arms races between bacteria and bacteriophage.

*Proc. Natl Acad. Sci. USA***102,**9535–9540 (2005). - 8.
Avrani, S., Schwartz, D. A. & Lindell, D. Virus-host swinging party in the oceans: incorporating biological complexity into paradigms of antagonistic coexistence.

*Mobile Genet. Elements***2,**88–95 (2012). - 9.
Payet, J. P. & Suttle, C. A. To kill or not to kill: the balance between lytic and lysogenic viral infection is driven by trophic status.

*Limnol. Oceanogr.***58,**465–474 (2013). - 10.
Murray, A. G. & Jackson, G. A. Viral dynamics: a model of the effects of size, shape, motion and abundance of single-celled planktonic organisms and other particles.

*Marine Ecol. Progr. Ser.***89,**103–116 (1992). - 11.
Bergh, O., Borhseim, K. Y., Bratbak, G. & Heldal, M. High abundance of viruses found in aquatic environments.

*Nature***340,**467–468 (1989). - 12.
Maranger, R. & Bird, D. F. Viral abundance in aquatic systems—a comparison between marine and fresh-waters.

*Marine Ecol. Progr. Ser.***1210,**217–226 (1995). - 13.
Wommack, K. E. & Colwell, R. R. Virioplankton: viruses in aquatic ecosystems.

*Microbiol. Mol. Biol. Rev.***64,**69–114 (2000). - 14.
Weinbauer, M. G. Ecology of prokaryotic viruses.

*FEMS Microbiol. Rev.***28,**127–181 (2004). - 15.
Mojica, K. D. A.

*et al.*Phytoplankton community structure in relation to vertical stratification along a north–south gradient in the Northeast Atlantic Ocean.*Limnol. Oceanogr.***60,**1498–1521 (2015). - 16.
Edwards, R. A. & Rohwer, F. Viral metagenomics.

*Nature Rev. Microbiol*.**3,**504–510 (2005). - 17.
Brum, J. R., Hurwitz, B. L., Schofield, O., Ducklow, H. W. & Sullivan, M. B. Seasonal time bombs: dominant temperate viruses affect southern ocean microbial dynamics.

*ISME J.*http://dx.doi.org/10.1038/ismej.2015.125 (2015). - 18.
Weitz, J. S.

*et al.*Phage–bacteria infection networks.*Trends Microbiol.***21,**82–91 (2013). - 19.
Danovaro, R.

*et al.*Marine viruses and global climate change.*FEMS Microbiol. Rev.***35,**993–1034 (2011). - 20.
Morel, A. & Berthon, J. Surface pigments, algal biomass profiles, and potential production of the euphotic layer: relationships reinvestigated in view of remote-sensing applications.

*Limnol. Oceanogr.***340,**1545–1562 (1989). - 21.
Parsons, R. J., Breitbart, M., Lomas, M. W. & Carlson, C. A. Ocean time-series reveals recurring seasonal patterns of virioplankton dynamics in the northwestern Sargasso Sea.

*ISME J.***6,**273–284 (2011). - 22.
Fuhrman, J. A.

*et al.*Annually reoccurring bacterial communities are predictable from ocean conditions.*Proc. Natl Acad. Sci. USA***103,**13104–13109 (2006). - 23.
De Corte, D., Sintes, E., Yokokawa, T., Reinthaler, T. & Herndl, G. J. Links between viruses and prokaryotes throughout the water column along a north Atlantic latitudinal transect.

*ISME J.***60,**1566–1577 (2012). - 24.
Li, W. K. W. & Dickie, P. M. Monitoring phytoplankton, bacterioplankton, and virioplankton in a coastal inlet (Bedford Basin) by flow cytometry.

*Cytometry***440,**236–246 (2001). - 25.
Yang, Y., Yokokawa, T., Motegi, C. & Nagata, T. Large-scale distribution of viruses in deep waters of the pacific and southern oceans.

*Aquatic Microbial Ecol.***710,**193–202 (2013). - 26.
Clasen, J. L., Brigden, S. M., Payet, J. P. & Suttle, C. A. Evidence that viral abundance across oceans and lakes is driven by different biological factors.

*Freshwater Biol.***53,**1090–1100 (2008). - 27.
Balsom, A. L.

*Macroinfaunal Community Composition and Biomass, and Bacterial and Viral Abundances from the Gulf of Alaska to the Canadian Archipelago: a Biodiversity Study*. MSc thesis, Univ. Tennessee (2003). - 28.
Strzepek, R. F.

*et al.*Spinning the ‘ferrous wheel’: the importance of the microbial community in an iron budget during the FeCycle experiment.*Global Biogeochem. Cycles*GB4S26 (2005).**19,** - 29.
Matteson, A. R.

*et al.*Production of viruses during a spring phytoplankton bloom in the South Pacific Ocean near New Zealand.*FEMS Microbiol. Ecol.***79,**709–719 (2012). - 30.
Rowe, J. M.

*et al.*Constraints on viral production in the Sargasso Sea and North Atlantic.*Aquatic Microbial Ecol.***520,**233–244 (2008). - 31.
Wilhelm, S. W.

*et al.*UV radiation induced DNA damage in marine viruses along a latitudinal gradient in the southeastern Pacific Ocean.*Aquatic Microbial Ecol.***310,**1–8 (2003). - 32.
Wang, K., Eric Wommack, K. & Chen, F. Abundance and distribution of

*Synechococcus*spp. and cyanophages in the Chesapeake Bay.*Appl. Environ. Microbiol.***770,**7459–7468 (2011). - 33.
Suttle, C. A. Marine viruses—major players in the global ecosystem.

*Nature Rev. Microbiol.***5,**801–812 (2007). - 34.
Danovaro, R.

*et al.*Major viral impact on the functioning of benthic deep-sea ecosystems.*Nature***454,**1084–1U27 (2008). - 35.
Brum, J. R. & Sullivan, M. B. Rising to the challenge: accelerated pace of discovery transforms marine virology.

*Nature Rev. Microbiol.***130,**147–159 (2015). - 36.
Weitz, J. S.

*et al.*A multitrophic model to quantify the effects of marine viruses on microbial food webs and ecosystem processes.*ISME J.***90,**1352–1364 (2015). - 37.
Suttle, C. A. & Chan, A. M. Marine cyanophages infecting oceanic and coastal strains of

*Synechococcus*: abundance, morphology, cross-infectivity and growth characteristics.*Marine Ecol. Progr. Ser.***92,**99–109 (1993). - 38.
De Paepe, M. & Taddei, F. Viruses’ life history: towards a mechanistic basis of a trade-off between survival and reproduction among phages.

*PLoS Biol.***4,**e193 (2006). - 39.
Thingstad, T. F. & Lignell, R. Theoretical models for the control of bacterial growth rate, abundance, diversity and carbon demand.

*Aquatic Microbial Ecol.***13,**19–27 (1997). - 40.
Giovannoni, S., Tempterton, B. & Zhao, Y.

*Nature***499,**E4–E5 (2013). - 41.
Carreira, C., Larsen, M., Glud, R. N., Brussaard, C. P. D. & Middelboe, M. Heterogeneous distribution of prokaryotes and viruses at the microscale in a tidal sediment.

*Aquatic Microbial Ecol.***69,**183–192 (2013). - 42.
Bratbak, G. & Heldal, M. in

*Molecular Ecology of Aquatic Microbes*Vol. 238 (ed. Joint, I.) 249–264 (Springer-Verlag, 1995). - 43.
Williamson, S. J., Houchin, L. A., McDaniel, L. & Paul, J. H. Seasonal variation in lysogeny as depicted by prophage induction in Tampa Bay, Florida.

*Appl. Environ. Microbiol.***68,**4307–4314 (2002). - 44.
Hatton, I. A.

*et al.*The predator–prey power law: biomass scaling across terrestrial and aquatic biomes.*Science***349,**aac6284 (2015).

## Acknowledgements

This work was supported by National Science Foundation (NSF) grants OCE-1233760 (to J.S.W.) and OCE-1061352 (to A.B. and S.W.W.), a Career Award at the Scientific Interface from the Burroughs Wellcome Fund (to J.S.W.) and a Simons Foundation SCOPE grant (to J.S.W.). This work was conducted as part of the Ocean Viral Dynamics Working Group at the National Institute for Mathematical and Biological Synthesis, sponsored by the National Science Foundation through NSF Award DBI-1300426, with additional support from The University of Tennessee, Knoxville.

## Author information

## Affiliations

### School of Biology, Georgia Institute of Technology, Atlanta, Georgia 30332, USA

- Charles H. Wigington
- & Joshua S. Weitz

### Department of Mathematics and Statistics, Northern Arizona University, Flagstaff, Arizona 86011, USA

- Derek Sonderegger

### Department of Biological Oceanography, Royal Netherlands Institute for Sea Research (NIOZ), 1790 AB Den Burg, Texel, The Netherlands

- Corina P. D. Brussaard

### Department of Aquatic Microbiology, Institute for Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, 1090 GE, Amsterdam, The Netherlands

- Corina P. D. Brussaard

### Department of Microbiology, The University of Tennessee, Knoxville, Tennessee 37996, USA

- Alison Buchan
- & Steven W. Wilhelm

### Department of Earth, Ocean and Atmospheric Sciences, University of British Columbia, Vancouver, British Columbia V6T 1Z4, Canada

- Jan F. Finke
- & Curtis A. Suttle

### Department of Biological Sciences, University of Southern California, Los Angeles, California 90089, USA

- Jed A. Fuhrman

### Department of Biology, Indiana University, Bloomington, Indiana 47405, USA

- Jay T. Lennon

### Marine Biological Section, Department of Biology, University of Copenhagen, DK-3000, Helsingør, Denmark

- Mathias Middelboe

### Department of Microbiology and Immunology, University of British Columbia, Vancouver, British Columbia V6T 1Z4, Canada

- Curtis A. Suttle

### Department of Botany, University of British Columbia, Vancouver, British Columbia V6T 1Z4, Canada

- Curtis A. Suttle

### Program in Integrated Microbial Diversity, Canadian Institute for Advanced Research, Toronto, Ontario M5G 1Z8, Canada

- Curtis A. Suttle

### Geophysical Fluid Dynamics Laboratory, Princeton, New Jersey 08540, USA

- Charles Stock

### Sir Alister Hardy Foundation for Ocean Science, The Laboratory, Citadel Hill, Plymouth PL1 2PB, UK

- William H. Wilson

### Plant and Soil Sciences, Delaware Biotechnology Institute, Delaware Technology Park, Newark, Delaware 19711, USA

- K. Eric Wommack

### School of Physics, Georgia Institute of Technology, Atlanta, Georgia 30332, USA

- Joshua S. Weitz

## Authors

### Search for Charles H. Wigington in:

### Search for Derek Sonderegger in:

### Search for Corina P. D. Brussaard in:

### Search for Alison Buchan in:

### Search for Jan F. Finke in:

### Search for Jed A. Fuhrman in:

### Search for Jay T. Lennon in:

### Search for Mathias Middelboe in:

### Search for Curtis A. Suttle in:

### Search for Charles Stock in:

### Search for William H. Wilson in:

### Search for K. Eric Wommack in:

### Search for Steven W. Wilhelm in:

### Search for Joshua S. Weitz in:

### Contributions

C.H.W. developed the code and implemented all scripts, analysed data, performed statistical analysis and contributed to writing the manuscript. D.S. developed and reviewed the code, analysed data, performed statistical analysis and provided feedback on the manuscript. A.B., J.F., J.T.L., M.M., C.A.S., C.S., W.H.W. and K.E.W. contributed to the design and implementation of the study, the assessment and collection of data sets, and provided feedback on the manuscript. C.P.D.B. and J.F.F. contributed to the assessment and collection of data sets and provided feedback on the manuscript. S.W.W. co-led the design of the study, led the data collection and assessment component, and contributed to writing the manuscript. J.S.W. co-led the design of the study, led the code and statistical analysis component, and wrote the manuscript.

### Competing interests

The authors declare no competing financial interests.

## Corresponding authors

Correspondence to Steven W. Wilhelm or Joshua S. Weitz.

## Supplementary information

## PDF files

- 1.
### Supplementary Information

Containing Supplementary Text, Tables 1-6, Figures 1-6 and References

## Excel files

- 1.
### Supplementary Data 1

Supplementary Dataset