Distinct patterns of within-host virus populations between two subgroups of human respiratory syncytial virus

Lin, Gu-Lung; Drysdale, Simon B.; Snape, Matthew D.; O’Connor, Daniel; Brown, Anthony; MacIntyre-Cockett, George; Mellado-Gomez, Esther; de Cesare, Mariateresa; Bonsall, David; Ansari, M. Azim; Öner, Deniz; Aerssens, Jeroen; Butler, Christopher; Bont, Louis; Openshaw, Peter; Martinón-Torres, Federico; Nair, Harish; Bowden, Rory; Golubchik, Tanya; Pollard, Andrew J.

doi:10.1038/s41467-021-25265-4

Download PDF

Article
Open access
Published: 26 August 2021

Distinct patterns of within-host virus populations between two subgroups of human respiratory syncytial virus

Nature Communications volume 12, Article number: 5125 (2021) Cite this article

5858 Accesses
14 Citations
25 Altmetric
Metrics details

Subjects

A Publisher Correction to this article was published on 07 October 2021

This article has been updated

Abstract

Human respiratory syncytial virus (RSV) is a major cause of lower respiratory tract infection in young children globally, but little is known about within-host RSV diversity. Here, we characterised within-host RSV populations using deep-sequencing data from 319 nasopharyngeal swabs collected during 2017–2020. RSV-B had lower consensus diversity than RSV-A at the population level, while exhibiting greater within-host diversity. Two RSV-B consensus sequences had an amino acid alteration (K68N) in the fusion (F) protein, which has been associated with reduced susceptibility to nirsevimab (MEDI8897), a novel RSV monoclonal antibody under development. In addition, several minor variants were identified in the antigenic sites of the F protein, one of which may confer resistance to palivizumab, the only licensed RSV monoclonal antibody. The differences in within-host virus populations emphasise the importance of monitoring for vaccine efficacy and may help to explain the different prevalences of monoclonal antibody-escape mutants between the two subgroups.

Long COVID: major findings, mechanisms and recommendations

Article 13 January 2023

Hannah E. Davis, Lisa McCorkell, … Eric J. Topol

Age-specific nasal epithelial responses to SARS-CoV-2 infection

Article Open access 15 April 2024

Maximillian N. J. Woodall, Ana-Maria Cujba, … Claire M. Smith

Mechanisms of SARS-CoV-2 entry into cells

Article 05 October 2021

Cody B. Jackson, Michael Farzan, … Hyeryun Choe

Introduction

Human respiratory syncytial virus (RSV) is the leading cause of lower respiratory tract infection (LRTI) in young children, globally responsible for around 33 million episodes of LRTI in children under 5 years of age annually with a disproportionately high burden in infants younger than 1 year of age¹. Repeated infection is common throughout life², usually resulting in mild symptoms, but it can also cause serious disease in older (age ≥65 years) or immunocompromised adults and people with chronic cardiopulmonary disease³. Despite decades of effort, there is no efficacious antiviral for treatment or licensed vaccine to prevent RSV infection, and thus the standard of care is supportive management only. Palivizumab, an RSV-specific humanised monoclonal antibody, is the only available immunoprophylactic agent. It requires multiple administrations over the RSV season and is very expensive, so its use is limited to the highest-risk populations, namely infants born preterm and those with congenital heart disease, chronic pulmonary disorders or severe combined immunodeficiency⁴.

RSV is a negative-sense single-stranded RNA virus with a genome containing ten genes. The F gene encodes the fusion (F) glycoprotein, which mediates the fusion of host cell and viral membranes. The F protein is the main target for antibody-mediated neutralisation, and has been the focus of the development of vaccines and monoclonal antibodies⁵. Through the fusion process, the F protein changes from the prefusion to postfusion conformation. Several antigenic sites (neutralising epitopes in particular) have been located on the surface of the F protein. Antibodies exclusively targeting prefusion-specific antigenic sites (e.g. sites $\varnothing $ and V) are more potent than those targeting sites that can be found in both conformations (e.g. sites I, II, IV)⁶. Nirsevimab (MEDI8897), a recombinant human monoclonal antibody currently in phase 3 clinical trials, exclusively targets antigenic site $\varnothing $⁷, and suptavumab (REGN2222), another prefusion-specific monoclonal antibody, binds antigenic site V⁸. Palivizumab and its affinity-enhanced variant, motavizumab⁹, target antigenic site II, and antibody 101F binds antigenic site IV¹⁰. Mutations in the antigenic sites that confer resistance to monoclonal antibodies have been identified. For example, mutants with N262S/Y, N268I, K272E/N/M/T/Q or S275F/L in the F protein are less susceptible to palivizumab^11,12,13, and nirsevimab has reduced neutralising activity against mutants with N67I/N208Y, N208S/D, K68N/N201S or K68N/N208S in the F protein⁷.

The G gene encodes the attachment (G) glycoprotein, a transmembrane protein responsible for viral attachment. The extracellular portion (ectodomain) of the G protein consists of two hypervariable mucin-like regions flanking a conserved central domain (CCD)¹⁴. The CCD, containing antigenic sites γ1 and γ2, has been shown to be a target for neutralising antibodies¹⁵ and is another focus of vaccine development^16,17. Outside the CCD, the mucin-like regions also have multiple antigenic sites though less well-defined¹⁸. The mucin-like region II (second hypervariable region) has been shown to have hypermutation at the population level and has thus been used widely in phylogenetic analyses¹⁹.

The two subgroups of RSV (A and B) co-circulate in epidemics, and both exhibit rapid evolutionary dynamics²⁰. Molecular epidemiology and evolutionary dynamics of RSV have been extensively studied at the consensus level; however, little is known about virus populations in each infected individual (i.e. within-host or intrahost virus diversity). Using high-throughput whole-genome sequencing, it is now possible to sequence viruses in sufficient depth to obtain a complete picture of within-host populations. A previous study showed that within-host RSV diversity increased in an immunocompromised infant with persistent RSV infection following a haematopoietic stem cell transplant, and palivizumab escape mutants emerged after multiple administrations of this drug²¹. Another study demonstrated that RSV-A exhibited greater within-host virus diversity in experimentally infected adults than in naturally infected infants²². However, these results were limited to RSV-A infection and did not look at natural infections in adult populations. Analysing within-host virus genetic diversity in infections that represent general seasonal epidemics can aid understanding of the patterns of virus evolution and its driving forces, informing the development of preventative and treatment measures.

In this study, we seek to characterise within-host RSV populations for the two subgroups, RSV-A and RSV-B, using deep sequencing of samples collected from participants in three prospective clinical studies. We find that RSV-B exhibits greater within-host diversity than RSV-A, with two RSV-B consensus strains and one RSV-B minor variant likely conferring resistance to nirsevimab or palivizumab. We also show that temporal changes of intrahost viral populations follow stochastic patterns. Our work highlights the importance of continued genetic surveillance of RSV to ensure the effectiveness of future RSV vaccines and therapeutics.

Results

Sample population

We sequenced RSV from 858 nasopharyngeal swabs collected from 459 RSV-infected patients in the United Kingdom, Spain and the Netherlands during 2017–2020. Of these, 327 samples had sufficient viral load to generate more than 10,000 unique (deduplicated) RSV reads. After removing five samples containing both RSV-A and RSV-B, 322 samples were included in the within-host virus diversity analysis. Sequencing was carried out in four batches, with 11, 113, 41 and 157 of the included samples from each batch respectively (Supplementary Table 1). The 322 samples were collected from 267 different participants, among which 34 participants had multiple samples (mean 2.6, range 2–5) collected on different days (ranging from 1 to 8 days apart).

Cumulative minor allele frequencies and minor variants

Genomic positions with a read depth of less than 200 were excluded from the analysis. Nearly 90% of the samples had ≥80% of the genome passing this threshold. Three samples had a significantly high mean cumulative minor allele frequency (MAF) per sample: 0.52% (from an RSV-A-infected infant; batch 4), 0.19% (from an RSV-B-infected adult; batch 2) and 0.17% (from an RSV-B-infected infant; batch 4). These samples presumably represented a real or artefactual mixture of genetically distinct strains of the same RSV subgroup and were thus excluded from the following analysis. The sources and sequencing yields of the remaining 319 samples (collected from 264 participants) are shown in Table 1.

Table 1 Characteristics of RSV samples by subgroup.

Full size table

The median of the mean cumulative MAF per sample was 0.039% (range 0.025–0.068%) for the 319 samples. The distributions of the mean cumulative MAF per sample were significantly different between samples from different sequencing batches (Supplementary Fig. 1a), likely due to the differences in the ratio of duplicate read counts to total RSV read counts (percent duplication rate) between batches (Supplementary Table 1). After adjusting for the observed batch effects (e.g. Supplementary Fig. 1b), RSV-B samples had a higher mean cumulative MAF per sample than RSV-A samples (median of the original data: 0.042% vs. 0.037%; multiple linear regression with batch and the number of unique RSV reads as covariates, P = 0.016; Mann–Whitney U-test on standardised data, P = 0.016).

On average, each sample had 3.7 minor variants (range 0–30; defined as variants with a frequency of ≥3%). Of the samples, 18.8% (60/319) did not have any minor variants. An inverse correlation was noted between the number of unique RSV reads and the number of minor variants (r = −0.41, P = 4.2 × 10⁻¹⁴; Supplementary Fig. 2), consistent with a greater variance of MAF when the sampling fraction was small (i.e. few unique reads were sequenced)²³. Variation rarely occurred at the same genomic position in different samples. Among all minor variants found in this study, only 5.9% (57/972) were shared by multiple samples (excluding 17 minor variants only shared by sequential samples from the same participants), usually no more than five samples. However, there was one minor variant shared by 59% (85/144) of the RSV-B samples, with a frequency between 3 and 11%. This minor variant had a G to A substitution at position 3403 of the L gene, causing an amino acid alteration from glutamic acid to lysine at position 1135 (E1135K) of the RNA-dependent RNA polymerase.

Potential antigenic variants

The sequences encoding the antigenic sites of the F protein were highly conserved at the consensus level in this study. However, two RSV-B isolates from two infant participants, both of whom had only one sample collected, had an A to T substitution at nucleotide position 204 of the F gene. This substitution results in an amino acid alteration from lysine to asparagine (K68N), which in a previous study was associated with a fourfold reduction in susceptibility to nirsevimab neutralisation in vitro⁷. No minor variant was found at this position in these two samples.

The frequencies and distribution of all minor variants across the coding sequence of the F gene are shown in Fig. 1a. There were one, eight, two and three minor variants identified in the antigenic sites $\varnothing $, II, IV and V of the F protein, respectively (Table 2). 0, 6.0% (6/100) and 1.6% (2/124) of the participants had potential antigenic variants (i.e. minor variants encoding a nonsynonymous substitution in the antigenic sites) in the 2017–18, 2018–19 and 2019–20 RSV seasons, respectively. One of these minor variants had two nucleotide substitutions with a frequency of ≥3% in a single codon, encoding an amino acid substitution from isoleucine to threonine at position 261 (I261T). Other minor variants identified in the antigenic sites were from different samples. To date, none of these variants have been reported to confer resistance to monoclonal antibodies.

**Fig. 1: Minor variants in the coding region of the F and G genes among 175 RSV-A and 144 RSV-B samples.**

Table 2 Characteristics of minor variants within the antigenic sites of the fusion protein.

Full size table

We also looked at the frequencies and distribution of minor variants in the coding region of the G gene (Fig. 1b). The median frequency of minor variants was significantly higher in the G gene than in the F gene, either at potential antigenic sites (median: 9.3% vs. 4.6%; Mann–Whitney U-test, P = 0.022) or across the whole coding sequences (median: 8.3% vs. 4.4%; Mann–Whitney U-test, P = 0.004), consistent with previous studies identifying the G gene as the most variable gene in the virus genome¹⁴. The median minor variant frequency in the mucin-like region II of the G gene (13.7%) was greater than that in the mucin-like region I (9.2%), which was greater than that in the CCD (4.0%). However, these differences were not statistically significant (Kruskal–Wallis test, P = 0.20).

Pairwise nucleotide diversity

Within-host virus genetic diversity was estimated as pairwise nucleotide diversity (see Methods). Pairwise nucleotide diversity did not correlate with the number of unique RSV reads after adjusting for the batch effects (Supplementary Table 2 and Supplementary Fig. 3a), but was highly consistent with the mean cumulative MAF per sample (r = 0.997, P < 2.2 × 10⁻¹⁶; Supplementary Fig. 3b). The median pairwise nucleotide diversity of the whole dataset was 0.0007 (range 0.0005–0.0014). Gene-wise comparisons showed that the L gene had significantly higher pairwise nucleotide diversity than the NS2, P, SH and G genes, but the other genes did not have significant differences in pairwise nucleotide diversity between each other (Supplementary Fig. 4). These significant differences were by definition due to the mean proportion of pairwise nucleotide differences at each genomic position within the L gene instead of the length of the L gene.

RSV-B had greater pairwise nucleotide diversity than RSV-A after adjusting for the batch effects (multiple linear regression, P = 0.044, Supplementary Table 2 and Fig. 2a), and older adults had a more diverse intrahost RSV-B population than infants (multiple linear regression, P = 0.0006, Supplementary Table 2 and Fig. 2b). The subgroup difference was still significant if excluding adult samples (Mann–Whitney U-tests on standardised data, P = 0.039). The number of RSV reads and the duration between symptom onset and sample collection were similar between both RSV subgroups and between both age groups. Samples collected from different countries or seasons or patients with different severity of RSV infections did not have significant differences in pairwise nucleotide diversity (Supplementary Table 2).

Genetic distance

Within-host diversity levels between samples were compared using pairwise Manhattan distances²⁴ at consensus-identical positions, where allele frequencies below the 3% threshold were converted to 0. In contrast, consensus variations between samples were compared using pairwise patristic distances, which are phylogenetic distances on RSV phylogenies (Supplementary Fig. 5). To eliminate the batch effects, we only included pairwise distances between samples in the second batch (n = 112; excluding one outlier). To reduce potential bias from geographical and temporal differences, only pairwise distances between samples from the same country and the same season were calculated.

Serial sample pairs (i.e. pairs with both samples collected from the same participant) had within-host diversity levels comparable to those of samples from different participants (range: 0–3.34 vs. 0–5.03), despite having identical or nearly identical consensus sequences, as indicated by their small patristic distances (range 2.0 × 10⁻⁶ − 7.5 × 10⁻⁵). Excluding the serial sample pairs, RSV-B sample pairs had significantly greater within-host diversity levels than RSV-A pairs (median: 1.24 vs. 0.86), whereas the comparison of consensus sequences showed the opposite effect (Fig. 2c, d). Pairwise patristic distances between RSV-A samples formed three clusters, corresponding to the three main clades of the phylogenetic tree (Supplementary Fig. 5a). When using all allele frequencies, including those below 3% MAF, to calculate Manhattan distances, RSV-B sample pairs still had significantly greater pairwise Manhattan distances than RSV-A pairs (median: 20.5 vs. 18.2, P = 8.2 × 10⁻⁵⁸; Supplementary Fig. 6).

Temporal change of intrahost virus population

Putting all samples together, standardised pairwise nucleotide diversity did not have a significant temporal change within 7 days of symptom onset (R² = 0.008; P = 0.122). For the 34 participants with multiple samples collected daily during hospitalisation, pairwise nucleotide diversity was also evaluated in each set of serially collected samples, excluding those sequenced in different batches (Fig. 3). No significant trend was noted either in each participant or when combining all samples and adjusting for the batch effects. The only exception was the samples from GB-058, where pairwise nucleotide diversity increased by 0.000063 daily (95% confidence interval, 0.000046 to 0.000080; P = 0.004). This patient was a 19-day-old preterm neonate (gestational age of 33 weeks 6 days) with severe RSV infection requiring intensive care and mechanical ventilation.

**Fig. 3: Temporal change of pairwise nucleotide diversity.**

The changes in minor variants and variant frequencies in the serial samples were also evaluated at polymorphic sites where minor alleles were identified at more than three time points (Fig. 4). Of these minor variants, 79% had a nonsynonymous substitution. Only one minor variant with a G to A substitution at position 3403 of the L gene from participant NL-091, which was shared by 71 participants (85 samples), remained above the 3% threshold throughout the sampling period. This patient was a 42-day-old previously healthy infant with severe RSV infection requiring intensive care and mechanical ventilation. All other variants (including the aforementioned variant in other participants) were only detected either early, late or intermittently during the course of sample collection.

**Fig. 4: Temporal change in minor alleles.**

Discussion

In this study, we sequenced 858 nasopharyngeal samples collected in three clinical studies during 2017–2020 and profiled within-host RSV populations from 319 samples. We demonstrated that RSV-B had greater within-host diversity than RSV-A, whereas RSV-A had greater consensus diversity than RSV-B. Two RSV-B isolates’ consensus sequences had a mutation in the F protein (K68N), previously associated with reduced susceptibility to nirsevimab neutralisation. Several other minor variants were also identified in the antigenic sites of the F protein. None of these variants have been reported before except for S255N²⁵, whose susceptibility to monoclonal antibodies has not been examined. Stochastic (random) patterns were found in the temporal changes of within-host virus diversity and minor variants.

Low input genetic material (i.e. viral load) has been shown to reduce the sensitivity and specificity of variant calling²⁶. In this study, we applied the quantitative methodology of targeted metagenomics to library construction and used the number of unique RSV reads as a proxy for viral load²⁷. The inclusion criterion of more than 10,000 unique RSV reads corresponded with a viral load of ~2.4 × 10⁶ copies/mL and above, sufficient input levels for accurate minority variant calling²⁸. Given a large number of samples in this study, batching was required for sequencing, resulting in variable percent duplication rates and hence some batch effects on diversity metrics. We adopted two approaches to account for the batch effects on the comparisons of mean cumulative MAF per sample and pairwise nucleotide diversity: (i) including batch as a regression covariate and (ii) standardising the values within each batch to z-scores (see Methods for details). Both methods showed the same significant findings, making cross-batch comparisons robust. To avoid any residual bias, for pairwise comparisons of genetic distances we used only samples from the same batch (batch 2), which had very high percent duplication rates and similar read counts for RSV-A and RSV-B (Table 1 and Supplementary Table 1), consistent with capture saturation, and from which we could be confident of recovering the full range of intrahost diversity.

The extent of intrahost virus diversity depends not only on the rate of virus evolution (partly associated with the ability of proofreading for viral replication errors) but also on the duration of infection. RNA viruses generally have a higher mutation rate than DNA viruses²⁹, and are usually not able to correct the errors of viral replication, which DNA viruses can³⁰. In our study, RSV had greater pairwise nucleotide diversity than has been reported for influenza virus, another RNA virus causing acute respiratory infection (range 0.0005–0.0014 vs. 0–0.0002³¹). RSV intrahost diversity appears to be comparable with, or slightly higher than, that of the DNA viruses in the family Herpesviridae, which cause chronic infections³², but up to one to two orders of magnitude lower than that of persistent RNA viruses (e.g. hepatitis C virus and human immunodeficiency virus) and persistent DNA viruses (e.g. hepatitis B virus), which generally have pairwise nucleotide diversity above 0.005³².

Neutralisation escape mutants have been isolated in 0.7% of immunoprophylaxis-naïve RSV-infected subjects¹³, 5–9% of RSV-breakthrough patients receiving palivizumab^12,33 and 8% of RSV-breakthrough cases receiving nirsevimab³⁴. In our study, isolates collected from 0.8% (2/264) of the immunoprophylaxis-naïve participants were found to contain a nirsevimab resistance-associated substitution at the consensus level. We also identified an RSV-B minor variant with an amino acid change from serine to proline at position 275 (S275P) of the F protein. Other amino acid substitutions at this position have demonstrated resistance to palivizumab (S275F/L)¹². Whether the mutation S275P also alters the neutralising activity of palivizumab requires further investigation; however, all three mutations at this position replaced a polar amino acid with a nonpolar one, which may result in significant conformational or functional changes. It is important to identify neutralisation escape mutants in immunoprophylaxis-naïve children in the era before RSV monoclonal antibodies become extensively used. It indicates the circulation of escape mutants in the community even though they generally have a selective disadvantage in the absence of monoclonal antibodies¹³.

Our findings that RSV-B had greater pairwise nucleotide diversity and pairwise Manhattan distances than RSV-A both indicate that, at least in our dataset, RSV-B had a more diverse intrahost virus population than RSV-A. These results do not correlate with the duration between symptom onset and sample collection (Table 1), but are consistent with previous studies on global RSV strains, which found that RSV-B has a higher genome-wide evolutionary rate than RSV-A (7.47–7.76 × 10⁻⁴ substitutions/site/year vs. 5.68–6.47 × 10⁻⁴ substitutions/site/year)^35,36. This difference extends below the 3% threshold for minority variant calling (Supplementary Fig. 6). On the basis of these findings, we hypothesise that RSV-B is subject to greater immune pressure (e.g. by innate immunity, neutralising antibodies or T cell-mediated cytotoxicity) than RSV-A. This hypothesis is in line with previous studies showing that intrahost RSV diversity increased in response to an established immunity²¹ and that RSV-B has more amino acid alterations³⁷, predicted O glycosylation site changes³⁷ and indel mutations³⁶ in the G gene than RSV-A, suggesting a stronger selective pressure acting on RSV-B than on RSV-A.

RSV-B exhibited higher within-host diversity in older adults than in infants in response to different immune pressures between the two age groups. Of note, our dataset included only eight adults, and this comparison was limited to seven adult samples and 137 infant samples collected from those with RSV-B infection. Further studies enrolling more adults would be of value to delineate the difference in within-host diversity between different age groups. Furthermore, the temporal changes of pairwise nucleotide diversity and minor variants were stochastic within each infected individual, suggesting the driving force of evolutionary dynamics in global RSV populations is more likely from the selective pressure imposed at the population level than within an individual host. Only samples that yielded sufficient RSV reads were included in this study, so these temporal trends were confined to samples collected over a short time frame (mostly within 5 days of symptom onset). Nonetheless, a study on seasonal influenza virus also found limited evidence of positive selection at the within-host evolutionary scale²⁴.

The greater within-host virus diversity observed in RSV-B than in RSV-A warrants separate testing and close monitoring of the anti-RSV-B efficacy of vaccines and monoclonal antibodies that are being developed. This is because the development of several RSV vaccines in preclinical or clinical trials is based on the nucleotide sequences or structure of RSV-A strains^38,39,40. Some studies have also shown that RSV-B had more fixed mutations in the antigenic sites of the F protein at the consensus level⁴¹, resulting in more variable in vitro and clinical susceptibility to monoclonal antibodies than RSV-A. For example, in a phase 2b trial of nirsevimab, the drug had reduced neutralising activity against two RSV-B isolates collected from its recipients; one had a mutation of N208S and the other had multiple mutations of I64T, K68E, I206M and Q209R in the F protein³⁴. A phase 3 trial of another investigational RSV monoclonal antibody, suptavumab, failed to meet its primary end point because all RSV-B strains identified in the trial carried two amino acid changes in the F protein (L172Q and S173L), conferring resistance to the drug⁸. All RSV-B samples in our study also harboured these two amino acid substitutions, except for one that encoded isoleucine instead of leucine at position 173 (a nonpolar-to-nonpolar substitution).

We excluded genomic positions where consensus bases were different from the calculation of Manhattan distance, to ensure that between-host genetic distance would be driven by differences in minor alleles rather than differences at the consensus level²⁴. We found that, outside the consensus-different positions, serial samples from the same individual did not have a shorter pairwise Manhattan distance than that of a randomly taken between-host pair from the same country and season. This methodology change makes our results robust to inter-host variation, in contrast to previous studies on influenza virus and RSV, where distance metrics were largely driven by consensus differences^42,43.

Our findings suggest that RSV-B has a more diverse within-host population than RSV-A, likely driven by selection pressure at the host-population level. This difference between the two subgroups warrants close monitoring of vaccine efficacy and emergence of neutralisation escape variants.

Methods

Sample collection

Nasopharyngeal swabs were collected from patients with respiratory symptoms under 1 year old or over 60 years old, from London and Oxford, United Kingdom, Santiago de Compostela, Spain and Utrecht, the Netherlands, during 2017–2020. These patients were enrolled in three clinical studies of the REspiratory Syncytial virus Consortium in EUrope project (RESCEU, ClinicalTrials.gov identifiers: NCT03627572⁴⁴, NCT03756766⁴⁵ and NCT03621930⁴⁶), a European multicentre project investigating epidemiological, virological and immunological characteristics of RSV infection. None of these participants had received any RSV monoclonal antibody or investigational vaccine. RSV infection was diagnosed using molecular point-of-care testing on the Alere^TM i RSV platform (Abbott, Illinois, US) in infant participants and on the GeneXpert^Ⓡ influenza/RSV system (Cepheid, California, US) in adult participants in a community setting, and using antigen and/or PCR tests at a central laboratory in a hospital setting. A nasopharyngeal swab was collected from each participant within 7 days of symptom onset, and daily swabs were also collected from RSV-positive hospitalised infant participants where possible until hospital discharge. After collection, swabs were immersed in an M4RT^Ⓡ transport medium, aliquoted, and frozen at −80 ^∘C until use.

The severity of an RSV infection was defined using the ReSVinet scale⁴⁷ in infants. This scale accounts for several clinical variables, including feeding intolerance, medical intervention, respiratory difficulty, respiratory frequency, apnoea, general condition and fever. The score ranges from 0 to 20; a score of 0–7 was defined as mild, a score of 8–13 as moderate and a score of 14–20 as severe. In older adults, those who did not require any treatment or medical attendance were defined as having mild disease, those requiring hospitalisation were defined as having severe disease and the rest were defined as having a moderate RSV disease.

These clinical studies were conducted in accordance with the provisions of the Declaration of Helsinki and were approved by the relevant ethics committees at each site, including the University of Oxford, the Health Research Authority (IRAS IDs: 224156 and 231136), the NHS National Research Ethics Service Oxfordshire Committee A (reference number: 15/SC/0335), the South Central and Hampshire A Research Ethics Committee (reference number: 17/SC/0522) and the London-Central Research Ethics Committee (reference number: 17/LO/1210) in the UK; Hospital Clínico Universitario de Santiago de Compostela, and Comité de Ética de la Investigación de Santiago-Lugo (reference number: 2017/395) in Spain; the Medical Ethical Committee, University Medical Center Utrecht (reference number: 17/563) and the Ethical Review Authority (reference number: NL60910.041.17) in the Netherlands. All adult participants and the parents or guardians of all infant participants provided written informed consent.

Nucleic acid isolation and whole-genome sequencing

All RSV-positive samples were selected for whole-genome sequencing. Nucleic acid isolation, library construction and sequencing were performed in four different batches. To minimise the risk of RNA degradation, nucleic acid was extracted locally from primary samples, and the extractions were scheduled as close as practical to the time of sequencing.

Total nucleic acid extraction was carried out using the NucliSENS^Ⓡ easyMAG^Ⓡ system (BioMérieux, Marcy-l’Étoile, France), following the manufacturer’s instructions. Wherever possible, 500 μL of each sample was used to get 25 μL eluate in the first and fourth batches, and 35 μL in the second and third batches.

Sequencing libraries were constructed using the methodology of targeted metagenomics²⁷, a modification of the veSEQ-HIV protocol⁴⁸. A 12-μL aliquot of each nucleic acid sample was first concentrated to 3 μL with RNAClean XP magnetic beads (Beckman Coulter, California, United States). Dual-indexed libraries for Illumina sequencing were then constructed using the SMARTer Stranded Total RNA-Seq Kit v2 - Pico Input Mammalian (Takara Bio USA, California, United States), where first-strand reverse transcription was primed with tagged random hexamers and double-stranded cDNA was synthesised with sets of i5 and i7 index primers, as previously described elsewhere⁴⁹. These gave unique dual indexing (UDI) for the samples, thus minimising the risk of index misassignment during sequencing. After 12 cycles of PCR amplification of the cDNA, 10 μL of each library was pooled and purified using AMPure XP (Beckman Coulter). A 750-ng aliquot was taken from the pool and captured using a predesigned SureSelect RNA Target Enrichment multi-pathogen probe set (Agilent, California, United States). This probe set (each 120 nucleotides long) targeted more than 100 pathogenic bacteria and viruses, including both RSV-A and RSV-B⁵⁰. Sixteen cycles of PCR were performed for post-capture amplification, and the final product was purified by AMPure XP.

Sequencing was performed on the Illumina MiSeq platform (Illumina, California, US) with the MiSeq Reagent Kit v3 (600-cycle) for the first and third batches, generating 265-bp and 300-bp paired-end reads, respectively. The second and fourth batches were sequenced on the Illumina NovaSeq 6000 system with the NovaSeq 6000 SP Reagent Kit v1.5 (300-cycle), generating 151-bp paired-end reads.

Genome reconstruction

The first six bases of read 1 and the first three bases of read 2 were clipped off to remove random hexamer primers and the SMARTer adaptor sequences, respectively. An extra three bases at the 5′ end of MiSeq-generated read 2 were also cut off as they had reduced quality. Trimmomatic (v0.39)⁵¹ was then used to trimmed off adaptor sequences and low-quality bases with a Phred score below 20 (option: Adaptors:2:10:7:1:true LEADING:20 TRAILING:20 SLIDINGWINDOW:4:20 MINLEN:50). De novo assembly of the trimmed reads was carried out using both IVA (v1.0.8)⁵² and SPAdes (v3.14.1)⁵³, in each case selecting the contig sequences with a higher N50 for genome reconstruction using shiver⁵⁴. Internally, BLASTN (v2.7.1+)⁵⁵ was used for read and contig classification, MAFFT (v7.471)⁵⁶ was used for sequence alignment and Bowtie 2 (v2.4.1)⁵⁷ was used for read alignment (option: --very-sensitive-local). A minimum base quality of 35 and mapping quality of 30 were required for a base or an alignment to be counted as mapped. Mapped RSV reads were deduplicated with Picard MarkDuplicates (v2.18.14, https://broadinstitute.github.io/picard/). Pre-deduplicated per-position mapped read counts, generated by shiver, were used for downstream within-host virus diversity analysis.

Within-host virus diversity analysis

Only samples generating more than 10,000 unique (i.e. deduplicated) RSV reads and containing a single subgroup of RSV were included in within-host virus genetic diversity analysis. We have previously shown that RSV viral load highly correlates with the number of unique RSV reads generated by this sequencing method²⁷, consistent with high-quality RNA being recovered in a quantitative way. Ten thousand unique RSV reads correspond to a viral load of ~2.4 × 10⁶ copies/mL. Allele frequencies were calculated at each genomic position, excluding those supported by fewer than 200 reads. The choice of this cut-off was based on a predefined criterion that 90% of the included samples had at least 80% of the genome fulfilling this cut-off (Supplementary Fig. 7). Cumulative MAF was defined as 1 minus major allele frequency, and polymorphic sites were those with a cumulative MAF of ≥3%. Mean cumulative MAF per sample was calculated as the sum of cumulative MAF at each genomic position divided by the total number of positions. Minor variants, or intrahost single nucleotide variants, were defined as variants with an allele frequency of ≥3% and <50%.

Intrahost virus diversity was estimated as pairwise nucleotide diversity (π)⁵⁸. The proportion of pairwise nucleotide differences (D) at each genomic position was calculated as

$${D}_{i}=\frac{{A}_{i}\times {C}_{i}+{A}_{i}\times {G}_{i}+{A}_{i}\times {T}_{i}+{C}_{i}\times {G}_{i}+{C}_{i}\times {T}_{i}+{G}_{i}\times {T}_{i}}{({N}_{i}^{2}-{N}_{i})/2}$$

(1)

where A_i, C_i, G_i and T_i represent the copy number of allele A, C, G and T, respectively, and N_i is the total count of the four alleles (i.e. depth of coverage) at a given locus i, so N_i = A_i + C_i + G_i + T_i. Loci with a total count of less than 200 were excluded. Pairwise nucleotide diversity across a genome (π) was then calculated as

$$\pi =\mathop{\sum }\limits_{i=1}^{L}\frac{{D}_{i}}{L}$$

(2)

where L is the number of genomic positions with a read depth of at least 200×.

Manhattan (L1-norm) distance was used to compare within-host diversity levels between samples, calculated as

$${d}_{i}({{{{{\bf{p}}}}}},{{{{{\bf{q}}}}}})=\mathop{\sum }\limits_{k=1}^{4}\left|{{{{{{\bf{p}}}}}}}_{k}-{{{{{{\bf{q}}}}}}}_{k}\right|$$

(3)

$$M=\mathop{\sum }\limits_{i=1}^{N}{d}_{i}\times \frac{S}{N}$$

(4)

where d_i is the distance between two samples at a given locus i with vectors p and q containing relative frequencies of four possible alleles (i.e. A, C, G and T), M is the Manhattan distance between the coding sequences of two samples, N is the number of coding sequence positions where both samples have the same consensus base and a read depth of at least 200× and S is the total length of the coding sequence. To remove potential background noise in Manhattan distance calculations, allele frequencies of <3% were changed to 0, and those of >97% were changed to 100%.

Nucleotide positions were numbered from the first base of the coding sequence of each gene according to the NCBI reference sequences with the accession numbers of NC_038235 and NC_001781 for RSV-A and RSV-B, respectively. Amino acid positions were numbered from the first methionine of each protein according to the same NCBI reference sequences.

Phylogeny reconstruction

Maximum likelihood phylogenies of consensus coding sequences, supported by at least two unique (deduplicated) RSV reads, were estimated using RAxML (v8.2.12)⁵⁹ with the general time-reversible nucleotide substitution model and gamma-distributed rate heterogeneity. Bootstrapping with 1000 replicates was used to assess the robustness of tree topologies. Pairwise patristic distances were calculated from the maximum-likelihood trees using the cophenetic function of the R package ape (v5.4-1)⁶⁰. Phylogenetic trees were visualised using the R package ggtree (v2.2.4)⁶¹.

Statistical analysis

Continuous variables were summarised using mean, median, maximum and minimum. All comparisons of continuous variables between groups were conducted by two-tailed Mann–Whitney U-tests (two groups) or Kruskal–Wallis tests (three groups). Post hoc application of the Benjamini–Hochberg procedure was used to control false discovery rates for multiple testing. Chi-square tests with Yates’ continuity correction were used for contingency analysis; Fisher’s exact tests were performed when the expected value of a cell was less than 5. Logistic regression was employed to model a binary dependent variable while adjusting for a covariate. Two-tailed Pearson correlation analysis was used to evaluate the relationship between two variables. Temporal changes of a variable were determined by ordinary least-squares linear regression. Two approaches were applied to account for batch effects on the comparisons of diversity metrics: (i) including batch as a regression covariate (e.g. regression of pairwise nucleotide diversity on sampling country, sampling season, RSV subgroup, RSV read count, participant age group, disease severity and ‘batch’ as in Supplementary Table 2); and (ii) standardising the values within each batch to z-scores, that is, to a mean of zero and a standard deviation of 1 (e.g. Mann–Whitney U-test on z-score standardised pairwise nucleotide diversity as in Fig. 2). Missing data were imputed using the aregImpute function, implemented in the R package Hmisc (v4.5-0)⁶². All statistical analyses were performed using R (v4.0.2)⁶³. P values or adjusted P values of less than 0.05 were considered to indicate statistical significance.

Reporting Summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The sequencing read data generated in this study have been deposited in the European Nucleotide Archive under study accession PRJEB34042. The RSV genomic sequences generated in this study have been deposited in GenBank under accession numbers LR699315 LR699726, LR699734, LR699736-LR699744 and MZ515551-MZ516143. The RSV reference sequences used in this study are available in GenBank under accession numbers NC_038235 and NC_001781. The associated sample and de-identified clinical information used in this study is provided in Supplementary Data 1.

Change history

07 October 2021
A Correction to this paper has been published: https://doi.org/10.1038/s41467-021-26291-y

References

Shi, T. et al. Global, regional, and national disease burden estimates of acute lower respiratory infections due to respiratory syncytial virus in young children in 2015: a systematic review and modelling study. Lancet 390, 946–958 (2017).
Article PubMed PubMed Central Google Scholar
Varga, S. M. & Braciale, T. J. The adaptive immune response to respiratory syncytial virus. Curr. Top. Microbiol. Immunol. 372, 155–171 (2013).
CAS PubMed Google Scholar
Falsey, A. R., Hennessey, P. A., Formica, M. A., Cox, C. & Walsh, E. E. Respiratory syncytial virus infection in elderly and high-risk adults. N. Engl. J. Med. 352, 1749–1759 (2005).
Article CAS PubMed Google Scholar
American Academy of Pediatrics. Updated guidance for palivizumab prophylaxis among infants and young children at increased risk of hospitalization for respiratory syncytial virus infection. Pediatrics 134, e620–e638 (2014).
Ruckwardt, T. J., Morabito, K. M. & Graham, B. S. Immunological lessons from respiratory syncytial virus vaccine development. Immunity 51, 429–442 (2019).
Article CAS PubMed Google Scholar
McLellan, J. S. Neutralizing epitopes on the respiratory syncytial virus fusion glycoprotein. Curr. Opin. Virol. 11, 70–75 (2015).
Article CAS PubMed PubMed Central Google Scholar
Zhu, Q. et al. Prevalence and significance of substitutions in the fusion protein of respiratory syncytial virus resulting in neutralization escape from antibody medi8897. J. Infect. Dis. 218, 572–580 (2018).
Article CAS PubMed Google Scholar
Simões, E. A. F. et al. Suptavumab for the prevention of medically attended respiratory syncytial virus infection in preterm infants. Clin. Infect. Dis. (2020).
Wu, H. et al. Development of motavizumab, an ultra-potent antibody for the prevention of respiratory syncytial virus infection in the upper and lower respiratory tract. J. Mol. Biol. 368, 652–665 (2007).
Article CAS PubMed Google Scholar
Wu, S. J. et al. Characterization of the epitope for anti-human respiratory syncytial virus f protein monoclonal antibody 101f using synthetic peptides and genetic approaches. J. Gen. Virol. 88, 2719–2723 (2007).
Article CAS PubMed Google Scholar
Zhao, X., Chen, F. P., Megaw, A. G. & Sullender, W. M. Variable resistance to palivizumab in cotton rats by respiratory syncytial virus mutants. J. Infect. Dis. 190, 1941–1946 (2004).
Article CAS PubMed Google Scholar
Zhu, Q. et al. Analysis of respiratory syncytial virus preclinical and clinical variants resistant to neutralization by monoclonal antibodies palivizumab and/or motavizumab. J. Infect. Dis. 203, 674–682 (2011).
Article CAS PubMed PubMed Central Google Scholar
Zhu, Q. et al. Natural polymorphisms and resistance-associated mutations in the fusion protein of respiratory syncytial virus (rsv): effects on rsv susceptibility to palivizumab. J. Infect. Dis. 205, 635–638 (2012).
Article CAS PubMed Google Scholar
Battles, M. B. & McLellan, J. S. Respiratory syncytial virus entry and how to block it. Nat. Rev. Microbiol. 17, 233–245 (2019).
Article CAS PubMed PubMed Central Google Scholar
Fedechkin, S. O., George, N. L., Wolff, J. T., Kauvar, L. M. & DuBois, R. M. Structures of respiratory syncytial virus g antigen bound to broadly neutralizing antibodies. Sci. Immunol. 3, eaar3534 (2018).
Article PubMed PubMed Central Google Scholar
Power, U. F. et al. Safety and immunogenicity of a novel recombinant subunit respiratory syncytial virus vaccine (bbg2na) in healthy young adults. J. Infect. Dis. 184, 1456–1460 (2001).
Article CAS PubMed Google Scholar
Choi, Y. et al. Antibodies to the central conserved region of respiratory syncytial virus (rsv) g protein block rsv g protein cx3c-cx3cr1 binding and cross-neutralize rsv a and b strains. Viral Immunol. 25, 193–203 (2012).
CAS PubMed PubMed Central Google Scholar
Lee, J., Klenow, L., Coyle, E. M., Golding, H. & Khurana, S. Protective antigenic sites in respiratory syncytial virus g attachment protein outside the central conserved and cysteine noose domains. PLoS Pathog. 14, e1007262 (2018).
Article PubMed PubMed Central Google Scholar
Eshaghi, A. et al. Genetic variability of human respiratory syncytial virus a strains circulating in ontario: a novel genotype with a 72 nucleotide g gene duplication. PLoS ONE 7, e32807 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Peret, T. C., Hall, C. B., Schnabel, K. C., Golub, J. A. & Anderson, L. J. Circulation patterns of genetically distinct group a and b strains of human respiratory syncytial virus in a community. J. Gen. Virol. 79, 2221–2229 (1998).
Article CAS PubMed Google Scholar
Grad, Y. H. et al. Within-host whole-genome deep sequencing and diversity analysis of human respiratory syncytial virus infection reveals dynamics of genomic diversity in the absence and presence of immune pressure. J. Virol. 88, 7286–7293 (2014).
Article PubMed PubMed Central Google Scholar
Lau, J. W. et al. Deep sequencing of rsv from an adult challenge study and from naturally infected infants reveals heterogeneous diversification dynamics. Virology 510, 289–296 (2017).
Article CAS PubMed Google Scholar
Lythgoe, K. A. et al. Sars-cov-2 within-host diversity and transmission. Science 372, eabg0821 (2021).
Article CAS PubMed PubMed Central Google Scholar
McCrone, J. T. et al. Stochastic processes constrain the within and between host evolution of influenza virus. Elife 7, e35962 (2018).
Article PubMed PubMed Central Google Scholar
Tabor, D. E. et al. Global molecular epidemiology of respiratory syncytial virus from the 2017-2018 inform-rsv study. J. Clin. Microbiol. 59, e01828–20 (2020).
Article PubMed PubMed Central Google Scholar
McCrone, J. T. & Lauring, A. S. Measurements of intrahost viral diversity are extremely sensitive to systematic errors in variant calling. J. Virol. 90, 6884–6895 (2016).
Article CAS PubMed PubMed Central Google Scholar
Lin, G. L. et al. Simultaneous viral whole-genome sequencing and differential expression profiling in respiratory syncytial virus infection of infants. J. Infect. Dis. 222, S666–S671 (2020).
Article CAS PubMed Google Scholar
Xue, K. S., Moncla, L. H., Bedford, T. & Bloom, J. D. Within-host evolution of human influenza virus. Trends. Microbiol. 26, 781–793 (2018).
Article CAS PubMed PubMed Central Google Scholar
Duffy, S., Shackelton, L. A. & Holmes, E. C. Rates of evolutionary change in viruses: patterns and determinants. Nat. Rev. Genet. 9, 267–276 (2008).
Article CAS PubMed Google Scholar
Sanjuan, R. & Domingo-Calap, P. Mechanisms of viral mutation. Cell. Mol. Life Sci. 73, 4433–4448 (2016).
Article CAS PubMed PubMed Central Google Scholar
Valesano, A. L. et al. Influenza b viruses exhibit lower within-host diversity than influenza a viruses in human hosts. J. Virol. 94, e01710–19 (2020).
Article CAS PubMed PubMed Central Google Scholar
Cudini, J. et al. Human cytomegalovirus haplotype reconstruction reveals high diversity due to superinfection and evidence of within-host recombination. Proc. Natl Acad. Sci. USA 116, 5693–5698 (2019).
Article CAS PubMed PubMed Central Google Scholar
Papenburg, J. et al. Molecular evolution of respiratory syncytial virus fusion gene, canada, 2006-2010. Emerg. Infect. Dis. 18, 120–124 (2012).
Article PubMed PubMed Central Google Scholar
Griffin, M. P. et al. Single-dose nirsevimab for prevention of rsv in preterm infants. N. Engl. J. Med. 383, 415–425 (2020).
Article CAS PubMed Google Scholar
Tan, L. et al. The comparative genomics of human respiratory syncytial virus subgroups a and b: genetic variability and molecular evolutionary dynamics. J Virol. 87, 8213–8226 (2013).
Article CAS PubMed PubMed Central Google Scholar
Schobel, S. A. et al. Respiratory syncytial virus whole-genome sequencing identifies convergent evolution of sequence duplication in the c-terminus of the g gene. Sci. Rep. 6, 26311 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Matheson, J. W. et al. Distinct patterns of evolution between respiratory syncytial virus subgroups a and b from new zealand isolates collected over thirty-seven years. J. Med. Virol. 78, 1354–1364 (2006).
Article CAS PubMed Google Scholar
Smith, G. et al. Respiratory syncytial virus fusion glycoprotein expressed in insect cells form protein nanoparticles that induce protective immunity in cotton rats. PLoS ONE 7, e50852 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Pierantoni, A. et al. Mucosal delivery of a vectored rsv vaccine is safe and elicits protective immunity in rodents and nonhuman primates. Mol. Ther. Methods Clin. Dev. 2, 15018 (2015).
Article PubMed PubMed Central Google Scholar
Crank, M. C. et al. A proof of concept for structure-based vaccine design targeting rsv in humans. Science 365, 505–509 (2019).
Article ADS CAS PubMed Google Scholar
Bin, L. et al. Emergence of new antigenic epitopes in the glycoproteins of human respiratory syncytial virus collected from a us surveillance study, 2015-17. Sci. Rep. 9, 3898 (2019).
Article ADS Google Scholar
Poon, L. L. et al. Quantifying influenza virus diversity and transmission in humans. Nat. Genet. 48, 195–200 (2016).
Article CAS PubMed PubMed Central Google Scholar
Githinji, G. et al. Assessing the utility of minority variant composition in elucidating RSV transmission pathways. Preprint at bioRxiv https://doi.org/10.1101/411512 (2018).
Wildenbeest, J. G. et al. Respiratory syncytial virus consortium in europe (resceu) birth cohort study: defining the burden of infant respiratory syncytial virus disease in europe. J. Infect. Dis. 222, S606–S612 (2020).
Article PubMed Google Scholar
Jefferies, K. et al. Presumed risk factors and biomarkers for severe respiratory syncytial virus disease and related sequelae: protocol for an observational multicenter, case-control study from the respiratory syncytial virus consortium in europe (resceu). J. Infect. Dis. 222, S658–S665 (2020).
Article CAS PubMed Google Scholar
Korsten, K. et al. Burden of respiratory syncytial virus infection in community-dwelling older adults in europe (resceu): an international prospective cohort study. Eur. Respir. J. 57, 2002688 (2021).
Article PubMed Google Scholar
Justicia-Grande, A. J. et al. Development and validation of a new clinical scale for infants with acute respiratory infection: the resvinet scale. PLoS ONE 11, e0157665 (2016).
Article PubMed PubMed Central Google Scholar
Bonsall, D. et al. A comprehensive genomics solution for HIV surveillance and clinical monitoring in low-income settings. J. Clin. Microbiol. 58, e00382–20 (2020).
Article CAS PubMed PubMed Central Google Scholar
Faircloth, B. C. & Glenn, T. C. Not all sequence tags are created equal: designing and validating sequence identification tags robust to indels. PLoS ONE 7, e42543 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Goh, C. et al. Targeted metagenomic sequencing enhances the identification of pathogens associated with acute infection. Preprint at bioRxiv https://doi.org/10.1101/716902 (2019).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS PubMed PubMed Central Google Scholar
Hunt, M. et al. Iva: accurate de novo assembly of rna virus genomes. Bioinformatics 31, 2374–2376 (2015).
Article CAS PubMed PubMed Central Google Scholar
Bankevich, A. et al. Spades: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 19, 455–477 (2012).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Wymant, C. et al. Easy and accurate reconstruction of whole hiv genomes from short-read sequence data with shiver. Virus Evol. 4, vey007 (2018).
Article PubMed PubMed Central Google Scholar
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
Article CAS PubMed Google Scholar
Katoh, K. & Standley, D. M. Mafft multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Article CAS PubMed PubMed Central Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with bowtie 2. Nat. Methods 9, 357–359 (2012).
Article CAS PubMed PubMed Central Google Scholar
Nelson, C. W. & Hughes, A. L. Within-host nucleotide diversity of virus populations: insights from next-generation sequencing. Infect. Genet. Evol. 30, 1–7 (2015).
Article CAS PubMed Google Scholar
Stamatakis, A. Raxml version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Article CAS PubMed PubMed Central Google Scholar
Paradis, E. & Schliep, K. ape 5.0: an environment for modern phylogenetics and evolutionary analyses in r. Bioinformatics 35, 526–528 (2019).
Article CAS PubMed Google Scholar
Yu, G., Smith, D. K., Zhu, H., Guan, Y. & Lam, T. T. Y. ggtree: an r package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods Ecol. Evol. 8, 28–36 (2017).
Article Google Scholar
Harrell Jr., F. E., with contributions from Charles Dupont & many others. Hmisc: Harrell miscellaneous. R Package Version 4.5-0 https://CRAN.R-project.org/package=Hmisc (2021).
R Core Team. R: a language and environment for statistical computing (R Foundation for Statistical Computing, 2018).

Download references

Acknowledgements

This work was supported by the National Institute for Health Research (NIHR) Oxford Biomedical Research Centre, the NIHR Thames Valley and South Midlands Clinical Research Network, the British Research Council, and the REspiratory Syncytial virus Consortium in EUrope (RESCEU) project. RESCEU has received funding from the Innovative Medicines Initiative 2 Joint Undertaking (grant number 116019). This Joint Undertaking receives support from the European Union Horizon 2020 Research and Innovation Programme and the European Federation of Pharmaceutical Industries and Associations.

Author information

Simon B. Drysdale
Present address: Paediatric Infectious Diseases Research Group, Institute for Infection and Immunity, St George’s, University of London, London, UK
Rory Bowden
Present address: Division of Advanced Technology and Biology, Walter and Eliza Hall Institute of Medical Research, Melbourne, VIC, Australia
These authors jointly supervised this work: Tanya Golubchik, Andrew J Pollard

Authors and Affiliations

Oxford Vaccine Group, Department of Paediatrics, University of Oxford, Oxford, UK
Gu-Lung Lin, Simon B. Drysdale, Matthew D. Snape, Daniel O’Connor, Elizabeth Clutterbuck, Joseph McGinley & Andrew J. Pollard
NIHR Oxford Biomedical Research Centre, Oxford, UK
Gu-Lung Lin, Simon B. Drysdale, Matthew D. Snape, Daniel O’Connor & Andrew J. Pollard
Peter Medawar Building for Pathogen Research, University of Oxford, Oxford, UK
Anthony Brown
Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
George MacIntyre-Cockett, Esther Mellado-Gomez, Mariateresa de Cesare, David Bonsall, M. Azim Ansari & Rory Bowden
Big Data Institute, Nuffield Department of Medicine, University of Oxford, Oxford, UK
David Bonsall & Tanya Golubchik
Translational Biomarkers, Infectious Diseases Therapeutic Area, Janssen Pharmaceutica NV, Beerse, Belgium
Deniz Öner & Jeroen Aerssens
Nuffield Department of Primary Care Health Sciences, University of Oxford, Oxford, UK
Christopher Butler
Department of Pediatrics, Wilhelmina Children’s Hospital, University Medical Center Utrecht, Utrecht, Netherlands
Louis Bont, Debby Bogaert & Joanne Wildenbeest
ReSViNET Foundation, Zeist, Netherlands
Louis Bont
National Heart and Lung Institute, Imperial College London, London, UK
Peter Openshaw, Ryan Thwaites & Dexter Wiseman
Translational Pediatrics and Infectious Diseases, Hospital Clínico Universitario de Santiago de Compostela, Santiago de Compostela, Spain
Federico Martinón-Torres
Genetics, Vaccines, Infectious Diseases, and Pediatrics Research Group (GENVIP), Instituto de Investigación Sanitaria de Santiago de Compostela, Santiago de Compostela, Spain
Federico Martinón-Torres, Alberto Gómez-Carballa, Carmen Rodriguez-Tenreiro, Irene Rivero-Calle & Ana Dacosta-Urbieta
Centre for Global Health, Usher Institute, Edinburgh Medical School, University of Edinburgh, Edinburgh, UK
Harish Nair, Harry Campbell & Steve Cunningham
Queen’s Medical Research Institute, University of Edinburgh, Edinburgh, UK
Debby Bogaert
Centre for Health Economics Research and Modelling Infectious Diseases, Vaccine and Infectious Disease Institute, University of Antwerp, Antwerp, Belgium
Philippe Beutels
Department of Pediatrics, University of Turku, Turku University Hospital, Turku, Finland
Terho Heikkinen
National Institute for Public Health and the Environment, Bilthoven, Netherlands
Adam Meijer
Statens Serum Institut, Copenhagen, Denmark
Thea Kølsen Fischer
Department of Pulmonary Diseases, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
Maarten van den Berge
PENTA Foundation, Padua, Italy
Carlo Giaquinto
AstraZeneca, Gaithersburg, MD, US
Michael Abram
Pfizer, Pearl River, NY, US
Philip Dormitzer
GlaxoSmithKline, Potomac, MD, US
Sonia Stoszek
Sanofi Pasteur, Toronto, Ontario, Canada
Scott Gallichan
Novavax, Potomac, MD, US
Brian Rosen
Team-It Research, Barcelona, Spain
Eva Molero, Nuria Machin & Martina Spadetto

Authors

Gu-Lung Lin
View author publications
You can also search for this author in PubMed Google Scholar
Simon B. Drysdale
View author publications
You can also search for this author in PubMed Google Scholar
Matthew D. Snape
View author publications
You can also search for this author in PubMed Google Scholar
Daniel O’Connor
View author publications
You can also search for this author in PubMed Google Scholar
Anthony Brown
View author publications
You can also search for this author in PubMed Google Scholar
George MacIntyre-Cockett
View author publications
You can also search for this author in PubMed Google Scholar
Esther Mellado-Gomez
View author publications
You can also search for this author in PubMed Google Scholar
Mariateresa de Cesare
View author publications
You can also search for this author in PubMed Google Scholar
David Bonsall
View author publications
You can also search for this author in PubMed Google Scholar
M. Azim Ansari
View author publications
You can also search for this author in PubMed Google Scholar
Deniz Öner
View author publications
You can also search for this author in PubMed Google Scholar
Jeroen Aerssens
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Butler
View author publications
You can also search for this author in PubMed Google Scholar
Louis Bont
View author publications
You can also search for this author in PubMed Google Scholar
Peter Openshaw
View author publications
You can also search for this author in PubMed Google Scholar
Federico Martinón-Torres
View author publications
You can also search for this author in PubMed Google Scholar
Harish Nair
View author publications
You can also search for this author in PubMed Google Scholar
Rory Bowden
View author publications
You can also search for this author in PubMed Google Scholar
Tanya Golubchik
View author publications
You can also search for this author in PubMed Google Scholar
Andrew J. Pollard
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

RESCEU Investigators

Harry Campbell
, Steve Cunningham
, Debby Bogaert
, Philippe Beutels
, Joanne Wildenbeest
, Elizabeth Clutterbuck
, Joseph McGinley
, Ryan Thwaites
, Dexter Wiseman
, Alberto Gómez-Carballa
, Carmen Rodriguez-Tenreiro
, Irene Rivero-Calle
, Ana Dacosta-Urbieta
, Terho Heikkinen
, Adam Meijer
, Thea Kølsen Fischer
, Maarten van den Berge
, Carlo Giaquinto
, Michael Abram
, Philip Dormitzer
, Sonia Stoszek
, Scott Gallichan
, Brian Rosen
, Eva Molero
, Nuria Machin
& Martina Spadetto

Contributions

G.-L.L., T.G. and A.J.P. conceived and designed the work. G.-L.L., S.B.D., M.D.S., D.Ö., J.A., C.B., L.B., P.O., F.M.-T., H.N. and A.J.P. conducted and supervised the clinical studies. M.A.A. designed the probe set that was used for capture. M.d.C., D.B. and R.B. designed the sequencing protocol. G.-L.L., A.B., G.M.-C., E.M.-G. and M.d.C. performed the experiments. G.-L.L., T.G., D.O’C. and A.J. analysed and interpreted the data. G.-L.L. drafted the manuscript and T.G., D.O’C. and A.J.P. substantively revised it. T.G. and A.J.P. supervised the work. All authors have approved the submitted version and agreed to submit the manuscript.

Corresponding author

Correspondence to Gu-Lung Lin.

Ethics declarations

Competing interests

S.B.D. has been an investigator for clinical trials of vaccines and antimicrobials for pharmaceutical companies including AstraZeneca, Merck and Janssen, and sits on an RSV advisory board for Sanofi Pastuer. M.A.A. is supported by a Sir Henry Dale Fellowship jointly funded by the Royal Society and Wellcome Trust (220171/Z/20/Z). D.Ö. and J.A. are employees of Janssen Pharmaceutica NV. F.M.-T. has received honoraria from GSK, Pfizer Inc., Sanofi Pasteur, MSD, Seqirus and Janssen for taking part in advisory boards and expert meetings and for acting as a speaker in congresses outside the scope of the submitted work. F.M.-T. has also acted as principal investigator in randomised controlled trials of the above-mentioned companies as well as Ablynx, Regeneron, Roche, Abbott, Novavax and MedImmune, with honoraria paid to his institution. F.M.-T. receives support for his research activities from the Instituto de Salud Carlos III (Proyecto de Investigación en Salud, Acción Estratégica en Salud): Fondo de Investigación Sanitaria (FIS;PI1601569/PI1901090) del plan nacional de I+D+I and ‘fondos FEDER’. A.J.P. is a National Institute for Health Research (NIHR) Senior Investigator with funding from the British Research Council. The remaining authors declare no competing interests. The views expressed in this article are those of the authors and may not be understood or quoted as being made on behalf of or reflecting the position of the organisations with which the authors are employed/affiliated.

Additional information

Peer review information Nature Communications thanks Rebecca Rockett, Michael Teng and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Description of Additional Supplementary Files

Supplementary Data 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lin, GL., Drysdale, S.B., Snape, M.D. et al. Distinct patterns of within-host virus populations between two subgroups of human respiratory syncytial virus. Nat Commun 12, 5125 (2021). https://doi.org/10.1038/s41467-021-25265-4

Download citation

Received: 17 February 2021
Accepted: 21 July 2021
Published: 26 August 2021
DOI: https://doi.org/10.1038/s41467-021-25265-4

This article is cited by

Targeted metagenomics reveals association between severity and pathogen co-detection in infants with respiratory syncytial virus
- Gu-Lung Lin
- Simon B. Drysdale
- Andrew J. Pollard
Nature Communications (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Sample population

Cumulative minor allele frequencies and minor variants

Potential antigenic variants

Pairwise nucleotide diversity

Genetic distance

Temporal change of intrahost virus population

Discussion

Methods

Sample collection

Nucleic acid isolation and whole-genome sequencing

Genome reconstruction

Within-host virus diversity analysis

Phylogeny reconstruction

Statistical analysis

Reporting Summary

Data availability

Change history

07 October 2021

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

RESCEU Investigators

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links