Abstract
Coxsackievirus (CV)-B5 is a common human enterovirus reported worldwide; swine vesicular disease virus (SVDV) is a porcine variant of CV-B5. To clarify the transmission dynamics and molecular basis of host switching between CV-B5 and SVDV, we analysed and compared the VP1 and partial 3Dpol gene regions of these two viruses. Spatiotemporal dynamics of viral transmission were estimated using a Bayesian statistical inference framework. The detected selection events were used to analyse the key molecules associated with host switching. Analyses of VP1 sequences revealed six CV-B5 genotypes (A1–A4 and B1–B2) and three SVDV genotypes (I–III). Analyses of partial 3Dpol revealed five clusters (A–E). The genotypes evolved sequentially over different periods, albeit with some overlap. The major hub of CV-B5 transmission was in China whereas the major hubs of SVDV transmission were in Italy. Network analysis based on deduced amino acid sequences showed a diverse extension of the VP1 structural protein, whereas most sequences were clustered into two haplotypes in the partial 3Dpol region. Residue 178 of VP1 showed four epistatic interactions with residues known to play essential roles in viral host tropism, cell entry, and viral decoating.
Similar content being viewed by others
Introduction
Coxsackievirus (CV)-B5 is a small, naked, single positive-stranded RNA virus in the enterovirus (EV)-B species of the Picornaviridae family. Enteroviral infections are highly contagious and are typically disseminated via faecal-oral and respiratory routes. As in EV-B, infection with CV-B5 is typically self-limiting; however, it can cause potentially fatal conditions (e.g., encephalitis, myocarditis, and neonatal sepsis-like disease) and chronic autoimmune diseases (e.g., insulin-dependent diabetes mellitus)1,2. Approximately half of all documented CV-B5 infections have been in infants who are highly vulnerable to infections3,4. The distinctive pattern of CV-B5 circulation is characterized by a sudden peak in infectious activity every 3–6 years. In intervening years, infections are sporadic but generally occur annually5,6. In Taiwan, CV-B5 epidemics occurred in 1995, 2002, 2011, and 20153. Interestingly, heavy precipitation (130–200 mm) is a more important risk factor for EV infection (relative risk, RR = 2.45, 1.59–3.78) than for dengue fever (RR = 1.96, 1.53–2.52)7 because naked viruses such as EV are more stable during harsh weather conditions such as heavy rainfall compared to enveloped viruses such as dengue virus. For example, EVs can contaminate groundwater even after being washed out of their dominant areas by heavy rainfall. Although the prevalence of EV peaks in warmer seasons, factors such as global warming and migration resulting from economic or political events such as the Syrian conflict8 mandate year-round surveillance.
Swine vesicular disease (SVD) was first described in Italy in 1966. Since then, numerous outbreaks have occurred in Europe and Asia9,10,11. Swine vesicular disease is difficult to detect because, although morbidity varies by strain, symptoms are usually mild. Additionally, SVD infections are usually detected during outbreaks of foot and mouth disease, which is clinically indistinguishable from SVD. In 2015, SVD was deleted from the Office of International Epizootics Listed Diseases12. In Taiwan, surveillance data show that SVD infections occurred only sporadically during 1997–2000. The pathogenic cause of SVD is the SVD virus (SVDV), which comprises a single serotype. Due to the sequence homology of CV-B5 and SVDV and their antigenic cross-reactivity, SVDV is now classified as a porcine variant of CV-B513. Although pigs are currently the only known SVDV host, researchers hypothesize that SVDV evolved from a shared ancestor of CV-B5, and then crossed the species barrier from human to pig through adaptation10,11. Given that biological sequences (DNA, RNA, and protein) are footprints of evolution, sites of sequence variation between ancestor and recent strains reveal not only the evolutionary history of a virus, but also the sequence conserved patterns thought to have functional value for survival14,15. Owing to the close genetic relationship between CV-B5 and SVDV, a comparison of their sequence variation signatures could reveal the key molecular sites of cross-host species transmission.
The VP1 gene of EVs encodes outer capsid proteins that determine host range and receptor binding16. The RNA-dependent RNA polymerase (RdRp) is a 3Dpol gene product that plays an important hereditary role in RNA viruses not only because mutations result in the loss of proofreading activity, but also because RdRp mediates template switching, which is a well-known recombination mechanism in RNA viruses17. Both mutation and recombination contribute to the rapid evolution rate needed to maintain viral adaptability under selective pressure. Gene alignment can reveal the evolutionary history of a virus whereas analysis of amino acid (aa) sequence conservation can reveal not only the constraints on protein structure and function, but also the effects of evolutionary dynamics. Therefore, identifying sequence variations in the VP1 and 3Dpol regions can reveal the evolutionary characteristics of CV-B5 and SVDV.
Phylodynamic analysis can elucidate the epidemiological, immunological, and evolutionary interactions of a pathogen18, whereas selection detection can further reveal how the direction of evolution is affected by interactions between gene sequence and molecular function. Therefore, we hypothesized that combining phylodynamic and epistatic analyses would yield useful information about viral host barrier crossing and adaptation. To clarify the molecular characteristics of CV-B5 and SVDV and the spatiotemporal characteristics of their evolution, a Bayesian statistical inference framework was used to analyse VP1 and partial 3Dpol regions in both viruses. Relationships among genealogies were also depicted by haplotype network analysis, and patterns of variation associated with host range switching were analysed by comparing CV-B5 and SVDV sequences.
Results
VP1-based phylogenetic and demographic analyses of CV-B5 and SVDV
Three datasets were used to estimate spatiotemporal dynamics in the 849-nucleotide (-nt) VP1 region: a CV-B5 dataset (195 sequences isolated from 28 countries during 1952–2015, including 28 Taiwan strains isolated in this study), an SVDV dataset (53 sequences sampled from 14 countries during 1966–2006), and a combined dataset designated 248_data (195 CV-B5 sequences and 53 SVDV sequences). Supplementary Table 1 shows the model compositions used. The maximum clade credibility (MCC) tree and maximum likelihood (ML) tree for 248_data showed that the two main branches evolved from a common ancestor (Fig. 1 and Supplementary Fig. S1). Previous studies showed a bifurcating tree with two major circulating genogroups19,20. This study applied the genogroup A and B designations used by Henquell et al. because dichotomous grouping is simple and clear19. The genotype A strains were prevalent mainly in Asia and Europe, while the genotype B strains were prevalent mainly in America and Europe. The two main branches evolved continuously in separate subclusters. Genotype analysis was limited to sequence with higher than 15% similarity21 and nodes with high support values (posterior probability value, PP > 0.9 or bootstrap value >70%), criteria that are widely used to discriminate non-polio enterovirus genotypes22,23,24,25,26, including EV-B. Sequential evolution of six CV-B5 genotypes (A1−A4 and B1−B2) were observed in this study. In each genogroup, some ancestor strains were separated from the main subcluster. Compared with the main subcluster, these ancestor strains had a wider range of isolation years, wider distribution of isolation locations, lower sequence similarity, and/or lower nodal support values, and were assigned as 0. For example, genotype A successively yielded A0, A1−A4, and SVDV; and genotype B successively yielded B0, B1, and B2. All SVDV strains were rooted in a single branch of genogroup A and clustered together with genogroup A1. The SVDV was further subdivided into genotypes I−III. Although all 248 strains were rooted together with high support values in both Bayesian Markov chain Monte Carlo (BMCMC) tree and ML trees, these results indicate that SVDV evolved from CV-B5. However, support values on the rooted nodes of SVDV with genogroup A are low (Fig. 1 and Supplementary Fig. S1). Supplementary Figs 2 and 3 also show within-genotype similarities. Dominant sublineages may have circulated for several years before being replaced by newer sublineages with extended branches. Different sublineages may have co-circulated during a given period in the same location, especially in recent decades.
The MCC trees for the CV-B5 and 248_data datasets showed similar topologies and clustering. On rooted branches, both trees had a balanced structure indicating co-survival in bifurcating sub-branches. On terminal branches, both trees had an imbalanced structure indicating temporal and geographic clustering of CV-B5 isolates, especially in those isolated after 2000, with a ladder-like backbone (Fig. 1 and Supplementary Fig. 1). After the prototype strain of CV-B5 appeared in the US in 1952, it reappeared in France and Ecuador in the 1970s and later in Poland in 2001. This cluster, which had a low PP value and low sequence similarity, was not included in the GenBank after 2001. Genotype A1 strains were isolated from Asia (Kyrgyzstan and Pakistan, 1989−1992); whereas genotype A2 strains were isolated from Europe and Asia (1995–2003); genotype A3 strains were isolated from Europe (1996−2005); and genotype A4 strains were isolated from Asia and Europe (2002−2015). In genogroup B, genotype B1 strains had circulated in America and Europe (1970−1985), whereas genotype B2 strains had circulated only in Europe (1983−2011).
Unlike the dichotomous phylogeny of CV-B5, SVDV clustered monophyletically with genotype A1 with high support values. All three genotypes included strains isolated in Europe and Asia but sub-clustered by their isolation locations and timespans. Genotype I comprised strains isolated during 1966−1973, including the prototype strain KT284997 isolated in Italy (1966); Genotype II included strains prevalent during 1972–1992; Genotype ASIII strains were isolated only after 1990. Topology and clustering in the SVDV tree resembled those in the 248_data except for five strains isolated from Hong Kong (1973−1985), which clustered together with genotype III with high support values in 248_data. In contrast, certain earlier Asian strains (isolated from Hong Kong and Japan during 1970–1976) clustered with genotype III in the SVDV tree and had low support values (Fig. 1 and Supplementary Fig. 2).
VP1-based estimation of time to most recent common ancestor (TMRCA), substitution rates, and viral demographic history
The analytical results for 248_data resembled those for CV-B5 except that CV-B5 had a slightly higher EPS during 1965–2000. For 248_data, the estimated mean (95% highest probability density, HPD) TMRCAs were 1891 (1856–1920) for CV-B5 and 1955 (1948–1961) for SVDV with an evolution rate of 4.47 × 10−3 (3.90 × 10−3 –5.05 × 10−3) substitutions/site/year (s/s/y). When analysed separately, the TMRCAs were 1895 (1863–1926) for CV-B5 and 1955 (1847–1962) for SVDV, and their evolution rates were 4.76 × 10−3 (4.11 × 10−3–5.47 × 10−3) s/s/y and 3.61 × 10−3 (2.87 × 10−3–4.34 × 10−3) s/s/y, respectively. Figure 2 shows the Bayesian skyline plots (BSPs) of the viral demographic histories for CV-B5, SVDV and 248_data. The results for CV-B5 show that a one-step increase in effective population size (EPS) occurred in 1960. During 1960-2015, the EPS was generally stable, i.e., fluctuation in EPS (e.g., a dip in 1965) did not appreciably affect the 95% credibility interval. On the other hand, the EPS of SVDV started from only 8.8 (1.7–31.1) and gradually increased to a plateau in 1999 before decreasing after 2000.
3Dpol-based phylodynamic analysis
Five phylogenetic clusters (A–E) with high support values were revealed in this partial 3Dpol-based analysis (Fig. 3). Most prototypical CV-B strains were alone in a separate branch, and most strains were in clusters that corresponded with VP1 genotypes. The cluster A sequences of 3Dpol tree included earlier isolates from the United Kingdom, France, and United States, and most of these strains were clustered in genotype B2 of VP1. The cluster B included all SVDV and CV-B5 strains that corresponded with genotype B1 of VP1. The cluster C strains included those isolated from Taiwan during 1995–2003 and one each from Korea (2000) and China (2009), which were mainly clustered in genotype A1 of VP1. The cluster D strains were isolated from Taiwan (1983–2011) and China (2008−2011) and were mainly clustered in genotype A4 of VP1. The cluster E included strains isolated from Dominica (1971), Hong Kong (1987), Pakistan (1991), China (2010–2013), and Taiwan (2013–2015); these were also clustered in genotype A4 of VP1. This partial 3Dpol-based analysis-revealed a notable incongruence: SVDV clustered together with genotype B1 in the 3Dpol tree instead of with genotype A1 in the VP1 tree.
Geographic transmission of CV-B5 and SVDV
The diffusion histories for both viruses were reconstructed using Bayesian estimation and annotated using the Spatial Phylogenetic Reconstruction of Evolutionary Dynamics 3 (SpreaD3) program. Based on the analysis of VP1, 11 transmission routes were supported by a Bayes factor >3 (Fig. 4). The 248_data revealed three transmission routes: one from the United States (to Denmark), one from Estonia (to Russia), and one from Australia (to Kuwait). The CV-B5 dataset revealed six transmission routes: one from Germany (to Denmark), three from China (to Taiwan, France, and Denmark), one from France (to the Netherlands), and one from Taiwan (to Denmark). The SVDV dataset revealed two routes: both from Italy (to the Netherlands and Spain). The analysis of the 3Dpol dataset revealed five transmission routes supported by a BF >3: one from Dominica (to Honduras), one from Russia (to the United States), and three from Italy (to the Netherlands, Spain, and Malta). In the 3Dpol dataset, Italy, the Netherlands, Spain, Malta, and Russia only had SVDV strains, suggesting that these transmission routes were important in SVDV transmission. Notably, transmissions from Italy to Netherlands and from Italy to Spain were supported by both analyses based on VP1 region of SVDV data and 3Dpol data, revealing the important role played by Italy in SVDV transmission.
Detection of variation in VP1 and 3Dpol regions
Although no recombination events were detected in either the VP1 (structural) or partial-3Dpol (nonstructural) regions, the incongruity of constructed phylogenies indicated that recombination occurred between these regions. To investigate the codon sequence composition of SVDV, we analysed 27 CV-B5 and 50 SVDV full codon sequences in GenBank. According to the SimPlot results (Supplementary Fig. 3), genotype A1 of CV-B5 and genotype B1 of CV-B5 had high similarity in the VP1 and 3Dpol regions, respectively. Additionally, the similarity between CV-B5 and SVDV was lower (<75%) within nt 2650–3000. The segmental arrangement of various CV-B5 genotypes in SVDV indicated that multiple recombination events have occurred as SVDV evolved.
The genealogical relationships of deduced aa sequences in the VP1 and 3Dpol regions of CV-B5 and SVDV were further visualized using the network in population analysis with reticulate trees (PopART) program27. Figure 5A clearly shows that based on the analysis of the VP1 region, CV-B5 and SVDV strains clearly clustered in separate branches (blue branch in the left and salmon branch in the right). The prototype strain of CV-B5 and SVDV were located near the junction of the blue branch and salmon branch. Both CV-B5 and SVDV networks revealed radical extensions, which were roughly grouped by genotype. The network revealed that the branch separated into genotype A1 and SVDV from prototype strain of CV-B5, each extending towards opposite directions. Next, the CV-B5 branch continuously extended A1 and formed a network that consist of genotypes A2 to A4 (on the upper-left of Fig. 5A), and further extends to the network consisting of strains formed by mixing of genotypes B1 and B2 (in the lower-left of Fig. 5A). Notably, this genogroup B network revealed a major group (the biggest circle) with short random extensions by mixing of genotype B1 and B2 strains (Supplementary Fig. S4). The biggest haplotype mixing by random extension short branches suggests that the founder effect was established by the fittest haplotype.
The prototype SVDV strain, from the junction of blue and salmon branches, extended to a mixed network of genotype I and II strains (light and dark purple circle on the right). This network then prolonged into two genotype II branches (upper-right) and genotype III network (lower-right), which corroborated the results of phylogenetic analysis and indicated that the SVDV strains had evolved with a spatiotemporal trend (Supplementary Fig. 5).
In the 3Dpol region, all strains were clustered into one large and small haplotype cluster each (Fig. 5B). The small haplotype included the prototype CV-B1 strain, 12 CV-B5 strains (four cluster C, six cluster D, and two cluster E), and five SVDV strains (Supplementary Fig. 6). All other strains were grouped together in the large haplotype cluster and its short branches. The presence of numerous haplotype clusters with short random extension branches also indicated the existence of a common ancestor and establishment of dominant traits.
Patterns conserved between CV-B5 and SVDV were visualized using the WebLogo program28. The two viruses showed aa sequence similarity in almost every VP1 and 3Dpol residue (Fig. 6). The single likelihood ancestor counting (SLAC) method obtained dN/dS values of 0.0334 and 0.0265 for the VP1 and 3Dpol regions, respectively. A dN/dS value of <1 indicates negative selection, in which the aa sequence tends to be fixed. Neither VP1 nor 3Dpol revealed sites of significant positive selection by SLAC. Both regions showed scattered sites of significant negative selection. However, the two viruses had five conserved nonsynonymous mutation sites (G83D, T84S, S186G, N210S, and S227G) in VP1. The epistatic analysis based on 248_data revealed 25 pairs of epistatic interactions (PP >0.5). Intriguingly, position 186 showed multiple epistatic interactions with positions 83, 84, 210, 227, and 6. These epistatic interactions occurred among all five nonsynonymous mutation sites. The partial 3Dpol region revealed no positive sites and no mutation pattern common to CV-B5 and SVDV. Six pairs of epistatic interactions were identified; however, none of them differed between CV-B5 and SVDV. The only residue difference between these two haplotypes was residue 117 immediately before motif E, where T in the large haplotype cluster was replaced by A in the small haplotype cluster.
Discussion
For a historical reconstruction of the spatiotemporal transmission of CV-B5 and SVDV, sequence changes during host switching were analysed in 834 nt of the VP1 region (248 sequences, 248_data) and in 420 nt of a partial 3Dpol region (129 sequences, including prototypical sequences of CV-B1–B4 and CVB6 as an outgroup). Phylodynamic analyses were used to estimate the epidemic, evolutionary, and immunological characteristics of CV-B5 and SVDV transmission. Analyses of evolutionary history revealed haplotype networks with deduced aa sequences. However, virus phylogenetic analysis should be carefully considered owing to several limitations. Ancestral sequences are undersampled owing to insufficient data deposited in GenBank, whereas recently isolated strains are oversampled owing to the use of improved laboratory techniques. Meanwhile, virulent strains are reported more frequently than viruses with low pathogenicity. Furthermore, the lack of reports of a virus in a given country does not mean that the virus is not currently circulating in that country. Thus, the results of the analysis should be cautiously interpreted in these conditions.
Recombination occurs frequently in EV-B, particularly in the non-structural region29,30. Previous results obtained from the phylogenetic analyses of structural and nonstructural regions of EV-B are often incongruent31,32. Earlier studies have reported that clusters in the structural region (e.g., VP1) are monophyletic by serotype whereas clusters in the nonstructural region (e.g., 3Dpol) are monophyletic by species (e.g., EV-B). This study did not directly detect recombination events within the VP1 or 3Dpol regions. However, CV-B5, similar to other serotypes in EV-B species, showed incongruent phylogenetic results for the VP1 and 3Dpol regions. The VP1 tree had a monophylogenetic topology, but the 3Dpol region was rooted together with other serotypes in EV-B23. The incongruent phylogenies constructed from structural (i.e., VP1) and nonstructural (i.e., 3Dpol) regions indicate the occurrence of recombination events between these two regions. On the countrary, the SVDV phylogenetic trees constructed in this study were also incongruent; SVDV strains clustered together with genogroup A of CV-B5 in the VP1 tree, but clustered together with genogroup B of CV-B5 in the 3Dpol tree. Interestingly, all SVDV isolates were monophyletic not only in the VP1 tree, but also in the 3Dpol tree, and support values were high in both regions. This result supports a previous hypothesis that SVDV originated from a common ancestor after the host was transferred from human to pig11, as the 3Dpol tree of SVDV clustered together with SVDV strains alone instead of rooting at the species level with other EV-B strains.
A previous report has suggested Hong Kong as an important transmission hub10. Hong Kong is a small region with limited natural resources, and is consequently reliant on imports. Thus, transmission of swine-associated pathogens to Hong Kong via live pigs or pork has been reported33. However, no transmission route from Hong Kong has been detected with high BF value in the current study. The geographic analysis revealed that China played important roles in CV-B5 transmission, and Italy played important roles in SVDV transmission.
Owing to a high selectivity for immune escape, VP1 has the lowest similarity of all EV genes. A “75/85% rule” is the common scheme used for molecular typing of EV species. That is, complete or partial VP1 sequences are assumed to have the same serotype if their nt or aa sequence similarity exceeds 75%34. A cutoff value of 15% sequence similarity is widely used in EV genotype discrimination21. Previous CV-B5 phylogenetic analyses have reported 2–5 genotypes, and all of these studies revealed a similar topology19,20,35,36. In the current study, the TMRCAs for two genogroups, both of which included SVDV strains, were similar to those reported by Gullberg, et. al. (Supplementary Table. 3)27. Continuous bipartite branches have been reported in genotype A19. This topology indicates that the sublineages co-circulated in a wide geographic area during their evolution. The genogroup B consists of one imbalanced ladder-like backbone and strains that mainly circulated in Europe19. We observed that ladder-like backbones were common not only in genogroup B but also in the terminal branches of recently circulating genotypes such as genotype A1 and genotype A4. These two genotypes included the strains after 2000. Improved laboratory techniques have reduced the loss of branch data. Given that tree imbalance is mainly determined by directional selection18, these data imply that, after evolution of the fittest sequence, the population size increased, but variation was fixed. That is, balanced structures suggest a founder effect in adaptive haplotypes. Thus, haplotype size and a mix of early and recent strains are also useful fitness indicators.
Sequence comparisons showed that VP1 starts with a conserved PALTAVETGHT motif (associated with viral uncoating) followed by a conical β-barrel commonly seen in the viral capsid protein. This β-barrel is composed of eight antiparallel β-strands (strand B to strand I). The main variations among VP1 are in the N- and C-terminus and in the loops that connect the β strands, especially in the outermost BC and DE loops37. As reported in previous studies, five conserved mutations (83, 84, 201, 227 and 186) of VP1 were identified in CV-B5 and SVDV19,38. A novel finding of the current study is the multiple epistatic interactions observed in the 186 residue. Positions 83 and 84 are located in antigenic site I (within the BC-loop), position 210 is located in the putative site of coxsackievirus and adenovirus receptor (CAR) binding, position 186 is located in the hydrophobic pocket, position 227 is located within the five-fold axis, and position 6 is located in the N-terminus before the PALTAVETGHT motif19. Notably, CAR is a cellular receptor of CV-B, and mutations of residues 83 and 84 are often reported in association with neutralization escape. The hydrophobic pocket is a determinant of pocket factor selectivity and, like the cellular receptor, is an important determinant of viral tropism in Picornaviridae. Meanwhile, pocket factor expulsion is associated with capsid destabilization. Subsequent extrusion of the VP1 N termini (positions 5 to 18) then forms the viral genome propeller tip26. Thus, all positions involved in the multiepistatic interaction were associated with viral tropism and viral decoating.
The RdRp proteins were organized as a closed right hand structure with three domains (fingers, palm, and thumb) consisting of seven motifs in the order G, F, A–E from the N terminus to the C terminus39,40. In the partial 3Dpol region, this study compared aa residues in positions 254–392 of motifs B to E in the palm region. Motif B is involved in the specific selection of rNTPs as substrates during RdRp polymerization. Motif C includes the strongly conserved GDD motif and has a catalytic residue Asp329 between β11 and β12. Motif E (β14 and β15) is the transition region between the palm and thumb domains34. The comparisons showed no signature variation between CV-B5 and SVDV, which indicated that this partial 3Dpol region does not functionally differ between human and swine hosts.
In conclusion, comparing homologous virus variants in different host species can reveal genetic components needed for viral infection and adaptation to a different host. The VP1 gene has had a particularly important role in host tropism and transmission through the host barrier whereas the partial 3Dpol region is associated with viral heredity and thus does not differ in a host switch from human to pig. This study used sequencing data to reconstruct the spatiotemporal transmission histories of CV-B5 and SVDV and detected selection events to reveal the key molecular sites associated with different host species.
Methods
Specimen collection and ethics statement
For each year of positive CV-B5 isolation (1999–2015), 28 isolates were randomly selected from a pool of de-identified virus stock from two medical centres in southern Taiwan (Kaohsiung Veterans General Hospital and Kaohsiung Medical University Hospital). The ethics committees of both hospitals approved this study. Experiments were limited to viral isolates obtained from clinically necessary laboratory procedures; medical histories were not reviewed, and no patients were harmed. As such, informed consent was not required. The SVDV sequences were obtained from the Animal Health Research Institute, Taiwan (AHRI-TW). Samples used in this study were obtained from certified farms that voluntarily sent samples to AHRI-TW. Animals were investigated according to the regulations established by the Central Epidemic Command Center of Taiwan. As such, this study did not require approval by the ethics committee of the institutional review board.
Viral RNA extraction, reverse transcriptase (RT-)PCR, and sequencing
Sampled viral strains were amplified in confluent rhabdomyosarcoma cells. The procedures for RNA purification, RT-PCR, and sequencing were performed as described previously41,42. Briefly, viral RNA was extracted from 200 μl of viral culture supernatant using the QIAmp viral RNA purification kit (Qiagen, Chatsworth, CA, USA). The One-Step RT-PCR kit (Qiagen) was used to perform RT-PCR with 200 ng total RNA using specific primer pairs (Table 1) for 35 amplification cycles. Purified PCR products were sequenced with the ABI Prism Ready Reaction Dideoxy Terminator cycle sequencing kit (Model 3730 v.3.4; Applied Biosystems, Foster City, CA, USA). The sequences for forward and reverse strands were obtained simultaneously and edited with Sequence Navigator v.3.01 software (Applied Biosystems). The obtained VP1 (880 nt) and partial 3Dpol (420 nt) gene sequences were submitted to GenBank under the accession numbers listed in Supplementary Table 2.
Detection of sequence variation and selection
Full-length VP1 and partial 3Dpol (positions 6671–7090 of accession no. AF114383) sequences of CV-B5 and SVDV were obtained using the Basic Local Alignment Search Tool program. Multiple sequence alignments were performed with T-coffee43. Sequences were manually excluded if they (1) lacked isolation date or location data or (2) if they had ambiguously codons, nonsense, or frameshift mutations. Sequences with the same isolation location and year were then stratified by random sampling. Recombination events were detected using Recombination Detection Program v.3.44 with default settings44 and SimPlot v.3.5.112. Pairwise comparisons of nt and aa sequences were estimated with the p-distance method in the MEGA v.7 program45. The SLAC method was used to estimate site-specific selection, i.e., the ratio of non-synonymous substitutions to synonymous substitutions (dN/dS). A p-value less than 0.05 was considered statistically significant. The Spidermonkey/Bayesian graphical model (BGM) was used to detect sites with epistatic interaction, and PP <0.5 was considered statistically significant. All programs used to detect selection, including SLAC and BGM, were implemented on the DataMonkey website46. For a comprehensive comparison of population differences between CV-B5 and SVDV, variations in deduced aa sequences between VP1 and 3Dpol proteins of both viruses were graphically visualized with the WebLogo24. The genealogical relationships were further visualized by using the method developed by Templeton, Crandall, and Singh (TSC)47 to construct a network in PopART program27.
Phylogenetic and phylodynamic analysis
The best-fit substitution model by jMODELTEST v.2.1.7 program48 under Akaike information criteria was performed. Maximum likelihood phylogenetic analyses were conducted using the MEGA v.7 program. Further, the Bayesian evolutionary analysis sampling trees (BEAST) v.1.8.4 program was used for BMCMC tree analysis49. All sampled sequences were stamped with isolation times and locations. Eight candidate model compositions (substitution-clock-tree) were used for Bayesian estimation in all four datasets (three for VP1 sequences and one for partial 3D sequences). All model compositions included the SRD06 substitution model, relaxed uncorrelated clock model with either exponential distribution or lognormal distribution, and one of the four coalescent tree priors: constant size, exponential growth, logistic growth, and BSP (1*2*4 = 8). The best-fit model composition for each dataset was selected with the path sampling and stepping-stone methods. The MCMC tree analysis was also used to co-estimate rate of growth, substitution rate, and TMRCA. The TRACER v.1.6 program was used to calculate effective sample size (ESS) based on the stationarity of post-burn-in distributions and estimated parameters, which were considered significant if ESS >200. These estimated parameters were expressed as mean value with 95% HPD. FigTree v.1.4.3 software was used to visualize the MCC tree. Nodal support was estimated by calculating PP, and significant support was defined as PP > 0.9. A BEAST log file with rate indicators was generated by asymmetric Bayesian stochastic search variable selection in location reconstruction. The SPREAD3 v0.9.6 program was used to convert the tree into KML format and to estimate non-zero expectancy rates50. Discrete sampling locations were plotted as the centre of the country in which the strain was isolated.
References
Pallansch, M. A. & Roos, R. P. In Fields Virology (eds Knipe, D. M. et al.) 723–776 (Lippincott Willians & Wilkins, 2001).
Kim, K. W. et al. Coxsackievirus B5 Infection Induces Dysregulation of microRNAs Predicted to Target Known Type 1 Diabetes Risk Genes in Human Pancreatic Islets. Diabetes 65, 996–1003 (2016).
Tsuey-Li, L., Ting-Han, L., Yuan-Pin, H. & Ho-Sheng, W. Introduction of Laboratory-based Surveillance System for Enterovirus in Taiwan. Taiwan Epidem Bull 31, 211–221 (2015).
Centers for Disease Control, Taiwan. Weekly Report of Enterovirus Infection http://www.cdc.gov.tw/english/submenu.aspx?treeid=00ed75d6c887bb27&nowtreeid=fc82bf602877f050 (2018).
Abedi, G. R. et al. Enterovirus and Human Parechovirus Surveillance - United States, 2009–2013. MMWR Morb Mortal Wkly Rep 64, 940–943 (2015).
Hyeon, J. Y. et al. Accuracy of diagnostic methods and surveillance sensitivity for human enterovirus, South Korea, 1999–2011. Emerg Infect Dis 19, 1268–1275 (2013).
Jean, J. S. et al. The association between rainfall rate and occurrence of an enterovirus epidemic due to a contaminated well. J Appl Microbiol 101, 1224–1231 (2006).
Condell, O. et al. Evaluation of the enterovirus laboratory surveillance system in Denmark, 2010 to 2013. Euro Surveill 21 (2016).
Nardelli, L. et al. A foot and mouth disease syndrome in pigs caused by an enterovirus. Nature 219, 1275–1276 (1968).
Zhang, G., Haydon, D. T., Knowles, N. J. & McCauley, J. W. Molecular evolution of swine vesicular disease virus. J Gen Virol 80, 639–651 (1999).
Bruhn, C. A. et al. Viral meningitis epidemics and a single, recent, recombinant and anthroponotic origin of swine vesicular disease virus. Evolution, medicine, and public health 2015, 289–303 (2015).
Lole, K. S. et al. Full-length human immunodeficiency virus type 1 genomes from subtype C-infected seroconverters in India, with evidence of intersubtype recombination. J Virol 73, 152–160 (1999).
King, A. M. Q. et al. In Virus Taxonomy. Seventh Report of the International Committee for the Taxonomy of Viruses (ed Van Regenmortel M. H. V. et al.) 657–673 (New-York, San Diego, 2000).
Simossis, V., Kleinjung, J. & Heringa, J. An overview of multiple sequence alignment. Current protocols in bioinformatics (ed. Andreas, D. et al.) Chapter 3, Unit 3.7 (2003).
Wilkins, A. D. et al. Accounting for epistatic interactions improves the functional analysis of protein structures. Bioinformatics 29, 2714–2721 (2013).
Whitton, J. L., Cornell, C. T. & Feuer, R. Host and virus determinants of picornavirus pathogenesis and tropism. Nature reviews. Microbiology 3, 765–776 (2005).
Ahlquist, P. & RNA-dependent, R. N. A. polymerases, viruses, and RNA silencing. Science 296, 1270–1273 (2002).
Grenfell, B. T. et al. Unifying the epidemiological and evolutionary dynamics of pathogens. Science 303, 327–332 (2004).
Henquell, C. et al. Phylogenetic patterns of human coxsackievirus B5 arise from population dynamics between two genogroups and reveal evolutionary factors of molecular adaptation and transmission. J Virol 87, 12249–12259 (2013).
Gullberg, M. et al. Characterization of a putative ancestor of coxsackievirus B5. J Virol 84, 9695–9708 (2010).
Rico-Hesse, R., Pallansch, M. A., Nottay, B. K. & Kew, O. M. Geographic distribution of wild poliovirus type 1 genotypes. Virology 160, 311–322 (1987).
Song, Y. et al. Persistent circulation of Coxsackievirus A6 of genotype D3 in mainland of China between 2008 and 2015. Scientific reports 7, 5491 (2017).
Chu, P. Y. et al. Transmission and Demographic Dynamics of Coxsackievirus B1. PloS one 10, e0129272 (2015).
Palacios, G. et al. Molecular epidemiology of echovirus 30: temporal circulation and prevalence of single lineages. J Virol 76, 4940–4949 (2002).
Mulders, M. N., Salminen, M., Kalkkinen, N. & Hovi, T. Molecular epidemiology of coxsackievirus B4 and disclosure of the correct VP1/2A(pro) cleavage site: evidence for high genomic diversity and long-term endemicity of distinct genotypes. J Gen Virol 81, 803–812 (2000).
Brown, B. A., Oberste, M. S., Alexander, J. P. Jr., Kennett, M. L. & Pallansch, M. A. Molecular epidemiology and evolution of enterovirus 71 strains isolated from 1970 to 1998. J Virol 73, 9969–9975 (1999).
Leigh, J. W. & Bryant, D. POPART: full-feature software for haplotype network construction. Methods Ecol Evol 6, 1110–1116 (2015).
Crooks, G. E., Hon, G., Chandonia, J. M. & Brenner, S. E. WebLogo: a sequence logo generator. Genome research 14, 1188–1190 (2004).
Oberste, M. S., Penaranda, S. & Pallansch, M. A. RNA recombination plays a major role in genomic change during circulation of coxsackie B viruses. J Virol 78, 2948–2955 (2004).
Zhang, H. et al. Molecular characterization of two novel echovirus 18 recombinants associated with hand-foot-mouth disease. Scientific reports 7, 8448 (2017).
Lukashev, A. N. et al. Recombination in circulating Human enterovirus B: independent evolution of structural and non-structural genome regions. J Gen Virol 86, 3281–3290 (2005).
Oberste, M. S., Maher, K. & Pallansch, M. A. Evidence for frequent recombination within species human enterovirus B based on complete genomic sequences of all thirty-seven serotypes. J Virol 78, 855–867 (2004).
Pappas, G. Socio-economic, industrial and cultural parameters of pig-borne infections. Clin Microbiol Infect 19, 605–610 (2013).
Oberste, M. S. et al. Typing of human enteroviruses by partial sequencing of VP1. J Clin Microbiol 37, 1288–1293 (1999).
Rezig, D. et al. Update on molecular characterization of coxsackievirus B5 strains. J Med Virol 83, 1247–1254 (2011).
Liu, N. et al. An outbreak of aseptic meningitis caused by a distinct lineage of coxsackievirus B5 in China. Int J Infect Dis 23, 101–104 (2014).
Oberste, S., Drescher, K. & Tracy, S. Group B coxsackieviruses. Preface. Curr Top Microbiol Immunol 323, v–vi (2008).
Verdaguer, N., Jimenez-Clavero, M. A., Fita, I. & Ley, V. Structure of swine vesicular disease virus: mapping of changes occurring during adaptation of human coxsackie B5 virus to infect swine. J Virol 77, 9780–9789 (2003).
Gruez, A. et al. The crystal structure of coxsackievirus B3 RNA-dependent RNA polymerase in complex with its protein primer VPg confirms the existence of a second VPg binding site on Picornaviridae polymerases. J Virol 82, 9577–9590 (2008).
Ferrer-Orta, C., Ferrero, D. & Verdaguer, N. RNA-Dependent RNA Polymerases of Picornaviruses: From the Structure to Regulatory Mechanisms. Viruses 7, 4438–4460 (2015).
Huang, H. W. et al. Phylodynamic reconstruction of the spatiotemporal transmission and demographic history of coxsackievirus B2. BMC Bioinformatics 16, 302 (2015).
Oberste, M. S., Nix, W. A., Maher, K. & Pallansch, M. A. Improved molecular identification of enteroviruses by RT-PCR and amplicon sequencing. J Clin Virol 26, 375–377 (2003).
Notredame, C., Higgins, D. G. & Heringa, J. T-Coffee: A novel method for fast and accurate multiple sequence alignment. Journal of molecular biology 302, 205–217 (2000).
Martin, D. P. et al. RDP3: a flexible and fast computer program for analyzing recombination. Bioinformatics 26, 2462–2463 (2010).
Kumar, S., Stecher, G. & Tamura, K. MEGA7: Molecular Evolutionary Genetics Analysis version 7.0 for bigger datasets. Mol Biol Evol (2016).
Delport, W., Poon, A. F. & Frost, S. D. & Kosakovsky Pond, S. L. Datamonkey 2010: a suite of phylogenetic analysis tools for evolutionary biology. Bioinformatics 26, 2455–2457 (2010).
Templeton, A. R., Crandall, K. A. & Sing, C. F. A cladistic analysis of phenotypic associations with haplotypes inferred from restriction endonuclease mapping and DNA sequence data. III. Cladogram estimation. Genetics 132, 619–633 (1992).
Darriba, D., Taboada, G. L., Doallo, R. & Posada, D. jModelTest 2: more models, new heuristics and parallel computing. Nature methods 9, 772 (2012).
Drummond, A. J., Suchard, M. A., Xie, D. & Rambaut, A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol Biol Evol 29, 1969–1973 (2012).
Bielejec, F. et al. SpreaD3: Interactive Visualization of Spatiotemporal History and Trait Evolutionary Processes. Mol Biol Evol 33, 2167–2169 (2016).
Acknowledgements
We gratefully acknowledge the Taiwan Centers for Disease Control (CDC-TW) and Animal Health Research Institute (AHRI-TW) for providing data regarding enterovirus prevalence. However, the interpretations of data and the conclusions contained herein do not represent those of CDC-TW and AHRI-TW. This study was supported by Ministry of Science and Technology, Taiwan grant nos. MOST106–2320-B-037-026 and MOST105-2320-B-037-014 to PYC; Center for Infectious Disease and Cancer Research, Kaohsiung Medical University grant nos. KMU-TP105PR20 and KMU-TP104PR14 to PYC and grant no. KMU-TP104E12 to YCT; Chi-Mei Medical Center and Kaohsiung Medical University Research Foundation grant nos. 104CMKMU012 and 105CM-KMU-13 to PYC; and Chi-Mei Foundation Medical Center grant no. CLFHR10510 to YCY. The funders had no roles in the design of the study, the collection and analysis of data, the decision to publish, or the preparation of the manuscript.
Author information
Authors and Affiliations
Contributions
Experiments were conceived and designed by Y.C.Y., P.Y.C. and H.W.H. Experiments were performed by C.F.W., C.C.L., C.H.P. and Y.Y.S. Data analyses were performed by C.H.P., Y.Y.S., Y.C.T. and C.H.C. Reagents/materials/analysis tools were contributed by C.H.P., P.L.L., Y.S.C., H.J.S., L.C.C., Y.Y.L., H.F.L., B.C.C. and T.S.H. The manuscript was prepared by H.W.H., P.H.C. and C.H.P. All authors have read the final version of the manuscript and have agreed to its contents.
Corresponding authors
Ethics declarations
Competing Interests
The authors declare no competing interests.
Additional information
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Huang, HW., Chu, PH., Pan, CH. et al. Evolutionary histories of coxsackievirus B5 and swine vesicular disease virus reconstructed by phylodynamic and sequence variation analyses. Sci Rep 8, 8821 (2018). https://doi.org/10.1038/s41598-018-27254-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-018-27254-y
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.