Abstract
Streptococcus agalactiae is a major pathogen and a hindrance on tilapia farming worldwide. The aims of this work were to analyze the genomic evolution of Brazilian strains of S. agalactiae and to establish spatial and temporal relations between strains isolated from different outbreaks of streptococcosis. A total of 39 strains were obtained from outbreaks and their whole genomes were sequenced and annotated for comparative analysis of multilocus sequence typing, genomic similarity and whole genome multilocus sequence typing (wgMLST). The Brazilian strains presented two sequence types, including a newly described ST, and a non-typeable lineage. The use of wgMLST could differentiate each strain in a single clone and was used to establish temporal and geographical correlations among strains. Bayesian phylogenomic analysis suggests that the studied Brazilian population was co-introduced in the country with their host, approximately 60 years ago. Brazilian strains of S. agalactiae were shown to be heterogeneous in their genome sequences and were distributed in different regions of the country according to their genotype, which allowed the use of wgMLST analysis to track each outbreak event individually.
Similar content being viewed by others
Introduction
Streptococcus agalactiae (Lancefield’s group B Streptococcus, GBS) is a Gram-positive coccus that causes septicaemia and meningoencephalitis in many species of marine and freshwater fish worldwide1,2,3,4,5. This bacterium may also cause septicaemia and meningitis in human new-borns6 and has already been reported in other animals, including guinea pigs, camels, cats, dolphins, horses and frogs7. This disease is a major obstacle to the expansion of Brazilian aquaculture because it causes high prevalence in Nile tilapia (Oreochromis niloticus), the most frequently farmed fish in Brazil3. GBS streptococcosis in tilapia farming occurs mainly in temperatures above 27 °C and it leads to a high economic impact due to high mortality and its ability to evolve quickly3,5.
Several genotyping methods have been used to study the population structure of S. agalactiae infecting humans and animals, including the evaluation of likelihood of cross-species transmission8,9, as capsular serotyping10 and multilocus sequence typing (MLST)11. The capsular serotyping has been used to discriminate the GBS strains into ten serotypes (Ia, Ib, II-IX)12, since each serotype is considered antigenically and structurally unique7. The fish GBS isolates are commonly characterized as serotypes Ia, Ib and III13. However, the discriminatory potential of this technique has been described as low for evaluation of epidemiological studies14. The main epidemiological tool applied in studies of GBS diseases is the MLST11,15, which can discriminate strains in lineages (sequence type, ST) and combine them based on genetic proximity in clonal complexes (CC). This method has proven efficient for understanding evolutionary stories between lineages and has so far discriminated fish GBS isolates worldwide in several CCs, including the fish-specific CC5524,16. However, MLST cannot fully discriminate many strains from different host species, geographical origins or outbreaks, grouping them into the same sequence type or clonal complex. Examples of this are several strains from human, bovine, feline and rodent sources belonging to ST-1034,6,17,18 and ST-7, which groups isolates from fish, humans and dolphins13. Also, a previous study revealed that Brazilian fish GBS were almost exclusively from CC552, including ST-552, ST-553 or ST-2604 been also this CC predominant in Latin America19.
With the advancements in the next-generation high-throughput sequencing and statistical tools based on molecular evolutionary theory, it became possible to perform comparative genome analysis among closely related strains. This strategy had a high resolution, enabling the discrimination of microorganisms at the single nucleotide level, providing thus a new method for typing known as genome-level typing tool20. In recent studies of different bacterial species, the use of a larger set (or even all) of alleles from a genome, through the whole-genomes MLST (wgMLST) has shown the possibility to distinguish between closely related strains of a given species even with epidemiological conditions of difficult discrimination21,22, as a single outbreak or different anatomic niches in a single patient17,18. As such, the wgMLST is emerging as a tool for epidemiological and evolutionary studies and has a promising future in molecular epidemiology of microbial pathogens23,24. This kind of information is very strategic in order to apply control measures to avoid he spread of specific bacterial lineages as well as the monitoring of vaccination programs, since the efficacy of S. agalactiae vaccine to Nile tilapia seems to be linked to strain specificity25. In addition, genome sequencing also allows the elucidation of the temporal and spatial dynamics of evolution of a pathogen through Bayesian phylogenetic methods26. This present work aimed to evaluate the population structure and evolutionary trends of the fish pathogen S. agalactiae in Brazil and to establish a farm-to-farm approach for the epidemiological tracking of different bacterial clones using genomic methods.
Results
Assembly, genomic features and in silico serotyping
In order to provide data to perform a large-scale analysis of S. agalactiae in Brazil, the 38 strains of the Next Generation Sequence (NGS) project from the National Reference Laboratory for Aquatic Animal Diseases (AQUACEN) culture collection were sequenced. The main features of these 38 genomes along with one previously sequenced strain by our group27 are shown in Table 1. The vertical coverage from Ion Torrent sequencing for all sequenced strains of this work (n = 38) was 224 ± 88 fold. All isolates were comprised of one single chromosome with an average size of 1,844,131 ± 4,460 bp. The number of coding DNA sequence (CDS) varied from 1,503 to 1,729, the number of pseudogenes varied from 98 to 320 and the average G + C content was 35.48%, for the 39 genomes. A search on the whole-genome sequence was performed to assign the capsular serotype using an in-house script, and all strains were typed as Ib.
MLST and eBURST profiles
From the whole genomes sequenced, the ST and clonal complexes were reconstructed in order to compare the population structure of Brazilian GBS strains with other GBS fish strains isolated in other countries. The 39 strains presented two distinct sequence types: one previously reported in Brazil (ST-260) plus a new ST that was submitted to PubMLST and named ST-927 (Table 1, Fig. 1). Additionally, several strains (n = 25) were non-typeable due to a partial deletion in the glcK locus and the consequential hindrance of allelic typing, an event previously described by Assis, et al.28. The eBURST analysis of all piscine-related STs is shown in Fig. 2. This analysis grouped the strains of three previously established CCs, as follows: CC10 with ST-283, ST-491 and ST-739; CC7 with ST-6, ST-7, ST-500 and ST-735; and CC261 with ST-246, ST-261 and ST-891. In addition, it established two newly proposed eBurst groups: the CC260 with ST-260 as a founder, ST-259, and the newly ST-927, and the group comprised of ST-257 and the non-typeable strains with a SLV, a close genetic relation. Strain STIR-CD-17 (GenBank accession number ALXB01) has no established ST, but its ST is different from ST-260 by a single nucleotide polymorphism at tkt. Neither ST-103 nor ST-258 shared enough alleles with any of the other piscine strains to be considered as genetically related.
Genomic similarity
Considering the low level of ST diversity observed in the Brazilian GBS fish strains, the 39 genomes were submitted to Gegennes, whose strategy is the comparison of similarities of whole genome contents through Blastn percentage of identity. The all versus all analysis of genomic similarity is shown in Fig. 3. All the studied piscine S. agalactiae strains showed high similarity (>98%), but two groups were slightly distinct. One was composed of MLST non-typeable isolates, and the other strains ST-260 plus ST-927, showed higher similarities within groups rather than between groups.
Phylogenomic analysis based on wgMLST
As genomic similarities did not improve discriminatory results after using pre-genomic MLST based techniques, a survey of high-resolution tools was conducted. Thus the wgMLST technique, which is known to provide a scalable means to study the whole-genome variation29, was designated to this study. The NeighborNet phylogenomic network separated all isolates into four main groups based on the “All_loci” comparison (Fig. 4A). One group was composed by the single strain 138spar (ST-261, USA) and the other group included the strains STIR-CD-14 (ST-491, Vietnam) and GD201008-001 (ST-7, China). The Brazilian strains were divided in two groups (Fig. 4B), with one group formed by all non-typeable strains (all arising from the Central-South region) and the other major group formed by ST-260 and ST-927 strains (arising mainly from the Northeast region).
Bayesian analysis of evolution and emergence of the Brazilian S. agalactiae clade
In order to evaluate the evolutionary trends of the population of Brazilian GBS fish strains, particularly the temporal emergence of different clones, a Bayesian analysis was performed based on substitutions found in the core genome of S. agalactiae. The substitution rate and evolution analysis showed a highly convergence with ST grouping of MLST technique (Fig. 5). The analysis was performed using 100 million generations that showed to be an Effective Sample Size (ESS) with values >=200 for key parameters (output variables). The substitution rate of this population was 6.21 × 10−7 substitution/site/year (95% highest probability density [HPD], 3.42 × 10−7 to 8.97 × 10−7). One clade, named taxa V, was formed by the ST-261, whose strains came from the USA, Israel and China and along with ST-927, ST-260 and the non-typeable strains of this work, they seemed to have emerged 1,233 years ago (95% HPD, 738 to 1886), whereas Taxa IV, containing the reported STs of S. agalactiae from fish in Brazil, seemed to have emerged approximately 585 years ago (95% HPD, 341 to 890). A third group (Taxa III) was composed only of the ST-260 from Brazilian isolates (although this ST also occurs outside Brazil) and had a time to most recent common ancestor (tMRCA) of approximately 33 years ago, in 1983 (95% HPD, 1963 to 1999). Finally, two groups exclusively from the Brazilian samples (Taxa II – non-typeable strains and Taxa I – ST-927) emerged respectively, in 1956 (95% HPD, 1923 to 1984) and in 1978 (95% HPD, 1955 to 1997).
Discussion
MLST is the main epidemiological tool for studying human and animal diseases caused by S. agalactiae, and it has been widely used to study evolutionary relationships between strains from different epidemiological sources, such as hosts species, geographical distances, and periods of isolation4,9,13,30,31,32. Nevertheless, it is already known that by using only seven genetic loci might result in the neglect of some genetic information, such as lateral genetic transfer events or phage-related genes, which compromises the resolution of the molecular characterization33. Alternatively, the use of genomic tools would be suitable to find divergence in close related GBS strains.
The MLST utilizes seven genes that play important cellular functions11,34 and whose mutations could possibly be subject to purifying selection. The studied strains could be discriminated into two STs and one non-typeable group. A previous study4, had described the ST-552 and ST-553, however theses STs were composed only by strains identified as non-typeable that shared a partial gene deletion event on glcK gene, as described by Assis et al.28. A PCR reaction targeting the glcK gene were performed for all Brazilian strains based on Godoy et al.4, and as a result all PCRs were found to be negative using electrophoresis on a 1.5% agarose gel (data not shown). This discovery resulted in the deconstruction of the previously described CC5524, which were comprised of ST-257, ST-259, ST-260, ST-552 and ST-553. Therefore, it was then established two eBurst groups, as follows: firstly the CC260 group, comprising ST-260 (founder, which formerly comprised CC552), ST-927 (newly described), ST-259 (which formerly comprised CC552), and the strain STIR-CD-17; and secondly, the group comprised of ST-257 (which cannot be considered as a CC since it does not have a founder; it was formerly comprised of CC552) and the non-typeable strains. These two eBurst groups and the CC261 had a similar MLST profile, since they shared five identical alleles, suggesting an evolutionary relationship.
In the present work, the three obtained MLST groups (two STs and the non-typeable strains) presented a geographical distribution pattern, with the non-typeable strains occurring exclusively in the Central-South region and the ST-260 and ST-927, occurring mainly in the Northeast, which is in accordance with a previous study4. The occurrence of ST-260 and ST-927 (SA245 and SA218, respectively) in the Southeast region shows that these two types have been circulating through the country, which is not surprising, as S. agalactiae is believed to be transmitted by contact between animals2,3.
Genomic similarity results (Fig. 3) showed that each ST comprises strains with similar sequences (>99.9%), and that the differences between the STs were small (<2%). It has been previously shown through comparative genomics that S. agalactiae has a genome structured with a stable “back-bone” and that other elements are responsible for differences between many lineages17. Therefore, small differences between genomes probably occur due to polymorphisms in the gene sequences. Fish-associated GBS isolates are known to harbour some genetic content that is not shared with other CCs16 and that may be associated with host adaptation. However, there is no information regarding genomic diversity in a population of closely related S. agalactiae strains that could be used to track every source of an outbreak of streptococcosis. The genomic features of the Brazilian isolates of S. agalactiae showed a number of pseudogenes varying from 98 to 320 between isolates and even varied inside each genomic lineage. It is known that piscine GBS strains are going through a reductive evolution, and a high percentage of pseudogenes are commonly seen, over 10% of the genome, which is believed to be an adaptative strategy of these hosts34. The analysis of genomic similarities corroborated with the MLST results, though it was not was sufficient to discriminate the closely related strains (Fig. 3).
Thus, a wgMLST was performed to depict the population structure of S. agalactiae isolated from fish in Brazil. According to Maiden, et al.21, the wgMLST has the potential to discriminate very closely related strains or single clone pathogens. These results strengthened the previous comparative analyses, since the phylogenomic network based on all loci comparisons revealed that the genomic types were similar. The geographical discrimination of the isolates was in accordance with the MLST, but by the use of the wgMLST approach it was possible to observe the relative genetic distances between each strain. The closest contemporary strains (SA623 and SA627, both isolated in 2015) diverged by 22 alleles, suggesting that each analyzed Brazilian isolate belongs to a single clone.
As shown in Fig. 4, in the ST-927 group, the two strains from the state of Pernambuco (SA102 and SA97) were closely related, and the same was seen for the strains from the Alagoas state (SA85 and SA95). However, in the ST-927 (mainly from the Northeast region), the strain SA218 from the Southeast region was most closely related to SA95, and in the ST-260 (mainly from the Northeast region), the SA245 from the Southeast region was most closely related to SA256, reinforcing the assumption of pathogen transmission by infected fish dislocations, which was raised after the MLST analysis, even with the interregional trade.
Moreover, in the non-typeable group, the lower axis contained the older isolates, while the upper axis contained the newest ones and showed that more contemporary strains had fewer allelic differences, suggesting that mutations were occurring through a temporal scale. Inside this group, the genome of the catfish isolate (SA81) was very closely related to some tilapia isolates (Fig. 4B). Currently, tilapia is the main piscine host for S. agalactiae in Brazil. Nevertheless, the emergence of new Brazilian fish species as potential hosts for this pathogen suggests that the host range might be even broader (tilapia and the parental catfish species of the hybrid catfish belong to different taxonomic orders) and it represents a great potential hazard for the Brazilian aquaculture.
Based on the Bayesian evolutionary tree (Fig. 5), all piscine GBS strains from Brazil have emerged from a single branch and are grouped in accordance with the MLST. This group emerged approximately 585 years ago, which is more recent than the group composed by other American, Chinese and Israeli piscine S. agalactiae strains from ST-261 and that they emerged approximately 1,234 years ago. It is evident that since the divergence from ST-261, the group composed of ST-260, ST-927 and the non-typeable strains are evolving more rapidly and further diversifying (intra-groups I, II and III in Fig. 5). These results showed that the main ST that nowadays occurs in Brazil had emerged quite recently (apart from 1956, for non-typeable strains) and, together with the genetic diversity depicted by the wg-MLST, might represent a concern for the development and use of vaccines with broad protection and the possibility of adaptation to new hosts. Inside the non-typeable group, SA81 (marked with a circle in Fig. 5), isolated from catfish, emerged from a different branch of the GBS strains isolated from tilapia, which suggests that microevolution is occurring within this group that is leading to different host tropisms. However, the genetic basis of this possible adaptation needs to be evaluated.
Similarly, the human ST-7 strain A909 isolated in 1975 (GenBank accession number CP000114, marked with a star in Fig. 5) emerged previously from a single branch contrast the other six ST-7 strains that were isolated between 2009–2015 (Fig. 5, Supplementary Table 2), which shared a single branch, arose from tilapia host, and emerged contemporaneously with the Brazilian strains in 1981 (1964–1996). Since the emergence of the ST-7, there has been a diversification to different host tropisms, which is in line with previous genomic studies that revealed a close genomic relationship between these isolates34,35. Although the evolutionary data pointed to a very recent emergence of new GBS STs, it should be considered that this analysis has some limitations, since other genetics events, such as horizontal gene transfer, are not evaluated by this method.
This study revealed that the Brazilian S. agalactiae population of piscine isolates is diverse, spatially distributed according to their sequence type and that many recent evolutionary events are leading the creation of new groups. Furthermore, the study revealed that there are contemporary events leading to the increased diversity of piscine S. agalactiae in Brazil, probably evolving along with the expansion of aquaculture in the last five decades. The gene-by-gene approach is a powerful tool that allows us to track outbreaks of streptococcosis by S. agalactiae farm-to-farm and to establish spatial links between disease events.
Methods
Bacterial strains
The strains of S. agalactiae were selected from the AQUACEN culture collection. All strains were previously characterized4,36, and its geographic origin and year of isolation of each strain is presented in the Supplementary Table 1.
DNA extraction and sequencing
For DNA extraction, strains were thawed and grown in blood agar at 28 °C for 48 h. Colonies were collected and submitted to the Maxwell® 16 MDx Research Instrument (Promega, USA) following the manufacturer’s instructions. Genomic DNA were quantified using a Qubit 2.0 Fluorometer (Life Technologies, Thermo Scientific, USA). The strain SA2037 was re-sequenced by our group27 on a MiSeq sequencer (Illumina, USA) using a Nextera™ DNA Library Prep Kit. All other strains were sequenced on an Ion Torrent Personal Genome Machine™ (PGM) (Life Technologies) with the Ion PGM Sequencing 200 bp Kit following the manufacturer’s recommendations.
Genomes assembly and annotation
The quality of all raw sequenced data was analysed using FastQC 0.11.138, and an in-house script (https://www.github.com/aquacen/fast_sample) was used to obtain reads with a PHRED quality score of at least 20 (i.e., -q 20 parameter) and exclude adaptors sequence (i.e., -l 17 parameter). Genomes were then de novo assembled using Newbler 2.0 (Roche, USA) with -urt -noace -m -a 50 parameters, and scaffolds were generated using CONTIGuator39, using SA20 as a reference strain and default parameters. Gap filling was performed manually using CLC Genomics Workbench 7.0 (CLC-gw) (Qiagen, USA), mapping the reads to the genomes and gradually extending the flanks of the gaps. Annotation was performed manually for SA20 using Uniprot database (http://www.uniprot.org), which was then used to annotate the other strains by the software Prokka version 1.1140, with modification to use nested databases in this order: manually curated CDSs from SA20, RefSeq database only with S. agalactiae proteins, and finally all proteins from RefSeq. The annotated genomes visualization and the manual correction of frameshifts were performed on Artemis41 and CLC-gw, respectively.
In silico capsular serotyping
An in-house script was developed (available at https://www.github.com/aquacen/serotype_Sagalactiae) to automate the protocol designed by Sheppard, et al.42, to access the capsular serotype of S. agalactiae isolates through the whole genome sequence. In summary, this script uses the complete genome sequence of each genome as query in Blastn algorithm, against a database containing the sequences proposed by Sheppard et al.42, with e-value 1e-100, query coverage >=90% and percentage of identity >=95%.
MLST genotyping and eBURST analysis
All seven of the MLST loci adhP (alcohol dehydrogenase), pheS (phenylalanyl-tRNA ligase subunit alpha), atr (amino acid ABC transporter), glnA (glutamine synthetase), sdhA (L-serine dehydratase subunit alpha), glcK (glucokinase) and tkt (transketolase) were selected from the 39 GBS genomes and then STs were obtained by an in-house script (available at https://www.github.com/aquacen/mlst_Sagalactiae). This script searches designated alleles deposited on pubMLST in the genomes through nucleotide-Blast alignment. Moreover, one strain from each ST previously related on fish was selected from the pubMLST and GenBank databases, and all data were submitted to the eBURST algorithm43. Clonal complexes were defined using SLV bias with default parameters. The map with the strains geographic location was generated in the OpenJump software version 1.6.344, using Brazilian geographic data available at Brazilian Ministry of Environment web site (http://mapas.mma.gov.br/i3geo/datadownload.htm) and isolation coordinates obtained in the AQUACEN culture collection metadata.
Genomic similarity percentage analysis
The similarity of whole genomes were compared using Gegenees software version 2.045 along with other piscine S. agalactiae strains that had complete genome sequences, 138spar (ST-261) and GD201008-001 (ST-7) (GenBank access numbers CP007565 and CP003810, respectively), to obtain a percentage identity matrix. This matrix was then used to build a similarity heatmap. The parameters set in Genenees were a Fragment size = 200 bp, Step size = 100 bp and a threshold in heat settings of 0%.
Phylogenomic analysis based on wgMLST
The complete genome sequences of the 39 presented strains together with 138spar and GD201008-001, in addition to the draft genome sequence of ST-491 S. agalactiae strain STIR-CD-14 (GenBank access number ANEJ01), were submitted to Bacterial Isolate Genome Sequence Database (BIGSdb). All loci were compared one by one between isolates using a gene-by-gene approach21 on a GenomeComparator plugin. All loci scheme was previously generated using all genes cluster from a cd-hit-est software v4.646, with a multi-fasta file containing all predicted genes from all strains (n = 42) and the default parameters. A distance matrix with the relative genomic divergence between all isolates was obtained and used to construct a phylogenomic NeighborNet network using SplitsTree 4.047.
Bayesian estimation of clonal emergence
The 39 strains from this study, along with the S. agalactiae genomes available on GenBank that were at least at the scaffold stage with a number of scaffolds >=30 (n = 103), were used to perform the substitution rate and Bayesian analyses. Using a highly stringent nucleotide-Blast search with an e-value of 1e-20, the CDS of SA20 were aligned against all other strains. All CDSs that met the following requirements were retained: minimal percentage of identity of 98%, difference of length between query and subject <=5 bp, lacking paralogous sequences, and present in all genomes (n = 382 genes) were extracted and concatenated to form the core genome of the species. The core genome of the 142 strains was aligned using MAFFT v7.302b48 with the parameter “–auto” enabled. The BEAUti package of BEAST v1.8.349 was used to generate an xml file to run a Bayesian phylogenetic analysis with the following parameters: output files of MAFFT as data, five taxa groups (Taxa I – all the MLST non-typeable strains of this work; Taxa II – all strains of ST-927; Taxa III – all strains of ST-260; Taxa IV – all strains of group I, II, III; and Taxa V – all strains of group I, II, III and ST-261); tip dates (year of isolation, for all fish strains, and year precision, for some strains that lack year of isolation), as described in the Supplementary Table 2, to calibrate the clock rate; general time-reversible (GTR) model with gamma correction plus invariant sites; strict clock; and coalescent exponential growth tree. Markov Chain Monte Carlo (MCMC) was performed using BEAST with 100 million generations and log collection every 100 generations. The Tracer package of BEAST was used to analyze the log collection with a burn-in value of 10% of generations. TreeAnnotator of BEAST was used to output the maximum clade credibility (MCC) tree as a nexus file, and TempEst software v1.550 was used to show the tree previously generated by TreeAnnotator.
BEAST implements strict, lognormal-relaxed, and exponential-relaxed molecular clock models and coalescent constant, exponential and expansion growths, and Bayesian skyline tree models. Furthermore, several substitution models for nucleotides are available. The clock and substitution models combinations were tested with MCMC as described above. Models of Bayesian skyline tree, lognormal-relaxed molecular clock and exponential-relaxed molecular clock, that failed to perform the MCMC algorithm, were discarded. Only results with ESS values greater than 200 for all key parameters were compared. The used model described above had been chosen by the substitution rate HPD cover described in a previous work with the Streptococcus genus51.
References
Eldar, A., Bejerano, Y. & Livoff, A. et al. Experimental streptococcal meningo-encephalitis in cultured fish. Veterinary microbiology 43, 33–40 (1995).
Evans, J. J. et al. Characterization of β-haemolytic Group B Streptococcus agalactiae in cultured seabream, Sparus auratus L., and wild mullet, Liza klunzingeri (Day), in Kuwait. Journal of Fish Diseases 25, 505–513 (2002).
Mian, G. F. et al. Aspects of the natural history and virulence of S. agalactiae infection in Nile tilapia. Veterinary microbiology 136, 180–183, https://doi.org/10.1016/j.vetmic.2008.10.016 (2009).
Godoy, D. T. et al. Genetic diversity and new genotyping scheme for fish pathogenic Streptococcus agalactiae. Letters in applied microbiology 57, 476–483, https://doi.org/10.1111/lam.12138 (2013).
Soto, E. et al. Characterization of Isolates of Streptococcus agalactiae from Diseased Farmed and Wild Marine Fish from the U.S. Gulf Coast, Latin America, and Thailand. Journal of aquatic animal health 27, 123–134, https://doi.org/10.1080/08997659.2015.1032439 (2015).
Bohnsack, J. F. et al. Population structure of invasive and colonizing strains of Streptococcus agalactiae from neonates of six U.S. Academic Centers from 1995 to 1999. Journal of clinical microbiology 46, 1285–1291, https://doi.org/10.1128/JCM.02105-07 (2008).
Johri, A. K. et al. Group B Streptococcus: global incidence and vaccine development. Nature reviews. Microbiology 4, 932–942, https://doi.org/10.1038/nrmicro1552 (2006).
Ismail, A. Q. & Anthony, M. Cross-species transfer of group B streptococcus via ingestion? Journal of perinatal medicine 40, 201–202, https://doi.org/10.1515/jpm.2011.136 (2012).
Chen, M. et al. Streptococcus agalactiae isolates of serotypes Ia, III and V from human and cow are able to infect tilapia. Veterinary microbiology 180, 129–135, https://doi.org/10.1016/j.vetmic.2015.07.033 (2015).
Cieslewicz, M. J. et al. Structural and genetic diversity of group B streptococcus capsular polysaccharides. Infect Immun 73, 3096–3103, https://doi.org/10.1128/IAI.73.5.3096-3103.2005 (2005).
Maiden, M. C. J. Multilocus sequence typing of bacteria. Annu Rev Microbiol 60, 561–588, https://doi.org/10.1146/annurev.micro.59.030804.121325 (2006).
Poyart, C. et al. Multiplex PCR assay for rapid and accurate capsular typing of group B streptococci. Journal of clinical microbiology 45, 1985–1988, https://doi.org/10.1128/JCM.00159-07 (2007).
Evans, J. J. et al. Phylogenetic relationships among Streptococcus agalactiae isolated from piscine, dolphin, bovine and human sources: a dolphin and piscine lineage associated with a fish epidemic in Kuwait is also associated with human neonatal infections in Japan. Journal of medical microbiology 57, 1369–1376, https://doi.org/10.1099/jmm.0.47815-0 (2008).
Martinez, G. et al. Characterization of Streptococcus agalactiae Isolates of Bovine and Human Origin by Randomly Amplified Polymorphic DNA Analysis. J Clin Microbiol 38, 71–78 (2000).
Maiden, M. C. J. et al. Multilocus sequence typing: A portable approach to the identification of clones within populations of pathogenic microorganisms. P Natl Acad Sci USA 95, 3140–3145, https://doi.org/10.1073/pnas.95.6.3140 (1998).
Delannoy, C. M. et al. Genomic comparison of virulent and non-virulent Streptococcus agalactiae in fish. J Fish Dis 39, 13–29, https://doi.org/10.1111/jfd.12319 (2016).
Brochet, M. et al. Genomic diversity and evolution within the species Streptococcus agalactiae. Microbes and infection / Institut Pasteur 8, 1227–1243, https://doi.org/10.1016/j.micinf.2005.11.010 (2006).
Springman, A. C. et al. Selection, recombination, and virulence gene diversity among group B streptococcal genotypes. Journal of bacteriology 191, 5419–5427, https://doi.org/10.1128/JB.00369-09 (2009).
Delannoy, C. M. J. et al. Human Streptococcus agalactiae strains in aquatic mammals and fish. BMC Microbiology 13 (2013).
Grad, Y. H. & Lipsitch, M. Epidemiologic data and pathogen genome sequences: a powerful synergy for public health. Genome Biol 15, 538, https://doi.org/10.1186/s13059-014-0538-4 (2014).
Maiden, M. C. et al. MLST revisited: the gene-by-gene approach to bacterial genomics. Nature reviews. Microbiology 11, 728–736, https://doi.org/10.1038/nrmicro3093 (2013).
Cody, A. J., Bennett, J. S. & Maiden, M. C. J. Multi-Locus Sequence Typing and the Gene-by-Gene Approach to Bacterial Classification and Analysis of Population Variation. 41, 201-219, https://doi.org/10.1016/bs.mim.2014.06.001 (2014).
Kingry, L. C. et al. Whole genome multilocus sequence typing as an epidemiologic tool for Yersinia pestis. Diagnostic Microbiology and Infectious Disease 84, 275–280, https://doi.org/10.1016/j.diagmicrobio.2015.12.003 (2016).
Kluytmans-van den Bergh, M. F. et al. Whole-Genome Multilocus Sequence Typing of Extended-Spectrum-Beta-Lactamase-Producing Enterobacteriaceae. J Clin Microbiol 54, 2919–2927, https://doi.org/10.1128/jcm.01648-16 (2016).
Chen, M. et al. Screening vaccine candidate strains against Streptococcus agalactiae of tilapia based on PFGE genotype. Vaccine 30, 6088–6092, https://doi.org/10.1016/j.vaccine.2012.07.044 (2012).
Gray, R. R. et al. Testing spatiotemporal hypothesis of bacterial evolution using methicillin-resistant Staphylococcus aureus ST239 genome-wide data within a bayesian framework. Mol Biol Evol 28, 1593–1603, https://doi.org/10.1093/molbev/msq319 (2011).
Pereira, F. L., Soares, S. C. & Dorella, F. A. et al. Evaluating the efficacy of the new Ion PGM Hi-Q Sequencing Kit applied to bacterial genomes. Genomics 107, 189–198, https://doi.org/10.1016/j.ygeno.2016.03.004 (2016).
Assis, G. B., Tavares, G. C., Pereira, F. L., et al. Natural coinfection by Streptococcus agalactiae and Francisella noatunensis subsp. orientalis in farmed Nile tilapia (Oreochromis niloticus L.). J Fish Dis, https://doi.org/10.1111/jfd.12493 (2016).
Jolley, K. A. & Maiden, M. C. J. Using MLST to study bacterial variation: prospects in the genomic era. Future Microbiology 9, 623–630, https://doi.org/10.2217/fmb.14.24 (2014).
Bergal, A. et al. Molecular epidemiology and distribution of serotypes, genotypes, and antibiotic resistance genes of Streptococcus agalactiae clinical isolates from Guelma, Algeria and Marseille, France. European journal of clinical microbiology & infectious diseases: official publication of the European Society of Clinical Microbiology 34, 2339–2348, https://doi.org/10.1007/s10096-015-2487-6 (2015).
Lusiastuti, A. M., Seeger, H. & Indrawati, A. et al. The Comparison of Streptococcus agalactiae Isolated from Fish and Bovine using Multilocus Sequence Typing. HAYATI Journal of Biosciences 20, 157–162, https://doi.org/10.4308/hjb.20.4.157 (2013).
Sun, J. et al. Inapparent Streptococcus agalactiae infection in adult/commercial tilapia. Scientific reports 6, 26319, https://doi.org/10.1038/srep26319 (2016).
Laing, C. R., Zhang, Y. & Thomas, J. E. et al. Everything at once: comparative analysis of the genomes of bacterial pathogens. Veterinary microbiology 153, 13–26, https://doi.org/10.1016/j.vetmic.2011.06.014 (2011).
Rosinski-Chupin, I. et al. Reductive evolution in Streptococcus agalactiae and the emergence of a host adapted lineage. BMC Genomics 14, 1–15, https://doi.org/10.1186/1471-2164-14-252 (2013).
Liu, G. Zhang, W. & Lu, C. Comparative genomics analysis of Streptococcus agalactiae reveals that isolates from cultured tilapia in China are closely related to the human strain A909. BMC Genomics 14(1), 775 (2013).
Assis, G. B. N. et al. Use of MALDI-TOF Mass Spectrometry for the Fast Identification of Gram-Positive Fish Pathogens. Frontiers in Microbiology 8, https://doi.org/10.3389/fmicb.2017.01492 (2017).
Pereira, Ud. P. et al. Complete genome sequence of Streptococcus agalactiae strain SA20-06, a fish pathogen associated to meningoencephalitis outbreaks. Standards in genomic sciences 8, 188–197, https://doi.org/10.4056/sigs.3687314 (2013).
Andrews, S. FastQC A quality control application for high throughput sequence data, http://www.bioinformatics.babraham.ac.uk/projects/download.html#fastqc (2010).
Galardini, M., Biondi, E.G., Bazzicalupo, M. et al. CONTIGuator a bacterial genomes finishing tool for structural insights on draft genomes. Source Code for Biology and Medicine 6 (2011).
Seemann, T. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30, 2068–2069, https://doi.org/10.1093/bioinformatics/btu153 (2014).
Rutherford, K. et al. Artemis: sequence visualization and annotation. Bioinformatics 16, 944–945 (2000).
Sheppard, A. E. et al. Capsular Typing Method for Streptococcus agalactiae Using Whole-Genome Sequence Data. Journal of clinical microbiology 54, 1388–1390, https://doi.org/10.1128/jcm.03142-15 (2016).
Feil, E. J., Li, B. C. & Aanensen, D. M. et al. eBURST: Inferring Patterns of Evolutionary Descent among Clusters of Related Bacterial Genotypes from Multilocus Sequence Typing Data. Journal of bacteriology 186, 1518–1530, https://doi.org/10.1128/jb.186.5.1518-1530.2004 (2004).
Steiniger, S. & Bocher, E. An overview on current free and open source desktop GIS developments. International Journal of Geographical Information Science 23, 1345–1370, https://doi.org/10.1080/13658810802634956 (2009).
Agren, J., Sundstrom, A. & Hafstrom, T. et al. Gegenees: fragmented alignment of multiple genomes for determining phylogenomic distances and genetic signatures unique for specified target groups. PloS one 7, e39107, https://doi.org/10.1371/journal.pone.0039107 (2012).
Li, W. & Godzik, A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22, 1658–1659, https://doi.org/10.1093/bioinformatics/btl158 (2006).
Huson, D. H. & Bryant, D. Application of phylogenetic networks in evolutionary studies. Molecular biology and evolution 23, 254–267, https://doi.org/10.1093/molbev/msj030 (2006).
Katoh, K. & Standley, D. M. MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability. Molecular biology and evolution 30, 772–780, https://doi.org/10.1093/molbev/mst010 (2013).
Drummond, A. J. & Rambaut, A. BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evolutionary Biology 7, 214, https://doi.org/10.1186/1471-2148-7-214 (2007).
Rambaut, A., Lam, T. T., Max Carvalho, L. et al. Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evolution 2 (2016).
Turner, C. E. et al. Emergence of a New Highly Successful A capsular Group A Streptococcus Clade of Genotype emm89 in the United Kingdom. mBio 6, e00622-00615, https://doi.org/10.1128/mBio.00622-15 (2015).
Acknowledgements
We thank the Coordination for the Improvement of Higher Education Personnel (CAPES), the National Counsel of Technological and Scientific Development (CNPq) and Ministry of Agriculture, Livestock and Food Supply for financing this study.
Author information
Authors and Affiliations
Contributions
G.M.B., F.L.P. and H.C.P.F.: wrote the manuscript. G.M.B. and F.L.P.: performed the bioinformatics analyses. G.M.B., G.C.T., A.F.C., F.A.D.: performed the experiments. C.A.G.L., H.C.P.F.: conceived and designed the experiments. H.C.P.F.: Coordinated all analyses of the project. All authors read and approved the final manuscript
Corresponding author
Ethics declarations
Competing Interests
The authors declare that they have no competing interests.
Additional information
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Barony, G.M., Tavares, G.C., Pereira, F.L. et al. Large-scale genomic analyses reveal the population structure and evolutionary trends of Streptococcus agalactiae strains in Brazilian fish farms. Sci Rep 7, 13538 (2017). https://doi.org/10.1038/s41598-017-13228-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-017-13228-z
This article is cited by
-
Delineation of the pan-proteome of fish-pathogenic Streptococcus agalactiae strains using a label-free shotgun approach
BMC Genomics (2019)
-
Metabolic engineering of Lactococcus lactis for high level accumulation of glutathione and S-adenosyl-l-methionine
World Journal of Microbiology and Biotechnology (2019)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.