Deep sequencing analysis of the heterogeneity of seed and commercial lots of the bacillus Calmette-Guérin (BCG) tuberculosis vaccine substrain Tokyo-172

Wada, Takayuki; Maruyama, Fumito; Iwamoto, Tomotada; Maeda, Shinji; Yamamoto, Taro; Nakagawa, Ichiro; Yamamoto, Saburo; Ohara, Naoya

doi:10.1038/srep17827

Download PDF

Article
Open access
Published: 04 December 2015

Deep sequencing analysis of the heterogeneity of seed and commercial lots of the bacillus Calmette-Guérin (BCG) tuberculosis vaccine substrain Tokyo-172

Takayuki Wada¹^na1,
Fumito Maruyama²^na1,
Tomotada Iwamoto³^na1,
Shinji Maeda⁴^na1,
Taro Yamamoto¹^na1,
Ichiro Nakagawa²^na1,
Saburo Yamamoto⁵^na1 &
…
Naoya Ohara^6,7^na1

Scientific Reports volume 5, Article number: 17827 (2015) Cite this article

2367 Accesses
10 Citations
1 Altmetric
Metrics details

Subjects

Abstract

BCG, only vaccine available to prevent tuberculosis, was established in the early 20th century by prolonged passaging of a virulent clinical strain of Mycobacterium bovis. BCG Tokyo-172, originally distributed within Japan in 1924, is one of the currently used reference substrains for the vaccine. Recently, this substrain was reported to contain two spontaneously arising, heterogeneous subpopulations (Types I and II). The proportions of the subpopulations changed over time in both distributed seed lots and commercial lots. To maintain the homogeneity of live vaccines, such variations and subpopulational mutations in lots should be restrained and monitored. We incorporated deep sequencing techniques to validate such heterogeneity in lots of the BCG Tokyo-172 substrain without cloning. By bioinformatics analysis, we not only detected the two subpopulations but also detected two intrinsic variations within these populations. The intrinsic variants could be isolated from respective lots as colonies cultured on plate media, suggesting analyses incorporating deep sequencing techniques are powerful, valid tools to detect mutations in live bacterial vaccine lots. Our data showed that spontaneous mutations in BCG vaccines could be easily monitored by deep sequencing without direct isolation of variants, revealing the complex heterogeneity of BCG Tokyo-172 and its daughter lots currently in use.

A guide to vaccinology: from basic principles to new developments

Article 22 December 2020

Andrew J. Pollard & Else M. Bijker

Evaluation of 16S rRNA gene sequencing for species and strain-level microbiome analysis

Article Open access 06 November 2019

Jethro S. Johnson, Daniel J. Spakowicz, … George M. Weinstock

The evolution of SARS-CoV-2

Article 05 April 2023

Peter V. Markov, Mahan Ghafari, … Aris Katzourakis

Introduction

Mycobacterium bovis bacillus Calmette-Guérin (BCG) was established in 1921 after 13 years and 230 subcultures of its virulent parental Nocard strain on a glycerinated beef-bile-potato medium¹. BCG has been the only officially registered live vaccine against tuberculosis (TB) for almost a century and effective BCG mass vaccination campaigns against TB have been conducted worldwide. Since its development, the original strain of BCG, maintained at the Pasteur Institute, has been distributed and used to produce substrains^2,3. These substrains have been used for vaccination and for culture with appropriate passaging to maintain the strains; this has caused accumulation of genetic difference among the substrains. Recent studies have reported the genetic diversity among these substrains^4,5,6; this variation has affected the respective substrains phenotypically, modifying the lipid components of the cell wall and affecting carbon metabolism and antigenicity, for example^{5,7,8,9,10,11}. Moreover, such genetic diversity has the potential to affect the stability of the vaccine during medium- and long-term storage. Recently, the World Health Organization (WHO) declared that BCG vaccine substrains should be evaluated for appropriate approval and application worldwide^10,12.

In addition to difference among substrains, variations within each substrain have arisen by spontaneous mutations. A well-studied example is BCG Tokyo (BCG Japan), one of the ancestral substrains, which was initially transferred from the Pasteur Institute to Japan in 1924¹³. A freeze-dried seed lot, Tokyo-172, was found to contain at least two subpopulations that exhibited different colony morphologies (Types I and II)^14,15. Consecutive seed lots of the substrain were propagated and subcultured to mass-produce the final bulk stock prior to the improvement of freeze-drying techniques, which may have been involved in the generation of mutations and the accumulation of genetic variants within the substrain. As a concrete example, the Type I subpopulation of Tokyo-172, which exhibits smooth colonies, is genetically characterized by a unique 22-bp deletion, which has been named RD16¹⁵, in the JTY3475 gene (an ortholog of Rv3405 of Mycobacterium tuberculosis H37Rv) compared with reference sequences of clinical strains, such as M. bovis AF2122/97 and M. tuberculosis H37Rv. In contrast to the thorough description of the Type I genomic sequence¹⁶, the Type II subpopulation, which can be distinguished from Type I organisms by its rough-shaped colonies, has remained genetically uncharacterized. Recently, a frameshift missense error of ppsA, found exclusively in the Type II subpopulation, was reported to affect the cell wall components¹⁷. Therefore, a complicated heterogeneous mixture of the two subpopulations exists in the BCG Tokyo-172 substrain. Such natural diversification, e.g., heterogeneity on duplication of the DU2 region in the BCG Danish substrain⁵, commonly occurs in BCG substrains and can be exacerbated by continuous passage without appropriate cloning due to natural mutations and selective pressure in culture media.

Deep sequencing using next-generation sequencing (NGS) technologies has been developing rapidly and has enabled researchers to analyse variations in nucleotides in genetically distinct populations of various species^18,19,20,21. Deep sequencing may unveil not only a priori differences between substrains but also de novo heterogeneity within a substrain without requiring the cloning of variants. As an example, the intra-lot variation in the oral poliovirus vaccine has already been successfully evaluated by NGS techniques^22,23, providing important information for maintenance of the homogeneity of live vaccines. Such an approach allows us to observe, characterize and examine undesirable heterogeneity in microbial resources caused by spontaneous genetic mutations.

In this study, we used NGS techniques to describe the heterogeneity in separate lots of the BCG Tokyo-172 substrain, including two commercial lots from overseas distributees. This approach can be used to detect genetic variations in live resources and contributes to the management of heterogeneity in cell seed lot systems.

Results

Genomic characterization of two subpopulations in the BCG Tokyo-172 substrain

In a prior investigation of the heterogeneity of seed and commercial lots, the genomic variation between Type I and II subpopulations of the BCG Tokyo-172 substrain was verified by mapping analysis using NGS short reads. In addition to an already-known difference (a frameshift mutation in the ppsA gene)¹⁷, six nucleotide substitutions were newly identified (Table 1). In comparison with outgroup substrains, these nucleotide changes were assigned to both Type I and II subpopulations, indicating that genetic diversification occurred not in one of the two subpopulations unilaterally, but in both subpopulations after their divergence.

Table 1 Single nucleotide polymorphisms between subpopulations of M. bovis BCG Tokyo.

Full size table

Subpopulations in seed/commercial lots of BCG Tokyo-172

Next, to assess the heterogeneity in lots of BCG Tokyo-172, five domestic seed and commercial lots and two foreign commercial lots were sampled and characterized genetically (Fig. 1). The domestic lots included two sequential seeds, Tokyo-172 and its daughter seed lot, Tokyo-172-1. Three remaining lots (K-1242-F, K1478L and KH130) were commercially available; these had been propagated and freeze-dried from Tokyo-172 or Tokyo-172-1. The two foreign lots, BG039304 and 1A-825-43, were commercial lots received from Taiwan and Thailand, respectively. BCG substrains in current use in both areas had been obtained from Japan after the establishment of Tokyo-172^2,24,25 and were then produced independently as applied vaccines in the respective countries.

First, from each lot, the two subpopulations, Types I and II, were quantified according to the RD16 genomic deletion. Two seed lots, Tokyo-172 and Tokyo-172-1, contained 36.8% ± 1.6% and 14.9% ± 0.6% of the Type II subpopulation, respectively (Fig. 2). The Type II subpopulation was predominated in domestic commercial lots (K-1242-F: 1.7% ± 0.1%, K1478L: 12.8% ± 0.9% and KH130: 5.8% ± 0.2%). Overall, the proportion of the Type II subpopulation decreased with increasing passages. These results correspond with those of a previous report¹⁴. A commercial lot from Taiwan (BG039304) contained 40.4% ± 1.9% of the Type II subpopulation, showing genetic similarity to the original seed lot, Tokyo-172. In contrast, the RD16 deletion genotype was not detected by polymerase chain reaction (PCR) in a commercial lot from Thailand 1A-825-43.

Deep sequencing analysis

Deep sequencing based on an Illumina platform provides massive sequence data at each nucleotide position in a genome; this analysis can capture genetic fluctuations quantitatively in a population of bacilli. With this perspective, the collected BCG lots were subjected to deep sequencing to evaluate the potential for use of this technique to measure the proportion of subpopulations and to detect spontaneous inherent variations.

To monitor genomic regions as broadly as possible, the read data from each BCG lot were mapped to reference sequences (BCG Tokyo-172 substrain Type I). The average depths mapped to the references ranged from 299.07 to 484.65 (average: 400.05, by a mapping tool Bowtie 2; Supplementary Table 1). With the read quantity, about 90% of nucleotide sequences of the reference genomes were mapped by the reads with a read depth of more than 200 (Fig. 3), which guaranteed subsequent detection of heterogeneity in a broad range with high accuracy. Indeed, the seven polymorphisms found between the two subpopulations (Table 1) were successfully detected and the proportions of Type I and II subpopulations in each lot were determined for all lots, except for K-1242-F, the commercial lot having the lowest proportion of the Type II subpopulation in our samples (Table 2).

Table 2 Detection of heterogeneity of type I/II variants in seed and commercial lots of M. bovis BCG Tokyo.

Full size table

We incorporated three bioinformatics tools to detect heterogeneity of BCG lots. LoFreq called the variation most consistently, although this tool cannot detect indel variations, such as the insertion variant at 3,192,638 (ppsA). SNVer and Breseq can detect both substitution and indel variations in their separate, respective files. However, the proportion of variants with this insertion, as determined by SNVer, deviated from that of the other variant calls inconsistently. In contrast, the insertion could be detected by Breseq, consistent with other substitutions. Breseq could also detect the 22-bp deletion of RD16, although its sensitivity for detection of substitutive variants seemed lower than those of LoFreq and SNVer (Table 2). For example, Breseq failed to consistently detect variations at positions 644,562 (JTY_0567) and 3,606,131 (JTY_3278), although these variations were detected by LoFreq and SNVer.

To apply this method to other substrains, a reference sequence must be applied by integration of several genomes to account for the deleted regions of each substrain. For this purpose, we supplemented the entire sequence of BCG Tokyo-172 with other genome sequences (BCG Moreau RDJ, another candidate of the reference substrain of BCG and M. bovis AL2122/97; Supplementary GenBank File 1). When it was used to replace the original sequence, the variation could be detected exactly as described in Table 2.

Monitoring intrinsic heterogeneity in BCG lots

In addition to the complexity of Type I and II substrains, spontaneous heterogeneity in the BCG lots was also determined by variant calls. From monitoring of NGS data, we estimated that only two lots may have contained minor heterogeneity (Table 3) in over 5% of the total population.

Table 3 Variant calls of the intrinsic heterogeneity in the respective lots.

Full size table

One of the two intrinsic variants (Tokyo-172) was detected in the most ancestral seed lot in this study (Table 3). The variant was not found in descendant lots, indicating that the variant had not been positively selected in subsequent passages. The other heterogeneous lot was 1A-825-43; this was a commercial lot and the variant would not be inherited in subsequent propagation, although the variant may be contaminated from its seed lot. To verify this bioinformatic estimation, we cultured the BCG lots on agar plates and tried to isolate variant clones directly from the colonies grown. As a result, mutated colonies were indeed isolated from the two BCG lots (Table 3).

Discussion

BCG Tokyo-172 is one of four candidate WHO reference strains^26,27. Two types of colonies showing different morphologies were found when Tokyo-172 was cultured on agar plates¹⁵; and these variants were thought to be caused by complex heterogeneity^16,17. In this study, to monitor hidden heterogeneity in lot stocks of BCG Tokyo-172, deep sequencing analysis was performed. We successfully identified and characterized two subpopulations (Types I and II) of this strain, providing insights into the acquisition of heterogeneity in vaccine strains.

BCG has been used in vaccine production for almost a century. This long period of maintenance and worldwide distribution has resulted in wide genetic diversity among substrains^5,15. This could also cause genetic heterogeneity within respective substrains, such as Type I and II subpopulations in BCG Tokyo-172. Although there is no evidence showing that heterogeneity affects the efficacy or antigenicity of the BCG vaccine directly, this feature should be controlled by certain procedures in order to maintain the stability of the vaccine. Monitoring of unpredictable variation across the entire genome was difficult in the past; however, with major advancements in deep sequencing analysis based on NGS technologies, such monitoring has become possible. Indeed, deep sequencing is now suitable for effective detection of spontaneous variation within a biotic resource and may become the new standard for quality control of new lots.

In this study, we examined seven derivative lots of Tokyo-172 to investigate their heterogeneity. The substrain BCG Tokyo-172 has been reported to include two subpopulations, identifiable by their colony morphology and genetic differences^15,17. This readily detectable diversity provides a good indicator to evaluate the use of NGS techniques to identify heterogeneity within BCG lots. In our study, seven nucleotide polymorphisms were identified between Type I and II subpopulations and could be representative of heterogeneous mutations in each lot. The results showed that deep sequencing analysis could be applies as a standard method to test the heterogeneity of BCG vaccines quantitatively.

If we are to assume that there was a historical process of subdivision of the BCG Tokyo-172 substrain, it is important that both Type I and II subpopulations possess their own unique polymorphisms. Other substrains (BCG Russia and Moreau) that were distributed from the Pasteur Institute during the same period as BCG Tokyo do not share the seven polymorphisms of the subpopulations. Thus, these polymorphisms may have accumulated separately during passages in Japan. However, it is not clear why such heterogeneity has occurred, particularly because the cloning processes of the substrain over this period have not been recorded.

The heterogeneity in each lot of BCG Tokyo-172 can be interpreted as a mixture of the major polymorphisms between the two subpopulations; in contrast, the number of spontaneous variants was low. Respective subpopulations were highly clonal, suggesting that they may have been generated via cloning steps just before the establishment of Tokyo-172. After establishment of Tokyo-172, the proportion of the Type II subpopulation may have decreased during passaging. The most dramatic decrease was found in K-1242-F, a commercial lot propagated directly from Tokyo-172. However, it is not clear why this lot had such a lower proportion of the Type II subpopulation than the other lots derived from Tokyo-172-1. It is possible that some conditions of lot preparation, such as nutrient components and temperature in culture, may affect the proportions of certain subpopulations. Further in vitro experiments are needed to check this hypothesis. In contrast to the heterogeneity in domestic lots, the Type I subpopulation could not be detected in a commercial lot from Thailand. Therefore, some type of cloning process may have been used to purify the Type II subpopulation in this country.

In contrast to the distinct colony morphology of the subpopulations¹⁵, it is generally more difficult to identify heterogeneity because there are no obvious clues to isolate distinct variants. From this background, deep sequencing analyses based on NGS techniques can be used to survey unavoidable spontaneous variations without colony isolation. This strategy has been already reported antecedently for the monitoring of live attenuated poliovirus vaccines^21,22,23. If genome-wide screening of heterogeneity can be incorporated into production of the BCG vaccine, it would be possible to assess the genetic uniformity of the vaccine, which could provide a more sustainable supply of stable vaccine lots.

Our approach revealed the complex heterogeneity of a substrain of BCG Tokyo-172. It is important to establish a corresponding reference genome sequence for other substrains in order to maintain high mapping quality. In this study, we constructed an artificial reference genome from three complete genome sequences of M. bovis and BCG substrains; however, further studies are require in order to verify the usefulness of this reference sequence for other substrains. Currently, the entire genome sequences of two of four candidate WHO reference substrains (i.e., BCG Danish 1331 and Russia I) have not been completed. Accumulation of such basic data may improve a variety of analytical methods for evaluation of various BCG substrains in the future.

Methods

Culture of BCG lots and DNA preparation

Cloned subpopulations (Types I and II) and seed/commercial lots of BCG Tokyo-172 and were cultured in 40 mL of Middlebrook 7H9 liquid broth with gentle shaking at 37 °C. The culture time was restricted to 2 days (about two generations of the substrain) to restrain heterogeneity in the culture as much as possible. Genomic DNAs were prepared from cultured bacilli according to a previous report²⁸. Briefly, centrifuged bacilli were delipidated using chloroform and methanol and then lysed using egg-white lysozyme and proteinase K. Genomic DNA was purified by phenol/chloroform extraction from the lysates. The quality and quantity of purified genomic DNA were evaluated based on the OD₂₆₀ and agarose electrophoresis.

Determination of nucleotide variations between BCG Tokyo-172 subpopulations

Purified DNAs from two substrains were sequenced on an Illumina MiSeq platform to obtain short reads (DRA: DRR029468 and DRR029469) and which were mapped to the complete genome sequence of BCG Tokyo-172 Type I (GenBank: NC_012207) to identify nucleotide variations that were distinct between the two subpopulations using CLC Genomics Workbench (Qiagen Science Inc.). To determine genetic differences between the Type I and II subpopulations, PCR fragments with the respective variant positions were verified by conventional Sanger sequencing. Primer sequences are listed in Supplementary Table 2.

Quantification of the Type I subpopulation in BCG lots

The proportion of subpopulations Types I and II in respective BCG lots was determined by the ratio of the genome region RD16 deleted genotype, a unique genetic feature of the Type I subpopulation, to the intact genotype, corresponding to the Type II subpopulation, using specific real-time PCR probes (Supplementary Table 3). The DNA concentration of both types was analysed in triplicate in three independent experiments, using a modified protocol of a previous report¹⁴. Subsequently, the ratios of the subpopulations were calculated, taking into account the propagation of errors²⁹.

Deep sequencing

Purified DNAs from BCG lots were subsequently analysed using an Illumina Genome Analyzer IIx (GAIIx; Illumina, Inc.). Libraries were constructed using a Paired-End DNA Sample Prep Kit (Illumina, Inc.) and each library was assigned to one lane per lot to obtain sufficient data quantity to detect minor heterogenic mutations. To avoid variants that arose during library construction as much as possible, libraries were constructed twice independently from the same purified DNAs. In this study, 75 bp of both strands were read by GAIIx, but only the forward strands were used for analysis owing to the low quality of the reverse strands. The quality score (QV) of the read data was checked by FastQC (ver. 0.10.1; Supplementary Figure S1).

Detection of heterogenic variation

To find heterogenic variation in the reads, three bioinformatics tools, Breseq (ver. 0.24rc6)³⁰, LoFreq (ver. 0.6.1)³¹ and SNVer (ver. 0.5.2)³² were used. All three tools are based on mapping to a reference genome sequence, finding subpopulational reads possessing nucleotide variants among the total mapped reads and applying the statistical criteria of strand bias, base call quality and mapping quality to remove pseudo-positive heterogeneity. Before analysis, the original read data were trimmed by SolexaQA (ver. 2.2)³³ to remove low-quality base calls; from this analysis, base calls with a quality score (QV) lower than 20 were excluded and reads shorter than 25 bp were removed. As a condition of Breseq, to filter variant calls, base calls with a QV lower than 20 were excluded from the calculation. For LoFreq and SNVer calculations, Burrows-Wheeler Alignment (ver 0.7.7)³⁴ was used as the mapping tool. BCG Tokyo-172 Type I (RefSeq: NC_012207) was used as the reference genome sequence. Mapping results are summarized in Supplementary Table S1.

Construction of a supplemented reference sequence

To supplement unique deletions in the genome sequence of BCG Tokyo-172, other entire genome sequences, i.e., those of BCG Moreau RDJ (GenBank: AM412059.2) and M. bovis AL2122/97 (RefSeq: NC_002945.3), were used. In brief, the three sequences were aligned using mauveAligner (ver 2.4.0)³⁵ and we verified that these sequences could be regarded as entirely syntenic. Deleted regions longer than 20 bp in BCG Tokyo-172 were determined by MAFFT (ver. 7)³⁶. Finally, the regions were supplemented manually to construct an intentional genomic sequence (4,419,199 bp, 47,788 bp larger than the original sequence). This sequence was automatically annotated by MiGAP³⁷ and edited using In silico Molecular Cloning (ver. 5.3; In Silico Biology, Inc.) and Artemis (ver. 16.0.0)³⁸.

Sequencing of variant positions

Putative intrinsic variations emerging in BCG lots were identified from the variation calls described above as variants representing over 5% of the read data, in duplicate, of each lot called by more than two of the three tools. Unreliable variants (located in paralogous or highly polymorphic genes, transposons and repetitive unit regions) were excluded to avoid false-positive or -negative calls. To verify the filtered variants in the two BCG lots (Tokyo-172 and 1A-825-43), bacilli were cultured directly on 7H11 agar plates and colonies were isolated. After the respective colonies were suspended in DNA-free water and heat killed, the supernatants were used as PCR templates to check the putative variant nucleotides by sequencing. PCR and sequencing primers are listed in Supplementary Table S4.

Additional Information

How to cite this article: Wada, T. et al. Deep sequencing analysis of the heterogeneity of seed and commercial lots of the bacillus Calmette-Guérin (BCG) tuberculosis vaccine substrain Tokyo-172. Sci. Rep. 5, 17827; doi: 10.1038/srep17827 (2015).

References

Albert, C. La Vaccination Preventive Contre la Tubeculose par le “BCG” (eds Albert, C. et al.) Ch. 3, 131–141 (Masson, 1927).
Jackson, M. & Yamamoto, S. BCG -Vaccine and Adjuvant- (eds Takii, T. et al.) Ch. 1, 3–12 (JATA press, 2011).
Zwerling, A. et al. The BCG World Atlas: a database of global BCG vaccination policies and practices. PLoS Med 8, e1001012 (2011).
Article Google Scholar
Behr, M. A. et al. Comparative genomics of BCG vaccines by whole-genome DNA microarray. Science 284, 1520–1523 (1999).
Article CAS ADS Google Scholar
Brosch, R. et al. Genome plasticity of BCG and impact on vaccine efficacy. Proc Natl Acad Sci USA 104, 5596–5601 (2007).
Article CAS ADS Google Scholar
Bedwell, J., Kairo, S. K., Behr, M. A. & Bygraves, J. A. Identification of substrains of BCG vaccine using multiplex PCR. Vaccine 19, 2146–2151 (2001).
Article CAS Google Scholar
Hayashi, D. et al. Biochemical characteristics among Mycobacterium bovis BCG substrains. FEMS Microbiol Lett 306, 103–109 (2010).
Article CAS Google Scholar
Aguirre-Blanco, A. M., Lukey, P. T., Cliff, J. M. & Dockrell, H. M. Strain-dependent variation in Mycobacterium bovis BCG-induced human T-cell activation and gamma interferon production in vitro. Infect Immun 75, 3197–3201 (2007).
Article CAS Google Scholar
Belley, A. et al. Impact of methoxymycolic acid production by Mycobacterium bovis BCG vaccines. Infect Immun 72, 2803–2809 (2004).
Article CAS Google Scholar
Orduna, P. et al. Genomic and proteomic analyses of Mycobacterium bovis BCG Mexico 1931 reveal a diverse immunogenic repertoire against tuberculosis infection. BMC Genomics 12, 493 (2011).
Article CAS Google Scholar
Dubos, R. J. & Pierce, C. H. Differential characteristics in vitro and in vivo of several substrains of BCG. IV. Immunizing effectiveness. Am Rev Tuberc 74, 699–717 (1956).
CAS PubMed Google Scholar
Ho, M. M., Southern, J., Kang, H. N. & Knezevic, I. WHO Informal Consultation on standardization and evaluation of BCG vaccines Geneva, Switzerland 22-23 September 2009. Vaccine 28, 6945–6950 (2010).
Article Google Scholar
Yamamoto, S. & Yamamoto, T. Historical review of BCG vaccine in Japan. Jpn J Infect Dis 60, 331–336 (2007).
PubMed Google Scholar
Shibayama, K. et al. Quantification of two variant strains contained in freeze-dried Japanese BCG vaccine preparation by real-time PCR. Biologicals 35, 139–143 (2007).
Article CAS Google Scholar
Honda, I. et al. Identification of two subpopulations of Bacillus Calmette-Guérin (BCG) Tokyo172 substrain with different RD16 regions. Vaccine 24, 4969–4974 (2006).
Article CAS Google Scholar
Seki, M. et al. Whole genome sequence analysis of Mycobacterium bovis bacillus Calmette-Guérin (BCG) Tokyo 172: a comparative study of BCG vaccine substrains. Vaccine 27, 1710–1716 (2009).
Article CAS Google Scholar
Naka, T. et al. Lipid phenotype of two distinct subpopulations of Mycobacterium bovis Bacillus Calmette-Guérin Tokyo 172 substrain. J Biol Chem 286, 44153–44161 (2011).
Article CAS Google Scholar
Totoki, Y. et al. High-resolution characterization of a hepatocellular carcinoma genome. Nat Genet 43, 464–469 (2011).
Article CAS Google Scholar
Hohenlohe, P. A. et al. Population genomics of parallel adaptation in threespine stickleback using sequenced RAD tags. PLoS Genet 6, e1000862 (2010).
Article Google Scholar
Woods, R. J. et al. Second-order selection for evolvability in a large Escherichia coli population. Science 331, 1433–1436 (2011).
Article CAS ADS Google Scholar
Macalalad, A. R. et al. Highly sensitive and specific detection of rare variants in mixed viral populations from massively parallel sequence data. PLoS Comput Biol 8, e1002417 (2012).
Article CAS Google Scholar
Neverov, A. & Chumakov, K. Massively parallel sequencing for monitoring genetic consistency and quality control of live viral vaccines. Proc Natl Acad Sci USA 107, 20063–20068 (2010).
Article CAS ADS Google Scholar
Victoria, J. G. et al. Viral nucleic acids in live-attenuated vaccines: detection of minority variants and an adventitious virus. J Virol 84, 6033–6040 (2010).
Article CAS Google Scholar
Jou, R., Huang, W. L. & Su, W. J. Tokyo-172 BCG vaccination complications, Taiwan. Emerg Infect Dis 15, 1525–1526 (2009).
Article Google Scholar
Puthanakit, T. et al. Immune reconstitution syndrome due to bacillus Calmette-Guérin after initiation of antiretroviral therapy in children with HIV infection. Clin Infect Dis 41, 1049–1052 (2005).
Article Google Scholar
Ho, M. M., Markey, K., Rigsby, P., Hockley, J. & Corbel, M. J. Report of an international collaborative study to establish the first WHO reference reagents for BCG vaccines of three different sub-strains. Vaccine 29, 512–518 (2010).
Article Google Scholar
Dagg, B., Hockley, J., Rigsby, P. & Ho, M. M. The establishment of sub-strain specific WHO Reference Reagents for BCG vaccine. Vaccine 32, 6390–6395 (2014).
Article CAS Google Scholar
Belisle, J. T. & Sonnenberg, M. G. Isolation of genomic DNA from mycobacteria. In Methods in molecular biology Vol. 101 (eds Parish, T. & Walker, J. M. ) 31–44 (Humana Press, 1998).
Google Scholar
Ku, H. H. Notes on the use of propagation of error formulas. J Res Natl Bur Stand 70C, 263–273 (1966).
Google Scholar
Barrick, J. E. et al. Genome evolution and adaptation in a long-term experiment with Escherichia coli. Nature 461, 1243–1247 (2009).
Article CAS ADS Google Scholar
Wilm, A. et al. LoFreq: a sequence-quality aware, ultra-sensitive variant caller for uncovering cell-population heterogeneity from high-throughput sequencing datasets. Nucleic Acids Res 40, 11189–11201 (2012).
Article CAS Google Scholar
Wei, Z., Wang, W., Hu, P., Lyon, G. J. & Hakonarson, H. SNVer: a statistical tool for variant calling in analysis of pooled or individual next-generation sequencing data. Nucleic Acids Res 39, e132 (2011).
Article CAS Google Scholar
Cox, M. P., Peterson, D. A. & Biggs, P. J. SolexaQA: At-a-glance quality assessment of Illumina second-generation sequencing data. BMC Bioinformatics 11, 485 (2010).
Article Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS Google Scholar
Darling, A. C., Mau, B., Blattner, F. R. & Perna, N. T. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res 14, 1394–1403 (2004).
Article CAS Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol 30, 772–780 (2013).
Article CAS Google Scholar
Sugawara, H., Ohyama, A., Mori, H. & Kurokawa, K. Microbial Genome Annotation Pipeline (MiGAP) for diverse users. The 20th International Conference on Genome Informatics (GIW2009). Yokohama, Japan (2009).
Carver, T., Harris, S. R., Berriman, M., Parkhill, J. & McQuillan, J. A. Artemis: an integrated platform for visualization and analysis of high-throughput sequence-based experimental data. Bioinformatics 28, 464–469. (2012).
Article CAS Google Scholar

Download references

Acknowledgements

This work was partially supported by a Health and Labor Science Research Grant for Research on Emerging and Re-emerging Infectious Diseases from the Ministry of Health, Labor and Welfare of Japan; and by the United States-Japan Cooperative Medical Science Committee.

Author information

Wada Takayuki and Maruyama Fumito contributed equally to this work.

Authors and Affiliations

Department of International Health, Institute of Tropical Medicine, Nagasaki University, Nagasaki, 852-8523, Japan
Takayuki Wada & Taro Yamamoto
Department of Microbiology, Kyoto University Graduate School of Medicine, Kyoto, 606-8501, Japan
Fumito Maruyama & Ichiro Nakagawa
Department of Infectious Diseases, Kobe Institute of Health, Kobe, 650-0046, Japan
Tomotada Iwamoto
Department of Mycobacterium Reference and Research, The Research Institute of Tuberculosis, Tokyo, 204-0022, Japan
Shinji Maeda
Japan BCG Laboratory, Tokyo, 204-0022, Japan
Saburo Yamamoto
Department of Oral Microbiology, Graduate School of Medicine, Dentistry and Pharmaceutical Sciences, Okayama University, Okayama, 700-8558, Japan
Naoya Ohara
The Advanced Research Center for Oral and Craniofacial Sciences, Dental School, Okayama University, Okayama, 700-8558, Japan
Naoya Ohara

Authors

Takayuki Wada
View author publications
You can also search for this author in PubMed Google Scholar
Fumito Maruyama
View author publications
You can also search for this author in PubMed Google Scholar
Tomotada Iwamoto
View author publications
You can also search for this author in PubMed Google Scholar
Shinji Maeda
View author publications
You can also search for this author in PubMed Google Scholar
Taro Yamamoto
View author publications
You can also search for this author in PubMed Google Scholar
Ichiro Nakagawa
View author publications
You can also search for this author in PubMed Google Scholar
Saburo Yamamoto
View author publications
You can also search for this author in PubMed Google Scholar
Naoya Ohara
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.Y. and N.O. conceived the study; T.W., S.M. and N.O. performed the experiments; T.W., F.M., T.I. and S.M. analysed the data; and T.W., F.M., T.Y., I.N. and S.Y. wrote the manuscript. All authors read and approved the final manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Materials

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Wada, T., Maruyama, F., Iwamoto, T. et al. Deep sequencing analysis of the heterogeneity of seed and commercial lots of the bacillus Calmette-Guérin (BCG) tuberculosis vaccine substrain Tokyo-172. Sci Rep 5, 17827 (2015). https://doi.org/10.1038/srep17827

Download citation

Received: 04 June 2015
Accepted: 06 November 2015
Published: 04 December 2015
DOI: https://doi.org/10.1038/srep17827

This article is cited by

Genome sequences of BCG Pasteur ATCC 35734 and its derivative, the vaccine candidate BCGΔBCG1419c
- Giuseppe D’Auria
- Yordan Hodzhev
- Mario Alberto Flores-Valdez
BMC Genomics (2023)
First insight into the whole-genome sequence variations in Mycobacterium bovis BCG-1 (Russia) vaccine seed lots and their progeny clinical isolates from children with BCG-induced adverse events
- Olga Narvskaya
- Daria Starkova
- Igor Mokrousov
BMC Genomics (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

A guide to vaccinology: from basic principles to new developments

Evaluation of 16S rRNA gene sequencing for species and strain-level microbiome analysis

The evolution of SARS-CoV-2

Introduction

Results

Genomic characterization of two subpopulations in the BCG Tokyo-172 substrain

Subpopulations in seed/commercial lots of BCG Tokyo-172

Deep sequencing analysis

Monitoring intrinsic heterogeneity in BCG lots

Discussion

Methods

Culture of BCG lots and DNA preparation

Determination of nucleotide variations between BCG Tokyo-172 subpopulations

Quantification of the Type I subpopulation in BCG lots

Deep sequencing

Detection of heterogenic variation

Construction of a supplemented reference sequence

Sequencing of variant positions

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Ethics declarations

Competing interests

Electronic supplementary material

Supplementary Materials

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Genome sequences of BCG Pasteur ATCC 35734 and its derivative, the vaccine candidate BCGΔBCG1419c

First insight into the whole-genome sequence variations in Mycobacterium bovis BCG-1 (Russia) vaccine seed lots and their progeny clinical isolates from children with BCG-induced adverse events

Comments

Search

Quick links