Chinese hamster genome sequenced from sorted chromosomes

Brinkrolf, Karina; Rupp, Oliver; Laux, Holger; Kollin, Florian; Ernst, Wolfgang; Linke, Burkhard; Kofler, Rudolf; Romand, Sandrine; Hesse, Friedemann; Budach, Wolfgang E; Galosy, Sybille; Müller, Dethardt; Noll, Thomas; Wienberg, Johannes; Jostock, Thomas; Leonard, Mark; Grillari, Johannes; Tauch, Andreas; Goesmann, Alexander; Helk, Bernhard; Mott, John E; Pühler, Alfred; Borth, Nicole

doi:10.1038/nbt.2645

Download PDF

Correspondence
Open access
Published: 08 August 2013

Chinese hamster genome sequenced from sorted chromosomes

Karina Brinkrolf^1,2^na1,
Oliver Rupp¹^na1,
Holger Laux³^na1,
Florian Kollin¹,
Wolfgang Ernst⁴,
Burkhard Linke¹,
Rudolf Kofler⁵,
Sandrine Romand³,
Friedemann Hesse⁴,
Wolfgang E Budach³,
Sybille Galosy⁶,
Dethardt Müller⁴,
Thomas Noll¹,
Johannes Wienberg⁵,
Thomas Jostock³,
Mark Leonard⁶,
Johannes Grillari⁴,
Andreas Tauch^1,2,
Alexander Goesmann¹,
Bernhard Helk³,
John E Mott⁶,
Alfred Pühler¹ &
…
Nicole Borth^2,4

Nature Biotechnology volume 31, pages 694–695 (2013)Cite this article

13k Accesses
127 Citations
61 Altmetric
Metrics details

Subjects

Genomics

To the Editor:

In recent years, the number of published genome sequences has increased substantially owing to major developments in next-generation sequencing (NGS) technologies, concomitant reduction of sequencing costs and improvements in assembly strategies. In 2011, your journal published the genome of Chinese hamster ovary (CHO)-K1 cells, the most frequently used mammalian production cell line for biopharmaceutical products¹. In this issue, the genomes of several related CHO cell lines as well as of the genome of the Chinese hamster are also presented². Although this information provides long-awaited and necessary insights for scientists working with these important production hosts, it also highlights a major drawback of short-read NGS technology, namely, the difficulty of assembling short-read data and scaffolding these sequences into a fully structured genome. This is especially critical for CHO cells, which are known to be genomically unstable, with frequent chromosome rearrangements and loss^3,4. In the following correspondence, we describe how a chromosome sorting approach can facilitate genome assembly from short-read sequences.

The effects of chromosome rearrangements on behavior relevant to individual bioprocesses of different CHO cell lines is not clear and will require more detailed analysis in the future. Although it seems less likely that large segments of genomic DNA are lost completely, which would entail the loss of necessary cellular functions, presumably leading to cell death, rearrangements probably lead to subtle changes in transcription patterns. These may affect cellular properties relevant to bioprocessing, such as growth, robustness and productivity of CHO cell lines and clones. For future studies on these changes and their impact on cell behavior in industrial cell lines, it is thus of prime importance to have, on the one hand, a reference genome that includes the allocation of scaffolds and contigs to chromosomes and, on the other hand, a method that enables characterization of chromosomal translocations present in CHO cell lines being sequenced.

Current NGS technology yields short-read sequences typically in the range of 100–500 bp, so that common repeats cannot be assembled and the precise location of duplicated sequences is likely to be missed⁵. De novo assembly generates, on average, scaffolds of 1–2 Mb if genome coverage is sufficiently high (50- to 100-fold). As chromosomes are several fold larger (typically 90-200 Mb), chromosomal rearrangements and translocations can be captured only in part.

Here, we address this dilemma by isolating individual chromosomes by flow cytometric cell sorting, followed by NGS of the obtained material in separate sequencing reactions. After curation and assembly, the resulting scaffolds can be assigned to specific chromosomes. We applied our approach to cells from the Chinese hamster strain 17A/GY and came across several challenges, such as cross-contamination by chromosomes that were too close in the flow histogram and which required a bioinformatic procedure for curation (Fig. 1). The most severely affected chromosomes in this respect were chromosomes 5 and 6. Chromosomes 9 and 10 could only be separated as a pool and chromosome Y was not sorted at all. For library construction, we obtained 80–620 ng of DNA for each sorted chromosome and prepared, in addition, a 5,000-bp mate-pair sequencing library from whole genome DNA. We sequenced the libraries on an Illumina (San Diego) Genome Analyzer IIx, using TrueSeq PE Cluster Kit v5-CS-GA and TrueSeq SBS Kit v5-GA and generated ∼70-fold genome coverage, assuming a genome size of 2.8 Gb for the Chinese hamster⁶. Subsequently, 1.4 billion reads were assembled into a draft sequence for the separated chromosomes using ALLPATHS-LG⁷. As mentioned above, sequencing libraries from separated chromosomes might be contaminated with sequences from other hamster chromosomes. The separated chromosome assemblies were therefore analyzed to identify and eliminate contaminating scaffolds from the data. This filtering led to high-quality assemblies of separated Chinese hamster chromosomes with the total number of scaffolds ranging from 517 for chromosome 8 to 5,348 for chromosomes 9+10, and a total genome size of 2.33 Gb (Table 1).

**Figure 1: Bivariate flow cytometric analysis of Chinese hamster chromosomes.**

Table 1 Assembly statistics of separated Chinese hamster chromosomes

Full size table

We mapped scaffolds of the separated hamster chromosome libraries to the mouse genome together with the published CHO-K1 genomic sequence¹ (Supplementary Fig. 1). This revealed that, in principle, the entire genome of the mouse can be covered by Chinese hamster sequences, even though complex chromosomal rearrangements have occurred. The only exceptions are mouse chromosomes 7, 14, 17 and X, which are incompletely covered by both Chinese hamster and CHO-K1 sequences. Gaps detected between the Chinese hamster scaffolds and mouse chromosomes occur primarily in regions with a high frequency of interspersed repeats and low complexity regions, which cannot be assembled properly from short sequence reads. As the missing regions on mouse chromosomes 7 and 12 are in part covered by short scaffolds and as the corresponding CHO-K1 genome has even more sequences mapping to these locations, it seems likely that these sequences are not missing in the Chinese hamster, but might have been difficult to assemble owing to sequence repeats. Also notable is that despite the severe chromosomal rearrangements that have occurred in CHO-K1 (refs. 3,4), no major parts of the genome are completely missing: gaps relative to the mouse chromosomes occur at the same positions of high repeat density as for the Chinese hamster reference genome, and only very small regions are missing in CHO-K1 that are present in the Chinese hamster genome. Homologies between the Chinese hamster chromosome sequences and mouse chromosomes identified by sequence mapping compare well to reciprocal chromosome painting results of hamster and mouse chromosomes⁸.

The sequence of the Chinese hamster provides a reference for future research of sufficient quality and precision to enable characterization and study of chromosomal rearrangements and stability in CHO cell lines. In addition, the results of this study suggest that the approach of using sorted chromosomes for library generation may prove beneficial for sequencing of complex reference genomes of other eukaryotes.

Accession code. GenBank: APMK00000000. The version described in this paper is the first version, APMK01000000.

Author contributions

N.B., J.G., J.E.M. and A.P. originated the concept of the study. H.L. contributed the chromosome sorting strategy. The project was further developed by W.E.B., D.M., T.J., M.L. and B.H. K.B., T.N., A.T. and A.G. carried out the sequencing project design. W.E. and F.H. contributed to study planning and generated samples of cells and genomic DNA of the Chinese hamster. R.K. and J.W. sorted Chinese hamster chromosomes. H.L. and S.R. prepared DNA from sorted chromosomes. O.R., F.K. and B.L. performed data analysis. All authors contributed to drafting and reviewing the manuscript.

References

Xu, X. et al. Nat. Biotechnol. 29, 735–741 (2011).
Article CAS Google Scholar
Lewis, N.E. et al. Nat. Biotechnol. 31, 759–765 (2013).
Article CAS Google Scholar
Derouazi, M. et al. Biochem. Biophys. Res. Commun. 340, 1069–1077 (2006).
Article CAS Google Scholar
Cao, Y. et al. Biotechnol. Bioeng. 109, 1357–1367 (2012).
Article CAS Google Scholar
Alkan, C. et al. Nat. Methods 8, 61–65 (2011).
Article CAS Google Scholar
Omasa, T. et al. Biotechnol. Bioeng. 104, 986–994 (2009).
Article CAS Google Scholar
Gnerre, S. et al. Proc. Natl. Acad. Sci. USA 108, 1513–1518 (2011).
Article CAS Google Scholar
Yang, F. et al. Chromosome Res. 8, 219–227 (2000).
Article CAS Google Scholar

Download references

Acknowledgements

F.K. acknowledges the receipt of a scholarship from the CLIB Graduate Cluster Industrial Biotechnology. Part of this research was supported by ACIB, the Austrian Center of Industrial Biotechnology, a K2 competence center within the COMET program of the Austrian FFG (the Austrian Research Promotion Agency).

Author information

Karina Brinkrolf, Oliver Rupp and Holger Laux: These authors contributed equally to this work.

Authors and Affiliations

Center for Biotechnology, Bielefeld University, Germany
Karina Brinkrolf, Oliver Rupp, Florian Kollin, Burkhard Linke, Thomas Noll, Andreas Tauch, Alexander Goesmann & Alfred Pühler
ACIB, Austrian Center of Industrial Biotechnology, Austria
Karina Brinkrolf, Andreas Tauch & Nicole Borth
Novartis Pharma, Basel, Switzerland
Holger Laux, Sandrine Romand, Wolfgang E Budach, Thomas Jostock & Bernhard Helk
Department of Biotechnology, University of Natural Resources and Life Sciences, Vienna, Austria
Wolfgang Ernst, Friedemann Hesse, Dethardt Müller, Johannes Grillari & Nicole Borth
Molecular Cytogenetics, Chrombios, Nussdorf, Germany
Rudolf Kofler & Johannes Wienberg
Pfizer, New York, New York, USA
Sybille Galosy, Mark Leonard & John E Mott

Authors

Karina Brinkrolf
View author publications
You can also search for this author in PubMed Google Scholar
Oliver Rupp
View author publications
You can also search for this author in PubMed Google Scholar
Holger Laux
View author publications
You can also search for this author in PubMed Google Scholar
Florian Kollin
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Ernst
View author publications
You can also search for this author in PubMed Google Scholar
Burkhard Linke
View author publications
You can also search for this author in PubMed Google Scholar
Rudolf Kofler
View author publications
You can also search for this author in PubMed Google Scholar
Sandrine Romand
View author publications
You can also search for this author in PubMed Google Scholar
Friedemann Hesse
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang E Budach
View author publications
You can also search for this author in PubMed Google Scholar
Sybille Galosy
View author publications
You can also search for this author in PubMed Google Scholar
Dethardt Müller
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Noll
View author publications
You can also search for this author in PubMed Google Scholar
Johannes Wienberg
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Jostock
View author publications
You can also search for this author in PubMed Google Scholar
Mark Leonard
View author publications
You can also search for this author in PubMed Google Scholar
Johannes Grillari
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Tauch
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Goesmann
View author publications
You can also search for this author in PubMed Google Scholar
Bernhard Helk
View author publications
You can also search for this author in PubMed Google Scholar
John E Mott
View author publications
You can also search for this author in PubMed Google Scholar
Alfred Pühler
View author publications
You can also search for this author in PubMed Google Scholar
Nicole Borth
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Alfred Pühler or Nicole Borth.

Ethics declarations

Competing interests

M.L., S.G. and J.E.M. are employees of Pfizer and H.L., S.R., W.E.B., T.J. and B.H. are employees of Novartis; both companies use CHO cells for manufacturing purposes. R.K. and J.W. are employees of Chrombios, which offers services for chromosome analyses.

Supplementary information

Supplementary Text and Figures

Supplementary Figure 1 (PDF 344 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/

Reprints and permissions

About this article

Cite this article

Brinkrolf, K., Rupp, O., Laux, H. et al. Chinese hamster genome sequenced from sorted chromosomes. Nat Biotechnol 31, 694–695 (2013). https://doi.org/10.1038/nbt.2645

Download citation

Published: 08 August 2013
Issue Date: August 2013
DOI: https://doi.org/10.1038/nbt.2645

This article is cited by

Recent developments in miRNA based recombinant protein expression in CHO
- Masoume Bazaz
- Ahmad Adeli
- Noushin Davoudi
Biotechnology Letters (2022)
Mapping the molecular basis for growth related phenotypes in industrial producer CHO cell lines using differential proteomic analysis
- Laura Bryan
- Michael Henry
- Paula Meleady
BMC Biotechnology (2021)
EvalDNA: a machine learning-based tool for the comprehensive evaluation of mammalian genome assembly quality
- Madolyn L. MacDonald
- Kelvin H. Lee
BMC Bioinformatics (2021)
Highly efficient synchronization of sheep skin fibroblasts at G2/M phase and isolation of sheep Y chromosomes by flow cytometric sorting
- Yanzhu Yao
- Yuanyuan Zhang
- Xuemei Deng
Scientific Reports (2020)
Evolution from adherent to suspension: systems biology of HEK293 cell line development
- Magdalena Malm
- Rasool Saghaleyni
- Johan Rockberg
Scientific Reports (2020)

Chinese hamster genome sequenced from sorted chromosomes

Subjects

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Competing interests

Supplementary information

Supplementary Text and Figures

Rights and permissions

About this article

Cite this article

This article is cited by

Recent developments in miRNA based recombinant protein expression in CHO

Mapping the molecular basis for growth related phenotypes in industrial producer CHO cell lines using differential proteomic analysis

EvalDNA: a machine learning-based tool for the comprehensive evaluation of mammalian genome assembly quality

Highly efficient synchronization of sheep skin fibroblasts at G2/M phase and isolation of sheep Y chromosomes by flow cytometric sorting

Evolution from adherent to suspension: systems biology of HEK293 cell line development

Search

Quick links

Subjects

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Competing interests

Supplementary information

Supplementary Text and Figures

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Recent developments in miRNA based recombinant protein expression in CHO

Mapping the molecular basis for growth related phenotypes in industrial producer CHO cell lines using differential proteomic analysis

EvalDNA: a machine learning-based tool for the comprehensive evaluation of mammalian genome assembly quality

Highly efficient synchronization of sheep skin fibroblasts at G2/M phase and isolation of sheep Y chromosomes by flow cytometric sorting

Evolution from adherent to suspension: systems biology of HEK293 cell line development

Search

Quick links