Whole-genome sequencing of liver cancers identifies etiological influences on mutation patterns and recurrent mutations in chromatin regulators

Journal name:
Nature Genetics
Volume:
44,
Pages:
760–764
Year published:
DOI:
doi:10.1038/ng.2291
Received
Accepted
Published online

Hepatocellular carcinoma (HCC) is the third leading cause of cancer-related death worldwide. We sequenced and analyzed the whole genomes of 27 HCCs, 25 of which were associated with hepatitis B or C virus infections, including two sets of multicentric tumors. Although no common somatic mutations were identified in the multicentric tumor pairs, their whole-genome substitution patterns were similar, suggesting that these tumors developed from independent mutations, although their shared etiological backgrounds may have strongly influenced their somatic mutation patterns. Statistical and functional analyses yielded a list of recurrently mutated genes. Multiple chromatin regulators, including ARID1A, ARID1B, ARID2, MLL and MLL3, were mutated in ~50% of the tumors. Hepatitis B virus genome integration in the TERT locus was frequently observed in a high clonal proportion. Our whole-genome sequencing analysis of HCCs identified the influence of etiological background on somatic mutation patterns and subsequent carcinogenesis, as well as recurrent mutations in chromatin regulators in HCCs.

At a glance

Figures

  1. Somatic substitution patterns of HCCs.
    Figure 1: Somatic substitution patterns of HCCs.

    (a) The number of somatic substitutions and indels (top) and somatic substitution patterns (bottom) of the 27 HCC genomes. (b) Repair on the transcribed strand. Fitted curves show the effect of gene expression and strand bias on substitution prevalence. We used Agilent microarray expression data (Whole Human Genome 8 × 60K Oligonucleotide Microarray) in this transcription-coupled repair (TCR) analysis, and expression level indicates Agilent microarray intensity level units with a log2 scale. UT, untranscribed strands; T, transcribed strands.

  2. Mutation patterns of MCTs.
    Figure 2: Mutation patterns of MCTs.

    (a) Circos plots20 of the MCTs from two subjects (HC3 and HC7). Each circle plot represents validated rearrangements (inner arcs) and copy-number alternations (inner rings). In rearrangements, lines show translocations (green), deletions (blue), inversions (orange) and tandem duplications (red). Copy-number gain and loss regions are shown in green and red. (b) PCA of the somatic substitution patterns of 25 HCC genomes. Two sets of MCT pairs (HC3 and HC7) are shown by green and blue, respectively, and are circled in red. (c) Three-dimensional plot of principal components (PCs) for 25 HCCs on somatic substitution pattern. Two sets of MCT pairs (HC3 and HC7) are shown in green and blue, respectively.

  3. Mutant allele proportions of point mutations.
    Figure 3: Mutant allele proportions of point mutations.

    HMG, highly mutated genes, genes whose mutation frequency was greater than 3% in the validation set (ARID1A, IGSF10, ATM, ZNF226, ZIC3, WWP1 and ERRFI1); TSG, known tumor suppressor genes annotated by MutationAssessor21; NS, nonsynonymous; S, synonymous. Non-coding includes point mutations in non-coding regions except for in splice sites. The edges of the boxes represent the 25th and 75th percentile values. The whiskers represent the most extreme data points, which are no more than 1.5 times the interquartile range from the boxes. *P < 0.05; **P < 0.01; ***P < 0.001.

  4. Mutations in chromatin regulators and functional analysis of potential driver genes.
    Figure 4: Mutations in chromatin regulators and functional analysis of potential driver genes.

    (a) Mutations in chromatin regulators in 27 HCC genomes. Mutations in chromatin-regulator genes are summarized. In addition to point mutations, 55.6-kb genomic deletion of ARID2 in NBNC2, which was identified by the read-pair method, and several copy-number alternations of chromatin-regulator genes are included. HC6 had both 1-bp deletion and low-level loss in ARID1A. (b) Functional assays of potential driver genes in HCC cell lines. Changes in cell proliferation in five HCCs compared to proliferation with control siRNA treatment are presented. Magenta and blue boxes represent more than 2-fold and less than 0.5-fold changes in the cell number, respectively. Genes involved in chromatin modification are indicated by the line.

  5. Clonal proportion of HBV integration sites in cancer cell populations of four HBV-integrated HCCs.
    Figure 5: Clonal proportion of HBV integration sites in cancer cell populations of four HBV-integrated HCCs.

    Integration sites (ISs) in the TERT locus are indicated by red. Digital PCR analysis indicates 4.0–57.8% clonal population of HBV integration at each locus. The average proportion of the TERT integration sites (41%) was higher than that of other integration sites (32%). Error bars, s.e.m. from four replicate measures.

Accession codes

Primary accessions

Gene Expression Omnibus

Referenced accessions

NCBI Reference Sequence

References

  1. Totoki, Y. et al. High-resolution characterization of a hepatocellular carcinoma genome. Nat. Genet. 43, 464469 (2011).
  2. Greenman, C. et al. Patterns of somatic mutation in human cancer genomes. Nature 446, 153158 (2007).
  3. El-Serag, H.B. & Rudolph, K.L. Hepatocellular carcinoma: epidemiology and molecular carcinogenesis. Gastroenterology 132, 25572576 (2007).
  4. Laurent-Puig, P. & Zucman-Rossi, J. Genetics of hepatocellular tumors. Oncogene 25, 37783786 (2006).
  5. Wiegand, K.C. et al. ARID1A mutations in endometriosis-associated ovarian carcinomas. N. Engl. J. Med. 363, 15321543 (2010).
  6. Jones, S. et al. Somatic mutations in the chromatin remodeling gene ARID1A occur in several tumor types. Hum. Mutat. 33, 100103 (2012).
  7. Ferby, I. et al. Mig6 is a negative regulator of EGF receptor–mediated skin morphogenesis and tumor formation. Nat. Med. 12, 568573 (2006).
  8. Reschke, M. et al. Mitogen-inducible gene–6 is a negative regulator of epidermal growth factor receptor signaling in hepatocytes and human hepatocellular carcinoma. Hepatology 51, 13831390 (2010).
  9. Li, Y., Zhou, Z., Alimandi, M. & Chen, C. WW domain containing E3 ubiquitin protein ligase 1 targets the full-length ErbB4 for ubiquitin-mediated degradation in breast cancer. Oncogene 28, 29482958 (2009).
  10. Shah, S.P. et al. The clonal and mutational evolution spectrum of primary triple-negative breast cancers. Nature published online doi:10.1038/nature10933 (4 April 2012).
  11. Huang, W., Sherman, B.T. & Lempicki, R.A. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 37, 113 (2009).
  12. Parsons, D.W. et al. The genetic landscape of the childhood cancer medulloblastoma. Science 331, 435439 (2011).
  13. Varela, I. et al. Exome sequencing identifies frequent mutation of the SWI/SNF complex gene PBRM1 in renal carcinoma. Nature 469, 539542 (2011).
  14. Lee, J. et al. A tumor suppressive coactivator complex of p53 containing ASC-2 and histone H3-lysine-4 methyltransferase MLL3 or its paralogue MLL4. Proc. Natl. Acad. Sci. USA 106, 85138518 (2009).
  15. Khalil, A.M. et al. Many human large intergenic noncoding RNAs associate with chromatin-modifying complexes and affect gene expression. Proc. Natl. Acad. Sci. USA 106, 1166711672 (2009).
  16. Jiang, Z. et al. The effects of hepatitis B virus integration into the genomes of hepatocellular carcinoma patients. Genome Res. 22, 593601 (2012).
  17. Paterlini-Bréchot, P. et al. Hepatitis B virus–related insertional mutagenesis occurs frequently in human liver cancers and recurrently targets human telomerase gene. Oncogene 22, 39113916 (2003).
  18. Shafritz, D.A., Shouval, D., Sherman, H.I., Hadziyannis, S.J. & Kew, M.C. Integration of hepatitis B virus DNA into the genome of liver cells in chronic liver disease and hepatocellular carcinoma. Studies in percutaneous liver biopsies and post-mortem tissue specimens. N. Engl. J. Med. 305, 10671073 (1981).
  19. Pleasance, E.D. et al. A comprehensive catalogue of somatic mutations from a human cancer genome. Nature 463, 191196 (2010).
  20. Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genome Res. 19, 16391645 (2009).
  21. Reva, B., Antipin, Y. & Sander, C. Determinants of protein function revealed by combinatorial entropy optimization. Genome Biol. 8, R232 (2007).
  22. Hudson, T.J. et al. International network of cancer genome projects. Nature 464, 993998 (2010).
  23. Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 17541760 (2009).
  24. Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 20782079 (2009).
  25. Fujimoto, A. et al. Whole-genome sequencing and comprehensive variant analysis of a Japanese individual using massively parallel sequencing. Nat. Genet. 42, 931936 (2010).
  26. Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B 57, 289300 (1995).
  27. Altschul, S.F., Gish, W., Miller, W., Myers, E.W. & Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 215, 403410 (1990).
  28. Andersson, R. et al. A segmental maximum a posteriori approach to genome-wide copy number profiling. Bioinformatics 24, 751758 (2008).
  29. Zhou, Y.Z., Butel, J.S., Li, P.J., Finegold, M.J. & Melnick, J.L. Integrated state of subgenomic fragments of hepatitis B virus DNA in hepatocellular carcinoma from mainland China. J. Natl. Cancer Inst. 79, 223231 (1987).
  30. Gozuacik, D. et al. Identification of human cancer-related genes by naturally occurring Hepatitis B Virus DNA tagging. Oncogene 20, 62336240 (2001).
  31. Puente, X.S. et al. Whole-genome sequencing identifies recurrent mutations in chronic lymphocytic leukaemia. Nature 475, 101105 (2011).
  32. Qin, J., Jones, R.C. & Ramakrishnan, R. Studying copy number variations using a nanofluidic platform. Nucleic Acids Res. 36, e116 (2008).

Download references

Author information

  1. These authors contributed equally to this work.

    • Akihiro Fujimoto &
    • Yasushi Totoki

Affiliations

  1. Center for Genomic Medicine, RIKEN, Yokohama, Japan.

    • Akihiro Fujimoto,
    • Tetsuo Abe,
    • Keith A Boroevich,
    • Ha Hai Nguyen,
    • Masayuki Aoki,
    • Naoya Hosono,
    • Michiaki Kubo,
    • Fuyuki Miya,
    • Kaoru Nakano,
    • Kumiko Watanabe-Makino,
    • Kazuaki Chayama,
    • Naoyuki Kamatani,
    • Yusuke Nakamura,
    • Tatsuhiko Tsunoda &
    • Hidewaki Nakagawa
  2. Division of Cancer Genomics, National Cancer Center Research Institute, Tokyo, Japan.

    • Yasushi Totoki,
    • Fumie Hosoda,
    • Yasuhito Arai,
    • Hiroyuki Takahashi,
    • Takuya Shirakihara,
    • Hiromi Nakamura &
    • Tatsuhiro Shibata
  3. Laboratory of DNA Informatics Analysis, Human Genome Center, Institute of Medical Science, The University of Tokyo, Tokyo, Japan.

    • Masao Nagasaki,
    • Tetsuo Shibuya,
    • Hiroko Tanaka &
    • Satoru Miyano
  4. National Institute of Biomedical Innovation, Ibaraki, Osaka, Japan.

    • Jun Kusuda
  5. Division of Molecular Pathology, National Cancer Center Research Institute, Tokyo, Japan.

    • Hidenori Ojima &
    • Hitoshi Nakagama
  6. Hepatobiliary and Pancreatic Surgery Division, National Cancer Center Hospital, Tokyo, Japan.

    • Kazuaki Shimada &
    • Tomoo Kosuge
  7. Hepatobiliary and Pancreatic Oncology Division, National Cancer Center Hospital, Tokyo, Japan.

    • Takuji Okusaka
  8. Department of Gastroenterological Surgery, Wakayama Medical University, Wakayama, Japan.

    • Masaki Ueno,
    • Yoshinobu Shigekawa &
    • Hiroki Yamaue
  9. Department of Medicine & Molecular Science, Hiroshima University School of Medicine, Hiroshima, Japan.

    • Yoshiiku Kawakami &
    • Kazuaki Chayama
  10. Department of Gastroenterological Surgery, Hiroshima University School of Medicine, Hiroshima, Japan.

    • Koji Arihiro
  11. Department of Anatomical Pathology, Hiroshima University School of Medicine, Hiroshima, Japan.

    • Hideki Ohdan
  12. Department of Surgery, Osaka Medical Center for Cancer and Cardiovascular Diseases, Osaka, Japan.

    • Kunihito Gotoh,
    • Osamu Ishikawa &
    • Terumasa Yamada
  13. Department of Gastroenterological Surgery, Tokyo Women's Medical University, Tokyo, Japan.

    • Shun-ichi Ariizumi &
    • Masakazu Yamamoto
  14. Division of Cancer Development System, National Cancer Center Research Institute, Tokyo, Japan.

    • Hitoshi Nakagama
  15. Laboratory of Molecular Medicine, Human Genome Center, Institute of Medical Science, The University of Tokyo, Tokyo, Japan.

    • Yusuke Nakamura

Contributions

A.F., Y.T., T.A., K.A.B., F.M., H. Nakamura, T.T., T. Shibata and H. Nakagawa performed data analyses. F.H., Y.A., H. Takahashi, T. Shirakihara, K.N., K.W.-M., T. Shibata and H. Nakagawa performed whole-genome sequencing. F.H., H.H.N., K.N. and K.W.-M. performed the validation sequencing study. F.H., Y.A., H. Takahashi, T. Shirakihara and T. Shibata performed siRNA experiments. A.F., M.A., N.H. and M.K. performed digital PCR and SNP microarray experiments. M.N., T. Shibuya, H. Tanaka and S.M. operated the supercomputer system. H. Ojima, K.S., T.O., M.U., Y.S., Y.K., K.A., H. Ohdan, K.G., O.I., S.A., M.Y., T.Y., K.C., T.K. and H.Y. collected clinical samples. A.F., Y.T., T.T., T. Shibata and H. Nakagawa wrote the manuscript. Y.N., T.T., T. Shibata and H. Nakagawa conceived the study and led the design of the experiments. J.K., N.K., H. Nakagama, Y.N., T. Shibata and H. Nakagawa contributed to the findings for this study.

Competing financial interests

The authors declare no competing financial interests.

Corresponding authors

Correspondence to:

Author details

Supplementary information

PDF files

  1. Supplementary Text and Figures (9 MB)

    Supplementary Note, Supplementary Figures 1–11 and Supplementary Tables 1 and 2 and 7–14

Excel files

  1. Supplementary Tables 3–6 (10 MB)

    Supplementary Tables 3–6

Additional data